Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

knowledge: add document for Boston public transportation history #1100

Closed
wants to merge 1 commit into from

Conversation

nathan-weinberg
Copy link
Member

@nathan-weinberg nathan-weinberg commented May 20, 2024

If your PR is related to a contribution to the taxonomy, please, fill
out the following questionnaire. If not, replace this whole text and the
following questionnaire with whatever information is applicable to your PR.

Describe the contribution to the taxonomy

  • Knowledge for the model about the history of public transportation in Boston

I do not have the hardware to do this locally, so I will be doing the following steps with instructlab-bot and updating these prompts accordingly

Input given at the prompt

What agency preceded the MBTA?

Response from the original model

The Metropolitan Boston Railroad Company (MBTA) was preceded by several railroad lines and companies that operated in the Boston area. The first railroad in the region, the Boston and Providence Railroad, began operating in 1835. Over time, several other railroads were established, including the Lowell and Boston Railroad, the Cambridge and Lowell Railroad, and the Worcester Railroad. These early railroads laid the groundwork for the development of a comprehensive rail network in the Boston area, eventually leading to the formation of the MBTA in 1964. The MBTA, or "T," as it is commonly referred to, is the largest public transit system in New England, serving more than 900,000 daily riders.

Response from the fine-tuned model

  ...

Contribution checklist

  • The contribution was tested with ilab generate
  • No errors or warnings were produced by ilab generate
  • All commits are signed off (DCO)
  • The qna.yaml file contains at least 5 seed_examples
  • The qna.yaml file was linted and prettified (yaml-validator can do both)
  • An attribution.txt file in the same folder as the qna.yaml file
  • Content does not include PII or otherwise sensitive or confidential information
  • Content does not include anything documented in the project's Avoid these Topics guidelines

@github-actions github-actions bot added triage-needed (Auto labeled) skill is ready to be triaged knowledge (Auto labeled) labels May 20, 2024
Copy link

Beep, boop 🤖, Hi, I'm @instructlab-bot and I'm going to help you with your pull request. Thanks for you contribution! 🎉

I support the following commands:

  • @instructlab-bot precheck -- Check existing model behavior using the questions in this proposed change.
  • @instructlab-bot generate -- Generate a sample of synthetic data using the synthetic data generation backend infrastructure.
  • @instructlab-bot generate-local -- Generate a sample of synthetic data using a local model.
  • @instructlab-bot help -- Print this help message again.

Note

Results or Errors of these commands will be posted as a pull request check in the Checks section below

Note

Currently only maintainers belongs to [[taxonomy-triagers taxonomy-approvers taxonomy-maintainers labrador-org-maintainers instruct-lab-bot-maintainers]] teams are allowed to run these commands.

@luke-inglis
Copy link
Contributor

@instructlab-bot precheck

Copy link

Beep, boop 🤖, Generating test data for your PR with the job type: precheck. Your Job ID is 321. The results will be presented below in the pull request status box. This may take several minutes...

Copy link

Results for job ID: 321 using the model instructlab/granite-7b-lab!

Results can be found here.

@nathan-weinberg
Copy link
Member Author

@luke-inglis for the record I'm trying this out locally still, that's why I opened it as a draft

@jjasghar jjasghar removed the triage-needed (Auto labeled) skill is ready to be triaged label May 20, 2024
@nathan-weinberg nathan-weinberg marked this pull request as ready for review May 21, 2024 17:29
@nathan-weinberg
Copy link
Member Author

Taking this out of draft so it can be properly triaged/I can continue with the bot commands, let me know if I need to readd the triage-needed label

@nathan-weinberg
Copy link
Member Author

@instructlab-bot generate

Copy link

User nathan-weinberg is not allowed to run the InstructLab bot. Only [taxonomy-triagers taxonomy-approvers taxonomy-maintainers labrador-org-maintainers instruct-lab-bot-maintainers] teams are allowed to access the bot functions.

@nathan-weinberg nathan-weinberg added the triage-needed (Auto labeled) skill is ready to be triaged label May 21, 2024
@jjasghar
Copy link
Member

The linting issue is due to a schema PR we are waiting on.

@jjasghar
Copy link
Member

@instructlab-bot generate

Copy link

Beep, boop 🤖, Generating test data for your PR with the job type: sdg-svc. Your Job ID is 331. The results will be presented below in the pull request status box. This may take several minutes...

Copy link

Results for job ID: 331 using the model sdg service backend!

Results can be found here.

@bjhargrave
Copy link
Contributor

The linting issue is due to a schema PR we are waiting on.

#769

@nathan-weinberg nathan-weinberg force-pushed the boston-mta branch 2 times, most recently from c41c4ea to 7026939 Compare May 30, 2024 19:39
@nathan-weinberg
Copy link
Member Author

This has been updated with a new commit hash as @juliadenham graciously fixed my Markdown file for me 😄

@bjhargrave bjhargrave added triage-requested-changes skill has been reviewed; changes requested from contributor and removed triage-needed (Auto labeled) skill is ready to be triaged labels May 31, 2024
Copy link

Beep, boop 🤖, Hi, I'm @instructlab-bot and I'm going to help you with your pull request. Thanks for you contribution! 🎉

I support the following commands:

  • @instructlab-bot precheck -- Check existing model behavior using the questions in this proposed change.
  • @instructlab-bot generate -- Generate a sample of synthetic data using the synthetic data generation backend infrastructure.
  • @instructlab-bot generate-local -- Generate a sample of synthetic data using a local model.
  • @instructlab-bot help -- Print this help message again.

Note

Results or Errors of these commands will be posted as a pull request check in the Checks section below

Note

Currently only maintainers belongs to [[taxonomy-triagers taxonomy-approvers taxonomy-maintainers labrador-org-maintainers instruct-lab-bot-maintainers]] teams are allowed to run these commands.

@github-actions github-actions bot added the triage-needed (Auto labeled) skill is ready to be triaged label Jun 3, 2024
Copy link

Beep, boop 🤖, Hi, I'm @instructlab-bot and I'm going to help you with your pull request. Thanks for you contribution! 🎉

I support the following commands:

  • @instructlab-bot precheck -- Check existing model behavior using the questions in this proposed change.
  • @instructlab-bot generate -- Generate a sample of synthetic data using the synthetic data generation backend infrastructure.
  • @instructlab-bot generate-local -- Generate a sample of synthetic data using a local model.
  • @instructlab-bot help -- Print this help message again.

Note

Results or Errors of these commands will be posted as a pull request check in the Checks section below

Note

Currently only maintainers belongs to [[taxonomy-triagers taxonomy-approvers taxonomy-maintainers labrador-org-maintainers instruct-lab-bot-maintainers]] teams are allowed to run these commands.

@bjhargrave
Copy link
Contributor

@instructlab-bot precheck

@bjhargrave
Copy link
Contributor

@instructlab-bot precheck

Copy link

Beep, boop 🤖, Generating test data for your PR with the job type: precheck. Your Job ID is 365. The results will be presented below in the pull request status box. This may take several minutes...

Copy link

Results for job ID: 365 using the model instructlab/granite-7b-lab!

Results can be found here.

@bjhargrave bjhargrave removed the triage-needed (Auto labeled) skill is ready to be triaged label Jun 4, 2024
Copy link

This pull request has been automatically marked as stale because it has not had activity within 15 days. It will be automatically closed if no further activity occurs within 31 days.

@github-actions github-actions bot added the stale stale-bot has marked you as stale label Jun 20, 2024
@jjasghar jjasghar removed the stale stale-bot has marked you as stale label Jun 27, 2024
@jjasghar
Copy link
Member

Hi! 👋
It’s been a while since you’ve seen any movement on this PR. We haven’t forgotten about you!  We’ve run into some logistical issues, hence this delay. We absolutely want your PR, and being marketed as e2e-ready is still the last stop before we get it into the upstream model.

We are thankful for your patience and ask that you please keep this PR open. As soon as we finish all our behind-the-scenes work, we’ll test the full model against your submissions and, ideally, accept your amazing contribution(s)! 

Your Community Maintainer Team.

P.S. if you have any specific questions or thoughts, don’t hesitate to comment on pull request this or email [email protected] and [email protected], and we’ll get back to you as soon as possible.

@github-actions github-actions bot added the triage-needed (Auto labeled) skill is ready to be triaged label Aug 19, 2024
created_by: nathan-weinberg, juliadenham
seed_examples:
- context: |
The [Cambridge Railroad](Cambridge_Railroad "wikilink") was the first
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can you please update the markdown (in the knowledge md and the context snippets) to remove all wikilinks?

See juliadenham/Summit_knowledge#1 (comment) where @mairin did this for Swifties.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I got this file directly from @juliadenham - Julia could you update the commit with that?

@jjasghar jjasghar added triage-requested-changes skill has been reviewed; changes requested from contributor and removed triage-needed (Auto labeled) skill is ready to be triaged community-build-ready Triage Team has signed off for synthetic data generation labels Aug 19, 2024
@mcorbin-ibm
Copy link
Contributor

Per the new taxonomy (#1215 ), we have a decision to make on this one -- does this belong in "history" or "social sciences/transportation"?

From reviewing the summaries PDF, I think I prefer the more specific topic (/science/social_sciences/transportation/), so I recommend this:

knowledge/science/social_sciences/transportation/history_mbta/

(Instead of knowledge/history/public_transportation/boston/ -- because how likely are we to get more histories of public transportations in other cities???)

@github-actions github-actions bot added the triage-needed (Auto labeled) skill is ready to be triaged label Aug 22, 2024
@jjasghar jjasghar added community-build-ready Triage Team has signed off for synthetic data generation cmb-running We are running these in the CMB process and removed triage-needed (Auto labeled) skill is ready to be triaged triage-requested-changes skill has been reviewed; changes requested from contributor labels Aug 22, 2024
@jjasghar
Copy link
Member

jjasghar commented Sep 3, 2024

Hi! 👋
Thank you for submitting this PR. We are ready to do some validation now, and we have a few candidates to see if they improve the model.
We some resources to run these release candidates, but we need you to help us. Can you reach out to me either on Slack (@awesome) or email me at awesomeATinstructlab.ai so I can get you access via ilab model chat?
We can only run these models for a "week" or so, so please reach out as soon as possible and tell me which one is best for you on this PR.

@jjasghar jjasghar added triage-rejected PR fails to meet criteria and removed community-build-ready Triage Team has signed off for synthetic data generation cmb-running We are running these in the CMB process labels Sep 9, 2024
@jjasghar jjasghar closed this Sep 9, 2024
@jjasghar jjasghar added the follow-up-engineering This PR needs to be investigated by engineering label Sep 9, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
follow-up-engineering This PR needs to be investigated by engineering knowledge (Auto labeled) triage-rejected PR fails to meet criteria
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants