-
Notifications
You must be signed in to change notification settings - Fork 34
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Links to pre-trained ed and eae models are broken #64
Comments
The link is updated here: #61 |
Thanks for the response! When I replace the old links with the new ones, I get the error
Is there anything additional that I need in order to use these files? They are downloading correctly, as far as I can tell, the directory is successfully unzipped and contains the following files:
Do I need to run |
As an urgent candidate, perhaps you can try this model (https://huggingface.co/THU-KEG/ADELIE-DPO-1.5B). It's more general and powerful for information extraction, and its scale is similar to the previous one. I will recheck the issues with the old version of the model soon. |
Have you tried to load the model using Line 55 in ec72e72
|
Thanks for your quick responses! In order: HuggingFacewith LangChainTrying the huggingface model ran into some problems (though admittedly, I am stacking LangChain on top of it): when invoking the model (whether set up using HuggingFaceEndpoint or HuggingFacePipeline), it throws a 404 and hangs. Here's a snippet from the HuggingFacePipeline approach
In the case of the Endpoint approach, the 404 happens during invocation:
In either case, I can't seem to get it to respond to the input. HuggingFace AloneIt did occur to me that maybe this is a LangChain issue, so I bopped around with HuggingFace, following the example in the Llama model card, as there is no invocation example for the ADELIE-DPO-1.5B.
The error:
(I do have safetensors installed) Using infer.pyYup, that's exactly what I tried (inasmuch as it is what's called by the test). I also did a bunch of stuff to try to add comments and logging and see what was going on and what might be causing the problem, but struck out. I'm happy to transition to using the HF model, I just need help with how, exactly, to leverage it. Again, big thanks for the help! |
When trying to run the test, I repeatedly encountered a
FileNotFoundError
regarding a config, which I eventually traced back to the actual problem: there are no such files at these URLs. What gets downloaded from here is an HTML page telling you that the link does not exist. This HTML page cannot be unzipped, of course, so the process errors out long before completing.Neither of these links is valid, meaning that the single test found in the repo cannot run, and the library cannot be used out of the box to perform tasks like the one in the README. Where are the fully pretrained models now? Have they been taken down? If so, submit that the README and some other documentation should change to reflect that there is no out-of-the-box functionality. If not, what should these links be updated to?
My poor baby machine cannot train its own model, I need y'all's.
The text was updated successfully, but these errors were encountered: