model evaluation #7

mars203030 · 2024-02-12T21:50:36Z

Hello,

do you have teh code for model evaluation

Regards,

jingyeyang95 · 2024-02-17T15:13:14Z

Hello, for model evaluation, we randomly hold out 20% of the Biolark-gcs dataset, which you can get from https://data.mendeley.com/datasets/v4t59p8w4z/2
We manually compared the generated results with the dataset's labels for better accuracy. Specifically, we load our pre-trained model and give it text data from biolark as a prompt. The PhenoGPT will generate (predict) phenotypes and HPO IDs. You can compare these directly with the true results for evaluation.

mars203030 · 2024-02-17T22:28:54Z

Thank you so much this is what I am intended too
I tried the code you posted for llama 2 but it seems I am not getting any answer.
is there a step I am missing

mars203030 · 2024-02-17T22:39:55Z

here is wandb results

kaichop · 2024-11-11T21:43:55Z

are there any update on the evaluation? Please note that the phenogpt has been updated a few times and you may try the new version as well.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

model evaluation #7

model evaluation #7

mars203030 commented Feb 12, 2024

jingyeyang95 commented Feb 17, 2024

mars203030 commented Feb 17, 2024

mars203030 commented Feb 17, 2024

kaichop commented Nov 11, 2024

model evaluation #7

model evaluation #7

Comments

mars203030 commented Feb 12, 2024

jingyeyang95 commented Feb 17, 2024

mars203030 commented Feb 17, 2024

mars203030 commented Feb 17, 2024

kaichop commented Nov 11, 2024