Feedback Request: Late-Fusion Model Worsens with Satellite. #16
-
Hi @Sukh-P With the diagram I made from our previous discussion, I have tried the late-fusion model with Korean Satellite Images and Korean weather forecasting (NWPs) on Jeju Island for only one month of this year. My current input settings are:
However, when I try this and calculate my NMAE, it comes out as high as 0.5. Ironically, the result with satellite images is worse than without them (2 NWP > 2 NWP + Satellite). May I ask for any quick tips or feedback on this? Thank you for your time and consideration! |
Beta Was this translation helpful? Give feedback.
Replies: 4 comments 5 replies
-
Hi @kwon-encored! Great that you were able to use our code, that's so exciting! And I'm sorry it's not going great so far, that's always frustrating. A couple of things off the top of my head, and I'm sure you've done some of that already but just in case anyway:
Hope this is helpful, do get in touch if you have any other questions! |
Beta Was this translation helpful? Give feedback.
-
@kwon-encored it would also be good if you were able to provide a bit more detail on what your target data is and the sources for the different input data you used were i.e is it generation data from the whole island split into different regions or is it generation data from one site on the island? Also are you using the history of the PV generation data as an input to the model? As @AUdaltsova mentioned also worth checking all of your data inputs, especially satellite to see there is decent data quality there. Additionally what was the satellite data source that you used and what satellite channels were used? Also how much satellite history was passed to the model? Hopefully with some additional detail we can provide some assistance, thanks |
Beta Was this translation helpful? Give feedback.
-
@kwon-encored @AUdaltsova I believe I may have found a difference or mistake during my code modification. Follow-up (if my understanding is correct):
|
Beta Was this translation helpful? Give feedback.
-
Hi @kwon-encored, so, in terms of your setup, it looks like you maybe have data on 19 individual plants as opposed to regional aggregates? It shouldn't necessarily cause issues but our site pipeline might be better suited for your data. Generally, if you are working with individual sites I'd select their positions as points of interest instead of creating a grid over the island if that's what you're doing-but I might've misunderstood you. We do not use GSP generation history in the model, so that shouldn't affect your performance, but generally, if it's available to you and is of good enough quality you can feed in a couple of hours (or more) of history, and it will likely help with performance. You're right that neither of those things would explain why including satellite data makes it worse for you though. However, NMAE of 0.5 is really, really high, and it might be worth trying to debug with less data sources, because it doesn't look like the model is functioning correctly. I'd still perform some checks that you're feeding in the right crops, aligned by time, correctly normalised etc. Might also be worth looking at the correlation of some of your data and your targets, though if you're feeding in relevant weather data and your pv doesn't look completely unreasonable I can't imagine it being the problem. Re: your follow-up, someone might correct me on this, but as far as I know each sample will contain data from just one GSP. It will look something like this:
A batch of size 128 will contain 128 samples like that, with some random mix of GSPs used in it for targets and IDs. If you're trying to predict some sort of mixture and do not distinguish between different IDs it might be causing your issues! I hope this helps :) |
Beta Was this translation helpful? Give feedback.
Hi @kwon-encored! Great that you were able to use our code, that's so exciting! And I'm sorry it's not going great so far, that's always frustrating. A couple of things off the top of my head, and I'm sure you've done some of that already but just in case anyway: