Replies: 7 comments 9 replies
-
Hey @brunosan Thanks for the chat yesterday! This is a nice dataset that would be a good start: https://github.com/nick-murray/coastTrain There's around 8,000 points across Fiji as a starting point. We have S-1 and S-2 annual mosaics, which we can use to do prediction on, see the S-1 Mosaic and S-2 GeoMAD here: https://stac-browser.staging.digitalearthpacific.org/ We have some unpolished notebooks doing random forest classification now, but it would be great to see if we can use your model to compare with a simple random forest model, for example. |
Beta Was this translation helpful? Give feedback.
-
Great to chat today, @weiji14 From the Digital Earth Pacific side, we have Sentinel-2 and Sentinel-1 mosaics over all of Fiji, which we are intending on using to do some land-use/land-cover type ML. We're currently doing old-school classification, and we are open to using new methods based on the Clay foundation model. We agreed that we (DEP) will wait until there are some more well established examples, and you said you might be able to help us there. We agreed to work together in the future on use-cases, once there is a process that our team can use as an example and start building on. |
Beta Was this translation helpful? Give feedback.
-
Thanks Wei Ji.
We are currently looking into different species of forest (including mangroves, pine, primary and secondary forest) in Fiji and Tonga and it would be great to understand how the Clay Model + segmentation capabilities could be applied to this too.
It looks like the current work in the repo you shared will already be able to shed some preliminary light onto this.
Best wishes.
…________________________________
From: Wei Ji ***@***.***>
Sent: 14 February 2024 10:33
To: Clay-foundation/model ***@***.***>
Cc: Nick Metherall ***@***.***>; Mention ***@***.***>
Subject: Re: [Clay-foundation/model] Digital Earth Pacific applications (Discussion #140)
You don't often get email from ***@***.*** Learn why this is important<https://aka.ms/LearnAboutSenderIdentification>
Thanks @alexgleith<https://github.com/alexgleith> and @nicholasmetherall<https://github.com/nicholasmetherall> for the call just now. I'm sure we'll have lots to collaborate on in the coming months on DEP 😄
From the Digital Earth Pacific side, we have Sentinel-2 and Sentinel-1 mosaics over all of Fiji, which we are intending on using to do some land-use/land-cover type ML. We're currently doing old-school classification, and we are open to using new methods based on the Clay foundation model.
Sounds good. I think the cool part would be to use the Clay model to do classification/segmentation tasks on the STAC-hosted mosaics, for downstream tasks like the mineral resource detection<https://github.com/digitalearthpacific/mineral-resource-detection>, or potentially a South Pacific-specific Land Use Land Cover map as you mentioned. @yellowcap<https://github.com/yellowcap> has been keen to create embeddings on composites/mosaics (see #128<#128>), and having a working STAC API would definitely simplify things!
We agreed that we (DEP) will wait until there are some more well established examples, and you said you might be able to help us there. We agreed to work together in the future on use-cases, once there is a process that our team can use as an example and start building on.
Yep, we (DevSeed) can handle the ML-engineering part to get an initial finetuning workflow set-up, and that would involve working with y'all at DEP to get the GeoMAD/mosaic data in a good state. I'll poke around some more and probably ask lots of questions along the way!
—
Reply to this email directly, view it on GitHub<#140 (reply in thread)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AQB2HOZS6WUGCMTEW36CLZTYTPZ4XAVCNFSM6AAAAABCPDCADCVHI2DSMVQWIX3LMV43SRDJONRXK43TNFXW4Q3PNVWWK3TUHM4DINJZG43DI>.
You are receiving this because you were mentioned.
|
Beta Was this translation helpful? Give feedback.
-
Chiming in here! Nice to meet you @alexgleith. A bit of a delayed update but we made some progress on this front. A couple of weeks ago we pushed a notebook that explores use of embeddings derived from the Sentinel-2 composite to see if we could find new "quarry" sites. Essentially, we are mapping filtered ground truth points to patch level embeddings to produce reference embeddings that can be used in a similarity search query. It's a bit hard to validate the results without greater understanding of how these quarries actually represent in a spatial and spectral sense. Stated otherwise, right now the embeddings are derived from 10 meter RGB-Nir inputs. Is that sufficient to capture a signal on quarries in this region? Would other channels be useful? Maybe Sentinel-1 as well? The notebook lives here: https://github.com/Clay-foundation/model/blob/main/docs/tutorial_digital_earth_pacific_patch_level.ipynb I'd love to discuss this further. Let me know if that is of interest! |
Beta Was this translation helpful? Give feedback.
-
Hi @lillythomas, thanks for sharing! I've had a read through of the notebook and my quick take is it's pretty complicated. There's a lot of writing to disk and storing values in databases, which is surprising to me I guess. I don't know if RGB+NIR is enough to identify the sites. In our work using random forest, I think elevation was ranked as important. I'll see if I can run the notebook today and extract some point locations and compare with our results and come back to you. |
Beta Was this translation helpful? Give feedback.
-
Ok, after checking, these bands were important in our current process (descending order of importance): The Sentinel-1 mosaic bands ( |
Beta Was this translation helpful? Give feedback.
-
Beta Was this translation helpful? Give feedback.
-
@alexgleith let's brainstorm here how Clay can help DEP.
In a nutshell, we seek to learn to predict labels from a dataset of ~1k examples you give us, that can be expected to be seen on Sentinel-2 or Sentinel-1 data.
Beta Was this translation helpful? Give feedback.
All reactions