Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[DataCap Application] <Large Sky Area Multi-Object Fiber Spectroscopic Telescope-5> #33

Open
1 of 2 tasks
baogebaoge178 opened this issue Dec 9, 2024 · 43 comments
Open
1 of 2 tasks
Assignees
Labels

Comments

@baogebaoge178
Copy link

baogebaoge178 commented Dec 9, 2024

Data Owner Name

National Astronomical Observatories, Chinese Academy of Sciences

Data Owner Country/Region

China

Data Owner Industry

Environment

Website

http://dr5.lamost.org/

Social Media Handle

http://dr5.lamost.org/

Social Media Type

Other

What is your role related to the dataset

Data Preparer

Total amount of DataCap being requested

5PiB

Expected size of single dataset (one copy)

500TiB

Number of replicas to store

10

Weekly allocation of DataCap requested

1000TiB

On-chain address for first allocation

f174mz57hgfirq7vgdhu4piwnvrpfg7a7j7qrnfka

Data Type of Application

Public, Open Dataset (Research/Non-Profit)

Custom multisig

  • Use Custom Multisig

Identifier

No response

Share a brief history of your project and organization

Experienced Personal Data Provider.The Large Sky Area Multi-Object Fiber Spectroscopic Telescope (LAMOST) is a Chinese national scientific research facility operated by the National Astronomical Observatories, Chinese Academy of Sciences. It is a special reflecting Schmidt telescope with 4000 fibers in a field of view of 20 deg2
 in the sky. Until July 2017, LAMOST has completed its pilot survey which was launched in October 2011 and ended in June 2012, and the first five years of regular survey which was initiated on September 2012. After this six-year-survey, we totally obtain 9,026,365 spectra, which consist of stars, galaxies, quasars and other unknown objects[1−7]
. Now, the fifth data release (DR5) has published online (http://dr5.lamost.org/), and released data products include:

Spectra. - IIn general, there are 9,026,365 flux- and wavelength-calibrated, sky-subtracted spectra in DR5, including 8,183,160 stars, 152,863 galaxies, 52,453 quasars, and 637,889 unknown objects, and these spectra cover the wavelength range of 3690-9100 angstrom with a resolution of 1800[2−3]
 at the 5500 angstrom.
Spectroscopic Parameters Catalogs. - In this data release, six spectroscopic parameters catalogs are also published,and they are the LAMOST general catalog, the A, F, G and K type star catalog, the A type star catalog, the M dwarf catalog, the observed plate information catalog, and the input catalog respectively. In the LAMOST general catalog, it includes 36 columns of basic spectroscopic information, for example, right ascension, declination, signal to noise ratio, magnitude, classification and redshift, which are also provided by the A, F, G and K type star catalog, the A type star catalog, and the M dwarf catalog. These three catalogs also provide other parameters, for example, atmospheric parameters (effective temperature, surface gravity, and metallicity), spectral line indices, line widths, the metallicity sensitive parameter, and the magnetic activity flag. In addition, the observed plate information catalog mainly contains nine basic plate information for all published plates, and the input catalog includes 24 basic fields mentioned above and three new fields which are not included in above catalogs.

http://dr5.lamost.org/v3/doc/data-production-description

Guoshoujing Telescope (the Large Sky Area Multi-Object Fiber Spectroscopic Telescope LAMOST) is a National Major Scientific Project built by the Chinese Academy of Sciences. Funding for the project has been provided by the National Development and Reform Commission. LAMOST is operated and managed by the National Astronomical Observatories, Chinese Academy of Sciences.

Is this project associated with other projects/ecosystem stakeholders?

No

If answered yes, what are the other projects/ecosystem stakeholders

No response

Describe the data being stored onto Filecoin

The Large Sky Area Multi-Object Fiber Spectroscopic Telescope (LAMOST) is a Chinese national scientific research facility operated by the National Astronomical Observatories, Chinese Academy of Sciences. It is a special reflecting Schmidt telescope with 4000 fibers in a field of view of 20 deg2
 in the sky. Until July 2017, LAMOST has completed its pilot survey which was launched in October 2011 and ended in June 2012, and the first five years of regular survey which was initiated on September 2012. After this six-year-survey, we totally obtain 9,026,365 spectra, which consist of stars, galaxies, quasars and other unknown objects[1−7]
. Now, the fifth data release (DR5) has published online (http://dr5.lamost.org/), and released data products include:

Spectra. - IIn general, there are 9,026,365 flux- and wavelength-calibrated, sky-subtracted spectra in DR5, including 8,183,160 stars, 152,863 galaxies, 52,453 quasars, and 637,889 unknown objects, and these spectra cover the wavelength range of 3690-9100 angstrom with a resolution of 1800[2−3]
 at the 5500 angstrom.
Spectroscopic Parameters Catalogs. - In this data release, six spectroscopic parameters catalogs are also published,and they are the LAMOST general catalog, the A, F, G and K type star catalog, the A type star catalog, the M dwarf catalog, the observed plate information catalog, and the input catalog respectively. In the LAMOST general catalog, it includes 36 columns of basic spectroscopic information, for example, right ascension, declination, signal to noise ratio, magnitude, classification and redshift, which are also provided by the A, F, G and K type star catalog, the A type star catalog, and the M dwarf catalog. These three catalogs also provide other parameters, for example, atmospheric parameters (effective temperature, surface gravity, and metallicity), spectral line indices, line widths, the metallicity sensitive parameter, and the magnetic activity flag. In addition, the observed plate information catalog mainly contains nine basic plate information for all published plates, and the input catalog includes 24 basic fields mentioned above and three new fields which are not included in above catalogs.

http://dr5.lamost.org/v3/doc/data-production-description

Guoshoujing Telescope (the Large Sky Area Multi-Object Fiber Spectroscopic Telescope LAMOST) is a National Major Scientific Project built by the Chinese Academy of Sciences. Funding for the project has been provided by the National Development and Reform Commission. LAMOST is operated and managed by the National Astronomical Observatories, Chinese Academy of Sciences.

Where was the data currently stored in this dataset sourced from

Other

If you answered "Other" in the previous question, enter the details here

http://dr5.lamost.org/

If you are a data preparer. What is your location (Country/Region)

None

If you are a data preparer, how will the data be prepared? Please include tooling used and technical details?

No response

If you are not preparing the data, who will prepare the data? (Provide name and business)

No response

Has this dataset been stored on the Filecoin network before? If so, please explain and make the case why you would like to store this dataset again to the network. Provide details on preparation and/or SP distribution.

No response

Please share a sample of the data

http://dr5.lamost.org/v3/sas/catalog/
http://dr5.lamost.org/v3/sas/fits/20111024/F5902/
http://dr5.lamost.org/v3/sas/fits/20111024/F5907/
http://dr5.lamost.org/v3/sas/fits/20111024/F5909/
http://dr5.lamost.org/v3/sas/png/20111024/F5902/
http://dr5.lamost.org/v3/sas/png/20111024/F5907/
http://dr5.lamost.org/v3/sas/png/20111024/F5909/
http://dr5.lamost.org/v3/sas/sky/20111024/

Confirm that this is a public dataset that can be retrieved by anyone on the Network

  • I confirm

If you chose not to confirm, what was the reason

No response

What is the expected retrieval frequency for this data

Sporadic

For how long do you plan to keep this dataset stored on Filecoin

1.5 to 2 years

In which geographies do you plan on making storage deals

Greater China, Asia other than Greater China, Africa, North America, South America, Europe, Australia (continent), Antarctica

How will you be distributing your data to storage providers

HTTP or FTP server, Shipping hard drives

How did you find your storage providers

Slack, Filmine, Partners

If you answered "Others" in the previous question, what is the tool or platform you used

No response

Please list the provider IDs and location of the storage providers you will be working with.

f01106668   HK
f03055029   HK
f01518369   US
f01889668   US
f03239692  Frankfurt
f03055005   HK
f03055018   HK
f03254235 London
f01315096   HK
f0870558     HK

How do you plan to make deals to your storage providers

Boost client, Lotus client, Singularity

If you answered "Others/custom tool" in the previous question, enter the details here

No response

Can you confirm that you will follow the Fil+ guideline

Yes

Copy link
Contributor

datacap-bot bot commented Dec 9, 2024

Application is waiting for allocator review

@hash889900
Copy link
Owner

The query finds that this public dataset has already been stored, so why store it again
image

@baogebaoge178
Copy link
Author

baogebaoge178 commented Dec 9, 2024

As I understand, many of the sectors on the SP where this dataset was originally stored have expired. We believe that valuable public datasets deserve to be redundantly stored. Additionally, the content of the data stored by each data preparer is not entirely the same. DR5 has a very large overall data volume, and we are only applying for a portion of it.

@hash889900
Copy link
Owner

Are you storing these public datasets on behalf of a company or an individual?

@hash889900
Copy link
Owner

image
Please update your sp information to include your geographic location.

@hash889900
Copy link
Owner

If you are representing a company, please provide a business license, if you are an individual, please provide personally identifiable information, and contact information sent to [email protected] to verify your information

@baogebaoge178
Copy link
Author

baogebaoge178 commented Dec 10, 2024

I apologize for the late reply. I serve in the role of a personal data preparer, primarily preparing data through official tools such as Lotus, Boost, and Singularity. The main nodes are located in Hong Kong: f01106668, f01315096, f0870558, f03055005, f03055018, f03055029; additionally, two nodes f01518369 and f01889668 are located in the United States,f03239692 Frankfurt,f03254235 London

For the initial sealing phase, we plan to collaborate with some of these SPs (Storage Providers), depending on their preparation of pledged coins. Of course, we are also looking for more SPs from different regions to cooperate with.

@baogebaoge178
Copy link
Author

The E-mail sent,Please check it.Thanks!

datacap-bot bot added a commit that referenced this issue Dec 10, 2024
datacap-bot bot added a commit that referenced this issue Dec 11, 2024
@hash889900
Copy link
Owner

If you are a data preparer, how will the data be prepared? Please include tooling used and technical details?

@hash889900 hash889900 self-assigned this Dec 11, 2024
@hash889900
Copy link
Owner

Total amount of DataCap being requested
15PiB

Sorry, the total limit for a single client for a single application is 5P,Please revise the application form accordingly

datacap-bot bot added a commit that referenced this issue Dec 11, 2024
@baogebaoge178
Copy link
Author

Total amount of DataCap being requested
15PiB

Sorry, the total limit for a single client for a single application is 5P,Please revise the application form accordingly

I have already revised the total amount of the application according to your requirements.

@baogebaoge178
Copy link
Author

I serve in the role of a personal data preparer, primarily preparing data through official tools such as Lotus, Boost, and Singularity.

I serve in the role of a personal data preparer, primarily preparing data through official tools such as Lotus, Boost, and Singularity.Mainly, it involves creating CAR files.

Copy link
Contributor

datacap-bot bot commented Dec 11, 2024

Datacap Request Trigger

Total DataCap requested

5PiB

Expected weekly DataCap usage rate

1000TiB

DataCap Amount - First Tranche

256TiB

Client address

f174mz57hgfirq7vgdhu4piwnvrpfg7a7j7qrnfka

Copy link
Contributor

datacap-bot bot commented Dec 11, 2024

DataCap Allocation requested

Multisig Notary address

Client address

f174mz57hgfirq7vgdhu4piwnvrpfg7a7j7qrnfka

DataCap allocation requested

256TiB

Id

1cc6cba7-2164-4d04-bdcc-129352ca0aa1

Copy link
Contributor

datacap-bot bot commented Dec 11, 2024

Application is ready to sign

Copy link
Contributor

datacap-bot bot commented Dec 11, 2024

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzacec3amzzrsr4a2fpjcdemkpeyfhhyaso2nae5mhrhnuawydpohihtc

Address

f174mz57hgfirq7vgdhu4piwnvrpfg7a7j7qrnfka

Datacap Allocated

256TiB

Signer Address

f14oq4jctudvidvyhfmrp7qt6ndng7pr7ampp3miq

Id

1cc6cba7-2164-4d04-bdcc-129352ca0aa1

You can check the status here https://filfox.info/en/message/bafy2bzacec3amzzrsr4a2fpjcdemkpeyfhhyaso2nae5mhrhnuawydpohihtc

Copy link
Contributor

datacap-bot bot commented Dec 11, 2024

Application is Granted

@hash889900
Copy link
Owner

checker:manualTrigger

@hash889900
Copy link
Owner

checker:manualTrigger

Copy link
Contributor

datacap-bot bot commented Dec 20, 2024

DataCap and CID Checker Report Summary1

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

⚠️ 66.67% of Storage Providers have retrieval success rate less than 75%.

⚠️ The average retrieval success rate is 19.70%

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients2

✔️ No CID sharing has been observed.

Full report

Click here to view the CID Checker report.

Footnotes

  1. To manually trigger this report, add a comment with text checker:manualTrigger

  2. To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

@hash889900
Copy link
Owner

checker:manualTrigger

Copy link
Contributor

datacap-bot bot commented Dec 30, 2024

DataCap and CID Checker Report Summary1

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

⚠️ 50.00% of Storage Providers have retrieval success rate less than 75%.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients2

✔️ No CID sharing has been observed.

Full report

Click here to view the CID Checker report.

Footnotes

  1. To manually trigger this report, add a comment with text checker:manualTrigger

  2. To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Copy link
Contributor

datacap-bot bot commented Jan 2, 2025

Client used 75% of the allocated DataCap. Consider allocating next tranche.

@baogebaoge178
Copy link
Author

checker:manualTrigger

Copy link
Contributor

datacap-bot bot commented Jan 7, 2025

DataCap and CID Checker Report Summary1

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

⚠️ 50.00% of Storage Providers have retrieval success rate less than 75%.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients2

✔️ No CID sharing has been observed.

Full report

Click here to view the CID Checker report.

Footnotes

  1. To manually trigger this report, add a comment with text checker:manualTrigger

  2. To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

@hash889900
Copy link
Owner

checker:manualTrigger

Copy link
Contributor

datacap-bot bot commented Jan 8, 2025

DataCap and CID Checker Report Summary1

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

⚠️ 50.00% of Storage Providers have retrieval success rate less than 75%.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients2

✔️ No CID sharing has been observed.

Full report

Click here to view the CID Checker report.

Footnotes

  1. To manually trigger this report, add a comment with text checker:manualTrigger

  2. To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

@hash889900
Copy link
Owner

checker:manualTrigger

Copy link
Contributor

datacap-bot bot commented Jan 8, 2025

DataCap and CID Checker Report Summary1

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

⚠️ 50.00% of Storage Providers have retrieval success rate less than 75%.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients2

✔️ No CID sharing has been observed.

Full report

Click here to view the CID Checker report.

Footnotes

  1. To manually trigger this report, add a comment with text checker:manualTrigger

  2. To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

@hash889900
Copy link
Owner

Currently there are only 2 regional SPs,can you elaborate on your plans for the next round?

@baogebaoge178
Copy link
Author

Hello, currently we are in the first round. As the project progresses, an increasing number of SPs from various regions will join sequentially. Please continue to offer your support.

We will promptly disclose the latest SPs before encapsulation once they are confirmed.

@hash889900
Copy link
Owner

Due to insufficient DC quota, another 256TiB will be triggered and the subsequent distribution will be continuously followed up.

Copy link
Contributor

datacap-bot bot commented Jan 8, 2025

Application is in Refill

Copy link
Contributor

datacap-bot bot commented Jan 8, 2025

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzaceaggtrbdcg2zd4yx5jb4h4dhyakbdggo3ljzktb75ov2mccwlgjqg

Address

f174mz57hgfirq7vgdhu4piwnvrpfg7a7j7qrnfka

Datacap Allocated

256TiB

Signer Address

f14oq4jctudvidvyhfmrp7qt6ndng7pr7ampp3miq

Id

4a23a0f4-6351-4885-b6cc-b72a5f2a41dc

You can check the status here https://filfox.info/en/message/bafy2bzaceaggtrbdcg2zd4yx5jb4h4dhyakbdggo3ljzktb75ov2mccwlgjqg

Copy link
Contributor

datacap-bot bot commented Jan 8, 2025

Application is Granted

@datacap-bot datacap-bot bot added granted and removed Refill labels Jan 8, 2025
@hash889900
Copy link
Owner

checker:manualTrigger

Copy link
Contributor

datacap-bot bot commented Jan 13, 2025

DataCap and CID Checker Report Summary1

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

⚠️ 50.00% of Storage Providers have retrieval success rate less than 75%.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients2

✔️ No CID sharing has been observed.

Full report

Click here to view the CID Checker report.

Footnotes

  1. To manually trigger this report, add a comment with text checker:manualTrigger

  2. To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

@hash889900
Copy link
Owner

checker:manualTrigger

Copy link
Contributor

datacap-bot bot commented Jan 18, 2025

DataCap and CID Checker Report Summary1

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

⚠️ 75.00% of Storage Providers have retrieval success rate less than 75%.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients2

✔️ No CID sharing has been observed.

Full report

Click here to view the CID Checker report.

Footnotes

  1. To manually trigger this report, add a comment with text checker:manualTrigger

  2. To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

@baogebaoge178
Copy link
Author

Continue to update SP: f03055018

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants