Skip to content

Latest commit

 

History

History
56 lines (49 loc) · 15 KB

Datasets-with-Aspect-Categories.md

File metadata and controls

56 lines (49 loc) · 15 KB

Datasets with Aspect Category Annotations

back to README.md

This table gives a list of publicly available datasets for ABSA and its subtasks with aspect category annotations. If you want to add any new datasets or change any information, please create a fork of the master repository and create a pull request, so that we can verify it and commit the change.

Index Year Dataset Paper Domain Link to the Dataset Lng #Revs #Sent #AT-pos #AT-neg #AT-neu #AC-pos #AC-neg #AC-neu
42 2024 ROAST (Chebolu etal., 2024) Amazon FineFoods https://github.com/RiTUAL-UH/ROAST-ABSA EN 2060
41 2024 ROAST (Chebolu etal., 2024) Coursera https://github.com/RiTUAL-UH/ROAST-ABSA EN 2061
40 2024 ROAST (Chebolu etal., 2024) Hotels https://github.com/RiTUAL-UH/ROAST-ABSA EN 2056
39 2024 ROAST (Chebolu etal., 2024) Movies https://github.com/RiTUAL-UH/ROAST-ABSA TE 1643
38 2024 ROAST (Chebolu etal., 2024) Phones https://github.com/RiTUAL-UH/ROAST-ABSA EN 1464
37 2024 ROAST (Chebolu etal., 2024) Phones https://github.com/RiTUAL-UH/ROAST-ABSA HN 2083
36 2024 OATS (Chebolu etal., 2024) Amazon FineFoods https://github.com/RiTUAL-UH/OATS-ABSA EN 1794 8913 5,577 1,187 234 5,577 1,187 234
35 2024 OATS (Chebolu etal., 2024) Coursera https://github.com/RiTUAL-UH/OATS-ABSA EN 1702 8278 4,403 1,008 213 4,403 1,008 213
34 2024 OATS (Chebolu etal., 2024) Hotels https://github.com/RiTUAL-UH/OATS-ABSA EN 1497 7963 6,952 1,207 169 6,952 1,207 169
33 2021 ASQP (Zhang etal., 2021a) Restaurants https://github.com/IsakZhang/ABSA-QUAD EN - 2124 1811 613 110 2229 877 135
32 2021 ASQP (Zhang etal., 2021a) Restaurants https://github.com/IsakZhang/ABSA-QUAD EN - 1580 1407 489 68 1710 701 85
31 2021 ASAP (Bu etal., 2021) Restaurants https://github.com/Meituan-Dianping/asap/tree/master/data CH 46K - - - - 169K 35K 66K
30 2021 Vietnam.Smartph. (Thanh etal., 2021) Smartphones https://github.com/kimkim00/UIT-ViSD4SA VI - 11122 - - - 21732 11206 2214
29 2020 TeluguMovies (Regatte etal., 2020) Movies https://tiny.cc/vdxugz TE - 5027 2480 3251 1129 2480 3251 1129
28 2019 MAMS (Jiang etal., 2019) Restaurants https://github.com/siat-nlp/MAMS-for-ABSA/tree/master/data EN - 3849 - - - 2415 2606 3858
27 2018 Foursquare (Brunand Nikoulina, 2018) Restaurants https://europe.naverlabs.com/Research/Natural-Language-Processing/Aspect-Based-Sentiment-Analysis-Dataset/ EN - 1006 759 108 16 947 191 19
26 2018 ABSITA-2018 (Basile etal., 2018) Hotels https://sag.art.uniroma2.it/absita/data/ IT - 9285 - - - 6893 5288 -
25 2018 BanglaRest., Cricket (RahmanandKumarDey, 2018) Restaurants https://github.com/AtikRahman/Bangla_ABSA_Datasets BG - 1712 - - - 477 1226 371
24 2018 BanglaRest., Cricket (RahmanandKumarDey, 2018) Cricket https://github.com/AtikRahman/Bangla_ABSA_Datasets BG - 2691 - - - 571 2157 266
23 2018 FiQA (de França Costa and da Silva, 2018) Financial** https://sites.google.com/view/fiqa/home EN 1303 - 774 399 - 774 399 -
22 2017 GermEval-2017 (Wojatzki etal., 2017) Soc.Med., blogs, news https://ltdata1.informatik.uni-hamburg.de/germeval2017/ DE - 27800 2802 12571 1459 2815 12690 13932
21 2017 CustomerResponse (Yin etal., 2017) Hotels** https://github.com/HKUST-KnowComp/DMSC EN 29K 375K - - - 120K 66K 49K
20 2017 CustomerResponse (Yin etal., 2017) BeerAdvocate** https://github.com/HKUST-KnowComp/DMSC EN 51K 552K - - - 176K 8902 64K
19 2016 SentiHood (Saeidi etal., 2016) UrbanNeighborhoods https://github.com/uclnlp/jack/tree/master/data/sentihood EN - 5215 - - - 4305 1606 -
18 2016 SE-16 (Pontiki etal., 2016) DigitalCameras https://alt.qcri.org/semeval2016/task5/index.php?id=data-and-tools CH 200 8100 - - - 1153 587 -
17 2016 SE-16 (Pontiki etal., 2016) MobilePhones https://alt.qcri.org/semeval2016/task5/index.php?id=data-and-tools CH 200 9500 - - - 1168 794 -
16 2016 SE-16 (Pontiki etal., 2016) Restaurants https://alt.qcri.org/semeval2016/task5/index.php?id=data-and-tools FR 455 2429 1285 1061 289 1605 1646 233
15 2016 SE-16 (Pontiki etal., 2016) Restaurants https://alt.qcri.org/semeval2016/task5/index.php?id=data-and-tools RU 405 4699 3139 696 313 3973 1030 379
14 2016 SE-16 (Pontiki etal., 2016) MobilePhones https://alt.qcri.org/semeval2016/task5/index.php?id=data-and-tools DU 270 1697 - - - 1454 225 110
13 2016 SE-16 (Pontiki etal., 2016) Restaurants https://alt.qcri.org/semeval2016/task5/index.php?id=data-and-tools DU 400 2286 1016 546 145 1431 857 185
12 2016 SE-16 (Pontiki etal., 2016) Hotels https://alt.qcri.org/semeval2016/task5/index.php?id=data-and-tools AR 2291 6029 7213 4003 824 7705 4556 852
11 2016 SE-16 (Pontiki etal., 2016) Telecom https://alt.qcri.org/semeval2016/task5/index.php?id=data-and-tools TR - 3000 - - - - - -
10 2016 SE-16 (Pontiki etal., 2016) Restaurants https://alt.qcri.org/semeval2016/task5/index.php?id=data-and-tools TR 339 1248 865 555 119 924 635 135
9 2016 SE-16 (Pontiki etal., 2016) Restaurants https://alt.qcri.org/semeval2016/task5/index.php?id=data-and-tools ES - 2691 1907 672 125 2675 948 168
8 2016 SE-16 (Pontiki etal., 2016) Laptops https://alt.qcri.org/semeval2016/task5/index.php?id=data-and-tools EN 530 3308 - - - 2118 1358 236
7 2016 SE-16 (Pontiki etal., 2016) Restaurants https://alt.qcri.org/semeval2016/task5/index.php?id=data-and-tools EN 400 2286 1817 634 106 2268 953 145
6 2015 SE-15 (Pontiki etal., 2015) Laptops https://alt.qcri.org/semeval2015/task12/index.php?id=data-and-tools EN 450 2500 - - - 1644 1094 185
5 2015 SE-15 (Pontiki etal., 2015) Restaurants https://alt.qcri.org/semeval2015/task12/index.php?id=data-and-tools EN 350 2000 1326 496 73 1652 749 98
4 2015 HAAD (Al-Smadi etal., 2015) Books https://github.com/msmadi/HAAD AR - 2389 1376 1287 148 721 750 14
3 2014 (Steinberger etal., 2014) Restaurants https://liks.fav.zcu.cz/sentiment/ CZ - 1244 679 725 403 521 569 246
2 2014 SE-14 (Pontiki etal., 2014) Restaurants https://alt.qcri.org/semeval2014/task4/ EN - 3841 2892 1001 829 2836 998 594
1 2011 TripAdvisorHotels (Wang etal., 2011) Hotels** https://www.cs.virginia.edu/~hw5x/dataset.html EN 108K 1M - - - 1.63M 153K 178K

back to README.md