MARTINI_enrich_BERTopic_surf_noise_eng
This is a BERTopic model. BERTopic is a flexible and modular topic modeling framework that allows for the generation of easily interpretable topics from large datasets.
Usage
To use this model, please install BERTopic:
pip install -U bertopic
You can use the model as follows:
from bertopic import BERTopic
topic_model = BERTopic.load("AIDA-UPM/MARTINI_enrich_BERTopic_surf_noise_eng")
topic_model.get_topic_info()
Topic overview
- Number of topics: 50
- Number of training documents: 5753
Click here for an overview of all topics.
Topic ID | Topic Keywords | Topic Frequency | Label |
---|---|---|---|
-1 | zelensky - russians - sanctions - eu - nazi | 21 | -1_zelensky_russians_sanctions_eu |
0 | bundestag - schmidt - rheinmetall - tagesspiegel - dusseldorf | 3454 | 0_bundestag_schmidt_rheinmetall_tagesspiegel |
1 | zaporizhzhia - chernobyl - khmelnitsky - rosenergoatom - radioactive | 118 | 1_zaporizhzhia_chernobyl_khmelnitsky_rosenergoatom |
2 | belarus - sanctions - famine - guterres - fertilizers | 108 | 2_belarus_sanctions_famine_guterres |
3 | nazies - shukhevych - genocidium - polska - swastika | 104 | 3_nazies_shukhevych_genocidium_polska |
4 | biden - democrats - polls - delaware - midterm | 92 | 4_biden_democrats_polls_delaware |
5 | zelensky - volodymyr - dzhankarashvili - ladimir - autocracy | 76 | 5_zelensky_volodymyr_dzhankarashvili_ladimir |
6 | britain - boris - partygate - downing - scottish | 75 | 6_britain_boris_partygate_downing |
7 | donetsk - makiivka - shrapnel - victims - voroshylovskyi | 73 | 7_donetsk_makiivka_shrapnel_victims |
8 | hamas - gaza - israeli - airstrikes - aleppo | 73 | 8_hamas_gaza_israeli_airstrikes |
9 | moldova - protesters - aeroflot - chernyshenko - maia | 71 | 9_moldova_protesters_aeroflot_chernyshenko |
10 | outages - latvia - boilers - ludwigshafen - megawatt | 70 | 10_outages_latvia_boilers_ludwigshafen |
11 | ukrainians - refugees - germany - krakow - xenophobia | 67 | 11_ukrainians_refugees_germany_krakow |
12 | bioweapons - pfizer - coronavirus - pentagon - kirillov | 66 | 12_bioweapons_pfizer_coronavirus_pentagon |
13 | protests - netherlands - marne - farmers - bonfires | 62 | 13_protests_netherlands_marne_farmers |
14 | lgbtq - california - legalized - fifa - fairies | 59 | 14_lgbtq_california_legalized_fifa |
15 | britons - inflation - billionaires - households - bills | 53 | 15_britons_inflation_billionaires_households |
16 | hungary - sanctions - balasz - brussels - zakarpatia | 52 | 16_hungary_sanctions_balasz_brussels |
17 | nato - stoltenberg - kyiv - allies - join | 50 | 17_nato_stoltenberg_kyiv_allies |
18 | latvians - daugavpils - russophobia - monuments - partisans | 50 | 18_latvians_daugavpils_russophobia_monuments |
19 | mercenaries - foreigncombatants - sniper - lisichansk - brigades | 49 | 19_mercenaries_foreigncombatants_sniper_lisichansk |
20 | mariupol - azovites - militants - surrendered - battalion | 48 | 20_mariupol_azovites_militants_surrendered |
21 | finland - stockholm - erdogan - nonproliferation - pekka | 45 | 21_finland_stockholm_erdogan_nonproliferation |
22 | pyongyang - missiles - denuclearization - nuked - hiroshima | 45 | 22_pyongyang_missiles_denuclearization_nuked |
23 | atrocities - kupyansk - captured - nazis - videos | 43 | 23_atrocities_kupyansk_captured_nazis |
24 | chernivtsi - zakarpattia - mobilized - verkhovnaya - commissars | 43 | 24_chernivtsi_zakarpattia_mobilized_verkhovnaya |
25 | putin - belarus - yermakov - disarmament - diplomat | 41 | 25_putin_belarus_yermakov_disarmament |
26 | russians - lvov - cheburashka - moseichuk - hatred | 40 | 26_russians_lvov_cheburashka_moseichuk |
27 | crimea - novovladimivka - terrorist - tatarstan - basayev | 36 | 27_crimea_novovladimivka_terrorist_tatarstan |
28 | chomsky - demonizing - totalitarianism - circlejerk - американцев | 35 | 28_chomsky_demonizing_totalitarianism_circlejerk |
29 | kosovska - mitrovica - serbians - pristina - provocations | 35 | 29_kosovska_mitrovica_serbians_pristina |
30 | kharkov - militants - voskresenskoye - mizintsev - atrocities | 33 | 30_kharkov_militants_voskresenskoye_mizintsev |
31 | nordstream - pipelines - sabotage - explosions - leakage | 33 | 31_nordstream_pipelines_sabotage_explosions |
32 | biden - billions - congresswoman - grants - pentagon | 33 | 32_biden_billions_congresswoman_grants |
33 | karabakh - pashinyan - armistice - transcaucasia - alimbayev | 32 | 33_karabakh_pashinyan_armistice_transcaucasia |
34 | saudi - khashoggi - bahrain - jeddah - mecca | 31 | 34_saudi_khashoggi_bahrain_jeddah |
35 | latvians - rusophobic - denaturalization - bans - rapporteurs | 27 | 35_latvians_rusophobic_denaturalization_bans |
36 | monuments - nkvd - bulgarians - vandalized - graves | 27 | 36_monuments_nkvd_bulgarians_vandalized |
37 | lithuania - lukashenka - visas - eu - banning | 27 | 37_lithuania_lukashenka_visas_eu |
38 | euromaidan - moldovans - funds - ammunition - blitzkrieg | 26 | 38_euromaidan_moldovans_funds_ammunition |
39 | severodonetsk - lisichansk - cossacks - thanked - allahu | 25 | 39_severodonetsk_lisichansk_cossacks_thanked |
40 | chernobaevka - missile - zhytomyr - aeroflot - crashed | 25 | 40_chernobaevka_missile_zhytomyr_aeroflot |
41 | missiles - pentagon - drones - cnn - shipped | 25 | 41_missiles_pentagon_drones_cnn |
42 | taiwan - beijing - fujian - pelosi - xinjiang | 24 | 42_taiwan_beijing_fujian_pelosi |
43 | musk - twitter - paywalled - impostors - 22mln | 24 | 43_musk_twitter_paywalled_impostors |
44 | gazprom - eu - pipeline - sudzha - renewable | 23 | 44_gazprom_eu_pipeline_sudzha |
45 | petroleum - khashoggi - cartel - cnooc - prices | 21 | 45_petroleum_khashoggi_cartel_cnooc |
46 | brics - multilateralism - vladivostok - ramaphosa - zhang | 21 | 46_brics_multilateralism_vladivostok_ramaphosa |
47 | sanctions - sberbank - croatia - mitsotakis - tankers | 21 | 47_sanctions_sberbank_croatia_mitsotakis |
48 | tiananmen - xinjiang - hong - pogroms - communists | 21 | 48_tiananmen_xinjiang_hong_pogroms |
Training hyperparameters
- calculate_probabilities: True
- language: None
- low_memory: False
- min_topic_size: 10
- n_gram_range: (1, 1)
- nr_topics: None
- seed_topic_list: None
- top_n_words: 10
- verbose: False
- zeroshot_min_similarity: 0.7
- zeroshot_topic_list: None
Framework versions
- Numpy: 1.26.4
- HDBSCAN: 0.8.40
- UMAP: 0.5.7
- Pandas: 2.2.3
- Scikit-Learn: 1.5.2
- Sentence-transformers: 3.3.1
- Transformers: 4.46.3
- Numba: 0.60.0
- Plotly: 5.24.1
- Python: 3.10.12
- Downloads last month
- 5
Inference Providers
NEW
This model is not currently available via any of the supported third-party Inference Providers, and
the model is not deployed on the HF Inference API.