MARTINI_enrich_BERTopic_surf_noise_eng

This is a BERTopic model. BERTopic is a flexible and modular topic modeling framework that allows for the generation of easily interpretable topics from large datasets.

Usage

To use this model, please install BERTopic:

pip install -U bertopic

You can use the model as follows:

from bertopic import BERTopic
topic_model = BERTopic.load("AIDA-UPM/MARTINI_enrich_BERTopic_surf_noise_eng")

topic_model.get_topic_info()

Topic overview

  • Number of topics: 50
  • Number of training documents: 5753
Click here for an overview of all topics.
Topic ID Topic Keywords Topic Frequency Label
-1 zelensky - russians - sanctions - eu - nazi 21 -1_zelensky_russians_sanctions_eu
0 bundestag - schmidt - rheinmetall - tagesspiegel - dusseldorf 3454 0_bundestag_schmidt_rheinmetall_tagesspiegel
1 zaporizhzhia - chernobyl - khmelnitsky - rosenergoatom - radioactive 118 1_zaporizhzhia_chernobyl_khmelnitsky_rosenergoatom
2 belarus - sanctions - famine - guterres - fertilizers 108 2_belarus_sanctions_famine_guterres
3 nazies - shukhevych - genocidium - polska - swastika 104 3_nazies_shukhevych_genocidium_polska
4 biden - democrats - polls - delaware - midterm 92 4_biden_democrats_polls_delaware
5 zelensky - volodymyr - dzhankarashvili - ladimir - autocracy 76 5_zelensky_volodymyr_dzhankarashvili_ladimir
6 britain - boris - partygate - downing - scottish 75 6_britain_boris_partygate_downing
7 donetsk - makiivka - shrapnel - victims - voroshylovskyi 73 7_donetsk_makiivka_shrapnel_victims
8 hamas - gaza - israeli - airstrikes - aleppo 73 8_hamas_gaza_israeli_airstrikes
9 moldova - protesters - aeroflot - chernyshenko - maia 71 9_moldova_protesters_aeroflot_chernyshenko
10 outages - latvia - boilers - ludwigshafen - megawatt 70 10_outages_latvia_boilers_ludwigshafen
11 ukrainians - refugees - germany - krakow - xenophobia 67 11_ukrainians_refugees_germany_krakow
12 bioweapons - pfizer - coronavirus - pentagon - kirillov 66 12_bioweapons_pfizer_coronavirus_pentagon
13 protests - netherlands - marne - farmers - bonfires 62 13_protests_netherlands_marne_farmers
14 lgbtq - california - legalized - fifa - fairies 59 14_lgbtq_california_legalized_fifa
15 britons - inflation - billionaires - households - bills 53 15_britons_inflation_billionaires_households
16 hungary - sanctions - balasz - brussels - zakarpatia 52 16_hungary_sanctions_balasz_brussels
17 nato - stoltenberg - kyiv - allies - join 50 17_nato_stoltenberg_kyiv_allies
18 latvians - daugavpils - russophobia - monuments - partisans 50 18_latvians_daugavpils_russophobia_monuments
19 mercenaries - foreigncombatants - sniper - lisichansk - brigades 49 19_mercenaries_foreigncombatants_sniper_lisichansk
20 mariupol - azovites - militants - surrendered - battalion 48 20_mariupol_azovites_militants_surrendered
21 finland - stockholm - erdogan - nonproliferation - pekka 45 21_finland_stockholm_erdogan_nonproliferation
22 pyongyang - missiles - denuclearization - nuked - hiroshima 45 22_pyongyang_missiles_denuclearization_nuked
23 atrocities - kupyansk - captured - nazis - videos 43 23_atrocities_kupyansk_captured_nazis
24 chernivtsi - zakarpattia - mobilized - verkhovnaya - commissars 43 24_chernivtsi_zakarpattia_mobilized_verkhovnaya
25 putin - belarus - yermakov - disarmament - diplomat 41 25_putin_belarus_yermakov_disarmament
26 russians - lvov - cheburashka - moseichuk - hatred 40 26_russians_lvov_cheburashka_moseichuk
27 crimea - novovladimivka - terrorist - tatarstan - basayev 36 27_crimea_novovladimivka_terrorist_tatarstan
28 chomsky - demonizing - totalitarianism - circlejerk - американцев 35 28_chomsky_demonizing_totalitarianism_circlejerk
29 kosovska - mitrovica - serbians - pristina - provocations 35 29_kosovska_mitrovica_serbians_pristina
30 kharkov - militants - voskresenskoye - mizintsev - atrocities 33 30_kharkov_militants_voskresenskoye_mizintsev
31 nordstream - pipelines - sabotage - explosions - leakage 33 31_nordstream_pipelines_sabotage_explosions
32 biden - billions - congresswoman - grants - pentagon 33 32_biden_billions_congresswoman_grants
33 karabakh - pashinyan - armistice - transcaucasia - alimbayev 32 33_karabakh_pashinyan_armistice_transcaucasia
34 saudi - khashoggi - bahrain - jeddah - mecca 31 34_saudi_khashoggi_bahrain_jeddah
35 latvians - rusophobic - denaturalization - bans - rapporteurs 27 35_latvians_rusophobic_denaturalization_bans
36 monuments - nkvd - bulgarians - vandalized - graves 27 36_monuments_nkvd_bulgarians_vandalized
37 lithuania - lukashenka - visas - eu - banning 27 37_lithuania_lukashenka_visas_eu
38 euromaidan - moldovans - funds - ammunition - blitzkrieg 26 38_euromaidan_moldovans_funds_ammunition
39 severodonetsk - lisichansk - cossacks - thanked - allahu 25 39_severodonetsk_lisichansk_cossacks_thanked
40 chernobaevka - missile - zhytomyr - aeroflot - crashed 25 40_chernobaevka_missile_zhytomyr_aeroflot
41 missiles - pentagon - drones - cnn - shipped 25 41_missiles_pentagon_drones_cnn
42 taiwan - beijing - fujian - pelosi - xinjiang 24 42_taiwan_beijing_fujian_pelosi
43 musk - twitter - paywalled - impostors - 22mln 24 43_musk_twitter_paywalled_impostors
44 gazprom - eu - pipeline - sudzha - renewable 23 44_gazprom_eu_pipeline_sudzha
45 petroleum - khashoggi - cartel - cnooc - prices 21 45_petroleum_khashoggi_cartel_cnooc
46 brics - multilateralism - vladivostok - ramaphosa - zhang 21 46_brics_multilateralism_vladivostok_ramaphosa
47 sanctions - sberbank - croatia - mitsotakis - tankers 21 47_sanctions_sberbank_croatia_mitsotakis
48 tiananmen - xinjiang - hong - pogroms - communists 21 48_tiananmen_xinjiang_hong_pogroms

Training hyperparameters

  • calculate_probabilities: True
  • language: None
  • low_memory: False
  • min_topic_size: 10
  • n_gram_range: (1, 1)
  • nr_topics: None
  • seed_topic_list: None
  • top_n_words: 10
  • verbose: False
  • zeroshot_min_similarity: 0.7
  • zeroshot_topic_list: None

Framework versions

  • Numpy: 1.26.4
  • HDBSCAN: 0.8.40
  • UMAP: 0.5.7
  • Pandas: 2.2.3
  • Scikit-Learn: 1.5.2
  • Sentence-transformers: 3.3.1
  • Transformers: 4.46.3
  • Numba: 0.60.0
  • Plotly: 5.24.1
  • Python: 3.10.12
Downloads last month
5
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.