File size: 11,487 Bytes
2492845 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 |
---
tags:
- bertopic
library_name: bertopic
pipeline_tag: text-classification
---
# bertopic-umap15-hbd15-topn15
This is a [BERTopic](https://github.com/MaartenGr/BERTopic) model.
BERTopic is a flexible and modular topic modeling framework that allows for the generation of easily interpretable topics from large datasets.
## Usage
To use this model, please install BERTopic:
```
pip install -U bertopic
```
You can use the model as follows:
```python
from bertopic import BERTopic
topic_model = BERTopic.load("ahessamb/bertopic-umap15-hbd15-topn15")
topic_model.get_topic_info()
```
## Topic overview
* Number of topics: 105
* Number of training documents: 14320
<details>
<summary>Click here for an overview of all topics.</summary>
| Topic ID | Topic Keywords | Topic Frequency | Label |
|----------|----------------|-----------------|-------|
| -1 | market - price - nft - said - cryptocurrency | 15 | -1_market_price_nft_said |
| 0 | korea - funds - attack - hackers - fraud | 6725 | 0_korea_funds_attack_hackers |
| 1 | usd - 500 - near - bitcoin - consolidating | 706 | 1_usd_500_near_bitcoin |
| 2 | sized - digest - news - blockchain - radar | 417 | 2_sized_digest_news_blockchain |
| 3 | merge - ethereum - proof - fork - beacon | 236 | 3_merge_ethereum_proof_fork |
| 4 | rate - cpi - hikes - fomc - bitcoin | 209 | 4_rate_cpi_hikes_fomc |
| 5 | luna - ustc - entropy - proposal - terraform | 207 | 5_luna_ustc_entropy_proposal |
| 6 | brands - meta - worlds - immersive - decentraland | 206 | 6_brands_meta_worlds_immersive |
| 7 | russia - sanctions - crypto - ruble - settlements | 187 | 7_russia_sanctions_crypto_ruble |
| 8 | gensler - securities - coinbase - industry - regulation | 178 | 8_gensler_securities_coinbase_industry |
| 9 | blockchain - web3 - gamers - p2e - industry | 174 | 9_blockchain_web3_gamers_p2e |
| 10 | miners - carbon - power - bitcoin - report | 157 | 10_miners_carbon_power_bitcoin |
| 11 | funding - round - ventures - capital - gamestop | 151 | 11_funding_round_ventures_capital |
| 12 | xrp - ripple - price - level - resistance | 146 | 12_xrp_ripple_price_level |
| 13 | etf - blackrock - grayscale - bitcoin - futures | 145 | 13_etf_blackrock_grayscale_bitcoin |
| 14 | web3 - disco - mcmullen - identity - platforms | 144 | 14_web3_disco_mcmullen_identity |
| 15 | protocols - decentralized - newsletter - cefi - lending | 141 | 15_protocols_decentralized_newsletter_cefi |
| 16 | inu - lucie - meme - tokens - ecosystem | 139 | 16_inu_lucie_meme_tokens |
| 17 | ftx - sam - bankman - bankruptcy - ceo | 132 | 17_ftx_sam_bankman_bankruptcy |
| 18 | tether - usdt - documents - coindesk - stablecoins | 123 | 18_tether_usdt_documents_coindesk |
| 19 | el - bukele - nayib - bitcoin - x93 | 120 | 19_el_bukele_nayib_bitcoin |
| 20 | dogecoin - musk - meme - twitter - level | 114 | 20_dogecoin_musk_meme_twitter |
| 21 | 26 - resistance - near - btc - bulls | 106 | 21_26_resistance_near_btc |
| 22 | nft - opensea - doppel - marketplaces - rug | 101 | 22_nft_opensea_doppel_marketplaces |
| 23 | cfds - traders - assets - cryptocurrency - adoption | 95 | 23_cfds_traders_assets_cryptocurrency |
| 24 | difficulty - hashrate - bitcoin - network - height | 90 | 24_difficulty_hashrate_bitcoin_network |
| 25 | ubi - cointelegraph - simonin - bitcoin - income | 88 | 25_ubi_cointelegraph_simonin_bitcoin |
| 26 | coinbase - bitkey - india - ceo - fees | 85 | 26_coinbase_bitkey_india_ceo |
| 27 | donated - russia - invasion - transformation - donors | 83 | 27_donated_russia_invasion_transformation |
| 28 | celsius - cel - withdrawals - company - mashinsky | 81 | 28_celsius_cel_withdrawals_company |
| 29 | nfts - collections - million - floor - cryptopunk | 81 | 29_nfts_collections_million_floor |
| 30 | blockchain - bvm - mvc - maestro - databases | 78 | 30_blockchain_bvm_mvc_maestro |
| 31 | crypto - merchants - mastercard - feature - cashapp | 78 | 31_crypto_merchants_mastercard_feature |
| 32 | ada - cardano - bearish - satoshis - market | 76 | 32_ada_cardano_bearish_satoshis |
| 33 | nft - sartoshi - artists - snoop - community | 75 | 33_nft_sartoshi_artists_snoop |
| 34 | solana - bearish - outages - fibonacci - resistance | 72 | 34_solana_bearish_outages_fibonacci |
| 35 | hinman - ripple - speech - emails - xrp | 71 | 35_hinman_ripple_speech_emails |
| 36 | oecd - taxation - framework - india - electronic | 70 | 36_oecd_taxation_framework_india |
| 37 | terraform - montenegro - korea - x93 - milojko | 69 | 37_terraform_montenegro_korea_x93 |
| 38 | order - securities - freeze - restraining - cyprus | 68 | 38_order_securities_freeze_restraining |
| 39 | manchester - sponsorship - bcci - com - fans | 68 | 39_manchester_sponsorship_bcci_com |
| 40 | surveyed - millennials - managers - crypto - report | 67 | 40_surveyed_millennials_managers_crypto |
| 41 | whales - eth - market - transactions - usdt | 66 | 41_whales_eth_market_transactions |
| 42 | binance - kazakhstan - changpeng - expansion - 500m | 61 | 42_binance_kazakhstan_changpeng_expansion |
| 43 | twitter - musk - metatime - jack - yaccarino | 59 | 43_twitter_musk_metatime_jack |
| 44 | rsi - price - line - altcoin - bullish | 59 | 44_rsi_price_line_altcoin |
| 45 | china - huobi - hkma - regulatory - companies | 57 | 45_china_huobi_hkma_regulatory |
| 46 | token - leo - surged - tlos - graph | 57 | 46_token_leo_surged_tlos |
| 47 | cbdcs - governor - banks - mit - project | 56 | 47_cbdcs_governor_banks_mit |
| 48 | daos - chorus - lieberman - decentralized - organizations | 51 | 48_daos_chorus_lieberman_decentralized |
| 49 | fungible - nonfungible - tokens - nft - 2021 | 51 | 49_fungible_nonfungible_tokens_nft |
| 50 | altcoins - levels - overhead - support - bounce | 50 | 50_altcoins_levels_overhead_support |
| 51 | yuan - digital - tax - cbdc - wallets | 43 | 51_yuan_digital_tax_cbdc |
| 52 | depot - company - invest - banking - america | 42 | 52_depot_company_invest_banking |
| 53 | markets - advice - bull - hodlers - nasdaily | 42 | 53_markets_advice_bull_hodlers |
| 54 | eth - level - breakout - tradingview - analysts | 38 | 54_eth_level_breakout_tradingview |
| 55 | nethereum - usd - struggling - resistance - performers | 37 | 55_nethereum_usd_struggling_resistance |
| 56 | ecoterra - trending - swords - presale - neo | 36 | 56_ecoterra_trending_swords_presale |
| 57 | securities - market - binance - coinbase - week | 34 | 57_securities_market_binance_coinbase |
| 58 | staking - eigenlayer - sip - ethereum - tokens | 33 | 58_staking_eigenlayer_sip_ethereum |
| 59 | founder - ethereum - forgotten - values - twitter | 33 | 59_founder_ethereum_forgotten_values |
| 60 | bnb - bauer - upgrade - ecosystem - network | 32 | 60_bnb_bauer_upgrade_ecosystem |
| 61 | price - rsi - bullish - chart - resistance | 32 | 61_price_rsi_bullish_chart |
| 62 | expiry - week - billion - derivatives - bet | 32 | 62_expiry_week_billion_derivatives |
| 63 | vasil - fork - mainnet - newest - scalability | 31 | 63_vasil_fork_mainnet_newest |
| 64 | microstrategy - saylor - btc - rumor - billion | 31 | 64_microstrategy_saylor_btc_rumor |
| 65 | metamask - browser - wallets - features - allows | 31 | 65_metamask_browser_wallets_features |
| 66 | uae - east - chainalysis - singapore - emerging | 31 | 66_uae_east_chainalysis_singapore |
| 67 | outflows - etps - products - week - funds | 31 | 67_outflows_etps_products_week |
| 68 | polygon - zcash - kakarot - starknet - protocol | 29 | 68_polygon_zcash_kakarot_starknet |
| 69 | japanese - jvcea - stablecoin - x93 - fatf | 29 | 69_japanese_jvcea_stablecoin_x93 |
| 70 | asic - miner - gpu - mi300x - ks3 | 28 | 70_asic_miner_gpu_mi300x |
| 71 | arrows - voyager - dcg - genesis - bankruptcy | 28 | 71_arrows_voyager_dcg_genesis |
| 72 | axie - infinity - program - ronin - upgrades | 26 | 72_axie_infinity_program_ronin |
| 73 | withdrawals - platform - freeway - halted - babel | 26 | 73_withdrawals_platform_freeway_halted |
| 74 | addresses - eth - glassnode - underwater - cryptos | 26 | 74_addresses_eth_glassnode_underwater |
| 75 | bottoming - dip - markets - chain - altcoins | 25 | 75_bottoming_dip_markets_chain |
| 76 | mica - eu - conglomerates - jurisdictions - framework | 25 | 76_mica_eu_conglomerates_jurisdictions |
| 77 | liquidations - resting - bid - order - 200 | 25 | 77_liquidations_resting_bid_order |
| 78 | listings - missed - announcements - usdt - exchanges | 25 | 78_listings_missed_announcements_usdt |
| 79 | cbdc - ripple - border - imf - currencies | 25 | 79_cbdc_ripple_border_imf |
| 80 | announcements - delisting - pair - listing - collection | 24 | 80_announcements_delisting_pair_listing |
| 81 | treasury - mixers - sanctioning - github - prank | 24 | 81_treasury_mixers_sanctioning_github |
| 82 | polkadot - parachains - auctions - opengov - referenda | 24 | 82_polkadot_parachains_auctions_opengov |
| 83 | hedge - investors - crypto - traditional - enriquez | 23 | 83_hedge_investors_crypto_traditional |
| 84 | level - resistance - cj - price - cryptocurrency | 23 | 84_level_resistance_cj_price |
| 85 | nexo - citibank - vauld - acquisitions - launched | 22 | 85_nexo_citibank_vauld_acquisitions |
| 86 | huobi - li - citing - pantronics - rumours | 22 | 86_huobi_li_citing_pantronics |
| 87 | nft - textbook - pill - sweeney - x9caccessible | 21 | 87_nft_textbook_pill_sweeney |
| 88 | bored - yacht - apecoin - justin - collection | 21 | 88_bored_yacht_apecoin_justin |
| 89 | apecoin - pattern - chart - head - roc | 21 | 89_apecoin_pattern_chart_head |
| 90 | subscription - investment - binance - dual - 06 | 20 | 90_subscription_investment_binance_dual |
| 91 | halving - correlation - nasdaq - 2024 - powell | 20 | 91_halving_correlation_nasdaq_2024 |
| 92 | announcements - delisting - listing - crypto - slice | 20 | 92_announcements_delisting_listing_crypto |
| 93 | adoption - nigeria - kucoin - lawful - aza | 18 | 93_adoption_nigeria_kucoin_lawful |
| 94 | staff - chatbot - layoffs - hr - terminations | 18 | 94_staff_chatbot_layoffs_hr |
| 95 | ethereum - network - batching - costs - tx | 18 | 95_ethereum_network_batching_costs |
| 96 | suarez - desantis - salary - city - candidate | 18 | 96_suarez_desantis_salary_city |
| 97 | circle - stablecoin - integrating - cybavo - worldpay | 17 | 97_circle_stablecoin_integrating_cybavo |
| 98 | stablecoins - paypal - plabasan - mhel - converge22 | 17 | 98_stablecoins_paypal_plabasan_mhel |
| 99 | week - tokens - tvl - locked - analytical | 17 | 99_week_tokens_tvl_locked |
| 100 | impairment - company - holdings - incurred - btc | 17 | 100_impairment_company_holdings_incurred |
| 101 | cbdc - familiarity - euro - ecb - respondents | 17 | 101_cbdc_familiarity_euro_ecb |
| 102 | marketplace - opensea - popularize - ftx - teaming | 16 | 102_marketplace_opensea_popularize_ftx |
| 103 | executive - leaving - bitstamp - genesis - samir | 15 | 103_executive_leaving_bitstamp_genesis |
</details>
## Training hyperparameters
* calculate_probabilities: False
* language: None
* low_memory: False
* min_topic_size: 15
* n_gram_range: (1, 1)
* nr_topics: None
* seed_topic_list: None
* top_n_words: 5
* verbose: False
## Framework versions
* Numpy: 1.22.4
* HDBSCAN: 0.8.29
* UMAP: 0.5.3
* Pandas: 1.5.3
* Scikit-Learn: 1.2.2
* Sentence-transformers: 2.2.2
* Transformers: 4.30.2
* Numba: 0.56.4
* Plotly: 5.13.1
* Python: 3.10.12
|