DIY AI For Journalists

hackshackers 's Collections

updated Sep 18, 2023

Compiling resources useful for journalists building prototypes with AI

Upvote

Runtime error

174

174

Whisper JAX Diarization

🔥

Note This Space provides a version of Whisper (a speech to text model) with speaker diarization. This allows you to transcribe audio containing speech along with information about who is speaking.
pyannote/speaker-diarization

Automatic Speech Recognition • Updated May 10, 2024 • 6.51M • 938

Note This model allows you to perform diarization (identification of who is speaking in audio)
copenlu/scientific-exaggeration-detection

Text Classification • Updated Jul 3, 2024 • 16 • 3

Note This model can measure the causal claim strength of a scientific sentence, which can be used to compare two sentences for exaggeration in causal claim strength.
Running

149

149

PDF OCR

📝

Convert PDF to text using OCR

Note A space that allows you to perform OCR on PDF documents
Running

5

5

Grobid CRF only

🌍

Extract bibliographic data from academic papers and patents

Note GROBID is a machine learning library for extracting, parsing and re-structuring raw documents such as PDF into structured XML/TEI encoded documents with a particular focus on technical and scientific publications.
Running

3

3

Coconut

🥥

Explore text data with various visualization tools

Note Coconut Library Tool is an all-in-one data mining and textual analysis tool
tomaarsen/span-marker-bert-base-uncased-keyphrase-inspec

Token Classification • Updated Sep 26, 2023 • 20 • 11

Note This is a Named Entity Recognition model trained to extract keywords from a text.
Running on CPU Upgrade

38

38

Argilla Space

✍

Note Sometimes it may be useful to create your own training data for training or evaluating machine learning models. Tools like Argilla can help with the process of creating these annotations.

Upvote

Whisper JAX Diarization

PDF OCR

Grobid CRF only

Coconut

Argilla Space