Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
673.4
TFLOPS
3
6
37
Tim Bula
timrbula
Follow
brmcg's profile picture
21world's profile picture
2 followers
·
10 following
timrbula
timrbula.com
AI & ML interests
LLMs for language and code
Recent Activity
liked
a model
6 days ago
mistralai/Mistral-Small-24B-Base-2501
reacted
to
singhsidhukuldeep
's
post
with ❤️
10 days ago
It's not every day you see a research paper named "Alice's Adventures in a Differentiable Wonderland," and when you open it, it's a 281-page book! I haven't completed it yet, but this amazing work, written by Simone Scardapane, is a fascinating introduction to deep neural networks and differentiable programming. Some key technical highlights: • Covers core concepts like automatic differentiation, stochastic optimization, and activation functions in depth • Explains modern architectures like convolutional networks, transformers, and graph neural networks • Provides mathematical foundations including linear algebra, gradients, and probability theory • Discusses implementation details in PyTorch and JAX • Explores advanced topics like Bayesian neural networks and neural scaling laws The book takes a unique approach, framing neural networks as compositions of differentiable primitives rather than biological analogs. It provides both theoretical insights and practical coding examples. I especially enjoyed the sections on: • Vector-Jacobian products and reverse-mode autodiff • Stochastic gradient descent and mini-batch optimization • ReLU, GELU, and other modern activation functions • Universal approximation capabilities of MLPs Whether you're new to deep learning or an experienced practitioner, this book offers valuable insights into the fundamentals and latest developments. Highly recommended for anyone working with neural networks!
liked
a model
21 days ago
ibm-granite/granite-3.1-3b-a800m-base
View all activity
Organizations
timrbula
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a model
6 days ago
mistralai/Mistral-Small-24B-Base-2501
Text Generation
•
Updated
7 days ago
•
4.7k
•
196
liked
7 models
21 days ago
ibm-granite/granite-3.1-3b-a800m-base
Text Generation
•
Updated
7 days ago
•
4.81k
•
5
ibm-granite/granite-3.1-1b-a400m-instruct
Text Generation
•
Updated
7 days ago
•
6.33k
•
13
ibm-granite/granite-3.1-1b-a400m-base
Text Generation
•
Updated
7 days ago
•
3.71k
•
5
ibm-granite/granite-3.1-8b-base
Text Generation
•
Updated
7 days ago
•
7.79k
•
20
ibm-granite/granite-3.1-2b-instruct
Text Generation
•
Updated
7 days ago
•
19.7k
•
31
ibm-granite/granite-3.1-2b-base
Text Generation
•
Updated
7 days ago
•
7.52k
•
10
ibm-granite/granite-3.1-3b-a800m-instruct
Text Generation
•
Updated
7 days ago
•
21k
•
20
liked
a model
28 days ago
ibm-research/materials.smi-ted
Feature Extraction
•
Updated
Nov 7, 2024
•
52.7k
•
21
liked
a model
about 2 months ago
ibm-granite/granite-3.1-8b-instruct
Text Generation
•
Updated
7 days ago
•
79.7k
•
137
liked
8 models
4 months ago
ibm-granite/granite-3.0-3b-a800m-instruct
Text Generation
•
Updated
Dec 19, 2024
•
6.36k
•
17
ibm-granite/granite-3.0-3b-a800m-base
Text Generation
•
Updated
Dec 19, 2024
•
3.68k
•
5
ibm-granite/granite-3.0-1b-a400m-instruct
Text Generation
•
Updated
Dec 19, 2024
•
672
•
19
ibm-granite/granite-3.0-1b-a400m-base
Text Generation
•
Updated
Dec 19, 2024
•
3.73k
•
6
ibm-granite/granite-3.0-8b-instruct
Text Generation
•
Updated
Dec 19, 2024
•
34.9k
•
199
ibm-granite/granite-3.0-8b-base
Text Generation
•
Updated
Dec 19, 2024
•
6.51k
•
22
ibm-granite/granite-3.0-2b-base
Text Generation
•
Updated
Dec 19, 2024
•
5.91k
•
21
ibm-granite/granite-3.0-2b-instruct
Text Generation
•
Updated
Dec 19, 2024
•
17.8k
•
47
liked
a dataset
4 months ago
TokenBender/code_instructions_122k_alpaca_style
Viewer
•
Updated
Jul 20, 2023
•
122k
•
1.94k
•
73
liked
a model
4 months ago
mistralai/Mixtral-8x22B-Instruct-v0.1
Text Generation
•
Updated
Oct 3, 2024
•
860k
•
•
710
Load more