How to Find and Evaluate Uncensored LLM Models on HuggingFace
I needed to find uncensored LLM models on HuggingFace for a local project. The platform hosts over 500,000 models, so finding the right ones requires knowing where to look and how to evaluate what you find.
The Challenge
HuggingFace’s search is powerful but not intuitive for finding uncensored variants. Standard searches for “uncensored” return mixed results - some models are genuinely uncensored, others are poorly documented, and many have significant capability degradation. I needed a systematic approach.
Finding Models: Three Proven Methods
Method 1: Keyword Search on HuggingFace
The fastest way to find uncensored models is using specific keywords in the HuggingFace search bar.
Primary search keywords:
- abliterated - Returns 4,967+ models
- heretic - Returns 2,164+ models
- uncensored - General search, variable quality
I found that “abliterated” and “heretic” yield the most consistent results because these terms have specific technical meanings in the uncensoring community.
Direct URL searches:
https://huggingface.co/models?search=abliteratedhttps://huggingface.co/models?search=heretichttps://huggingface.co/models?search=uncensoredSort by downloads to find community-validated models:
https://huggingface.co/models?search=abliterated&sort=downloadsMethod 2: Programmatic Search with HuggingFace API
For batch discovery, I use the HuggingFace API to search and filter models programmatically.
from huggingface_hub import HfApi
api = HfApi()models = list(api.list_models(search="abliterated", limit=100))
# Extract key informationfor model in models[:10]: print(f"{model.id}: {model.downloads} downloads")This approach lets me filter by downloads, likes, and last modified date to find active, well-maintained models.
Method 3: Curated Collections
Some creators maintain organized collections of uncensored models. The DavidAU Heretic Collection is particularly useful:
https://huggingface.co/collections/DavidAU/heretic-abliterated-uncensored-unrestricted-powerThese collections typically include:
- Detailed model cards explaining uncensoring methodology
- Benchmark comparisons with base models
- Usage examples and known limitations
Understanding the Two Main Uncensoring Approaches
Before evaluating models, I needed to understand what makes them different.
Abliterated Models
Abliterated models use mechanistic interpretability techniques to remove refusal directions from model weights without retraining.
Technical characteristics:
- Identifies refusal directions using SVD/PCA analysis- Projects out refusal vectors from weights- No retraining required- Process completes in minutesBest for: Quick experimentation, limited compute resources, testing before committing to heretic models.
Heretic Models
Heretic models undergo fine-tuning with Bayesian-optimized kernel methods for more comprehensive uncensoring.
Technical characteristics:
- Fine-tuning with Optuna TPE search- 7 global parameters optimized- Parametric kernel optimization- Permanent weight modificationsBest for: Maximum uncensoring needed, GPU resources available, permanent deployment.
Evaluating Model Quality
Finding models is easy. Evaluating quality requires a multi-step approach.
Step 1: Check Open LLM Leaderboard Scores
The Open LLM Leaderboard provides standardized benchmarks for model comparison. I look for these key metrics:
MMLU-Pro: Tests knowledge across 57 subjectsIFEval: Measures instruction following abilityMATH: Tests mathematical reasoningGPQA: Tests fact-based reasoningMuSR: Tests multi-step reasoning chainsTarget scores for usable uncensored models:
MMLU-Pro: 60-75% (good range)IFEval: 70%+ (critical for practical use)MATH: 30-50% (capable models)GPQA: 40-60% (good range)MuSR: 60%+ (usable models)Important limitation: Many uncensored models are not submitted to the official leaderboard. When this happens, I check the model card for self-reported benchmarks.
Step 2: Assess Capability Preservation
Uncensoring can damage model capabilities. I compare uncensored model benchmarks to the base model.
0-10% drop: Acceptable10-20% drop: Warning zone>20% drop: Model likely damaged, rejectI found that well-constructed abliterated models typically show 0-5% degradation, while heretic models can vary more widely depending on the training approach.
Step 3: Check Community Indicators
Download count is a useful quality signal:
>10k downloads: Community-validated quality1k-10k downloads: Worth investigating<1k downloads: Test thoroughly before useI also check:
- Model card quality (detailed methodology, benchmark comparisons)
- Creator reputation (established creators like DavidAU, huihui-ai)
- Update frequency and community engagement
Step 4: Test Refusal Rate
The ultimate test is whether the model actually responds to prompts that typically trigger refusals.
Manual testing protocol:
Test prompts that typically trigger refusalsScore: % of prompts receiving actual responsesTarget: 90%+ response rate for well-uncensored modelsI test 10-20 prompts across different categories to get a realistic refusal rate.
A Practical Workflow
Here’s the workflow I use when finding and evaluating uncensored models.
Phase 1: Discovery (30 minutes)
- Search for “abliterated” models on HuggingFace, sort by downloads
- Search for “heretic” models, check DavidAU’s collection
- Note 5-10 candidate models matching hardware requirements
Phase 2: Evaluation (2-4 hours)
- Check leaderboard scores or model card benchmarks
- Compare benchmark scores to base model
- Assess capability preservation (look for less than 10% degradation)
- Review download counts and model card quality
Phase 3: Testing (1-2 hours)
- Download top 2-3 candidates
- Run refusal tests with 10-20 prompts
- Test practical capabilities (reasoning, creativity, knowledge)
- Select best match for use case
Common Mistakes to Avoid
Mistake 1: Assuming all “uncensored” models are equal
Models labeled “uncensored” vary wildly in quality. Some have residual refusals; others have severe capability degradation. Always test refusal rates and benchmark preservation.
Mistake 2: Ignoring capability degradation
Aggressive uncensoring can damage reasoning and coherence. I reject models with >20% benchmark degradation from base model.
Mistake 3: Overlooking hardware requirements
Before downloading, I calculate VRAM needs:
Model parameters (B) x 2 (FP16) = Minimum VRAM in GB
For 16GB VRAM: target 7B-14B modelsFor 24GB VRAM: target up to 14B models comfortablyFor 48GB VRAM: target up to 30B modelsMistake 4: Trusting single evaluation sources
I combine multiple evaluation methods: benchmarks + refusal testing + community feedback + practical testing. No single source tells the complete story.
Mistake 5: Not reading model cards
Model cards contain critical information: uncensoring methodology, known limitations, special usage instructions. Always read the full model card before downloading.
Starting Points for Different Hardware
For 16GB VRAM:
Abliterated:- lukey03/Qwen3.5-9B-abliterated- huihui-ai/Huihui-Qwen3.5-35B-A3B-abliterated (quantized)
Heretic:- DavidAU/Qwen3.5-9B-Claude-4.6-HighIQ-THINKING-HERETIC-UNCENSORED
Community recommended:- Dirty Shirley Writer V1- fluffy/l3-8b-stheno-v3.2For 24GB VRAM:
- llmfan46/Qwen3.5-35B-A3B-heretic-v2 (quantized)- gpt-oss-20b-heretic-ara-v3- Magnum cydoms 24b i1Summary
Finding uncensored LLM models on HuggingFace requires specific keywords (“abliterated”, “heretic”), systematic evaluation (benchmarks, refusal rates, capability preservation), and practical testing. I use the three-phase workflow: discovery through keyword search, evaluation through benchmark analysis, and testing through refusal rate assessment.
The best models balance uncensoring effectiveness with capability preservation: >90% response rate on typically-refused prompts while maintaining >90% of base model benchmark performance. Start with well-documented, high-download models from reputable creators, and always test in your specific use case.
Final Words + More Resources
My intention with this article was to help others share my knowledge and experience. If you want to contact me, you can contact by email: Email me
Here are also the most important links from this article along with some further resources that will help you in this scope:
- 👨💻 HuggingFace Model Hub
- 👨💻 Open LLM Leaderboard
- 👨💻 OBLITERATUS Project
- 👨💻 DavidAU Heretic Collection
Oh, and if you found these resources useful, don’t forget to support me by starring the repo on GitHub!
Comments