Spaces:
Running
Running
File size: 6,356 Bytes
7a48c74 4448285 7a48c74 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 |
---
license: mit
title: Query Fan-Out SEO Analyzer
sdk: streamlit
emoji: π
colorFrom: green
colorTo: yellow
pinned: true
short_description: WordPress Query Fan Out SEO Tool
---
# WordPress Query Fan-Out SEO Analyzer
A comprehensive tool that analyzes WordPress sites for Google AI Mode optimization using query decomposition and multi-hop reasoning strategies.
## Features
- **WordPress Content Crawling**: Fetches all posts, pages, categories, and tags via wp-json API
- **Content Graph Construction**: Builds a knowledge graph of your site's content and internal links
- **Query Pattern Analysis**: Uses Claude AI to identify complex queries and decomposition opportunities
- **Semantic Clustering**: Groups related content using TF-IDF vectorization
- **Multi-Source Optimization**: Identifies content that can serve multiple Google source types
- **Actionable Recommendations**: Provides specific steps to optimize for query fan-out
- **Visual Graph Export**: Creates interactive visualization of your content network
## Installation
```bash
# Clone or download app.py
# Install dependencies
pip install -r requirements.txt
```
## Usage
### Basic Usage
```bash
python app.py https://yourwordpresssite.com YOUR_CLAUDE_API_KEY
```
### With Options
```bash
# Custom output file
python app.py https://yourwordpresssite.com YOUR_CLAUDE_API_KEY --output my_report.json
# With visualization
python app.py https://yourwordpresssite.com YOUR_CLAUDE_API_KEY --visualize
```
## Getting Your Claude API Key
1. Sign up at https://console.anthropic.com
2. Go to API Keys section
3. Create a new API key
4. Copy and use in the command
## What the Analyzer Does
### 1. Content Fetching
- Retrieves all published posts and pages
- Fetches categories, tags, and media information
- Respects rate limits to avoid overloading your server
### 2. Graph Construction
- Creates nodes for each piece of content
- Maps internal links as edges
- Identifies content relationships through categories/tags
### 3. Query Analysis with Claude
- Sends content samples to Claude API
- Identifies potential complex user queries
- Predicts how Google would decompose these queries
- Finds gaps in sub-query coverage
### 4. Content Depth Analysis
- Scores each piece of content for depth and comprehensiveness
- Identifies potential hub pages
- Finds orphaned content with no internal links
- Discovers semantic content clusters
### 5. Recommendation Generation
- Content gaps for unanswered sub-queries
- Internal linking opportunities
- Hub page optimization suggestions
- Semantic bridge creation recommendations
## Output Report Structure
```json
{
"site_url": "https://example.com",
"analysis_date": "2024-01-15T10:30:00",
"summary": {
"total_posts": 156,
"total_pages": 23,
"content_nodes": 205,
"internal_links": 432,
"orphan_content": 12,
"hub_pages": 5,
"semantic_clusters": 8
},
"query_optimization": {
"complex_queries": [
"How do I set up WooCommerce with custom shipping zones for international orders?"
],
"decompositions": {
"query_1": [
"What is WooCommerce?",
"How to install WooCommerce?",
"What are shipping zones?",
"How to set up international shipping?"
]
},
"gaps": [
"No content about shipping zones",
"Missing international shipping guide"
]
},
"recommendations": [
{
"type": "content_gap",
"priority": "high",
"action": "Create new content",
"details": "Create content to answer sub-query: What are shipping zones?",
"impact": "Enables multi-hop reasoning path"
}
],
"action_plan": {
"immediate": [...],
"short_term": [...],
"long_term": [...]
}
}
```
## Interpreting Results
### Content Gaps
These are sub-queries that Google might generate but your site doesn't answer. Creating this content enables Google to use your site in multi-hop reasoning.
### Orphan Content
Valuable content that isn't well-connected to your site's graph. Adding internal links helps Google traverse your content.
### Hub Pages
Pages with high potential to serve as central nodes in query paths. Optimizing these strengthens your site's authority.
### Semantic Clusters
Groups of related content that should be better interconnected to support query fan-out.
## Visualization
If you use the `--visualize` flag, the tool generates an interactive HTML graph showing:
- Blue nodes: Posts
- Orange nodes: Pages
- Green nodes: Categories
- Red nodes: Tags
- Node size: Based on number of connections
- Edges: Internal links and relationships
## Best Practices
1. **Run Regularly**: Monthly analysis helps track improvements
2. **Focus on High-Priority**: Address "immediate" recommendations first
3. **Create Sub-Query Content**: Each piece should comprehensively answer one specific question
4. **Build Semantic Bridges**: Connect related content with contextual internal links
5. **Monitor Results**: Track performance in Google Search Console
## Troubleshooting
### API Rate Limits
If you hit rate limits, the tool automatically slows down. For large sites, the analysis may take 10-20 minutes.
### Memory Issues
For very large sites (1000+ posts), you may need to modify the code to process in batches.
### Claude API Errors
Ensure your API key is valid and you have sufficient credits.
## Example Use Cases
### E-commerce Site
Identifies complex product queries and ensures all comparison factors are covered.
### Tech Blog
Finds tutorial series that need better interconnection for step-by-step learning paths.
### Service Business
Discovers service-related questions that require multiple pages to answer fully.
## Advanced Usage
### Custom Analysis
Modify the `analyze_query_patterns()` method to focus on specific query types relevant to your niche.
### Export Formats
Extend the `export_report()` method to output in different formats (CSV, HTML, etc.).
### Integration
Use the report data to automatically create content briefs or update your content calendar.
## Support
For issues or questions:
1. Check WordPress REST API is enabled: `https://yoursite.com/wp-json/`
2. Verify Claude API key is active
3. Ensure Python dependencies are installed correctly
## License
MIT License - Feel free to modify and use for your SEO optimization needs. |