Common Issues

This page documents common issues and their solutions. Each issue includes symptoms, root cause, step-by-step solution, and prevention tips.

Frontend Issues

Blank Screen After Loading

Symptoms:

Frontend loads but shows blank page
No content visible
Browser console may show errors

Cause: Backend API is unreachable or returning errors

Solution:

Check backend is running:

docker compose ps backend

Check backend logs for errors:

docker compose logs backend

Verify frontend can reach backend:

curl http://localhost:3001/api/health

Check CORS configuration in backend matches frontend URL

Prevention:

Use health check endpoints in deployment
Configure proper CORS settings
Monitor backend availability

API Connection Errors

Symptoms:

"Failed to fetch" errors in browser console
Network errors when making API calls
CORS policy errors

Cause: CORS configuration mismatch between frontend and backend

Solution:

Check CORS_ORIGIN in backend environment:

docker compose exec backend env | grep CORS_ORIGIN

Ensure it matches frontend URL (http://localhost:5173 for development)
Update CORS_ORIGIN if needed:

CORS_ORIGIN=http://localhost:5173

Restart backend:

docker compose restart backend

Prevention:

Set CORS_ORIGIN correctly during deployment
Use environment variables for configuration
Test API connectivity after deployment

Video Won't Play

Symptoms:

Video player shows error or loading indefinitely
Video playback controls disabled
Black screen in video player

Cause: Video file not accessible, wrong format, or server configuration issue

Solution:

Check video exists in /data directory:

ls -lh data/*.mp4

Verify video format (MP4 and WebM supported):

file data/your_video.mp4

Check backend serves video:

curl -I http://localhost:3001/videos/your_video.mp4

Check file permissions:

ls -l data/your_video.mp4

Prevention:

Use supported video formats (MP4, WebM)
Verify file permissions (readable by Docker)
Test video playback after adding new videos

Backend Issues

Database Connection Failed

Symptoms:

Backend logs show "Connection refused" to PostgreSQL
Backend service fails to start
Database migration errors

Cause: Database not ready, wrong credentials, or network issues

Solution:

Check PostgreSQL is running:

docker compose ps postgres

Check PostgreSQL logs:

docker compose logs postgres

Verify DATABASE_URL has correct credentials:

docker compose exec backend printenv | grep DATABASE_URL

Test database connection:

docker compose exec backend npx prisma db execute --stdin <<< "SELECT 1"

Wait for database to be ready (check health status)

Prevention:

Use depends_on with health checks in docker-compose.yml
Configure proper startup order
Monitor database health

Migration Errors

Symptoms:

"Migration failed" on backend startup
Database schema out of sync
Prisma errors about missing tables

Cause: Schema changes conflict with existing data or migration not applied

Solution:

Check migration status:

docker compose exec backend npx prisma migrate status

Apply pending migrations:

docker compose exec backend npx prisma migrate deploy

If migrations fail, check for conflicts in logs
For development only, reset database:

docker compose exec backend npx prisma migrate reset

Prevention:

Test migrations in development first
Backup database before applying migrations
Review migration SQL before applying

High Memory Usage

Symptoms:

Backend container uses more than 4GB RAM
System becomes slow
Docker reports memory warnings
Container may be restarted by Docker

Cause: Memory leak, large dataset in cache, or insufficient resource limits

Solution:

Check current memory usage:

docker stats backend

Restart backend:

docker compose restart backend

Check for memory leaks in logs:

docker compose logs backend | grep -i "memory\|heap"

Reduce cache TTL in Redis if caching large datasets
Set memory limits in docker-compose.yml:

backend:
  deploy:
    resources:
      limits:
        memory: 4G

Prevention:

Monitor memory usage regularly
Set appropriate resource limits
Configure cache expiration
Profile application for memory leaks

Model Service Issues

CUDA Out of Memory

Symptoms:

Model service crashes with "CUDA out of memory" error
GPU memory exhausted messages
Container restarts

Cause: GPU VRAM exhausted by model or batch size

Solution:

Check GPU memory usage:

docker compose exec model-service-gpu nvidia-smi

Reduce batch size in model-service/config/models.yaml
Use smaller models
Switch to CPU mode temporarily:

docker compose --profile gpu down
docker compose up -d

Adjust PyTorch memory settings:

PYTORCH_CUDA_ALLOC_CONF=max_split_size_mb:256

Prevention:

Monitor GPU memory usage
Use appropriate model sizes for available VRAM
Configure batch sizes conservatively
Set memory limits in configuration

Model Not Loading

Symptoms:

"Failed to load model" errors
Model service takes very long to start
Inference requests fail

Cause: Model files not downloaded, corrupted, or network issues

Solution:

Check model service logs:

docker compose logs model-service

Look for specific model name in error
Delete model cache and restart:

docker volume rm fovea_model-cache
docker compose up -d model-service

Verify internet connection for model download
Check disk space:

df -h

Prevention:

Pre-download models
Use persistent volume for model cache
Ensure adequate disk space
Monitor model loading times

Slow Inference

Symptoms:

Inference takes more than 30 seconds for simple tasks
Job queue backs up
Timeout errors

Cause: CPU mode, model configuration issue, or resource contention

Solution:

Verify GPU mode if applicable:

docker compose exec model-service-gpu nvidia-smi

Check DEVICE environment variable:

docker compose exec model-service env | grep DEVICE

Switch to faster model in config/models.yaml
Verify no other processes using GPU
Check system resource usage:

docker stats

Prevention:

Use GPU mode for production
Benchmark models before deployment
Monitor inference latency
Configure appropriate model sizes

Infrastructure Issues

Docker Out of Disk Space

Symptoms:

"No space left on device" errors
Build failures
Container start failures

Cause: Docker images, containers, and volumes consuming disk

Solution:

Check disk usage:

df -h
docker system df

Clean up Docker resources:

docker system prune -a --volumes

Remove unused images:

docker image prune -a

Check large volumes:

docker volume ls
docker volume inspect fovea_model-cache

Prevention:

Regular cleanup (weekly/monthly)
Monitor disk space
Set up disk space alerts
Use appropriate volume sizes

Port Already in Use

Symptoms:

"Port 5173 is already allocated" error
"Address already in use" error
Service fails to start

Cause: Another service using the port

Solution:

Find process using port:

lsof -i :5173

Kill process:

kill <PID>

Or change FOVEA port in docker-compose.yml:

frontend:
  ports:
    - "5174:5173"  # Use different external port

Prevention:

Use standard ports consistently
Document port usage
Check for conflicts before deployment
Use port range that does not conflict with common services

Permission Denied

Symptoms:

"Permission denied" accessing /data directory
Cannot write to volumes
Docker socket permission errors

Cause: File permissions mismatch or user permissions

Solution:

Fix file ownership:

sudo chown -R $USER:$USER data/

Check Docker permissions:

sudo usermod -aG docker $USER
newgrp docker

Verify volume permissions:

docker compose exec backend ls -la /data

Set correct permissions in docker-compose.yml:

volumes:
  - ./data:/data:rw

Prevention:

Set correct permissions during setup
Run containers as appropriate user
Document permission requirements
Use Docker user namespace remapping for security

General Troubleshooting Steps

Diagnostic Commands

Check service status:

docker compose ps

View all logs:

docker compose logs -f

Check specific service:

docker compose logs backend
docker compose logs model-service

View resource usage:

docker stats

Inspect network:

docker network inspect fovea_fovea-network

Check health endpoints:

curl http://localhost:3001/api/health
curl http://localhost:8000/health

Restart Strategy

Single service:

docker compose restart backend

All services:

docker compose restart

Clean restart:

docker compose down
docker compose up -d

Rebuild and restart:

docker compose up -d --build

Getting Help

If issues persist after troubleshooting:

Collect diagnostic information:
- Service logs: docker compose logs > logs.txt
- Configuration: docker compose config
- System info: docker version, docker compose version
Check GitHub issues for similar problems
Search documentation for related topics
Include diagnostic information when asking for help

Next Steps

Common Tasks: Daily operations
Monitoring: Set up monitoring
Configuration: Configuration guide

Frontend Issues​

Blank Screen After Loading​

API Connection Errors​

Video Won't Play​

Backend Issues​

Database Connection Failed​

Migration Errors​

High Memory Usage​

Model Service Issues​

CUDA Out of Memory​

Model Not Loading​

Slow Inference​

Infrastructure Issues​

Docker Out of Disk Space​

Port Already in Use​

Permission Denied​

General Troubleshooting Steps​

Diagnostic Commands​

Restart Strategy​

Getting Help​

Next Steps​

Frontend Issues

Blank Screen After Loading

API Connection Errors

Video Won't Play

Backend Issues

Database Connection Failed

Migration Errors

High Memory Usage

Model Service Issues

CUDA Out of Memory

Model Not Loading

Slow Inference

Infrastructure Issues

Docker Out of Disk Space

Port Already in Use

Permission Denied

General Troubleshooting Steps

Diagnostic Commands

Restart Strategy

Getting Help

Next Steps