My Tech with serving internet meals.: 2025

trešdiena, 2025. gada 3. septembris

Upscale with A.I VIDEOS

Great choice! The ShotCut + Upscayl combo gives you a GUI-based workflow to upscale videos on Ubuntu 22.04 — with no command line required for basic use.

🎯 What You'll Achieve

Split your video into image frames using ShotCut (or FFmpeg, optionally).
Upscale the frames using Upscayl (AI image upscaler).
Reassemble the upscaled frames back into a video using ShotCut.

🧰 What You Need

✅ ShotCut
✅ Upscayl (Free and open-source AI image upscaler)

🧱 Step-by-Step Guide

🔧 Step 1: Install ShotCut

📦 Install via Flatpak (Recommended)


sudo apt install flatpak -y
flatpak install flathub org.shotcut.Shotcut

✅ Launch with:


flatpak run org.shotcut.Shotcut

🤖 Step 2: Install Upscayl (GUI AI Image Upscaler)

📥 Download `.AppImage` (Easiest)

Go to: https://github.com/upscayl/upscayl/releases
Download the latest Upscayl-x.x.x.AppImage
Make it executable:


chmod +x Upscayl-*.AppImage
./Upscayl-*.AppImage

🎞️ Step 3: Extract Frames from Video using ShotCut

Open your video in ShotCut.
Drag the video to the timeline.
Go to File > Export Frame (or use the “Export” tab).
Choose an image format (e.g., PNG or JPG).
Export all frames as images manually — or use ShotCut’s “Export > Advanced > Format = image sequence” option to do it automatically.

Alternatively, extract frames via FFmpeg:


mkdir frames
ffmpeg -i yourvideo.mp4 frames/frame%04d.png

🧠 Step 4: Upscale Frames with Upscayl

Launch Upscayl.
Select the folder with the exported frames.
Choose an AI model (Real-ESRGAN-based models work best).
Set output folder.
Click "Upscayl" and wait for processing.

Upscayl supports batch processing, so it will upscale all frames in one go.

🎬 Step 5: Reassemble Upscaled Frames into a Video

Option 1: Use ShotCut

Open ShotCut and create a new project.
Drag the upscaled images (sorted in order) into the timeline.
Set the frame rate to match the original video.
Export as video.

Option 2: Use FFmpeg (faster)


ffmpeg -framerate 30 -i upscaled/frame%04d.png -i original_audio.aac -c:v libx264 -pix_fmt yuv420p output.mp4

Adjust -framerate to match original.
If you want to keep original audio, extract it first:


ffmpeg -i yourvideo.mp4 -q:a 0 -map a original_audio.aac

🎉 Done!

You now have a fully upscaled version of your video using AI — all on Ubuntu, no complex CLI tools or Docker needed.

📝 Bonus: Automate the Process?

Want a simple shell script to automate this (frame extraction + reassembly)? Just ask, and I’ll generate one tailored to your workflow.

Let me know if you’d like help optimizing for speed, quality, or GPU usage.

ceturtdiena, 2025. gada 6. februāris

How to Merge Video Files Using FFmpeg

join_video.txt

file /Users/Video/input1.mp4
file /Users/Video/input2.mp4

Note : You can add more than two video files.

Then, run the FFmpeg command.

ffmpeg -f concat -safe 0 -i join_video.txt -c copy output_demuxer.mp4

-safe 0 is added so that any file name will be accepted. The default value 1 will reject absolute paths in the text file.

extract frames

ffmpeg -i kickflip.mp4 '%04d.png'

joing frames

ffmpeg -framerate 60 -pattern_type glob -i '*.png' -c:v libx264 -pix_fmt yuv420p result4.mp4

trešdiena, 2025. gada 8. janvāris

Python code to generate music with facebook/musicgen (facebook/musicgen-melody)

import os
import psutil
import torch
import gc
from transformers import AutoProcessor, MusicgenMelodyForConditionalGeneration, MusicgenMelodyConfig
import scipy
# https://huggingface.co/docs/transformers/main/model_doc/musicgen_melody

# Function to log memory usage
def log_memory(stage=""):
    process = psutil.Process(os.getpid())
    print(f"Memory Usage after {stage}: {process.memory_info().rss / 1024 ** 2} MB")

log_memory("initial load")

# Hugging Face token for authentication
token = "" // getyourself at https://huggingface.co/

# Load model configuration and manually add missing config attributes
#model_name = "facebook/musicgen-small"  # Use smaller variants if available
model_name = "facebook/musicgen-melody" # For better output
config = MusicgenMelodyConfig.from_pretrained(model_name, token=token)

# Manually add the missing 'use_cache' attribute
config.use_cache = False  # This should resolve the AttributeError you encountered

# Manually add the missing initializer_factor if it's required
config.initializer_factor = 1.0  # Default value for initialization

# Modify configuration parameters for debugging
config.dropout = 0.1
config.layerdrop = 0.1
config.max_position_embeddings = 512  # Reduced
config.hidden_size = 128  # Smaller hidden size
config.num_codebooks = 128  # Adjusted to a smaller number for compatibility
config.scale_embedding = True
config.vocab_size = 50257
config.num_hidden_layers = 2  # Fewer layers
config.num_attention_heads = 4  # Fewer attention heads
config.attention_dropout = 0.1
config.activation_function = "gelu"
config.activation_dropout = 0.1
config.ffn_dim = 1024

log_memory("after config")

# Load the model
model = MusicgenMelodyForConditionalGeneration.from_pretrained(model_name, config=config, token=token).eval()

log_memory("after model loaded")

# Processor for the model
processor = AutoProcessor.from_pretrained(model_name)

# Ensure proper input shape by padding to the required size
prompt = "A relaxing jazz track with piano and bass."

input_ids = processor(
    text=[prompt],
    padding=True,
    return_tensors="pt",
).to(model.device)

# Check the shape after reshaping
print(f"Input tensor shape after reshaping: {input_ids['input_ids'].shape}")

# Generate audio based on input prompt with no_grad to save memory
with torch.no_grad():
    generated_audio = model.generate(**input_ids,  max_new_tokens=1024)
    print(generated_audio)

log_memory("after generation")

# Check type of the audio data
print(f"Type of generated audio: {type(generated_audio)}")

# Save the generated audio to a file
if isinstance(generated_audio, torch.Tensor):
    sampling_rate = model.config.audio_encoder.sampling_rate
    scipy.io.wavfile.write("generated_music.wav", rate=sampling_rate, data=generated_audio.to("cpu")[0, 0].numpy())
else:
    print("Unexpected audio format, unable to save.")

# Cleanup
del generated_audio  # Explicitly delete the variable
gc.collect()  # Garbage collection
log_memory("after cleanup")

Photorealistic filter for video game footage

Here is a step-by-step guide for creating a photorealistic filter for video game footage using the workflow and free tools mentioned:

Step 1: Extract Video Frames

First, convert the video game footage into individual image frames.

Install FFmpeg:
- Download FFmpeg from FFmpeg Official Website.
- Add FFmpeg to your system’s PATH for easy access via the terminal/command prompt.
Extract Frames:
- Open a terminal and navigate to the folder containing your video (input.mp4).
- Run the following command:
```
ffmpeg -i input.mp4 -vf fps=30 frames/frame_%04d.png
```
  - input.mp4: Replace this with your video file name.
  - fps=30: Set the output frame rate (30 frames per second).
  - frames/frame_%04d.png: Saves frames in the frames/ folder as frame_0001.png, frame_0002.png, etc.
Verify Output:
- Check the frames/ folder to ensure the extracted frames are saved as images.

Step 2: Process Frames

Option A: Apply Style Transfer with CycleGAN

Download CycleGAN:

Clone the repository:

git clone https://github.com/junyanz/pytorch-CycleGAN-and-pix2pix.git
cd pytorch-CycleGAN-and-pix2pix

Install Dependencies:
- Install required Python libraries:
```
pip install -r requirements.txt
```
Download Pretrained Models:
- Download a pretrained model (e.g., horse2zebra for stylistic changes or fine-tune for photorealism):
```
bash ./scripts/download_cyclegan_model.sh horse2zebra
```
Apply Style Transfer:
- Use the test.py script to process frames:
```
python test.py --dataroot ./frames --name horse2zebra_pretrained --model test --no_dropout
```
  - Replace horse2zebra with your pretrained model.
  - Processed frames will be saved in the results/ folder.

Option B: Apply Super-Resolution with ESRGAN

Download ESRGAN: