R36SHack

0 Tags

5.0 MiB

Author	SHA1	Message	Date
Matteo Benedetto	67224626a5	perf: insert videoscale before appsink to cut NV12 memmove 6.7× When hardware decode (mppvideodec/NV12) is active, wrap the appsink in a GstBin with a videoscale element so the VPU decodes at full stream resolution but Python only receives a frame pre-scaled to the SDL display size (default 640x480). Effect: NV12 buffer per frame: 3,133,440 B (1080p) → 460,800 B (640x480) memmove per frame: ~33 ms (80.5% budget) → ~5 ms (expected ~12%) The videoscale bilinear step runs entirely in software on the A35 cores but scales down 6.7×, so its cost is far lower than the avoided memmove. SDL still handles final aspect-ratio fitting inside the viewport, so visual quality is unchanged relative to what the 640x480 display can show. Fallback: if videoscale is not available, unscaled NV12 is used as before.	2 weeks ago
Matteo Benedetto	da02e7446f	perf: replace extract_dup+from_buffer_copy with buffer.map+memmove zero-copy Instead of extract_dup (GLib alloc+memcpy → Python bytes) followed by from_buffer_copy (Python bytes → ctypes array) — two 3MB copies per frame — use Gst.Buffer.map(READ) to get a zero-allocation pointer to the decoded frame memory, then memmove directly into a pre-allocated reusable ctypes array (_raw_arr). This reduces the per-frame copy path from 2 copies (6MB) to 1 memmove (3MB), with no Python bytes object allocation at all. The memmove happens under _frame_lock so render() on the main thread never reads a partial frame. _raw_arr is allocated once on the first frame (or on resolution change) and reused for every subsequent frame. _Frame no longer carries a pixels field. Tests updated accordingly. Benchmark updated to use the same buffer.map+memmove path as the app.	2 weeks ago
Matteo Benedetto	6e15fcab5a	player: NV12 zero-copy SDL upload path for Rockchip MPP hardware decode - _Frame: add pixel_format ('BGRA'\|'NV12'), uv_pixels, uv_pitch fields - _create_appsink: accept NV12 caps when R36S_HW_DECODE=1 (NV12;BGRA fallback) - render(): choose SDL_PIXELFORMAT_NV12 texture + SDL_UpdateNVTexture for NV12 frames, avoiding any software colourspace conversion on the CPU - _on_new_sample: detect format via VideoInfo.finfo.name, extract Y+UV planes separately from NV12 GStreamer buffers - _destroy_texture: reset _texture_format to 'BGRA' - deploy/arkos/MatHacks.sh: set R36S_HW_DECODE=1 to activate the path - tests/test_player.py: add finfo mock, pixel_format to SimpleNamespace frame	2 weeks ago
Matteo Benedetto	1d89c7fdc7	Initial import	2 weeks ago

4 Commits (fe39312cfa33dba49385ea8829ebebe952def53f)