Realtek RTD1296 Intelligent voice, video processing platform

From Banana Pi Wiki
Jump to: navigation, search


Realtek RTD1296 Intelligent AI subtitles machine
Exterior industrial design
Hardware base on Banana Pi BPI-W2
Hardware base on Banana Pi BPI-W2
Exterior industrial design
Exterior industrial design
Exterior industrial design
Successful case for BPI:4.0 OEM&ODM

this is a BPI 4.0 Server customization project base on Banana Pi BPI-W2,it with 8G eMMC flash ,and 4G DDR on board, support HDMI in and HDMI out , You can collect the audio and video data of HDMI in, and send them to the display terminal after processing by HDMI out to complete the audio and video services you want to process,also support HDMI bypass funtion, use TS3DV642A0RUAR HDMI bypass chip. Direct connection between IN and OUT when shutdown (continuous electricity). After booting, HDMI IN and HDMI OUT function respectively

RTD 1296 design 1.JPG


Main function

  • Realtek RTD1296, Quad-core ARM Cortex-A53
  • Mali T820 MP3 GPU
  • 8G eMMC flash (Max to 32 G)
  • Realtek rtl8275 b/g/n wifi and BT 4.0 support
  • M.2 Key E interface
  • MicroSD slot supports up to 256GB expansion
  • 1 SATA interface
  • 1XGigabit LAN
  • 1xUSB 3.0 1xUSB 2.0
  • HDMI in & HDMI out support HDMI bypass function
  • TYPE C /Power
  • IR support

Hardware interface

RTD 1296 design interface.JPG

Exterior industrial design

RTD 1296 design case.jpg


Open Source resource

this board base on BPI-W2 ,you can reference Banana Pi BPI-W2

how to begin : Getting Started with W2

Custom application

this project is an intelligent hardware product mainly used to assist the audience rating of hearing impaired people.This product use the advanced speech recognition technology and specialized audio processing technology, the real-time voice into text, and through the intelligent algorithm to generate subtitles, automatic matching output video content after overlay, caption and pictures "a screen" is presented to deaf people, from now on to thoroughly solve the deaf people can't watch without subtitles broadcast video program or number of screen, is only to the quick.Auxiliary subtitle box appearance is shown


  • 1. Subtitle and video "on the one screen" : With the advanced design concept and the use of advanced voice recognition technology and video and audio processing technology, real-time subtitles can not only be provided, but also the real-time subtitles and the corresponding video can be presented to the hearing impaired through a screen, successfully solving the audience rating needs of the hearing impaired.
  • 2. Intelligent engine recognition, better subtitle accuracy:It integrates the voice recognition engine of domestic first-class manufacturers and combines with the optimization algorithm independently developed by us to ensure more accurate real-time subtitle content.
  • 3. Accurate subtitle push for specific programs: For live news, variety shows, sports and other programs with fixed broadcast time and no subtitles, the auxiliary subtitle box supports receiving and pushing accurate subtitles in the background, which improves the audience understanding of the deaf and the seriousness of the corresponding programs
  • 4. Subtitle can be closed and CC technology pioneer experience : Based on the design concept of "closing captions", superposition captions. When captions are not needed, you can open or close the captions at any time through the "captions button" of the remote control. At the same time, subtitle content can adjust the display position and even font size according to the program and the viewing habits of the deaf
  • 5. Luxury hardware configuration, smooth interaction and super computing power:It adopts 4GB super memory, 4K HDMI IN and OUT interface, USB3.0 interface, and BY PASS circuit design to ensure optimal performance and best use experience
  • 6. Applicable to meeting, window service and other scenarios
  • 7. More humanized function design

Banana Pi BPI-W2 Realtek RTD1296 Intelligent voice, video processing platform, Test real-time video stream speech to text,So that the hearing impaired can easily watch the video

Real-time video stream speech to text Technical solution

cooperative partner