Difference between revisions of "Realtek RTD1296 Intelligent voice, video processing platform"
(→Custom application) |
|||
(8 intermediate revisions by the same user not shown) | |||
Line 1: | Line 1: | ||
=Overview= | =Overview= | ||
+ | |||
+ | [[File:BPI-W2_new_case_design_.jpg|thumb|Realtek RTD1296 Intelligent AI subtitles machine ]] | ||
[[File:RTD_1296_design_1.JPG|thumb|[[Realtek RTD1296 Intelligent voice, video processing platform]]]] | [[File:RTD_1296_design_1.JPG|thumb|[[Realtek RTD1296 Intelligent voice, video processing platform]]]] | ||
Line 8: | Line 10: | ||
[[File:RTD_1296_design_case_3.jpg|thumb|Exterior industrial design]] | [[File:RTD_1296_design_case_3.jpg|thumb|Exterior industrial design]] | ||
[[File:RTD_1296_design_case_4.jpg|thumb|Exterior industrial design]] | [[File:RTD_1296_design_case_4.jpg|thumb|Exterior industrial design]] | ||
+ | [[File:Banana_pi_4g_router_1.jpg|thumb|[[Successful case ]] for BPI:4.0 OEM&ODM]] | ||
this is a [[BPI 4.0 Server]] customization project base on [[Banana Pi BPI-W2]],it with 8G eMMC flash ,and 4G DDR on board, support HDMI in and HDMI out , You can collect the audio and video data of HDMI in, and send them to the display terminal after processing by HDMI out to complete the audio and video services you want to process,also support HDMI bypass funtion, use TS3DV642A0RUAR HDMI bypass chip. Direct connection between IN and OUT when shutdown (continuous electricity). After booting, HDMI IN and HDMI OUT function respectively | this is a [[BPI 4.0 Server]] customization project base on [[Banana Pi BPI-W2]],it with 8G eMMC flash ,and 4G DDR on board, support HDMI in and HDMI out , You can collect the audio and video data of HDMI in, and send them to the display terminal after processing by HDMI out to complete the audio and video services you want to process,also support HDMI bypass funtion, use TS3DV642A0RUAR HDMI bypass chip. Direct connection between IN and OUT when shutdown (continuous electricity). After booting, HDMI IN and HDMI OUT function respectively | ||
Line 45: | Line 48: | ||
how to begin : [[Getting Started with W2]] | how to begin : [[Getting Started with W2]] | ||
− | |||
==Custom application== | ==Custom application== | ||
Line 53: | Line 55: | ||
Features: | Features: | ||
− | *1.Subtitle and video "on the one screen" | + | *1. Subtitle and video "on the one screen" : With the advanced design concept and the use of advanced voice recognition technology and video and audio processing technology, real-time subtitles can not only be provided, but also the real-time subtitles and the corresponding video can be presented to the hearing impaired through a screen, successfully solving the audience rating needs of the hearing impaired. |
− | *2.Intelligent engine recognition, better subtitle accuracy | + | *2. Intelligent engine recognition, better subtitle accuracy:It integrates the voice recognition engine of domestic first-class manufacturers and combines with the optimization algorithm independently developed by us to ensure more accurate real-time subtitle content. |
− | *3.Accurate subtitle push for specific programs | + | *3. Accurate subtitle push for specific programs: For live news, variety shows, sports and other programs with fixed broadcast time and no subtitles, the auxiliary subtitle box supports receiving and pushing accurate subtitles in the background, which improves the audience understanding of the deaf and the seriousness of the corresponding programs |
− | *4.Subtitle can be closed and CC technology pioneer experience | + | *4. Subtitle can be closed and CC technology pioneer experience : Based on the design concept of "closing captions", superposition captions. When captions are not needed, you can open or close the captions at any time through the "captions button" of the remote control. At the same time, subtitle content can adjust the display position and even font size according to the program and the viewing habits of the deaf |
− | *5.Luxury hardware configuration, smooth interaction and super computing power | + | *5. Luxury hardware configuration, smooth interaction and super computing power:It adopts 4GB super memory, 4K HDMI IN and OUT interface, USB3.0 interface, and BY PASS circuit design to ensure optimal performance and best use experience |
− | *6.Applicable to meeting, window service and other scenarios | + | *6. Applicable to meeting, window service and other scenarios |
− | *7.More humanized function design | + | *7. More humanized function design |
+ | |||
+ | Banana Pi BPI-W2 Realtek RTD1296 Intelligent voice, video processing platform, Test real-time video stream speech to text,So that the hearing impaired can easily watch the video | ||
+ | |||
+ | *function demo : https://www.youtube.com/watch?v=oM4hwKwYjiE | ||
+ | |||
+ | Real-time video stream speech to text Technical solution | ||
+ | |||
+ | *function demo : https://www.youtube.com/watch?v=2yQz3UaszFI | ||
=cooperative partner = | =cooperative partner = |
Latest revision as of 03:29, 7 April 2021
Contents
Overview
this is a BPI 4.0 Server customization project base on Banana Pi BPI-W2,it with 8G eMMC flash ,and 4G DDR on board, support HDMI in and HDMI out , You can collect the audio and video data of HDMI in, and send them to the display terminal after processing by HDMI out to complete the audio and video services you want to process,also support HDMI bypass funtion, use TS3DV642A0RUAR HDMI bypass chip. Direct connection between IN and OUT when shutdown (continuous electricity). After booting, HDMI IN and HDMI OUT function respectively
Hardware
Main function
- Realtek RTD1296, Quad-core ARM Cortex-A53
- Mali T820 MP3 GPU
- 4G DDR4 SDRAM
- 8G eMMC flash (Max to 32 G)
- Realtek rtl8275 b/g/n wifi and BT 4.0 support
- M.2 Key E interface
- MicroSD slot supports up to 256GB expansion
- 1 SATA interface
- 1XGigabit LAN
- 1xUSB 3.0 1xUSB 2.0
- HDMI in & HDMI out support HDMI bypass function
- TYPE C /Power
- IR support
Hardware interface
Exterior industrial design
Software
Open Source resource
this board base on BPI-W2 ,you can reference Banana Pi BPI-W2
how to begin : Getting Started with W2
Custom application
this project is an intelligent hardware product mainly used to assist the audience rating of hearing impaired people.This product use the advanced speech recognition technology and specialized audio processing technology, the real-time voice into text, and through the intelligent algorithm to generate subtitles, automatic matching output video content after overlay, caption and pictures "a screen" is presented to deaf people, from now on to thoroughly solve the deaf people can't watch without subtitles broadcast video program or number of screen, is only to the quick.Auxiliary subtitle box appearance is shown
Features:
- 1. Subtitle and video "on the one screen" : With the advanced design concept and the use of advanced voice recognition technology and video and audio processing technology, real-time subtitles can not only be provided, but also the real-time subtitles and the corresponding video can be presented to the hearing impaired through a screen, successfully solving the audience rating needs of the hearing impaired.
- 2. Intelligent engine recognition, better subtitle accuracy:It integrates the voice recognition engine of domestic first-class manufacturers and combines with the optimization algorithm independently developed by us to ensure more accurate real-time subtitle content.
- 3. Accurate subtitle push for specific programs: For live news, variety shows, sports and other programs with fixed broadcast time and no subtitles, the auxiliary subtitle box supports receiving and pushing accurate subtitles in the background, which improves the audience understanding of the deaf and the seriousness of the corresponding programs
- 4. Subtitle can be closed and CC technology pioneer experience : Based on the design concept of "closing captions", superposition captions. When captions are not needed, you can open or close the captions at any time through the "captions button" of the remote control. At the same time, subtitle content can adjust the display position and even font size according to the program and the viewing habits of the deaf
- 5. Luxury hardware configuration, smooth interaction and super computing power:It adopts 4GB super memory, 4K HDMI IN and OUT interface, USB3.0 interface, and BY PASS circuit design to ensure optimal performance and best use experience
- 6. Applicable to meeting, window service and other scenarios
- 7. More humanized function design
Banana Pi BPI-W2 Realtek RTD1296 Intelligent voice, video processing platform, Test real-time video stream speech to text,So that the hearing impaired can easily watch the video
- function demo : https://www.youtube.com/watch?v=oM4hwKwYjiE
Real-time video stream speech to text Technical solution
- function demo : https://www.youtube.com/watch?v=2yQz3UaszFI
cooperative partner
- OEM&ODM please contact : [email protected]
- Good idea ,discuss on forum ptherad : http://forum.banana-pi.org/t/realtek-rtd1296-intelligent-voice-video-processing-platform/10727