ECE4006
From GanjaLinux
[[Category:ECE4006]]
Final Report: [[1]] This is Group 1's project development page for Dr. Barnwell's Real-Time DSP Senior Design course at Georgia Institute of Technology. Please read our Design Report and implementation report for more details.
Next Meeting: none. Bunger-Henry 315.
Final Presentation: December 8, 3 pm.
Georgia Tech ECE 4006 Senior Design Hardware.
Contents |
Minutes
- Meeting 1
- introductions and backgrounds
- potential projects
- Professors and Research Interests:
- Array Processing:
- Russell M. Mersereau - Image and Video Processing
- Biing-Hwang (Fred) Juang - Speech processing
- Douglas B. Williams - Statistical Signal Processing
- Speach Coding
- Thomas P. Barnwell - DSP Hardware and Speech Processing
- Mark A. Clements - Speech Recognition, Aids for Hearing-Impaired
- Chin-Hui Lee - Speech and Language Processing
- GTREP
- Dr. Joel Jackson
- Array Processing:
- Professors and Research Interests:
- Meeting 2 (Lucas & Dr. Mersereau)
- problem: eliminate camera man, automate classroom
- blocks:
- acquisition (locate speaker, scan rate, red/green ratio)
- zoom (in on face, out on back of head)
- respond to gestures (hand movement controls slide progression)
- algorithms:
- particle/condensation filter = difficult to achieve real-time
- contacts:
- Yeong-Seon Lee (research student).
- research: isolate current speaker in conference room using particle filter
- Yeong-Seon Lee (research student).
- equipment:
- polycom camera
- smart board
- Meeting 3 (Overview Design and Planning)
- Teams - Please read up on your area of expertise.
- FPGA Programming / Systems Integration : Jaimin, Lucas, Vince
- Vision System : Lucas, Vince
- Speech System : Justin, Jaimin, Wailing
- Camera / Mic interface : Wailing, Lucas, Vince
- GTREP contact and Data acquisition : Wailing, Lucas
- Teams - Please read up on your area of expertise.
- Meeting 4 (Dr. Barnwell, Lucas, Vince, Justin, Jaimin)
- Project Proposal
- incorporation of previous projects:
- narrow band tracking (using DSK algorithm)
- not applicable to our environment
- video tracking projects were conducted using the GVU labs
- narrow band tracking (using DSK algorithm)
- overall: project is too difficult.
- pick one component -> chose vision tracking
- incorporation of previous projects:
- Meetings
- have milestones for each week (updated weekly)
- Problem Specification (suggested a second meeting with Dr. Mersereau)
- still camera tracks single moving object (teacher head), puts a box around it, and directs a second camera
- moving camera locates the head and tracks
- potential to merge this algorithm with HP's research of using many low resolution cameras to make one high-resolution picture (from this picture we track, using downsample techniques to achieve real-time)
- Test Environment
- Room VL461
- Project Proposal
- Meeting 5
- Agenda
- Discuss abstract
- Obtain Polycom documentation
- how to control PTZ
- video output formats (S-video or composite)
- Select a two input real time capture card (expensive ~1000 USD! see Matrox, Pinnacle, Canopus and Dazzle for examples)
- depends on Polycom output
- physical separation of cameras will require very long video feed cable that will cause issues (degradation of video quality and ability to track)
- firewire or PCI
- call Videoguys - The Electronic Mailbox - 800-323-2325 We are the Digital Video Editing & DVD Production Experts! and get a price quote
- order card
- order IR RS232
- just in case we use it to control the PTZ of the cameras because we can't figure out how to control the Polycoms over the IP network
- Meeting 6
- Mariam Crowder
- Digital Media Lab - VL 483
- Set up meeting to obtain possible data
- Equipment Available
- Sony EVI-D30 Camera
- Stepping motor camera
- Need to find PCI card that interact with camera
- Video Capture Card
- Winnov website http://www.winnov.de
- Model - Videum 1000 VO Plus http://www.winnov.de/index.php?option=com_content&task=view&id=61&Itemid=93
- Driver download not required
- HP Vectra COC 311
- Code Composer Installed
- Digital Signal Processor
- TMS320C6701
- Frequency - 167 MHz
- 4ch DMA
- 2 32-bit GP Timers
- 1.9 V Core Supply, 3.3 V IO Supply
- Floating-Point Digital Signal Processor
Resources
Bitmap Info
Code
- Image Processing Code
- Matlab: Current Files
- Matlab: Old Files
- C
- find_head.c
- bitmap24.h
- Working find_head (DO NOT EDIT)
- Host-side code
- target-side code
Hardware
Papers
IEEE Xplore can be accessed from anywhere with proper login via http://gtel.gatech.edu:2065/Xplore/dynhome.jsp
- Speaker Tracking
- A low-cost real-time stereo vision system for looking at people
- Audio-visual speaker tracking with importance particle filters
- Real-time speaker tracking using particle filter sensor fusion
- Real-time speaker localization and speech separation by audio-visual integration
- Evolutive HMM for multi-speaker tracking system
- A self-calibrated speaker tracking system using both audio and video data
- The Intelligent Classroom
- Robust real time tracking of a vehicle by image processing
- 3-D facial pose and gaze point estimation using a robust real-time tracking paradigm
- Transductive inference for color-based particle filter tracking
- Robust real-time tracking on an active vision head

