Computer vision for robotics презентация

Август 5, 2022

Главная
Информатика
Computer vision for robotics

Содержание

2. Why do we need computer vision? Smart video surveillance Biometrics Automatic Driver Assistance Systems Machine vision
3. Vision is hard! Even for humans…
4. Texai parking
5. Agenda Camera model Stereo vision Stereo vision on GPU Object detection methods Sliding window Local descriptors
6. Pinhole camera model
7. Distortion model
8. Reprojection error
9. Homography
10. Perspective-n-Points problem P4P RANSAC (RANdom SAmple Consensus)
11. Stereo: epipolar geometry Fundamental matrix constraint
12. Stereo Rectification Algorithm steps are shown at right: Goal: Each row of the image contains the
13. Stereo correspondence Block matching Dynamic programming Inter-scanline dependencies Segmentation Belief propagation
14. Stereo correspondence block matching For each block in left image: Search for the corresponding block in
15. Pre- and post processing Low texture filtering SSD/SAD minimum ambiguity removal Using gradients instead of intensities
16. Stereo Matching
17. Parallel implementation of block matching The outer cycle iterates through disparity values We compute SSD and
18. Parallelization scheme
19. Optimization concepts Not using texture – saving registers 1 thread per 8 pixels processing – using
20. Performance summary CPU (i5 750 2.66GHz), GPU (Fermi card 448 cores) Block matching on CPU+2xGPU is
21. Full-HD stereo in realtime http://www.youtube.com/watch?v=ThE7sRAtaWU
22. Applications of stereo vision Machine vision Automatic Driver Assistance Movie production Robotics Object recognition Visual odometry
23. Object detection
24. Sliding window approach
25. Cascade classifier Stage 1 Stage 2 Stage 3 image face face Not face Not face Not
26. Face detection
27. Object detection with local descriptors Detect keypoints Calculate local descriptors for each point Match descriptors for
28. FAST feature detector
29. Keypoints example
30. SIFT descriptor David Lowe, 2004
31. SURF descriptor 4x4 square regions inside a square window 20*s 4 values per square region
32. More descriptors One way descriptor C-descriptor, FERNS, BRIEF HoG Daisy
33. Matching descriptors example
34. Ways to improve matching Increase the inliers to outliers ratio Distance threshold Distance ratio threshold (second
35. Random Sample Consensus Do n iterations until #inliers > inlierThreshold Draw k matches randomly Find the
36. Geometry validation
37. Scaling up FLANN (Fast Library for Approximate Nearest Neighbors) In OpenCV thanks to Marius Muja Bag
38. Projects Textured object detection PR2 robot automatic plugin Visual odometry / SLAM
39. Textured object detection
40. Object detection example Iryna Gordon and David G. Lowe, "What and where: 3D object recognition with
41. Keypoint detection We are looking for small dark regions This operation takes only ~10ms on 640x480
42. Classification with one way descriptor Introduced by Hinterstoisser et al (Technical U of Munich, Ecole Polytechnique)
43. Keypoint classification examples One way descriptor does the most of the outlet detection job for us.
44. Object detection Object pose is reconstructed by geometry validation (using geomertic hashing) Itseez Ltd. http://itseez.com
45. Outlet detection: challenging cases Shadows Severe lighting conditions Partial occlusions Itseez Ltd. http://itseez.com
46. PR2 plugin (outlet and plug detection) http://www.youtube.com/watch?v=GWcepdggXsU
47. Visual odometry
48. Visual odometry (II)
50. Скачать презентацию

Слайд 2

Why do we need computer vision?
Smart video surveillance
Biometrics
Automatic Driver Assistance Systems
Machine

vision (Visual inspection)
Image retrieval (e.g. Google Goggles)
Movie production
Robotics

Слайд 3

Vision is hard! Even for humans…

Слайд 4

Texai parking

Слайд 5

Agenda
Camera model
Stereo vision
Stereo vision on GPU
Object detection methods
Sliding window
Local descriptors
Applications
Textured

object detection
Outlet detection
Visual odometry

Слайд 6

Pinhole camera model

Слайд 7

Distortion model

Слайд 8

Reprojection error

Слайд 9

Homography

Слайд 10

Perspective-n-Points problem
P4P
RANSAC (RANdom SAmple Consensus)

Слайд 11

Stereo: epipolar geometry
Fundamental matrix constraint

Слайд 12

Stereo Rectification
Algorithm steps are shown at right:
Goal:
Each row of the image

contains the same world points
“Epipolar constraint”

Result: Epipolar alignment of features:

All: Gary Bradski and Adrian Kaehler: Learning OpenCV

Слайд 13

Stereo correspondence
Block matching
Dynamic programming
Inter-scanline dependencies
Segmentation
Belief propagation

Слайд 14

Stereo correspondence block matching
For each block in left image:
Search for the

corresponding block in the right image such that SSD or SAD between pixel intensities is minimum

Слайд 15

Pre- and post processing
Low texture filtering
SSD/SAD minimum ambiguity removal
Using gradients instead

of intensities
Speckle filtering

Слайд 16

Stereo Matching

Слайд 17

Parallel implementation of block matching
The outer cycle iterates through disparity values
We

compute SSD and compare it with the current minimum for each pixel in a tile
Different tiles reuse the results of each other

Слайд 18

Parallelization scheme

Слайд 19

Optimization concepts
Not using texture – saving registers
1 thread per 8 pixels

processing – using cache
Reducing the amount of arithmetic operations
Non-parallelizable functions (speckle filtering) are done on CPU

Слайд 20

Performance summary
CPU (i5 750 2.66GHz), GPU (Fermi card 448 cores)
Block matching

on CPU+2xGPU is 10 times faster than CPU implementation with SSE optimization, enabling real-time processing of HD images!

Слайд 21

Full-HD stereo in realtime
http://www.youtube.com/watch?v=ThE7sRAtaWU

Слайд 22

Applications of stereo vision
Machine vision
Automatic Driver Assistance
Movie production
Robotics
Object recognition
Visual odometry /

SLAM

Слайд 23

Object detection

Слайд 24

Sliding window approach

Слайд 25

Cascade classifier
Stage 1
Stage 2
Stage 3
image
face
face
Not face
Not face
Not face
face
Real-time in year 2000!

Слайд 26

Face detection

Слайд 27

Object detection with local descriptors
Detect keypoints
Calculate local descriptors for each point
Match

descriptors for different images
Validate matches with a geometry model

Слайд 28

FAST feature detector

Слайд 29

Keypoints example

Слайд 30

SIFT descriptor
David Lowe, 2004

Слайд 31

SURF descriptor
4x4 square regions inside a square window 20*s
4 values per

square region

Слайд 32

More descriptors
One way descriptor
C-descriptor, FERNS, BRIEF
HoG
Daisy

Слайд 33

Matching descriptors example

Слайд 34

Ways to improve matching
Increase the inliers to outliers ratio
Distance threshold
Distance ratio

threshold (second to first NN distance)
Backward-forward matching
Windowed matching
Increase the amount of inliers
One to many matching

Слайд 35

Random Sample Consensus
Do n iterations until #inliers > inlierThreshold
Draw k matches

randomly
Find the transformation
Calculate inliers count
Remember the best solution

Слайд 36

Geometry validation

Слайд 37

Scaling up
FLANN (Fast Library for Approximate Nearest Neighbors)
In OpenCV thanks to

Marius Muja
Bag of Words
In OpenCV thanks to Ken Chatfield
Vocabulary trees
Is going to be in OpenCV thanks to Patrick Mihelich

Слайд 38

Projects
Textured object detection
PR2 robot automatic plugin
Visual odometry / SLAM

Слайд 39

Textured object detection

Слайд 40

Object detection example
Iryna Gordon and David G. Lowe, "What and where:

3D object recognition with accurate pose," in Toward Category-Level Object Recognition, eds. J. Ponce, M. Hebert, C. Schmid, and A. Zisserman, (Springer-Verlag, 2006), pp. 67-82.

Manuel Martinez Torres, Alvaro Collet Romea, and Siddhartha Srinivasa, MOPED: A Scalable and Low Latency Object Recognition and Pose Estimation System, Proceedings of ICRA 2010, May, 2010.

Слайд 41

Keypoint detection
We are looking for small dark regions
This operation takes only

~10ms on 640x480 image
The rest of the algorithm works only with keypoint regions

Itseez Ltd. http://itseez.com

Слайд 42

Classification with one way descriptor
Introduced by Hinterstoisser et al (Technical U

of Munich, Ecole Polytechnique) at CVPR 2009
A test patch is compared to samples of affine-transformed training patches with Euclidean distance
The closest patch together with a pose guess are reconstructed

Itseez Ltd. http://itseez.com

Слайд 43

Keypoint classification examples
One way descriptor does the most of the outlet

detection job for us. Few holes are misclassified

Ground hole

Power hole

Non-hole keypoint from outlet image

Background keypoint

Itseez Ltd. http://itseez.com

Слайд 44

Object detection
Object pose is reconstructed by geometry validation (using geomertic hashing)
Itseez

Ltd. http://itseez.com

Слайд 45

Outlet detection: challenging cases
Shadows
Severe lighting conditions
Partial occlusions
Itseez Ltd. http://itseez.com

Слайд 46

PR2 plugin (outlet and plug detection)
http://www.youtube.com/watch?v=GWcepdggXsU

Слайд 47

Visual odometry

Слайд 48

Computer vision for robotics презентация

Содержание

Why do we need computer vision?Smart video surveillanceBiometricsAutomatic Driver Assistance SystemsMachine

Vision is hard! Even for humans…

Texai parking

AgendaCamera modelStereo visionStereo vision on GPUObject detection methodsSliding windowLocal descriptors ApplicationsTextured

Pinhole camera model

Distortion model

Reprojection error

Homography

Perspective-n-Points problemP4PRANSAC (RANdom SAmple Consensus)

Stereo: epipolar geometryFundamental matrix constraint

Stereo RectificationAlgorithm steps are shown at right:Goal:Each row of the image

Stereo correspondenceBlock matchingDynamic programmingInter-scanline dependenciesSegmentationBelief propagation

Stereo correspondence block matchingFor each block in left image:Search for the

Pre- and post processingLow texture filteringSSD/SAD minimum ambiguity removalUsing gradients instead

Stereo Matching

Parallel implementation of block matchingThe outer cycle iterates through disparity valuesWe

Parallelization scheme

Optimization conceptsNot using texture – saving registers1 thread per 8 pixels

Performance summaryCPU (i5 750 2.66GHz), GPU (Fermi card 448 cores)Block matching

Full-HD stereo in realtimehttp://www.youtube.com/watch?v=ThE7sRAtaWU

Applications of stereo visionMachine visionAutomatic Driver AssistanceMovie productionRoboticsObject recognitionVisual odometry /

Object detection

Sliding window approach

Cascade classifierStage 1Stage 2Stage 3imagefacefaceNot faceNot faceNot facefaceReal-time in year 2000!

Face detection

Object detection with local descriptorsDetect keypointsCalculate local descriptors for each pointMatch

FAST feature detector

Keypoints example

SIFT descriptorDavid Lowe, 2004

SURF descriptor4x4 square regions inside a square window 20*s4 values per

More descriptorsOne way descriptorC-descriptor, FERNS, BRIEFHoGDaisy

Matching descriptors example

Ways to improve matchingIncrease the inliers to outliers ratioDistance thresholdDistance ratio

Random Sample ConsensusDo n iterations until #inliers > inlierThresholdDraw k matches

Geometry validation

Scaling upFLANN (Fast Library for Approximate Nearest Neighbors)In OpenCV thanks to

ProjectsTextured object detectionPR2 robot automatic pluginVisual odometry / SLAM

Textured object detection

Object detection exampleIryna Gordon and David G. Lowe, "What and where:

Keypoint detectionWe are looking for small dark regionsThis operation takes only

Classification with one way descriptorIntroduced by Hinterstoisser et al (Technical U

Keypoint classification examplesOne way descriptor does the most of the outlet

Object detectionObject pose is reconstructed by geometry validation (using geomertic hashing)Itseez

Outlet detection: challenging casesShadowsSevere lighting conditionsPartial occlusionsItseez Ltd. http://itseez.com

PR2 plugin (outlet and plug detection)http://www.youtube.com/watch?v=GWcepdggXsU

Visual odometry

Visual odometry (II)

Похожие презентации

Why do we need computer vision?
Smart video surveillance
Biometrics
Automatic Driver Assistance Systems
Machine

Agenda
Camera model
Stereo vision
Stereo vision on GPU
Object detection methods
Sliding window
Local descriptors
Applications
Textured

Perspective-n-Points problem
P4P
RANSAC (RANdom SAmple Consensus)

Stereo: epipolar geometry
Fundamental matrix constraint

Stereo Rectification
Algorithm steps are shown at right:
Goal:
Each row of the image

Stereo correspondence
Block matching
Dynamic programming
Inter-scanline dependencies
Segmentation
Belief propagation

Stereo correspondence block matching
For each block in left image:
Search for the

Pre- and post processing
Low texture filtering
SSD/SAD minimum ambiguity removal
Using gradients instead

Parallel implementation of block matching
The outer cycle iterates through disparity values
We

Optimization concepts
Not using texture – saving registers
1 thread per 8 pixels

Performance summary
CPU (i5 750 2.66GHz), GPU (Fermi card 448 cores)
Block matching

Full-HD stereo in realtime
http://www.youtube.com/watch?v=ThE7sRAtaWU

Applications of stereo vision
Machine vision
Automatic Driver Assistance
Movie production
Robotics
Object recognition
Visual odometry /

Cascade classifier
Stage 1
Stage 2
Stage 3
image
face
face
Not face
Not face
Not face
face
Real-time in year 2000!

Object detection with local descriptors
Detect keypoints
Calculate local descriptors for each point
Match

SIFT descriptor
David Lowe, 2004

SURF descriptor
4x4 square regions inside a square window 20*s
4 values per

More descriptors
One way descriptor
C-descriptor, FERNS, BRIEF
HoG
Daisy

Ways to improve matching
Increase the inliers to outliers ratio
Distance threshold
Distance ratio

Random Sample Consensus
Do n iterations until #inliers > inlierThreshold
Draw k matches

Scaling up
FLANN (Fast Library for Approximate Nearest Neighbors)
In OpenCV thanks to

Projects
Textured object detection
PR2 robot automatic plugin
Visual odometry / SLAM

Object detection example
Iryna Gordon and David G. Lowe, "What and where:

Keypoint detection
We are looking for small dark regions
This operation takes only

Classification with one way descriptor
Introduced by Hinterstoisser et al (Technical U

Keypoint classification examples
One way descriptor does the most of the outlet

Object detection
Object pose is reconstructed by geometry validation (using geomertic hashing)
Itseez

Outlet detection: challenging cases
Shadows
Severe lighting conditions
Partial occlusions
Itseez Ltd. http://itseez.com

PR2 plugin (outlet and plug detection)
http://www.youtube.com/watch?v=GWcepdggXsU