Home  

Random  

Nearby  



Log in  



Settings  



Donate  



About Wikipedia  

Disclaimers  



Wikipedia





Pinhole camera model





Article  

Talk  



Language  

Watch  

Edit  





The pinhole camera model describes the mathematical relationship between the coordinates of a point in three-dimensional space and its projection onto the image plane of an ideal pinhole camera, where the camera aperture is described as a point and no lenses are used to focus light. The model does not include, for example, geometric distortions or blurring of unfocused objects caused by lenses and finite sized apertures.[1] It also does not take into account that most practical cameras have only discrete image coordinates. This means that the pinhole camera model can only be used as a first order approximation of the mapping from a 3D scene to a 2D image. Its validity depends on the quality of the camera and, in general, decreases from the center of the image to the edges as lens distortion effects increase.

A diagram of a pinhole camera.

Some of the effects that the pinhole camera model does not take into account can be compensated, for example by applying suitable coordinate transformations on the image coordinates; other effects are sufficiently small to be neglected if a high quality camera is used. This means that the pinhole camera model often can be used as a reasonable description of how a camera depicts a 3D scene, for example in computer vision and computer graphics.

Geometry

edit
 
The geometry of a pinhole camera. Note: the x1x2x3 coordinate system in the figure is left-handed, that is the direction of the OZ axis is in reverse to the system the reader may be used to.

The geometry related to the mapping of a pinhole camera is illustrated in the figure. The figure contains the following basic objects:

The pinhole aperture of the camera, through which all projection lines must pass, is assumed to be infinitely small, a point. In the literature this point in 3D space is referred to as the optical (or lens or camera) center.[3]

Formulation

edit

Next we want to understand how the coordinates   of point Q depend on the coordinates   of point P. This can be done with the help of the following figure which shows the same scene as the previous figure but now from above, looking down in the negative direction of the X2 axis.

 
The geometry of a pinhole camera as seen from the X2 axis

In this figure we see two similar triangles, both having parts of the projection line (green) as their hypotenuses. The catheti of the left triangle are   and f and the catheti of the right triangle are   and  . Since the two triangles are similar it follows that

 or 

A similar investigation, looking in the negative direction of the X1 axis gives

 or 

This can be summarized as

 

which is an expression that describes the relation between the 3D coordinates   of point P and its image coordinates   given by point Q in the image plane.

Rotated image and the virtual image plane

edit

The mapping from 3D to 2D coordinates described by a pinhole camera is a perspective projection followed by a 180° rotation in the image plane. This corresponds to how a real pinhole camera operates; the resulting image is rotated 180° and the relative size of projected objects depends on their distance to the focal point and the overall size of the image depends on the distance f between the image plane and the focal point. In order to produce an unrotated image, which is what we expect from a camera, there are two possibilities:

In both cases, the resulting mapping from 3D coordinates to 2D image coordinates is given by the expression above, but without the negation, thus

 

In homogeneous coordinates

edit

The mapping from 3D coordinates of points in space to 2D image coordinates can also be represented in homogeneous coordinates. Let   be a representation of a 3D point in homogeneous coordinates (a 4-dimensional vector), and let   be a representation of the image of this point in the pinhole camera (a 3-dimensional vector). Then the following relation holds

 

where   is the   camera matrix and the   means equality between elements of projective spaces. This implies that the left and right hand sides are equal up to a non-zero scalar multiplication. A consequence of this relation is that also   can be seen as an element of a projective space; two camera matrices are equivalent if they are equal up to a scalar multiplication. This description of the pinhole camera mapping, as a linear transformation   instead of as a fraction of two linear expressions, makes it possible to simplify many derivations of relations between 3D and 2D coordinates.[citation needed]

See also

edit

References

edit
  1. ^ Szeliski, Richard (2022). Computer Vision: Algorithms and Applications (2 ed.). Springer Nature. p. 74. ISBN 3030343723. Retrieved 30 December 2023.
  • ^ Carlo Tomasi (2016-08-09). "A Simple Camera Model" (PDF). cs.duke.edu. Retrieved 2021-02-18.
  • ^ Andrea Fusiello (2005-12-27). "Elements of Geometric Computer Vision". Homepages.inf.ed.ac.uk. Retrieved 2013-12-18.
  • Bibliography

    edit

    Retrieved from "https://en.wikipedia.org/w/index.php?title=Pinhole_camera_model&oldid=1218032693"
     



    Last edited on 9 April 2024, at 09:41  





    Languages

     


    العربية



     

    Wikipedia


    This page was last edited on 9 April 2024, at 09:41 (UTC).

    Content is available under CC BY-SA 4.0 unless otherwise noted.



    Privacy policy

    About Wikipedia

    Disclaimers

    Contact Wikipedia

    Code of Conduct

    Developers

    Statistics

    Cookie statement

    Terms of Use

    Desktop