- Library Home /
- Search Collections /
- Open Collections /
- Browse Collections /
- UBC Theses and Dissertations /
- Efficient compression, human-inspired refocusing, and...
Open Collections
UBC Theses and Dissertations
UBC Theses and Dissertations
Efficient compression, human-inspired refocusing, and quality assessment of light field videos Mehajabin, Nusrat
Abstract
Light field (LF) technology has transformed digital media by capturing scenes from multiple viewpoints, introducing a new level of visual immersion and interactivity. This thesis explores the complexities of LF technology, with a specific focus on compression techniques, quality metrics for assessing processed light fields, and refocusing strategies aligned with the principles of the human visual system. Efficient compression plays a crucial role in LF technology due to the inherent richness of data. Within this thesis, we introduce innovative pseudo-sequence-based prediction structures and coding orders for LF compression. The first structure, informed by interview-similarity, optimizes compression by leveraging reference adjacency and maximizing B-frame usage. This approach excels in bitrate efficiency, real-time decoding, and competitive encoding times, making it suitable for delay-tolerant applications such as broadcasting. The second compression technique employs a frame distance aware structure, enhancing random-access efficiency through diagonal references. This structure offers the shortest encoding time with competitive bitrate gains, making it suitable for real-time interactive applications. Existing LF refocusing methods often fall short of mimicking the natural behavior of the human visual system, resulting in refocused images that appear artificial. Therefore, we developed an innovative LF refocusing technique aimed at producing human perception consistent refocused LFs from camera arrays. By integrating view synthesis, depth estimation, and object segmentation, the proposed method addresses the challenges of depth-dependent refocusing, yielding authentic-looking, natural post-shoot refocusing for camera-array-based content. Traditional quality assessment metrics prove inadequate in capturing the multidimensional nature of LF, underscoring the necessity for novel quality metrics tailored to LF. Finally, we introduce the first LF quality metric tailored for sparsely sampled LFs. This novel full-reference LF quality assessment technique utilizes a volumetric LF representation's cross-section for holistic analysis employing a deep feature extractor. By simultaneously extracting spatial and angular features, this approach comprehensively captures LF quality. This thesis navigates the potential of LF by examining representation, compression, processing, and quality assessment. As LF continues to shape the future of digital media, the solutions presented herein facilitate its effective utilization.
Item Metadata
Title |
Efficient compression, human-inspired refocusing, and quality assessment of light field videos
|
Creator | |
Supervisor | |
Publisher |
University of British Columbia
|
Date Issued |
2024
|
Description |
Light field (LF) technology has transformed digital media by capturing scenes from multiple
viewpoints, introducing a new level of visual immersion and interactivity. This thesis explores the
complexities of LF technology, with a specific focus on compression techniques, quality metrics
for assessing processed light fields, and refocusing strategies aligned with the principles of the
human visual system.
Efficient compression plays a crucial role in LF technology due to the inherent richness of data.
Within this thesis, we introduce innovative pseudo-sequence-based prediction structures and
coding orders for LF compression. The first structure, informed by interview-similarity, optimizes
compression by leveraging reference adjacency and maximizing B-frame usage. This approach
excels in bitrate efficiency, real-time decoding, and competitive encoding times, making it suitable
for delay-tolerant applications such as broadcasting. The second compression technique employs
a frame distance aware structure, enhancing random-access efficiency through diagonal
references. This structure offers the shortest encoding time with competitive bitrate gains, making
it suitable for real-time interactive applications.
Existing LF refocusing methods often fall short of mimicking the natural behavior of the human
visual system, resulting in refocused images that appear artificial. Therefore, we developed an
innovative LF refocusing technique aimed at producing human perception consistent refocused
LFs from camera arrays. By integrating view synthesis, depth estimation, and object segmentation, the proposed method addresses the challenges of depth-dependent refocusing, yielding authentic-looking, natural post-shoot refocusing for camera-array-based content.
Traditional quality assessment metrics prove inadequate in capturing the multidimensional nature
of LF, underscoring the necessity for novel quality metrics tailored to LF. Finally, we introduce
the first LF quality metric tailored for sparsely sampled LFs. This novel full-reference LF quality
assessment technique utilizes a volumetric LF representation's cross-section for holistic analysis
employing a deep feature extractor. By simultaneously extracting spatial and angular features, this
approach comprehensively captures LF quality.
This thesis navigates the potential of LF by examining representation, compression, processing,
and quality assessment. As LF continues to shape the future of digital media, the solutions
presented herein facilitate its effective utilization.
|
Genre | |
Type | |
Language |
eng
|
Date Available |
2024-07-10
|
Provider |
Vancouver : University of British Columbia Library
|
Rights |
Attribution-NonCommercial-NoDerivatives 4.0 International
|
DOI |
10.14288/1.0444120
|
URI | |
Degree | |
Program | |
Affiliation | |
Degree Grantor |
University of British Columbia
|
Graduation Date |
2024-11
|
Campus | |
Scholarly Level |
Graduate
|
Rights URI | |
Aggregated Source Repository |
DSpace
|
Item Media
Item Citations and Data
Rights
Attribution-NonCommercial-NoDerivatives 4.0 International