Deep intrinsic decomposition trained on surreal scenes yet with realistic light effects

Hassan A. Sial; Ramon Baldrich; Maria Vanrell

doi:10.1364/JOSAA.37.000001

Journal of the Optical Society of America A
Vol. 37,
Issue 1,
pp. 1-15
(2020)
•https://doi.org/10.1364/JOSAA.37.000001

Deep intrinsic decomposition trained on surreal scenes yet with realistic light effects

Hassan A. Sial, Ramon Baldrich, and Maria Vanrell

Not Accessible

Your library or personal account may give you access

Get PDF
Email
Share
Get Citation
Copy Citation Text
Hassan A. Sial, Ramon Baldrich, and Maria Vanrell, "Deep intrinsic decomposition trained on surreal scenes yet with realistic light effects," J. Opt. Soc. Am. A 37, 1-15 (2020)

Export Citation
- BibTex
- Endnote (RIS)
- HTML
- Plain Text
Citation alert
Save article

Check for updates

Related Topics
Table of Contents Category
- Machine Vision
Optics & Photonics Topics
?

The topics in this list come from the Optics and Photonics Topics applied to this article.

About this Article
History
- Original Manuscript: July 15, 2019
- Revised Manuscript: October 6, 2019
- Manuscript Accepted: October 25, 2019
- Published: December 2, 2019

Abstract

Estimation of intrinsic images still remains a challenging task due to weaknesses of ground-truth datasets, which either are too small or present non-realistic issues. On the other hand, end-to-end deep learning architectures start to achieve interesting results that we believe could be improved if important physical hints were not ignored. In this work, we present a twofold framework: (a) a flexible generation of images overcoming some classical dataset problems such as larger size jointly with coherent lighting appearance; and (b) a flexible architecture tying physical properties through intrinsic losses. Our proposal is versatile, presents low computation time, and achieves state-of-the-art results.

Full Article | PDF Article

More Like This

Invariant descriptors for intrinsic reflectance optimization

Anil S. Baslamisli and Theo Gevers
J. Opt. Soc. Am. A 38(6) 887-896 (2021)

Intrinsic decomposition from a single spectral image

Xi Chen, Weixin Zhu, Yang Zhao, Yao Yu, Yu Zhou, Tao Yue, Sidan Du, and Xun Cao
Appl. Opt. 56(20) 5676-5684 (2017)

Object-based color constancy in a deep neural network

Hamed Heidari-Gorji and Karl R. Gegenfurtner
J. Opt. Soc. Am. A 40(3) A48-A56 (2023)

Cited By

You do not have subscription access to this journal. Cited by links are available to subscribers only. You may subscribe either as an Optica member, or as an authorized user of your institution.

Contact your librarian or system administrator
or
Login to access Optica Member Subscription

Figures (8)

You do not have subscription access to this journal. Figure files are available to subscribers only. You may subscribe either as an Optica member, or as an authorized user of your institution.

Contact your librarian or system administrator
or
Login to access Optica Member Subscription

Tables (7)

You do not have subscription access to this journal. Article tables are available to subscribers only. You may subscribe either as an Optica member, or as an authorized user of your institution.

Contact your librarian or system administrator
or
Login to access Optica Member Subscription

Equations (5)

You do not have subscription access to this journal. Equations are available to subscribers only. You may subscribe either as an Optica member, or as an authorized user of your institution.

Contact your librarian or system administrator
or
Login to access Optica Member Subscription

Dataset	Size (# Images)	Model Fulfillment	Training on Full Image	Diversified Background	Cast Shadows	Consistent Lighting
MIT (Grosse et al. [2])	220	Yes $^{⋆}$	No	No	No	Yes
IIW (Bell et al. [3])	5230	No	No	Yes	Yes	Yes
MIII (Beigpour et al. [39])	75	Yes	No	No	No	Yes
Sintel (Butler et al. [4])	890	No	Yes	Yes^‡	Yes	Yes
ShapeNet (Shi et al. [5])	330 K	Yes	No	Yes	No	No
ShapeNet-Intrinsic (Baslamisli et al. [6])	20 K	Yes	No	Yes	No	No
Our dataset (SID)	25 K	Yes	Yes	Yes	Yes	Yes
Baslamisli et al. [7]	35 K	Yes	No^†	Yes	Yes	No
CGintrinsics (Li and Snavely [8])	20 K	Yes	No^†	Yes	Yes	No
InteriorNet (Li et al. [41])	20M	No	No^†	Yes	Yes	No

	Reflectance			Shading
Method (where tested)	MSE	LMSE	DSSIM	MSE	LMSE	DSSIM
Retinex (whole image) [2]	0.0500	0.049	0.17	0.0400	0.0403	0.24
IUI (foreground object)	0.0046	0.0038	0.029	0.0023	0.0020	0.0178
	(10.9)	(12.9)	(5.9)	(17.4)	(20.2)	(13.5)
IUI (background walls)	0.0016	0.0014	0.019	0.0010	0.0008	0.023
	(31.3)	(35.0)	(8.9)	(40.0)	(50.4)	(10.4)
IUI (whole image)	0.0020	0.0019	0.020	0.0011	0.0009	0.022
	(25.0)	(25.8)	(8.5)	(36.4)	(44.8)	(10.9)

	Reflectance			Shading
Method	MSE	LMSE	DSSIM	MSE	LMSE	DSSIM
Retinex [2]	0.0032	0.0353	0.1825	0.0348	0.1027	0.3987
SIRFS [18]	0.0147	0.0416	0.1238	0.0083	0.0168	0.0985
Direct Intrinsics [26]	0.0277	0.0585	0.1526	0.0154	0.0295	0.1328
ShapeNet [5]	0.0278	0.0503	0.1465	0.0126	0.0240	0.1200
CGIntrinsics [8]	0.167	0.0319	0.1287	0.0127	0.0211	0.1376
IntrinsicNet [6]	0.0051	0.0295	0.0926	0.0029	0.0157	0.0441
RetiNet [6]	0.0128	0.0652	0.0909	0.0107	0.0746	0.1054
IUI	0.0046	0.0197	0.054	0.0038	0.020	0.0557

	Reflectance			Shading
Method	MSE	LMSE	DSSIM	MSE	LMSE	DSSIM
Retinex [2]	0.0606	0.00366	0.227	0.0727	0.0419	0.24
Lee et al. [21]	0.0463	0.02224	0.199	0.0507	0.0192	0.177
SIRFS [18]	0.042	0.0298	0.21	0.0436	0.0264	0.206
Chen and Koltun [22]	0.0307	0.0185	0.196	0.0277	0.019	0.165
Direct Intrinsics [26]	0.01	0.0083	0.02014	0.0092	0.0085	0.1505
Fan et al. [38]	0.0069	0.0044	0.1194	0.0059	0.0043	0.0822
IUI fine-tuned on CS	0.0072	0.0054	0.1374	0.0068	0.0059	0.1247
IUI without fine-tuning	0.023	0.015	0.21	0.035	0.022	0.255
IUI fine-tuned on GLS	0.0062	0.0047	0.1297	0.0057	0.0048	0.1183

	Reflectance			Shading
Method	MSE	LMSE	DSSIM	MSE	LMSE	DSSIM
Direct Intrinsics [26]	0.0238	0.0155	0.226	0.0205	0.0172	0.1816
Fan et al. [38]	0.0189	0.0122	0.1645	0.0171	0.0117	0.1450
IUI fine-tuned on CS	0.0213	0.0140	0.1787	0.0253	0.01721	0.1874
IUI without fine-tuning	0.023	0.0154	0.20	0.034	0.023	0.24
IUI fine-tuned on GLS	0.01733	0.0110	0.16189	0.0201	0.013182	0.1618

Method	WHDR (mean)	Runtime (sec)
Shen et al. [48]	36.90	297
Retinex (color) [2]	26.89	198.5
Retinex (gray) [2]	26.84	225.3
Graces et al. [49]	25.46	5.1
Zhao et al. [50]	23.20	34.7
IUI (without fine-tuning)	22.50	0.02
L1 flattening [51]	20.94	310.94
Bell et al. [3]	20.64	214
Zhou et al. [28]	19.95	300
Nestmeyer et al. (CNN) [32]	19.49	0.006
Nestmeyer et al. [32]	17.69	300.086
Bi et al. [51]	17.67	300
Fan et al. [38]	14.45	0.1

	Reflectance			Shading
Method	MSE	LMSE	DSSIM	MSE	LMSE	DSSIM
Direct Intrinsic [26]	0.1487	0.6868	0.0475	0.0505	0.3386	0.0361
ShapeNet [5]	0.0023	0.0349	0.0186	0.0037	0.0608	0.0171
IntrinsicNet [6]	0.0005	0.0072	0.0909	0.0007	0.0505	0.0084
RetiNet [6]	0.0003	0.0205	0.0052	0.0004	0.0253	0.0064
IUI trained on ShapeNet-Intrinsic	0.0002	0.0193	0.0032	0.0003	0.0229	0.0047
IUI without fine-tuning	0.0073	0.1926	0.0396	0.0479	0.2324	0.0291

	Reflectance			Shading
Method	MSE	LMSE	DSSIM	MSE	LMSE	DSSIM
Direct Intrinsic [26]	0.1487	0.6868	0.0475	0.0505	0.3386	0.0361
ShapeNet [5]	0.0023	0.0349	0.0186	0.0037	0.0608	0.0171
IntrinsicNet [6]	0.0005	0.0072	0.0909	0.0007	0.0505	0.0084
RetiNet [6]	0.0003	0.0205	0.0052	0.0004	0.0253	0.0064
IUI trained on ShapeNet-Intrinsic	0.0002	0.0193	0.0032	0.0003	0.0229	0.0047
IUI without fine-tuning	0.0073	0.1926	0.0396	0.0479	0.2324	0.0291

Abstract

Cited By

Figures (8)

Tables (7)

Equations (5)

Journal of the Optical Society of America A