SCI Publications

Page 4 of 144

SCI Publications

2024

Q. Huang, J. Le, S. Joshi, J. Mendes, G. Adluru, E. DiBella. “Arterial Input Function (AIF) Correction Using AIF Plus Tissue Inputs with a Bi-LSTM Network,” In Tomography, Vol. 10, pp. 660-673. 2024.

ABSTRACT

Background: The arterial input function (AIF) is vital for myocardial blood flow quantification in cardiac MRI to indicate the input time–concentration curve of a contrast agent. Inaccurate AIFs can significantly affect perfusion quantification. Purpose: When only saturated and biased AIFs are measured, this work investigates multiple ways of leveraging tissue curve information, including using AIF + tissue curves as inputs and optimizing the loss function for deep neural network training. Methods: Simulated data were generated using a 12-parameter AIF mathematical model for the AIF. Tissue curves were created from true AIFs combined with compartment-model parameters from a random distribution. Using Bloch simulations, a dictionary was constructed for a saturation-recovery 3D radial stack-of-stars sequence, accounting for deviations such as flip angle, T2* effects, and residual longitudinal magnetization after the saturation. A preliminary simulation study established the optimal tissue curve number using a bidirectional long short-term memory (Bi-LSTM) network with just AIF loss. Further optimization of the loss function involves comparing just AIF loss, AIF with compartment-model-based parameter loss, and AIF with compartment-model tissue loss. The optimized network was examined with both simulation and hybrid data, which included in vivo 3D stack-of-star datasets for testing. The AIF peak value accuracy and $? ? ? ? ? ?$ results were assessed. Results: Increasing the number of tissue curves can be beneficial when added tissue curves can provide extra information. Using just the AIF loss outperforms the other two proposed losses, including adding either a compartment-model-based tissue loss or a compartment-model parameter loss to the AIF loss. With the simulated data, the Bi-LSTM network reduced the AIF peak error from −23.6 ± 24.4% of the AIF using the dictionary method to 0.2 ± 7.2% (AIF input only) and 0.3 ± 2.5% (AIF + ten tissue curve inputs) of the network AIF. The corresponding ?^????? error was reduced from −13.5 ± 8.8% to −0.6 ± 6.6% and 0.3 ± 2.1%. With the hybrid data (simulated data for training; in vivo data for testing), the AIF peak error was 15.0 ± 5.3% and the corresponding ?^????? error was 20.7 ± 11.6% for the AIF using the dictionary method. The hybrid data revealed that using the AIF + tissue inputs reduced errors, with peak error (1.3 ± 11.1%) and ?^????? error (−2.4 ± 6.7%). Conclusions: Integrating tissue curves with AIF curves into network inputs improves the precision of AI-driven AIF corrections. This result was seen both with simulated data and with applying the network trained only on simulated data to a limited in vivo test dataset.

X. Huang, H. Miao, A. Townsend, K. Champley, J. Tringe, V. Pascucci, P.T. Bremer. “Bimodal Visualization of Industrial X-Ray and Neutron Computed Tomography Data,” In IEEE Transactions on Visualization and Computer Graphics, IEEE, 2024.
DOI: 10.1109/TVCG.2024.3382607

ABSTRACT

Advanced manufacturing creates increasingly complex objects with material compositions that are often difficult to characterize by a single modality. Our collaborating domain scientists are going beyond traditional methods by employing both X-ray and neutron computed tomography to obtain complementary representations expected to better resolve material boundaries. However, the use of two modalities creates its own challenges for visualization, requiring either complex adjustments of bimodal transfer functions or the need for multiple views. Together with experts in nondestructive evaluation, we designed a novel interactive bimodal visualization approach to create a combined view of the co-registered X-ray and neutron acquisitions of industrial objects. Using an automatic topological segmentation of the bivariate histogram of X-ray and neutron values as a starting point, the system provides a simple yet effective interface to easily create, explore, and adjust a bimodal visualization. We propose a widget with simple brushing interactions that enables the user to quickly correct the segmented histogram results. Our semiautomated system enables domain experts to intuitively explore large bimodal datasets without the need for either advanced segmentation algorithms or knowledge of visualization techniques. We demonstrate our approach using synthetic examples, industrial phantom objects created to stress bimodal scanning techniques, and real-world objects, and we discuss expert feedback.

K.E. Isaacs, H. Kaiser. “Halide Code Generation Framework in Phylanx,” In Euro-Par 2022: Parallel Processing Workshops , Springer, 2024.

ABSTRACT

Separating algorithms from their computation schedule has become a de facto solution to tackle the challenges of developing high performance code on modern heterogeneous architectures. Common approaches include Domain-specific languages (DSLs) which provide familiar APIs to domain experts, code generation frameworks that automate the generation of fast and portable code, and runtime systems that manage threads for concurrency and parallelism. In this paper, we present the Halide code generation framework for Phylanx distributed array processing platform. This extension enables compile-time optimization of Phylanx primitives for target architectures. To accomplish this, (1) we implemented new Phylanx primitives using Halide, and (2) partially exported Halide’s thread pool API to carry out parallelism on HPX (Phylanx’s runtime) threads. (3) showcased HPX performance analysis tools made available to Halide applications. The evaluation of the work has been done in two steps. First, we compare the performance of Halide applications running on its native runtime with that of the new HPX backend to verify there is no cost associated with using HPX threads. Next, we compare performances of a number of original implementations of Phylanx primitives against the new ones in Halide to verify performance and portability benefits of Halide in the context of Phylanx.

Y. Ishidoya, E. Kwan, B. Hunt, M. Lange, T. Sharma, D. Dosdall, R.S. MacLeod, E. Kholmovski, T.J. Bunch, R. Ranjan. “Effective ablation settings that predict chronic scar after atrial ablation with HELIOSTAR™ multi-electrode radiofrequency balloon catheter,” In Journal of Interventional Cardiac Electrophysiology, Springer Nature, 2024.
DOI: https://doi.org/10.1007/s10840-024-01948-y

ABSTRACT

Background

Radiofrequency balloon (RFB) ablation (HELIOSTAR™, Biosense Webster) has been developed to improve pulmonary vein ablation efficiency over traditional point-by-point RF ablation approaches. We aimed to find effective parameters for RFB ablation that result in chronic scar verified by late gadolinium enhancement cardiac magnetic resonance (LGE-CMR).

Methods

A chronic canine model (n = 8) was used to ablate in the superior vena cava (SVC), the right superior and the left inferior pulmonary vein (RSPV and LIPV), and the left atrial appendage (LAA) with a circumferential ablation approach (RF energy was delivered to all electrodes simultaneously) for 20 s or 60 s. The electroanatomical map with the ablation tags was projected onto the 3-month post-ablation LGE-CMR. Tags were divided into two groups depending on whether they correlated with CMR-based scar (ScarTags) or non-scar tissue (Non-ScarTags). The effective parameters for scar formation were estimated by multivariate logistic regression.

Results

This study assessed 80 lesions in the SVC, 80 lesions in the RSPV, 20 lesions in the LIPV, and 30 lesions in the LAA (168 ScarTags and 42 Non-ScarTags). In the multivariate analysis, two variables were associated with chronic scar formation: temperature of electrode before energy application (odds ratio (OR) 0.805, p = 0.0075) and long RF duration (OR 2.360, p = 0.0218), whereas impedance drop was not associated (OR 0.986, p = 0.373).

Conclusion

Lower temperature of the electrode before ablation and long ablation duration are critical parameters for durable atrial scar formation with RFB ablation.

K. Iyer, J. Adams, S.Y. Elhabian. “SCorP: Statistics-Informed Dense Correspondence Prediction Directly from Unsegmented Medical Images,” Subtitled “arXiv preprint arXiv:2404.17967,” 2024.

ABSTRACT

Statistical shape modeling (SSM) is a powerful computational framework for quantifying and analyzing the geometric variability of anatomical structures, facilitating advancements in medical research, diagnostics, and treatment planning. Traditional methods for shape modeling from imaging data demand significant manual and computational resources. Additionally, these methods necessitate repeating the entire modeling pipeline to derive shape descriptors (e.g., surface-based point correspondences) for new data. While deep learning approaches have shown promise in streamlining the construction of SSMs on new data, they still rely on traditional techniques to supervise the training of the deep networks. Moreover, the predominant linearity assumption of traditional approaches restricts their efficacy, a limitation also inherited by deep learning models trained using optimized/established correspondences. Consequently, representing complex anatomies becomes challenging. To address these limitations, we introduce SCorP, a novel framework capable of predicting surface-based correspondences directly from unsegmented images. By leveraging the shape prior learned directly from surface meshes in an unsupervised manner, the proposed model eliminates the need for an optimized shape model for training supervision. The strong shape prior acts as a teacher and regularizes the feature learning of the student network to guide it in learning image-based features that are predictive of surface correspondences. The proposed model streamlines the training and inference phases by removing the supervision for the correspondence prediction task while alleviating the linearity assumption. Experiments on the LGE MRI left atrium dataset and Abdomen CT-1K liver datasets demonstrate that the proposed technique enhances the accuracy and robustness of image-driven SSM, providing a compelling alternative to current fully supervised methods.

K. Iyer, S.Y. Elhabian. “Probabilistic 3D Correspondence Prediction from Sparse Unsegmented Images,” Subtitled “arXiv preprint arXiv:2407.01931v1,” 2024.

ABSTRACT

The study of physiology demonstrates that the form (shape) of anatomical structures dictates their functions, and analyzing the form of anatomies plays a crucial role in clinical research. Statistical shape modeling (SSM) is a widely used tool for quantitative analysis of forms of anatomies, aiding in characterizing and identifying differences within a population of subjects. Despite its utility, the conventional SSM construction pipeline is often complex and time-consuming. Additionally, reliance on linearity assumptions further limits the model from capturing clinically relevant variations. Recent advancements in deep learning solutions enable the direct inference of SSM from unsegmented medical images, streamlining the process and improving accessibility. However, the new methods of SSM from images do not adequately account for situations where the imaging data quality is poor or where only sparse information is available. Moreover, quantifying aleatoric uncertainty, which represents inherent data variability, is crucial in deploying deep learning for clinical tasks to ensure reliable model predictions and robust decision-making, especially in challenging imaging conditions. Therefore, we propose SPI-CorrNet, a unified model that predicts 3D correspondences from sparse imaging data. It leverages a teacher network to regularize feature learning and quantifies data-dependent aleatoric uncertainty by adapting the network to predict intrinsic input variances. Experiments on the LGE MRI left atrium dataset and Abdomen CT-1K liver datasets demonstrate that our technique enhances the accuracy and robustness of sparse image-driven SSM.

K. Iyer, S. Elhabian, S. Joshi. “LEDA: Log-Euclidean Diffeomorphic Autoencoder for Efficient Statistical Analysis of Diffeomorphism,” Subtitled “arXiv preprint arXiv:2412.16129,” 2024.

ABSTRACT

Image registration is a core task in computational anatomy that establishes correspondences between images. Invertible deformable registration, which computes a deformation field and handles complex, non-linear transformation, is essential for tracking anatomical variations, especially in neuroimaging applications where inter-subject differences and longitudinal changes are key. Analyzing the deformation fields is challenging due to their non-linearity, limiting statistical analysis. However, traditional approaches for analyzing deformation fields are computationally expensive, sensitive to initialization, and prone to numerical errors, especially when the deformation is far from the identity. To address these limitations, we propose the Log-Euclidean Diffeomorphic Autoencoder (LEDA), an innovative framework designed to compute the principal logarithm of deformation fields by efficiently predicting consecutive square roots. LEDA operates within a linearized latent space that adheres to the diffeomorphisms group action laws, enhancing our model’s robustness and applicability. We also introduce a loss function to enforce inverse consistency, ensuring accurate latent representations of deformation fields. Extensive experiments with the OASIS-1 dataset demonstrate the effectiveness of LEDA in accurately modeling and analyzing complex non-linear deformations while maintaining inverse consistency. Additionally, we evaluate its ability to capture and incorporate clinical variables, enhancing its relevance for clinical applications.

J Johnson, L McDonald, T Tasdizen. “Improving uranium oxide pathway discernment and generalizability using contrastive self-supervised learning,” In Computational Materials Science, Vol. 223, Elsevier, 2024.

ABSTRACT

In the field of Nuclear Forensics, there exists a plethora of different tools to aid investigators when performing analysis of unknown nuclear materials. Many of these tools offer visual representations of the uranium ore concentrate (UOC) materials that include complimentary and contrasting information. In this paper, we present a novel technique drawing from state-of-the-art machine learning methods that allows information from scanning electron microscopy images (SEM) to be combined to create digital encodings of the material that can be used to determine the material’s processing route. Our technique can classify UOC processing routes with greater than 96% accuracy in a fraction of a second and can be adapted to unseen samples at similarly high accuracy. The technique’s high accuracy and speed allow forensic investigators to quickly get preliminary results, while generalization allows the model to be adapted to new materials or processing routes quickly without the need for complete retraining of the model.

L.G. Johnson, J.D. Mozingo, P.R. Atkins, S. Schwab, A. Morris, S.Y. Elhabian, D.R. Wilson, H. Kim, A.E. Anderson. “A framework for three-dimensional statistical shape modeling of the proximal femur in Legg–Calvé–Perthes disease,” In International Journal of Computer Assisted Radiology and Surgery, Springer Nature Switzerland, 2024.

ABSTRACT

Purpose

The pathomorphology of Legg–Calvé–Perthes disease (LCPD) is a key contributor to poor long-term outcomes such as hip pain, femoroacetabular impingement, and early-onset osteoarthritis. Plain radiographs, commonly used for research and in the clinic, cannot accurately represent the full extent of LCPD deformity. The purpose of this study was to develop and evaluate a methodological framework for three-dimensional (3D) statistical shape modeling (SSM) of the proximal femur in LCPD.

Methods

We developed a framework consisting of three core steps: segmentation, surface mesh preparation, and particle-based correspondence. The framework aims to address challenges in modeling this rare condition, characterized by highly heterogeneous deformities across a wide age range and small sample sizes. We evaluated this framework by producing a SSM from clinical magnetic resonance images of 13 proximal femurs with LCPD deformity from 11 patients between the ages of six and 12 years.

Results

After removing differences in scale and pose, the dominant shape modes described morphological features characteristic of LCPD, including a broad and flat femoral head, high-riding greater trochanter, and reduced neck-shaft angle. The first four shape modes were chosen for the evaluation of the model’s performance, together describing 87.5% of the overall cohort variance. The SSM was generalizable to unfamiliar examples with an average point-to-point reconstruction error below 1mm. We observed strong Spearman rank correlations (up to 0.79) between some shape modes, 3D measurements of femoral head asphericity, and clinical radiographic metrics.

Conclusion

In this study, we present a framework, based on SSM, for the objective description of LCPD deformity in three dimensions. Our methods can accurately describe overall shape variation using a small number of parameters, and are a step toward a widely accepted, objective 3D quantification of LCPD deformity.

O. Joshi, T. Skóra, A. Yarema, R.D. Rabbitt, T.C. Bidone. “Contributions of the individual domains of αIIbβ3 integrin to its extension: Insights from multiscale modeling ,” In Cytoskeleton, 2024.

ABSTRACT

The platelet integrin α_IIbβ₃ undergoes long-range conformational transitions between bent and extended conformations to regulate platelet aggregation during hemostasis and thrombosis. However, how exactly α_IIbβ₃ transitions between conformations remains largely elusive. Here, we studied how transitions across bent and extended-closed conformations of α_IIbβ₃ integrin are regulated by effective interactions between its functional domains. We first carried out μs-long equilibrium molecular dynamics (MD) simulations of full-length α_IIbβ₃ integrins in bent and intermediate conformations, the latter characterized by an extended headpiece and closed legs. Then, we built heterogeneous elastic network models, perturbed inter-domain interactions, and evaluated their relative contributions to the energy barriers between conformations. Results showed that integrin extension emerges from: (i) changes in interfaces between functional domains; (ii) allosteric coupling of the head and upper leg domains with flexible lower leg domains. Collectively, these results provide new insights into integrin conformational activation based on short- and long-range interactions between its functional domains and highlight the importance of the lower legs in the regulation of integrin allostery.

T. Kataria, B. Knudsen, S.Y. Elhabian. “StainDiffuser: MultiTask Dual Diffusion Model for Virtual Staining,” Subtitled “arXiv preprint arXiv:2403.11340,” 2024.

ABSTRACT

Hematoxylin and Eosin (H&E) staining is the most commonly used for disease diagnosis and tumor recurrence tracking. Hematoxylin excels at highlighting nuclei, whereas eosin stains the cytoplasm. However, H&E stain lacks details for differentiating different types of cells relevant to identifying the grade of the disease or response to specific treatment variations. Pathologists require special immunohistochemical (IHC) stains that highlight different cell types. These stains help in accurately identifying different regions of disease growth and their interactions with the cell’s microenvironment. The advent of deep learning models has made Image-to-Image (I2I) translation a key research area, reducing the need for expensive physical staining processes. Pix2Pix and CycleGAN are still the most commonly used methods for virtual staining applications. However, both suffer from hallucinations or staining irregularities when H&E stain has less discriminate information about the underlying cells IHC needs to highlight (e.g.,CD3 lymphocytes). Diffusion models are currently the state-of-the-art models for image generation and conditional generation tasks. However, they require extensive and diverse datasets (millions of samples) to converge, which is less feasible for virtual staining applications. Inspired by the success of multitask deep learning models for limited dataset size, we propose StainDiffuser, a novel multitask dual diffusion architecture for virtual staining that converges under a limited training budget. StainDiffuser trains two diffusion processes simultaneously: (a) generation of cell-specific IHC stain from H&E and (b) H&E-based cell segmentation using coarse segmentation only during training. Our results show that StainDiffuser produces high-quality results for easier (CK8/18,epithelial marker) and difficult stains(CD3, Lymphocytes).

R. Kolasangiani, K. Farzanian, Y. Chen, M.A. Schwartz, T.C. Bidone. “Conformational response of αIIbβ3 and αVβ3 integrins to force,” In Structure, Elsevier, 2024.
ISSN: 0969-2126
DOI: https://doi.org/10.1016/j.str.2024.11.016

ABSTRACT

As major adhesion receptors, integrins transmit biochemical and mechanical signals across the plasma membrane. These functions are regulated by transitions between bent and extended conformations and modulated by force. To understand how force on integrins mediates cellular mechanosensing, we compared two highly homologous integrins, α_IIbβ₃ and α_Vβ₃. These integrins, expressed in circulating platelets vs. solid tissues, respectively, share the β₃ subunit, bind similar ligands and have similar bent and extended conformations. Here, we report that in cells expressing equivalent levels of each integrin, α_IIbβ₃ mediates spreading on softer substrates than α_Vβ₃. These effects correlate with differences in structural dynamics of the two integrins under force. All-atom simulations show that α_IIbβ₃ is more flexible than α_Vβ₃ due to correlated residue motions within the α subunit domains. Single molecule measurements confirm that α_IIbβ₃ extends faster than α_Vβ₃. These results reveal a fundamental relationship between protein function and structural dynamics in cell mechanosensing.

V. Koppelmans, M.F.L. Ruitenberg, S.Y. Schaefer, J.B. King, J.M. Jacobo, B.P. Silvester, A.F. Mejia, J. van der Geest, J.M. Hoffman, T. Tasdizen, K. Duff. “Classification of Mild Cognitive Impairment and Alzheimer's Disease Using Manual Motor Measures,” In Neurodegener Dis, 2024.
DOI: 10.1159/000539800
PubMed ID: 38865972

ABSTRACT

Introduction: Manual motor problems have been reported in mild cognitive impairment (MCI) and Alzheimer's disease (AD), but the specific aspects that are affected, their neuropathology, and potential value for classification modeling is unknown. The current study examined if multiple measures of motor strength, dexterity, and speed are affected in MCI and AD, related to AD biomarkers, and are able to classify MCI or AD.

Methods: Fifty-three cognitively normal (CN), 33 amnestic MCI, and 28 AD subjects completed five manual motor measures: grip force, Trail Making Test A, spiral tracing, finger tapping, and a simulated feeding task. Analyses included: 1) group differences in manual performance; 2) associations between manual function and AD biomarkers (PET amyloid β, hippocampal volume, and APOE ε4 alleles); and 3) group classification accuracy of manual motor function using machine learning.

Results: amnestic MCI and AD subjects exhibited slower psychomotor speed and AD subjects had weaker dominant hand grip strength than CN subjects. Performance on these measures was related to amyloid β deposition (both) and hippocampal volume (psychomotor speed only). Support vector classification well-discriminated control and AD subjects (area under the curve of 0.73 and 0.77 respectively), but poorly discriminated MCI from controls or AD.

Conclusion: Grip strength and spiral tracing appear preserved, while psychomotor speed is affected in amnestic MCI and AD. The association of motor performance with amyloid β deposition and atrophy could indicate that this is due to amyloid deposition in- and atrophy of motor brain regions, which generally occurs later in the disease process. The promising discriminatory abilities of manual motor measures for AD emphasize their value alongside other cognitive and motor assessment outcomes in classification and prediction models, as well as potential enrichment of outcome variables in AD clinical trials.

E. Kwan, E. ghafoori, W. Good, M. Regouski, B. Moon, J. Fish, E. Hsu, I. Polejaeva, R.S. Macleod, D. Dosdall, R. Ranjan. “Diffuse Functional and Structural Abnormalities in Fibrosis: Potential Structural Basis for Sustaining Atrial Fibrillation,” In Circulation, Vol. 150, pp. A4136863--A4136863. 2024.

ABSTRACT

Background: Structural remodeling is associated with atrial fibrillation (AF), but detailed structural and functional characteristics is not well defined.

Goal: Using a novel transgenic goat model with cardiac-specific overexpression of TGF-β1 leading to increased cardiac fibrosis, we evaluated detailed structural and functional differences between fibrotic and healthy regions of the atria.

Methods: Ex-vivo MRI and histology were used to evaluate fibrosis, fiber disarray, and structural anisotropy. Ex-vivo MRI intensity values were scaled to standard deviations around the mean. The functional analysis examined conduction speeds and direction heterogeneity. Conduction anisotropy was measured along the fiber direction obtained with diffusion tensor imaging.

Results: The transgenic goats (n=12) had, on average, 21% of the left atria labeled as fibrotic determined from ex-vivo MRI. The histology samples taken from the labeled fibrotic regions had an increase in fibrosis percentage compared to labeled healthy regions (7.78±3.76% vs 2.80±1.86%, p<0.01). The structural anisotropy was lower in fibrotic regions (0.196±0.002 vs 0.244±0.002, p<0.01). Fiber disarray was greater in the fibrotic regions (20.3±0.2° vs 19.2±0.1°, p<0.01). The fibrotic regions had slower conduction speeds (0.78±0.02 m/s vs 1.12±0.02 m/s, p<0.01) and more aligned conduction directions (30.5±0.2° vs 31.6±0.1°, p<0.01), potentially developing unidirectional conduction block. Conduction anisotropy, measured on the fiber directions, was found to be lower in the fibrotic regions (1.86±0.05 vs 2.10±0.02, p=0.04). As scaled MRI intensity increased, the conduction speed, heterogeneity, and anisotropy all decreased.

Conclusions: Functional and structural differences exist between fibrotic and healthy regions of the atria. Though statistically significant, the changes are not discrete. The correlation showed gradual functional abnormalities with MRI intensities. Fibrotic regions tended to have increased fiber disarray, slower conduction, and more unidirectional propagation with lower conduction and structural anisotropy. Diffuse functional and structural abnormalities may allow fibrotic regions to serve as a substrate to sustain AF.

D. Lange, R. Judson-Torres, T.A. Zangle, A. Lex. “Aardvark: Composite Visualizations of Trees, Time-Series, and Images,” In IEEE Transactions on Visualization and Computer Graphics, IEEE, 2024.

ABSTRACT

How do cancer cells grow, divide, proliferate and die? How do drugs influence these processes? These are difficult questions that we can attempt to answer with a combination of time-series microscopy experiments, classification algorithms, and data visualization. However, collecting this type of data and applying algorithms to segment and track cells and construct lineages of proliferation is error-prone; and identifying the errors can be challenging since it often requires cross-checking multiple data types. Similarly, analyzing and communicating the results necessitates synthesizing different data types into a single narrative. State-of-the-art visualization methods for such data use independent line charts, tree diagrams, and images in separate views. However, this spatial separation requires the viewer of these charts to combine the relevant pieces of data in memory. To simplify this challenging task, we describe design principles for weaving cell images, time-series data, and tree data into a cohesive visualization. Our design principles are based on choosing a primary data type that drives the layout and integrates the other data types into that layout. We then introduce Aardvark, a system that uses these principles to implement novel visualization techniques. Based on Aardvark, we demonstrate the utility of each of these approaches for discovery, communication, and data debugging in a series of case studies.

J. Li, T.A.J. Ouermi, C.R. Johnson. “Visualizing Uncertainties in Ensemble Wildfire Forecast Simulations,” In IEEE Workshop on Uncertainty Visualization: Applications, Techniques, Software, and Decision Frameworks, IEEE, pp. 84--88. 2024.
DOI: 10.1109/UncertaintyVisualization63963.2024.00016

ABSTRACT

Wildfires pose substantial risks to our health, environment, and economy. Studying wildfires is challenging due to their complex interaction with the atmosphere dynamics and the terrain. Researchers have employed ensemble simulations to study the relationship among variables and mitigate uncertainties in unpredictable initial conditions. However, many wildfire researchers are unaware of the advanced visualization available for conveying uncertainty. We designed and implemented an interactive visualization system for studying the uncertainties of fire spread patterns utilizing band-depth-based order statistics and contour boxplots. We also augment the visualization system with the summary of changes in the burned area and fuel content to help scientists identify interesting temporal events. In this paper, we demonstrate how our system can support wildfire experts in studying fire spread patterns, identifying outlier simulations, and navigating to interesting times based on a summary of events.

Z. Li, H. Miao, V. Pascucci, S. Liu. “Visualization Literacy of Multimodal Large Language Models: A Comparative Study,” Subtitled “arXiv:2407.10996,” 2024.

ABSTRACT

The recent introduction of multimodal large language models (MLLMs) combine the inherent power of large language models (LLMs) with the renewed capabilities to reason about the multimodal context. The potential usage scenarios for MLLMs significantly outpace their text-only counterparts. Many recent works in visualization have demonstrated MLLMs' capability to understand and interpret visualization results and explain the content of the visualization to users in natural language. In the machine learning community, the general vision capabilities of MLLMs have been evaluated and tested through various visual understanding benchmarks. However, the ability of MLLMs to accomplish specific visualization tasks based on visual perception has not been properly explored and evaluated, particularly, from a visualization-centric perspective.

In this work, we aim to fill the gap by utilizing the concept of visualization literacy to evaluate MLLMs. We assess MLLMs' performance over two popular visualization literacy evaluation datasets (VLAT and mini-VLAT). Under the framework of visualization literacy, we develop a general setup to compare different multimodal large language models (e.g., GPT4-o, Claude 3 Opus, Gemini 1.5 Pro) as well as against existing human baselines. Our study demonstrates MLLMs' competitive performance in visualization literacy, where they outperform humans in certain tasks such as identifying correlations, clusters, and hierarchical structures.

X. Li, R. Mohammed, T. Mangin, S. Saha, K. Kelly, R.T. Whitaker, T. Tasdizen. “Joint Audio-Visual Idling Vehicle Detection with Streamlined Input Dependencies,” Subtitled “arXiv:2410.21170v1,” 2024.

ABSTRACT

Idling vehicle detection (IVD) can be helpful in monitoring and reducing unnecessary idling and can be integrated into real-time systems to address the resulting pollution and harmful products. The previous approach, a non-end-to-end model, requires extra user clicks to specify a part of the input, making system deployment more error-prone or even not feasible. In contrast, we introduce an end-to-end joint audio-visual IVD task designed to detect vehicles visually under three states: moving, idling and engine off. Unlike feature co-occurrence task such as audio-visual vehicle tracking, our IVD task addresses complementary features, where labels cannot be determined by a single modality alone. To this end, we propose AVIVD-Net, a novel network that integrates audio and visual features through a bidirectional attention mechanism. AVIVD-Net streamlines the input process by learning a joint feature space, reducing the deployment complexity of previous methods. Additionally, we introduce the AVIVD dataset, which is seven times larger than previous datasets, offering significantly more annotated samples to study the IVD problem. Our model achieves performance comparable to prior approaches, making it suitable for automated deployment. Furthermore, by evaluating AVIVDNet on the feature co-occurrence public dataset MAVD, we demonstrate its potential for extension to self-driving vehicle video-camera setups.

Z. Li, S. Liu, X. Yu, K. Bhavya, J. Cao, J.D. Diffenderfer, D. James, P.T. Bremer, V. Pascucci. ““Understanding Robustness Lottery”: A Geometric Visual Comparative Analysis of Neural Network Pruning Approaches,” In IEEE Transactions on Visualization & Computer Graphics, IEEE, pp. 1--16. 2024.
ISSN: 1941-0506
DOI: 10.1109/TVCG.2024.3514996

ABSTRACT

Deep learning approaches have provided state-of-the-art performance in many applications by relying on large and overparameterized neural networks. However, such networks are very brittle and are difficult to deploy on resource-limited platforms. Model pruning, i.e., reducing the size of the network, is a widely adopted strategy that can lead to a more robust and compact model. Many heuristics exist for model pruning, but our understanding of the pruning process remains limited due to the black-box nature of a neural network model. Empirical studies show that some heuristics improve performance whereas others can make models more brittle. This work aims to shed light on how different pruning methods alter the network's internal feature representation and the corresponding impact on model performance. To facilitate a comprehensive comparison and characterization of the high-dimensional model feature space, we introduce a visual geometric analysis of feature representations. We evaluated a set of critical geometric concepts decomposed from the commonly adopted classification loss and used them to design a visualization system to compare and highlight the impact of pruning on model performance and feature representation. The proposed tool provides an environment for an in-depth comparison of pruning methods and a comprehensive understanding of how the model responds to common data corruption. By leveraging the proposed visualization, machine learning researchers can reveal the similarities between pruning methods and redundancy in robustness evaluation benchmarks, obtain geometric insights about the differences between pruned models that achieve superior robustness performance, and identify samples that are robust or fragile to model pruning and common data corruption.

M. Lisnic, Z. Cutler, M. Kogan, A. Lex. “Visualization Guardrails: Designing Interventions Against Cherry-Picking in Interactive Data Explorers,” Subtitled “Preprint,” 2024.

ABSTRACT

The growing popularity of interactive time series exploration platforms has made visualizing data of public interest more accessible to general audiences. At the same time, the democratized access to professional-looking explorers with preloaded data enables the creation of convincing visualizations with carefully cherry-picked items. Prior research shows that people use data explorers to create and share charts that support their potentially biased or misleading views on public health or economic policy and that such charts have, for example, contributed to the spread of COVID-19 misinformation. Interventions against misinformation have focused on post hoc approaches such as fact-checking or removing misleading content, which are known to be challenging to execute. In this work, we explore whether we can use visualization design to impede cherry-picking—one of the most common methods employed by deceptive charts created on data exploration platforms. We describe a design space of guardrails—interventions against cherry-picking in time series explorers. Using our design space, we create a prototype data explorer with four types of guardrails and conduct two crowd-sourced experiments. In the first experiment, we challenge participants to create cherry-picked charts. We then use these charts in a second experiment to evaluate the guardrails’ impact on the perception of cherry-picking. We find evidence that guardrails—particularly superimposing relevant primary data—are successful at encouraging skepticism in a subset of experimental conditions but come with limitations. Based on our findings, we propose recommendations for developing effective guardrails for visualizations.

Page 4 of 144

SCIENTIFIC COMPUTING AND IMAGING INSTITUTEat the University of Utah