\section{Visuo-Haptic Augmentations of Hand-Object Interactions} \label{visuo_haptic} Perception and manipulation of objects with the hand typically involves both the visual and haptic senses. Each sense has unique capabilities for perceiving certain object properties, such as color for vision or temperature for touch, but they are equally capable for many properties, such as roughness, hardness, or geometry \cite{baumgartner2013visual}. Both \AR and wearable haptic systems integrate virtual content into the user's perception as sensory illusions. It is essential to understand how a visuo-haptic rendering of a \VO is perceived as a coherent object property, and how wearable haptics have been integrated with immersive \AR.%, especially in immersive \AR where the haptic actuator is moved away so as not to cover the inside of the hand. % spatial and temporal integration of visuo-haptic feedback as perceptual cues vs proprioception and real touch sensations % delocalized : not at the point of contact = difficult to integrate with other perceptual cues ? \subsection{Visuo-Haptic Perception of Virtual and Augmented Objects} \label{vh_perception} \subsubsection{Merging the Sensations into a Perception} \label{sensations_perception} A \emph{perception} is the merging of multiple sensations from different sensory modalities (visual, cutaneous, proprioceptive, etc.) about the same event or object property \cite{ernst2004merging}. For example, it is the haptic hardness perceived through skin pressure and force sensations (\secref{hardness}), the hand movement from proprioception and a visual hand avatar (\secref{ar_displays}), or the perceived size of a real object with a co-localized \VO (\secref{ar_tangibles}). If the sensations are redundant, \ie if only one sensation could suffice to estimate the property, they are integrated to form a single perception \cite{ernst2004merging}. No sensory information is completely reliable and may give different answers to the same property when measured multiple times, \eg the weight of an object. Therefore, each sensation $i$ is said to be an estimate $\tilde{s}_i$ with variance $\sigma_i^2$ of the property $s$. The \MLE model then predicts that the integrated estimated property $\tilde{s}$ is the weighted sum of the individual sensory estimates: \begin{equation}{MLE} \tilde{s} = \sum_i w_i \tilde{s}_i \quad \text{with} \quad \sum_i w_i = 1 \end{equation} Where the individual weights $w_i$ are proportional to their inverse variances: \begin{equation}{MLE_weights} w_i = \frac{1/\sigma_i^2}{\sigma^2} \end{equation} And the integrated variance $\sigma^2$ is the inverse of the sum of the individual variances: \begin{equation}{MLE_variance} \sigma^2 = \left( \sum_i \frac{1}{\sigma_i^2} \right)^{-1} \end{equation} This was demonstrated by \textcite{ernst2002humans} in a user study where participants estimated the height of a virtual bar using a fixed-window \OST-\AR display (\secref{ar_displays}) and force-feedback devices worn on the thumb and index finger (\secref{wearability_level}), as shown in \figref{ernst2002humans_setup}. %They first measured the individual variances of the visual and haptic estimates (\figref{ernst2002humans_within}) and then the combined variance of the visuo-haptic estimates (\figref{ernst2002humans_visuo-haptic}). On each trial, participants compared the visuo-haptic reference bar (of a fixed height) to a visuo-haptic comparison bar (of a variable height) in a \TIFC task (one bar is tested first, a pause, then the other) and indicated which was taller. The reference bar had different conflicting visual $s_v$ and haptic $s_h$ heights, and different noise levels were added to the visual feedback to increase its variance. The objective was to determine a \PSE between the comparison and reference bars, where the participant was equally likely to choose one or the other (\percent{50} of the trials). %\figref{ernst2002humans_within} shows the discrimination of participants with only the haptic or visual feedback, and how much the estimation becomes difficult (thus higher variance) when noise is added to the visual feedback. \figref{ernst2004merging_results} shows that when the visual noise was low, the visual feedback had more weight, but as visual noise increased, haptic feedback gained more weight, as predicted by the \MLE model. \begin{subfigs}{ernst2002humans}{ Visuo-haptic perception of height of a virtual bar \cite{ernst2002humans}. }[][ \item Experimental setup.%: Participants estimated height visually with an \OST-\AR display and haptically with force-feedback devices worn on the thumb and index fingers. %\item with only haptic feedback (red) or only visual feedback (blue, with different added noise), %\item combined visuo-haptic feedback (purple, with different visual noises). \item Proportion of trials (vertical axis) where the comparison bar (horizontal axis) was perceived taller than the reference bar as function of increase variance (inverse of reliability) of the visual feedback (colors). The reference had different conflicting visual $s_v$ and haptic $s_h$ heights. ] \subfig[.34]{ernst2002humans_setup} \subfig[.64]{ernst2004merging_results} %\subfig{ernst2002humans_within} %\subfig{ernst2002humans_visuo-haptic} \end{subfigs} %Hence, the \MLE model explains how a (visual) \VO in \AR can be perceived as coherent when combined with real haptic sensations of a tangible or a wearable haptic feedback. The \MLE model implies that when seeing and touching a \VO in \AR, the combination of visual and haptic stimuli, real or virtual, presented to the user can be perceived as a coherent single object property. %As long as the user is able to associate the sensations as the same object property, and even if there are discrepancies between the sensations, the overall perception can be influenced by changing one of the stimuli, as discussed in the next sections. %for example by including tangible objects, wearable haptic feedback, or even by altering the visual rendering of the \VO, as discussed in the next sections. \subsubsection{Influence of Visual Rendering on Haptic Perception} \label{visual_haptic_influence} Thus, a visuo-haptic perception of an object's property is robust to some difference between the two sensory modalities, as long as one can match their respective sensations to the same property. In particular, the texture perception of objects is known to be constructed from both vision and touch \cite{klatzky2010multisensory}. More precisely, when surfaces are evaluated by vision or touch alone, both senses discriminate their materials mainly by the same properties of roughness, hardness, and friction, and with similar performance \cite{bergmanntiest2007haptic,baumgartner2013visual,vardar2019fingertip}. The overall perception can then be modified by changing one of the sensory modalities. \textcite{yanagisawa2015effects} altered the perceived roughness, stiffness, and friction of real tactile materials touched by the finger by superimposing different real visual textures using a half-mirror. In a similar setup, but in immersive \VST-\AR, \textcite{kitahara2010sensory} overlaid visual textures on real textured surfaces touched through a glove: many visual textures were found to match the real haptic textures. \textcite{degraen2019enhancing} and \textcite{gunther2022smooth} also combined multiple \VOs in \VR with \ThreeD-printed hair structures or with everyday real surfaces, respectively. They found that the visual perception of roughness and hardness influenced the haptic perception, and that only a few real, tangible objects seemed to be sufficient to match all the visual \VOs (\figref{gunther2022smooth}). %Taken together, these studies suggest that a set of haptic textures, real or virtual, can be perceived as coherent with a larger set of visual virtual textures. \fig{gunther2022smooth}{In a passive touch context in \VR, only a smooth and a rough real surfaces were found to match all the visual \VOs \cite{gunther2022smooth}.} Visual feedback can even be intentionally designed to influence haptic perception, usually by deforming the visual representation of a user input, creating a \enquote{pseudo-haptic feedback} \cite{ujitoko2021survey}. For example, in a fixed \VST-\AR screen (\secref{ar_displays}), by visually deforming the geometry of a real object touched by the hand, as well as the touching hand, the visuo-haptic perception of the size, shape, or curvature can be altered \cite{ban2013modifying,ban2014displaying}. \textcite{punpongsanon2015softar} used this technique in spatial \AR (\secref{ar_displays}) to induce a softness illusion of a hard real surface by superimposing a virtual texture that deforms when pressed by the hand (\figref{punpongsanon2015softar}). \textcite{ujitoko2019modulating} increased the perceived roughness of a virtual patterned texture rendered as vibrations through a hand-held stylus (\secref{texture_rendering}) by adding small oscillations to the visual feedback of the stylus on a screen. \begin{subfigs}{pseudo_haptic}{Pseudo-haptic feedback in \AR. }[][ \item A virtual soft texture projected on a table and that deforms when pressed by the hand \cite{punpongsanon2015softar}. \item Modifying visually a real object and the hand touching it in \VST-\AR to modify its perceived shape \cite{ban2014displaying}. ] \subfigsheight{42mm} \subfig{punpongsanon2015softar} \subfig{ban2014displaying} \end{subfigs} %In all of these studies, the visual expectations of participants influenced their haptic perception. %In particular, in \AR and \VR, the perception of a haptic rendering or augmentation can be influenced by the visual rendering of the \VO. \subsubsection{Perception of Visuo-Haptic Rendering in AR and VR} \label{ar_vr_haptic} Some studies have investigated the visuo-haptic perception of \VOs rendered with force-feedback and vibrotactile feedback in \AR and \VR. In an immersive \VST-\AR setup, \textcite{knorlein2009influence} rendered a virtual piston using force-feedback haptics that participants pressed directly with their hand (\figref{visuo-haptic-stiffness}). In a \TIFC task (\secref{sensations_perception}), participants pressed two pistons and indicated which was stiffer. One had a reference stiffness but an additional visual or haptic delay, while the other varied with a comparison stiffness but had no delay. \footnote{Participants were not told about the delays and stiffness tested, nor which piston was the reference or comparison. The order of the pistons (which one was pressed first) was also randomized.} Adding a visual delay increased the perceived stiffness of the reference piston, while adding a haptic delay decreased it, and adding both delays cancelled each other out (\figref{knorlein2009influence_2}). \begin{subfigs}{visuo-haptic-stiffness}{ Perception of haptic stiffness in \VST-\AR \cite{knorlein2009influence}. }[][ \item Participant pressing a virtual piston rendered by a force-feedback device with their hand. \item Proportion of comparison piston perceived as stiffer than reference piston (vertical axis) as a function of the comparison stiffness (horizontal axis) and visual and haptic delays of the reference (colors). ] \subfig[.44]{knorlein2009influence_1} \subfig[.55]{knorlein2009influence_2} \end{subfigs} %explained how these delays affected the integration of the visual and haptic perceptual cues of stiffness. The stiffness $\tilde{k}(t)$ of the piston is indeed estimated at time $t$ by both sight and proprioception as the ratio of the exerted force $F(t)$ and the displacement $D(t)$ of the piston, following \eqref{stiffness}, but with potential visual $\Delta t_v$ or haptic $\Delta t_h$ delays: \begin{equation}{stiffness_delay} \tilde{k}(t) = \frac{F(t + \Delta t_h)}{D(t + \Delta t_v)} \end{equation} Therefore, the perceived stiffness $\tilde{k}(t)$ increases with a haptic delay in force and decreases with a visual delay in displacement \cite{diluca2011effects}. In a similar \TIFC user study, participants compared perceived stiffness of virtual pistons in \OST-\AR and \VR \cite{gaffary2017ar}. However, the force-feedback device and the participant's hand were not visible (\figref{gaffary2017ar}). The reference piston was judged to be stiffer when seen in \VR than in \AR, without participants noticing this difference, and more force was exerted on the piston overall in \VR. This suggests that the haptic stiffness of \VOs feels \enquote{softer} in an \AE than in a full \VE. %Two differences that could be worth investigating with the two previous studies are the type of \AR (visuo or optical) and to see the hand touching the \VO. \begin{subfigs}{gaffary2017ar}{Perception of haptic stiffness in \OST-\AR \vs \VR \cite{gaffary2017ar}. }[][ \item Experimental setup: a virtual piston was pressed with a force-feedback placed to the side of the participant. \item View of the virtual piston seen in front of the participant in \OST-\AR and \item in \VR. ] \subfig[0.35]{gaffary2017ar_1} \subfig[0.3]{gaffary2017ar_3} \subfig[0.3]{gaffary2017ar_4} \end{subfigs} Finally, \textcite{diluca2019perceptual} investigated the perceived simultaneity of visuo-haptic contact with a \VO in \VR. The contact was rendered both by a vibrotactile piezoelectric device on the fingertip and by a visual change in the color of the \VO. The visuo-haptic simultaneity was varied by adding a visual delay or by triggering the haptic feedback earlier. No participant (out of 19) was able to detect a \qty{50}{\ms} visual lag and a \qty{15}{\ms} haptic lead, and only half of them detected a \qty{100}{\ms} visual lag and a \qty{70}{\ms} haptic lead. These studies have shown how the latency of the visual rendering of a \VO or the type of environment (\VE or \RE) can affect the perceived haptic stiffness of the object, rendered with a grounded force-feedback device. We describe in the next section how wearable haptics have been integrated with immersive \AR. \subsection{Wearable Haptics for Direct Hand Interaction in AR} \label{vhar_haptics} A few wearable haptic devices have been specifically designed or experimentally tested for direct hand interaction in immersive \AR. Since virtual or augmented objects are naturally touched, grasped, and manipulated directly with the fingertips (\secref{exploratory_procedures} and \secref{grasp_types}), the main challenge of wearable haptics for \AR is to provide haptic sensations of these interactions while keeping the fingertips free to interact with the \RE. Several approaches have been proposed to move the haptic actuator to a different location, on the outside of the finger or the hand, \eg the nail, the top of a phalanx, or the wrist. Yet, they differ greatly in the actuators used (\secref{wearable_haptic_devices}), thus the haptic feedback (\secref{tactile_rendering}), and the placement of the haptic rendering. Other wearable haptic actuators have been proposed for \AR, but are not discussed here. A first reason is that they permanently cover the fingertip and affect the interaction with the \RE, such as thin-skin tactile interfaces \cite{withana2018tacttoo,teng2024haptic} or fluid-based interfaces \cite{han2018hydroring}. Another category of actuators relies on systems that cannot be considered as portable, such as REVEL \cite{bau2012revel}, which provide friction sensations with reverse electrovibration that must modify the real objects to augment, or Electrical Muscle Stimulation (EMS) devices \cite{lopes2018adding}, which provide kinesthetic feedback by contracting the muscles. \subsubsection{Nail-Mounted Devices} \label{vhar_nails} \textcite{ando2007fingernailmounted} were the first to move the actuator from the fingertip to propose the nail, as described in \secref{texture_rendering}. This approach was later extended by \textcite{teng2021touch} with Touch\&Fold, a haptic device able to unfold its end-effector on demand to make contact with the fingertip when touching \VOs (\figref{teng2021touch_1}). This moving platform also contains a \LRA (\secref{moving_platforms}) and provides contact pressure and texture sensations. %The whole system is very compact (\qtyproduct{24 x 24 x 41}{\mm}), lightweight (\qty{9.5}{\g}), and fully portable by including a battery and Bluetooth wireless communication. \qty{20}{\ms} for the Bluetooth When touching \VOs in \OST-\AR with the index finger, this device was found to be more realistic overall (5/7) than vibrations with a \LRA at \qty{170}{\Hz} on the nail (3/7). Still, there is a high (\qty{92}{\ms}) latency for the folding mechanism and this design is not suitable for augmenting real objects. % teng2021touch: (5.27+3.03+5.23+5.5+5.47)/5 = 4.9 % ando2007fingernailmounted: (2.4+2.63+3.63+2.57+3.2)/5 = 2.9 With Fingeret, \textcite{maeda2022fingeret} adapted the belt actuators (\secref{belt_actuators}) as a \enquote{finger-side actuator} that leaves the fingertip free (\figref{maeda2022fingeret}). Two rollers, one on each side, can deform the skin: When rotated inward, they pull the skin, simulating a contact sensation, and when rotated outward, they push the skin, simulating a release sensation. They can also simulate a texture sensation by rapidly rotating in and out. %The device is also very compact (\qty{60 x 25 x 36}{\mm}), lightweight (\qty{18}{\g}), and portable with a battery and Bluetooth wireless communication with \qty{83}{\ms} latency. In a user study not in \AR, but directly touching images on a tablet, Fingeret was found to be more realistic (4/7) than a \LRA at \qty{100}{\Hz} on the nail (3/7) for rendering buttons and a patterned texture (\secref{texture_rendering}), but not different from vibrations for rendering high-frequency textures (3.5/7 for both). However, as with \textcite{teng2021touch}, finger speed was not taken into account when rendering vibrations, which may have been detrimental to texture perception, as described in \secref{texture_rendering}. Finally, \textcite{preechayasomboon2021haplets} (\figref{preechayasomboon2021haplets}) and \textcite{sabnis2023haptic} designed Haplets and Haptic Servo, respectively: These are compact and lightweight vibrotactile \LRA devices designed to provide both integrated finger motion sensing and low latency haptic feedback (\qty{<5}{ms}). However, no proper user study has been conducted to evaluate these devices in \AR. \begin{subfigs}{ar_wearable}{Nail-mounted wearable haptic devices designed for \AR. }[][ %\item A voice-coil rendering a virtual haptic texture on a real sheet of paper \cite{ando2007fingernailmounted}. \item Touch\&Fold provide contact pressure and vibrations on demand to the fingertip \cite{teng2021touch}. \item Fingeret is a finger-side wearable haptic device that pulls and pushs the fingertip skin \cite{maeda2022fingeret}. \item Haplets is a compact nail device with integrated sensing and vibrotactile feedback \cite{preechayasomboon2021haplets}. ] \subfigsheight{33mm} %\subfig{ando2007fingernailmounted} \subfig{teng2021touch_1} \subfig{maeda2022fingeret} \subfig{preechayasomboon2021haplets} \end{subfigs} \subsubsection{Belt Devices} \label{vhar_rings} The haptic ring belt devices of \textcite{minamizawa2007gravity} and \textcite{pacchierotti2016hring}, presented in \secref{belt_actuators}, have been used to improve the manipulation of \VOs in \AR (\secref{ar_interaction}). Recall that these devices have also been used to modify the perceived stiffness, softness, friction and localized bumps and holes on smooth real surfaces (\secref{hardness_rendering}) \cite{detinguy2018enhancing,salazar2020altering}, but have not been tested in \AR. In a \VST-\AR setup, \textcite{scheggi2010shape} explored the effect of rendering the weight of a virtual cube placed on a real surface hold with the thumb, index, and middle fingers (\figref{scheggi2010shape}). The middle phalanx of each of these fingers was equipped with a haptic ring of \textcite{minamizawa2007gravity}. %However, no proper user study was conducted to evaluate this feedback.% on the manipulation of the cube. %that simulated the weight of the cube. %A virtual cube that could push on the cube was manipulated with the other hand through a force-feedback device. \textcite{scheggi2010shape} report that 12 out of 15 participants found the weight haptic feedback essential to feeling the presence of the virtual cube. In a pick-and-place task in non-immersive \VST-\AR involving direct hand manipulation of both virtual and real objects (\figref{maisto2017evaluation}), \textcite{maisto2017evaluation} and \textcite{meli2018combining} compared the effects of providing haptic or visual feedback about fingertip-object contacts. They compared the haptic ring of \textcite{pacchierotti2016hring} on the proximal phalanx, the moving platform of \textcite{chinello2020modular} on the fingertip, and a visual rendering of the tracked fingertips as virtual points. They showed that the haptic feedback improved the completion time, reduced the force exerted on the cubes compared to the visual feedback (\figref{visual-hands}). The haptic ring was also perceived as more effective than the moving platform. However, the measured difference in performance could be due to either the device or the device position (proximal vs fingertip), or both. These two studies were also conducted in non-immersive setups, where users viewed a screen displaying the visual interactions, and only compared the haptic and visual rendering of the hand-object contacts, but did not examine them together. \begin{subfigs}{ar_rings}{Wearable haptic ring devices for \AR. }[][ \item Rendering weight of a virtual cube placed on a real surface \cite{scheggi2010shape}. \item Rendering the contact force exerted by the fingers on a virtual cube \cite{maisto2017evaluation,meli2018combining}. ] \subfigsheight{57mm} \subfig{scheggi2010shape} \subfig{maisto2017evaluation} \end{subfigs} %\subsubsection{Wrist Bracelet Devices} %\label{vhar_bracelets} With their \enquote{Tactile And Squeeze Bracelet Interface} (Tasbi), already mentioned in \secref{belt_actuators}, \textcite{pezent2019tasbi} and \textcite{pezent2022design} explored the use of a wrist-worn bracelet actuator. It is capable of providing a uniform pressure sensation (up to \qty{15}{\N} and \qty{10}{\Hz}) and vibration with six \LRAs (\qtyrange{150}{200}{\Hz} bandwidth). A user study was conducted in \VR to compare the perception of visuo-haptic stiffness rendering \cite{pezent2019tasbi}, and showed that the haptic pressure feedback was more important than the visual displacement. %In a \TIFC task (\secref{sensations_perception}), participants pressed a virtual button with different levels of stiffness via a virtual hand constrained by the \VE (\figref{pezent2019tasbi_2}). %A higher visual stiffness required a larger physical displacement to press the button (C/D ratio, see \secref{pseudo_haptic}), while the haptic stiffness control the rate of the pressure feedback when pressing. %When the visual and haptic stiffness were coherent or when only the haptic stiffness changed, participants easily discriminated two buttons with different stiffness levels (\figref{pezent2019tasbi_3}). %However, if only the visual stiffness changed, participants were not able to discriminate the different stiffness levels (\figref{pezent2019tasbi_4}). %This suggests that in \VR, the haptic pressure is more important perceptual cue than the visual displacement to render stiffness. %A short vibration (\qty{25}{\ms} \qty{175}{\Hz} square-wave) was also rendered when contacting the button, but kept constant across all conditions: It may have affected the overall perception when only the visual stiffness changed. %\begin{subfigs}{pezent2019tasbi}{Visuo-haptic stiffness rendering of a virtual button in \VR with the Tasbi bracelet. }[][ % \item The \VE seen by the user: the virtual hand (in beige) is constrained by the virtual button. The displacement is proportional to the visual stiffness. The real hand (in green) is hidden by the \VE. % \item When the rendered visuo-haptic stiffness are coherents (in purple) or only the haptic stiffness change (in blue), participants easily discrimated the different levels. % \item When varying only the visual stiffness (in red) but keeping the haptic stiffness constant, participants were not able to discriminate the different stiffness levels. % ] % \subfigsheight{45mm} % \subfig{pezent2019tasbi_2} % \subfig{pezent2019tasbi_3} % \subfig{pezent2019tasbi_4} %\end{subfigs} % \cite{sarac2022perceived,palmer2022haptic} not in \AR but studies on relocating to the wrist the haptic feedback of the fingertip-object contacts. %\subsection{Conclusion} %\label{visuo_haptic_conclusion}