Open Access Paper
11 February 2020 Selected problems of image data preprocessing used to perform examination in Parkinson’s disease
Author Affiliations +
Proceedings Volume 11442, Radioelectronic Systems Conference 2019; 114420G (2020) https://doi.org/10.1117/12.2565138
Event: Radioelectronic Systems Conference 2019, 2019, Jachranka, Poland
Abstract
Engineering support in the field of distinguishing Parkinson's disease from other diseases, diagnosing its progression and monitoring the effectiveness of drug treatment is nowadays implemented by way of recording and analyzing equipment fitted with motion sensors. The time series they provide enable quantitative evaluation of a set of symptoms describing daily activities and motor abilities of patients. The paper presents the preliminary results of fundamental research, which based on known medical observations indicating the diminution of facial expressions and micrographic apart from general motor deterioration, suggest that the clinical studies could utilize the techniques of processing image data acquired during the medical history taking. The image data includes video recording of the face and limbs conducted in the course of the coercions suggested in the study and manual drawings by patients. The image data are redundant and require processing for presentations facilitating their interpretation by a physician and enabling efficient utilization of machine learning algorithms in the next study stage. Within the framework of preliminary processing of acquired images, attempts were made to determine the quantitative measures, such as, e.g. blinking frequency and the indicators generated as a result of analyzing the position of characteristic points within the facial image. In the case of limbs, it is suggested to reproduce the motion on the image using a time series acquired thanks to the fixed markers. Preliminary processing of data coming from a graphic tablet also guarantees the generation of time series for images created by patients.

1.

INTRODUCTION

Engineering support in the field of distinguishing Parkinson’s disease (PD) from other diseases, diagnosing its progression and monitoring the effectiveness of drug treatment is nowadays widely implemented within the framework of research focused on recording and analyzing equipment fitted with motion sensors, e.g. accelerometers and gyroscopes, fastened to the body of a patient. They are used for examination and for the quantitative evaluation of various limb tremor forms, as well as postural and gait disorders, and fall detection [1-5]. Despite the fact the technological advancement of such equipment opens broad opportunities in terms of evaluating motor-related PD symptoms, many issues associated with their clinical application has not yet been sufficiently analyzed. Questions regarding the number and type of used sensors and their location on a patient’s body in order to improve method sensitivity are still open; studies aimed at searching for indicators more specific to this disease are also attempted [6].

The theme of the article focuses on studying the possibilities of an innovative application for artificial intelligence in the process of early diagnosis of patients with Parkinson’s disease and the monitoring of its progression. It presents the preliminary results of fundamental research, which suggest using the processing techniques for image data acquired during a clinical study evaluating a patient using a unified numerical scale named UPDRS [7]. The research was conducted on image-based material, which corresponds to the selected components of the scale and provides a potential opportunity to assess the motor function, facial expression, as well as writing and drawing. The presented results supplement the study devoted to preliminary voice analysis in Parkinson’s disease [8] and are based on the data acquired for the same patients and using a measuring set for multimodal data recording [9]. The image data includes video recording of the face and limbs conducted in the course of the impositions suggested in the study and a manual drawing by a patient. The objective of the attempted research was to develop a method for processing redundant image data for simpler representation, facilitating their interpretation and enabling the effective use of machine learning algorithms in the next study stage.

2.

MATERIAL

The research was conducted within a pilot recording program under the participation of the patients from the Department of Neurology at the Medical University of Warsaw. The group consisted of 9 men and 5 women, aged 26 to 64. The tested group comprised of 5 healthy persons and 9 diagnosed with Parkinson’s disease. The group included patients treated pharmacologically, in various disease progress stages, with a disease history of several to a dozen or so years. The majority of patients (6 people) exhibited single-sided symptoms on the right or left part of the body, with two-sided symptoms in 3 patients. During the tests, the patients were in the so-called “On” state, which means with a slightly increased intensity of the symptoms, characteristic for the lower range of the UPDRS applied in diagnostics. In this respect, the standard diagnostic methods are characterized by low sensitivity.

3.

SUGGESTED PROCESSING METHODS

3.1

Patient’s limb motion

In the case of limbs, it is suggested to map the movement on the image using a time series representing time-variable coordinates of the areas of interest or the value of their functions. In order to simplify this task, markers of the same color were fixed to the patient’s limb during the recording, so that it was possible to apply segmentation through color image binarization in the primary color channels, and cluster analysis. The values of the TR, TG and TB thresholds in each of the IR, IG and IB channels were adapted to the color of the used marker. Therefore, the IBIN output image after binarization for the red marker used in the finger tapping test [9] was determined as follows:

00465_psisdg11442_114420g_page_2_1.jpg

The aforementioned binarization shall ensure obtaining points within a plane, the location of which corresponds to the fingertips and shall enable determining the coordinates for their centers using the cluster method. However, the binarization result shown in Fig. 1A indicates the presence of numerous artefacts associated with the occurrence of colors similar to the marker color within the image.

Figure 1.

Idea of image segmentation for the purposes of separating fingertip areas: A – binarization result, B – erroneous detection of fingertips using the k-mean method based on image A, C – median filtration of a binary image, D – correct detection of finger tips using the k-mean method based on image C.

00465_psisdg11442_114420g_page_3_1.jpg

Grouping algorithm for k-means for a two-class problem (2 fingers) provides in such a case false coordinate values for detected clusters – Fig. 1B. The nature of the occurring artefacts is similar to the “salt & pepper” type noise, which is why it can be easily eliminated using median filtration. Fig. 1C shows the result of such filtration using a size-20 square mask filter, along with the result of k-mean clusterization in the form of two-cluster centers. The cluster centers projected on the photo in Fig. 1D correspond to fingertips.

Fig. 2 shows subsequent video stream frames with the effect of applied aforementioned segmentation for tracking patient fingertips for the purposes of the finger tapping test (point III.4 of UPDRS). Using x1 and x2 to mark the coordinates of both cluster centers, it is possible to determine the Euclidean distances between them based on the sequence of successive video recording frames:

Figure 2.

Limb motion tracking illustration based on finger tapping test example (circles in green are coordinates of points returned by the developed algorithm).

00465_psisdg11442_114420g_page_3_3.jpg
00465_psisdg11442_114420g_page_3_2.jpg

and thus, obtain a time series representing relative limb motion.

Fig. 3 shows time waveforms – obtained this way – for the right and left hand of the patient diagnosed with the classic form of Parkinson’s disease with left-sided symptoms, evaluated in terms of the motor component at 24 UPDRS points.

Figure 3.

Illustration of time waveforms and signal spectra of distances between the fingers in point III.4 of a clinical study for the right hand (above) and left hand (below) in a PD patient.

00465_psisdg11442_114420g_page_4_1.jpg

The illustration of time waveforms enables comparing the range of motion for both hands and evaluate the motion regularity disturbances, included in point III.4 of UPDRS. At the same time, simple processing based on, e.g., spectral analysis (Fig. 3) enables to quantitatively express the mean number of finger taps. In the case discussed in Fig. 3 this number was ca. 2.5 per second for the right hand and ca. 1.5 per second for the left. For comparison, Fig. 4 shows a time waveform and the spectrum of a signal obtained in the case of a healthy person. We can notice better regularity and a greater motion speed, which can be assessed at ca. 3.5 taps per second.

Figure 4.

Time waveform and signal spectrum for a distance between the right-hand fingers in a healthy person.

00465_psisdg11442_114420g_page_4_2.jpg

3.2

Drawing created by a patient

The motivation behind commencing research in the field of analyzing artwork created by a patient is a symptom often brought up by the patients themselves, which results from the slowdown, tremor, as well as muscle stiffness in the hand, namely, micrographics [10]. In the course of the implemented pilot research, the patients were first asked to write the sentence “Dzisiaj jest ładna pogoda” (eng. The weather is nice today) five times, one by one. There are naturally ongoing changes during writing, the quantitative assessment of which can find a diagnostic application. These include pressure of the pen on the piece of paper, reducing letters, shifting of the successive sentences relative to the previous ones, appearance of larger spacing between the words and time intervals between subsequent words or sentences resulting from the need to longer think about the following activity, which might indicate the deterioration of cognitive abilities. The second task involved drawing an Archimedean spiral, successively using the right and then the left hand, and then following a contour of a circle, once again, with the right and then the left hand. Both these activities can reveal not only differences associated with right- or left-handedness, but also with the impact of the disease on one side of the body only.

The properties of the used graphic tablet provide a potential to quantitatively describe the performed activities and the changes therein. The completed recording contain such information as the sample number, recording time in [ms], position of the pen on the X and Y axes of the tablet, pressure in arbitrary units, pen inclination angle relative to the tablet surface in [°] and the angle included between the Y axis of the table and the pen, the so-called azimuth in [°]. Table 1 shows a fragment of collected sample data.

Table 1.

Examples of a data set recorded by the used graphic tablet.

sample numbertimestamp [ms]coordinate x [cm]coordinate y [cm]pressure [arb. unit]inclination angle [°]azimuth [°]
102.68518.6121485183
252.68518.61657005183
3102.68518.61969125183
4152.68518.62376965181
5202.68518.62489525081
6252.68618.625102205080
7302.68618.625108685080

The resulting data are in tabulated form, which requires preliminary conversion to a simpler form facilitating their interpretation.

3.2.1

Drawing reproduction

Using a computer screen to recreate an image created by a patient on a piece of paper placed on the surface of the tablet requires, first of all, to remove all redundant samples collected when the pen was raised above the tablet surface. As a result, the drawing will only contain the exact trace, which has been drawn on the piece of paper in reality. A finished image is created as a graph of relationships between the value of coordinates X and Y, using points of appropriate thickness. The ultimate effect is a black image on a white background. Thus reproduced, e.g., Archimedean spiral images drawn with the right and left hands are shown in Fig. 5.

Figure 5.

Reproduced spiral drawings created by the same patient, successively, with the right and left hands.

00465_psisdg11442_114420g_page_5_1.jpg

The analysis of the obtained images provides certain interpretation-related possibilities, but does not utilize information about their creation. Such additional opportunities in terms of interpretation appear thanks to an analysis of the time waveforms corresponding to the image created by the patient.

3.2.2

Time series analysis

The data obtained from the tablet, such as stylus position, pressure, angle of inclination or azimuth can be presented in form of time series illustrating the kinematics and dynamics of image creation. Fig. 6 shows the time waveforms based on data from the tablet, which correspond to the Archimedean spirals created by one person (Fig. 5). The graphs in Fig. 6A showing changes of the stylus position in the course of drawing an Archimedean spiral using the right hand can be used to determine its creation time (ca. 4s). The presented signals have a characteristic waveform, similar with its shape to the sine and cosine function with increasing amplitude, which is an effect of drawing a circle of increasing radius. Additional information on the manner of spiral drawing by a patient can be obtained by analyzing the recording dynamics, which is a change of the pen pressure on the piece of paper. In order to illustrate the difference in the pressure during the same task conducted with the right and left hands, their corresponding pressure time waveforms are projected on each other in Fig. 6B.

Figure 6.

Illustration of the kinematics of creating the Archimedean spiral using the right hand (A) and a comparison of the dynamics of both hands (B).

00465_psisdg11442_114420g_page_6_1.jpg

When analyzing the presented waveform, it can be observed that the time for drawing the spiral with the left hand was longer than in the case of the right one. Despite the fact that the nature of both formed curves is similar, the averaged pen pressure when drawing with the right hand is higher than for the left hand, whereas in both cases, in grows along with creating successive spiral laps. Quantitative evaluation of the obtained waveforms is possible using numerical parameters, such as maximum pressure, average pressure and pressure standard deviation. Table 2 lists the values obtained for the case of signals shows in Fig. 6B.

Table 2.

Selected time waveform parameters of the pressure when drawing the Archimedean spirals.

 right handleft hand
maximum pressure [arb. unit]2891827730
average pressure [arb. unit]2639022215
std. deviation [arb. unit]22295858

Apart from pressure, the difference between drawing with the right and left hands can also be observed when analyzing the time series of the pen inclination angle relative to the tablet surface, which is shown in Fig. 7.

Figure 7.

Change of the pen inclination angle relative to the tablet surface when drawing the spiral with the right and left hands.

00465_psisdg11442_114420g_page_7_1.jpg

Pen holding manner, expressed by an angle included between the pen and the tablet surface, is almost constant in the case of the right hand. The difference between the maximum and minimum angle is only 8°, and the standard deviation is 1.39°. In the case of the left hand, the difference between the maximum and minimum inclination angles is 22°, while the standard deviation is 5.71°. The average inclination angle when hold the pen with the right hand was equal to 48.9°, and lower than in the case of the left hand, with that value amounting to 71.4°. These results show that writing with the left hand was more difficult for this patient, and the manner of holding the pen was more chaotic than for the right hand.

The time series analysis can also be conducted for data representing handwriting samples. In the case of the handwriting sample shown in Fig. 8, by a patient with genetic Parkinson’s disease, already the very analysis of the change of stylus position over time, provides quantitative information regarding the characteristic tendency ongoing in the course of writing.

Figure 8.

Handwriting sample for a patient with genetic Parkinson’s disease.

00465_psisdg11442_114420g_page_7_2.jpg

The graph in Fig. 9 indicates, that the coordinate signals for the stylus position when writing sentences are in the form of a sawtooth. The characteristic vertices determine locations where sentences end. It can be seen that in the case of coordinate X these vertices are lower, which means that the sentences were shorter.

Figure 9.

Stylus position change when writing 5 sentences visible in Figure 8.

00465_psisdg11442_114420g_page_8_1.jpg

The length of written sentences can be of diagnostic significance. Lengths of each of the 5 sentences created by all patients were determined based on the set of data recorded within the pilot research. The list of the basic descriptive statistics for each of them is contained in Table 3. It can be noticed that, with the exception of a single case, the standard deviations of the length of sentences written by sick persons were larger than the standard deviations of the length of sentences written by healthy persons.

Table 3.

List of mean values and standard deviations of a parameter described by sentence length.

 PDhealthy
L̅ [cm]14.0313.8118.3311.7410.0015.2513.629.7712.5613.2110.5016.3312.2110.22
σL [cm]1.021.091.050.730.540.500.550.590.190.360.070.410.420.12
0.700.27

3.3

Patient’s facial expressions

Masked face, otherwise called hypomimia or mimic diminution is a symptom of the Parkinson’s disease, which involves the limited ability of a patient to react spontaneously (e.g. eyelid blinking) and express facial emotions due to a deteriorating of the activity speed and coordination of face muscles. The techniques of processing facial images for the purposes of biometrics developed over the recent years provide potential possibilities for assessing hypomimia, however due to the specificity of the subject and the conditions of the conducted tests, it is not a simple task. First of all, due to the possible movements of a patient’s head in front of a camera, tools suitable for this purpose should be sought after among processing methods, which are independent of subject movements and rotations. The very methodology is also important, and it should minimize the strong impact of individual features on the recorded material. Therefore, this research paper suggests using the detection of characteristic points, between which constant relationships under a fixed facial expression will undergo, regardless of the rotation or movement. With the changing facial expression, these relationships should be disturbed, according to the type of the emotion and the ability of a patient to express them. Therefore potentially, such points are the tip of the nose, eye corners, points on eyebrow and eye arches, mouth contour points, etc.

Contemporary techniques in the field of searching for characteristic facial features are based on supervised machine learning methods, in which a large learning set coming from manually entered points is a base to create models for detecting points within a tested image [13-14]. The algorithm utilized for the purposes of this paper is based on deep learning techniques (convolutional neural network), the development of which was based on 10 000 learning patterns [15]. The motivation behind its application was the fact that the data for learning included facial images with highly diverse pose, partially covered and representing variable expressions. The algorithm detects 68 points but requires prior indication of an area of interest in the form of a framed facial image, which is relatively easy to obtain using the classic Viola-Jones algorithm. Fig. 10 shows the results of combining both of the aforementioned approaches to processing an image recorded during the pilot studies.

Figure 10.

Illustration of face detection within an image using the Viola-Jones algorithm and the detection of characteristic point.

00465_psisdg11442_114420g_page_9_1.jpg

3.3.1

Quantitative emotion expression evaluation

In order to study the quantitative evaluation of hypomimia, the data recording procedure involved the application of a certain form of coercion, which involved asking a patient to express predetermined emotions using his/her face, namely, asking to show how he/she looks when happy, sad, surprised, etc. Fig. 11 shows the results of the preliminary image processing in the form of the distribution of determined characteristic points for a PD and healthy person, whereas for both cases, image A is the reference image, corresponding to a neutral facial expression.

Figure 11.

Distributions of characteristic points on the face of a patient with Parkinson’s disease (top row) and a healthy patient (bottom row) when attempting to express the following facial emotions: A – neutral facial expression, B – joy, C – sadness, D – surprise, E – anger.

00465_psisdg11442_114420g_page_9_2.jpg

A quantitative method for determining the facial expression changes entailing various emotions can be calculating the distance D between appropriate points i and j of a given X distribution as per relationship (2) and relating them, as a percentage, to the distances occurring between the points of a phase with a neutral expression (on the A distribution in Fig. 11). Therefore, the value of the numerical parameter p will be determined from the relationship:

00465_psisdg11442_114420g_page_10_1.jpg

where DX(i, j) is the distance between points numbered i and j, and X adopts letter values depending on the expressed emotion, as shown in Fig. 11. The adopted numeration of characteristic points within a face image is shown in Fig. 12.

Figure 12.

Numeration of detected characteristic spots.

00465_psisdg11442_114420g_page_10_2.jpg

Table 4 shows a list of the p parameter values for 10 exemplary pairs of points describing an emotion of surprise expressed by a face of a sick and healthy person. It should be noted that in each case of the selected points, the change of a distance due to a presented emotion was higher for a healthy person.

Table 4.

Percentage changes in the facial expression when expressing surprise (emotion X=D)

No.points in Fig. 12parameter p
ijPDhealthy
13120-1.6-7.0
23125-1.4-3.7
331580.23.6
431590.43.3
538423.0-20.9
644483.0-20.3
749550.925.7
854590.840.8
92448-2.1-10.7
102141-2.7-12.6

3.3.2

Eyelid blinking frequency

The eyelid blinking frequency evaluation included in point III.2 of UPDRS can be determined within the same image processing methodology. For this purpose, it is sufficient to monitor, by calculation, the distances between the points, which detect eye edges and to count the number of events of such distances declining due to their closure during the recording. Fig. 13 shows a graph illustrating the determined distances for subsequent video stream frames in the case of the studied persons blinking the eyes.

Figure 13.

Eyelid blinking detection: photographs below – subsequent video stream frames – corresponding to the frames of the distance between points 38 and 42 (right eye) and 44 and 48 (left eye).

00465_psisdg11442_114420g_page_11_1.jpg

4.

CONCLUSIONS

The presented material illustrates the preliminary results of fundamental research focusing on the possibility of utilizing image data in the process of supporting clinical studies of Parkinson’s disease. It discusses the methods of preliminary processing of data acquired under the participation of healthy and sick patients, which can be used for the objective analysis of tests evaluating the disease according to UPDRS. The suggested methods for the quantitative evaluation of limb motion regularity as well as frequency and the eyelid blinking frequency fall directly in line with the UPDRS procedures and provide objective quantitative measures supporting the clinical judgement of a physician. The other ones, namely, facial expression and handwriting regularity are their supplementation. The recording conducted for a wider population of patients will enable the utilization of the suggested methods in search for a multi-dimensional description of patients for the purposes of developing a more advanced expert system.

ACKNOWLEDGEMENTS

Project co-financed from the funds of the Polish Ministry of National Defense for the implementation of basic research within the research grant No. GBMON/B9/13-996/2018 “Basic research in the field of sensor technology using innovative data processing methods”.

REFERENCES

[1] 

Maetzler, W., Domingos, J., Srulijes, K., Ferreira, J. J., and Bloem, B. R., “Quantitative wearable sensors for objective assessment of Parkinson’s disease,” Movement Disorders, 28 (12), 1628 –1637 (2013). https://doi.org/10.1002/mds.v28.12 Google Scholar

[2] 

Kumar, Y. D., Prasad, A. M., “MEMS accelerometer system for tremor analysis,” International Journal of Advanced Engineering and Global Technology, 2 (5), 685 –693 (2014). Google Scholar

[3] 

Bermeo, A., Bravo, M., Huerta, M., and Soto,A, “A System to Monitor Tremors in Patients with Parkinson’s Disease,” in 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, conference materials, 5007 –5010 (2016). Google Scholar

[4] 

Pierleoni, P., Belli, A., Bazgir, O., Maurizi, L., Paniccia, M., and Palma, L. P., “A Smart Inertial System for 24h Monitoring and Classification of Tremor and Freezing of Gait in Parkinson’s Disease,” IEEE Sensors Journal (Early Access)., Google Scholar

[5] 

Jakubowski, J., Kwiatos, K., Chwaleba, A., and Osowski, S., “Higher order statistics and neural network for tremor recognition,” IEEE Transactions on Biomedical Engineering, 49 (2), 152 –159 (2002). https://doi.org/10.1109/10.979354 Google Scholar

[6] 

Suzuki, M., Mitoma, H., and Yoneyama, M., “Quantitative analysis of motor status in Parkinson’s disease using wearable devices: From methodological considerations to problems in clinical applications,” Parkinson’s Disease 2017, (2017). Google Scholar

[7] 

“The Unified Parkinson’s Disease Rating Scale (UPDRS): Status and recommendations,” Movement Disorders, 18 (7), 738 –750 (2003). https://doi.org/10.1002/(ISSN)1531-8257 Google Scholar

[8] 

Majda-Zdancewicz, E., “Zastosowanie technik przetwarzania głosu w ocenie pacjentów z chorobą Parkinsona (The use of voice processing techniques in the assessment of patients with Parkinson’s disease),” in X Konferencja Naukowa Urządzenia i Systemy Radioelektroniczne UiSR 2019 (10th Scientific Conference Radioelectronic Equipment and Systems), conference materials, Google Scholar

[9] 

Chmielińska,J., “Stanowisko do rejestracji multimodalnych danych na potrzeby obiektywnej oceny choroby Parkinsona (Multimodal data acquisition set or objective assessment of Parkinson’s disease),” in X Konferencja Naukowa Urządzenia i Systemy Radioelektroniczne UISR 2019 (10th Scientific Conference Radioelectronic Equipment and Systems), conference materials, Google Scholar

[10] 

Gaweł, M., and Potulska-Chromik, A., “Choroby neurodegeneracyjne: choroba Alzheimera i Parkinsona (Neurodegenerative diseases: Alzheimer’s and Parkinson’s diseases),” Postępy Nauk Medycznych, 28 (7), 468 –476 (2015). Google Scholar

[11] 

Gunnery, S. D., Habermann, B., Saint-Hilaire, M., Thomas, C. A., and Tickle-Degnen, L., “The relationship between the experience of hypomimia and social wellbeing in people with Parkinson’s disease and their care partners,” Journal of Parkinson’s disease, 6 (3), 625 –630 (2016). https://doi.org/10.3233/JPD-160782 Google Scholar

[12] 

Bologna, M., “Facial bradykinesia,” Journal of Neurology, Neurosurgery and Psychiatry, online publication, (2012). Google Scholar

[13] 

Xiong, X., De la Torre, F., “Supervised Descent Method and its Application to Face Alignment,” in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), conference materials, (2013). https://doi.org/10.1109/CVPR.2013.75 Google Scholar

[14] 

Dhananjay, R., “Facial Landmark Localization – A Literature Survey,” International Journal of Current Engineering and Technology, 4 (3), 1901 –1907 (2014). Google Scholar

[15] 

Zhang, Z., Luo, P., Loy, C. C., and Tang, X., “Facial landmark detection by deep multi-task learning,” in European Conference on Computer Vision, conference materials, 94 –108 (2014). Google Scholar
© (2020) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Kamila Białek, Jacek Jakubowski, Anna Potulska-Chromik, Jolanta Chmielińska, Ewelina Majda-Zdancewicz, Monika Nojszewska, Anna Kostera-Pruszczyk, and Andrzej Dobrowolski "Selected problems of image data preprocessing used to perform examination in Parkinson’s disease", Proc. SPIE 11442, Radioelectronic Systems Conference 2019, 114420G (11 February 2020); https://doi.org/10.1117/12.2565138
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Image processing

Tablets

Analytical research

Video

Algorithm development

Eye

Data acquisition

RELATED CONTENT


Back to Top