Cooperative Spectrum Sensing for Cognitive Radio Networks by Praveen Kaligineedi B. Tech, Indian Institute of Technology Kanpur, 2004 M. A. Sc, The University of British Columbia, 2006 A THESIS SUBMITTED IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF Doctor of Philosophy in THE FACULTY OF GRADUATE STUDIES (Electrical and Computer Engineering) The University Of British Columbia (Vancouver) November 2010 c© Praveen Kaligineedi, 2010 Abstract Radio spectrum is a very scarce and important resource for wireless communication sys- tems. However, a recent study conducted by Federal Communications Commission (FCC) found that most of the currently allocated radio spectrum is not efficiently utilized by the licensed primary users. Granting opportunistic access of the spectrum to unlicensed sec- ondary users has been suggested as a possible way to improve the utilization of the radio spectrum. Cognitive Radio (CR) is an emerging technology that would allow an unlicensed (cognitive) radio to sense and efficiently use any available spectrum at a given time. Re- liable detection of the primary users is an important task for CR systems. Cooperation among a few sensors can offer significant gains in the performance of the CR spectrum sensing system by countering shadow-fading effects. In this thesis, we consider a parallel fusion based cooperative sensing network, in which the sensors send their sensing information to an access point, which makes the final deci- sion regarding presence or absence of the primary signal. We assume that energy detection is used at each sensor. Presence of few malicious users sending false sensing data can severely degrade the performance of such a cooperative sensing system. In this thesis, we investigate schemes to identify malicious users based on outlier detection techniques. We take into consideration constraints imposed by the CR scenario, such as limited informa- ii tion about the primary signal propagation environment and small sensing data sample size. Considering partial knowledge of the primary user activity, we propose a novel method to identify malicious users. We further propose malicious user detection schemes that take into consideration the spatial location of the sensors. We then investigate efficient sensor allocation and quantization techniques for a CR network operating in multiple primary bands. We explore different methods to assign CR sensors to various primary bands. We then study efficient single-bit quantization schemes at the sensors. We show that the optimal quantization scheme is, in general, non-convex and propose a suboptimal solution based on a convex restriction of the original problem. We compare the performance of the proposed schemes using simulations. iii Preface I am the primary researcher and author for all the research contributions made in this thesis. I identified the research problems, performed literature review, and conducted research to address those problems. Mathematical formulation and analysis of the problems and development of novel schemes were carried out by me. I wrote the programs for analyzing the mathematical models and simulating performance of proposed schemes. I also prepared the associated manuscripts ([34–37]) for publication. Dr. Majid Khabbazian is a co-author for contributions in Chapter 2. I consulted him during identification and formulation of the research problem. He also provided some technical feedbacks and editorial corrections for the associated manuscripts ([36, 37]). My supervisor Prof. Vijay Bhargava is a co-author for the contributions made in Chapters 2 and 3. I consulted him during the identification and formulation of the research problems. He also provided editorial feedbacks during my preparation of the associated manuscripts. iv Table of Contents Abstract . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ii Preface . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iv Table of Contents . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . v List of Tables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . viii List of Figures . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ix Acknowledgments . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xii 1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 1.1 Scope, Motivation and Objectives . . . . . . . . . . . . . . . . . . . . . . 5 1.2 Literature Survey . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7 1.2.1 Detection of Insider Attacks . . . . . . . . . . . . . . . . . . . . 8 1.2.2 Sensor Allocation and Quantization . . . . . . . . . . . . . . . . 9 1.3 Outline of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12 2 Malicious User Detection . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14 2.1 Background . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14 v 2.2 System Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15 2.2.1 Impact of Malicious Users . . . . . . . . . . . . . . . . . . . . . 16 2.3 Assigning Outlier Factors . . . . . . . . . . . . . . . . . . . . . . . . . . 17 2.3.1 Alternatives to the Mean . . . . . . . . . . . . . . . . . . . . . . 19 2.3.2 Alternatives to Standard Deviation . . . . . . . . . . . . . . . . . 21 2.3.3 Tackling Skew in the Data . . . . . . . . . . . . . . . . . . . . . 26 2.4 Malicious User Detection . . . . . . . . . . . . . . . . . . . . . . . . . . 27 2.4.1 Method I . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27 2.4.2 Method II . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28 2.5 Malicious User Detection Using Spatial Information . . . . . . . . . . . . 33 2.6 Performance Analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35 2.7 Simulation Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37 2.8 Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 50 3 Sensor Allocation and Quantization Schemes . . . . . . . . . . . . . . . . . 54 3.1 Background . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 54 3.2 System Model and Problem Formulation . . . . . . . . . . . . . . . . . . 55 3.3 Sensor Assignment . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 59 3.3.1 Maximum Weighted Sum Channel Gain Assignment . . . . . . . 59 3.3.2 Max-Min Channel Gain Assignment . . . . . . . . . . . . . . . . 60 3.4 Quantization Thresholds . . . . . . . . . . . . . . . . . . . . . . . . . . 61 3.4.1 Max-Min Optimization . . . . . . . . . . . . . . . . . . . . . . . 66 3.5 General k-out-of-N Fusion Rule . . . . . . . . . . . . . . . . . . . . . . 67 3.6 Simulation Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 69 3.7 Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 73 vi 4 Conclusions and Future Research Directions . . . . . . . . . . . . . . . . . 80 4.1 Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 80 4.2 Future Research Directions . . . . . . . . . . . . . . . . . . . . . . . . . 82 4.2.1 Malicious User Detection . . . . . . . . . . . . . . . . . . . . . . 82 4.2.2 Sensor Allocation and Quantization Schemes . . . . . . . . . . . 83 Bibliography . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 84 A Convexity Conditions for the Objective Functions (3.18) and (3.35) . . . . 92 B Log-Concavity of Q-function . . . . . . . . . . . . . . . . . . . . . . . . . . 96 vii List of Tables Table 3.1 Greedy algorithm . . . . . . . . . . . . . . . . . . . . . . . . . . . . 61 Table 3.2 Values of x̄(k,N) at different values of k and N . . . . . . . . . . . . . . 63 Table 3.3 Values of P(k,N)fmax at different values of k and N . . . . . . . . . . . . . . 64 viii List of Figures Figure 2.1 Empirical influence curves for mean, median and bi-weight location estimate. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22 Figure 2.2 Empirical influence curves for standard deviation, median absolute de- viation (MAD) and bi-weight scale (BWS). . . . . . . . . . . . . . . 25 Figure 2.3 Performance of malicious user detection schemes for CR network spread over a small area in the presence of M = 1 malicious user and Mmax = 2. 39 Figure 2.4 Performance of malicious node detection schemes for CR network spread over a large area in the presence of M = 1 malicious user with primary user SNR at (100m, 100m) ignoring fading effects = -5dB. . . . . . . 40 Figure 2.5 Performance of malicious node detection schemes for CR network spread over a large area in the presence of M = 1 malicious user with primary user SNR at (100m, 100m) ignoring fading effects = 3dB. . . . . . . . 41 Figure 2.6 Performance of Method II at different values of K for M = 1, Mmax = 2 and Km = 16. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42 Figure 2.7 Performance of Method II at different values of Km for M = 1, Mmax = 2 and K = 32 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44 ix Figure 2.8 Performance of malicious user detection schemes at different values of M for Mmax = 20 with primary user SNR at (100m, 100m) ignoring fading effects = -5dB. . . . . . . . . . . . . . . . . . . . . . . . . . . 46 Figure 2.9 Performance of malicious user detection schemes at different values of M for Mmax = 20 with primary user SNR at (100m, 100m) ignoring fading effects = 0dB. . . . . . . . . . . . . . . . . . . . . . . . . . . 47 Figure 2.10 Performance of malicious user detection schemes at different values of M for Mmax = 20 with primary user SNR at (100m, 100m) ignoring fading effects = 8dB. . . . . . . . . . . . . . . . . . . . . . . . . . . 48 Figure 2.11 Performance of malicious user detection schemes using spatial infor- mation of the CR network for M = 1 malicious user and Mmax = 2. . . 49 Figure 2.12 Performance of malicious node detection schemes for CR network spread over a large area in the presence of a single ‘Always Yes’ malicious user 51 Figure 2.13 Performance of malicious node detection schemes for CR network spread over a large area in the presence of a single smart malicious user . . . 52 Figure 3.1 Sum throughput rate of the CR system using ‘OR’ fusion rule for dif- ferent sensor allocation and quantization schemes . . . . . . . . . . . 71 Figure 3.2 Min throughput rate among various bands using ‘OR’ fusion rule for different sensor allocation and quantization schemes . . . . . . . . . 72 Figure 3.3 Sum throughput rate of the CR system for different sensor allocation and quantization schemes when ‘2’-out-of-‘5’ fusion rule is used at the access point in each primary band . . . . . . . . . . . . . . . . . . . 74 x Figure 3.4 Sum throughput rate of the CR system for different sensor allocation and quantization schemes when ‘3’-out-of-‘5’ fusion rule is used at the access point in each primary band . . . . . . . . . . . . . . . . . . . 75 Figure 3.5 Comparison of the optimal and greedy max-min assignment algorithms for ‘OR’ fusion rule when maximizing the minimum throughput rate among various primary bands . . . . . . . . . . . . . . . . . . . . . . 76 Figure 3.6 Comparison of the optimal and greedy max-min assignment algorithms for ‘OR’ fusion rule when maximizing the sum throughput rate of the CR system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 77 xi Acknowledgments First and foremost, I would like to thank my supervisor, Professor Vijay K. Bhargava, for his guidance, encouragement and support. I would like to thank Dr. Majid Khabbazian for contributing valuable insights to my thesis work. I am very grateful to Professor Robert Schober, Professor Lutz Lampe, Professor Dave Michelson and Professor Vikram Krishna- murthy for serving on my committee. I would like to thank the instructors of my graduate courses for helping me obtain a good understanding of the basic concepts. Finally, I would like to thank all members of our lab for their support and for providing a stimulating and fun environment in which to learn and grow. xii Chapter 1 Introduction Recent explosive growth in the wireless communication market and proliferation of multi- media capable mobile devices has led to increase in demand for radio spectrum. However, most of the radio spectrum has already been licensed to various entities across different ge- ographical areas giving them exclusive transmission rights in the allocated spectral bands. This was done to avoid interference between two systems operating in the same spectral band in close vicinity to each other. However, a recent study conducted by the Federal Communications Commission (FCC) found that most of the currently allocated radio spec- trum is not efficiently utilized by the licensed (primary) users [19]. It has been suggested that the utilization of the radio frequency spectrum could be improved by giving oppor- tunistic access of the spectrum to unlicensed secondary users. Cognitive radio (CR) is an emerging technology which would allow an unlicensed (cognitive) radio to automatically sense and make efficient use of any available radio spectrum at a given time [44]. CR de- sign is, therefore, an innovative radio design philosophy which involves smartly sensing the swaths of spectrum and then determining the transmission characteristics of secondary 1 users based on the primary users behavior. Specifically, CR is likely to be built on software defined radio (SDR) [44], which would allow it to adjust its transmitter characteristics dy- namically, based on the interaction with the environment in which it operates. Due to the immense potential of improving the spectral utilization by using CR, adaptive access sys- tem design for CR networks has emerged as one of the most important research areas in the field of wireless communications. The IEEE 802.22 standard based on CR technology for wireless regional access networks (WRAN) is presently under development, to bring broadband access to hard-to-reach rural areas with low population density [60]. Identifying the presence of licensed primary users is a very important task for a CR system [13, 63]. Fast and accurate spectrum sensing is necessary to improve the oppor- tunistic spectrum access gains of the CR system and to decrease the interference caused to the primary user system. If the CR sensing system falsely determines that there is a primary user present, even though there is no primary user operating in the band, it would lead to a missed opportunity for transmission, which would decrease the CR throughput. On the other hand, if the the CR system misdetects the primary signal then it will lead to interference to the primary user which could be unacceptable to the primary user system. Nevertheless, the spectrum sensing process is a very difficult task, due to presence of wide range of primary users using different modulation schemes, transmission powers and data rates, secondary user interference, variable propagation losses and thermal noise. Several signal processing techniques have been proposed in the literature to identify primary users [13]. The simplest of the proposed detectors is the energy detector that measures energy in a particular spectrum band and concludes presence of a primary user if the energy detected in the band is higher than a certain threshold [67]. The energy detector has low complexity and requires no knowledge of the primary user signal characteristics. 2 Performance of energy detection for the CR networks has been studied in [63]. Energy detection requires high sensing time to accurately detect a primary signal compared to other detectors. Moreover, it was shown in [62] that the energy detector fails to detect the primary signal at very low signal-to-noise ratio (SNR), due to presence of noise-uncertainty (uncertainty in estimating the background noise power). The SNR below which detection is impossible is called the SNR wall [62]. Another spectrum sensing technique is cyclostationary feature detection. A signal is said to be nth-order cyclostationary if it exhibits periodicity in its nth-order moments [24]. Most of the signals encountered in wireless communications are 2nd order cyclostationary whereas the noise is stationary. As a result, the cyclostationarity of the primary signals can be used to detect their presence. The 2nd order cyclostationarity of a signal is not reflected in the power spectral density (PSD). However, it is reflected in the spectral cor- relation density (SCD) function, which is obtained by the Fourier transform of the cyclic autocorrelation function [24]. The signal detectors based on cyclostationarity give better performance than the energy detectors for same number of signal samples and have lower SNR wall compared to energy detector. However, they are highly complex compared to energy detector and require knowledge of primary signal characteristics which might not always be available. Further, eigenvalue based detection was proposed for CR sensors equipped with multiple receiver antennas [77]. Eigenvalue based detectors utilizes the fact that background noise is uncorrelated across the receive antennas whereas the primary sig- nal is correlated. The eigenvalue detector has a higher complexity compared to the energy detector. One of the major issues with spectrum sensing using a single sensor is the impact of shadow fading due to presence of an obstacle between the primary transmitter and the CR 3 sensor [43]. For example, The CR sensing device may not detect the primary signal when the channel between the primary transmitter and the sensing device is under a deep fade. As a result, the CR system might transmit a signal in the corresponding primary user band, causing interference to the nearby primary receiver. This is called the hidden terminal problem and can have significant effect on the sensing performance of the CR system. The burden on signal processing techniques can be reduced to a large extent by us- ing cooperative diversity between CR spectrum sensors. Cooperative sensing among few sensing devices sufficiently distant from one another (in order to ensure independent prop- agation loss) can improve the detection efficiency of the sensing system and essentially help overcome the hidden terminal problem by countering the shadowing effects [43]. Al- ternatively, cooperative sensing can be seen as a means to reduce the sensing time for the same level of detection [21, 22]. Cooperative sensing would also reduce the impact of noise uncertainty on sensing system by lowering the SNR wall [43]. Recently, several cooperative spectrum sensing architectures for CR networks have been proposed in the literature [21–23, 43, 56, 63, 66]. In [56], it has been proposed that either, spectrum sensing devices can be collocated with the cognitive users or, a sep- arate network of sensors could be used for spectrum sensing. The latter scheme could be used to save bandwidth of the cognitive users, as they do not need to allocate some of their transmission time period for sensing. This could especially be useful in the areas where CR density is expected to be high as the cost of having separate network of sensors can be compensated by the total throughput gain achieved. Several fusion architectures can be considered to combine the sensing data from various sensing devices. The most commonly proposed architecture is a parallel fusion network, in which all the sensing devices send their sensing information directly to an access point, which makes a final decision regard- 4 ing the presence or absence of the primary signal based on their sensing data using a data fusion rule [43, 66]. The parallel fusion rule is more robust to sensor failures and requires less processing at the sensors. Another possible sensing architecture is the serial fusion architecture [70], in which each sensing devices sends its sensing data to another sensing device which based on the received sensing data and its own sensing data, makes a decision and sends its decision to the next sensing device. This process is continued until the last sensing device, which is generally the access point, makes a decision regarding the presence or absence of the signal. Yet another sensing mechanism is the decentralized sensing architecture, which does not have any access point [21, 22]. In this architecture, each individual user with a sensing device makes a decision regarding the presence or absence of signals based on its own data and data obtained from other sensing devices according to some predetermined rule. 1.1 Scope, Motivation and Objectives In this thesis, we consider a parallel fusion cooperative sensing network. Each CR sensor uses energy detection to sense the primary user. Several cooperative sensing techniques for CR networks have been recently considered in the literature based on parallel fusion archi- tecture [25, 43, 66]. There are several issues that need to be addressed in order to obtain maximum possible sensing gain from cooperation. In this thesis, we identify two major challenges involved in parallel fusion cooperative sensing schemes. The first challenge is to tackle malicious users, which send false sensing data to the access point. Another impor- tant challenge is to design fast and efficient methods to combine the sensing information available from various sensing devices. 5 Security is one of the most crucial aspects of CR cooperative sensing system [12, 78]. CR cooperative sensing system is vulnerable to two different kinds of security threats [12]. One is an outsider attack in which a malicious transmitter tries to manipulate the sensor readings by transmitting signals emulating the primary user signal characteristics. Tech- niques to identify these kind of attacks have been studied in [1, 16, 17]. In [16], the au- thenticity of the signal is tested by estimating the location of the origin of the signal. If the origin of the signal is not at the same location as that of the primary user transmitter, then the signal is considered malicious. In [17], signal classification algorithms were proposed to distinguish primary and malicious signals. In [1], a primary user emulator is identified using certain distinctive behavior in primary transmitter. The other kind of security threat is the insider attack, where a user belonging to the CR sensor network sends false sensing information to the access point. It was shown in [43] that the presence of a few malicious users sending false sensing data can severely affect the performance of a parallel fusion cooperative spectrum sensing system. A CR user might be malicious for selfish reasons or due to sensor malfunctioning. In the former case, a CR might detect that the primary signal is absent. However, it might force the access point to erroneously decide that a primary signal is present by sending false sensing data. The malicious user can then selfishly transmit its own signal on the free channel. If the sensor is malfunctioning, it might generate random energy values. In this thesis, one of our goals is to identify such malicious users and weed them out of the system. CR systems are proposed to simultaneously operate over multiple primary bands and dynamically use the available channels for transmission [30]. Multi-band sensing has been suggested in [42] to take advantage of sparse nature of the available spectrum. Sensing multiple bands simultaneously can help improve the opportunistic spectral gain by making 6 dynamic and efficient use of the free spectrum. Good spectral utilization requires quick sensing of large swath of spectrum with high accuracy. However, the sensing resources are usually limited in terms of sensing time and bandwidth. Moreover, due to bandwidth limitations in the control channels, the sensors might have to quantize their sensing data in order to reliably communicate it to the access point. Efficient ways to allocate sensors to various primary bands and quantize the sensing information need to be investigated to achieve maximum possible opportunistic spectral gain for such a system. In this thesis, we explore methods to assign narrow-band sensors to various primary user bands. We then investigate efficient techniques to determine the quantization thresholds at each sensor in each primary band. The objectives of this thesis are as follows: • Identify possible methods used by the malicious users to degrade the CR cooperative sensing system performance. Propose techniques to reliably detect the presence of the malicious users and nullify their effect on the performance of the sensing system. • In a CR system operating in multiple bands, identify methods to assign CR sensors to various primary user bands and determine the energy detection thresholds at each sensor, taking into consideration the rates available in each primary band along with cost of interference with the respective primary users. 1.2 Literature Survey In this section, we give an overview of the works related to the detection of insider attacks in CR cooperative sensing networks as well as the quantization and data fusion schemes for CR sensing systems. 7 1.2.1 Detection of Insider Attacks Techniques to detect the insider attacks in CR cooperative sensing systems have recently received attention in the literature [15, 73–75, 77]. In [15], a technique to identify malicious users based on weighted sequential probability test was proposed in a system in which single-bit quantization is used at the sensors. Weights were assigned based on the reputation gained from the previous sensing iterations. If a user’s decision is in agreement with the final decision at the access point then its weightage is increased and if not, its weightage is decreased. However, in [15], accurate knowledge of the primary signal distribution at the CR sensors is assumed which is not always available to the CR network. Also, the performance of the malicious user detection depends on the influence of the malicious users on final decision. If the malicious users can influence the final global decision, then the entire scheme would fail. In [76], a reputation-based CR spectrum sensing was proposed in which some of the users can be completely trusted. Through the assistance of these trusted nodes in the network, the malicious users are detected. In [74], a scheme to identify the malicious nodes is proposed based on the CR sen- sors’ past reports. The knowledge of the distance between primary user and CR sensors is assumed and then suspicious level of each node is calculated using Bayesian criterion based on the data measurements from all sensors. However, this requires knowledge of pri- mary user signal distribution at the CR sensors based on distance between the primary user and CR sensors, which might be difficult to estimate. In [73], a robust cooperative sens- ing scheme was developed which takes into account the the possible presence of malicious user data while determining a fusion rule at the access point. They assume independent and identical primary signal fading at the sensors which is not true in practical scenarios due to presence of variable shadow fading and path losses. In [75], a malicious user detection 8 technique was proposed for ad hoc CR networks based on consensus algorithms. In this thesis, we investigate schemes to identify the malicious users based on outlier detection techniques. An outlier is an observation which is far away from rest of the data [29]. Outlier detection techniques have been well studied in the field of database research to identify extreme data points [2, 4, 9, 29, 39]. Their applications include video surveillance, intrusion detection and identifying fraudulent transactions. Some of the outlier detection techniques have been recently applied to the sensor networks to identify suspicious sensor readings [8, 48, 58, 59]. Nevertheless, using the outlier detection techniques for CR coop- erative sensing network has a very different set of challenges when compared to most of the sensor networks. For example, the CR sensor network is not aware whether a primary user signal is present or not. Further, it has limited knowledge of the underlying distribution of the data points when the primary signal is present. Thus, model based outlier detection schemes ([4, 29]) which assume a particular underlying data distribution cannot be applied. Further, even in the case when the underlying data distribution is not known, most of the sensor networks have large database of sensor readings from which the outliers can be efficiently detected using non-parametric methods [9, 39]. However, in CR networks, the number of collaborating sensors is generally low (∼ 10−20) [43] and these non-parametric outlier detection techniques cannot be directly applied to CR cooperative sensing systems. In this thesis, we take into consideration some of these constraints imposed by the CR scenario to devise novel malicious user detection techniques. 1.2.2 Sensor Allocation and Quantization As a part of this thesis, we also investigate sensing schemes for CR system operating in multiple bands. In [52], a multi-band CR system operating with wide-band spectrum sen- 9 sors was considered, in which each sensor measures the signal energy in all primary bands. The energy detector output from each sensor is sent to the access point assuming perfect reporting channels. The access point then calculates a weighted sum of energy detector outputs from the sensors in each band, which is compared to a threshold in order to deter- mine whether a primary signal is present or not. The set of equations to find the optimal weights were then presented and the optimization problem was shown to be non-convex. Sub-optimal weighing factors for each sensor data and energy detection threshold at the ac- cess point in each primary band were derived. In [54], the optimal weights were obtained by solving non-convex optimization problem using genetic algorithm. However, in [52], the results are obtained for un-quantized data. However, in systems with control channel bandwidth limitations, quantization is necessary to transmit the sens- ing result reliably to the access point. Moreover, the wide-band sensing considered in [52] might require a very high sampling rate to precisely determine the band in which the pri- mary user is present. Quantization scheme based on controlling the false discovery rate (FDR) was proposed for sensor networks in [53]. This technique was extended for multi- band sensing in [3]. However, the CR sensing system in [3] still requires multi-band energy detection at each sensor which would need large sensing time. In this thesis, we consider a multi-band CR system in which sensors that can sense one primary band at a time. For such a system, it has been shown that a tradeoff can be achieved between the sensing time and the amount of collaborative gain obtained using cooperative sensing by dividing the sensors into clusters with each cluster of sensors operating in an assigned primary band [28, 61]. Efficient techniques to allocate sensors to various primary bands need to be investigated. The sensor allocation belong to a category of combinatorial optimization problems called 10 assignment problems. The original assignment problem [46] involved optimally assigning the “tasks” representing the jobs to be done to the “agents” representing the machines or the people that can do those jobs. There is a cost attached with assigning an agent to a task and the aim is to minimize the sum total cost. Several variations of the assignment problem with different cost functions and constraints have been studied [49]. In our thesis, the detection of signal in primary user bands represent the tasks and the CR sensors represent the agents. We propose various assignment algorithms to allocate CR sensors to primary users based on different cost functions. Once the sensors are assigned to the primary users, techniques to quantize the sensing data are explored. Distributed detection and data fusion for the parallel fusion network is a well studied topic in the field of sensor networks [5, 64, 69, 70]. In general, the complexity of designing the optimum distributed detection (quantization) scheme increases exponen- tially with number of sensors and number of quantization levels. Even in case of indepen- dent and identical distribution of received primary signal energy at various sensing devices, the optimum quantization thresholds for each sensing device will not be same at all the sen- sors. Thus, the problem is highly complex to solve. Several low-complexity quantization schemes have been proposed in the literature [38, 41]. However, the design complexity of these quantizers is still very high. The design complexity can be reduced to a large ex- tent by assuming that all sensors use identical quantization thresholds as it decreases the dimension of the optimization problem. However, this would degrade the performance of the cooperative sensing system. Nevertheless, it has been shown in [65], that for identi- cal sensing data distribution, distributed detection performance based on sensing devices using identical quantization thresholds asymptotically approaches the optimum distributed detection performance as the number of sensors goes to infinity, in case of binary hypoth- 11 esis testing. Further, in [47], it was shown that using equal thresholds at the sensors and a k-out-of-N fusion rule at the access point is also asymptotically optimal in case of different variable fading losses among sensors. In a k-out-of-N fusion rule, the access point declares that the primary user is present only when k or more out of N sensors send bit ‘1’ to the access point. It has been shown in the literature that ‘OR’ fusion rule (1-out-of-N fusion rule) is robust and gives performance close to that of the optimal k-out-ofN fusion rule for many CR cooperative sensing system models [25]. Quantization and data fusion schemes for CR sensing systems operating in a single primary band has been considered in [14, 18, 57, 68]. In [14], locally optimal multi-bit quantization schemes were analyzed. Suboptimal schemes were then proposed based on iterative estimation of likelihood ratios. In [18], spatio-temporal quantization is considered where both spatial and temporal distributions of the primary signal distribution are utilized to design dynamic quantization levels. In [57], low complexity quantization schemes based on maximizing the deflection coefficient were proposed. In [68], control channel transmis- sion errors were taken into consideration while determining the quantization thresholds. In this thesis, we explore the multi-band CR sensing systems in which equal energy thresholds are used in all the sensors allocated to a particular primary band and a k-out- of-N fusion rule is used at the access point. We propose efficient schemes to determine sensor quantization thresholds in each primary band taking into account the throughput rates available in various bands and corresponding interference limitations. 1.3 Outline of the Thesis The rest of this thesis is organized as follows: • In Chapter 2, we propose malicious user detection schemes based on outlier detec- 12 tion techniques for CR cooperative sensing network. We take into consideration constraints imposed by the CR scenario, such as limited knowledge of the primary signal propagation environment and small size of the sensing data samples. Consid- ering partial information of the primary user activity, we propose a novel method to identify the malicious users. We further propose malicious user detection schemes that take into consideration the spatial location of the CR sensors. The performance of the proposed schemes are studied using simulations. • In Chapter 3, we consider a CR system operating in multiple primary bands. We ex- plore methods to allocate the sensors to various primary user bands using assignment algorithms. We then investigate efficient techniques to determine the quantization thresholds at each sensor. We initially consider the case when the ‘OR’ fusion rule is used at the access point in each primary band. We then investigate quantization schemes for the case when the k-out-of-N fusion rule is used at the access point in each primary band. We compare the performance of the proposed schemes using simulations. • Conclusions and possible directions for future research are discussed in Chapter 4. 13 Chapter 2 Malicious User Detection 2.1 Background In this chapter, malicious-user detection schemes for CR cooperative sensing system are proposed based on the outlier-detection techniques [29]. Identifying malicious users in CR cooperative sensing system is very difficult since the malicious user detection schemes do not know whether a primary signal is present or not. Thus, they are unaware of the under- lying distribution of the energy detector outputs. We also take into consideration further constraints imposed by the CR scenario such as the lack of complete information about the primary signal propagation environment, the absence of feedback from primary user net- work and the small size of the sensing data samples among which the malicious user data points need to be identified (It was shown in [43] that most of the gain through cooperation is achieved by using ∼ 10− 20 users). We only consider those malicious user detection schemes that are based on the non-parametric outlier detection techniques and hence, do not require the prior knowledge of the underlying data distribution parameters. Thus, the malicious user schemes detection proposed in this chapter are not influenced by uncertainty 14 in the noise measurement and do not require any feedback from the primary user system or knowledge of the location of the primary transmitter. Low number of spectrum sensors also make the detection of the malicious sensors among them very challenging. Robust as well as efficient outlier detection techniques are necessary to ensure reliable detection of the malicious users based on small size of sensor data samples. We later assume par- tial knowledge of the primary user activity and propose improved malicious user detection schemes based on this information. We also propose methods which consider the spatial location information of the CR users to further improve the performance of malicious user detection schemes, especially, for the CR systems spread over a wide area. The rest of this chapter is organized as follows. In Section 2.2, we define the coop- erative sensing system model and discuss the effect of malicious users on the system. In Section 2.3, we discuss techniques to assign robust and efficient outlier factors to the cog- nitive users based on their sensing data. In Section 2.4, we propose techniques which use these outlier factors to detect the malicious users present in the system. In Section 2.5, we propose malicious user detection technique which takes into consideration the users’ spatial information. Section 2.6 describes the method used to compare the performances of various malicious user detection schemes for the case when equal gain combining is used as the fusion rule at the access point. Simulation results are presented in Section 2.7. Conclusions are finally drawn in Section 2.8. 2.2 System Model We consider a group of N CRs with collocated spectrum sensors in the presence of a pri- mary transmitter. All of the sensors use energy detectors. The sensors send their sensing data to an access point through control channels, which are assumed to be perfect. Based 15 on the data obtained from the sensors, the access point makes a decision regarding the presence or absence of the primary signal using a data fusion and detection scheme. Let en[l] represent the output of energy detector at nth sensor during the lth sensing iteration. Let hypotheses H1 and H0 denotes the presence and absence of a primary signal, respectively. The output of the nth user’s energy detector in the baseband is given by [67] en[l] = ∫ Tk+T−1 Tk |hn(t)s(t)+ zn(t)|2dt ;H1∫ Tk+T−1 Tk |zn(t)|2dt ;H0 (2.1) where T denotes the length of the sensing interval, s(t) is the primary transmitted signal and hn(t) represents the channel between the primary transmitter and the nth spectrum sensor. zn(t) is the additive white Gaussian noise (AWGN) at the nth sensor. In this chapter, we assume a generic wide area propagation model for the primary signal [55]. However, we assume no knowledge of the distributions of the channel gains between the primary transmitter and CR sensors. 2.2.1 Impact of Malicious Users The presence of malicious users can significantly affect the performance of a CR cooper- ative sensing system [43]. A user might be malicious for selfish reasons or due to sensor malfunctioning. In the former case, a CR might detect that the primary signal is absent. However, it might force the access point to erroneously decide that a primary signal is present by sending false sensing data. The malicious user can then selfishly transmit its own signal on the free channel. If the sensor is malfunctioning, it might generate random energy values. There are, generally, two ways in which malicious users can affect the cooperative 16 sensing system. They may send high energy values when there is no primary signal present, thus increasing the probability of a false alarm and decreasing the available bandwidth for the CR system. Malicious users may also send low energy values when the signal is present, thus decreasing the probability of detection of the primary signal and causing increased interference to the primary user system. Since most of the data fusion schemes at the access point take into consideration that some of the sensors will have weak channels from the primary transmitter, the impact of malicious users sending low energy values when a primary signal is present will, in general, be low on the performance of the cooperative sensing system. However, when the malicious users send high energy values when no primary signal is present, the impact on the performance of the cooperative sensing system will be much more severe. Thus, malicious user detection schemes should be efficient in identifying malicious users that falsely send high energy values to the access point. At the same time, the scheme chosen to identify these malicious users should not misdetect a non-malicious user as a malicious user. When the primary signal is present, it is especially important that the data of non-malicious users that receive good signal strength from the primary transmitter should not be rejected, as this would severely decrease the probability of detection of the cooperative sensing system leading to severe interference to the primary user system. 2.3 Assigning Outlier Factors Each user is assigned a set of outlier factors based on the energy detector outputs. The outlier factor gives a measure of the outlyingness of a data point. These outlier factors are then used to identify and nullify the effect of malicious users. In this chapter, we assume that the outlier factor assignment schemes are unaware of the additive noise variance and 17 location of the primary transmitter and receives no feedback from the primary user system. A simple way to assign outlier factors on[l] based on the energy values obtained during the lth sensing iteration is as follows: on[l] = edBn [l]−µ[l] σ [l] (2.2) where edBn [l] represents the energy detector outputs in decibels (dB), µ[l] and σ [l] are, respectively, the sample mean and the sample standard deviation of the energy values edBn [l] of all users at a given iteration l. The sample mean is an estimate of the location of a distribution, and the standard deviation is an estimate of the scale. We proposed this method of outlier factor assignment in [37] to detect the malicious users in CR networks. The energy-detector outputs are considered in dB because it is desirable that the un- derlying data distribution be close to symmetric when assigning outlier factors as in (2.2). If the underlying distribution is highly skewed (un-symmetric), then the valid data points lying on the heavy-tailed side of the skewed distribution will be assigned very high out- lier factors. Distribution of en[l] can have a high positive skew, especially in the presence of a primary signal. One way to reduce the positive skew in the data is to use logarith- mic transformation (i.e., consider energy-detector outputs in dB). A more computationally complex and widely used technique to reduce skewness in any distribution is the Box- Cox transformation [6]. However, Box-Cox transformations are not robust against outliers. Moreover, most of the channel shadow-fading models in wireless communications follow a log-normal distribution. Therefore, if the sensors are distributed over a small area in which the path-loss component can be assumed to be same for all the sensors, taking the logarithm would make the distribution of energy detector outputs close to normal distri- 18 bution with low skew. Also, in the case where no primary signal is present, the logarithm operation does not induce significant negative skewness in the energy distribution. However, there are several issues with assigning outlier factors as in (2.2). First, the mean and the standard deviation are not robust estimates and can be easily manipulated by the data of the malicious users, especially, in the case of un-quantized data fusion at the access point. Even a few malicious users can severely degrade the performance of the system without being detected when outlier detection schemes use non-robust location and scale estimates such as the mean and standard deviation. Therefore, robust alternatives to the sample mean and the sample standard deviation need to be studied. Secondly, these robust estimates of location and scale must also be efficient. The efficiency of a statistic determines the degree to which the statistic is stable from sample to sample. An estimate having low efficiency can have a huge deviation from the underlying distribution, especially for a low number of sample data points. Thirdly, the logarithm transformation does not completely remove the skew in the data under hypothesis H1. The data might still have a high positive skew if the secondary user network size is large with variable path loss between the primary transmitter and the sensors. Techniques to tackle skewness in the energy distribution need to be explored. 2.3.1 Alternatives to the Mean As discussed in Section 2.3.1, the sample mean is highly vulnerable to outliers. A robust alternative to the sample mean to estimate the location of a distribution in (2.2) is the median (µ̃). One way to measure the robustness of an estimate is by its breakdown point. The breakdown point is the minimum proportion of contaminated points (outliers) in a sample that can make the estimate unbounded. Note that the outliers can still have an 19 impact on the estimate when their percentage is lower than the breakdown point. However, their effect would be limited and they cannot randomly change the estimate. The median has a 50% breakdown point compared to 100N % for the mean, where N is the sample size. Even though the median has a very high breakdown point, its efficiency is low. The efficiency of a statistic is the degree to which the statistic is stable from sample to sample. Efficiency is defined as the ratio of the inverse of the Fisher information to the variance of the statistic [45]. An estimate having low efficiency can have a huge deviation from the underlying distribution, especially for a low number of sample data points. Therefore, high efficiency is very desirable in small sample sizes. A more efficient and robust estimate of the location is the bi-weight estimate (µ̂) [45], which is calculated iteratively as follows: µ̂ = ∑wne dB n ∑wn (2.3) where wn = ( 1− ( edBn −µ̂ c1S )2)2 : ( edBn −µ̂ c1S )2 < 1 0 : Otherwise (2.4) and S = median{|edBn − µ̂ |} (2.5) The bi-weight estimate calculates a weighted mean with lower weightage being given to the observations away from the estimate. S is a measure of the spread of the underlying distribution. It measures the median absolute deviation from the location estimate µ̂ . The parameter c1 is called the tuning constant. Observations at a distance of more than c1 times S from the estimate are assigned zero weight. Thus, c1 can be used to determine the impact 20 of extreme data points on the calculation of the bi-weight estimate µ̂ . It has been shown in the literature that the bi-weight estimate (µ̂) has higher efficiency than the median, is very robust and has a high breakdown point [45]. The performance efficiency of the bi-weight estimate can be understood better in terms of its empirical influence curve [45]. An empirical influence curve measures the influence of a new data point on an estimate calculated for a given sample of data as the new data point takes all possible values. In Fig. 2.1, we obtain influence curve for the mean, median and bi-weight estimate for a sample of 19 data points with values (−0.9,−0.8, ...,0.8,0.9). The value of the 20th data point is changed gradually from large negative values to large positive values and its influence is measured on the mean, median and bi-weight estimate. As seen from Fig. 2.1, the sample mean can change from negative infinity to positive infinity along with the new data measurement; thus, it can be easily influenced by a mali- cious user. On the other hand, the median is not affected by a measurement outside a narrow range. The bi-weight estimate, however, only ignores data points that are substantially far away from rest of the data. It is much more sensitive to data that is at a moderate distance from the location estimate. Thus, the bi-weight estimate still considers the influence of data points that are not necessarily outliers. At same time, it restricts the outliers from having an influence beyond certain value. Thus, it is efficient as well as robust. The values of data points that are ignored can be adjusted using the tuning constant c1. Generally, for a bi-weight estimate a tuning constant of c1 = 6 is used [40]. 2.3.2 Alternatives to Standard Deviation One possible alternative to standard deviation for the scale estimate in (2.2) is the median absolute deviation (MAD). Median absolute deviation measures the median of the absolute 21 −5 −4 −3 −2 −1 0 1 2 3 4 5 −0.25 −0.2 −0.15 −0.1 −0.05 0 0.05 0.1 0.15 0.2 0.25 New Data Point Mean Median Bi−weight Figure 2.1: Empirical influence curves for mean, median and bi-weight location esti- mate. 22 distances of the data points from the sample median. MAD (σ1) of the edBn is given by σ1 = median|edBn − µ̃ | (2.6) MAD has a breakdown point of 50%, and is used as a robust alternative to standard devi- ation in many applications. However, MAD is not an efficient estimate of the scale. It has an efficiency of only 36.74% for Gaussian distributions [40]. A more efficient and robust measure of scale is the bi-weight scale (BWS) given by [45] σ2 = √√√√√√√√√√√ N ∑ u2n<1 (edBn −µ∗)2(1−u2n)4 ∑ u2n<1 (1−u2n)(1−5u2n) −1+ ∑ u2n<1 (1−u2n)(1−5u2n) (2.7) where un = edBn −µ∗ c2median|edBn −µ∗| (2.8) µ∗ is a robust estimate of location such as the median (µ̃) or the bi-weight estimate (µ̂). c2 is the tuning constant. c2 can be used to determine the impact of the extreme data points on the BWS estimate. Note that all of the summations in (2.7) are only over the values of n for which u2n < 1. In [40], it was shown that BWS (σ2) is very efficient for a wide range of symmetric distributions compared to other robust estimates of scale, particularly for the tuning constant c2 = 9. In Fig. 2.2, the empirical influence curves of the standard deviation, MAD and BWS for the same data sample (−0.9,−0.8, ...,0.8,0.9) used in Section 2.3.2 are shown. As can be seen from the figure, the standard deviation is easily influenced by the new data point 23 whereas the MAD is not influenced by the new data point beyond a narrow range. The BWS is sensitive to the data points that are at a moderate distance from the location estimate and only ignores the extreme data points, like the bi-weight location estimate. Note that while calculating the bi-weight scale, a tuning constant of c2 = 9 was found to be more efficient for a wide range of distributions, compared to a tuning constant of c1 = 6 for the bi-weight location estimate. This is because the extreme observations contribute more substantial information about scale than about the location. Therefore, robust scale estimators should ignore fewer of the extreme observations to attain efficiency [40]. The optimal value of c2 = 9 in calculating the BWS estimate was obtained through Monte Carlo simulations in [40]. BWS was used by Alan Gross [26] to define robust confidence intervals for bi-weight estimate as follows µ̂± tν σ2√N (2.9) where ν = 0.7(N−1) and tν is the Student-t distribution with ν degrees of freedom. Sim- ilarly, using a bi-weight location estimate and a variant of the BWS, Horn [32] proposed a technique for robust estimation of a (1−α)100% prediction interval for the next obser- vation Xn+1 based on the observed random sample x1,x2, ...,xn drawn from a symmetric distribution. Based on these results, the percentage of sample values lying in the interval [µ̂−βσ2, µ̂−βσ2] can be expected to be close to each other for a wide range of symmetric distributions. This means that the probability of on being greater than β using bi-weight estimates for location and scale would be similar for most of the symmetric distributions. This sort of consistency is essential in assigning outlier factors, since the outlier factors should give a consistent measure of the outlyingness of a data point, irrespective of the un- derlying distribution of {edBn }Nn=1 (which is generally short-tailed in the case of hypothesis 24 −5 −4 −3 −2 −1 0 1 2 3 4 5 0.4 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 New Data Point Standard Deviation MAD BWS Figure 2.2: Empirical influence curves for standard deviation, median absolute devi- ation (MAD) and bi-weight scale (BWS). 25 H0 and long-tailed in the case of hypothesis H1). 2.3.3 Tackling Skew in the Data The outlier detection techniques described in Sections 2.3.2 and 2.3.3 are effective for symmetric data distributions. However, significant positive skewness could be present in the energy distribution under hypothesis H1 even after logarithm operation, particularly when the secondary user network spatial size is large, with few users having low path loss and the others having very high path loss. This would lead to assignment of high outlier values for the users having high channel gain when the primary signal is present, leading to severe degradation in probability of detection of the sensing system. However, as mentioned in Section 2.3.1, other transformations to remove skewness, such as the Box- Cox method, are not very robust against outliers. Another way to tackle skew in the data distribution is to estimate the amount of skew- ness present in the data and then use it to modify the outlier factor. Skew in data is, gener- ally, measured by its skew factor, given by γ1 = 1 N ∑(edBn −µ)3 σ 3 (2.10) However, this measure is not robust and is easily influenced by the malicious user’s data. Several robust estimates of skew factor have been studied in the literature [10]. Recently, a robust skew estimate called Med-Couple (MC) [11] was proposed. MC is given by MC = med edBi <µ̃<edBj (edBj − µ̃)− (µ̃− edBi ) edBj − edBi (2.11) MC has a breakdown point of 25% and has been shown to offer a good tradeoff between 26 robustness and efficiency compared to other robust estimates of skewness [10]. An expo- nential function of MC has been used in [33] to adjust the upper and lower limits of the Tukey box-plots used to detect the outliers. However, reliable estimation of MC would re- quire a large number of data points. For small sample sizes, even a few malicious users can have a substantial effect on the MC estimate. Moreover, skew estimates exhibit significant variation from sample to sample for a low number of data points. Therefore, this measure cannot be used effectively to compensate for the skew, particularly for a low number of sensors. 2.4 Malicious User Detection In this section, malicious user detection techniques are proposed that employ robust and ef- ficient outlier factor assignment techniques discussed in Section 2.3. The maximum num- ber of malicious users that the cooperative sensing system is expected to tolerate is denoted by Mmax. 2.4.1 Method I One method to identify the malicious users in the system is to compare the magnitudes of the outlier factors, computed using bi-weight as the location estimate and BWS as the scale estimate in Eq. (2.2), with a threshold θ1 during each iteration. The users whose outliers values have the magnitude above the threshold are considered malicious. If the number of such users is more than Mmax, then only the Mmax users with the largest outlier factor magnitudes are considered malicious. The users identified as malicious are not used for the decision making process during the particular iteration. However, deciding whether a user is malicious or not just based on its present outlier factor can potentially degrade the 27 performance of the system. For example, in order to reliably detect the malicious users falsely producing high energy values a low detection threshold θ1 is needed. However, if the primary signal is present, a non-malicious cognitive user with very good channel between its receiver and the primary user might have a high outlier factor especially if the distribution of the primary user SNR at the CR users is skewed. Thus, lower threshold value θ1 would increase the chances of misdetection of such a user as malicious, which might severely decrease the probability of detection of the primary user signal by the cooperative sensing system. On the other hand, if a high outlier detection threshold is used, then the malicious users can potentially report higher energy values without being identified as the bad users. This could drastically increase the probability of false alarm of the system affected by the ‘Always Yes’ malicious users. If the primary user does not change its state over a period of time, it is not possible to determine without a priori knowledge of primary user signal statistics, the channel conditions between primary user transmitter and CR sensors or the background noise level, whether the high outlier factor is due good channel between the primary user and the CR sensor or due to false data. 2.4.2 Method II If the primary user system is dynamic, with the primary user signal appearing and disap- pearing after every few sensing iterations, the malicious user detection schemes can be fur- ther improved. Significant increase in the energy values of the CR users from one sensing iteration to another would, in general, imply that the primary user has started transmission during the particular sensing iteration. Similarly, when the energy values of sensors show significant decrease, it might be an indication that primary user has stopped transmission. The change in the energy values of the CR users, as the state of the primary user changes 28 over a period of time, can be used to detect those malicious users which do not exhibit sim- ilar behavior as rest of the users. However, it is important to precisely identify the iteration during which the change in the energy values is due to change in the state of primary user rather than due to malicious users or fluctuations in noise and fading components. In this subsection, we propose a technique, based on robust statistics discussed in Section 2.3, to identify the iterations during which there was a change in the primary user state and using it to detect the malicious users. During each iteration, the energy values of users having very high outlier factor magni- tudes that are above a certain threshold θ2 are ignored and the adjusted bi-weight estimate µ̂a[l] and adjusted bi-weight scale σ̂a[l] are estimated using remaining energy values. θ2 is generally used to eliminate only extreme outliers. If the number of outlier factors with magnitudes above θ2 is more than Mmax, only Mmax energy values are ignored before eval- uating the adjusted bi-weight location and scale estimates. The difference between the adjusted bi-weight estimate µ̂a[l] from iteration l and the adjusted bi-weight estimate from the iteration l−1 is obtained as follows ∆µ̂a[l] = µ̂a[l]− µ̂a[l−1] (2.12) If the adjusted bi-weight increases from the l−1th iteration to the lth iteration (i.e. if ∆µ̂a[l] is positive), it could be due to the appearance of primary user signal in between iterations l and l−1. It is also possible that the primary user has remained in the same state (i.e. it hasn’t started transmission) and the increase in the bi-weight estimate is due to fluctuations in the channel fading and noise components or due to the presence of malicious users. However, a malicious user data has only limited impact on the adjusted bi-weight estimate, 29 especially since the data of users with very large outlier factor magnitudes is eliminated. The impact of variations in noise and fading components will not be significant compared to increase due to appearance of a primary signal as long as there are few non-malicious users with good channels between primary user and their sensors. Similarly, if the ∆µ̂a[l] is negative, it could be due to disappearance of primary user signal, malicious users or due to variations in channel fading and noise components. However, magnitude of ∆µ̂a[l], in general, is expected to be higher if the primary user stops transmission. At each sensing iteration, ∆µ̂a[l] from previous K iterations are taken into consideration. Among these K iterations, we identify the set of Km/2 iterations S+[l] such that ∆µ̂a[l′], for l′ ∈ S+[l], are positive with Km/2 largest magnitudes, and the set of Km/2 iterations S−[l] such that ∆µ̂a[l′], for l′ ∈ S−[l], are negative with Km/2 largest magnitudes. Thus, S+[l] represents the set of iterations during which there is a high chance that the primary user has started transmission and S−[l] represents the set of iterations during which the primary user might have stopped transmission. The penalty factors Pn[l] are now assigned to each user as follows: Pn[l] = ∑ l′∈S+[l] ( o+n [l′−1]+o−n [l′] ) + ∑ l′∈S−[l] ( o−n [l′−1]+o+n [l′] ) (2.13) where o−n [l′] = −edBn [l′]−µ̂a[l′]σ̂a[l′] ;edBn [l′]< µ̂a[l′] 0 ;Otherwise (2.14) o+n [l′] = edBn [l′]−µ̂a[l′] σ̂a[l′] ;e dB n [l′]> µ̂a[l′] 0 ;Otherwise (2.15) 30 Therefore, for all values of l′ ∈ S+[l], during which the primary user has most likely started transmission, magnitudes of only negative adjusted outlier factors o−n [l′] for iteration l′ and positive adjusted outlier factors o+n [l′−1] for iteration l′−1 are added to the penalty factor, and for values l′ ∈ S−[l], the magnitudes of only positive adjusted outlier factors o+n [l′] for iteration l′ and negative adjusted outlier factors o−n [l′− 1] for iteration l′− 1 are added to the penalty factor. Suppose a user consistently produces high energy values irrespective of the presence or absence of the primary user. If in between iterations l−1 and l the primary user reappears, then ∆µ̂a[l] will be positive. As a result, the users producing high energy value during iteration l − 1 will receive a penalty factor based on their adjusted outlier factors from iteration l−1. Also, the CR sensors with low primary SNR will be assigned a penalty factor based on their adjusted outlier factors from iteration l. However, these sensors will not have significant impact on the final decision at the access point. In a similar way, malicious users and CR users with low primary user SNR will also be assigned a high penalty factor when the primary user disappears in between iterations l−1 and l. Sometimes, the sensors with high primary user SNR could be assigned penalty factors. This would usually happen when some of Km iterations chosen from previous K iterations do not correspond to a change in state of the primary user. In such scenario, adjusted bi-weight might decrease due to fluctuations in fading and noise components even though the primary user was present during both iterations l−1 and l. The choice of Km and K would depend upon the number of times a primary user is expected to change its state during a given time period. For a good choice of Km and K, the proposed method would avoid assignment of high penalty factors to non-malicious CR users having high primary user SNR as long as there are few CR users with good channels between primary user and their sensors. 31 Based on these penalty factors another set of the outlier factors ōn[l] are defined as follows: ōn[l] = Pn[l]− µ̂P[l] σ̂P[l] (2.16) where µ̂P[l] and σ̂P[l] are bi-weight location and scale estimates of Pn[l]. A positive thresh- old θ3 is applied to determine the malicious users. All the users with positive outlier factors above this threshold (or users with the Mmax largest outlier factors if the number of users with outlier factors above θ3 is more than Mmax) are considered malicious. Method IIa If a malicious user is aware that Method II is being used at the access point, it can avoid sending false values whenever the state of primary user changes. Even though the malicious user could be identified using Method II, since it would be not be sure whether the primary user would change its state during the next iteration, it could still escape getting assigned a high penalty factor. In this section, we propose a method to identify such smart malicious users. We define ∆µ̂δa [l] = µ̂a[l]− µ̂a[l−δ ] (2.17) Kδm, Sδ+[l] and Sδ−[l] are defined based on ∆µ̂δa [l] in a similar way as Km, S+[l] and S−[l] were defined based on ∆µ̂a[l]. The penalty factors Pδn [l] are assigned as follows: Pδn [l] = ∑ l′∈Sδ+[l] ( o+n [l′−δ ]+o−n [l′] ) + ∑ l′∈Sδ−[l] ( o−n [l′−δ ]+o+n [l′] ) (2.18) 32 Final penalty factors are assigned as follows Pn[l] = ∑ δ∈Dδ Pδn [l] (2.19) where Dδ is the set of δ values considered. The outlier factors ōn[l] are calculated as in (2.16). Values of δ > 1 could be used to identify the smart malicious users mentioned earlier. Moreover, δ values can also be chosen randomly by the access point. Both Methods II and IIa, cannot accurately identify malicious users which send false sensing values once every few iterations keeping their overall penalty factors low. However, the impact of such malicious users would be less on the throughput of the cooperative sensing system. 2.5 Malicious User Detection Using Spatial Information As mentioned in earlier sections, significant skewness could be present in the energy dis- tribution under hypothesis H1 even after logarithm operation, particularly when the CR network spatial size is large. Another way to tackle skew is to estimate the skewness present in the data by calculating the skew factor and then use it to modify the outlier fac- tors [10, 33]. However, for small sample sizes, robust skew estimates exhibit significant variation from sample to sample and the false data points can have a substantial effect on the estimate. Therefore, these measures cannot be used effectively to compensate for the skew, particularly for a low number of sensors. If the spatial location of the CR users is available at the access point, then the outlier factor can be assigned to each user based on the energy-detector outputs of its closest spa- tial neighbors. In wireless communication systems, the distribution of the energy-detector outputs is generally expected to be less skewed for sensors spread over a small area, com- 33 pared to sensors spread over a larger area. Spatial outlier factors osn[l] are computed as follows osn[l] = edBn [l]− µ̂s[l] σ̂ s[l] (2.20) where µ̂s[l] and σ̂ s[l] are the bi-weight estimate and bi-weight scale of the energy values of the A closest neighbors of a user n (including the user n). Based on osn[l] calculated as in (2.20) and on[l] calculated as in (2.2), a final outlier factor o fn [l] is assigned as follows: o fn [l] = min{|osn[l]|, |on[l]|} ;on[l]≥ 0 −min{|osn[l]|, |on[l]|} ;on[l]< 0 (2.21) The minimum of osn[l] and on[l] is taken instead of just assigning osn[l] as the outlier factor of each user to prevent assignment of high outlier factors to certain non-malicious users. For example, a non-malicious user might have a high channel gain from the primary transmitter compared to rest of the sensors in its spatial neighborhood, thus, getting a high spatial outlier factor osn under Hypothesis H1. However, when compared to other sensors in the entire system the channel gain of this particular user is not too high to raise any suspicion. Taking just osn will lead to erroneous assignment of high outlier factor to such non-malicious user. Malicious users can now be identified by Method I discussed in Section 2.4.1, using the values o fn [l] instead of on[l]. Alternatively, Method II discussed in Section 2.4.2 can be used. The algorithm remains the same except that o−n [l] in (2.14) and o+n [l] in (2.15) are 34 assigned: o−n [l′] = min{|ōsn[l′]|, |ōn[l′]|} ; ōn[l′]< 0 0 ;Otherwise (2.22) o+n [l′] = min{|ōsn[l′]|, |ōn[l′]|} ; ōn[l′]≥ 0 0 ;Otherwise (2.23) where ōsn[l′] = edBn [l′]− µ̂sa[l′] σ̂ sa[l′] (2.24) ōn[l′] = edBn [l′]− µ̂a[l′] σ̂a[l′] (2.25) where µ̂sa[l′] and σ̂ sa[l′] are the new spatial neighborhood bi-weight location and scale es- timate obtained after eliminating users with outlier factors having magnitudes above the threshold θ2. 2.6 Performance Analysis In this section, a method to compare the performances of the proposed malicious user detection schemes is considered. The equal gain combination scheme is considered at the access point [71]. The equal gain combining method is as follows: 1 N N ∑ n=1 en[l] H1 ≷ H0 eT (2.26) where eT is the detection threshold used at the access point. The performances of the malicious user detection schemes are analyzed by defining 35 measures additional probability of false alarm ¯Pf and additional probability of misdetection ¯Pm as follows: ¯Pf = Pr( ˆdm = 1/ ˆd0 = d = 0) (2.27) ¯Pm = Pr( ˆdm = 0/ ˆd0 = d = 1) (2.28) where d is the primary user state (d = 1 and d = 0 denote the presence and absence of the primary signal, respectively), ˆd0 is the decision made by the ideal malicious user detection scheme that correctly identifies and ignores the data of the malicious users. ˆdm is the decision made by a system, affected by the malicious users, implementing the proposed malicious user detection scheme. Thus, when the primary user is not present, ¯Pf represents the probability that the malicious user identification scheme fails to detect the malicious users or misdetects non-malicious user as malicious resulting in a wrong decision ˆdm = 1, when in fact the ideal malicious user detection scheme would have made the correct decision ˆd0 = 0. Similarly, when the primary user is present, ¯Pm represents the probability that malicious user detection scheme fails to detect the malicious user or misdetects a good user as a malicious user resulting in making a wrong decision ˆdm = 0 when for the same set of energy values an ideal malicious user detection scheme would have made the correct decision ˆd0 = 1. In malicious user detection Methods I and II described in Section 2.4, the values of ¯Pf and ¯Pm depend on the outlier detection thresholds θ1 and θ3, respectively. The trade-off between ¯Pf and ¯Pm, as the values of thresholds θ1 and θ3 are varied, is studied to analyze the performance of the malicious user detection schemes. 36 2.7 Simulation Results We consider a cooperative sensing system with N = 20 users. A generic wide area prop- agation model is considered for the primary signal [55]. The path loss constant is 5. The standard deviation of log-normal shadowing is 5 dB. The correlation between shadowing components of two sensors is assumed to be exponentially decreasing with the distance between the sensors, with a correlation of 0.3 at a distance of 10m [27]. Independent and identically distributed small-scale Rayleigh fading is assumed at each sensor. The sensing period at each sensor is given by T = 5/B, where B is the channel bandwidth. The CR sensors are assumed to be stationary with fixed path loss and shadowing components. Out- lier factors are calculated using bi-weight location and scale estimates. BWS is calculated using the median as the location estimate µ∗ in (2.7) and (2.8). The threshold eT in (2.26) is chosen so that the probability of false alarm at the fusion center is 0.01. We assume that the probability of a primary user being present during a sensing iteration is 0.5 and this probability is independent from one iteration to another. We consider ‘Always Yes’ malicious users that generate values that are randomly distributed between the values 4eT and 8eT . M represents the number of malicious users present in the system. We assign the spatial locations of the sensors using a two-dimensional model. The (X ,Y ) coordinates of the primary user transmitter are (0,0). In Fig. 2.3, we assume that the sensors are distributed in an area of 50m×50m as 5×4 uniform rectangular grid. The X and Y coordinates of the sensors lie between the values 100m and 150m. Ignoring fading effects, the SNR at (100m,100m) is -5dB. The number of malicious users is M = 1 and the maximum number of malicious users tolerated is Mmax = 2. The location of the malicious user is chosen as (100m,100m). Performances of the Methods I and II are compared. In case of Method II, the threshold θ2 which is used to eliminate extreme outliers before 37 calculating adjusted bi-weight location and scale estimates is chosen to be 4. We see that Method II significantly outperforms Method I. Moreover, the performance improves as value of K increases for given Km/K ratio. It should also be noted that the ¯Pf cannot be reduced below a certain value for each malicious user detection scheme, since at low values of outlier detection thresholds, some of the good users are misidentified as bad users. The case when no malicious node detection scheme is used corresponds to the left end of the performance curve of Method I (at low ¯Pm), i.e., for very high detection threshold (θ1) at which the malicious user is not detected. As we can see, the malicious user significantly increases the probability of false alarm of the system. In Fig. 2.4 and Fig. 2.5, we assume that the sensors are distributed in an area of 225m×225m as 5×4 uniform rectangular grid. The X and Y coordinates are distributed between the values 25m and 250m. Ignoring fading, the primary user SNR at (100m,100m) is -5dB and 3dB in case of Fig. 2.4 and Fig. 2.5, respectively. All other parameters are similar to those used in Fig. 2.3. The skew in the received energy distribution in dB under hypothesis H1 is generally expected to be higher compared to the system considered in Fig. 2.3. We consider the performance of Method I and II for M = 1 and Mmax = 2. The location of the malicious user is chosen to be (25m,25m). We see from Fig. 2.4 that compared to the system considered in Fig. 2.3, to achieve similar decrease in the value of ¯Pf would result in higher ¯Pm. This is due to higher probability of misdetection of CR users with strong channels from primary user as malicious. Moreover, the impact of eliminating such users on the sensing system would be higher. We also notice that at higher SNR values, as in Fig. 2.5, Method II offers significant improvement in the performance. In Fig. 2.6, we consider the performance of Method II for the system considered in Fig. 2.4. We vary the value of K keeping Km constant at 16. We see that the performance of 38 10−4 10−3 10−2 10−1 10−6 10−5 10−4 10−3 10−2 10−1 100 Additional probability of misdetection (P̄m) A d d it io n a l p r o b a b il it y o f fa ls e a la r m (P̄ f ) Method I Method II, K m = 8, K = 16 Method II, K m = 16, K = 32 Method II, K m = 32, K = 64 Figure 2.3: Performance of malicious user detection schemes for CR network spread over a small area in the presence of M = 1 malicious user and Mmax = 2. 39 10−3 10−2 10−1 100 10−4 10−3 10−2 10−1 100 Additional probability of misdetection (P̄m) A d d it io n a l p r o b a b il it y o f fa ls e a la r m (P̄ f ) Method I Method II, K m = 8, K = 16 Method II, K m = 16, K = 32 Method II, K m = 32, K = 64 Figure 2.4: Performance of malicious node detection schemes for CR network spread over a large area in the presence of M = 1 malicious user with primary user SNR at (100m, 100m) ignoring fading effects = -5dB. 40 10−5 10−4 10−3 10−2 10−4 10−3 10−2 10−1 100 Additional Probability of Misdetection (P̄m) A d d it io n a l P r o b a b il it y o f F a ls e A la r m (P̄ f ) Method I Method II, K m = 8, K = 16 Method II, K m = 16, K = 32 Method II, K m = 32, K = 64 Figure 2.5: Performance of malicious node detection schemes for CR network spread over a large area in the presence of M = 1 malicious user with primary user SNR at (100m, 100m) ignoring fading effects = 3dB. 41 the malicious user detection scheme increases with increasing value of K. This is because for larger values of K, the Km iterations during which the change in the bi-weight location estimate has been largest, more precisely corresponds to the change in the state of the primary user. However, an increase in K also leads to latency in malicious user detection scheme. 10−3 10−2 10−1 100 10−4 10−3 10−2 10−1 100 Additional probability of misdetection (P̄m) A d d it io n a l p r o b a b il it y o f fa ls e a la r m (P̄ f ) Method II, K m = 16, K = 24 Method II, K m = 16, K = 32 Method II, K m = 16, K = 48 Method II, K m = 16, K = 64 Figure 2.6: Performance of Method II at different values of K for M = 1, Mmax = 2 and Km = 16. 42 In Fig. 2.7, we consider the performance of Method II at different values of Km keeping K constant at 32, for the system considered in Fig. 2.4. We observe that the best perfor- mance is obtained when Km is 0.5K. This is due the nature of the primary user considered in these simulations. Since, the probability of primary user being in state d = 1 (primary user signal present) or state d = 0 (primary user signal absent) is assumed to be 0.5 and independent from one iteration to another, the most likely number of primary user state transitions during the K iterations would be 0.5K. Therefore, if Km < 0.5K, there is a high probability that the some of the iterations during which there was a change in the primary user state have not been considered in assigning penalty factor, leading to poorer perfor- mance. If Km > 0.5K, there is a high chance that some of iterations during which there was no change of state of the primary user have been considered in assigning penalty fac- tor, again leading to a poorer performance. Thus, more precise knowledge of the primary user activity (expected number of state transitions in a given time interval) can be used to appropriately choose Km and K. In Fig. 2.8, Fig. 2.9 and Fig. 2.10, we consider the performance of Methods I and II at different values of M for Mmax = 20. The system considered is similar to the system analyzed in Fig. 2.4. The primary user SNR (ignoring fading effects) at (100m,100m) is assumed to be -5dB, 0dB and 8dB in Fig. 2.8, Fig. 2.9 and Fig. 2.10, respectively. We assume that all malicious users collude together and produce equal high energy values. We consider the worst possible case in which all the malicious users in the system are the ones spatially closest to the primary user. In case of Method II, we choose Km = 16 and K = 32. We see that the performance of Method II degrades more compared to that of Method I as M increases. This is especially true at low values of primary user SNR (Fig. 2.8). This is because at low primary user SNR values there are not enough non-malicious users with 43 10−3 10−2 10−1 10−3 10−2 10−1 100 Additional probability of misdetection (P̄m) A d d it io n a l p r o b a b il it y o f fa ls e a la r m (P̄ f ) Method II, K m = 8, K = 32 Method II, K m = 16, K = 32 Method II, K m = 24, K = 32 Figure 2.7: Performance of Method II at different values of Km for M = 1, Mmax = 2 and K = 32 44 good channels from the primary user. Therefore, it is not necessarily true that the largest increase or decrease in the adjusted bi-weight estimates is due to change in the state of the primary user, leading to severe performance degradation in case of Method II. However, as seen from Fig. 2.10, at high values of SNR, Method II still outperforms Method I even for high values of M. Both Method I and II would offer a trade-off between the probability of false alarm and probability of misdetection for a system affected by malicious users as long as their percentage is less than 50. However, the trade-off might not be practical for high values of M and low primary user SNR values. In Fig. 2.11, we consider the performance of malicious user detection techniques using spatial information for the system considered in Fig. 2.4 with M = 1 and Mmax = 2. The size of spatial neighborhood considered is A= 8. We see that the performances of both Methods I and II improve substantially when spatial outlier factors are taken into consideration. This is due to assignment of lower magnitude outlier factors to non-malicious users with good channels from the primary user which decreases the probability of such users of having a outlier magnitude or penalty factor higher than the malicious users or CR users with low SNR from the primary user. Even though, in this method, the chances of sensors with low primary user SNR getting high outlier or penalty factor are higher, the effect of these sensors will be low on the performance of the cooperative sensing system. The optimal choice of A would depend on the propagation environment of primary user signal. In Fig. 2.12 and Fig. 2.13, we analyze the performance of Method IIa when Dδ = {1,2,3,4} for the system considered in Fig. 2.4. In Fig. 2.12, we consider ‘Always Yes’ malicious user and in Fig. 2.13, we consider a smart malicious user that avoids sending false sensing values during the iterations when there is change in the primary user state. Same Kδm value is used for each δ and is denoted by Km in Fig. 2.12 and Fig. 2.13. We see 45 10−3 10−2 10−1 100 10−3 10−2 10−1 100 Additional probability of misdetection (P̄m) A d d it io n a l p ro b a b il it y o f fa ls e a la rm (P̄ f ) M = 1, Method I M = 2, Method I M = 3, Method I M = 4, Method I M = 1, Method II M = 2, Method II M = 3, Method II M = 4, Method II Figure 2.8: Performance of malicious user detection schemes at different values of M for Mmax = 20 with primary user SNR at (100m, 100m) ignoring fading effects = -5dB. 46 10−3 10−2 10−1 100 10−3 10−2 10−1 100 Additional probability of misdetection (P̄m) A d d it io n a l p ro b a b il it y o f fa ls e a la rm (P̄ f ) M = 1, Method I M = 2, Method I M = 3, Method I M = 4, Method I M = 1, Method II M = 2, Method II M = 3, Method II M = 4, Method II Figure 2.9: Performance of malicious user detection schemes at different values of M for Mmax = 20 with primary user SNR at (100m, 100m) ignoring fading effects = 0dB. 47 10−4 10−3 10−2 10−3 10−2 10−1 100 Additional probability of misdetection (P̄m) A d d it io n a l p ro b a b il it y o f fa ls e a la rm (P̄ f ) M = 1, Method I M = 2, Method I M = 3, Method I M = 4, Method I M = 1, Method II M = 2, Method II M = 3, Method II M = 4, Method II Figure 2.10: Performance of malicious user detection schemes at different values of M for Mmax = 20 with primary user SNR at (100m, 100m) ignoring fading effects = 8dB. 48 10−4 10−3 10−2 10−1 100 10−4 10−3 10−2 10−1 100 Additional probability of misdetection (P̄m) A d d it io n a l p r o b a b il it y o f fa ls e a la r m ( P̄ f ) Method I Method I, Spatial Method II, K m = 8, K = 16 Method II, Spatial, K m = 8, K = 16 Method II, K m = 16, K = 32 Method II, Spatial, K m = 16, K = 32 Method II, K m = 32, K = 64 Method II, Spatial, K m = 32, K = 64 Figure 2.11: Performance of malicious user detection schemes using spatial informa- tion of the CR network for M = 1 malicious user and Mmax = 2. 49 that Method IIa performs close to Method II in case of ‘Always Yes’ malicious user. At the same time, Method IIa significantly outperforms Method II in case of smart malicious user. This is because the smart malicious user escapes getting a penalty during most iterations in case of Method II. However, for δ > 1, it still receives the penalty and thus is identified using Method IIa. 2.8 Conclusions In this chapter, we studied CR cooperative sensing system based on a parallel fusion sens- ing architecture in which all sensors send their quantized or un-quantized energy detector outputs to an access point which then applies a data fusion and detection scheme to de- termine the presence of a primary signal. We investigated schemes to identify malicious CR sensors sending false sensing information to the access point which can lead to severe degradation in performance of the CR sensing system. We explored techniques based on outlier detection to identify such malicious users. Several important constraints imposed by the CR scenario such as small data sample size and limited knowledge of primary sig- nal propagation environment were taken into consideration. We investigated various robust statistics that could be used to assign outlier factors to the CR users during each sensing iteration. Malicious user detection schemes based on these outliers factors were then pro- posed to identify users sending false sensing information and reduce their impact on the performance of the sensing system. The proposed malicious user detection schemes do not require feedback from the primary user network or knowledge of the additive noise variance and the location of the primary transmitter. We especially focused on identifying the malicious users which decrease the CR throughput by sending false high energy values when the primary user is absent. Assuming partial knowledge of the primary user activity, 50 10−3 10−2 10−1 100 10−4 10−3 10−2 10−1 100 Additional Probability of Misdetection (P̄m) A d d it io n a l P r o b a b il it y o f F a ls e A la r m ( P̄ f ) Method I Method II, K m = 8, K = 16 Method II, K m = 16, K = 32 Method IIa, K m = 8, K = 16 Method IIa, K m = 16, K = 32 Figure 2.12: Performance of malicious node detection schemes for CR network spread over a large area in the presence of a single ‘Always Yes’ malicious user 51 10−3 10−2 10−1 100 10−4 10−3 10−2 10−1 100 Additional Probability of Misdetection (P̄m) A d d it io n a l P r o b a b il it y o f F a ls e A la r m ( P̄ f ) Method I Method II, K m = 8, K = 16 Method II, K m =16, K = 32 Method II, K m = 32, K = 64 Method IIa, K m = 8, K = 16 Method IIa, K m = 16, K = 32 Method IIa, K m = 32, K = 64 Figure 2.13: Performance of malicious node detection schemes for CR network spread over a large area in the presence of a single smart malicious user 52 we proposed a novel method to improve the performance of the malicious user detection schemes. For the case of a CR cooperative sensing system spread over a wide area with significant difference in path loss components of the channels between the primary user and various CR sensors, we proposed improved malicious user detection schemes in which spatial location information of the sensors is taken into consideration. We analyzed the per- formance of the proposed schemes through simulations for a cooperative sensing system using equal gain combining as the data fusion scheme at the access point. 53 Chapter 3 Sensor Allocation and Quantization Schemes 3.1 Background In this chapter, we consider a CR system operating in multiple primary bands. We assume that the CR sensors are equipped with narrow-band detectors that can only scan one primary band at a time. For such a system, we present the optimal joint sensor allocation and single-bit quantization problem when ‘OR’ fusion rule is used in each primary band at the access point. Since the original problem is a highly complex mixed integer optimization problem, we propose to solve it sub-optimally by separating it into two subproblems. We first propose schemes to allocate sensors to various primary bands based on assignment algorithms [46, 49]. We then study optimal single-bit quantization scheme at the sensors assuming equal quantization thresholds at all the sensors assigned to the same primary band. We show that the optimal quantization scheme is, in general, non-convex and propose a suboptimal solution based on convex restriction of the optimal problem. We further 54 study quantization schemes when a general k-out-of-N fusion rule is used in each primary band at the access point. In this chapter, we assume that the CR network has information about certain primary transmitter characteristics such as the timing of the primary user pilot signals which it can use to evaluate the channel gains between the primary transmitters and CR sensors. The rest of this chapter is organized as follows. In Section 3.2, we define the system model and define the optimization problem when the ‘OR’ fusion rule is implemented at the access point in each primary band. In Section 3.3, we propose schemes to assign sensors to various primary user bands. In Section 3.4, we propose efficient techniques to determine energy detection thresholds at each sensor. In Section 3.5, we extend the results for the case when general k-out-of-N fusion rule is implemented at the access point. Simulation results are presented in Section 3.6. Conclusions are finally drawn in Section 3.7. 3.2 System Model and Problem Formulation We consider a group of L CR sensors operating in P primary user bands. Energy detector is implemented at each sensor. The energy detector measures the signal energy level in the assigned primary band and sends bit ‘1’ to the access point via a reporting channel if the energy level is above a certain energy threshold and bit ‘0’ if the energy level is below the energy threshold. We assume that channel coding is used in the reporting channels and the effect of errors due to reporting channels is negligible on the performance of the CR sensing system. In this section, we assume that ‘OR’ fusion rule is used at the access point in each primary band. We assume that all primary bands are assigned equal number of sensors. Thus, assum- ing L is a multiple of P, each primary band is assigned N = L/P sensors. The optimiza- 55 tion criteria used for assigning sensors and quantization thresholds can vary from system to system. In this chapter, we consider two optimization criteria: 1) Maximize the sum throughput rate of the CR system and 2) Maximize the minimum throughput rate available to the CR system among various primary bands. The optimal sensor assignment and detection thresholds that maximize the sum through- put rate, for a ‘OR’ fusion rule, can be obtained by solving following optimization problem max λi j,xi j P ∑ i=1 ri(1−Pfi) = P ∑ i=1 ri PN ∏ j=1 (1−Pfi j(λi j))xi j (3.1) s.t. P ∑ i=1 ci(1−Pdi) = P ∑ i=1 ci PN ∏ j=1 (1−Pdi j(λi j))xi j < C (3.2) PN ∏ j=1 (1−Pdi j(λi j))xi j < ¯Pmi ∀i = 1 to P (3.3) 1− PN ∏ j=1 (1−Pfi j(λi j))xi j < ¯Pfi ∀i = 1 to P (3.4) PN ∑ j=1 xi j = N (3.5) P ∑ i=1 xi j = 1 (3.6) xi j ∈ {0,1} (3.7) where λi j is energy detection threshold at the sensor j in the primary band i. xi j = 1 indi- cates that the sensor j has been assigned to primary band i and xi j = 0 indicates otherwise. Pdi j and Pfi j denote probability of detection of the primary signal and probability of false alarm, respectively, for sensor j in primary band i. Pdi and Pfi represent the probability of detection of the primary signal and probability of false alarm, respectively, in primary band i. ri represents the data throughput rate available to a CR user in band i when the 56 primary user is absent. ci represents the cost to be paid to a primary user system if the CR system fails to detect the primary user signal in band i, as a result, causing interference to the primary user. ¯Pmi represents the maximum probability of mis-detection that can be tolerated by primary user system in band i. ¯Pfi represents the maximum probability of false alarm that can be tolerated in band i, in order to ensure a minimum opportunistic spectral utilization of the band. As in [52], we assume that each primary band has a strict limit over the amount of interference that it can tolerate which is represented by Eq. (3.3). Even within these in- terference limits, we assume that each primary system further imposes a cost on the CR system proportional to the interference caused due to misdetection of the primary signal. Parameter C in Eq. (3.2) denotes the maximum total sum cost of misdetection over all the primary bands which the CR system is willing to pay. At the same time, it is also necessary to provide certain quality of service to CR users operating in each primary band. There- fore, constraints are imposed on the probability of false alarm in each primary band as in Eq. (3.4). Eq. (3.5) denotes that N sensors are assigned to each primary band. Eq. (3.6) specifies that a sensor can be assigned to only one primary band. Alternatively, it might be of interest in certain systems to guarantee a max-min fairness to the CR users. In this case, the aim of the sensor system is to maximize the minimum throughput rate available to the CR system among various bands. The max-min optimiza- tion problem is given by max λi j,xi j min i PN ∏ j=1 (1−Pfi j(λi j))xi j (3.8) given the constraints (3.2)-(3.7). Let ν denote the number of signal samples taken by the energy detector at each sensor in each band. Let |hi j| represent the effective channel gain between the primary transmitter in 57 band i to the sensor j assuming that the transmitter transmits signal at unit power. The CR sensors can estimate |hi j| by taking energy samples during the periods when the primary transmitters are known to be transmitting (for example when they are transmitting pilot signals) [52]. We assume additive white Gaussian noise (AWGN) with variance σ 2 in each channel. For such a system, the probability of detection and false alarm at each sensor are given by [43] Pdi j(λi j) = Pr ( χ2ν ( ν |hi j|2 σ 2 ) > λi j ) (3.9) Pfi j(λi j) = Pr(χ2ν > λi j) (3.10) where χ2ν ( ν |hi j|2 σ2 ) represents non-central chi-square distribution with ν degrees of freedom and non-centrality parameter ν |hi j| 2 σ2 and χ2ν represents central chi-square distribution with ν degrees of freedom. Using the central limit theorem for large ν , both central and non-central chi-square distribution can be approximated with Gaussian distributions. This yields following ap- proximations for probability of detection and false alarm at the sensors [52] Pdi j(λi j) ≈ Q ( λi j−2ν(σ 2 + |hi j|2)√ 4ν(σ 2 +2|hi j|2)σ 2 ) (3.11) Pfi j(λi j) ≈ Q (λi j−2νσ 2√ 4νσ 4 ) (3.12) where Q(·) represents the Gaussian Q-function. For the probability of detection Pdi j(λi j) and false alarm Pfi j(λi j) given in (3.11) and (3.12), respectively, (3.1) and (3.8) are mixed integer optimization problems and are highly complex to solve. In order to reduce the complexity, we separate the problem into two 58 subproblems. We first propose schemes to assign the sensors to different primary user bands. Once the sensors are assigned to various primary bands, we investigate efficient quantization schemes. 3.3 Sensor Assignment In this section, we study two possible techniques that could be used to assign the sensors based on the channel gains between various primary transmitters and the CR sensors, the costs ci of causing interference to the primary users and throughput rates ri available in the primary bands. 3.3.1 Maximum Weighted Sum Channel Gain Assignment One method to assign sensors is to maximize the cost weighted sum of channel gains be- tween each primary user and sensors assigned to the primary user. The sensor allocation problem in this case is as follows max xi j P ∑ i=1 PN ∑ j=1 ri ci |hi j|2xi j (3.13) P ∑ i=1 xi j = 1 (3.14) PN ∑ j=1 xi j = N (3.15) xi j ∈ {0,1} (3.16) The sensor allocation problem in (3.13)-(3.16) is well studied in the literature and can be optimally solved using Munkres algorithm [46], which has a complexity of O((PN)3). 59 3.3.2 Max-Min Channel Gain Assignment The maximum weighted sum channel gain assignment scheme can however lead to as- signment of all good sensors to one user and assignment of sensors with weak channels to another. Thus, it might not offer a good trade-off between the probability of detection and probability of false alarm in some of the primary bands. Especially, in case of max- min optimization criterion as in (3.8), assigning sensors such that the minimum weighted sum of the channel gains assigned to various primary users is maximized could offer better performance. Such a max-min assignment problem can be formulated as follows max xi j min i PN ∑ j=1 ri ci |hi j|2xi j (3.17) given the constraints (3.14)-(3.16). The optimization problem in (3.17) is max-min variant of the bottleneck assignment problem under categorization (which is a min-max assignment problem) and is strictly NP-hard [51]. Therefore, we propose a suboptimal greedy algo- rithm to solve (3.17). The proposed algorithm is a modification of the greedy algorithm discussed in [51], in which the min-max version of the problem in (3.17) was studied. The greedy algorithm is described in Table 3.1. As seen from Table 3.1, the greedy algo- rithm assigns the sensors in serial order. During each iteration, the band with lowest cost weighted sum of channel gains is selected and assigned the best sensor available to it. In case of a tie (i.e. if two or more primary bands have the same minimum weighted sum of channel gains at a particular iteration), the primary band with maximum available sensor channel gain among rest of the unallocated sensors is chosen and corresponding sensor is assigned to it. The greedy algorithm is suboptimal but has a much lower complexity of O(P2N log(PN)) [51], compared to the optimal max-min assignment algorithm based on 60 exhaustive search. During each iteration, the greedy algorithm attempts to maximize the minimum weighted sum channel gain values assigned to each primary band. This should intuitively lead to a solution close to the optimal solution. Table 3.1: Greedy algorithm Step 0 (Initialization): l = 0 x̂i j = 0 ∀i = 1 to P and j = 1 to N Step 1 (Assignment): l = l + 1 ˆi = argmin i∈Rpl ∑ j ri ci |hi j|2x̂i j where Rpl = {i : ∑ j x̂i j < N} ˆj = argmax j∈Rnl |h ˆi j|2 where Rnl = { j : ∑ i x̂i j = 0} x̂ ˆi ˆj = 1 (x̂i j = 1 implies that the sensor j has been assigned to primary band i) Step 2 (Finish): Stop if l = PN else go to Step 1 3.4 Quantization Thresholds Once the sensors are assigned to each primary band, we optimize the quantization thresh- olds at each sensor. In general, the optimal thresholds are not equal even in case of a single primary user with equal channel gains |hi j| at all sensors [65]. However, it has been shown in the literature that equal thresholds are asymptotically optimal by Neyman-Pearson or Bayesian criteria [47, 65, 69] as the number of sensors goes to infinity. Therefore, in order to reduce the complexity of the algorithm, we assume that all the sensors assigned to a single primary band use equal energy detection thresholds (i.e., λi j = λi,∀ j). Let Si represent the set of sensors assigned to primary band i. Assuming ‘OR’ 61 fusion rule, the optimal thresholds can be determined by solving the following problem max λi P ∑ i=1 ri ( 1−Q (λi−2νσ 2√ 4νσ 4 ))N (3.18) s.t. P ∑ i=1 ci ∏ j∈Si ( 1−Q ( λi−2ν(σ 2 + |hi j|2)√ 4ν(σ 2 +2|hi j|2)σ 2 )) < C (3.19) ∏ j∈Si ( 1−Q ( λi−2ν(σ 2 + |hi j|2)√ 4ν(σ 2 +2|hi j|2)σ 2 )) < ¯Pmi ∀i (3.20) 1− ( 1−Q (λi−2νσ 2√ 4νσ 4 ))N < ¯Pfi ∀i (3.21) In rest of this chapter, we use following notation for convenience αi = λi−2νσ 2√ 4νσ 4 (3.22) βi j = λi−2ν(σ 2 + |hi j|2)√ 4ν(σ 2 +2|hi j|2)σ 2 (3.23) (3.21) is a linear constraint. We show in Appendix A that the objective function in (3.18) is concave for values of αi satisfying (from Eq. (A.6)) ∑ j∈Si αi N ≥ x̄(1,N) (3.24) where x̄(1,N) values are shown in Table 3.2. We also show in Appendix A that condition (3.24) holds for all values of αi for which the probability of false alarm in the primary band i, Pfi, is less than or equal to P (1,N) fmax (from Eq. (A.8)). Therefore, for ¯Pfi ≤ P (1,N) fmax , the objective function in (3.18) is concave over the set of λi values satisfying the constraint (3.21). The values of P(1,N)fmax are shown in Table 3.3. As seen from Table 3.3, the values of P(1,N)fmax lie above 0.5 for all values of N and thus, the constraint in (3.24) is very reasonable 62 for practical CR systems. Table 3.2: Values of x̄(k,N) at different values of k and N k N 1 ‘OR’ rule 2 3 4 5 6 7 8 9 10 1 0 2 0.51 -0.51 3 0.77 0 −0.77 4 0.94 0.28 −0.28 −0.94 5 1.06 0.47 0 −0.47 −1.06 6 1.16 0.61 0.19 −0.19 −0.61 −1.16 7 1.24 0.72 0.34 0 −0.34 −0.72 −1.24 8 1.31 0.81 0.45 0.15 −0.15 −0.45 −0.81 −1.31 9 1.37 0.89 0.55 0.26 0 −0.26 −0.55 −0.89 −1.37 10 1.42 0.95 0.63 0.36 0.12 -0.12 -0.36 -0.63 -0.95 -1.42 However, constraint (3.19) is not guaranteed to be convex in general. The necessary conditions to guarantee convexity are very complex to derive and need not necessarily hold true for all values of λi satisfying (3.19) and (3.20). Thus, this problem in general is non- convex and cannot be solved to obtain unique optimal solution [7]. In this chapter, we propose a suboptimal solution to the optimization problem in (3.18)- (3.21) by solving a convex restriction to the original optimization problem. We show in Appendix B that Q(x) is a log-concave function. Therefore, 1−Q(x) = Q(−x) is also 63 Table 3.3: Values of P(k,N)fmax at different values of k and N k N 1 ‘OR’ rule 2 3 4 5 6 7 8 9 10 1 0.5 2 0.5189 0.4811 3 0.5292 0.5 0.4708 4 0.5360 0.5087 0.4913 0.4640 5 0.5410 0.5142 0.5 0.4858 0.4590 6 0.5449 0.5181 0.5052 0.4948 0.4819 0.4551 7 0.5481 0.5212 0.5088 0.5 0.4912 0.4788 0.4519 8 0.5508 0.5238 0.5116 0.5036 0.4964 0.4884 0.4762 0.4492 9 0.5530 0.5259 0.5139 0.5062 0.5 0.4938 0.4861 0.4741 0.4470 10 0.5550 0.5277 0.5158 0.5083 0.5026 0.4974 0.4917 0.4842 0.4723 0.4450 log-concave. Thus, we have 1 N ∑j∈Si log ( 1−Q(βi j)) ≤ log ( 1−Q (∑ j∈Si βi j N )) =⇒ ∏ j∈Si ( 1−Q(βi j)) ≤ ( 1−Q (∑ j∈Si βi j N ))N (3.25) with equality holding when the channel gains of the sensors assigned to primary band i are all equal. We show in Appendix A (see Eq. (A.7)) that 1− ( 1−Q ( ∑ j∈Si βi j N ))N is concave and thus, ( 1−Q ( ∑ j∈Si βi j N ))N is convex for ∑ j∈Si βi j N < x̄(1,N) (3.26) 64 For βi j satisfying (3.26), 1− ( 1−Q ( ∑ j∈Si βi j N ))N is greater than P(1,N)fmax (from (A.9)) and hence, ( 1−Q ( ∑ j∈Si βi j N ))N is less than 1−P(1,N)fmax . As seen from Table 3.3, the values of 1−P(1,N)fmax are above 0.44 for values of N ≤ 10 (In [43], it was shown that most of the gain through cooperation is achieved by using∼ 10−20 sensors). Thus, for practical CR systems, it would be reasonable to assume that ¯Pmi is less than 1−P(1,N)fmax . Therefore, all the values of βi j, for which ( 1−Q ( ∑ j∈Si βi j N ))N < ¯Pmi , satisfy the constraint (3.26) and hence, ( 1−Q ( ∑ j∈Si βi j N ))N would be convex at those values of βi j. Since αi and βi j are linear functions of λi, constraints (3.24) and (3.26) are satisfied for following linear constraint on λi √ 4νσ 4x̄(1,N) + 2νσ 2 < λi < x̄(1,N)+∑ j∈Si 2ν(σ 2+|hi j|2)√ 4ν(σ2+2|hi j|2)σ2 ∑ j∈Si 1√4ν(σ2+2|hi j|2)σ2 (3.27) Thus, we obtain the following restricted convex optimization problem max λi P ∑ i=1 ri (1−Q(αi))N (3.28) s.t. P ∑ i=1 ci ( 1−Q (∑ j∈Si βi j N ))N < C (3.29) ( 1−Q (∑ j∈Si βi j N ))N < ¯Pmi ∀i = 1 to P (3.30) 1− (1−Q(αi))N < ¯Pfi ∀i = 1 to P (3.31) (3.28) is a convex restriction of the problem (3.18) since the constraints (3.29) and (3.30) 65 are more restrictive on the values λi compared to (3.19) and (3.20), respectively. Never- theless, (3.28)-(3.31) is a convex optimization problem as long as ¯Pfi ≤ P(1,N)fmax and ¯Pmi ≤ 1−P(1,N)fmax , since the corresponding solution set would always satisfy the linear constraint (3.27), for which the objective function in (3.28) is concave, constraint (3.29) is convex and, constraints (3.30) and (3.31) are linear. As discussed earlier, these restrictions on ¯Pfi and ¯Pmi are very reasonable in practical systems. The solution of the suboptimal problem (3.28)-(3.31) forms a lower bound on the solution of the original optimization problem in (3.18)-(3.21) [7]. The suboptimal solution is equal to the optimal solution if in each primary band, all the sensors have equal channel gains from the primary transmitter. The complexity of the suboptimal solution will, in general, be lower than the optimum solution. The suboptimal solution can be further simplified by solving (3.28)-(3.31) assuming that all the sensors assigned to various primary bands are allocated equal thresholds (i.e. λ1 = λ2 = ... = λP = λ ). This would reduce the complexity further since the number of optimization variables is reduced from P to 1. 3.4.1 Max-Min Optimization If the aim of the threshold allocation algorithm is to maximize the minimum throughput rate available among various primary bands, the optimization problem after sensor assignment is as follows max λi min i ri(1−Q(αi))N (3.32) given constraints (3.19), (3.20) and (3.21). The max-min optimization in (3.32) can be reformulated by introducing a new variable 66 γ as follows [50] min γ>0, λi −γ (3.33) s.t. γ− ri(1−Q(αi))N < 0 ∀i = 1 to P (3.34) given the constraints (3.19), (3.20) and (3.21). Using the convex restriction techniques proposed earlier in this section, a suboptimal solution can be obtained by solving a convex restriction of the original problem in (3.32), as long as the constraint (3.27) is valid. 3.5 General k-out-of-N Fusion Rule In this section, we extend the results obtained in the previous section to the case when a general k-out-of-N fusion rule is used by the access point in each primary band. The threshold optimization problem after sensor assignment for k-out-of-N fusion rule is given by max λi P ∑ i=1 ri ( 1− N ∑ r=k ( N r ) Q(αi)r(1−Q(αi))N−r ) (3.35) s.t. P ∑ i=1 ci 1− N∑ r=k ∑ s∈Sir ∏ j∈s j′∈Si−s Q(βi j)(1−Q(βi j′)) < C (3.36) 1− N ∑ r=k ∑ s∈Sir ∏ j∈s j′∈Si−s Q(βi j)(1−Q(βi j′)) < ¯Pmi ∀i (3.37) N ∑ r=k ( N r ) Q(αi)r(1−Q(αi))N−r < ¯Pfi ∀i (3.38) 67 where Sir represents the set of all combinations of size r among the users assigned to primary band i. Si− s represents the set of users in Si but not in s. In Appendix A, we show that the objective function in (3.35) is concave and (3.38) is a convex constraint as long as ∑ j∈Si αi N ≥ x̄(k,N) (3.39) Values of x̄(k,N) are given in Table 3.2. In Appendix A, we show that for the values of αi satisfying (3.39), the probability of false alarm Pfi in each band is less than P(k,N)fmax whose values are given in Table 3.3. As seen from Table 3.3, the values of P(k,N)fmax are greater than 0.44 for values of N less than or equal to 10. Therefore, the constraint in (3.39) must be reasonable for most of the CR systems. The log-concavity of the Q-function used in case of ‘OR’ fusion rule to obtain a convex restricted problem cannot be used for k > 1. We instead take the performance of the system when all sensors assigned to a primary band have a channel gain equal to worst among them. Thus, the suboptimal solution is obtained by solving following problem max λi P ∑ i=1 ri ( 1− k ∑ r=1 ( N r ) Q(αi)r(1−Q(αi))N−r ) (3.40) s.t. P ∑ i=1 ci ( 1− N ∑ r=k ( N r ) Q(β wi )r(1−Q(β wi ))N−r ) <C (3.41) 1− N ∑ r=k ( N r ) Q(β wi )r(1−Q(β wi ))N−r < ¯Pmi ∀i (3.42) N ∑ r=k ( N r ) Q(αi)r(1−Q(αi))N−r < ¯Pfi ∀i (3.43) where β wi = λi−2ν(σ 2 +min j∈Si |hi j|2)√ 4ν(σ 2 +2min j∈Si |hi j|2)σ 2 (3.44) 68 (3.40)-(3.43) is a convex optimization problem as long as β wi satisfies the following constraint β wi < x̄(k,N) (3.45) We show in Appendix A that constraint (3.45) is satisfied as long as the system in which the channel gains of the all the sensors allocated to a primary user band i are equal to the worst among them, has a probability of misdetection less than 1−P(k,N)fmax . As can be seen from Table 3.3, the values of 1−P(k,N)fmax lie between 0.44 and 0.56 for values of N less than or equal to 10. 3.6 Simulation Results In Fig. 3.1, we consider a system with L = 20 CR sensors operating in P = 4 primary bands. Each primary band is assigned N = 5 CR sensors to detect the presence of a primary user in that band. We assume that the channels from the primary transmitters to the sensors undergo independent and identical log-normal shadowing and small scale Rayleigh fading. The mean signal to noise ratio (SNR) due to path loss at the sensors is assumed to be -3dB in each primary band. The variance of log-normal shadowing between each primary user and sensor is 4dB. The throughput rates ri available in the primary bands are randomly distributed between 1 Mbps and 2 Mbps. c = [c1,c2,c3,c4] = [0.1,0.2,0.3,0.4] is used as the cost vector. The maximum allowed probability of miss detection ¯Pmi and false alarm ¯Pfi are chosen as 0.1 and 0.4, respectively, in all primary bands. The access point uses ‘OR’ (1-out-of-5) fusion rule in each primary band. We study the performance of various sensor allocation and quantization schemes to solve the sum throughput rate optimization problem in (3.1). Sensors are assigned using the algorithm described in Section 3.3. A close to optimal solution to the original non-convex optimization problem in (3.18)-(3.21) 69 is obtained by using convex optimization algorithms starting from different initial points. In the figure, we refer to this solution as optimal eventhough its not possible to determine the optimal point since the problem is non-convex. We also present the suboptimal solutions obtained by solving convex restriction problem in (3.28)-(3.31). In the figure, we refer to it as the suboptimal solution. We further present the performance of the suboptimal solution, obtained by solving convex restricted optimization problem (3.28)-(3.31), assuming equal thresholds at all the sensors operating in all primary bands. From Fig 3.1., we see that the performance of the max-min sensor assignment scheme is, in general, better than the maximum weighted sum channel gain assignment scheme. This is because the maximum weighted sum channel gain scheme might lead to assign- ment of sensors with low channel gains to certain primary bands thus reducing the tradeoff available between probability of detection and probability of false alarm in those primary bands. We also notice that the suboptimal solution obtained by solving the restricted convex problem is close to the optimal solution, especially, at higher cost threshold C. We observe that using different thresholds for each band leads to much better performance compared to using equal thresholds in all the primary bands. We also compare the performance in the case when no quantization is used. In Fig. 3.2, we consider the same system as considered in Fig 3.1. However, the op- timization criterion is to maximize the minimum throughput rate available among various primary bands as described in (3.8). We see from Fig. 3.2 that max-min sensor assignment scheme significantly outperforms the maximum weighted sum channel gain assignment. We notice that the restricted convex optimization problem performs close to the original optimization problem at higher values of threshold C. We further observe that using differ- ent thresholds for different primary bands leads to better performance. 70 0.005 0.01 0.015 0.02 0.025 0.03 0.035 0.04 0.045 0.05 0.055 5.1 5.2 5.3 5.4 5.5 5.6 5.7 5.8 5.9 6 6.1 Cost Threshold C S u m T h ro u g h p u t R a te o f th e C R s ys te m (i n M bp s) Optimal, Max Weighted Sum Channel Gain Assignment Suboptimal, Max Weighted Sum Channel Gain Assignment Suboptimal with Equal Thresholds, Max Weighted Sum Channel Gain Assignment Optimal, Greedy Max−Min Assignment Suboptimal, Greedy Max−Min Assignment Suboptimal with Equal Thresholds, Greedy Max−Min Assignment Figure 3.1: Sum throughput rate of the CR system using ‘OR’ fusion rule for different sensor allocation and quantization schemes 71 0.005 0.01 0.015 0.02 0.025 0.03 0.035 0.04 0.045 0.05 0.055 1 1.02 1.04 1.06 1.08 1.1 1.12 1.14 1.16 1.18 1.2 Cost threshold C M in im u m T h ro u g h p u t R a te (i n M bp s) Optimal, Max Weighted Sum Channel Gain Assignment Suboptimal, Max Weighted Sum Channel Gain Assignment Suboptimal with Equal Thresholds, Max Weighted Sum Channel Gain Assignment Optimal, Greedy Max−Min Assignment Suboptimal, Greedy Max−Min Assignment Suboptimal with Equal Thresholds, Greedy Max−Min Assignment Figure 3.2: Min throughput rate among various bands using ‘OR’ fusion rule for dif- ferent sensor allocation and quantization schemes 72 In Fig. 3.3 and Fig. 3.4, we consider the system considered as in Fig 3.1. However, ‘2’- out-of-‘5’ and ‘3’-out-of-‘5’ fusion rules are used at the access point in Fig 3.3 and Fig. 3.4, respectively. We see that greedy max-min sensor assignment still outperforms the weighted sum channel gain assignment. We see slightly larger gap in the performance of the optimal and suboptimal schemes. This is because convex restriction in case of ‘2’-out-of-‘5’ and ‘3’-out-of-‘5’ fusion rule is less close to the original problem (since the channel gains of all sensors assigned to a primary band are replaced with that of worst sensor among them) compared to convex restriction obtained in case of the ‘OR’ fusion rule. In Fig. 3.5, we compare the performance of the greedy assignment algorithm to the exhaustive search algorithm which optimally solves (3.17) for a system with P = 2 and N = 4 for max-min optimization criteria in Fig. 3.5 and sum throughput optimization criteria in Fig. 3.6. ‘OR’ fusion rule is used in each primary band. The throughput rates ri in each band are randomly distributed between 1 Mbps and 2 Mbps. c = [c1,c2] = [0.4,0.6] is used as the cost vector. The maximum allowed probability of miss detection ¯Pmi and false alarm ¯Pfi are chosen as 0.1 and 0.4, respectively, in all primary bands. We see that the performance degradation due to greedy algorithm, in case of max-min optimization criteria shown in Fig. 3.5, is marginal even though the complexity of the greedy algorithm is substantially lower than the optimal exhaustive search algorithm. Similarly, in case of sum throughput optimization criteria considered in Fig. 3.6, the performances of greedy algorithm and optimal max-min assignment algorithm are very close to each other. 3.7 Conclusions In this chapter, we investigated cooperative sensing schemes to identify multiple primary signals operating in different spectrum bands. We considered narrow-band CR sensors that 73 0.005 0.01 0.015 0.02 0.025 0.03 0.035 0.04 0.045 0.05 0.055 4 4.2 4.4 4.6 4.8 5 5.2 5.4 5.6 5.8 6 Cost Threshold C S u m T h ro u g h p u t R a te o f th e C R S ys te m (i n M bp s) Optimal, Max Weighted Sum Channel Gain Assignment Suboptimal, Max Weighted Sum Channel Gain Assignment Suboptimal with Equal Thresholds, Max Weighted Sum Channel Gain Assignment Optimal, Greedy Max−Min Assignment Suboptimal, Greedy Max−Min Assignment Suboptimal with Equal Thresholds, Greedy Max−Min Assignment Figure 3.3: Sum throughput rate of the CR system for different sensor allocation and quantization schemes when ‘2’-out-of-‘5’ fusion rule is used at the access point in each primary band 74 0.005 0.01 0.015 0.02 0.025 0.03 0.035 0.04 0.045 0.05 0.055 3.5 4 4.5 5 5.5 6 Cost threshold C S u m T h ro u g h p u t R a te o f th e C R S ys te m (i n M bp s) Optimal, Max Weighted Sum Channel Gain Assignment Suboptimal, Max Weighted Sum Channel Gain Assignment Suboptimal with Equal Thresholds, Max Weighted Sum Channel Gain Assignment Optimal, Greedy Max−Min Assignment Suboptimal, Greedy Max−Min Assignment Suboptimal with Equal Thresholds, Greedy Max−Min Assignment Figure 3.4: Sum throughput rate of the CR system for different sensor allocation and quantization schemes when ‘3’-out-of-‘5’ fusion rule is used at the access point in each primary band 75 0.005 0.01 0.015 0.02 0.025 0.03 0.035 0.04 0.045 0.05 0.055 0.85 0.9 0.95 1 1.05 1.1 1.15 1.2 1.25 Cost threshold C M in im u m T h ro u g h p u t R a te (i n M bp s) Optimal, Optimal Max−Min Assignment Suboptimal, Optimal Max−Min Assignment Suboptimal with Equal Thresholds, Optimal Max−Min Assignment Optimal, Greedy Max−Min Assignment Suboptimal, Greedy Max−Min Assignment Suboptimal with Equal Thresholds, Greedy Max−Min Assignment Figure 3.5: Comparison of the optimal and greedy max-min assignment algorithms for ‘OR’ fusion rule when maximizing the minimum throughput rate among various primary bands 76 0.005 0.01 0.015 0.02 0.025 0.03 0.035 0.04 0.045 0.05 0.055 2 2.1 2.2 2.3 2.4 2.5 2.6 2.7 2.8 Cost Threshold C S u m T h ro u g h p u t R a te o f th e C R s ys te m (i n M bp s) Optimal, Optimal Max−Min Assignment Suboptimal, Optimal Max−Min Assignment Suboptimal with Equal Thresholds, Optimal Max−Min Assignment Optimal, Greedy Max−Min Assignment Suboptimal, Greedy Max−Min Assignment Suboptimal with Equal Thresholds, Greedy Max−Min Assignment Figure 3.6: Comparison of the optimal and greedy max-min assignment algorithms for ‘OR’ fusion rule when maximizing the sum throughput rate of the CR sys- tem 77 can sense a single primary band during each sensing iteration. Further, the sensors quantize their sensing information using a single bit before sending it to access point due to band- width limitations of the reporting channels. We assumed that a certain data throughput rate is available to the CR system in each band when the primary user is absent. Further, it was assumed that each primary user has certain strict limitations on the interference that CR system could cause due to misdetection of the primary signal. Moreover, even within these interference limits, each primary system imposes a cost on the CR system propor- tional to the interference caused. For such a system, we studied efficient sensor allocation and quantization schemes. We considered sum throughput rate optimization in which sum of the opportunistic throughput rates available to the CR users is maximized taking into account the cost, interference and QoS constraints. We also investigated the max-min rate optimization in which the minimum rate available among various primary bands is maxi- mized. We initially considered the case when ‘OR’ fusion rule is used at the access point in each primary band. The original problem, that jointly optimizes the sensor allocation and quantization thresholds, is a mixed integer optimization problem and is highly com- plex to solve. Therefore, we found suboptimal solutions by solving the original problem in two steps: Allocation of CR sensors to various primary bands based on the channel gains between the CR sensors and primary transmitters followed by determination of the quanti- zation thresholds at the CR sensors. We considered various schemes that could be used to allocate CR sensors. A low complexity greedy algorithm to efficiently assign CR sensors to various primary bands was proposed. After sensor allocation, we studied the optimal scheme to determine the quantization thresholds at the CR sensors, assuming equal quan- tization thresholds at all the sensors assigned to the same primary band. We showed that the optimal quantization, in general, involves solving a non-convex optimization problem. 78 Therefore, we proposed a suboptimal convex restriction to the optimal problem using the log-concavity of the Q-function. We further studied quantization schemes when a general k-out-of-N fusion rule is used at the access point in each primary band. 79 Chapter 4 Conclusions and Future Research Directions 4.1 Conclusions In this thesis, we studied CR cooperative sensing system based on a parallel fusion architec- ture and using energy detection at the sensors. In the first part of this thesis, we investigated schemes to identify CR sensors reporting false high energy values even when the primary signal is not present, which leads to decrease in the throughput rate of the CR system. We presented malicious user detection schemes that use outlier detection techniques based on robust statistics. The proposed malicious user detection schemes do not require knowledge of the primary transmitter location or knowledge of the additive noise variance. Assuming partial knowledge of the primary user activity, we proposed a novel method to improve the performance of the malicious user detection schemes. We also proposed improved mali- cious user detection schemes assuming knowledge of the spatial location information of the CR sensors. The performance of the proposed schemes were analyzed via simulations 80 for a cooperative sensing system using equal gain combining as the data fusion scheme at the access point. We further considered CR systems operating in multiple primary bands. We investi- gated distributed detection and data fusion schemes to identify multiple primary signals operating in different spectrum bands. Considering narrow-band CR sensors that can sense a single primary band during each sensing iteration and single-bit quantization at the CR sensors, we studied efficient sensor allocation and quantization schemes. We considered sum throughput rate optimization in which sum of the opportunistic throughput rates avail- able to the CR system in all the primary bands is maximized. Also the max-min rate optimization was investigated in which the minimum rate available among various primary bands is maximized. Considering ‘OR’ fusion rule at the access point in each primary band, we presented the optimization problem that jointly optimizes the sensor allocation and quantization thresholds. Joint sensor allocation and quantization is a mixed integer optimization problem and is NP-hard to solve. Therefore, we solved the problem in two steps. First, CR sensors were allocated to various primary bands based on the channel gains between the CR sensors and primary transmitters. Then, the quantization thresholds were determined at the CR sensors. We considered various methods that could be used to allocate CR sensors. A low complexity greedy sensor allocation algorithm was proposed. After sensor allocation, assuming equal quantization threshold in all the sensors assigned to the same primary band, we studied the optimal scheme to determine the quantization thresholds at the CR sensors. We showed that the optimal quantization scheme, in general, involves solving a non-convex optimization problem and proposed a suboptimal convex re- striction to the optimal problem. We further studied quantization schemes when a general k-out-of-N fusion rule is used at the access point in each primary band. 81 4.2 Future Research Directions In this section, we propose the possible research directions that can follow from this thesis. 4.2.1 Malicious User Detection Malicious User Detection Techniques based on Further Information In the scenarios where CR networks have more information regarding the primary user system such as the location of the primary user, primary user spectral usage behavior, the distribution of channel gains between the primary transmitter and the CR sensors etc., malicious user detection techniques that utilize this knowledge need to be investigated. Model based outlier detection techniques [4, 29] could be applied in such cases. Single Bit Quantization Even though the malicious user detection schemes discussed in Chapter 2 can be applied when the sensors quantize their data before sending it to the access point, they might not be efficient, especially when single-bit quantization is used at the sensors. Therefore, it would be interesting to investigate efficient schemes based on the outlier detection techniques to detect malicious users in a CR cooperative sensing systems using data quantization. Game Theoretic Analysis Game theory [20] has been applied for analyzing security threats in CR networks in the literature [72]. Game theoretic approach could be used to investigate various methods in which a group of malicious users can avoid detection by the malicious user detection schemes presented in Chapter 2. This can be helpful in further improving the performance of the malicious user detection schemes by devising algorithms to identify such malicious 82 users. 4.2.2 Sensor Allocation and Quantization Schemes Channel Estimation Errors and Reporting Channel Errors In Chapter 3, for sensor allocation and determination of the quantization thresholds at the sensors, we assumed perfect knowledge of the channel gains between the primary trans- mitters and the CR users. However, there might be errors in the estimation of these channel gains. The effects of channel estimation errors on the performance of the sensor allocation and quantization schemes need to be analyzed. Further, the errors due to reporting channels between the CR sensors and the access point were assumed to be negligible in Chapter 3. In some CR systems, with deep fading between the sensors and the access point, this might not be the case. For such systems, the errors due to the reporting channels must be taken into consideration while allocating sensors and determining the quantization thresholds. Non-Gaussian Noise In this thesis, we assumed that the additive noise is white and Gaussian distributed. How- ever, this assumption may not be true in general for CR systems due to presence of other secondary user interferences [62]. The sensor allocation schemes and quantization schemes obtained in this thesis could be extended for the case of colored non-Gaussian noise. 83 Bibliography [1] O. Afolabi, K. Kim, and A. Ahmad. On secure spectrum sensing in cognitive radio networks using emitters electromagnetic signature. In Proceedings of 18th Internatonal Conference on Computer Communications and Networks (ICCCN), pages 1–5, 2009. → pages 6 [2] C. Aggarwal and S. Yu. An effective and efficient algorithm for high-dimensional outlier detection. In The International Journal on Very Large Data Bases, volume 14, pages 211–221, 2005. → pages 9 [3] G. Atia, E. Ermis, and V. Saligrama. Robust energy efficient cooperative spectrum sensing in cognitive radios. In IEEE/SP 14th Workshop on Statistical Signal Processing (SSP) 2007, pages 502–507, 2007. → pages 10 [4] V. Barnett and T. Lewis. Outliers in Statistical Data. John Wiley & Sons., 3rd edition, 1994. → pages 9, 82 [5] R. S. Blum, S. A. Kassam, and H. V. Poor. Distributed detection with multiple sensors: Part ii-advanced topics. In Proceedings of IEEE, volume 85, pages 64–79, Jan 1997. → pages 11 [6] G. E. P. Box and D. R. Cox. An analysis of transformations. In Journal of Royal Statistical Society, volume B28, pages 211–252, 1964. → pages 18 [7] S. Boyd and L. Vanderberghe. Convex Optimization. Cambridge, U.K.: Cambridge Univ. Press, 2003. → pages 63, 66 [8] J. Branch, B. Szymanski, C. Giannella, R. Wolff, and H. Kargupta. In-network outlier detection in wireless sensor networks. In 26th IEEE International Conference on Distributed Computing Systems (ICDCS) 2006, pages 51–51, 2006. → pages 9 [9] M. M. Breunig, H.-P. Kriegel, R. T. Ng, and J. Sander. Lof: identifying density-based local outliers. In ACM SIGMOD, volume 29, pages 93–104, 2000. → pages 9 84 [10] G. Brys, M. Hubert, and A. Struyf. A comparison of some new measures of skewness. In Developments in Robust statistics, ICORS 2001, pages 98–113, 2001. → pages 26, 27, 33 [11] G. Brys, M. Hubert, and A. Struyf. A robust measure of skewness. In Journal of Computational and Graphical Statistics, volume 13, pages 996–1017, 2004. → pages 26 [12] J. L. Burbank. Security in cognitive radio networks: The required evolution in approaches to wireless network security. In Third International Conference on Cognitive Radio Oriented Wireless Networks and Communications (CrownCom), pages 1–7, 2008. → pages 6 [13] D. Cabric, S. M. Mishra, and R. W. Brodersen. Implementation issues in spectrum sensing for cognitive radio. In Thirty-Eighth Asilomar Conference on Signals, Systems and Computers, 2004, volume 1, pages 772–776, Nov 2004. → pages 2 [14] L. Chen, J. Wang, and S. Li. Cooperative spectrum sensing with multi-bits local sensing decisions in cognitive radio context. In IEEE Wireless Communications and Networking Conference (WCNC) 2008, pages 570–575, 2008. → pages 12 [15] R. Chen, J.-M. Park, and K. Bian. Robust distributed spectrum sensing in cognitive radio networks. In The 27th IEEE Conference on Computer Communications (INFOCOM), pages 1876–1884, 2008. → pages 8 [16] R. Chen, J.-M. Park, and J. Reed. Defense against pu emulation attacks in cr networks. In IEEE Journal on Selected Areas in Communications, volume 26, pages 25–37, Jan 2008. → pages 6 [17] T. Clancy and N. Goergen. Security in cognitive radio networks: Threats and mitigation. In Third International Conference on Cognitive Radio Oriented Wireless Networks and Communications (CrownCom), pages 1–8, 2008. → pages 6 [18] T. Do and B. L. Mark. Joint spatial-temporal spectrum sensing for cognitive radio networks. In 43rd Annual Conference on Information Sciences and Systems, 2009. CISS 2009., pages 124–129, 2009. → pages 12 [19] FCC. Spectrum policy task force report. In Technical Report 02-135, Nov 2002. → pages 1 [20] D. Fudenberg and J. Tirole. Game Theory. MIT Press, 1st edition, 1991. → pages 82 85 [21] G. Ganesan and Y. Li. Cooperative spectrum sensing in cognitive radio: Part i: two user networks. In IEEE Transactions on Wireless Communications, volume 6, pages 2204–2213, June 2007. → pages 4, 5 [22] G. Ganesan and Y. Li. Cooperative spectrum sensing in cognitive radio: Part ii: multiuser networks. In IEEE Transactions on Wireless Communications, volume 6, pages 2214–2222, June 2007. → pages 4, 5 [23] G. Ganesan and Y. Li. Cooperative spectrum sensing in cr networks. In IEEE Conference on Dynamic Spectrum Access Networks (DYSPAN’05), pages 137–143, Nov 2005. → pages 4 [24] W. A. Gardner. Cyclostationarity in Communications and Signal Processing. New Jersey, NY, USA: IEEE Press, 1993. → pages 3 [25] A. Ghasemi and E. S. Sousa. Opportunistic spectrum access in fading channels through collaborative sensing. In Journal of Communications (JCM), volume 2, pages 71–82, March 2007. → pages 5, 12 [26] A. M. Gross. Confidence interval robustness with long-tailed symmetric distributions. In Journal of the American Statistical Association, volume 71, pages 409–416, June 1976. → pages 24 [27] M. Gudmundson. Correlation model for shadow fading in mobile radio systems. In Electronic Letters, volume 27, pages 2145–2146, 1991. → pages 37 [28] C. Guo, T. Peng, S. Xu, H. Wang, and W. Wang. Cooperative spectrum sensing with cluster-based architecture in cognitive radio networks. In Proceedings of IEEE Vehicular Technology Conference (VTC), Spring 2009, pages 1–5, April 2009. → pages 10 [29] D. Hawkins. Identification of outliers. Chapman and Hall, London, 1980. → pages 9, 14, 82 [30] S. Haykin. Cognitive radio: Brain-empowered wireless communications. In IEEE Journal on Selected Areas in Communications, volume 23, pages 201–220, 2005. → pages 6 [31] C. W. Helstrom. Gradient algorithm for quantization levels in distributed detection systems. In IEEE Transactions on Aerospace and Electronic Systems, volume 31, pages 390–398, Jan 1995. → pages 94 [32] P. S. Horn. A biweight prediction interval for random samples. In Journal of the American Statistical Association, volume 83, pages 249–256, 1988. → pages 24 86 [33] M. Hubert and E. Vandervieren. An adjusted boxplot for skewed distributions. In Computational Statistics and Data Analysis, volume 52, pages 5186–5201, 2008. → pages 27, 33 [34] P. Kaligineedi and V. K. Bhargava. Sensor allocation and quantization schemes for multi-band cognitive radio cooperative sensing system. In IEEE Transactions on Wireless Communications (Accepted). → pages iv [35] P. Kaligineedi and V. K. Bhargava. Distributed detection of primary signals in fading channels for cognitive radio networks. In Proceedings of IEEE Global Communications Conference (Globecom) 2008, pages 1–5, 2008. → pages [36] P. Kaligineedi, M. Khabbazian, and V. K. Bhargava. Malicious user detection for cognitive radio systems. In IEEE Transactions on Wireless Communications, volume 9, pages 2488–2497, August 2010. → pages iv [37] P. Kaligineedi, M. Khabbazian, and V. K. Bhargava. Secure cooperative sensing techniques for cognitive radio systems. In IEEE International Conference on Communications (ICC) 2008, pages 3406–3410, May 2008. → pages iv, 18 [38] D. Kazakos. New error bounds and optimum quantization for multisensor distributed signal detection. In IEEE Transactions on Communications, volume 40, pages 1144–1151, July 1992. → pages 11 [39] E. M. Knorr and R. T. Ng. Algorithms for mining distance-based outliers in large datasets. In Proc. the 24th International Conference on Very Large Databases (VLDB), pages 392–403, 1998. → pages 9 [40] D. A. Lax. Robust estimators of scale: Finite-sample performance in long-tailed symmetric distributions. In Journal of the American Statistical Association, volume 80, pages 736–741, Sept. 1985. → pages 21, 23, 24 [41] M. Longo, T. D. Lookabaugh, and R. M. Gray. Quantization for decentralized hypothesis testing under communication constraints. In IEEE Transactions on Information Theory, volume 36, pages 241–255, March 1990. → pages 11 [42] S. M. Mishra, R. Tandra, and A. Sahai. The case for multiband sensing. In Proceedings of the Allerton Conference on Communications, Control and Computing, 2007. → pages 6 [43] S. M. Mishra, A. Sahai, and R. W. Brodersen. Cooperative sensing among crs. In IEEE International Conference on Communications (ICC) 2006, pages 1658–1663, June 2006. → pages 4, 5, 6, 9, 14, 16, 58, 65 87 [44] J. Mitola. Software Radio Architecture. John Wiley & Sons, 2000. → pages 1, 2 [45] F. Mostseller and J. W. Tukey. Data Analysis and Regression: A second course in Statistics. Reading, MA: Addison-Wesley, 1978. → pages 20, 21, 23 [46] J. Munkres. Algorithms for the assignment and transportation problems. In Journal of the Society for Industrial and Applied Mathematics, volume 5, pages 32–38, March 1957. → pages 11, 54, 59 [47] R. Niu, P. K. Varshney, and Q. Cheng. Distributed detection in a large wireless sensor network. In International Journal on Information Fusion, volume 7, pages 380–394, 2006. → pages 12, 61 [48] T. Palpanas, D. Papadopoulos, V. Kalogeraki, and D. Gunopulos. Distributed deviation detection in sensor networks. In ACM SIGMOD, volume 32, pages 77–82, 2003. → pages 9 [49] D. W. Pentico. Assignment problems: A golden anniversary survey. In European Journal of Operational Research, volume 176, pages 774–793, 2008. → pages 11, 54 [50] K. T. Phan, L. B. Le, S. A. Vorobyov, and T. Le-Ngoc. Centralized and distributed power allocation in multi-user wireless relay networks. In Proceedings of IEEE International Conference on Communications (ICC) 2009, pages 1–5, 2009. → pages 67 [51] A. P. Punnen and Y. P. Aneja. Categorized assignment scheduling: a tabu search approach. In Journal on Operational Research Society, volume 44, pages 673–679, 1993. → pages 60 [52] Z. Quan, S. Cui, A. H. Sayed, and H. V. Poor. Optimal multiband joint detection for spectrum sensing in cognitive radio networks. In IEEE Transactions on Signal Processing, volume 57, pages 1128–1140, 2009. → pages 9, 10, 57, 58 [53] P. Ray and P. K. Varshney. Distributed detection in wireless sensor networks using dynamic sensor thresholds. In International Journal of Distributed Sensor Networks, volume 4, pages 5–12, 2010. → pages 10 [54] M. Sanna and M. Murroni. Optimization of non-convex multiband cooperative sensing with genetic algorithms. In IEEE Journal of Selected Topics in Signal Processing, 2010. → pages 10 [55] S. Saunders and A. Aragon-Zavala. Antennas and Propagation for Wireless Communication Systems. John Wiley & Sons., 2nd edition, 2007. → pages 16, 37 88 [56] N. S. Shankar, C. Cordeiro, and K. Challapali. Spectrum agile radios: utilization and sensing architectures. In IEEE Conference on Dynamic Spectrum Access Networks (DYSPAN’05), pages 160–169, Nov 2005. → pages 4 [57] B. Shen, T. Cui, K. Kwak, C. Zhao, and Z. Zhou. An optimal soft fusion scheme for cooperative spectrum sensing in cognitive radio network. In IEEE Wireless Communications and Networking Conference (WCNC), 2009, pages 1–5, 2009. → pages 12 [58] B. Sheng, Q. Li, W. Mao, and W. Jin. Outlier detection in sensor networks. In Proceedings of the 8th ACM international symposium on Mobile ad hoc networking and computing, pages 219–228, 2007. → pages 9 [59] A. Silberstein, K. Munagala, and J. Yang. Energy-efficient monitoring of extreme values in sensor networks. In Proceedings of the 2006 ACM SIGMOD international conference on Management of data, pages 169–180, 2006. → pages 9 [60] C. Stevenson, G. Chouinard, Z. Lei, W. Hu, S. Shellhammer, and W. Caldwell. Ieee 802.22: The first cognitive radio wireless regional area network standard. In IEEE Communications Magazine, volume 47, pages 130–138, Jan. 2009. → pages 2 [61] C. Sun, W. Zhang, and K. B. Lataief. Cluster-based cooperative spectrum sensing in cognitive radio systems. In Proceedings of IEEE International conference on Communications (ICC) 2007, pages 2511–2515, June 2007. → pages 10 [62] R. Tandra and A. Sahai. Snr walls for signal detection. In IEEE Journal on Selected Topics in Signal Processing, volume 2, pages 4–17, Feb 2008. → pages 3, 83 [63] R. Tandra, A. Sahai, and S. M. Mishra. What is a spectrum hole and what does it take to recognize one? In Proceedings of the IEEE, volume 97, pages 822–848, 2009. → pages 2, 3, 4 [64] S. Thomopoulos, R. Viswanathan, and D. Bougoulias. Optimal distributed decision fusion. In IEEE Transactions on Aerospace and Electronic Systems, volume 25, pages 761–765, Sep. 1989. → pages 11 [65] J. N. Tsitsiklis. Decentralized detection by a large number of sensors. In Mathematics of Control, Signals and Systems, volume 1, pages 167–182, 1988. → pages 11, 61 [66] J. Unnikrishnan and V. Veeravalli. Cooperative sensing for primary detection in cognitive radios. In IEEE Journal on Selected Topics in Signal Processing, volume 2, pages 18–27, Feb 2008. → pages 4, 5 89 [67] H. Urkowitz. Energy detection of unknown deterministic signals. In Proceedings of IEEE, volume 55, pages 523–531, April 1967. → pages 2, 16 [68] O. van den Biggelaar, J.-M. Dricot, P. De Doncker, and F. Horlin. Quantization and transmission of the energy measures for cooperative spectrum sensing. In IEEE 71st Vehicular Technology Conference (VTC 2010-Spring), pages 1–5, 2010. → pages 12 [69] P. K. Varshney. Distributed Detection and Data fusion. New York: Springer-Verlag, 1996. → pages 11, 61 [70] V. V. Veeravalli. Sequential decision fusion: theory and applications. In Journal of the Franklin Institute, volume 336, pages 301–322, 1999. → pages 5, 11 [71] F. E. Visser, G. M. Janssen, and P. Paweczak. Multinode spectrum sensing based on energy detection for dynamic spectrum access. In IEEE Vehicular Technology Conference, pages 1394–1398, May 2008. → pages 35 [72] B. Wang, Y. Wu, Z. Ji, K. J. R. Liu, and T. C. Clancy. Game theoretical mechanism design methods: suppressing cheating in cognitive radio networks. In IEEE Signal Processing Magazine, volume 25, pages 74–84, 2008. → pages 82 [73] H. Wang, L. Lightfoot, and T. Li. On phy-layer security of cognitive radio: Collaborative sensing under malicious attacks. In 44th Annual Conference on Information Sciences and Systems (CISS), pages 1–6, 2010. → pages 8 [74] W. Wang, H. Ki, Y. Sun, and Z. Han. Securing collaborative spectrum sensing against untrustworthy secondary users in cognitive radio networks. In EURASIP Journal on Advances in Signal Processing, pages 1–15, 2010. → pages 8 [75] F. R. Yu, H. Tang, M. Huang, Z. Li, and P. C. Mason. Defense against spectrum sensing data falsification attacks in mobile ad hoc networks with cognitive radios. In IEEE Military Communications Conference (MILCOM), pages 1–7, 2009. → pages 8 [76] K. Zeng, P. Paweczak, and D. Cabric. Reputation-based cooperative spectrum sensing with trusted nodes assistance. In IEEE Communications Letters, volume 14, pages 226–228, March 2010. → pages 8 [77] Y. Zeng and Y.-C. Liang. Maximum-minimum eigenvalue detection for cognitive radio. In Proceedings of the IEEE 18th International Symposium on Personal, Indoor and Mobile Radio Communications, (PIMRC), pages 1–15, Sept. 2007. → pages 3, 8 90 [78] Y. Zhang, G. Xu, and X. Geng. Security threats in cognitive radio networks. In 10th IEEE International Conference on High Performance Computing and Communications (HPCC), pages 1036–1041, Sept. 2008. → pages 6 91 Appendix A Convexity Conditions for the Objective Functions (3.18) and (3.35) Consider the function P(k,N)f (x) = ∑Nr=k (N r )Q(x)r(1−Q(x))N−r. P(k,N)f (αi) represents the probability of false alarm Pfi when a k-out-of-N fusion rule is used at the access point in pri- mary band i. Note that, for the ‘OR’ fusion rule, P(k,N)f (x)=P (1,N) f (x)=∑Nr=1 (N r )Q(x)r(1− Q(x))N−r = 1− (1−Q(x))N . The derivative of P(k,N)f (x) is given by d dxP (k,N) f (x) = N ( N−1 k−1 ) Q(x)k−1 (1−Q(x))N−k ddxQ(x) (A.1) The double derivative of P(k,N)f (x) (using the fact that ddxQ(x)=− 1√2pi e − x22 and d2dx2 Q(x)= 92 x√ 2pi e − x22 ) is given by d2 dx2 P (k,N) f (x) = N ( N−1 k−1 ) Q(x)k−1 (1−Q(x))N−k−1 1√ 2pi e− x2 2 (A.2) x(1−Q(x))− (N−1) 1√ 2pi e− x2 2 +(k−1) 1√ 2pi) e− x2 2 Q(x) Notice that the terms outside the square brackets on RHS of (A.2) are all positive. We denote the term within the square brackets as g(x) = x(1−Q(x))− (N−1) 1√ 2pi e− x2 2 +(k−1) 1√ 2pi e − x22 Q(x) (A.3) We consider two different cases, Case I: k ≤ N+12 and Case II: k > N+12 , as follows Case I: k ≤ N+12 For x < 0, g(x) = x(1−Q(x))− (N−1) 1√ 2pi e− x2 2 +(k−1) 1√ 2pi e − x22 Q(x) ≤ x(1−Q(x))− ((N−1)−2(k−1)) 1√ 2pi e− x2 2 (since Q(x)> 0.5 for x < 0) < 0 (since x < 0 and (N−1)−2(k−1)≥ 0) (A.4) 93 Thus, g(x)< 0 for x < 0. Now consider the derivative of the g(x) for x≥ 0, dg dx = 1−Q(x)+ x 1√ 2pi e− x2 2 + x(N−1) 1√ 2pi e− x2 2 + (k−1) 1Q(x)2 ( −Q(x)x 1√ 2pi e− x2 2 + ( 1√ 2pi e− x2 2 )2) = 1−Q(x)+Nx 1√ 2pi e− x2 2 +(k−1) 1√ 2pi e − x22 Q(x)2 ( −Q(x)x+ 1√ 2pi e− x2 2 ) (A.5) Using the fact that Q(x)≤ 1 x √ 2pi e − x22 for x > 0, it can be easily seen from (A.5) that dgdx > 0 is for x > 0. Thus, g(x) is monotonically increasing for x > 0. Now g(0) ≤ 0 for k ≤ N +1/2 and limx→∞ g(x) = ∞. Therefore, there exists a point x̄(k,N) ≥ 0 such that d2 dx2 P (k,N) f (x)≥ 0 : x≥ x̄(k,N) (A.6) d2 dx2 P (k,N) f (x)< 0 : x < x̄ (k,N) (A.7) x̄(k,N) can be found by evaluating the root of g(x) using a false position algorithm [31]. Since P(k,N)f (x) is a decreasing function of x, it follows that there exists a P (k,N) fmax corre- sponding to x̄(k,N) such that P(k,N)f (x) is convex : P (k,N) f (x)≤ P(k,N)fmax (A.8) P(k,N)f (x) is concave : P (k,N) f (x)> P (k,N) fmax (A.9) 94 Case II: k > N+12 It can be shown that (using the fact that (N r )Q(x)r(1−Q(x))N−r are terms of a binomial probability function B(N,Q(x)) and Q(−x) = 1−Q(x)) P(k,N)f (x) = 1−P(N−k+1,N)f (−x) (A.10) Since, N − k + 1 ≤ N+12 , from Case I it follows hat P (N−k+1,N) f (−x) is concave for −x≤ x̄(N−k+1,N) and thus P(k,N)f (x)) is convex for x≥−x̄(N−k+1,N). Thus, for k > N+12 , x̄(k,N) = −x̄(N−k+1,N) (A.11) P(k,N)fmax = 1−P (N−k+1,N) fmax (A.12) Table 3.2 shows the values of x̄(k,N) for which Pf (k,N)(x) is convex for x > x̄(k,N) and concave for x < x̄(k,N) for various values of k and N. Table 3.3 shows the maximum probability of false alarm P(k,N)fmax below which the Pfi is convex, for a k-out-of-N fusion rule, at different values of k and N. From Table 3.3, it can be see that the values of P(k,N)fmax are greater than 0.44 for all values of N less than 10. 95 Appendix B Log-Concavity of Q-function The double derivative of log of Q-function is given by d2 dx2 logQ(x) = Q(x) d2dx2 Q(x)− ( d dxQ(x) )2 Q(x)2 (B.1) = Q(x)x 1√2pi e − x22 − ( − 1√2pi e − x22 )2 Q(x)2 (B.2) = 1√ 2pi e − x22 Q(x)2 [ xQ(x)− 1√ 2pi e− x2 2 ] (B.3) Its easy see that terms outside the square brackets on RHS of (B.3) are positive. Now, the term inside the square brackets, xQ(x)− 1√2pi e − x22 , is less than or equal to zero for x ≤ 0, since both xQ(x) and − 1√2pi e − x22 are less than or equal to zero. For x > 0, it is well known that Q(x)< 1 x √ 2pi e − x22 . Therefore, xQ(x)− 1√2pi e− x2 2 is less than zero for x > 0. Hence, from (B.3), it follows that double derivative of logQ(x) is negative for all x and thus, logQ(x) is a concave function. Therefore, Q(x) is a log-concave function. 96
- Library Home /
- Search Collections /
- Open Collections /
- Browse Collections /
- UBC Theses and Dissertations /
- Cooperative spectrum sensing for cognitive radio networks
Open Collections
UBC Theses and Dissertations
Featured Collection
UBC Theses and Dissertations
Cooperative spectrum sensing for cognitive radio networks Kaligineedi, Praveen 2010
pdf
Notice for Google Chrome users:
If you are having trouble viewing or searching the PDF with Google Chrome, please download it here instead.
If you are having trouble viewing or searching the PDF with Google Chrome, please download it here instead.
Page Metadata
Item Metadata
Title | Cooperative spectrum sensing for cognitive radio networks |
Creator |
Kaligineedi, Praveen |
Publisher | University of British Columbia |
Date Issued | 2010 |
Description | Radio spectrum is a very scarce and important resource for wireless communication systems. However, a recent study conducted by Federal Communications Commission (FCC) found that most of the currently allocated radio spectrum is not efficiently utilized by the licensed primary users. Granting opportunistic access of the spectrum to unlicensed secondary users has been suggested as a possible way to improve the utilization of the radio spectrum. Cognitive Radio (CR) is an emerging technology that would allow an unlicensed (cognitive) radio to sense and efficiently use any available spectrum at a given time. Reliable detection of the primary users is an important task for CR systems. Cooperation among a few sensors can offer significant gains in the performance of the CR spectrum sensing system by countering shadow-fading effects. In this thesis, we consider a parallel fusion based cooperative sensing network, in which the sensors send their sensing information to an access point, which makes the final decision regarding presence or absence of the primary signal. We assume that energy detection is used at each sensor. Presence of few malicious users sending false sensing data can severely degrade the performance of such a cooperative sensing system. In this thesis, we investigate schemes to identify malicious users based on outlier detection techniques. We take into consideration constraints imposed by the CR scenario, such as limited information about the primary signal propagation environment and small sensing data sample size. Considering partial knowledge of the primary user activity, we propose a novel method to identify malicious users. We further propose malicious user detection schemes that take into consideration the spatial location of the sensors. We then investigate efficient sensor allocation and quantization techniques for a CR network operating in multiple primary bands. We explore different methods to assign CR sensors to various primary bands. We then study efficient single-bit quantization schemes at the sensors. We show that the optimal quantization scheme is, in general, non-convex and propose a suboptimal solution based on a convex restriction of the original problem. We compare the performance of the proposed schemes using simulations. |
Genre |
Thesis/Dissertation |
Type |
Text |
Language | eng |
Date Available | 2010-12-02 |
Provider | Vancouver : University of British Columbia Library |
Rights | Attribution-NonCommercial-NoDerivatives 4.0 International |
DOI | 10.14288/1.0071475 |
URI | http://hdl.handle.net/2429/30261 |
Degree |
Doctor of Philosophy - PhD |
Program |
Electrical and Computer Engineering |
Affiliation |
Applied Science, Faculty of Electrical and Computer Engineering, Department of |
Degree Grantor | University of British Columbia |
GraduationDate | 2011-05 |
Campus |
UBCV |
Scholarly Level | Graduate |
Rights URI | http://creativecommons.org/licenses/by-nc-nd/4.0/ |
AggregatedSourceRepository | DSpace |
Download
- Media
- 24-ubc_2011_spring_kaligineedi_praveen.pdf [ 576.92kB ]
- Metadata
- JSON: 24-1.0071475.json
- JSON-LD: 24-1.0071475-ld.json
- RDF/XML (Pretty): 24-1.0071475-rdf.xml
- RDF/JSON: 24-1.0071475-rdf.json
- Turtle: 24-1.0071475-turtle.txt
- N-Triples: 24-1.0071475-rdf-ntriples.txt
- Original Record: 24-1.0071475-source.json
- Full Text
- 24-1.0071475-fulltext.txt
- Citation
- 24-1.0071475.ris
Full Text
Cite
Citation Scheme:
Usage Statistics
Share
Embed
Customize your widget with the following options, then copy and paste the code below into the HTML
of your page to embed this item in your website.
<div id="ubcOpenCollectionsWidgetDisplay">
<script id="ubcOpenCollectionsWidget"
src="{[{embed.src}]}"
data-item="{[{embed.item}]}"
data-collection="{[{embed.collection}]}"
data-metadata="{[{embed.showMetadata}]}"
data-width="{[{embed.width}]}"
data-media="{[{embed.selectedMedia}]}"
async >
</script>
</div>
Our image viewer uses the IIIF 2.0 standard.
To load this item in other compatible viewers, use this url:
https://iiif.library.ubc.ca/presentation/dsp.24.1-0071475/manifest