Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Eduard A. Jorswieck

Two-Hop Age of Information Scheduling for Multi-UAV Assisted Mobile Edge Computing: FRL vs MADDPG

Jun 19, 2022
Marjan Tajik, Mohammadreza Maleki, Nader Mokari, Mohammad Reza Javan, Hamid Saeedi, Bile Peng, Eduard A. Jorswieck

Figure 1 for Two-Hop Age of Information Scheduling for Multi-UAV Assisted Mobile Edge Computing: FRL vs MADDPG

Figure 2 for Two-Hop Age of Information Scheduling for Multi-UAV Assisted Mobile Edge Computing: FRL vs MADDPG

Figure 3 for Two-Hop Age of Information Scheduling for Multi-UAV Assisted Mobile Edge Computing: FRL vs MADDPG

Figure 4 for Two-Hop Age of Information Scheduling for Multi-UAV Assisted Mobile Edge Computing: FRL vs MADDPG

In this work, we adopt the emerging technology of mobile edge computing (MEC) in the Unmanned aerial vehicles (UAVs) for communication-computing systems, to optimize the age of information (AoI) in the network. We assume that tasks are processed jointly on UAVs and BS to enhance edge performance with limited connectivity and computing. Using UAVs and BS jointly with MEC can reduce AoI on the network. To maintain the freshness of the tasks, we formulate the AoI minimization in two-hop communication framework, the first hop at the UAVs and the second hop at the BS. To approach the challenge, we optimize the problem using a deep reinforcement learning (DRL) framework, called federated reinforcement learning (FRL). In our network we have two types of agents with different states and actions but with the same policy. Our FRL enables us to handle the two-step AoI minimization and UAV trajectory problems. In addition, we compare our proposed algorithm, which has a centralized processing unit to update the weights, with fully decentralized multi-agent deep deterministic policy gradient (MADDPG), which enhances the agent's performance. As a result, the suggested algorithm outperforms the MADDPG by about 38\%

Via

Access Paper or Ask Questions

Toward a Smart Resource Allocation Policy via Artificial Intelligence in 6G Networks: Centralized or Decentralized?

Feb 18, 2022
Ali Nouruzi, Atefeh Rezaei, Ata Khalili, Nader Mokari, Mohammad Reza Javan, Eduard A. Jorswieck, Halim Yanikomeroglu

Figure 1 for Toward a Smart Resource Allocation Policy via Artificial Intelligence in 6G Networks: Centralized or Decentralized?

Figure 2 for Toward a Smart Resource Allocation Policy via Artificial Intelligence in 6G Networks: Centralized or Decentralized?

Figure 3 for Toward a Smart Resource Allocation Policy via Artificial Intelligence in 6G Networks: Centralized or Decentralized?

Figure 4 for Toward a Smart Resource Allocation Policy via Artificial Intelligence in 6G Networks: Centralized or Decentralized?

In this paper, we design a new smart softwaredefined radio access network (RAN) architecture with important properties like flexibility and traffic awareness for sixth generation (6G) wireless networks. In particular, we consider a hierarchical resource allocation framework for the proposed smart soft-RAN model, where the software-defined network (SDN) controller is the first and foremost layer of the framework. This unit dynamically monitors the network to select a network operation type on the basis of distributed or centralized resource allocation architectures to perform decision-making intelligently. In this paper, our aim is to make the network more scalable and more flexible in terms of achievable data rate, overhead, and complexity indicators. To this end, we introduce a new metric, throughput overhead complexity (TOC), for the proposed machine learning-based algorithm, which makes a trade-off between these performance indicators. In particular, the decision making based on TOC is solved via deep reinforcement learning (DRL), which determines an appropriate resource allocation policy. Furthermore, for the selected algorithm, we employ the soft actor-critic method, which is more accurate, scalable, and robust than other learning methods. Simulation results demonstrate that the proposed smart network achieves better performance in terms of TOC compared to fixed centralized or distributed resource management schemes that lack dynamism. Moreover, our proposed algorithm outperforms conventional learning methods employed in other state-of-the-art network designs.

* Submitted to IEEE for possible publications

Via

Access Paper or Ask Questions

AI-based Robust Resource Allocation in End-to-End Network Slicing under Demand and CSI Uncertainties

Feb 10, 2022
Amir Gharehgoli, Ali Nouruzi, Nader Mokari, Paeiz Azmi, Mohamad Reza Javan, Eduard A. Jorswieck

Network slicing (NwS) is one of the main technologies in the fifth-generation of mobile communication and beyond (5G+). One of the important challenges in the NwS is information uncertainty which mainly involves demand and channel state information (CSI). Demand uncertainty is divided into three types: number of users requests, amount of bandwidth, and requested virtual network functions workloads. Moreover, the CSI uncertainty is modeled by three methods: worst-case, probabilistic, and hybrid. In this paper, our goal is to maximize the utility of the infrastructure provider by exploiting deep reinforcement learning algorithms in end-to-end NwS resource allocation under demand and CSI uncertainties. The proposed formulation is a nonconvex mixed-integer non-linear programming problem. To perform robust resource allocation in problems that involve uncertainty, we need a history of previous information. To this end, we use a recurrent deterministic policy gradient (RDPG) algorithm, a recurrent and memory-based approach in deep reinforcement learning. Then, we compare the RDPG method in different scenarios with soft actor-critic (SAC), deep deterministic policy gradient (DDPG), distributed, and greedy algorithms. The simulation results show that the SAC method is better than the DDPG, distributed, and greedy methods, respectively. Moreover, the RDPG method out performs the SAC approach on average by 70%.

Via

Access Paper or Ask Questions

Reconfigurable Intelligent Surface Enabled Spatial Multiplexing with Fully Convolutional Network

Jan 08, 2022
Bile Peng, Jan-Aike Termöhlen, Cong Sun, Danping He, Ke Guan, Tim Fingscheidt, Eduard A. Jorswieck

Figure 1 for Reconfigurable Intelligent Surface Enabled Spatial Multiplexing with Fully Convolutional Network

Figure 2 for Reconfigurable Intelligent Surface Enabled Spatial Multiplexing with Fully Convolutional Network

Figure 3 for Reconfigurable Intelligent Surface Enabled Spatial Multiplexing with Fully Convolutional Network

Figure 4 for Reconfigurable Intelligent Surface Enabled Spatial Multiplexing with Fully Convolutional Network

Reconfigurable intelligent surface (RIS) is an emerging technology for future wireless communication systems. In this work, we consider downlink spatial multiplexing enabled by the RIS for weighted sum-rate (WSR) maximization. In the literature, most solutions use alternating gradient-based optimization, which has moderate performance, high complexity, and limited scalability. We propose to apply a fully convolutional network (FCN) to solve this problem, which was originally designed for semantic segmentation of images. The rectangular shape of the RIS and the spatial correlation of channels with adjacent RIS antennas due to the short distance between them encourage us to apply it for the RIS configuration. We design a set of channel features that includes both cascaded channels via the RIS and the direct channel. In the base station (BS), the differentiable minimum mean squared error (MMSE) precoder is used for pretraining and the weighted minimum mean squared error (WMMSE) precoder is then applied for fine-tuning, which is nondifferentiable, more complex, but achieves a better performance. Evaluation results show that the proposed solution has higher performance and allows for a faster evaluation than the baselines. Hence it scales better to a large number of antennas, advancing the RIS one step closer to practical deployment.

Via

Access Paper or Ask Questions

Multi Agent Reinforcement Learning Trajectory Design and Two-Stage Resource Management in CoMP UAV VLC Networks

Dec 03, 2021
Mohammad Reza Maleki, Mohammad Robat Mili, Mohammad Reza Javan, Nader Mokari, Eduard A. Jorswieck

Figure 1 for Multi Agent Reinforcement Learning Trajectory Design and Two-Stage Resource Management in CoMP UAV VLC Networks

Figure 2 for Multi Agent Reinforcement Learning Trajectory Design and Two-Stage Resource Management in CoMP UAV VLC Networks

Figure 3 for Multi Agent Reinforcement Learning Trajectory Design and Two-Stage Resource Management in CoMP UAV VLC Networks

Figure 4 for Multi Agent Reinforcement Learning Trajectory Design and Two-Stage Resource Management in CoMP UAV VLC Networks

In this paper, we consider unmanned aerial vehicles (UAVs) equipped with a visible light communication (VLC) access point and coordinated multipoint (CoMP) capability that allows users to connect to more than one UAV. UAVs can move in 3-dimensional (3D) at a constant acceleration, where a central server is responsible for synchronization and cooperation among UAVs. The effect of accelerated motion in UAV is necessary to be considered. Unlike most existing works, we examine the effects of variable speed on kinetics and radio resource allocations. For the proposed system model, we define two different time frames. In the frame, the acceleration of each UAV is specified, and in each slot, radio resources are allocated. Our goal is to formulate a multiobjective optimization problem where the total data rate is maximized, and the total communication power consumption is minimized simultaneously. To handle this multiobjective optimization, we first apply the scalarization method and then apply multi-agent deep deterministic policy gradient (MADDPG). We improve this solution method by adding two critic networks together with two-stage resources allocation. Simulation results indicate that the constant acceleration motion of UAVs shows about 8\% better results than conventional motion systems in terms of performance.

* 12 pages, 11 figures

Via

Access Paper or Ask Questions

AI-Based Radio Resource Management and Trajectory Design in CoMP UAV VLC Networks

Nov 22, 2021
Mohammad Reza Maleki, Mohammad Robat Mili, Mohammad Reza Javan, Nader Mokari, Eduard A. Jorswieck

Figure 1 for AI-Based Radio Resource Management and Trajectory Design in CoMP UAV VLC Networks

Figure 2 for AI-Based Radio Resource Management and Trajectory Design in CoMP UAV VLC Networks

Figure 3 for AI-Based Radio Resource Management and Trajectory Design in CoMP UAV VLC Networks

Figure 4 for AI-Based Radio Resource Management and Trajectory Design in CoMP UAV VLC Networks

In this paper, we consider unmanned aerial vehicles (UAVs) equipped with a visible light communication (VLC) access point and coordinated multipoint (CoMP) capability that allows users to connect to more than one UAV. UAVs can move in 3-dimensional (3D) at a constant acceleration in each time scale, where a central server is responsible for synchronization and cooperation among UAVs. The effect of accelerated motion in UAV is necessary to be considered. We define the data rate for each user type, CoMP and non-CoMP. Unlike most existing works, we see the effect of variable speed on kinetic and allocation formulas. For the proposed system model, we define two different timescales. In the master timescale, the acceleration of each UAV is specified, and in each short timescale, radio resources are allocated. The initial velocity in each small time slot is obtained from the previous time slot's velocity. Our goal is to formulate a multiobjective optimization problem where the total data rate is maximized and the total communication power consumption is minimized simultaneously. To handle this multiobjective optimization, we first apply the scalarization method and then apply multi-agent deep deterministic policy gradient (MADDPG) which is a multi-agent method based on deep deterministic policy gradient (DDPG) that ensures stable and fast convergence. We improve this solution method by adding two critic networks together with allocating the two step acceleration. Simulation results indicate that the constant acceleration motion of UAVs shows about 8% better results than conventional motion systems in terms of performance. Furthermore, CoMP supports the system to achieve an average 12% average higher rates compared to a non-CoMP system.

* 12 pages, 11 figures

Via

Access Paper or Ask Questions

AI-based Radio Resource Management and Trajectory Design for PD-NOMA Communication in IRS-UAV Assisted Networks

Nov 06, 2021
Hussein M. Hariz, Saeed Sheikhzadeh, Nader Mokari, Mohammad R. Javan, B. Abbasi-Arand, Eduard A. Jorswieck

Figure 1 for AI-based Radio Resource Management and Trajectory Design for PD-NOMA Communication in IRS-UAV Assisted Networks

Figure 2 for AI-based Radio Resource Management and Trajectory Design for PD-NOMA Communication in IRS-UAV Assisted Networks

Figure 3 for AI-based Radio Resource Management and Trajectory Design for PD-NOMA Communication in IRS-UAV Assisted Networks

Figure 4 for AI-based Radio Resource Management and Trajectory Design for PD-NOMA Communication in IRS-UAV Assisted Networks

In this paper, we consider that the unmanned aerial vehicles (UAVs) with attached intelligent reflecting surfaces (IRSs) play the role of flying reflectors that reflect the signal of users to the destination, and utilize the power-domain non-orthogonal multiple access (PD-NOMA) scheme in the uplink. We investigate the benefits of the UAV-IRS on the internet of things (IoT) networks that improve the freshness of collected data of the IoT devices via optimizing power, sub-carrier, and trajectory variables, as well as, the phase shift matrix elements. We consider minimizing the average age-of-information (AAoI) of users subject to the maximum transmit power limitations, PD-NOMA-related restriction, and the constraints related to UAV's movement. The optimization problem consists of discrete and continuous variables. Hence, we divide the resource allocation problem into two sub-problems and use two different reinforcement learning (RL) based algorithms to solve them, namely the double deep Qnetwork (DDQN) and a proximal policy optimization (PPO). Our numerical results illustrate the performance gains that can be achieved for IRS enabled UAV communication systems. Moreover, we compare our deep RL (DRL) based algorithm with matching algorithm and random trajectory, showing the combination of DDQN and PPO algorithm proposed in this paper performs 10% and 15% better than matching algorithm and random-trajectory algorithm, respectively.

Via

Access Paper or Ask Questions

AI-Based Radio Resource Management and Trajectory Design in CoMP UAV VLC Networks: Constant Velocity Vs. Constant Acceleration

Nov 06, 2021
Mohammad Reza Maleki, Mohammad Robat Mili, Mohammad Reza Javan, Nader Mokari, Eduard A. Jorswieck

Figure 1 for AI-Based Radio Resource Management and Trajectory Design in CoMP UAV VLC Networks: Constant Velocity Vs. Constant Acceleration

Figure 2 for AI-Based Radio Resource Management and Trajectory Design in CoMP UAV VLC Networks: Constant Velocity Vs. Constant Acceleration

Figure 3 for AI-Based Radio Resource Management and Trajectory Design in CoMP UAV VLC Networks: Constant Velocity Vs. Constant Acceleration

Figure 4 for AI-Based Radio Resource Management and Trajectory Design in CoMP UAV VLC Networks: Constant Velocity Vs. Constant Acceleration

In this paper, we consider UAVs equipped with a VLC access point and coordinated multipoint (CoMP) capability that allows users to connect to more than one UAV. UAVs can move in 3-dimensional (3D) at a constant acceleration in each master timescale, where a central server is responsible for synchronization and cooperation among UAVs. We define the data rate for each user type, CoMP and non-CoMP. The constant speed in UAVs' motion is not practical, and the effect of acceleration on the movement of UAVs is necessary to be considered. Unlike most existing works, we see the effect of variable speed on kinetic and allocation formulas. For the proposed system model, we define timescales for two different slots in which resources are allocated. In the master timescale, the acceleration of each UAV is specified, and in each short timescale, radio resources are allocated. The initial velocity in each small time slot is obtained from the previous time slot's velocity. Our goal is to formulate a multiobjective optimization problem where the total data rate is maximized and the total communication power consumption is minimized simultaneously. To deal this multiobjective optimization, we first apply the weighted method and then apply multi-agent deep deterministic policy gradient (MADDPG) which is a multi-agent method based on deep deterministic policy gradient (DDPG) that ensures more stable and faster convergence. We improve this solution method by adding two critic networks as well as allocating the two step acceleration. Simulation results indicate that the constant acceleration motion of UAVs gives about 8\% better results than conventional motion systems in terms of performance. Furthermore, CoMP supports the system to achieve an average of about 12\% higher rates comparing with non-CoMP system.

* 12 pages, 12 figures

Via

Access Paper or Ask Questions

Reconfigurable Intelligent Surface Phase Hopping for Ultra-Reliable Communications

Jul 25, 2021
Karl-Ludwig Besser, Eduard A. Jorswieck

Figure 1 for Reconfigurable Intelligent Surface Phase Hopping for Ultra-Reliable Communications

Figure 2 for Reconfigurable Intelligent Surface Phase Hopping for Ultra-Reliable Communications

Figure 3 for Reconfigurable Intelligent Surface Phase Hopping for Ultra-Reliable Communications

Figure 4 for Reconfigurable Intelligent Surface Phase Hopping for Ultra-Reliable Communications

We introduce a phase hopping scheme for reconfigurable intelligent surfaces (RISs) in which the phases of the individual RIS elements are randomly varied with each transmitted symbol. This effectively converts slow fading into fast fading. We show how this can be leveraged to significantly improve the outage performance and even achieve an outage probability of zero at a positive data rate without channel state information (CSI) at the transmitter and RIS. Furthermore, the same result can be accomplished even if only two possible phase values are available. Since we do not require perfect CSI at the transmitter or RIS, the proposed scheme has no additional communication overhead for adjusting the phases. This enables robust ultra-reliable communications with a reduced effort for channel estimation.

* 30 pages, 12 figures

Via

Access Paper or Ask Questions

Optimal Power Allocation in Downlink NOMA

Jul 14, 2021
Sepehr Rezvani, Eduard A. Jorswieck, Roghayeh Joda, Halim Yanikomeroglu

Figure 1 for Optimal Power Allocation in Downlink NOMA

Figure 2 for Optimal Power Allocation in Downlink NOMA

Figure 3 for Optimal Power Allocation in Downlink NOMA

Figure 4 for Optimal Power Allocation in Downlink NOMA

Power-domain non-orthogonal multiple access (NOMA) has arisen as a promising multiple access technique for the next-generation wireless networks. In this work, we address the problem of finding globally optimal power allocation strategies for the downlink of a generic single-cell NOMA system including multiple NOMA clusters each operating in an isolated resource block. Each cluster includes a set of users in which the well-known superposition coding (SC) combined with successive interference cancellation (SIC) technique (called SC-SIC) is applied among them. Interestingly, we prove that in both the sum-rate and energy efficiency maximization problems, network-NOMA can be equivalently transformed to a virtual network-OMA system, where the effective channel gain of these virtual OMA users are obtained in closed-form. Then, the latter problems are solved by using very fast water-filling and Dinkelbach algorithms, respectively. The equivalent transformation of NOMA to the virtual OMA system brings new insights, which are discussed throughout the paper. Extensive numerical results are provided to show the performance gap between fully SC-SIC, NOMA, and OMA in terms of system outage probability, BS's power consumption, users sum-rate, and system energy efficiency.

* 15 pages, 30 figures. arXiv admin note: text overlap with arXiv:2106.08636

Via

Access Paper or Ask Questions