and Multiple Robot Path Coordination Today’s Class 1. Motion Planning in Dynamic Environme

合集下载

激光振镜扫描图形失真及校正方法

y
采用二次多项式代替原来的线性关系，可将式 (8)式(9)转化为： x æ ì ⋅ e ± d 2 + y 2 ö + m 1 x + m 2 y + m 3 xy ïx ′ = ø k è í ï x x = n + n + n ′ 1 2y 3 y îy (10) 这就是理论点和失真点之间的二阶关系。
光束经过镜 2 后的偏转角为 2θ y 。当激光束垂直射到工作台时，激光束在工作台面上的投影为 (0， 0) 点，此时镜 1 与镜 2 的偏转角 θ x 和 θ 分别为零。当
y
激光束射到工作面上的任一点 ( x，y ) 时，可以得出
1 振镜扫描系统失真原因分析
1.1 映射关系非线性如图 1 所示的坐标系中，激光束入射方向平行于 X 轴，其偏 X 轴振镜(镜 1)以平行于 Z 的轴转动， θ θ Y 转角为 x ，光束经过镜 1 反射后的偏转角为 2 x ；轴振镜(镜 2)以平行于 X 的轴转动，其偏转角为 θ y ，
2( p， - q)、正方形顶点的位置坐标为别为 1( p，q)、（x，y） 3( - p，q) 、 4( - p， - q) ，畸变坐标为，以x
若镜 1 固定且 θ x ≠0，令 2 θ x =a，则由式（1）有：
( x - e ⋅ a) y2 - 2 =1 2 d (d ⋅ a)
（6）
n2 = k3 (m1 x + m2) + k 4 。由此得出点 ( x，y) 和 ( x ′ ， y ′ )的关系式：
此时，光束在 y 轴没有畸变。由式（5）和式（6）得出失真图形的校正函数为： x æ 2 ì ⋅ d + y 2 + eö ïx ′ = ø k è （7） í ï = ′ îy y 假设理论上的直线为 y = k1 x + k2 ，失真后的直线为 y ′ = k3 x ′ + k 4 ，两条直线经过平移后得到 n1 y +

Coordination strategies for multi-robot exploration and mapping

Coordination strategies for multi-robot exploration andmappingJohn G.Rogers III,Carlos Nieto-Granda,and Henrik I.ChristensenCenter for Robotics and Intelligent MachineGeorgia Institute of Technology{jgrogers,carlos.nieto,hic}@Abstract.Situational awareness in rescue operations can be provided by teamsof autonomous mobile robots.Human operators are required to teleoperate thecurrent generation of mobile robots for this application;however,teleoperationis increasingly difﬁcult as the number of robots is expanded.As the number ofrobots is increased,each robot may interfere with one another and eventuallydecrease mapping performance.Through careful consideration of robot team co-ordination and exploration strategy,large numbers of mobile robots be allocatedto accomplish the mapping task more quickly and accurately.1MotivationProjects like the Army Research Laboratory’s Micro-Autonomous Systems Technol-ogy(MAST)[1]seek to introduce the application of large numbers of inexpensive and simple mobile robots for situational awareness in urban military and rescue operations. Human operators are required to teleoperate the current generation of mobile robots for this application;however,teleoperation is increasingly difﬁcult as the number of robots is expanded.There is evidence in human factors research which indicates that the cognitive load on a human operator is signiﬁcantly increased when they are asked to teleoperate more than one robot[18].Autonomy will make it possible to manage larger numbers of small robots for map-ping.There is a continuum of options as to the degree of shared autonomy between robot and human operator[11].Current robots employed in explosive ordinance dis-posal(EOD)missions are fully tele-operated.At the other extreme,robots can be given high-level tasks by the operator,while autonomously handling low-level tasks[3]such as obstacle avoidance or balance maintenance.In this paper,our robot teams occupy the latter end of the spectrum;we imagine that the operator has tasked the robot team to autonomously explore and map an unknown environment while focusing on the high level task of looking for survivors.In the multi-robot scenario,resources are distributed amongst a team of robots in-stead of concentrated on one large and expensive machine.This distribution offers a number of advantages and disadvantages over the single robot case.The distributed team is able to continue its mission even if some of the robots are disabled or de-stroyed.A single robot can only explore or monitor at one location at a time;however, the multi-robot team can provide situational awareness in many locations at once.Un-less the single robot is able to move much faster than the multi-robot agents,the lone2robot will be slower in performing the exploration and mapping task.These advantages are taken for a multi-robot team at the cost of increased complexity in communication and coordination.As the number of robots is increased,each robot may interfere with one another and eventually decrease the performance of the mapping task.Careful consideration of exploration strategy and coordination of large numbers of mobile robots can efﬁciently allocate resources to perform the mapping task more quickly and more accurately.Mobile robot simultaneous localization and mapping(SLAM)has been thoroughly addressed in the literature,see[2]and[6]for a detailed review of the history and state-of-the-art in SLAM research.The speciﬁc techniques used in this paper are based upon the Square Root SAM algorithm[4][5]which uses the well-known algorithms of linear algebra least-squares system solving to compute the map and robot trajectory based on a set of measurements.Multi-robot mapping and exploration was addressed in[9]and[17].These papers build a map using up to3robots with a decision-theoretic planner which trades off robot rendezvous operations with frontier exploration.These robots rendezvous to determine their relative pose transforms to provide constraints to recover theﬁnal map.In contrast, our approach does not require this rendezvous step because landmarks are globally data associated between each robot on a central map coordinator.The exploration strategy used is similar to our strategy called Reserve;however,we will not use a rendezvous step and do not require a decision-theoretic planner.2Technical ApproachWe use the Robot Operating System(ROS)from[12].ROS provides interprocess com-munication as well as coordination of sensor data with pose information.Our robot algorithms are implemented as a distributed set of programs which run in the ROS sys-tem.In addition,we make use of several implementations of common mobile robot software components which are provided in the ROS distribution such as motion plan-ning,obstacle avoidance,platform control,and IMU and odometryﬁltering.2.1Mapping SystemOur mapping system is based upon the GTsam library developed at Georgia Tech.This library extends the Square Root SAM technique in[5]with sparse linear algebra in a nonlinear optimization engine.We have extended the GTsam library with a frame-work based upon the M-space formulation of Folkesson and Christensen[8]called OmniMapper.OmniMapper is a map library based upon a system of plugins which handle multiple landmark types simultaneously.We have used the OmniMapper in the past to build maps using multiple types of landmarks such as walls,doors,and ob-jects[14][13][16].This implementation builds maps of planar regions corresponding to walls and tables from[15].Each robot in the team builds a map locally with the OmniMapper and sends map data to the map coordinator.Each robot can incorporate new landmark measurements whenever it has moved far enough from the last pose where measurements were made.3Fig.1.OmniMapper.In the current implementation this is set to10cm.When a robotﬁnishes optimizing its local map with new landmark measurements,all relevant information needed by the map coordinator is packaged and transmitted.The information which is needed by the map coordinator to incorporate a new piece of information from a team member consists of many components.First,the sensor measurement data is needed.In the current implementation,this consists of the ex-tracted plane information consisting of a plane equation along with a convex hull of points along the perimeter of the plane.This represents a signiﬁcant compression over an alternative scheme where all point-cloud data could be transmitted and processed at the master node.Secondly,the team member’s integrated odometry is transmitted.This allows the master node to compute the odometric relative pose since the prior landmark measurement data was incorporated;this is used to insert a relative pose factor and also give initial conditions for data association.Finally,the team member’s local map pose is transmitted.This is used by the master node to compute a map pose correction.This correction is sent back to the team member so that it knows it’s relative pose in the global map frame.This knowledge is needed so that the team member can interpret exploration goals correctly.The map coordinator maintains trajectories for each of the robots in the team.Mea-surements from each robot are merged into one global view of the landmarks.This is realized through a simple modiﬁcation to the standard OmniMapper through duplica-tion of data structures tracking indexing data and pose information used for interaction with GTsam into arrays.This implementation potentially allows for an unlimited num-ber of team members to build a map together.4Most modern SLAM approaches use a pose graph[10]which is generated via laser scan matching in2D or point-cloud ICP in3D.This approach is effective for single robot mapping;however,it has some drawbacks for larger multirobot mapping.Scan matching and ICP algorithms are computationally intensive and matching across many robots would rapidly become intractable.Also,point cloud representations are large and their transport over a wireless link could be prohibitive if the link is limited in capacity due to mesh network routing or environmental interference.To address these limitations,our robots extract relevant,parsimonious features from the environment and transmit them to the master node.Each turtlebot in these experiments maps planar wall structures using a Microsoft Kinect sensor.Planar segments corresponding to walls are extracted from point clouds via a RANSAC[7]based algorithm[15].Points are uniformly sampled from the point cloud and any sufﬁciently large set of points coplanar with these three points are se-lected as a plane and are removed from the point cloud.This process is repeated until up to four planes are extracted or aﬁxed number of iterations is reached.To improve the speed of plane extraction,the Kinect point cloud is computed at QQVGA(f rac18) resolution,which achieves˜1Hz frame rate.The Kinect sensor on each robot has a narrowﬁeld-of-view which is not ideal for detecting exploration frontiers.To alleviate this problem,we incorporated a strategy by which each robot will rotate periodically to get a360degree view of its surroundings. This data is synchronized with robot odometry to synthesize a360degree laser scan. This synthesized laser scan is sent to the local mapper and forwarded to the global map-per.At the global mapper,it is linked to a trajectory pose element and used to populate an occupancy grid.This occupancy grid is re-computed after every map optimization so that a loop closure will result in a correct occupancy grid map.The frontier based explo-ration strategies detailed below use this occupancy grid toﬁnd the boundary between clear and unknown grid cells.2.2Exploration StrategyEach robot team leader uses a frontier based exploration strategy similar to the one used in[17].An exploration frontier is deﬁned on a costmap cellular decomposition where each cell has one of three labels:Clear,Obstacle,and Unknown.The costmap is initialized as Unknown.Costmap cells are set to Obstacle corresponding to locations where the Kinect sensor detects an obstacle in the environment.The cells on a line between the obstacle cell and the robot’s current location are set to Clear.Exploration frontiers are deﬁned as Clear cells which are adjacent to at least one neighbor where the label is Unknown.The high level robot exploration goal allocation is centrally planned on the same workstation where the global map is constructed.There are many choices which can be made by the exploration planner when choosing which robot or group of robots should move towards an exploration goal.We have chosen to employ a greedy strategy by which the nearest robot or team is allocated to a goal instead of a more sophisticated traveling-salesman type of algorithm.We believe that this is appropriate because the exploration goals will change as the robots move through the environment;re-planning will be required after each robot or team reaches an exploration goal.5Fig.2.Global maps using the Reserve coordination algorithm described in this paper.2.3Coordination StrategyThe coordination strategy used between robot agents as well as the number of robots arethe independent variables in the experiments performed in this paper.The coordinationstrategy refers to the proportion of robots which are dispatched to each exploration goal.On one extreme,a single robot can be sent to explore a new goal;at the other extreme allavailable robots can be sent to a new rger robot teams sent to a new explorationgoal will improve availability of new agents at the location of new exploration goals arediscovered.The larger group has spare robots which can be quickly allocated to explorenew goals,such as those discovered when the team moves past a corridor intersectionor t-junction.If the group of robots allocated to a navigation goal is too large,then therobots can interfere with each other due to local reactive control of multiple agents withrespect to dynamic obstacles and limited space in corridors.The strategies selected fortesting trade off availability (robots are close and able to explore branching structurequickly)with non-interference (robots do not get in each other’s way).The ﬁrst coordination algorithm is called Reserve .In this algorithm,all unallocatedrobots remain a the starting locations until new exploration goals are uncovered.Whena branching point is detected by an active robot,the closest reserve robot will be re-cruited into active status to explore the other path.This strategy has low availabilitybecause all of the reserve robots remain far away at the entrance;however,it has min-imal interference because the exploring robots will usually be further away from otherrobots.The second coordination algorithm is Divide and Conquer .In this strategy,the en-tire robot group follows the leader until a branching point is detected.The group splits in half,with the ﬁrst n 2robots following the original leader,robot n 2+1is selected as theleader of the second group,and robots n 2+2through n are now members of its squad.Once there are n squads with one robot,no further divide operations can be made andnew exploration goals will only be allocated once a robot has reached a dead-end or6Fig.3.A map built by three robots using the Reserve cooperative mapping strategy. looped back into a previously explored area.This algorithm maximizes availability,but potentially causes signiﬁcant interference between robots.An example3D map built by two robots as they approach a branch point can be seen inﬁgure4(a).At this point,the robot team splits and each team member takes a separate path,as seen inﬁgure4(b).The map shown is built concurrently with local maps built on each robot.The global map is used to establish a global frame of reference for robot collaboration message coordinates.(a)Two robots approach the intersection.(b)Two robots split and move past the in-tersectionFig.4.An illustration of the Divide and Conquer exploration strategy.As the robots approach an intersection,the team must split and recruit new partner robots from the reserved units.3ExperimentsThe setting for the multi-robot mapping task for this series of experiments consists of a team of robots being introduced into a single entrance in an unknown environment. Each robot is an inexpensive Willow Garage TurtleBot;a team of nine of these robots is shown inﬁgure3.The TurtleBot was chosen for this application due to its low cost and7(a)A map built by seven robots in an experiment using the Reserve coopera-tive mapping strategy.(b)The same map shown from a different angle to demonstrate3Dplane features which are used for map landmarks.Fig.5.Global maps gathered by a team of seven mobile robots.8the ease of integrating large numbers of robots through ROS.The TurtleBot platform is based on the iRobot Create base.The robots make measurements of planes with a Kinect sensor,and use an onboard IMU together with odometry to estimate ego-motion.Fig.6.Our nine TurtleBots used in these experiments.We evaluated the performance of various robot coordination strategies in the multi-robot exploration and mapping task.An example scenario for the Divide and Conquer cooperative mapping strategy can be seen in the panorama image inﬁgure3.Fig.7.An example scenario for the experiments described in this paper.Three teams of two robots are exploring the branching hallway structure in an ofﬁce environment.In this illustration, the robots are using the Divide and Conquer cooperative mapping strategy.We performed a series of experiments to demonstrate the performance of our two cooperative mapping strategies.A total of6runs were performed for each cooperation strategy,team size,and starting location.For each experiment run,the TurtleBot team9 explored the environment from a wedge-shaped starting conﬁguration,which can be seen inﬁgure3.These experiments were performed in an ofﬁce environment.In order to measure the exploration and mapping performance in each location,we chose spe-ciﬁc starting locations which are labeled Base1and Base2inﬁgure3.These starting locations were chosen because the area around the robot teams could be blocked off so there is only one initial exploration frontier,directly in front of the lead robot.This ini-tial conﬁguration was chosen to represent a breaching behavior which would be needed for implementation of collaborative mapping in a hostile environment.Fig.8.Our ofﬁce environment where the experiments were performed.The areas labeled Base1 and Base2are the initial position of the robots.Red lines indicate artiﬁcial barricades to restrict the initial exploration of the robot teams to simulate a breach entrance into a hostile environment. 4ResultsWe performed a series of experiments for this paper which demonstrate team perfor-mance based upon coverage in a mapping task on an unknown ofﬁce environment. Robot team sizes were varied from2to9robots.An map built with7robots at Turtle-Bots using the Reserve strategy is seen in Figure5(a).An image showing the sameﬁnal global map from a side view demonstrates the3D plane features inﬁgure2.3.Each of the collaboration strategy and robot team size experiments were performed from two starting locations.These starting locations are labeled Base1and Base2in ﬁgure3.A series of interesting locations was determined in advance by examining the buildingﬂoor-plan;these points of interest are also marked inﬁgure3.Each experiment run gets a score based on how many of these points of interest are visited and mapped before a time limit is reached.This score represents the effectiveness of that algorithm and team size at providing coverage while exploring an unknown map.ﬁrst experiment series from Base1in ﬁgure 3,both strategies achieve reducedexploration coverage per robot as the team size is increased,as can be seen in the graphsin ﬁgure 9.In this starting location,there is limited space to maneuver,so both strate-gies generate signiﬁcant interference between robots trying to move to their goals.Inseveral instances,pairs of robots even crashed into each other due to the limited ﬁeld-of-view of their sensors.We believe that the Divide and Conquer strategy results inﬁgure 9(b)indicate that the team was slightly more effective than the Reserves strategyin ﬁgure 9(a).At the largest team size of 9robots,the Divide and Conquer strategyusually visited one additional point-of-interest more than the Reserves strategy.Addi-tional qualitative impressions are that the Divide and Conquer strategy explored thepoints-of-interest that it reached more quickly than with the Reserves strategy.For bothstrategies,the best team size appears to be 6robots in this starting location.(a)Reserves (b)Divide and ConquerFig.9.Results from the ﬁrst starting areaIn the second set of experiments,the robot teams were placed in the starting arealabeled Base2in ﬁgure 3.As in the ﬁrst experiment,the per-robot performance of bothstrategies decreased as the number of robots were increased.This series of experimentsdemonstrates a marked improvement of the Divide and Conquer strategy over the Re-serves strategy as can be seen in ﬁgure 10.The Divide and Conquer strategy causesmore robots to be making observations of exploration frontiers due to the fact thatgroups contain more than one robot.These additional observations of the frontier allowthe Divide and Conquer strategy to ﬁnd exploration frontiers faster than the Reservesstrategy,and therefore explore more points-of-interest.The second experiment startedfrom an area where there is more room to maneuver.This allowed the Divide and Con-quer strategy to have less interference since the entire team moved together out of thestarting area into the larger area before any divide operations were performed.The Re-serves strategy still had to initially maneuver from the cramped starting location.As inthe ﬁrst experiment,the Divide and Conquer strategy qualitatively explored the envi-ronment faster than the Reserves strategy.The best value for the number of robots is 6,which is the same value found in the ﬁrst experiment.11(a)Reserves(b)Divide and ConquerFig.10.Results from the second starting area5DiscussionWe have presented experiments which evaluate two collaboration strategies which can be used by teams of mobile robots to map and explore an unknown environment.We have also evaluated the impact of the number of robots on coverage in the exploration and mapping task.Theﬁrst collaboration strategy,called Reserves keeps a pool of unallocated robots at the starting location.A new robot is activated when there are more exploration frontiers than currently active robots.This strategy was intended to minimize the amount of in-terference between robot agents since robots would be far away from each other during exploration.The results from our experiments do not indicate that this strategy results in less interference than other strategies since performance decreases more when more robots are added in some environments.The Reserves strategy is signiﬁcantly slower at exploring the environment than other strategies.The second collaboration strategy,called Divide and Conquer has all available robots proceed in one large group.Once there are two exploration frontiers,at a corridor t-junction for example,the team will divide in half and each sub-team will follow one of the exploration frontiers.This process will be repeated with teams dividing in half each time they see branching structure in the environment.It was anticipated that this strategy would result in higher interference since robots would be maneuvering close together;however,the increased availability of robots near new exploration frontiers offsets this phenomenon.Divide and Conquer appears to be a more effective strategy than Reserves for ex-ploring and mapping an unknown environment.There are additional hybrid strategies which could now be considered such as the Buddy System,which modiﬁes the Reserves strategy with teams of2robots instead of1.We believe that this strategy will mitigate much of the slowness of the Reserves strategy while still minimizing interference.12AcknowledgmentsThis work was made possible through generous support from the Army Research Lab (ARL)MAST CTA project,and The Boeing Corporation.References1.ARL:Army Research Lab Micro Autonomous Systems and Technology Collaborative Tech-nology Alliance MAST CTA./www/default.cfm?page=332(2006) 2.Bailey,T.,Durrant-Whyte,H.:Simultaneous localisation and mapping(SLAM):Part II stateof the art.Robotics and Automation Magazine(September2006)3.Chipalkatty,R.,Daepp,H.,Egerstedt,M.,Book,W.:Human-in-the-loop:Mpc for sharedcontrol of a quadruped rescue robot.In:Intelligent Robots and Systems(IROS),2011 IEEE/RSJ International Conference on.pp.4556–4561.IEEE(2011)4.Dellaert,F.:Square root SAM:Simultaneous localization and mapping via square root infor-mation smoothing.Robotics:Science and Systems(2005)5.Dellaert,F.,Kaess,M.:Square root SAM:Simultaneous localization and mapping via squareroot information smoothing.International Journal of Robotics Research(2006)6.Durrant-Whyte,H.,Bailey,T.:Simultaneous localisation and mapping(SLAM):Part I theessential algorithms.Robotics and Automation Magazine(June2006)7.Fischler,M.A.,Bolles,R.C.:Random sample consensus:A paradigm for modelﬁtting withapplications to image analysis and automated munications of the ACM 24(6),381–395(1981)8.Folkesson,J.,Christensen,H.:Graphical SLAM-a self-correcting map.IEEE InternationalConference on Robotics and Automation(2004)9.Fox,D.,Ko,J.,Konolige,K.,Limketkai,B.,Schulz,D.,Stewart,B.:Distributed multirobotexploration and mapping.Proceedings of the IEEE94(7),1325–1339(2006)10.Grisetti,G.,Grzonka,S.,Stachniss,C.,Pfaff,P.,Burgard,W.:Efﬁcient estimation of accuratemaximum likelihood maps in3D.In:Intelligent Robots and Systems,2007.IROS2007.IEEE/RSJ International Conference on.pp.3472–3478.IEEE(2007)11.Heger,F.,Singh,S.:Sliding autonomy for complex coordinated multi-robot tasks:Analy-sis and experiments.In:Proceedings of Robotics:Science and Systems.Philadelphia,USA (August2006)12.Quigley,M.,Gerkey,B.,Conley,K.,Faust,J.,Foote,T.,Leibs,J.,Berger,E.,Wheeler,R.,Ng,A.:ROS:an open-source robot operating system.In:ICRA Workshop on Open Source Software(2009)13.Rogers,J.,Trevor,A.,Nieto,C.,Cunningham,A.,Paluri,M.,Michael,N.,Dellaert,F.,Chris-tensen,H.,Kumar,V.:Effects of sensory perception on mobile robot localization and map-ping.In:International Symposium on Experimental Robotics ISER(2010)14.Rogers III,J.G.,Trevor,A.J.B.,Nieto-Granda,C.,Christensen,H.I.:Simultaneous localiza-tion and mapping with learned object recognition and semantic data association.In:IEEE International Conference on Intelligent RObots and Systems(IROS)(2011)15.Rusu,R.B.,Cousins,S.:3D is here:Point Cloud Library(PCL).In:IEEE International Con-ference on Robotics and Automation(ICRA).Shanghai,China(20112011)16.Trevor,A.J.B.,Rogers III,J.G.,Nieto-Granda,C.,Christensen,H.I.:Tables,counters,andshelves:Semantic mapping of surfaces in3D.In:IROS Workshop on Semantic Mapping and Autonomous Knowledge Acquisition(2010)17.Vincent,R.,Fox,D.,Ko,J.,Konolige,K.,Limketkai,B.,Morisset,B.,Ortiz,C.,Schulz,D.,Stewart,B.:Distributed multirobot exploration,mapping,and task allocation.Annals ofMathematics and Artiﬁcial Intelligence52(2),229–255(2008)13 18.Zheng,K.,Glas,D.,Kanda,T.,Ishiguro,H.,Hagita,N.:How many social robots can oneoperator control?In:Proceedings of the6th international conference on Human-robot inter-action.pp.379–386.ACM(2011)。

cs148_2009_slides_topic15

Computation
Robot 2
Assignment problem:
assign tasks to robots to maximize utility
Embodiment
y2[t]
Perception
^2[t] x
Sensing
x[t+1]
objective G2 G1
Decision Making Decision Making
planning3
)
Final Tournament
double elimination larger ﬁeld (6x4m) and goals (1.5m) I will attack
600
I will defend
A* search on Wikipedia as well as many other sources. 2-on-2 matches ble on the internet, you are free to use these just be sure
and equipment to be used (e.g., the accuracy of the in terms of jobs and workers. tially Observable Markov Decision Processes (Kaelbling, map that will be produced using a laser range-ﬁnder), stand what kind ofLittman task allocation & Cassandraproblem 1998), bothis of present which are in highly expressive. Unfortunately, such models tend to be too comDeﬁnition (Optimal Assignment Problem). reGiv order to solve it in a principled manner. • expected resource cost, given the spatio-temporal plex to be directly analyzed or solved, even for modestof the for taskone (e.g.,job the and power will be each looking nthat prioritized We propose the following three axes for use in describ- workers, quirements sized systems. Another formal model is that of the hybrid required to drive the motors and laser range-ﬁnder in one worker. Also given for each work ng MRTA problems: system (Alur, Courcoubetis, Halbwachs, Henzinger, each Ho, requiring order to map the building). nonnegative skill rating (i.e., utility estimate) that pr Nicollin, Olivero, Sifakis & Yovine 1995), which characGiven a robot R and a task T , if R is capable of executing • single-task robots (ST) systems vs. multi-task terizes discrete operating in an robots analog environhis/her performance for each job; if a worker is inca T , then one can deﬁne, on some standardized scale, QRT ment. Hybrid systems can also become complex and are (MT): ST means that each robot is capable of exe- of undertaking a job, then the worker is assigned a usually used to describe or control the behavior of a single and CRT as the quality and cost, respectively, expected cuting as most one task at a time, while MT means of zero to result fromjob. the execution of T . This results in a t for that The goal isby toRassign workers robot, via a so-called three-layer architecture (Gat 1998). formation of utility matrix nonnegative utility measure: that some robotsOur can execute multiple simultagoal in this paper is to ﬁlltasks a gap in the existing literso as combined, to maximize overall expected performance, t ature on multi-robot coordination. We neither construct a neously. into account priorities of jobs and the skill r QRT − CRT if Rthe is capable of executing the formal model in support of a particular coordination archiT and QRT > CRT RT = of theUworkers. compare different architectures in a particular • single-robot tecture, tasks nor (SR) vs. multi-robot tasks 0 otherwise task domain. Rather, we develop a task- and architecture(MR): SR means that each task requires exactly one The OAP cangiven be cast inAmany ways, independent taxonomy, based on optimization theory, in For example, a robot that can achieveincluding a task T which to study task allocation problems. robot to achieve it, while MR means that some tasks integral linear (Gale 1960): mn nonne with qualityprogram QAT = 20 at cost CAT =ﬁnd 10 and a robot ij

不确定动态环境下移动机器人的完全遍历路径规划 - 副本

Keywords: comp lete coverage path p lanning; biologically insp ired neural network; rolling w indow; heuristic p lanning
1 引言 ( In troduction )
移动机器人路径规划是实现移动机器人自主导航和控制的基本问题之一 , 许多学者对其进行了广 [ 1, 2 ] 泛的研究 ,并提出了多种路径规划方法 . 根据移动机器人的运动目标 ,路径规划主要有两种 , 一种是点到点的路径规划 ,另一种是完全遍历路径规划 . 点到点的路径规划是为移动体在工作空间寻找一条从始点到终点、能完全避障的最优或合理的路径 . 完全遍历路径规划是二维工作环境中的一种特殊的路径规划 , 寻找一条在工作区域内从始点到终点且经过
第 28 卷第 6 期 2006 年 11 月文章编号 : 1002 2 0446 (2006) 06 2 0586 2 07
机器人 ROBO T
Vol . 28, No. 6 Nov . , 2006
不确定动态环境下移动机器人的完全遍历路径规划
邱雪娜
1, 2
3
, 刘士荣 , 宋加涛 , Simon X. YANG
587
知和环境信息未知两种情况 , 相应的路径规划问题就有 : 全局路径规划和局部路径规划 . 点到点的路径规划方法主要有 : 栅格法、人工势场法、切线法、启发 3 式方法、单元分解法、 A 算法、四叉树、模糊逻辑、神经网络等 ; 由于完全遍历路径规划的特殊性 , 通常在以上方法的基础引入了基于模板的模型、基于行为的路径规划和其他混合方法 . 但这些方法单独运用时 ,对于不同的实际问题总是存在一定的缺陷 . 在不确定动态环境下 , 移动机器人对于环境信息、特别是动态障碍物的运动很难具有先验的知识 . 移动机器人在未知环境中运动时 , 只能根据其传感器探测到的有限范围内的环境信息 . 因此在这种情况下 , 移动机器人只靠在运动中获取的信息来决定自己的行走路径 . 传统的基于反馈的“ 导航 ” 算法正是基于这一思想 ,但这种算法往往只着眼于避障 , 算法的实施并不能保证顺利到达终点 , 更谈不上规划较优的路径 . 受生物启发的神经网络方法已用于移动机器人的工作环境建模、路径规划和运动控制 [ 3, 4 ] 等 ,该方法在完全遍历路径规划中具有一定的优势 ,也存在一些不足 . 本文将生物激励神经网络、滚动窗口、启发式规划等概念与方法集成 , 提出了基于生物激励与滚动启发式规划的遍历路径规划方法 . 该方法依据移动机器人实时探测的环境信息 , 构成滚动窗口信息 , 运用生物激励神经网络产生实时局部地图信息 , 用启发式规则得到可行的安全路径 . 在障碍物突然出现或消失的环境中 ,或存在运动障碍物的环境中 , 移动机器人能安全避障并能完全遍历所有的工作空间 . 仿真实验表明该方法简单有效 , 并具有较高的规划效率 .

机器人路径规划的多目标二次优化方法

机器人路径规划的多目标二次优化方法吴军;史美萍;宋金泽;贺汉根【期刊名称】《计算机仿真》【年(卷),期】2009(0)10【摘要】A method named Multi - objective Quadratic Optimization (MOQO) is proposed for robot path - planning to raise the computation efficiency of the former multi - objective evolutionary algorithm. Firstly, a quadratic optimization idea is introduced into MOQO, namely, through an offline global optimization process to provide a suboptimal initial solution set for the online local optimization process, thus using the apriori knowledge and evolutionary information in the former optimization process more effectively, and accelerating later the local optimization process. Secondly, a modified multi - objective evolutionary algorithm is proposed in MOQO to improve the real - time performance of optimization process. Lastly, the simulations and trials show that the MOQO can accelerate the evolutionary operation process effectively, and the feasibility and validation of MOQO are confirmed.%基于多目标进化算法的机器人路径规划问题存在求解速度慢的不足,为提高求解效率,提出了一种多目标二次优化方法(MOQO).在MOQO方法中首先引入二次优化的思想.通过离线的全局优化为在线的局部优化提供一个次优的初始解集,以更有效利用全局优化过程的先验信息和进化信息,从而加速后期的局部优化操作;其次,在MOQO方法中还提出了一种改进的多目标进化算法PPMOEA,以进一步提高优化求解的实时性能.最终的机器人路径规划仿真实验测试了MOQO方法对进化求解操作的加速效果,证明了方法的有效件和可行性.【总页数】4页(P179-181,220)【作者】吴军;史美萍;宋金泽;贺汉根【作者单位】国防科技大学机电工程和自动化学院,湖南,长沙,410073;国防科技大学机电工程和自动化学院,湖南,长沙,410073;国防科技大学机电工程和自动化学院,湖南,长沙,410073;国防科技大学机电工程和自动化学院,湖南,长沙,410073【正文语种】中文【中图分类】TP273【相关文献】1.空间机器人路径规划综合优化方法 [J], 金宗耀;谭春林2.传统多目标优化方法和多目标遗传算法的比较综述 [J], 马小姝;李宇龙;严浪3.复杂环境下农业机器人路径规划优化方法 [J], 殷建军;董文龙;梁利华;谢伟东;项祖丰4.行人二次过街交叉口信号相序多目标优化方法 [J], 王艳丽; 卢建涛; 吴兵5.一种移动机器人路径规划的优化方法 [J], 李燊;韩志平;李颖因版权原因，仅展示原文概要，查看原文内容请购买。

多智能体机器人协调控制研究及稳定性分析

多智能体机器人协调控制研究及稳定性分析一、本文概述Overview of this article随着科技的飞速发展，多智能体机器人系统在工业、军事、医疗等领域的应用日益广泛。

多智能体机器人协调控制作为该领域的核心问题，对于提升机器人系统的整体性能、稳定性和适应性具有至关重要的意义。

本文旨在深入研究多智能体机器人协调控制的相关理论与技术，并对其稳定性进行深入分析。

With the rapid development of technology, the application of multi-agent robot systems in industrial, military, medical and other fields is becoming increasingly widespread. The coordinated control of multi-agent robots, as a core issue in this field, is of great significance for improving the overall performance, stability, and adaptability of robot systems. This article aims to conduct in-depth research on the relevant theories and technologies of coordinated control ofmulti-agent robots, and analyze their stability in depth.文章首先回顾了多智能体机器人协调控制的发展历程和研究现状，概述了当前面临的主要挑战和问题。

在此基础上，文章提出了一种基于优化算法和一致性理论的多智能体机器人协调控制策略，并对该策略的实现细节进行了详细介绍。

Polygon Area Decomposition for Multiple-Robot Workspace Division

Polygon Area Decomposition for Multiple-Robot Workspace Division
Susan Herty Vladimir Lumelskyz
We present a new polygon decomposition problem, the anchored area partition problem, which has applications to a multiple-robot terrain-covering problem. This problem concerns dividing a given polygon P into n polygonal pieces, each of a speci ed area and each containing a certain point (site) on its boundary. We rst present the algorithm for the case when P is convex and contains no holes. Then the generalized version that handles nonconvex and nonsimply connected polygons is presented. The algorithm uses sweep-line and divide-andconquer techniques to construct the polygon partition. The algorithm assumes the input polygon P has been divided into a set of p convex pieces (p = 1 when P is convex) and runs in time O(pn2 + vn), where v is the sum of the number of vertices of the convex pieces.

基于多目标遗传算法的双机器人协调焊接路径规划

基于多目标遗传算法的双机器人协调焊接路径规划侯仰强;王天琪;岳建锋;贾振威【摘要】对双机器人协调焊接复杂空间焊缝路径规划问题进行研究,提出了一种基于多目标遗传算法的焊接路径规划方案.通过分析影响焊缝质量及机器人运动平稳性的各参数,建立了评价焊接路径的目标函数:焊接质量函数、机器人运动平稳性函数以及双机器人碰撞函数.利用多目标遗传算法对协调焊接路径进行求解,实现对双机器人最佳焊接路径的规划.以"马鞍形"空间焊缝为例,对双机器人协调焊接路径进行了试验验证.结果表明,该方案可实现双机器人协调焊接路径的规划.【期刊名称】《中国机械工程》【年(卷),期】2018(029)016【总页数】6页(P1984-1989)【关键词】多目标遗传算法;双机器人;焊接;路径规划【作者】侯仰强;王天琪;岳建锋;贾振威【作者单位】天津工业大学天津市现代机电装备技术重点实验室,天津,300387;天津工业大学天津市现代机电装备技术重点实验室,天津,300387;天津工业大学天津市现代机电装备技术重点实验室,天津,300387;天津工业大学天津市现代机电装备技术重点实验室,天津,300387【正文语种】中文【中图分类】TP2730 引言双机器人协调焊接系统在提高焊接效率、保证焊接质量等方面具有独特的优势，已广泛运用于各种工业焊接生产线［1］。

为使双机器人可以快速、无碰撞地完成高质量焊缝的焊接任务，需要事先通过一定的算法对其进行焊接路径的规划，将规划好的机器人姿态信息导入机器人控制柜中，使其控制焊接机器人按照规划的路径进行焊接。

路径规划技术作为人工智能研究中的经典问题，众多学者对其开展了广泛的研究，提出了解决此类问题的多种算法，如人工势场法、遗传算法、模拟退火算法等［2⁃3］。

针对双机器人协调焊接作业问题，在路径规划时需要考虑焊枪姿态、焊接位置、机器人运动平稳性等多种因素，既要保证焊缝质量良好，又要保证机器人运动平稳、双机器人之间无碰撞，因此双机器人协调焊接路径规划可看为多目标规划问题［4］。

multibot

No single point of failure
Distributed Methods: Cons

Not all problems can be decomposed well Plans based only on local information Result: solutions are often highly sub-optimal
with transition rates
p (n, t Dt | n, t ) W (n | n; t ) lim Dt 0 Dt
In semi-Markov systems, transition rates are time dependent
From One Robot to Many
Simulation Results
Dealing with Uncertainty: Distributed Reactive Robots as Markov Processes

Individual robot state p(n,t) = probability robot is in state n at time t Markov property: robot’s state at time t+Dt depends only on its state at time t

Robust: no single point of failure Optimized, even under dynamic conditions Quick to respond to changes Able to deal with imperfect communication Able to allocate limited resources Heterogeneous and able to make use of different robot skills

基于强化学习的多机器人路径规划方法

２，１８１６，４
Ｒ２１１，１７３，５
２，４１３，１８
Ｒ３
３，１７１６，４１８，１９１１，３
Ｒ４
／
／
１９，１２１，１２
（ａ）
（ｂ）图１ＰＵＭＳＱ算法的多机器人移动轨迹在对分段再励函数、加权和分段再励函数及数学模型的分段再励函数的仿真实验中，其参数设置为：
在机器人领域，很多学者对路径规划作了大量的研究并提出一些方法，如四叉树法［１］、人工势场法［２］等等，然而此类规划方法很大一部分是基于
计思想，并在相同的仿真环境下［５］，证明了此方法的有效性。
环境模型已知的，当环境模型未知或动态变化时就不再适用。而多机器人协调避障方法，如主从控制法、动态优先级法［３］等等，大多建立在机器人间的通讯协商上，由于系统整体性能常常受通讯瓶颈的限制，因此上述方法由于各自的局限性将很难继续适用。
根据上述定义，分段再励函数的形式如下：
!１趋向目标
##－１远离目标
Ｒｍｏｖｅｔｏｇｏａｌ＝ "##－５
发生碰撞
（４）
$０其他
Ｒ＝ａｖｏｉｄｏｂｓｔａｃｌｅ１
!０．５１．５ｍ%ｄ０＜３ｍ且远离障碍物
#
"－０．５１．５ｍ＜ｄ０＜３ｍ且靠近障碍物（５）
#
$０无障碍物
器人时，首先检测当前障碍物或机器人与其的距
离状态信息，并以此状态信息为标准来选择再励
函数，在此距离下，若当前动作使得与障碍物或其
他机器人的距离加大，则给以正的奖励信号，否则

1、下载文档前请自行甄别文档内容的完整性，平台不提供额外的编辑、内容补充、找答案等附加服务。
2、"仅部分预览"的文档,不可在线预览部分如存在完整性等问题,可反馈申请退款(可完整预览的文档不适用该条件!)。
3、如文档侵犯您的权益，请联系客服反馈,我们会尽快为您处理(人工客服工作时间：9:00-18:30)。