arXiv Paper Daily: Mon, 19 Jun 2017

栏目: 数据库 · 发布时间: 7年前

内容简介：arXiv Paper Daily: Mon, 19 Jun 2017

Neural and Evolutionary Computing

The Evolution of Neural Network-Based Chart Patterns: A Preliminary Study

Myoung Hoon Ha , Byung-Ro Moon

Comments: 8 pages, In proceedings of Genetic and Evolutionary Computation Conference (GECCO 2017), Berlin, Germany

Subjects

Neural and Evolutionary Computing (cs.NE)

A neural network-based chart pattern represents adaptive parametric features,

including non-linear transformations, and a template that can be applied in the

feature space. The search of neural network-based chart patterns has been

unexplored despite its potential expressiveness. In this paper, we formulate a

general chart pattern search problem to enable cross-representational

quantitative comparison of various search schemes. We suggest a HyperNEAT

framework applying state-of-the-art deep neural network techniques to find

attractive neural network-based chart patterns; These techniques enable a fast

evaluation and search of robust patterns, as well as bringing a performance

gain. The proposed framework successfully found attractive patterns on the

Korean stock market. We compared newly found patterns with those found by

different search schemes, showing the proposed approach has potential.

Evaluating Noisy Optimisation Algorithms: First Hitting Time is Problematic

Simon M. Lucas , JIalin Liu , Diego Pérez-Liébana

Comments: 4 pages, 4 figurs, 1 table

Subjects

Neural and Evolutionary Computing (cs.NE)

; Artificial Intelligence (cs.AI)

A key part of any evolutionary algorithm is fitness evaluation. When fitness

evaluations are corrupted by noise, as happens in many real-world problems as a

consequence of various types of uncertainty, a strategy is needed in order to

cope with this. Resampling is one of the most common strategies, whereby each

solution is evaluated many times in order to reduce the variance of the fitness

estimates. When evaluating the performance of a noisy optimisation algorithm, a

key consideration is the stopping condition for the algorithm. A frequently

used stopping condition in runtime analysis, known as “First Hitting Time”, is

to stop the algorithm as soon as it encounters the optimal solution. However,

this is unrealistic for real-world problems, as if the optimal solution were

already known, there would be no need to search for it. This paper argues that

the use of First Hitting Time, despite being a commonly used approach, is

significantly flawed and overestimates the quality of many algorithms in

real-world cases, where the optimum is not known in advance and has to be

genuinely searched for. A better alternative is to measure the quality of the

solution an algorithm returns after a fixed evaluation budget, i.e., to focus

on final solution quality. This paper argues that focussing on final solution

quality is more realistic and demonstrates cases where the results produced by

each algorithm evaluation method lead to very different conclusions regarding

the quality of each noisy optimisation algorithm.

Plan, Attend, Generate: Character-level Neural Machine Translation with Planning in the Decoder

Caglar Gulcehre , Francis Dutil , Adam Trischler , Yoshua Bengio

Comments: Accepted to Rep4NLP 2017 Workshop at ACL 2017 Conference

Subjects

Computation and Language (cs.CL)

; Neural and Evolutionary Computing (cs.NE)

We investigate the integration of a planning mechanism into an

encoder-decoder architecture with attention for character-level machine

translation. We develop a model that plans ahead when it computes alignments

between the source and target sequences, constructing a matrix of proposed

future alignments and a commitment vector that governs whether to follow or

recompute the plan. This mechanism is inspired by the strategic attentive

reader and writer (STRAW) model. Our proposed model is end-to-end trainable

with fully differentiable operations. We show that it outperforms a strong

baseline on three character-level decoder neural machine translation on WMT’15

corpus. Our analysis demonstrates that our model can compute qualitatively

intuitive alignments and achieves superior performance with fewer parameters.

Gradient Descent for Spiking Neural Networks

Dongsung Huh , Terrence J. Sejnowski Subjects : Neurons and Cognition (q-bio.NC) ; Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Machine Learning (stat.ML)

Much of studies on neural computation are based on network models of static

neurons that produce analog output, despite the fact that information

processing in the brain is predominantly carried out by dynamic neurons that

produce discrete pulses called spikes. Research in spike-based computation has

been impeded by the lack of efficient supervised learning algorithm for spiking

networks. Here, we present a gradient descent method for optimizing spiking

network models by introducing a differentiable formulation of spiking networks

and deriving the exact gradient calculation. For demonstration, we trained

recurrent spiking networks on two dynamic tasks: one that requires optimizing

fast (~millisecond) spike-based interactions for efficient encoding of

information, and a delayed memory XOR task over extended duration (~second).

The results show that our method indeed optimizes the spiking network dynamics

on the time scale of individual spikes as well as behavioral time scales. In

conclusion, our result offers a general purpose supervised learning algorithm

for spiking neural networks, thus advancing further investigations on

spike-based computation.

Computer Vision and Pattern Recognition

Perceptual Generative Adversarial Networks for Small Object Detection

Jianan Li , Xiaodan Liang , Yunchao Wei , Tingfa Xu , Jiashi Feng , Shuicheng Yan Subjects : Computer Vision and Pattern Recognition (cs.CV)

Detecting small objects is notoriously challenging due to their low

resolution and noisy representation. Existing object detection pipelines

usually detect small objects through learning representations of all the

objects at multiple scales. However, the performance gain of such ad hoc

architectures is usually limited to pay off the computational cost. In this

work, we address the small object detection problem by developing a single

architecture that internally lifts representations of small objects to

“super-resolved” ones, achieving similar characteristics as large objects and

thus more discriminative for detection. For this purpose, we propose a new

Perceptual Generative Adversarial Network (Perceptual GAN) model that improves

small object detection through narrowing representation difference of small

objects from the large ones. Specifically, its generator learns to transfer

perceived poor representations of the small objects to super-resolved ones that

are similar enough to real large objects to fool a competing discriminator.

Meanwhile its discriminator competes with the generator to identify the

generated representation and imposes an additional perceptual requirement –

generated representations of small objects must be beneficial for detection

purpose – on the generator. Extensive evaluations on the challenging

Tsinghua-Tencent 100K and the Caltech benchmark well demonstrate the

superiority of Perceptual GAN in detecting small objects, including traffic

signs and pedestrians, over well-established state-of-the-arts.

Multispectral and Hyperspectral Image Fusion Using a 3-D-Convolutional Neural Network

Frosti Palsson , Johannes R. Sveinsson , Magnus O. Ulfarsson Subjects : Computer Vision and Pattern Recognition (cs.CV) ; Machine Learning (stat.ML)

In this paper, we propose a method using a three dimensional convolutional

neural network (3-D-CNN) to fuse together multispectral (MS) and hyperspectral

(HS) images to obtain a high resolution hyperspectral image. Dimensionality

reduction of the hyperspectral image is performed prior to fusion in order to

significantly reduce the computational time and make the method more robust to

noise. Experiments are performed on a data set simulated using a real

hyperspectral image. The results obtained show that the proposed approach is

very promising when compared to conventional methods. This is especially true

when the hyperspectral image is corrupted by additive noise.

Self-ensembling for domain adaptation

Geoffrey French , Michal Mackiewicz , Mark Fisher

Comments: 15 pages, 1 figure, submitted to BMVC 2017

Subjects

Computer Vision and Pattern Recognition (cs.CV)

This paper explores the use of self-ensembling with random image augmentation

— a technique that has achieved impressive results in the area of

semi-supervised learning — for visual domain adaptation problems. We modify

the approach of Laine et al. to improve stability and ease of use. Our approach

demonstrates state of the art results when performing adaptation between the

following pairs of datasets: MNIST and USPS, CIFAR-10 and STL, SVHN and MNIST,

Syn-Digits to SVHN and Syn-Signs to GTSRB. We also explore the use of richer

data augmentation to solve the challenging MNIST to SVHN adaptation path.

Dynamic Filters in Graph Convolutional Networks

Nitika Verma , Edmond Boyer , Jakob Verbeek Subjects : Computer Vision and Pattern Recognition (cs.CV)

Convolutional neural networks (CNNs) have massively impacted visual

recognition in 2D images, and are now ubiquitous in state-of-the-art

approaches. While CNNs naturally extend to other domains, such as audio and

video, where data is also organized in rectangular grids, they do not easily

generalize to other types of data such as 3D shape meshes, social network

graphs or molecular graphs. To handle such data, we propose a novel

graph-convolutional network architecture that builds on a generic formulation

that relaxes the 1-to-1 correspondence between filter weights and data elements

around the center of the convolution. The main novelty of our architecture is

that the shape of the filter is a function of the features in the previous

network layer, which is learned as an integral part of the neural network.

Experimental evaluations on digit recognition, semi-supervised document

classification, and 3D shape correspondence yield state-of-the-art results,

significantly improving over previous work for shape correspondence.

Interactive 3D Modeling with a Generative Adversarial Network

Jerry Liu , Fisher Yu , Thomas Funkhouser Subjects : Computer Vision and Pattern Recognition (cs.CV) ; Graphics (cs.GR)

This paper proposes the idea of using a generative adversarial network (GAN)

to assist a novice user in designing real-world shapes with a simple interface.

The user edits a voxel grid with a painting interface (like Minecraft). Yet, at

any time, he/she can execute a SNAP command, which projects the current voxel

grid onto a latent shape manifold with a learned projection operator and then

generates a similar, but more realistic, shape using a learned generator

network. Then the user can edit the resulting shape and snap again until he/she

is satisfied with the result. The main advantage of this approach is that the

projection and generation operators assist novice users to create 3D models

characteristic of a background distribution of object shapes, but without

having to specify all the details. The core new research idea is to use a GAN

to support this application. 3D GANs have previously been used for shape

generation, interpolation, and completion, but never for interactive modeling.

The new challenge for this application is to learn a projection operator that

takes an arbitrary 3D voxel model and produces a latent vector on the shape

manifold from which a similar and realistic shape can be generated. We develop

algorithms for this and other steps of the SNAP processing pipeline and

integrate them into a simple modeling tool. Experiments with these algorithms

and tool suggest that GANs provide a promising approach to computer-assisted

interactive modeling.

A Fully Trainable Network with RNN-based Pooling

Shuai Li , Wanqing Li , Chris Cook , Ce Zhu , Yanbo Gao

Comments: 17 pages, 5 figures, 4 tables

Subjects

Computer Vision and Pattern Recognition (cs.CV)

Pooling is an important component in convolutional neural networks (CNNs) for

aggregating features and reducing computational burden. Compared with other

components such as convolutional layers and fully connected layers which are

completely learned from data, the pooling component is still handcrafted such

as max pooling and average pooling. This paper proposes a learnable pooling

function using recurrent neural networks (RNN) so that the pooling can be fully

adapted to data and other components of the network, leading to an improved

performance. Such a network with learnable pooling function is referred to as a

fully trainable network (FTN). Experimental results have demonstrated that the

proposed RNN-based pooling can well approximate the existing pooling functions

and improve the performance of the network. Especially for small networks, the

proposed FTN can improve the performance by seven percentage points in terms of

error rate on the CIFAR-10 dataset compared with the traditional CNN.

The Monkeytyping Solution to the YouTube-8M Video Understanding Challenge

He-Da Wang , Teng Zhang , Ji Wu

Comments: Submitted to the CVPR 2017 Workshop on YouTube-8M Large-Scale Video Understanding

Subjects

Computer Vision and Pattern Recognition (cs.CV)

This article describes the final solution of team monkeytyping, who finished

in second place in the YouTube-8M video understanding challenge. The dataset

used in this challenge is a large-scale benchmark for multi-label video

classification. We extend the work in [1] and propose several improvements for

frame sequence modeling. We propose a network structure called Chaining that

can better capture the interactions between labels. Also, we report our

approaches in dealing with multi-scale information and attention pooling. In

addition, We find that using the output of model ensemble as a side target in

training can boost single model performance. We report our experiments in

bagging, boosting, cascade, and stacking, and propose a stacking algorithm

called attention weighted stacking. Our final submission is an ensemble that

consists of 74 sub models, all of which are listed in the appendix.

Symplectomorphic registration with phase space regularization by entropy spectrum pathways

Vitaly L. Galinsky , Lawrence R. Frank

Comments: 26 pages, 7 figures

Subjects

Computer Vision and Pattern Recognition (cs.CV)

The ability to register image data to a common coordinate system is a

critical feature of virtually all imaging studies that require multiple subject

analysis, combining single subject data from multiple modalities, or both.

However, in spite of the abundance of literature on the subject and the

existence of several variants of registration algorithms, their practical

utility remains problematic, as commonly acknowledged even by developers of

these methods because the complexity of the problem has resisted a general,

flexible, and robust theoretical and computational framework.

To address this issue, we present a new registration method that is similar

in spirit to the current state-of-the-art technique of diffeomorphic mapping,

but is more general and flexible. The method utilizes a Hamiltonian formalism

and constructs registration as a sequence of symplectomorphic maps in

conjunction with a novel phase space regularization based on the powerful

entropy spectrum pathways (ESP) framework.

The method is demonstrated on the three different magnetic resonance imaging

(MRI) modalities routinely used for human neuroimaging applications by mapping

between high resolution anatomical (HRA) volumes, medium resolution diffusion

weighted MRI (DW-MRI) and HRA volumes, and low resolution functional MRI (fMRI)

and HRA volumes. The typical processing time for high quality mapping ranges

from less than a minute to several minutes on a modern multi core CPU for

typical high resolution anatomical (~256x256x256 voxels) MRI volumes.

Face Clustering: Representation and Pairwise Constraints

Yichun Shi , Charles Otto , Anil K. Jain

Comments: 13 pages, journal paper

Subjects

Computer Vision and Pattern Recognition (cs.CV)

Clustering face images according to their identity has two important

applications: (i) grouping a collection of face images when no external labels

are associated with images, and (ii) indexing for efficient large scale face

retrieval. The clustering problem is composed of two key parts: face

representation and choice of similarity for grouping faces. We first propose a

representation based on ResNet, which has been shown to perform very well in

image classification problems. Given this representation, we design a

clustering algorithm, Conditional Pairwise Clustering (ConPaC), which directly

estimates the adjacency matrix only based on the similarity between face

images. This allows a dynamic selection of number of clusters and retains

pairwise similarity between faces. ConPaC formulates the clustering problem as

a Conditional Random Field (CRF) model and uses Loopy Belief Propagation to

find an approximate solution for maximizing the posterior probability of the

adjacency matrix. Experimental results on two benchmark face datasets (LFW and

IJB-B) show that ConPaC outperforms well known clustering algorithms such as

k-means, spectral clustering and approximate rank-order. Additionally, our

algorithm can naturally incorporate pairwise constraints to obtain a

semi-supervised version that leads to improved clustering performance. We also

propose an k-NN variant of ConPaC, which has a linear time complexity given a

k-NN graph, suitable for large datasets.

Hierarchical Label Inference for Video Classification

Nelson Nauata , Jonathan Smith , Greg Mori Subjects : Computer Vision and Pattern Recognition (cs.CV)

Videos are a rich source of high-dimensional structured data, with a wide

range of interacting components at varying levels of granularity. In order to

improve understanding of unconstrained internet videos, it is important to

consider the role of labels at separate levels of abstraction. In this paper,

we consider the use of the Bidirectional Inference Neural Network (BINN) for

performing graph-based inference in label space for the task of video

classification. We take advantage of the inherent hierarchy between labels at

increasing granularity. The BINN is evaluated on the first and second release

of the YouTube-8M large scale multilabel video dataset. Our results demonstrate

the effectiveness of BINN, achieving significant improvements against baseline

models.

Robotic Ironing with 3D Perception and Force/Torque Feedback in Household Environments

David Estevez , Juan G. Victores , Raul Fernandez-Fernandez , Carlos Balaguer

Comments: Accepted and to be published on the 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2017) that will be held in Vancouver, Canada, September 24-28, 2017

Subjects

Robotics (cs.RO)

; Computer Vision and Pattern Recognition (cs.CV)

As robotic systems become more popular in household environments, the

complexity of required tasks also increases. In this work we focus on a

domestic chore deemed dull by a majority of the population, the task of

ironing. The presented algorithm improves on the limited number of previous

works by joining 3D perception with force/torque sensing, with emphasis on

finding a practical solution with a feasible implementation in a domestic

setting. Our algorithm obtains a point cloud representation of the working

environment. From this point cloud, the garment is segmented and a custom

Wrinkleness Local Descriptor (WiLD) is computed to determine the location of

the present wrinkles. Using this descriptor, the most suitable ironing path is

computed and, based on it, the manipulation algorithm performs the

force-controlled ironing operation. Experiments have been performed with a

humanoid robot platform, proving that our algorithm is able to detect

successfully wrinkles present in garments and iteratively reduce the

wrinkleness using an unmodified iron.

A new look at clustering through the lens of deep convolutional neural networks

Ali Borji , Aysegul Dundar Subjects : Learning (cs.LG) ; Computer Vision and Pattern Recognition (cs.CV)

Classification and clustering have been studied separately in machine

learning and computer vision. Inspired by the recent success of deep learning

models in solving various vision problems (e.g., object recognition, semantic

segmentation) and the fact that humans serve as the gold standard in assessing

clustering algorithms, here, we advocate for a unified treatment of the two

problems and suggest that hierarchical frameworks that progressively build

complex patterns on top of the simpler ones (e.g., convolutional neural

networks) offer a promising solution. We do not dwell much on the learning

mechanisms in these frameworks as they are still a matter of debate, with

respect to biological constraints. Instead, we emphasize on the

compositionality of the real world structures and objects. In particular, we

show that CNNs, trained end to end using back propagation with noisy labels,

are able to cluster data points belonging to several overlapping shapes, and do

so much better than the state of the art algorithms. The main takeaway lesson

from our study is that mechanisms of human vision, particularly the hierarchal

organization of the visual ventral stream should be taken into account in

clustering algorithms (e.g., for learning representations in an unsupervised

manner or with minimum supervision) to reach human level clustering

performance. This, by no means, suggests that other methods do not hold merits.

For example, methods relying on pairwise affinities (e.g., spectral clustering)

have been very successful in many cases but still fail in some cases (e.g.,

overlapping clusters).

Distance weighted discrimination of face images for gender classification

Mónica Benito , Eduardo García-Portugués , J. S. Marron , Daniel Peña

Comments: 9 pages, 4 figures, 1 table

Subjects

Applications (stat.AP)

; Computer Vision and Pattern Recognition (cs.CV); Methodology (stat.ME)

We illustrate the advantages of distance weighted discrimination for

classification and feature extraction in a High Dimension Low Sample Size

(HDLSS) situation. The HDLSS context is a gender classification problem of face

images in which the dimension of the data is several orders of magnitude larger

than the sample size. We compare distance weighted discrimination with Fisher’s

linear discriminant, support vector machines, and principal component analysis

by exploring their classification interpretation through insightful

visuanimations and by examining the classifiers’ discriminant errors. This

analysis enables us to make new contributions to the understanding of the

drivers of human discrimination between males and females.

Artificial Intelligence

Value-Decomposition Networks For Cooperative Multi-Agent Learning

Peter Sunehag , Guy Lever , Audrunas Gruslys , Wojciech Marian Czarnecki , Vinicius Zambaldi , Max Jaderberg , Marc Lanctot , Nicolas Sonnerat , Joel Z. Leibo , Karl Tuyls , Thore Graepel Subjects : Artificial Intelligence (cs.AI)

We study the problem of cooperative multi-agent reinforcement learning with a

single joint reward signal. This class of learning problems is difficult

because of the often large combined action and observation spaces. In the fully

centralized and decentralized approaches, we find the problem of spurious

rewards and a phenomenon we call the “lazy agent” problem, which arises due to

partial observability. We address these problems by training individual agents

with a novel value decomposition network architecture, which learns to

decompose the team value function into agent-wise value functions. We perform

an experimental evaluation across a range of partially-observable multi-agent

domains and show that learning such value-decompositions leads to superior

results, in particular when combined with weight sharing, role information and

information channels.

From Propositional Logic to Plausible Reasoning: A Uniqueness Theorem

Kevin S. Van Horn

Comments: Submitted to Int’l Journal of Approximate Reasoning

Subjects

Artificial Intelligence (cs.AI)

; Logic in Computer Science (cs.LO)

We consider the question of extending propositional logic to a logic of

plausible reasoning, and posit four requirements that any such extension should

satisfy. Each is a requirement that some property of classical propositional

logic be preserved in the extended logic; as such, the requirements are simpler

and less problematic than those used in Cox’s Theorem and its variants. As with

Cox’s Theorem, our requirements imply that the extended logic must be

isomorphic to (finite-set) probability theory. We also obtain specific

numerical values for the probabilities, recovering the classical definition of

probability as a theorem, with truth assignments that satisfy the premise

playing the role of the “possible cases.”

Improving Scalability of Inductive Logic Programming via Pruning and Best-Effort Optimisation

Mishal Kazmi , Peter Schüller , Yücel Saygın

Comments: 24 pages, preprint of article accepted at Expert Systems With Applications

Subjects

Artificial Intelligence (cs.AI)

Inductive Logic Programming (ILP) combines rule-based and statistical

artificial intelligence methods, by learning a hypothesis comprising a set of

rules given background knowledge and constraints for the search space. We focus

on extending the XHAIL algorithm for ILP which is based on Answer Set

Programming and we evaluate our extensions using the Natural Language

Processing application of sentence chunking. With respect to processing natural

language, ILP can cater for the constant change in how we use language on a

daily basis. At the same time, ILP does not require huge amounts of training

examples such as other statistical methods and produces interpretable results,

that means a set of rules, which can be analysed and tweaked if necessary. As

contributions we extend XHAIL with (i) a pruning mechanism within the

hypothesis generalisation algorithm which enables learning from larger

datasets, (ii) a better usage of modern solver technology using recently

developed optimisation methods, and (iii) a time budget that permits the usage

of suboptimal results. We evaluate these improvements on the task of sentence

chunking using three datasets from a recent SemEval competition. Results show

that our improvements allow for learning on bigger datasets with results that

are of similar quality to state-of-the-art systems on the same task. Moreover,

we compare the hypotheses obtained on datasets to gain insights on the

structure of each dataset.

Deal or No Deal? End-to-End Learning for Negotiation Dialogues

Mike Lewis , Denis Yarats , Yann N. Dauphin , Devi Parikh , Dhruv Batra Subjects : Artificial Intelligence (cs.AI) ; Computation and Language (cs.CL)

Much of human dialogue occurs in semi-cooperative settings, where agents with

different goals attempt to agree on common decisions. Negotiations require

complex communication and reasoning skills, but success is easy to measure,

making this an interesting task for AI. We gather a large dataset of

human-human negotiations on a multi-issue bargaining task, where agents who

cannot observe each other’s reward functions must reach an agreement (or a

deal) via natural language dialogue. For the first time, we show it is possible

to train end-to-end models for negotiation, which must learn both linguistic

and reasoning skills with no annotated dialogue states. We also introduce

dialogue rollouts, in which the model plans ahead by simulating possible

complete continuations of the conversation, and find that this technique

dramatically improves performance. Our code and dataset are publicly available

( this https URL ).

Zero-Shot Task Generalization with Multi-Task Deep Reinforcement Learning

Junhyuk Oh , Satinder Singh , Honglak Lee , Pushmeet Kohli

Comments: ICML 2017

Subjects

Artificial Intelligence (cs.AI)

; Learning (cs.LG)

As a step towards developing zero-shot task generalization capabilities in

reinforcement learning (RL), we introduce a new RL problem where the agent

should learn to execute sequences of instructions after learning useful skills

that solve subtasks. In this problem, we consider two types of generalizations:

to previously unseen instructions and to longer sequences of instructions. For

generalization over unseen instructions, we propose a new objective which

encourages learning correspondences between similar subtasks by making

analogies. For generalization over sequential instructions, we present a

hierarchical architecture where a meta controller learns to use the acquired

skills for executing the instructions. To deal with delayed reward, we propose

a new neural architecture in the meta controller that learns when to update the

subtask, which makes learning more efficient. Experimental results on a

stochastic 3D domain show that the proposed ideas are crucial for

generalization to longer instructions as well as unseen instructions.

Conjunctions of Among Constraints

Victor Dalmau

Comments: 15 pages plus appendix

Subjects

Artificial Intelligence (cs.AI)

; Logic in Computer Science (cs.LO)

Many existing global constraints can be encoded as a conjunction of among

constraints. An among constraint holds if the number of the variables in its

scope whose value belongs to a prespecified set, which we call its range, is

within some given bounds. It is known that domain filtering algorithms can

benefit from reasoning about the interaction of among constraints so that

values can be filtered out taking into consideration several among constraints

simultaneously. The present pa- per embarks into a systematic investigation on

the circumstances under which it is possible to obtain efficient and complete

domain filtering algorithms for conjunctions of among constraints. We start by

observing that restrictions on both the scope and the range of the among

constraints are necessary to obtain meaningful results. Then, we derive a

domain flow-based filtering algorithm and present several applications. In

particular, it is shown that the algorithm unifies and generalizes several

previous existing results.

Collaborative vehicle routing: a survey

Margaretha Gansterer , Richard F. Hartl Subjects : Multiagent Systems (cs.MA) ; Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Optimization and Control (math.OC); Physics and Society (physics.soc-ph)

In horizontal collaborations, carriers form coalitions in order to perform

parts of their logistics operations jointly. By exchanging transportation

requests among each other, they can operate more efficiently and in a more

sustainable way. Collaborative vehicle routing has been extensively discussed

in the literature. We identify three major streams of research: (i) centralized

collaborative planning, (ii) decentralized planning without auctions, and (ii)

auction-based decentralized planning. For each of them we give a structured

overview on the state of knowledge and discuss future research directions.

Structured Best Arm Identification with Fixed Confidence

Ruitong Huang , Mohammad M. Ajallooeian , Csaba Szepesvári , Martin Müller Subjects : Learning (cs.LG) ; Artificial Intelligence (cs.AI)

We study the problem of identifying the best action among a set of possible

options when the value of each action is given by a mapping from a number of

noisy micro-observables in the so-called fixed confidence setting. Our main

motivation is the application to the minimax game search, which has been a

major topic of interest in artificial intelligence. In this paper we introduce

an abstract setting to clearly describe the essential properties of the

problem. While previous work only considered a two-move game tree search

problem, our abstract setting can be applied to the general minimax games where

the depth can be non-uniform and arbitrary, and transpositions are allowed. We

introduce a new algorithm (LUCB-micro) for the abstract setting, and give its

lower and upper sample complexity results. Our bounds recover some previous

results, which were only available in more limited settings, while they also

shed further light on how the structure of minimax problems influence sample

complexity.

AI-Powered Social Bots

Terrence Adams

Comments: 2 figures

Subjects

Social and Information Networks (cs.SI)

; Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Multimedia (cs.MM)

This paper gives an overview of impersonation bots that generate output in

one, or possibly, multiple modalities. We also discuss rapidly advancing areas

of machine learning and artificial intelligence that could lead to

frighteningly powerful new multi-modal social bots. Our main conclusion is that

most commonly known bots are one dimensional (i.e., chatterbot), and far from

deceiving serious interrogators. However, using recent advances in machine

learning, it is possible to unleash incredibly powerful, human-like armies of

social bots, in potentially well coordinated campaigns of deception and

influence.

Bib2vec: An Embedding-based Search System for Bibliographic Information

Takuma Yoneda , Koki Mori , Makoto Miwa , Yutaka Sasaki

Comments: EACL2017 extended version

Journal-ref: Proceedings of the EACL 2017 Software Demonstrations, Valencia,

Spain, April 3-7 2017, pages 112-115

Subjects

Computation and Language (cs.CL)

; Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)

We propose a novel embedding model that represents relationships among

several elements in bibliographic information with high representation ability

and flexibility. Based on this model, we present a novel search system that

shows the relationships among the elements in the ACL Anthology Reference

Corpus. The evaluation results show that our model can achieve a high

prediction ability and produce reasonable search results.

An Overview of Multi-Task Learning in Deep Neural Networks

Sebastian Ruder

Comments: 14 pages, 8 figures

Subjects

Learning (cs.LG)

; Artificial Intelligence (cs.AI); Machine Learning (stat.ML)

Multi-task learning (MTL) has led to successes in many applications of

machine learning, from natural language processing and speech recognition to

computer vision and drug discovery. This article aims to give a general

overview of MTL, particularly in deep neural networks. It introduces the two

most common methods for MTL in Deep Learning, gives an overview of the

literature, and discusses recent advances. In particular, it seeks to help ML

practitioners apply MTL by shedding light on how MTL works and providing

guidelines for choosing appropriate auxiliary tasks.