I need a server to hold my primary (raw) data, that has not been subjected to processing or any other manipulation. I need this because i don't know what tool i will use in the future. At the moment i might use Splunk and Graylog2. But if i decide to use other tooling, it might be difficult to export the data. When the primary data is still available i can import that in the new tooling.
Starting points:
Incoming data will be stored locally on RAID10 redundant storage. It will then be synced over to a NFS share on a NAS for archiving and back-up. This way if the NAS suddenly isn't available, we will retry the rsync. If the NAS is available again the rsync will continue where it left off.
Configure a backup job on the NAS to remote storage.
CentOS 6.5
rsyslogd
vsftp
ssh/sftp
rsync