Big Data for DBAs

Big Data is such a wide scope term so I want to know what exactly is DBAs role in Big Data world?
Thanks,
Abhi

Hello Rajendra,
There are several article / tutorials regarding this topic online available, e.g. at MVA = Microsoft Virtual Academy:
http://www.microsoftvirtualacademy.com/training-courses/big-data-analytics
Olaf Helper
[ Blog] [ Xing] [ MVP]

Similar Messages

  • Big Data example

    Hi all, 
    I hear the term big data for quite some time now...
    Whenever I look on the web I only find infrastructure explanation...
    What does it mean in terms of T-Sql coding? structural storage on sql server (are tables used?)? Is there a "Hello World" example for big data.
    Sorry if my question is a bit wierd but that's how I've always started learning any new programming language or API.
    Thanks in advance, 
    Dror

    Hi,
    what is big data?
    from wiki
    Big data is an all-encompassing term for any collection of dataset so large and complex that it becomes difficult to process using on-hand data management tools or traditional data processing applications.
    http://en.wikipedia.org/wiki/Big_data
    What is Hadoop?
    Hadoop is designed to efficiently process large volumes of information by connecting many commodity computers together to work in parallel
    https://developer.yahoo.com/hadoop/tutorial/
    Hadoop distribution for Microsoft
    http://hortonworks.com/partner/microsoft/
    Microsoft PDW
    http://gnanadurai.blogspot.in/

  • OEID Big Data Connector Integration

    In the oracle big data solution , it shows OEID can source from Oracle Big Data connectors to ingest Big Data for exploration and discovery.
    Is there any documentation around as to how to achieve this ?
    Can anyone share any information ?

    It depends on the connector. Oracle Loader for Hadoop runs on the Hadoop cluster, and does not reside on the Oracle Database. It can reside on a node from where you submit your map reduce nodes - which might or might be a node on the cluster. Oracle Direct Connector for HDFS resides on Oracle Database.

  • Certifications for "Appliances" & more Big Data certifications Please

    It seems Oracle is committing towards appliances . . . database appliance, big data appliance, Exadata and Exalytics In-Memory Machine.
    Will there be a shift in certification to match?
    For example there is a appliance certification out there already . . . the Exadata certification . . . so extending this idea there could (or should be) certifications related to how to support the other appliances
    I would like to see a certification related to Big Data . . . how about a NoSQL certification?

    910086 wrote:
    It seems Oracle is committing towards appliances . . . database appliance, big data appliance, Exadata and Exalytics In-Memory Machine.
    Will there be a shift in certification to match?
    For example there is a appliance certification out there already . . . the Exadata certification . . . so extending this idea there could (or should be) certifications related to how to support the other appliances
    I would like to see a certification related to Big Data . . . how about a NoSQL certification?On the certification website there is a Oracle Certified Database Cloud Administrator track published ( http://education.oracle.com/pls/web_prod-plq-dad/db_pages.getpage?page_id=458&get_params=p_track_id:CLOUDOCP) .... (some relevant exams in beta (exadata) or just about to go beta ) ... There is Oracle Cloud Application Foundation Certified Implementation Specialist ... the 12c DBA OCP Database upgrade exam (from 11g DBA OCP) is present but not bookable on Pearson VUE (last time I looked) ... waiting for 12c Database release (should be soon).
    There's some matching things on the systems side too: http://education.oracle.com/pls/web_prod-plq-dad/db_pages.getpage?page_id=632 (see under systems).
    If I goto the 'big data' technology home page on OTN (http://www.oracle.com/us/technologies/big-data/index.html) ... and quite frankly its the first time I've been there as far as I remember, I see a lot of technologies brought togther ... some are (relatively) old , some seem relatively new. Some have associated certifications already ... some do not ... and I suspect some never will (not everything has to have a certication).
    When a technology is in its infancy the developers, champions and first users will be getting to grips with it, learning it, developing it, learning how to appy it. When this has matured enough the expertise will also have developed enough to take stock and produce a certification.
    I am also aware Oracle are aware of the ability of a certification to incentivise candiates (and oracle partners) to get onto evaluating or adopting a particular technology or new features (overall this probably benefits both sides ... but it can mean a certification may be slightly skewed towards new features or aspects Oracle wishes to promote ... especially for OCS).

  • I am using the big date calendar template and when I submit it to apple for printing I lose the name of two months. These names are not text boxes. I see the names when I send it in but something happens during the transmission to apple. It was suggested

    I am using the big date calendar template in iPhoto. I am on Lion 10.7.2, macbook air. The names of the months are on each calendar page but something happens when I send the data to Apple. The names are part of the template. They are not text boxes. I lose two names on the calendar after it is sent to Apple. Apple suggested I make a pdf file of my calendar before sending it in and check to make sure every name shows. I did this with a calendar I just sent in. The calendar was correct. All names of the months were showing. After sending the data two month names disappeard because when it arrived by mail, it was incorrect. Apple looked at my calendar via a pdf file and it was incorrect.  This is second time this has happened. I called Apple and they had me delete several folders in the Library folder, some preferences and do a complete reinstall of iPhoto.  I have not yet remade the defective calendar. I am wondering if anyone else has had this problem?
    kathy

    Control-click on the background of the view all pages window and select "Preview Calendar" from the contextual menu.
    You can also save the pdf as a file to compare to the printed calendar.  If the two names are visible in the pdf file then the printed copy should show them.  Contact Apple for a refund.  Apple Print Products - Apple Store (U.S.)

  • Best practices for administering Oracle Big Data Appliance

    -        Best practices as part of administration of Oracle Big Data Infrastructure
    -        How do we lock down max space usage per project
    Eg: Project team A can have a max limit of 10 TB space allocated
    -        Restricting roles, access ( Read, Write), place holder for common shared artifacts
    -        Template/procedure for code migration across dev,qa and prod environments etc

    Your data is bigger than I run, but what I have done in the past is to restrict their accounts to a separate datafile and limit its size to the max that I want for them to use: create objects restricted to accommodate the location.

  • What is the best big data solution for interactive queries of rows with up?

    0 down vote favorite
    We have a simple table such as follows:
    | Name | Attribute1 | Attribute2 | Attribute3 | ... | Attribute200 |
    | Name1 | Value1 | Value2 | null | ... | Value3 |
    | Name2 | null | Value4 | null | ... | Value5 |
    | Name3 | Value6 | null | Value7 | ... | null |
    | ... |
    But there could be up to hundreds of millions of rows/names. The data will be populated every hour or so.
    The goal is to get results for interactive queries on the data within a couple of seconds.
    Most queries look like:
    select count(*) from table
    where Attribute1 = Value1 and Attribute3 = Value3 and Attribute113 = Value113;
    The where clause contains arbitrary number of attribute name-value pairs.
    I'm new in big data and wondering what the best option is in terms of data store (MySQL, HBase, Cassandra, etc) and processing engine (Hadoop, Drill, Storm, etc) for interactive queries like above.

    Hi,
    As always, the correct answer is "it depends".
    - Will there be more reads (queries) or writes (INSERTs)?
    - Will there be any UPDATEs?
    - Does the use case require any of the ACID guarantees, or would "eventual consistency" be fine?
    At first glance, Hadoop (HDFS + MapReduce) doesn't look like a viable option, since you require "interactive queries". Also, if you require any level of ACID guarantees or UPDATE capabilities the best (and arguably only) solution is a RDBMS. Also, keep in mind that Millions of rows is pocket change for modern RDBMSs on average hardware.
    On the other hand, if there'll be a lot more queries than inserts, VERY few or no updates at all, and eventual consistency will not be a problem, I'd probably recommend you to test a Key-Value store (such as Oracle NoSQL Database). The idea would be to use (AttributeX,ValueY) as the Key, and a Sorted List of Names that have ValueY for their AttributeX. This way you only do as many reads as attributes you have in the WHERE clause, and then compute the intersection (very easy and fast with sorted lists).
    Also, I'd do this computation manually. SQL may be comfortable, but I don't think It's Big Data ready yet (unless you chose the RDBMS way, of course).
    I hope it helped,
    Joan
    Edited by: JPuig on Apr 23, 2013 1:45 AM

  • Strategy for big data

    Dear experts,
    Currently i'm facing Big Data problem. We have an about 1TB transaction record for Per Month.
    Now I'm trying to create Data Marts for that. And Install Obiee. What is the Strategy And Steps?
    Please Advice...
    BR,
    Eba

    Denis,
    In this case you can do it two ways.
    1. Proxies - You will have to develop a custom report which will collect all the data that needs to be sent and call the PROXY will the collected as input.
    2. IDOCs - If you are dealing with standard IDOCS, this is easier. You can activate the configuration to send the IDOCS for contracts for all the operations that you have mentioned. Do the required outbound configuration in WE20 to mention the target system as XI.
    I am not sure why are you even thinking of scheduling a BPM in XI that will invoke the RFC. SAP as such has got the scheduling capabilities. I would rather suggest you to use that.
    Regards,
    Ravi

  • Working with R packages for Big Data

    Hi ,
    I wonder which R package from it big data an parallel processing family are relevent to work with in ML Studio ?
    It depends on if ML Studio uses Map Reduce during R script ? If yes , RHadoop package seems not useful .
    If using snowfall package for parallel processing will help for high volume datasets . If it will exploit several CPU ?
    Thanks in advance

    Currently, the R scripts are executed on single VM. You can manually set up map-reduce pattern by splitting the data and having multiple Execute R Script modules in parallel in your experiment graph.
    -Roope

  • Scripts for OTN Developer Day - Big Data

    Hi,
    I was wondering if anybody knows where to find the scripts used for "OTN Developer Day - Big Data" which was held during February/March 2014?
    The "Lab Guide" can be found in: https://www.oracle.com/webfolder/s/delivery_production/docs/FY14h1/BigDataWorkshop.pdf
    Regards,
    Babak.

    Not sure if these are the exact same ones, but it should be very close: Oracle Big Data Lite Virtual Machine
    It will also get you to the page where the VM is frequently updated and the HOL sections come with the updated VM.
    JP

  • GoldenGate for Big Data 12c for Win x64?

    I was looking for the GoldenGate for Big Data download for Win x64 and all I found on edelivery was Linux, Solaris, HP-UX and AIX platforms, but no Windows at all (see the screenshot below). I wonder if it's been released yet? Or, is it just an unfortunate omission?
    Thanks
    Andy

    Thanks, for your reply, Karan!
    I tried following your advice, but bumped into yet another similar problem. I've installed OGG 12c and now I can't seem to be able to find the matching version of the GG Application Adapters for JMS and Flat File for the Win x64 platform. The latest version of Application Adapter available on edelivery is 11.1.1.0.0 which means I need to downgrade OGG to the same version. No big deal but I wanted to make sure I'm not missing anything.
    I wonder if anybody has any idea as to whether Application Adapters 12c for JMS and Flat File is available for the Win x64 platform, and if so, where can I download it from?
    Thanks
    Andy

  • Big Data - Learn - Easy? for Microsoft SQL Server Deveoloper and MSBI?

    I Posted it already, but reposting here as hoping that i will get more response in this forum .  Can I leverage my existing MSBI skills to learn Big Data or something I have learn it from scratch. is learning
    Big Data would helpful to my career ?
    Thanks, Rajendra

    Hello Rajendra,
    There are several article / tutorials regarding this topic online available, e.g. at MVA = Microsoft Virtual Academy:
    http://www.microsoftvirtualacademy.com/training-courses/big-data-analytics
    Olaf Helper
    [ Blog] [ Xing] [ MVP]

  • Big Data

    Really , this new technology could not have another name . In Portuguese , in a literal translation , can we say big data , or even analytic function of a large volume of data , structured or unstructured , which are determined by sound, images , numbers and even personalities , guys . This analytic function , which is a statistical function can determine trends for a given sequence of actions on the internet . For example , in my case , in a certain period of time , I created a sequence of groups with the same configuration and the same format in use. Therefore, the analysis may indicate a tendency to create new groups with the same goal . Soon after , this analysis indicates the creation of a page as an author and a daily event . Thus , including the analysis of texts and interactions of tastes and commitments of the texts , we can come to a conclusion : this guy writes every day.
    Well, so far we see that whosoever will be effective in the use of this new technology , analysis tool , should have basic knowledge of statistics, just math .
    So here comes the question : professionals in the exact sciences will be ahead in this technology field ? Or human issue , referring to the tastes and engagements also involve professionals in the humanities ?
    I understand that without the two views , the analysis will be distorted .
    With the two views we can say that in my case , the analysis may indicate that social groups , are actually short stories written in the social network , also called " social books " ( " social book " in English ) .
    Recommendation : Spot staff, learn to hear about human feelings , human and personal , learn to such statistics.
    So you can collaborate , work together as a team .

    975791 wrote:
    Hi All,
    I have around few years experience ( L2  level) in Oracle DBA.I recently joined in a company.
    You realize, of course, that "L2 level" is not an industry-wide term an means nothing outside of your own organization's usage.
    When I first started in this business (1981) my company had 3 "levels' of "programmer-analys" -- PA-1, PA-2, and PA-3.  The PA-1 was the most junior and the PA-3 was the most senior.  In my next job they had exactly the same job titles.  But there PA-1 was most senior and PA-3 was most junior.
    "When I use a word," Humpty Dumpty said in rather a scornful tone, "it means just what I choose it to mean -- neither more nor less."
    (Lewis Carroll - Through the Looking Glass)
    They asked to study Big Data.Could you please share about Big data Future in IT industry.
    You do realize, of course, that "Big Data" is just a concept, little more than a marketing term.  It does not necessarily refer to anything that is in conflict with your oracle experience.  I
    I  hesitate to learn Big Data because i spent a lot of  time/energy in oracle dba.
    Why do you think learning MORE will negate what you've already learned?  The more you know about anything, the more you will know about everything, and the more valuable you will be.  There's no such thing as wasted education.
    Share about Oracle dba Future. Because Oracle  automated everything in database.
    No they have not "automated everything in database".  They have not automated intelligence into design.  They have not automated intelligence in problem solving.  Sure, the database doesn't require as much day-to-day hand-holding as it used to, but that just frees you up for more intelligent work.
    Please anyone advise me.
    Don't expect your career 30 years from now to be the same as it is today.
    Don't expect technology 30 years from now to be the same as it is today.

  • Big data and database administration

    Hi,
    I am working as a Oracle DBA. I would like to know what is dba role for Big data & Nosql.
    Is it really useful for learning bigdata.
    Thanks,

    . Are
    there any relationship between these two fields?You are comparing cheese with chalk.
    how
    I can learn more about the data wherehousing?Start with Oracle doc,
    Oracle® Database Data Warehousing Guide
    http://download.oracle.com/docs/cd/B19306_01/server.102/b14223/toc.htm

  • What does "use cellular data for" mean?

    In the settings, what happens if I turn on/off "use cellular data for?"

    For apps that require a data connection, if you enable that and you are not within range of a wifi node to use, the app will use cellular data which will count towards your cell plan's data usage.  If you turn that OFF, and there is no wifi available, the app will not work, but you also won't use any cellular data.
    so in the iTunes and App store settings, if you want to limit the stores to only use wifi, turn the "Use Cellular Data" option OFF (e.g. to avoid big downloads over cellular).

Maybe you are looking for

  • File transmission, file conversion in OS400

    Hello Gurus, We are sending the file from  our Quality SAP to other AS400 server but we get an error. From: SAP Quality as400 To: External server user SAPTEST The file we are sending is SAP9530001_20100813from folder /usr/sap/Q00/INTERFACES/OUTBOUND/

  • Explorer and icon Missing in the Workspace

    Hello All, I am using Rapid Deployment document to install HFM on windows 2008 R2 64 bit server. I installed Foundation service , RA / Financial reporting services and HFM. After I start the services, I don't see the explorer and icons for HFM under

  • I can't sync 515 calendar with Outlook

    The Nokia 515 is one of the simplest phones on the market, but the most complex to use. I would like to be able to see my Calendar on my iPad, 515 and Laptop. Vodaphoe support told me that I could only synchronise the phone with a Microsoft Exchange

  • Test tab giving correct result but msg to receiver is incorrect?

    Hi Experts, When i test the message (in test tab)taken from the source payload from moni, i get the correct output as expected. But the message send to receiver is not correct. why is this deviation i dont understand. Pls help !! Regards, Suresh.

  • Can macs use pc monitors?

    I don't know where to ask this newbie question but I couldn't find the answer in the site. I am considering switching to mac but I recently got a nice flatscreen monitor for my PC and mac monitors are seemingly really expensive.. I imagine you can't