Extended Star Schema doubt
Hi ALL,
Fact table --- Dimension table -
Sid Table -- master data values.
Fact table:
Contains Key Figures and Dimensional id's
Dimension table:
Contains Dim Id & SID
SID table:
SID and Master object ID ( eg: Customer ID)
Master Data Table:
contains Customer Attributes , Texts , Hierarchies for that Customer ID.
THis is the Extended Star Schema design.
MY DOUBT:
1)In Dimension table itself if we place that Customer ID(No SID table next) what will happen?
Fact table --> Dimension table --> Master data Table
2)Instead of that SID table we can directly place that CustomerID in Dimension table , so we can reduce one layer inbetween Dimendion table and Master data table.Is it correct or not?
Any one can clarify my doubt.
Regards,
Arun.M.D
SID means 'surrogate ID'. That is an system created id as you know. Main purpose is fastening the search.
Mostly, there exists a rule for Customer or Material ID's.
Like it should be CHAR 10 or CHAR 16.
This kind of alpha-numeric fields are harder to search when compared to integers. Moreover, your customer id can be 10 digits but, this does not mean you will have 1000000000 customers. This is the main reason that, an internal ID is produced. If you have 10000 customers, your SID will be at most 10000.
However, if your customer ID's are starting from 1 and growing up like integers, then your argument would be true. ( but still no way to skip SID creation and direct usage of characteristic ID in fact table)
Also, as mentioned by other friends, there exist the Line Item dimension property if you have only one characteristic in one dimension. That simply does skip the DIM ID creation step, and puts your SID into the Fact table. ( Since you have only one char in the dimension, no combination is possible)
Hope this helps.
Derya
Similar Messages
-
Regarding Extended star schema
Hi Friends,
In Extended star schema,master data will load separately ,which will connect through sid's to dimension table .
My question is.. This master data tables can be used other than this cube ?
Please tell me i am in confusion.
Thanks in advace,
Regards,
ramnaresh.Hi
InfoCubes are made up of a number of InfoObjects. All InfoObjects (characteristics and key figures) are available independent of the InfoCube. Characteristics refer to master data with their attributes and text descriptions.
An InfoCube consists of several InfoObjects and is structured according to the star schema. This means there is a (large) fact table that contains the key figures for the InfoCube, as well as several (smaller) dimension tables which surround it. The characteristics of the InfoCube are stored in these dimensions.
An InfoCube fact table only contains key figures, in contrast to a DataStore object, whose data part can also contain characteristics. The characteristics of an InfoCube are stored in its dimensions.
The dimensions and the fact table are linked to one another using abstract identification numbers (dimension IDs) which are contained in the key part of the particular database table. As a result, the key figures of the InfoCube relate to the characteristics of the dimension. The characteristics determine the granularity (the degree of detail) at which the key figures are stored in the InfoCube.
Characteristics that logically belong together (for example, district and area belong to the regional dimension) are grouped together in a dimension. By adhering to this design criterion, dimensions are to a large extent independent of each other, and dimension tables remain small with regards to data volume. This is beneficial in terms of performance. This InfoCube structure is optimized for data analysis.
The fact table and dimension tables are both relational database tables.
Characteristics refer to the master data with their attributes and text descriptions. All InfoObjects (characteristics with their master data as well as key figures) are available for all InfoCubes, unlike dimensions, which represent the specific organizational form of characteristics in one InfoCube.
Integration
You can create aggregates to access data quickly. Here, the InfoCube data is stored redundantly and in an aggregated form.
You can either use an InfoCube directly as an InfoProvider for analysis and reporting, or use it with other InfoProviders as the basis of a MultiProvider or InfoSet.
See also:
Checking the Data Loaded in the InfoCube
If the above info is useful, please grant me points -
Wat is the use of sid in extended star schema rather than linking masterdat
hi bw gurus,
wat is the use of sid in extended star schema rather than linking masterdata with dimention tables?
thanx in advace,
i will assign points,
srinivashi,
the sid are used instead of the data in order to avoid the redunduncy of data.
and reduce the datastorage size.
the data will be present in the sid table and the data is linked used the corerponding sid in the dimension.
regards -
Hi
Right now I'm trying to implement SAP HR Infotypes into SAP BW. I try to combine 10 infotypes (all Personal Administration) into one infocube. The confusing one is that when I see my dimension, there something strange (please see below)
Dimension 1 (from PA0016)
ZDATEFROM16
ZDATETO16
ZXXX (60 character, don't have master date attribute or text)
ZYYY (30 character, don't have master date attribute or text)
ZAAA(3 character, only master data text)
Dimension 2 (from PA0023)
ZDATEFROM23
ZDATETO23
ZCCC (60 character, don't have master date attribute or text)
ZBBB (30 character, don't have master date attribute or text)
ZDDD (20 character, don't have master date attribute or text)
If i see my dimension, it seems strange because most of them don't have master data (text,attr,hierarchy).
Is this violate the extended star schema design? Thank you.
Regards,
SatriaYou can use characteristics such as ZXXX without master data (text, attr, hier) in extended star schema. It is not a violation. This means in business, for this characteristic you really don't need its text, attribute or hierarchy.
When perform data modeling for you HR cube, please do consider the design, can some characteristic be attribute of another one? Will you use hierarchy such organization level for some characteristic? characteristic with text, attribute or hierachy can provide more flexibility in reporting. -
Need help with an extended star schema.
I have the following data in spreadsheet form and I need to transpose this into an extended star schema in order to prdouce my characteristics and key figures, however I'm having a little trouble understanding where to include the Product Group in the star schema and also what to do with the Sales Doc (Invoice Number) in the sales information spreadsheet.
I need to summarise the data into the following also:
1. Total quantity and revenue for each Channel and product group
2. Total quantity and revenue for each Channel and product group (July 2009 to September 2009)
3. Total Quantity and Revenue for iphone sales
So my question in short is what would the star schema look like? I'm hoping somebody can help me over this hurdle.
The following are the details from the spreadsheet:
Product Details (Attributes)
File Name: Product_Details.csv
Structure FieldName Length Description
ProdID 5 Uniquely identifies a product
PG_Group 20 Identifies products in specific groups (Navigational field)
Sample Data
ProdID,PG_Group
PDS01,COMPUTER
PDS02,COMPUTER
PDS03,COMPUTER
PDS04,ACCESSORIES
PDS05,ACCESSORIES
PDS06,ACCESSORIES
PDS07,ACCESSORIES
PDS08,HARDWARE
Product Description (Text)
File Name: Product.csv
Structure FieldName Length Description
ProdID 5 Uniquely identifies a product
Prod_Desc Medium Text Represents the product description
(Navigational field)
Language 2 Represents the language descriptor
Sample Data
ProdID,Prod_Desc,Language
PDS01,Notebook Speedy I,EN
PDS02,Notebook Speedy II,EN
PDS03,PC Thinkbox I,EN
PDS04,USB Adaptor,EN
PDS05,iPhones PX2,EN
PDS06, Stereo Kit,EN
PDS07,Camera Connector,EN
PDS08,USB Storage,EN
Transactional Data
Sales information
File Name: Sales.txt
Structure FieldName Length Description
Product 5 Uniquely identifies a product
Sales Doc 4 Sales Document number (Invoice Number)
Calendar Year 8 YYYYMMDD format
Channel 1 Represents the distribution channel the product was sold by
Quantity 4 Numeric value indicating the number of each product sold on a specific sales document
Unit 2 Refers the unit specified by the quantity
Revenue 8 Refers the sales amount of a specific product on a specific sales document
Currency 3 Relates the revenue currency type
Sample Data
Product;Sales Doc ,Calendar Day,Channel,Quantity,Unit;Revenue,Currency
ZD_PROD,0DOC_NUMBER,0CALDAY,ZD_CHAN,ZD_QTY,0UNIT,ZD_REV,0CURRENCY
PDS07,7389,20090705,A,95,EA,151905,USD
PDS08,7389,20090705,A,115,EA,264385,USD
PDS02,7473,20090706,B,55,EA,3580445,USD
PDS01,7472,20090706,C,110,EA,604989,USD
PDS02,7472,20090706,C,43,EA,2799257,USD
PDS03,7472,20090706,C,75,EA,4507425,USD
PDS04,7472,20090706,A,339,EA,301032,USD
PDS06,7472,20090706,A,120,EA,47976,USD
PDS01,7473,20090718,D,55,EA,3024945,USD
PDS02,6845,20090806,B,30,EA,195297,USD
PDS07,6877,20090807,C,800,EA,12792,USD
PDS08,6877,20090807,C,250,EA,57475,USD
PDS03,6912,20090812,C,250,EA,1502475,USD
PDS04,6900,20090812,B,70,EA,6216,USD
PDS01,6899,20090812,C,12,EA,659988,USD
PDS04,6899,20090812,D,55,EA,4884,USD
PDS04,7479,20090815,C,80,EA,7104,USD
PDS05,7479,20090815,C,300,EA,70497,USD
PDS05,6809,20090818,A,1000,EA,234990,USD
PDS06,6809,20090818,A,130,EA,51974,USD
PDS03,6831,20090822,C,15,EA,901485,USD
PDS04,6831,20090822,C,500,EA,4440,USD
PDS03,6830,20090822,C,125,EA,7512375,USD
PDS06,6830,20090822,C,400,EA,15992,USD
PDS04,6841,20090825,C,53,EA,47064,USD
PDS05,6841,20090825,C,350,EA,822465,USD
Distribution Channel
Channel (text)
File Name: Channel.csv
Structure FieldName Length Description
ChanID 1 Uniquely identifies a specific distribution channel
Chan_Desc Medium
Text Full name of each distribution channel
(Navigational field)
Sample Data
ChanID, Chan_Desc
A,Phone
B,Fax
C,Internet
D,Other
Star Schema Model:Potentially the issue may be the combination of actual distance and the presence of any Wi-Fi interference at the location of the AirPort Express that may be the culprit here. The bottom line is you would want the Express to be within a 25+ dB range of the Extreme to be sure that it can consistently stay connected to it.
For more details on proper base station placement, check out the following AirPort User tip. Please post back your results. -
Why was the extended star schema scrapped?
Hi,
BW on HANA takes out the dimension tables and gives us back the original star schema structure.
Now my questions are
- How did the original extended star schema help the old BW on oracle or DB2 systems? I mean what was the need for such a design
- If it offered some kind of optimization .. then why was it scrapped in BW on HANA? Couldn't it have added on to the performance that HANA DB brought?
Regards,
SamCan anyone please explain.. ? Krishna Tangudu Thomas Jung Shyam Uthaman .. Any thoughts?
-
What happens when i remove Dimension table in extended star schema
*how to integrate the flex builder web dynpro and xcelsius?
i found some scenarios like when the content of the table change the chart will changes for the changed data in the table using flash islands and web dynpro.
my scenario is like this.
when i changed the chart with sliders automatically the content in the table will update to the chart level ? but both table and chart will be in the same dashboard. how can i work out this scenario using flash islands, flex and web dynpro>
thanks in Advance*Hi Sateesh,
Please go through this URL you will get overall Idea about Extended Star schema with DIM ID's,SID's and Data flow.
http://help.sap.com/saphelp_nw70/helpdata/EN/8d/2b4e3cb7f4d83ee10000000a114084/frameset.htm
http://help.sap.com/saphelp_nw70/helpdata/EN/8d/2b4e3cb7f4d83ee10000000a114084/frameset.htm
http://help.sap.com/saphelp_nw70/helpdata/EN/8d/2b4e3cb7f4d83ee10000000a114084/frameset.htm
I hope it will help you.
Regards
Suresh B.G. -
Relation between different tables in extended star schema
Hi Experts,
I searched all many threads to understand the how Attribute change run is used for
But a small question is going still.
Is attribute change run used for getting consistent data from aggregates or from infocubes.
Attribute change run is used to update the master data .Then what is the use of "activate master data "
in context menu of the master data infoobject.
Somewhere I have read that Attribute change run is used for updating Aggregates but it has got nothing to
do with Infocubes,Is it right ?
Please clarify me
Thanks in advance
SundarHi Sundar,
Is attribute change run used for getting consistent data from aggregates or from infocubes.
The purpose of ACR is to update the changes happened in master data to aggregates not for aggregates.
Attribute change run is used to update the master data .Then what is the use of "activate master data "
in context menu of the master data infoobject.
when you click on activate master data, the data in P or Q or X or Y tables will get activated(if any records with version M will turned to version A)
ACR is used to update the same changes to aggregates
Somewhere I have read that Attribute change run is used for updating Aggregates but it has got nothing to
do with Infocubes,Is it right ?
Yes you are wright.here we have small logic when ACR is running if you start any rollup step it will get fail.
While ACR is running aggregates will get locked
Regards,
Venkatesh -
Exepert's,
I have cube ZSAL_ANA .I am having Two dimensions 1.Sales and 2.Customer.
Sales Region contains Sales Region and Sales Location.Customer Dimension contains Customer Type and Customer location.Ok
Now Correct me if I am wrong?
In Fact table ie BIC/FZSAL_ANA we will be having keyfigures and Dimension IDs ie for Sales and Customer.
Ok ,Now The Dimension Sales (BIC/DFZSAL_ANA51) Contains only Dim Ids and
also Sids ok./BIC/SZSAL_RGN is a SID for Sales Region and it contains SID and also Sales regions. ./BIC/SZSAL_LOC is a SID for Sales Location and it contains SID and Sales Locations.
similarly for Customer Dimension (BIC/DFZSAL_ANA52).
Now /BI0/SZCUST_TPE is a SID table for Customer type.
Similarly /BI0/SZCUST_LOC is a SID table for Customer Location.
Please correct me if I am wrong?Hi Vasu,
The 'actual' char values are stored in the S table, like SE16> /BI0/SMATERIAL. The value is not stored in the SID, but in the S table you can see each char value and the corresponding SID, like:
Material SID
0200000000 4
0200000001 5
0200000010 6
0200000011 7
0200000020 8
0200000030 9
0200000031 10
0200000032 11
0200000033 12
Hope this helps... -
Extended star schema implementation in rpd
joins of the tables in physical model is as follows
dim a->fact b
dim c->dim a
joins in logical table
i have created a logical table with dim a & dim c as sources ... and connected this to logical fact with fact b as one of the sources
now i want to create a measure in the fact table in which i have to use the column from dim c..
plz suggest me an approach to achieve this ...
thnks in advanceregarding aggregation rule,
it depends on your code.. that you want to write.. because...
let's go with an example...
i want to calculate A + B, where A & B are logical columns with aggregation rule sum...
Now, if you go with logical mapping (A+B), as A & B are already having Aggregation rule this will be carried to the new derived column also... this can be checked by looking at your column image(has aggregation - Sigma symbol)
better provide your code want to put in..
And, if it's sum or difference, there is no much difference if you are using logical mapping or physical mapping...
Edited by: Kishore Guggilla on Sep 20, 2010 1:01 PM -
Doubt on star schema..
Hi all ,
I have a smaal doubt on Extended star schema . Like Dimension table stores only the Dim ids and corresponding SIDs and through this SID values these are linked to SID table which contains the SID for a master data . My doubt is where is our characteristics value for dimension tables . Like if one doimension if we have Employee name , college are the dimension and its master data let us assume is Employee code. CAn somebody explain me the scenario..
Thanks in advance..Niyati,
To find the employee name, check the sid value, i.e :-
/BI0/SEMPLOYEE
Which will give you the employee number
Put this in :-
/BI0/TEMPLOYEE
and you will succeed.
Cheers,
Pom -
Hi,
I know that in classical star schema the dimension tables sits within the info cube and so we cannot use this dimension table in any other cube we need to have separate dimension table for that cube thought it might be having same data. I also know to over come this redundancy extended star schema came into picture where we have SID table and we keep the dimension table out of the cube and reuse the dimension tables across many cubes.
Now what i don't understand is that instead of having Separate SID tables for linking the dimension and fact tables why cant we make the DIMENSION table generic and keep them out of the infocube so that we can same the same dimension table for many infocube in this case we wont need SID tables.
suppose i have one info cube which has dimension vendor material and customer and its keyfigure is quantity and price and i have a separate infocube which has dimesnion material customer and location and its key figure is something else ......so here in why cant i keep the dimensions out of the infocube and use the dimension material customer for both infocube.Your dimension tables are filled based on your transaction data - which is why dimension table design is very important you decide to group related data for the incoming transaction data into your dimension tables .
The dimension tables have SIDs which in turn point to master data = in the classic star schema - the dimension tables are outside the cube but the dim tables have the master data within them whhich is overcome using the extended star schema.
The reason why dimension tables can be reused is that the dim IDs and SIDs in the simension table correspond to the transaction data in the cube - and unless the dim IDs in both your cubes match you cannot reuse the dim tables - which means that you have exactly the same data in both the cubes - which means you need not have two cubes with the same data.
Example :
Cube 1 : Fact Table
Dim1ID | DIM2ID | KF1
1|01|100
2|02|200
Dimension Table : Dim 1 ( Assumin that there are 2 characteristics in this dimension ) - here the DIM1ID is Key
Dim1ID | SID1 | SID2
1|20|25
2|30|35
Dimension Table Dim 2 - Here the Dim2ID field is key
Dim2ID| SID1 | SID2| SID3
01| 30| 45
02|45|40
Here the Dim IDs for the cube Fact table are generated at the time of load and this is generated from the NRIV Table ( read material on Number Ranges ) - this meanns that you cannot control DIM ID generation across cubes which means that you cannot reuse Dimension Tables -
Do we use direct star schema concept anywhere in sap bw
i know about extended star schema,and where sap uses this concept.
my question is do we use normal star schema concept any where in sap bw, apart from extended star schema concept.
if yes specify the answer briefly .
thanks in advance
with regards
yash.bHi,
If I'm not mistaken an Analytic view in Hana is more like the normal star schema, it is definitely not extended and can be consumed by BW for OLAP processing.
Regards,
Michael Devine -
BW Star Scheme & Multi dimensional Data Modelling
Hi BW Experts,
Can any one please let me know when i have to check in help.sap or serivices.sap
for detailed info on BW Star Scheema and Multi dimensional Data Modelling and how it is used in BW.
Please update me where i have to check for this info
Thankshi...
star schema..
Please check the threads below..
Differences between Star Schema and extended Star Schem
What is the difference between Fact tables F & E?
Invalid characters erros
mdm..
http://help.sap.com/bp_biv133/documentation/Multi-dimensional_modeling_EN.doc
hope this helps,... -
hi friends,
in extrenal starschema fact tabel is connected dimension table dimension table is connected sid sid is connetced with master data. here sid means setid or surrogated id. why we are udning sid for this?
Thanking u
suneel.hi suneel
SID means Surrogate ID.
BIW is completed based on extended star schema. The fact table is connected to the dimension tables and the dimension table is connected to the master data, Attributes, Texts and hierarchies.The reason why we connect with SID is multiple infocube dimension characteristics can access the same master data at the same time.
Hope it helps...
Maybe you are looking for
-
Disable Driver Enforcement for Virtual Audio Cable?
Hi, I'm trying to get this thing called "Virtual Audio Cable" Working, but It is apparently not signed, so I cant even use it. Is there a fix to this? I've tried to use bcdedit -set. I have tried to use DSEO Too, but I dont know what file it wants me
-
Can't create a personal email account on my Blackberry Curve 8520.
I cannot create an personal email account on my Blackberry Curve 8520. I've already followed some of the instructions from this forum and on the main website of Blackberry but still can't create an email account for my phone. These are the steps I'm
-
Call dropping in middle of converstation
Hi, I have an issue between a Cisco AS5400 and Mitel phones. Randomly in the middle of a conversation the outbound voice seems to drop. We can still hear the person on the other end of the phone, but they cannot here us. We have two E1 as out outboun
-
I have created a Flex App with out a Server now I need one
So I'm new to Flex and didn't think the project needed a server at first (poor planning on my part) now I'll need one. I have alot built and would like to just conevert the app if I can. The project has been sourced in to Subversion, I would like not
-
Residual files from uninstall procedure for a HP Scanjet Scanner
I have used 'AppDelete' program to uninstall HP Scanjet software on my MacBook. I don't see the files in the Applications folder, but I do see the app: HP Scanjet Manager.app in the Activitiy Monitor. This is after running scrips, permissions and a f