February 2021 - Page 11 of 117 - Dr. Ware Technology Services

Cisco Releases Security Updates 

by Scott Muniz | Feb 25, 2021 | Security, Technology

This article is contributed. See the original author and article here.

Build Near Real Time Power BI reports using Synapse Link and SQL On-Demand easily

by Contributed | Feb 25, 2021 | Technology

This article is contributed. See the original author and article here.

This guide describes how to build near real time Power BI reports leveraging Synapse Link and SQL On-Demand. The intent is to demonstrate the simplicity of using these technologies.

Now let’s start!

Ensure you have the new Synapse Workspace enabled in your subscription:

Ensure you have Synapse Link enabled at your Cosmos DB account

Create your Database and container, verify the container has the Analytical Store enabled

As a prerequisite you need to ensure you are running Cosmos DB Python SDK v4.1.0 by executing the code below in a notebook:

import azure.cosmos as cosmos
print (cosmos.__version__)

Result should be 4.1.0, if it’s below then run this command in a new cell:

pip install --force-reinstall azure-cosmos

You’ll then need to open a new notebook to get the new version taken into account and run the following code:

import azure.cosmos
from azure.cosmos.partition_key import PartitionKey

database = cosmos_client.create_database_if_not_exists('RetailDemo')
print('Database RetailDemo created')

container = database.create_container_if_not_exists(id='WebsiteData', partition_key=PartitionKey(path='/CartID'),analytical_storage_ttl=-1)


print('Container WebsiteData created')

Note that you have created in the database RetailDemo a container named WebsiteData partitioned on CartID and you enabled the Analytical Store with the the parameter “analytical_storage_ttl=-1”

Once the container is created you can check the Analytical Store is enabled by default:

Then let start to load a small sample of data, for this you just need to create a new notebook in your Cosmos DB Data Explorer as follow:

Code to run in your notebook:

%%upload --databaseName RetailDemo --containerName WebsiteData --url https://cosmosnotebooksdata.blob.core.windows.net/notebookdata/websiteData-small.json

Here is how the data look like now:

Once this first step is complete you have a container with a few items and an Analytical Store on it.

Next Step is to go to Synapse Workspace and from there to Synapse Studio, create a SQL On Demand Database and test querying the Cosmos DB Analytical Store from there.

You can either use an existing Synapse Workspace or create a new one and launch Synapse Studio directly on your Workspace.

To discover how to create a SQL On-Demand Database and start learning how it works simply use this link:

https://docs.microsoft.com/en-us/azure/synapse-analytics/quickstart-sql-on-demand

Now you can create a view in the On-Demand Database with the following syntax:

Click on the Develop icon on the left side to access the SQL script and Notebooks, click on the ‘+’ sign to get a new SQL script then connected to the SQL On-Demand engine and ‘myondemanddb’ database.

Code to run in your SQL script:

CREATE VIEW CosmosDBTest
AS 
SELECT
*
FROM OPENROWSET
    (

     'CosmosDB',
       'account=cosmosdblp2;database=RetailDemo;region=northeurope;key=your_key',
       WebsiteData
    )

AS q1

In my my current example region is northeurope.

Once the view is created, you can run simplistic queries such as the ones below and get the results from the CosmosDB container WebsiteData:

Code to run in your SQL script:

select * from CosmosDBTest;

select country,sum(price) 
from CosmosDBTest

group by country;

Note that Synapse Link take care of the JSON document flattening into a table format for you.

Now that the general mechanism is now in place so you are ready to build a Power BI report on top of this. In case you don’t have PBI Desktop already you can get it from there:

https://www.microsoft.com/en-us/download/details.aspx?id=58494

Start PBI Desktop and select the Azure SQL Database source and put the SQL On-Demand endpoint as the server name:

Do not forget to specify port 1433 (yours-ondemand.sql.azuresynapse.net,1433) and DirectQuery mode:

After giving your credentials let’s pick up the view CosmosDBTest that you created precedingly:

Here is the simple report we can build and let focus on Guinea-Bissau, the current price value is 7.5:

Let’s go back to CosmosDB and select the corresponding items where we can raise the price of the corresponding item by 100 for instance:

After a latency of around a minute the price increase is reflected on the report:

And what if you load a larger set of Data into your Cosmos DB container?

Let’s then take a larger data set and bulk load it into your container as follow:

How is this rendered in Power BI?

Almost there!

What if you publish to your Power BI Tenant?

This is it!

To wrap this up:

I enabled the Analytical Store on a Cosmos DB container

I created a SQL on-demand view on this container Analytical Store

I created a Power BI report connected to the SQL on-demand database as a regular Azure SQL Database

Call to Action:

Please try now the Cosmos DB to Spark integration by following these examples:

Synapse/Notebooks/PySpark/Synapse Link for Cosmos DB samples at main · Azure-Samples/Synapse (github.com)

Azure SQL Database or SQL Managed Instance Database used data space is much larger than expected

by Contributed | Feb 25, 2021 | Technology

This article is contributed. See the original author and article here.

The data space used in an Azure SQL database or SQL Managed Instance database can be larger than expected – and on occasions significantly larger than expected – when compared with the actual number of records in the individual tables. This can lead to the impression of a problem with the database storage itself. However, this is almost certainly never the case and the issue can be resolved by carrying out a few maintenance procedures.

Storage space types for a database

To understand why this happens we should first review the different types of storage space used to the manage the file space of a database:

Used Data space – This is the amount of space used to store the database data, which is stored in 8 KB pages. Generally, the space used increases and decreases due to inserts and deletes respectively. In some cases, the space used does not change on inserts or deletes depending on the amount and pattern of data involved in the operation and any fragmentation. For example, deleting one row from every data page does not necessarily decreasing the space used.

Allocated Data space – The amount of formatted file space made available for storing database data. The amount of space allocated grows automatically, but never decreases after deletes. This behavior ensures that future inserts are faster since space does not need to be reformatted.

Data space allocated but unused – The difference between the amount of data space allocated and data space used. This quantity represents the maximum amount of free space that can be reclaimed by shrinking database data files.

Data max size – The maximum amount of space that can be used for storing database data.

The following diagram illustrates the relationship between the different types of storage space for a database:

Maintenance plan

Now we know that the used data space does not always change when inserts and deletes are performed and therefore can be greater than what could be expected when considering the number of records in the tables. How do we resolve this? This can be achieved by following the maintenance steps below to reduce index fragmentation, cleaning up any ghost records and then cleaning the Persisted Version Store:

1) Index fragmentation

Fragmentation exists when indexes have pages in which the logical ordering within the index, based on the key value of the index, does not match the physical ordering inside the index pages. The following example finds the average fragmentation percentage of all indexes in the Sales.SalesOrderDetail table in the AdventureWorks2012 database:

SELECT a.index_id, name, avg_fragmentation_in_percent, fragment_count, avg_fragment_size_in_pages  FROM sys.dm_db_index_physical_stats (DB_ID('AdventureWorks2012'), object_id('Sales.SalesOrderDetail'), NULL, NULL, NULL) AS a
JOIN sys.indexes AS b ON a.object_id = b.object_id AND a.index_id = b.index_id

The following links detail how to rebuild indexes to reduce the fragmentation (the second link includes an index and statistics maintenance script you can download):

Resolve index fragmentation by reorganizing or rebuilding indexes

How to maintain Azure SQL Indexes and Statistics

2) Ghost Records

Ghost records are records that are deleted from a leaf level of an index page but aren’t physically removed from the page. Instead, the record is marked as ghosted meaning to be deleted. This means that the row stays on the page, but the row header is modified to indicate the row is a confirmed ghost record. The reason behind this is to optimize performance during a delete operation. Ghosts are necessary for row-level locking, but also necessary for snapshot isolation where we need to maintain the older versions of rows. The number of ghost records can build up in a database until they are cleaned. The database engine runs a ghost cleanup process in the background that sometime after the delete transaction is committed, physically removes ghosted records from pages.

It is also possible the ghost cleanup process is disabled (not generally recommended). Disabling the ghost cleanup process can cause your database to grow unnecessarily large and can lead to performance issues. You can check if the ghost cleanup process is disabled by running the following command:

DBCC Tracestatus (661)

If the “Status” flag is set to 0, then this indicates that the ghost clean-up is enabled. If “Status” flag is set 1, then the process has been disabled.

To confirm if there are ghost records on your database execute this T-SQL:

SELECT sum(ghost_record_count) total_ghost_records, db_name(database_id)
FROM sys.dm_db_index_physical_stats (NULL, NULL, NULL, NULL, 'SAMPLED')
GROUP BY database_id 
ORDER BY total_ghost_records DESC

If there are ghost records, you can delete the ghost records manually from the database by executing an index rebuild. This process reclaims disk space by compacting the pages based on the specified or existing fill factor setting and reorders the index rows in adjoining pages.

Another option is to use sp_clean_db_file_free_space to clean all pages in all files of the database. For example, this T-SQL will clean the ghost records from the AdventureWorks2012 database:

USE master;  
GO  
EXEC sp_clean_db_free_space @dbname = N'AdventureWorks2012';

For more details about the Ghost clean process refer to the following guide: Ghost cleanup process guide – SQL Server | Microsoft Docs

3) Persisted Version Store (PVS)

PVS is a database engine mechanism for persisting the row versions generated in the database itself instead of the traditional tempdb version store. PVS enables resource isolation and improves availability of readable secondaries. The accelerated database recovery (ADR) feature uses PVS.

You can check the database PVS size by running the following T-SQL:

SELECT DB_Name(database_id), persistent_version_store_size_kb
FROM sys.dm_tran_persistent_version_store_stats 
WHERE database_id = add your database ID

If PVS size is large you can enforce the PVS cleanup by executing the following T-SQL:

EXEC sys.sp_persistent_version_cleanup [database_name]

The links below contains more information about PVS and ADR:

Accelerated Database Recovery in Azure SQL

Manage accelerated database recovery

Database shrink process

If required, the DBCC SHRINKFILE command can be executed after the above maintenance procedures to release allocated space.

The following example shrinks the size of a data file named DataFile1 in the UserDB user database to 7 MB.

USE UserDB;  
GO  
DBCC SHRINKFILE (DataFile1, 7);  
GO

However, there are several best practices to be aware of when considering using DBCC SHRINKFILE:

A shrink operation is most effective after an operation that creates a large amount of unused space, such as a truncate table or a drop table operation.

Most databases require some available free space for regular day-to-day operations. If you shrink a database repeatedly and its size grows again, then it’s likely that regular operations require the shrunk space. In these cases, repeatedly shrinking the database is a wasted operation.

A shrink operation doesn’t preserve the fragmentation state of indexes in the database, and generally increases fragmentation to a certain degree. This is another reason not to repeatedly shrink the database.

Shrink multiple files in the same database sequentially instead of concurrently. Contention on system tables can cause blocking and lead to delays.

More details about DBCC SHRINKFILE are contained in this link: DBCC SHRINKFILE (Transact-SQL) – SQL Server | Microsoft Docs

Conclusion

In this article we have considered the scenario where the used size of an Azure SQL Database or SQL Managed Instance Database is much larger than expected when compared with the actual number of records in the tables.

This can be resolved by carrying out a few maintenance procedures such as rebuilding indexes to reduce index fragmentation, cleaning up ghost records and cleaning the Persisted Version Store. If required, the DBCC SHRINKFILE command can also be executed afterwards to release allocated space.

I hope this article was helpful for you, please feel free to share your feedback in the comments section.

Sabrin Alsahsah

Experiencing Data Latency Issue in Azure portal for Log Search Alerts – 02/25 – Investigating

by Contributed | Feb 25, 2021 | Technology

This article is contributed. See the original author and article here.

Initial Update: Thursday, 25 February 2021 10:32 UTC

We are aware of issues within Log Search Alerts and are actively investigating. Some Customers may experience issues with missed or delayed Log Search alerts or experienced difficulties accessing data for resources hosted in East US region.

Work Around: None
Next Update: Before 02/25 14:00 UTC

We are working hard to resolve this issue and apologize for any inconvenience.
-Harshita

Introducing list sharing from personal accounts to work accounts in Microsoft To Do

by Contributed | Feb 25, 2021 | Technology

This article is contributed. See the original author and article here.

Microsoft To Do gives you a personal and intuitive way to stay organized and make the most of every day. Today, most of our tasks involve collaboration with multiple people – something that list sharing can make easier. Whether you want to share a list of work items with your colleagues or a grocery list with a loved one, To Do makes it easy to collaborate and get things done together.

List sharing in To Do was originally restricted to sharing between personal accounts and sharing between work (or school) accounts within the same organization. However, many of you wanted to be able to share between personal and work accounts.

We listened to your feedback and are pleased to announce that you can now share lists from personal accounts to work accounts in Microsoft To Do.

To Do supports list sharing for the following scenarios:

Sharing between personal Microsoft accounts.

Sharing between accounts within the same place of work or education.

Sharing between personal accounts and work accounts. Work accounts can join lists owned by personal accounts provided enterprise admins have enabled this feature for their respective organizations. However, personal accounts cannot join lists owned by work accounts.

Whether it’s a grocery list that your spouse wants to share from a personal account with your work account or a work-related list that an external vendor team can share from their personal accounts with your work account, collaborating on To Do has never been easier.

Want to know more? You can read up on list sharing here.

We can’t wait to hear what you think about this new feature – let us know in the comments below or over on Twitter and Facebook. You can also write to us at todofeedback@microsoft.com.

« Older Entries

Next Entries »

Cisco Releases Security Updates

Build Near Real Time Power BI reports using Synapse Link and SQL On-Demand easily

Azure SQL Database or SQL Managed Instance Database used data space is much larger than expected

Experiencing Data Latency Issue in Azure portal for Log Search Alerts – 02/25 – Investigating

Introducing list sharing from personal accounts to work accounts in Microsoft To Do

Recent Posts

Recent Comments

Archives

Categories

Meta

We look forward to meeting you