Reporting Tales

Back in February on the London Pentaho User-Group meeting I promised to make the contents of that presentation available as blog entries. This is the first of these three articles. · “What are the five top growing products over the last year?

org.pentaho.reporting.engine.classic.core.ProfileReportProcessing=true
org.pentaho.reporting.engine.classic.core.performance.LogPageProgress=true
org.pentaho.reporting.engine.classic.core.performance.LogLevelProgress=true
org.pentaho.reporting.engine.classic.core.performance.LogRowProgress=true
org.pentaho.reporting.engine.classic.core.DebugDataSources=true
org.pentaho.reporting.engine.classic.core.ProfileDataSources=true
org.pentaho.reporting.engine.classic.core.modules.misc.tablemodel.TableFactoryMode=simple
# Controls the mandatory lifetime of HTTP objects before we check for updates on the object 
org.pentaho.reporting.libraries.resourceloader.config.url.FixedCacheDelay=10000

# Disable cache updates completely. Once a resource is loaded it stays loaded until its 
# end of life in the underlying resource cache has been reached.
org.pentaho.reporting.libraries.resourceloader.config.url.FixBrokenWebServiceDateHeader=false
# Config settings for the 'classic-engine.properties' file
org.pentaho.reporting.engine.classic.core.modules.output.table.base.ReportCellConflicts=false
org.pentaho.reporting.engine.classic.core.modules.output.table.base.VerboseCellMarkers=false
org.pentaho.reporting.engine.classic.core.modules.output.table.html.CopyExternalImages=true
org.pentaho.reporting.engine.classic.core.modules.output.table.html.InlineStyles=false
org.pentaho.reporting.engine.classic.core.modules.output.table.html.ExternalStyle=true
org.pentaho.reporting.engine.classic.core.modules.output.table.html.ForceBufferedWriting=true
="runtime-query"
jdbc:mysql://[host:port]/[database][?propertyName1][=propertyValue1][&propertyName2][=propertyValue2]...
jdbc:mysql://localhost:3306/sampledata
Host [...] is not allowed to connect to this MySQL server
Access denied for user 'pentaho_user'@'localhost' (using password: YES)
ERROR 1044 (42000): Access denied for user 'pentaho_user'@'localhost' to database 'sampledata'
SELECT User, Host FROM mysql.user;
+------------------+-----------+
| User             | Host      |
+------------------+-----------+
| pentaho_user     | localhost | 
+------------------+-----------+

+------------------+-----------+
| User             | Host      |
+------------------+-----------+
| pentaho_user     | %         | 
+------------------+-----------+

+---------------+
| PRD-Host      |
+---------------+
| Web-Server    |
+---------------+
| My-SQL Server |
+---------------+

+------------------+-----------+
| User             | Host      |
+------------------+-----------+
| pentaho_user     | localhost | 
+------------------+-----------+

+----------+          +------------+         +---------------+
| PRD-Host |  --+-->  | Web-Server |  ---->  | My-SQL Server |
+----------+    |     +------------+         +---------------+
                |                                 /|\ 
                +----------------------------------+

+------------------+-----------------------------+
| User             | Host                        |
+------------------+-----------------------------+
| pentaho_user     | IP or hostname of webserver | 
+------------------+-----------------------------+
| pentaho_user     | IP or hostname of PRD-host  | 
+------------------+-----------------------------+

+------------------+-----------+
| User             | Host      |
+------------------+-----------+
| pentaho_user     | %         | 
+------------------+-----------+

+----------+          +---------------+ 
| PRD-Host |  --+-->  | Web-Server    |
+----------+    |     +---------------+
                +-->  | My-SQL server |
                      +---------------+

+------------------+-----------------------------+
| User             | Host                        |
+------------------+-----------------------------+
| pentaho_user     | localhost                   | 
+------------------+-----------------------------+
| pentaho_user     | IP or hostname of PRD-host  | 
+------------------+-----------------------------+

GRANT SELECT ON database.* TO 'pentaho_user'@'localhost';
SINGLEVALUEQUERY()
SINGLEVALUEQUERY([query:string]; [column:string])
SINGLEVALUEQUERY([query:string]; [column:string]; [querytimeout:integer])
MULTIVALUEQUERY([query:string])
MULTIVALUEQUERY([query:string]; [column:string])
MULTIVALUEQUERY([query:string]; [column:string]; [querytimeout:integer])
MULTIVALUEQUERY([query:string]; [column:string]; [querytimeout:integer]; [limit:integer] )
SINGLEVALUEQUERY("sales-top-performer")
SINGLEVALUEQUERY("SELECT name, Sum(Sales) AS 'Sales' FROM SalesData ORDER BY 'Sales' LIMIT 1")
SELECT
     SUM("ORDERDETAILS"."QUANTITYORDERED" * 
     "ORDERDETAILS"."PRICEEACH") AS "Sales"
FROM
     "ORDERS" INNER JOIN "ORDERDETAILS" ON "ORDERS"."ORDERNUMBER" = "ORDERDETAILS"."ORDERNUMBER"
     INNER JOIN "CUSTOMERS" ON "ORDERS"."CUSTOMERNUMBER" = "CUSTOMERS"."CUSTOMERNUMBER"
WHERE 
     "CUSTOMERS"."CUSTOMERNUMBER" = ${CUSTOMERNUMBER}
GROUP BY 
     "CUSTOMERS"."CUSTOMERNUMBER"     

=SINGLEVALUEQUERY("Sales-For-Customer")
import javax.print.DocFlavor;
import javax.print.PrintService;
import javax.print.PrintServiceLookup;
import javax.print.attribute.standard.PrinterName;

import org.pentaho.reporting.engine.classic.core.modules.misc.tablemodel.TableModelInfo;
import org.pentaho.reporting.engine.classic.core.util.TypedTableModel; 


    PrintService[] services = PrintServiceLookup.lookupPrintServices(
        DocFlavor.SERVICE_FORMATTED.PAGEABLE, null);
    TypedTableModel tt = new TypedTableModel();
    tt.addColumn("ID", String.class);
    tt.addColumn("Value", String.class);
    for (int i = 0; i < services.length; i++)
    {
      PrintService service = services[i];
      PrinterName displayName = service.getAttribute(PrinterName.class);
      if (displayName != null)
      {
        tt.addRow(new Object[]{service.getName(), displayName.getValue()});
      }
      else
      {
        tt.addRow(new Object[]{service.getName(), service.getName()});
      }
    }
    return tt;
 
http://127.0.0.1:8080/

http://127.0.0.1:8080/pentaho

localhost:8080

/pentaho

=ENV("pentahoBaseURL")

=ENV("pentahoBaseURL") & "/content/reporting/report.html?solution=steel-wheels&path=reports&name=BuyerReport.prpt"

org.pentaho.reporting.engine.classic.core.env-mapping.=

org.pentaho.reporting.engine.classic.core.env-mapping.-array=

Setting	Value
Driver	The driver class to load. The name of the class that implements java.sql.Driver in MySQL Connector/J is `com.mysql.jdbc.Driver`.
JDBC-URL	The JDBC URL for the MySQL JDBC Driver uses the following format. Items in square brackets ([, ]) are optional. `jdbc:mysql://[host:port]/[database][?propertyName1][=propertyValue1][&propertyName2][=propertyValue2]...` If the host name is not specified, it defaults to 127.0.0.1. If the port is not specified, it defaults to 3306, the default port number for MySQL servers.
Username	Your user name as given to you by your database administrator. See below how MySQL translates that information and matches this against the internal user database.
Password	Your password for the connection.

Property	Definition	Comment
useCompression	Use zlib compression when communicating with the server (true/false)? Defaults to ‘false’.	Useful if you access a remote server through a low bandwidth connection. Disable for production systems.
passwordCharacterEncoding	What character encoding is used for passwords? Leaving this set to the default value (null), uses the platform character set, which works for ISO8859_1 (i.e. “latin1”) passwords. For passwords in other character encodings, the encoding will have to be specified with this property, as it’s not possible for the driver to auto-detect this.	If you are not using an all ASCII password and work outside the area of Western European languages, knowing about this is a life-saver.
characterEncoding	If ‘useUnicode’ is set to true (which is the default), what Java character encoding should the driver use when dealing with strings? (defaults is to ‘autodetect’). If the encoding cannot be determined, then an exception will be raised.	Sometimes MySQL server do detect the character set correctly and therefore return garbage when querying tables with non-ascii data. In this case this property can be used to override the autodetection of character sets.
functionsNeverReturnBlobs	Should the driver always treat data from functions returning BLOBs as Strings – specifically to work around dubious metadata returned by the server for GROUP BY clauses?	The MySQL JDBC-Driver returns string results of functions in SQL statements as byte-arrays. If you use any functions in SELECT statements, set this property to “true” or you will see garbage in the results.
emptyStringsConvertToZero	Should the driver allow conversions from empty string fields to numeric values of ‘0’?	MySQL treats empty strings or values as zero for numeric columns by default. If you need to see the values as they are stored in the database without helpful corrections from the JDBC driver, then set this flag to false.
zeroDateTimeBehavior	What should happen when the driver encounters DATETIME values that are composed entirely of zeros (used by MySQL to represent invalid dates)? Valid values are “exception”, “round” and “convertToNull”.	Use this if your select statement fails with an exception and you have a zero-date in it. Set it to ‘convertToNull’ to come to a more reasonable result.
serverTimezone	Override detection/mapping of timezone. Used when timezone from server doesn’t map to Java timezone	Use this if your dates returned by the database seem to be offset by some hours. Also see ‘useGmtMillisForDatetimes’, ‘useJDBCCompliantTimezoneShift’, ‘useLegacyDatetimeCode’ and ‘useTimezone’ for more time zone related functions.
holdResultsOpenOverStatementClose	Should the driver leave the result sets open on Statement.close() (enabling violates JDBC specification)	This can be a very helpful property for reports with subreports. It allows the Pentaho Reporting engine to process several server-side resultsets in parallel. Without this setting, the reporting engine will fully buffer result-sets in memory instead, which increases the memory consumption of the reporting system.

Output-target	Description
table/html;page-mode=stream	HTML as a single page, all report pagebreaks are ignored
table/html;page-mode=page	HTML as a sequence of physical pages, manual and automatic pagebreaks are active
application/vnd.openxmlformats-officedocument.spreadsheetml.sheet;page-mode=flow	Excel 2007 XLSX Workbook
table/excel;page-mode=flow	Excel 97 Workbook
table/csv;page-mode=stream	CSV output
table/rtf;page-mode=flow	Rich-Text-Format
pageable/pdf	PDF output
pageable/text	Plain text
pageable/xml	Pageable layouted XML
table/xml	Table-XML output
pageable/X-AWT-Graphics;image-type=png	A single report page as PNG
mime-message/text/html	Mime-Email with HTML as body text and all style and images as inline attachments

ID	Value
false	Do Not Print
true	Print

The database as bottle neck

The layout as bottle neck

Subreports: Inline vs Banded

Images loaded from HTTP-URLs

Misconfigured caching as source of slow-downs

Output targets as bottle neck

Pageable outputs

Table outputs

Streaming HTML exports

Calculations override static values

How to avoid slow queries while designing the report

Avoid any database access at all while designing reports in the Report Designer

Connecting to a MySQL database – the basics

User management and access control in MySQL

Listing the defined users

Connections from Pentaho Report Designer via Connections from PhpMyAdmin or other Server Side Software

Pentaho Reporting, the Web-Server and MySQL run on the same machine

Pentaho Reporting, the Web-Server and MySQL all use different machines

The Web-Server and MySQL share the same machine

Fine grained access permissions

MySQL connection properties you should know

Further readings

So what is that ominous “query” parameter about?

How do I parametrize the SINGLEVALUEQUERY or MULTIVALUEQUERY formula function?

How do I limit the visible output types for a particular report?

Server Side Printing

Canvas-Layout

Block-Layout

Inline-Layout

Row-Layout

Combining layout strategies for better effects

How to avoid that dynamic-height elements overlap other elements

Analytical Reports

Operational Reports

Well-Known Keys

Session keys

User-Defined Environment setting

How to use them

Auto-mapping

Where to use Report-Environment fields

Setting up session values

Setting up global user-defined report-environment values