Jump to content

Recommended Posts

Posted (edited)

Greetings, folks!

Here is a way to optimise the HTML length by up to 30%, depending on the nesting.

The examples I give are taken from an old Interlude core, but the general idea is applicable to all java-based cores.

Proof of concept!
The two important bits in the screenshot below are the (1) Original HTML size row and the (4) No Tabs/Rows size row, which represent the before and after, respectively.

image.png.79a2d793c47dfd3fabd786d0b6cd2399.png

STEPS:

 1. locate the `setHtml(String text)` method inside `NpcHtmlMessage.java`.

 2. add the following bit of code as a standalone method:

/**
 * Replaces all occurrences of New Rows and Tab spaces in the string.
 * @param text the string that may contain invalid characters
 * @return the string with invalid character replaced by underscores
 */
public static String removeTabSpacesAndNewRows(String text)
{
	return text.replaceAll("[\\t\\n]", "");
}


 3. find the last ocurence of the `text` String variable in the `setHtml()` method, and push it through this newly created method like so: `removeTabSpacesAndNewRows(text);`.
 
4. example of the final result.
 

/**
 * Sets the html.
 * @param text the new html
 */
public void setHtml(String text)
{
	if (text == null)
	{
		LOGGER.warning("Html is null! this will crash the client!");
		_html = "<html><body></body></html>";
		return;
	}
	
	if (text.length() > 8192)
	{
		LOGGER.warning("Html is too long! this will crash the client!");
		_html = "<html><body>Html was too long,<br>Try to use DB for this action</body></html>";
		return;
	}
		
	_html = removeTabSpacesAndNewRows(text); // html code must not exceed 8192 bytes
}

/**
 * Replaces all occurrences of New Rows and Tab spaces in the string.
 * @param text the string that may contain invalid characters
 * @return the string with invalid character replaced by underscores
 */
public static String removeTabSpacesAndNewRows(String text)
{
	return text.replaceAll("[\\t\\n]", "");
}



5. You could further refine it by processing the `text` variable between the two IF clauses by creating a new local variable, assigning it a value of `text` and then replacing the subsequent  `text` mentions in the `setHtml()` method with this new local var. Here's what it would look like:
 

/**
 * Sets the html.
 * @param text the new html
 */
public void setHtml(String text)
{
	if (text == null)
	{
		LOGGER.warning("Html is null! this will crash the client!");
		_html = "<html><body></body></html>";
		return;
	}

	String refinedText = removeTabSpacesAndNewRows(text);
	if (refinedText.length() > 8192)
	{
		LOGGER.warning("Html is too long! this will crash the client!");
		_html = "<html><body>Html was too long,<br>Try to use DB for this action</body></html>";
		return;
	}
		
	_html = refinedText; // html code must not exceed 8192 bytes
}



UPDATE:
(thanks to @xdem for pointing it out)

You can instead apply the same logic/method to the HTMCache.java
To do so, locate the loadFile() method and recycle the String/Text/Content through the removeTabSpacesAndNewRows method.

KEEP IN MIND that you might want to move the removeTabSpacesAndNewRows method in another java class, preferrably in some UTILITY class.
Here is an example:
 

	public String loadFile(File file)
	{
		final HtmFilter filter = new HtmFilter();
		String content = null;
		if (file.exists() && filter.accept(file) && !file.isDirectory())
		{
			FileInputStream fis = null;
			BufferedInputStream bis = null;
			try
			{
				fis = new FileInputStream(file);
				bis = new BufferedInputStream(fis);
				final int bytes = bis.available();
				final byte[] raw = new byte[bytes];
				bis.read(raw);
				
				content = new String(raw, StandardCharsets.UTF_8);
				content = content.replaceAll("\r\n", "\n");
				content = content.replaceAll("(?s)<!--.*?-->", ""); // Remove html comments
				content = NpcHtmlMessage.removeTabSpacesAndNewRows(content);
				
				final String relpath = Util.getRelativePath(Config.DATAPACK_ROOT, file);
				final int hashcode = relpath.hashCode();
				if (Config.CHECK_HTML_ENCODING && !StandardCharsets.US_ASCII.newEncoder().canEncode(content))
				{
					LOGGER.warning("HTML encoding check: File " + relpath + " contains non ASCII content.");
				}
				
				final String oldContent = _cache.get(hashcode);
				if (oldContent == null)
				{
					_bytesBuffLen += bytes;
					_loadedFiles++;
				}
				else
				{
					_bytesBuffLen = (_bytesBuffLen - oldContent.length()) + bytes;
				}
				
				_cache.put(hashcode, content);
			}
			catch (Exception e)
			{
				LOGGER.warning("Problem with htm file " + e);
			}
			finally
			{
				if (bis != null)
				{
					try
					{
						bis.close();
					}
					catch (Exception e1)
					{
						LOGGER.warning("Problem with HtmCache: " + e1.getMessage());
					}
				}
				
				if (fis != null)
				{
					try
					{
						fis.close();
					}
					catch (Exception e1)
					{
						LOGGER.warning("Problem with HtmCache: " + e1.getMessage());
					}
				}
			}
		}
		
		return content;
	}


 

Edited by Bru7aLMike
headline
  • Like 1
  • Thanks 1
Posted

okay its simple and it will work, but you better save the minified html directly in the HTML Cache during the load phase!

Posted
9 minutes ago, xdem said:

okay its simple and it will work, but you better save the minified html directly in the HTML Cache during the load phase!


Ok, I will update the guide then.

Posted

I would also advice you to experiment with

 

replaceAll("\\s+", " ");

 

s+ will automatically match any whitespace character (including \r \n etc)
 

Posted
25 minutes ago, xdem said:

I would also advice you to experiment with

 

replaceAll("\\s+", " ");

 

s+ will automatically match any whitespace character (including \r \n etc)
 



I have, as evident from the posted screenshot, see a copy below.

The bottom-most line is No White-Spaces, at all.
However, the 5% gain is not worth the risk, in my personal opinion. The Tab-Spaces and New Rows reduces the overall length by quite a lot as is.

image.png.748e32247416b582de08a1491e88a5da.png

Here is a list of all four methods that I used in the screenshot.

	public static String removeTabSpaces(String text)
	{
		return text.replaceAll("\\t", "");
	}

	public static String removeNewRows(String text)
	{
		return text.replaceAll("\\n", "");
	}

	public static String removeTabSpacesAndNewRows(String text)
	{
		return text.replaceAll("[\\t\\n]", "");
	}

	public static String removeAllWhiteSpacing(String text)
	{
		return text.replaceAll("\\s+", "");
	}

 

Posted

That's so useful in interlude clients since the html length is very low and you can't have a full menu page.

For example, this is in interlude but html code after editing to remove spaces and etc is unreadable.
It will make editing way much easier.
lootboxes-armors-nocturn-protojah.png

Thank you for your share 😇

  • Salty Mike changed the title to How to: reduce HTML weight/length from the core/java
  • 4 weeks later...

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now


  • Posts

    • https://web.archive.org/web/20260306183214/https://maxcheaters.com/topic/241828-l2j-l2damage/page/3/ https://l2topzone.com/forum/l2-server-support-problems/9/l2damage-stopped/30514 Also we will try to push longer seasons ever ! (1135-100)/9 = 115 online
    • ONE SIDE – AND EVERYTHING BREAKS ▪ Looks like a simple case: Florida DL, back side, barcode – “clean and minimal”. ▪ In reality, these are exactly the tasks that fail most often. – data provided as plain text – request only for the back side – focus on the barcode (PDF417) ▪ And here’s the key point: ▪ A barcode is not just a “picture on the back”. It’s compressed logic of the entire document. ▪ If it doesn’t match the front, format, and data structure – the system flags it instantly. ▪ Many create a “similar-looking” code. But systems don’t read “similar” – they read by specification. ▪ In cases like this, it’s not about design. It’s about correct data assembly and how it behaves inside the format. ▪ Today only – 15% off for verification cases. ▪ Want it to pass, not just look right? Describe your case – we’ll show where even clean files break. › TG: @mustang_service ( https:// t.me/ mustang_service ) › Channel: Mustang Service ( https:// t.me/ +JPpJCETg-xM1NjNl ) #editing #photoshop #documents #verification #case
    • Your anonymity is a corpse. Blockchain forgets nothing. Your transactions are direct footprints in the hands of anyone who takes an interest. [✘] Still believe in "mixing"? Forget it. Classic Bitcoin mixers are an illusion of security. For Chainalysis and Elliptic algorithms, any attempt to hide tracks in the ledger is transparent. Your "mixing" is an artifact that gets filtered out in seconds. Every transaction leaves a trail that leads to frozen assets or unwanted questions from exchanges.  We don't mix. We break the link. [-] Input: Your "dirty" coins (Dirty BTC/ETH) with all their history and digital markers stay with us. [+] Output: You receive absolutely clean assets (Clean Crypto) from our reserves, which have never intersected with your past. This isn't a game of hide and seek. This is the surgical removal of your financial history from the system.   ------------------------------------------------------------------- Technical indexing: Bitcoin Mixer, Crypto Mixer, Clean BTC, Clean ETH, Anti-Chainalysis, Best Bitcoin Mixer, Anonymous Crypto Exchange, NoLog Mixing Service.
    • Here you are: https://l2crypt.com/l2-tools/l2editor-source/
  • Topics

×
×
  • Create New...

Important Information

This community uses essential cookies to function properly. Non-essential cookies and third-party services are used only with your consent. Read our Privacy Policy and We have placed cookies on your device to help make this website better. You can adjust your cookie settings, otherwise we'll assume you're okay to continue..