zero terminated "strings"

Name
Email
Subject
Comment *
File	Select/drop/paste files here
Password	(Randomized for file and post deletion; you may also set your own.)
* = required field	[▶ Show post options & limits] Confused? See the FAQ.

Flag
Oekaki	Show oekaki applet (replaces files and can be used instead)
Options	Do not bump (you can also write sage in the email field) Spoiler images (this replaces the thumbnails of your images with question marks)
Allowed file types:jpg, jpeg, gif, png, webm, mp4, pdf Max filesize is 16 MB. Max image dimensions are 15000 x 15000. You may upload 3 per post.

[–]

▶zero terminated "strings" Anonymous 04/10/18 (Tue) 11:43:33 No.895727>>895728 >>895737 >>895804 >>895846 >>896072 >>897671 [Watch Thread][Show All Posts]

https://blog.ircmaxell.com/2015/03/security-issue-combining-bcrypt-with.html

When will this stupidity finally come to an end?

Even in C, it's absolutely possible to use a struct with pointer and length, and add a library with replacements for the functions which worked with zero terminated strings.

Why would anyone still use zero terminated "strings"? They make no fucking sense, almost the worst idea ever.

▶Anonymous 04/10/18 (Tue) 11:48:33 No.895728>>895732 >>896203 >>897515

>>895727 (OP)

But anon, null terminated strings take up less space and are the same format regardless of CPU architecture.

▶Anonymous 04/10/18 (Tue) 11:51:47 No.895732>>895734

>>895728

they don't.

3 bytes difference is irrelevant for long strings.

if you want to make short strings small, then there is such a thing as variable length coding for integers, but this would be so little gain, it's not worth it.

CPU architecture is irrelevant, the pointer size would change in both cases. and the internal represenation of the length doesn't mean jack shit anyway.

also zero terminated strings are less efficient because calculating length is O(n).

▶Anonymous 04/10/18 (Tue) 11:55:11 No.895734>>895738

>>895732

>they don't.

Ah so that 8 byte size prefix is not a waste of space then?

>there is such a thing as variable length coding for integers

Yeah we will just do a big num hack to size our strings! Good idea.

▶Anonymous 04/10/18 (Tue) 11:55:48 No.895735>>895736 >>895744 >>896031

What if my string is more than 4,294,967,295 characters!

▶Anonymous 04/10/18 (Tue) 11:56:49 No.895736>>895744 >>896516

>>895735

No one would ever need to store a file more than 4 gigabytes. We don't need to design our system to handle that.

▶Anonymous 04/10/18 (Tue) 11:58:17 No.895737>>895739

>>895727 (OP)

You know windows internally uses sized strings. Look how well that turned out.

▶Anonymous 04/10/18 (Tue) 12:00:11 No.895738>>895740 >>895743

>>895734

4 bytes are enough for strings for all practical purposes. if you have more, obviously it's time to use specialized data structures anyway.

it's not a waste of space, it's only 3 bytes more than the terminating null byte. which is about 1-2 characters on average if you use UTF-8, and less than 1 character if you use fixed size Unicode.

unless you store single characters in strings, this doesn't fucking matter, and it's a lot better than to turn all code dealing with strings into a potential minefield + sacrificing run time where you can't reuse length information for some reason.

>>895734

>Yeah we will just do a big num hack to size our strings! Good idea.

it's only big if your brain is small. it's not gonna be used in the most of application level code.

anyway, in the same sentence I also said that the gain is minimal so it is not worth it. learn to read. still, that would be still better than zero terminated strings.

▶Anonymous 04/10/18 (Tue) 12:01:06 No.895739>>895743

>>895737

which exactly of windows problems are a consequence of this?

▶Anonymous 04/10/18 (Tue) 12:02:14 No.895740

>>895738

>unless you store single characters in strings a lot of times

a few words escaped, fixed

▶Anonymous 04/10/18 (Tue) 12:04:00 No.895742>>895795 >>896253

With any kind of input you usually don't even know the string length beforehand, so some kind of string termination is necessary.

and you don't always need to know the string length either

▶Anonymous 04/10/18 (Tue) 12:04:51 No.895743>>895748 >>896509

>>895738

>4 bytes are enough for strings for all practical purposes

I agree, no one could possibly need a hard drive more than 16 megabytes. A 200Mhz CPU is blazing fast.

>it's not a waste of space, it's only 3 bytes more than the terminating null byte

For 32bit max length strings. For small strings especially its a waste of space.

>if you use fixed size Unicode.

Well good thing no one uses that for the same reason no one uses size prefixed strings. Because it has a different representation on different machines. Byte order and what not.

>you can't reuse length information

Well thats one particular operation

> it's not gonna be used in the most of application level code

Do you not know what bignum is? You are saying that everyone is going to be using a bignum implementation for basic fucking strings.

>>895739

>windows problems are a consequence of this?

Idk its proprietary they only let us know so much

▶Anonymous 04/10/18 (Tue) 12:04:58 No.895744>>895745 >>895803

>>895736

if you read a file that's as big, and need to keep all of the content in RAM for some reason, you aren't using a fucking string type for it. it will be almost useless

in this form anyway.

>>895735

a legit use case, please.

also keep in mind that scanning 4 GiB for the zero byte would take really a lot of time.

▶Anonymous 04/10/18 (Tue) 12:06:47 No.895745>>895750

>>895744

>you aren't using a fucking string type for it.

So no one reads files into char* then? lol

▶Anonymous 04/10/18 (Tue) 12:11:01 No.895748>>895751

>>895743

>Do you not know what bignum is? You are saying that everyone is going to be using a bignum implementation for basic fucking strings.

I'm saying no one will re-implement it.

And using bignum already implemented in a library doesn't make shit any more complex.

>>895743

>Well thats one particular operation

when it is used somewhere and turns an O(n) algorithm into O(n^2) that would be a big deal and a PITA to fix.

>>895743

>Well good thing no one uses that for the same reason no one uses size prefixed strings. Because it has a different representation on different machines. Byte order and what not.

For most use cases it's a bullshit reason. Byte order only matters for data exchange --- files and network. Files do not need to store their size, their size is known. So prefixing size in files would be simply excessive, just as adding zero byte to the end. On network, size counts even more, so it's a normal practice to use variable length coding (protobuf, etc.).

But we are talking about in-memory representation. And you know, you don't swap a fucking CPU on a running machine while keeping RAM and CPU cache and registers, etc.

▶Anonymous 04/10/18 (Tue) 12:11:52 No.895749>>895795 >>895806

>+3 bytes is a waste of space for strings

Meanwhile people are happily making programs in javascript where every variable is some abominable super object.

▶Anonymous 04/10/18 (Tue) 12:12:23 No.895750>>895751

>>895745

char* doesn't say anything about whether the referenced content is zero terminated.

it's just a fucking pointer.

▶Anonymous 04/10/18 (Tue) 12:16:13 No.895751>>895754 >>895755 >>895795

>>895748

>And using bignum already implemented in a library doesn't make shit any more complex.

You think linking in foreign dependencies to use strings does not increase the complexity? HAHAHA

>Byte order only matters for data exchange

Like uhhhh text files, websites, spreadsheets, literally everything.

>are happily making programs in javascript where every variable is some abominable super object.

And this is a bad thing

>On network, size counts even more, so it's a normal practice to use variable length coding (protobuf, etc.).

Not for strings where the smallest representation is null terminated.

>>895750

>He does not have 10 gigabyte log files he wants to parse

▶Anonymous 04/10/18 (Tue) 12:20:34 No.895754>>895757 >>895764

>>895751

>>He does not have 10 gigabyte log files he wants to parse

you never wrote software which was able to do that, obviously.

>>895751

>Like uhhhh text files, websites, spreadsheets, literally everything.

and? they don't use zero terminated strings.

and we are talking about in-memory representation.

>>895751

>You think linking in foreign dependencies to use strings does not increase the complexity? HAHAHA

HAHAHA, go program some stuff without libc.

▶Anonymous 04/10/18 (Tue) 12:21:27 No.895755>>895776

>>895751

>And this is a bad thing

Of course. My point was that 3 bytes on a string is nothing. It's the difference between "Hello world!" and "Hello world!gay". You're more likely to waste bytes on shit software design than a structured string.

▶Anonymous 04/10/18 (Tue) 12:31:37 No.895757>>895759 >>895761

>>895754

>go program some stuff without libc.

String copy is like 5 lines of code to write yourself, you want to add on a bunch of big num shit to make it 100.

>and we are talking about in-memory representation.

You think file formats don't use null termination literally everywhere?

>you never wrote software which was able to do that, obviously.

Its really fucking easy, you just mmap it into memory and start reading. The OS will load and unload the pages for you.

>You're more likely to waste bytes on shit software design than a structured string.

People wasting space at the high level is no excuse to waste shit at the low level. Even worse wasting space at the low level is going to make all that high level shit even shittier.

▶Anonymous 04/10/18 (Tue) 12:38:52 No.895759

>>895757

>You think file formats don't use null termination literally everywhere?

Yes.

>>895757

>just mmap it into memory and start reading

Then you aren't using any null termination.

▶Anonymous 04/10/18 (Tue) 12:40:21 No.895761>>895763

>>895757

>excuse to waste shit at the low level

You're free to null terminate your own esoteric string use cases if you really need to squeeze the fuck out of every byte of memory, but the chances that you do are probably 0. I don't think you understand how meaningless those 3 bytes really are in this case.

▶Anonymous 04/10/18 (Tue) 12:41:47 No.895763

>>895761

>I don't think you understand how meaningless those 3 bytes really are in this case.

combining that with the other text, it's pretty much obvious that he doesn't.

▶Anonymous 04/10/18 (Tue) 12:41:53 No.895764>>895770

>>895754

>Then you aren't using any null termination.

mmap is not dependent on the file type you dolt

>I don't think you understand how meaningless those 3 bytes really are in this case.

No reason to optimize! Not like computers process hundreds of billions of strings a day! That would not be beneficial at all!

▶Anonymous 04/10/18 (Tue) 12:47:48 No.895770

>>895764

>mmap is not dependent on the file type you dolt

which retarded file format are you using which has zero termination at the end?

▶Have you heard of Lisp? Scheme is Life!!ph6IF8xgJ2 04/10/18 (Tue) 12:58:56 No.895776>>895798

Null-terminated strings have the advantage of behaving extremely well with recursive function, as constructing a suffix substring is just a matter of incrementing a pointer.

Think about how Lisp handles list: they are chained cons cells, with the last cons having an empty (null) list as its CDR. This is how you work with recursivity in Lisp.

The fact that you think null-terminated strings are useless prove you have still much to learn, youngling.

>>895755

>3 bytes on a string is nothing

Wrong. First of all, a size_t variable in a 64 bits environment is 64 bits, so we're talking about 7 extra byes. Second of all, those bytes are nothing on a text, but when you have a plethora of max 10 characters strings (quite frequent), with 32 bits system it increases their size by a 30% factor, on 64 bits system, it's 70% of memory usage increase.

Imagine storing an associative array that uses, as keys, 5-char strings with 8 byte size variable. That's 13 bytes per key. With null-terminated strings, it's 6 bytes. That's some 118% more space used by the size+string solution.

▶Anonymous 04/10/18 (Tue) 12:59:20 No.895777>>895778 >>895779 >>895781

>ITT: strings should be null terminated because some filetypes use null characters, also I don't stat my mmaps and EOF aren't a thing, and I prefer to waste cycles scanning for the null terminator rather than wasting an extra 1% of memory that would speed up most use cases and solve 95% of the silliest and most common types of bugs ever

The absolute state of C idiots in this board, everyone.

▶Anonymous 04/10/18 (Tue) 13:01:06 No.895778>>896348

>>895777

Go back to your javascript bullshit. Clearly you don't care about efficiency and interchangeability.

▶Scheme is Life!!ph6IF8xgJ2 04/10/18 (Tue) 13:01:32 No.895779>>895810

>>895777

Do you know that EOF is an integer value, not an unsigned char, that most strings in a system are very small, that strlen() is rarely needed, and that you can still store a string's length in C, while not losing the advantages of the NULL termination?

▶Anonymous 04/10/18 (Tue) 13:01:55 No.895781

>>895777

>C

You mean Unix.

▶Scheme is Life!!ph6IF8xgJ2 04/10/18 (Tue) 13:09:52 No.895785>>895788 >>895793 >>895801 >>896156 >>896219

the virgin FOReskin


void map(char *str, size_t str_len, char (func*)(char)) {
  for (size_t i = 0; i < str_len; ++i) {
    str[i] =  func(str[i]);
  }
}

The Chad Elegant Recursion


void map(char* str, char (func*)(char)) {
  if (str) {
    *str = func(*str);
      map(str, func);
  }
}

[/code]

▶Scheme is Life!!ph6IF8xgJ2 04/10/18 (Tue) 13:11:29 No.895788>>895790

>>895785

should be map(str++, func), LOL

▶Anonymous 04/10/18 (Tue) 13:13:40 No.895790>>895811 >>896038

>>895788

>stack overflow

This is why you don't use languages without proper tail recursion.

▶Anonymous 04/10/18 (Tue) 13:16:25 No.895793>>895811

>>895785

>it took less to type therefore it's better as a program

Remove yourself from premises.

▶Anonymous 04/10/18 (Tue) 13:17:43 No.895795>>895797 >>895811

Null-terminated strings suck. C weenies defend it because that's what C uses. Common Lisp strings are arrays, and they can be adjustable (grow and shrink) and have a fill pointer (anything less than it is the currently used part). This covers all the uses of dynamically sized strings, length-prefixed strings, and fixed-length strings. Lisp strings are arrays, so all arrays can have these properties.

>>895742

Bullshit. You always need to know the length. If you really added up all the waste from C and UNIX "comparing characters to zero and adding one to pointers", it would be more efficient to have GC and dynamic typing and store files as arrays of strings. I'm not kidding. C malloc overhead is huge too, but on a Lisp machine, allocating a list only uses one word of memory per element. Allocating a 1D array only uses one header word to store the actual length of the array (which malloc has to do too, but it doesn't provide useful information to you) followed by the words for the array data. Lisp machine overhead is much smaller than C overhead, and the GC compacts to eliminate memory fragmentation.

>>895749

>>+3 bytes is a waste of space for strings

>Meanwhile people are happily making programs in javascript where every variable is some abominable super object.

That's because C sucks. malloc in C has more than 3 bytes of waste. JavaScript is a better language than C even though it sucks too.

>>895751

>>He does not have 10 gigabyte log files he wants to parse

You're going to read an entire 10 GB file into memory (not memory mapping) and stick a 0 byte on the end, but you think an 8 byte length is wasteful? I have no idea why anyone would do things like that.

> Subject: More On Compiler Jibberings... 
> 
> ...
> There's nothing wrong with C as it was originally 
> designed,
> ...

bullshite.

Since when is it acceptable for a language to incorporate
two entirely diverse concepts such as setf and cadr into the
same operator (=), the sole semantic distinction being that
if you mean cadr and not setf, you have to bracket your
variable with the characters that are used to represent
swearing in cartoons?  Or do you have to do that if you mean
setf, not cadr?  Sigh.

Wouldn't hurt to have an error handling hook, real memory
allocation (and garbage collection) routines, real data
types with machine independent sizes (and string data types
that don't barf if you have a NUL in them), reasonable
equality testing for all types of variables without having
to call some heinous library routine like strncmp,
and... and... and...  Sheesh.

I've always loved the "elevator controller" paradigm,
because C is well suited to programming embedded controllers
and not much else.  Not that I'd knowingly risk my life in
an elevator that was controlled by a program written in C,
mind you...

And what can you say about a language which is largely used
for processing strings (how much time does Unix spend
comparing characters to zero and adding one to pointers?)
but which has no string data type?  Can't decide if an array
is an aggregate or an address?  Doesn't know if strings are
constants or variables?  Allows them as initializers
sometimes but not others?

(I realize this does not really address the original topic,
but who really cares.  "There's nothing wrong with C as it
was originally designed" is a dangerously positive sweeping
statement to be found in a message posted to this list.)

▶Anonymous 04/10/18 (Tue) 13:18:55 No.895797

>>895795

>Spams copy pasta

dropped

▶Anonymous 04/10/18 (Tue) 13:19:31 No.895798>>895800 >>895811

>>895776

>max 10 characters strings (quite frequent)

citation needed

>First of all, a size_t variable in a 64 bits environment is 64 bits

what about uint32_t?

▶Anonymous 04/10/18 (Tue) 13:19:55 No.895800>>895803

>>895798

>4 gigabyte limit

▶Anonymous 04/10/18 (Tue) 13:21:36 No.895801>>895802 >>895811 >>896088

>>895785

>char (func*)(char)

lol, what a retarded syntax

▶Anonymous 04/10/18 (Tue) 13:22:43 No.895802>>895808 >>896088

>>895801

I bet you are the type of larper that gets autistic about where parenthesis are placed or using spaces vs tabs wasting all our fucking time.

▶Anonymous 04/10/18 (Tue) 13:23:09 No.895803>>895805

>>895800

>>895744

▶Anonymous 04/10/18 (Tue) 13:24:18 No.895804

>>895727 (OP)

>misuse null-terminated strings

<FUCK NULL TERMINATED STRINGS, IT WASN'T MY OWN STUPIDITY

The code for the hash in your link is shit, and it's not because of the string. It's because the bcrypt writer played with fire and got burned. If you work directly with pointer logic, you need to be very careful. The language does offer you ways of solving the problem with safer, easier to use tools. The problem is, when you need to be efficient, you're going to have to write code closer to the hardware level. You might as well ban chainsaws because idiots get hurt by them.

▶Anonymous 04/10/18 (Tue) 13:24:27 No.895805>>895808

>>895803

Who said anything about keeping it in RAM? You can process something without it being in RAM. Every heard of streams? Every heard of memory mapped files? Guess not lol.

▶Anonymous 04/10/18 (Tue) 13:25:40 No.895806>>895809

>>895749

And there's absolutely nothing wrong with that. Having been in both worlds, its such a pleasure to write software in the more abstract languages.

▶Anonymous 04/10/18 (Tue) 13:26:15 No.895808>>895812 >>896088

>>895802

not.

it's a lot less clear than `char -> char` for example, or even `Function<char, char>`.

try to spell (in C) a type of a variable which is a function which takes a char and returns a function which returns a function which returns a function which returns a char, for example.

>>895805

if you read from file, you already know the size, because files have size. adding 1 useless byte is useless and stupid.

▶Anonymous 04/10/18 (Tue) 13:26:28 No.895809>>895815 >>895818

>>895806

Javascript is the C of high level languages

▶Anonymous 04/10/18 (Tue) 13:26:40 No.895810>>895814 >>895816 >>895827

>>895779

>Do you know that EOF is an integer value

People ITT apparently don't know null terminated strings and reading from files have nothing to do with each other at all, that's my point.

>that most strings in a system are very small

Did you know SQL databases solved this ages ago with fixed size char fields, variable size char fields and text fields? Fuck, we could solve this the same way we solved numeric types of different sizes, with short strings, regular strings, long strings, etc.

<but that's not YOONIKS-y and simple!

It's about as obtuse as integer sizes. Read: not at all if you care the littlest bit about muh autistic efficiency. Not only that, but the compiler could infer the most adjusted type for literals, so you should only worry about user inputs and files, which should have a fixed length anyway.

<but muh length promotion would waste too much!

Go write assembly then, fag.

>that strlen() is rarely needed,

Rarely my ass, unless you use buffers and increase the complexity of your program by doing this.

▶Scheme is Life!!ph6IF8xgJ2 04/10/18 (Tue) 13:26:50 No.895811>>895813 >>895819 >>895822 >>896921

>>895793

That alone is a reason to make it better, but it's also clearer, the function takes one less argument, and it doesn't need to push a new variable onto the stack.

>>895790

While it is true that ANSI C says nothing about tail call recursion, GCC does it.

>>895795

Mr. Common Lisp[1] here apparently does not understand the value of a null terminator in a linear collection of elements (like a string), even though it is the principle upon which cons cell lists are constructed.

[1] yuck!

>>895798

>citation needed

Look up any software, and see how long most string are.

>what about uint32_t?

Though there is no reason for it not to be used, is not recommended to hardcode your size_t. Also, uint32_t is not defined by ANSI standards older than C99.

>>895801

>return type (name) (arguments)

How would you do it, Mr. Smart Man?

▶Anonymous 04/10/18 (Tue) 13:27:03 No.895812

>>895808

<Not

>Goes on to larp about syntax

▶Anonymous 04/10/18 (Tue) 13:28:06 No.895813>>895827

>>895811

I bet you think a linked list is good too because it doesn't need an iterator variable to loop through.

▶Anonymous 04/10/18 (Tue) 13:28:09 No.895814>>895820

>>895810

>People ITT apparently don't know null terminated strings and reading from files have nothing to do with each other at all, that's my point.

People ITT don't know that null terminated strings are used in file formats all the time.

▶Anonymous 04/10/18 (Tue) 13:28:13 No.895815>>895817

>>895809

>Javascript is the crap of high level languages

ftfy

although C is crap too, so… not a big difference after all.

▶Anonymous 04/10/18 (Tue) 13:28:38 No.895816>>895823 >>896348

>>895810

>Go write assembly then, fag.

Go write in javascript faggot, its where you belong.

▶Anonymous 04/10/18 (Tue) 13:28:57 No.895817

>>895815

Thats the point dingus

▶Anonymous 04/10/18 (Tue) 13:29:28 No.895818

File (hide): 890ef04edd83d7a⋯.jpg (30.94 KB, 670x503, 670:503, 890ef04edd83d7afef978d973b….jpg) (h) (u)

>>895809

Made my day.

▶Anonymous 04/10/18 (Tue) 13:29:47 No.895819>>895827

>>895811

>and it doesn't need to push a new variable onto the stack

who told you so?

>what is registers?

▶Anonymous 04/10/18 (Tue) 13:30:47 No.895820

>>895814

>People ITT don't know that null terminated strings are used in file formats all the time.

and the most widely used example is … ?

▶Anonymous 04/10/18 (Tue) 13:31:12 No.895821

File (hide): e59218be943d602⋯.jpg (157.55 KB, 1200x1200, 1:1, muammar-al-qaddafi-39014-1….jpg) (h) (u)

so many newfag CS undergrads ITT smh

▶Anonymous 04/10/18 (Tue) 13:32:43 No.895822>>895827 >>895836

>>895811

>Though there is no reason for it not to be used, is not recommended to hardcode your size_t. Also, uint32_t is not defined by ANSI standards older than C99.

Older standards than C99 belong to the garbage bin.

▶Anonymous 04/10/18 (Tue) 13:33:12 No.895823>>895824

>>895816

Your beloved C does size promotion all the time. Fuck, getchar(), which is used to read a single character from a file, which is about as wasteful of a function as it gets, performs promotions with every single call. And it's negligible.

Really, fuck off. You don't even want assembly, your autism should only allow you to use ASICs that waste zero cycles at all.

▶Anonymous 04/10/18 (Tue) 13:34:44 No.895824>>895827 >>895828

>>895823

I hate C, I just like null termination.

▶Scheme is Life!!ph6IF8xgJ2 04/10/18 (Tue) 13:36:52 No.895827>>895831 >>895833 >>895888

>>895810

SQL databases are much different than C storage. For starters, the length of VARCHAR is stored only once, in the column definition. When the length is dynamic, we're talking about text, which will indeed make the extra 8 bytes literally nothing.

><but that's not YOONIKS-y and simple!

I don't like Unix, please do not put words in my mouth.

>some rambling on stuff I haven't mentioned

ok

>Rarely my ass

Rarely indeed. For hard-coded strings, the length is simply sizeof(myString) (which counts the null terminator). For strings that you receive as input, the size is calculated while receiving it, or is pre-given.

Null-terminated dynamic-size strings are good for manipulation, sized dynamic-size strings are good for interchange (databases, network, file formats, etc.)

You should use fixed-length strings as much as possible anyway.

>>895819

If it is not pushed onto the stack, it's a compiler optimization, that you shouldn't rely on, or you need to specify the variable as volatile.

>>895813

Linked lists are excellent as lists. If you try to use them when you should use fixed-size arrays or vectors, maybe you should take an IQ test, and based on that, decide if you should kill yourself or retake Data Structures 101.

>>895822

If you're not a LARPer, surely you have heard of legacy codebases.

>>895824

Same. C sucks, but most of its detractors just don't understand the real reasons why.

▶Anonymous 04/10/18 (Tue) 13:37:05 No.895828>>895830

>>895824

Well tough shit then…

▶Anonymous 04/10/18 (Tue) 13:38:29 No.895830

>>895828

Tough shit for you, attacking a strawman this whole time.

▶Anonymous 04/10/18 (Tue) 13:39:42 No.895831>>895832

File (hide): d8d412bd0f8dddf⋯.jpg (32.53 KB, 310x349, 310:349, 6193c1121b574f696e20d8f671….jpg) (h) (u)

>>895827

>Linked lists are excellent as lists

>you should take an IQ test

You sure you're not projecting m8?

▶Scheme is Life!!ph6IF8xgJ2 04/10/18 (Tue) 13:40:27 No.895832>>895835

>>895831

>I don't understand why you would possibly want a linked list.

How many years of programming do you have on your CV, again?

▶Anonymous 04/10/18 (Tue) 13:41:01 No.895833>>895834 >>895844

>>895827

>Linked lists are excellent as lists.

Linked lists waste all that space on pointers though, terrible cache properties, jumping around to different pages all the time. Big O time complexity has little to do with the real world when we are dominated by the size of N.

▶Anonymous 04/10/18 (Tue) 13:43:37 No.895834>>895837 >>895843

>>895833 (checked)

Is an array of pointers that get reallocated all the time a better solution when the list is not changing often?

▶Anonymous 04/10/18 (Tue) 13:44:29 No.895835>>895861

>>895832

Well I've never heard of a situation where a linked list is the best solution, so here's your chance to educate me.

▶Scheme is Love!!5qkHgFVuxU 04/10/18 (Tue) 13:44:51 No.895836>>895839 >>895845 >>896048

File (hide): 3b235dc9d091814⋯.jpg (117.17 KB, 905x1280, 181:256, f675d1bd03c83ec210e8900cb1….jpg) (h) (u)

>>895822

That's where you're wrong, kiddo. C99 is one of the worst standards to come, and everyone in the industry uses C95 exclusively.

▶Anonymous 04/10/18 (Tue) 13:45:13 No.895837>>895841 >>895852 >>895968

>>895834

If that array of pointers fits within a few pages then its absolutely faster compared to chasing down pages wherever they get allocated.

▶Anonymous 04/10/18 (Tue) 13:47:05 No.895839

>>895836

>The furry c programmer knows all

▶Anonymous 04/10/18 (Tue) 13:48:20 No.895841

>>895837

Thx. Is this the best solution for small lists? Is there a special list type you'd recommend?

▶Anonymous 04/10/18 (Tue) 13:49:25 No.895843>>895852

>>895834

Have you ever benchmarked this shit on a relatively modern computers?

▶Scheme is Life!!ph6IF8xgJ2 04/10/18 (Tue) 13:49:32 No.895844>>895848 >>895849 >>895850 >>895855

>>895833

You and I must have different definitions of "real world". When you need to constantly resize (queues, lists, stack), using vectors is extremely expensive. When the size of your vector remains constant, or is changed very little, using a vector is better.

You wouldn't cut a steak with a wood saw, or cut a plank with a steak knife. Two different tools serve two different purposes, and so do two different data structures.

▶Anonymous 04/10/18 (Tue) 13:49:59 No.895845>>895858

>>895836

Not an argument

▶Anonymous 04/10/18 (Tue) 13:50:42 No.895846>>895851

>>895727 (OP) (OP)

That vulnerability mentioned in that blog post is developer error. The function takes in a string, but you pass in a byte array. Why would you expect it to work? If you pass in the wrong type of variable then of course it might not work right.

▶Anonymous 04/10/18 (Tue) 13:51:12 No.895848>>895861

>>895844

>You and I must have different definitions of "real world". When you need to constantly resize (queues, lists, stack), using vectors is extremely expensive

Any evidence?

▶Anonymous 04/10/18 (Tue) 13:51:35 No.895849>>895853 >>895861

>>895844

> using vectors is extremely expensive

Thats just it, its not extremely expensive. It have a expensive big O cost, but almost every benchmark will show that vectors are faster. This is because cache pages exist. The cache changes how all of this works.

▶Anonymous 04/10/18 (Tue) 13:52:35 No.895850>>895861

>>895844

Look your CS 101 data structures class using big O notation is not an accurate description of how caches work.

▶Anonymous 04/10/18 (Tue) 13:52:40 No.895851>>895985

>>895846

in C, char* is also used for byte arrays.

this is a programmer error, but it could be prevented if the design of the language and the stdlib was less shit.

programmers will always make some errors, but some of them can be prevented entirely as a class.

▶Anonymous 04/10/18 (Tue) 13:53:11 No.895852

>>895843

No. But performance always takes priority.

And I think we should listen to

>>895837

's practical advice and not some stupid theory developed by java shitcoders at some university.

▶Anonymous 04/10/18 (Tue) 13:53:26 No.895853>>895854

>>895849

>It have a expensive big O cost

it doesn't.

amortized cost of adding an item is still O(1).

▶Anonymous 04/10/18 (Tue) 13:54:21 No.895854>>895856 >>895861

>>895853

Adding an item to the middle of a vector is not amortized to O(1).

▶Anonymous 04/10/18 (Tue) 13:54:28 No.895855

>>895844

You can change how often a vector reallocates itself, but really, the default behavior is sufficient for most implementations.

▶Anonymous 04/10/18 (Tue) 13:55:17 No.895856>>895859

>>895854

neither in the linked list if you need first to find a place where to insert --- you'll need O(n) traversal first.

▶Scheme is Love!!5qkHgFVuxU 04/10/18 (Tue) 13:56:01 No.895858>>895862

>>895845

Yours neither, loser. You literally made a bold statement without backing it, or providing proof. Your nodev ass can't even write a reverse polish calculator, LOL.

▶Anonymous 04/10/18 (Tue) 13:56:27 No.895859>>895861 >>895867

>>895856

Again you keep using all these fucking big O notation when talking about the speed of these datastructures. The real world does not follow big O. Iterating over a vector thats all in one page is thousands of times faster than jumping between pages where linked list nodes are allocated despite the same time complexity.

▶Scheme is Life!!ph6IF8xgJ2 04/10/18 (Tue) 13:57:08 No.895861>>895864 >>895867

>>895842

Yup. This is why compiler warnings exist when you try to do implicit conversion, and this is why Apps Hungarian Notation is useful.

>>895848

>evidence

>of a math problem

1st year CS theory that you ought to know if you want to be taken seriously here.

>>895849

>benchmarks

That use cases where vectors are indeed better.

>>895850

>cache

Do you think data structures stop existing outside of RAM?

>>895835

Filesystems make extensive use of linked data structures.

>>895854

In the middle, or anywhere besides the end. Dynamic vectors can be used somewhat effectively as stacks because of that, but that's about it.

>>895859

>The real world does not follow big O

L M A O

M M

A A

O O

▶Anonymous 04/10/18 (Tue) 13:57:55 No.895862>>895863

>>895858

>Your nodev ass can't even write a reverse polish calculator, LOL

I can write even infix calculator without any problem.

I actually wrote a compiler for a simple language and a lot of other shit too. Fix your detector.

▶Scheme is Love!!5qkHgFVuxU 04/10/18 (Tue) 13:58:30 No.895863

>>895862

You're still claiming shit you've never done, and don't provide proof.

>>>/reddit/

▶Anonymous 04/10/18 (Tue) 13:58:42 No.895864>>895866

>>895861

Look here retard. Iterating over a list and vector have the same big O cost. In the case of an actual list though you will be chasing down pointers in different pages. big O does not at all model this cast. If you knew more about CS theory than an undergrad simpleton you would understand this.

▶Anonymous 04/10/18 (Tue) 13:59:35 No.895865>>895870

oh shit watch out there's a troll in here.

▶Scheme is Life!!ph6IF8xgJ2 04/10/18 (Tue) 14:00:10 No.895866>>895869

>>895864

>muh iterations

Insert a new value at the head of a 10 million records vector.

Now do it at the head of a 10 million records linked list.

Come back and tell everyone how it went.

▶Anonymous 04/10/18 (Tue) 14:00:11 No.895867>>895868 >>895872

>>895861

>1st year CS theory that you ought to know if you want to be taken seriously here.

When you make claims based on your invalid mental model of the modern computing hardware, of course you need to prove your bullshit to be taken seriously.

>>895859

Lol, are you a brainlet or what?

>>895861

>Filesystems make extensive use of linked data structures.

For different reasons altogether.

We are talking about in-memory data structures.

▶Anonymous 04/10/18 (Tue) 14:00:58 No.895868>>895870

>>895867

>Lol, are you a brainlet or what?

<standard Big O notation always correctly models hardware

what the fuck are you on about

▶Anonymous 04/10/18 (Tue) 14:01:46 No.895869>>895872

>>895866

>Insert a new value at the head of a 10 million records vector.

if you need to insert at head, you use deque and not vector.

for deque, this is not a problem at all and it will be faster than linked list (amortized)

▶Anonymous 04/10/18 (Tue) 14:02:34 No.895870>>895871

>>895865

don't worry I got him right here: >>895868

▶Anonymous 04/10/18 (Tue) 14:03:28 No.895871

File (hide): 021ae030b41cf7c⋯.jpg (78.17 KB, 680x491, 680:491, попался толстяк.jpg) (h) (u)

>>895870

forgot pic

▶Scheme is Life!!ph6IF8xgJ2 04/10/18 (Tue) 14:04:40 No.895872>>895873 >>895874

>>895867

>We are talking about in-memory data structures.

Who says so? I defended that linked lists had very valid use cases, and everyone and their nodev asses have come to shit on what is basic knowledge.

>invalid mental model of the modern computing hardware

I know how cache works, thank you.

>>895869

>deque

Not always.

▶Anonymous 04/10/18 (Tue) 14:05:41 No.895873>>895875

>>895872

>not always

I see you don't know what amortized means then

▶Anonymous 04/10/18 (Tue) 14:06:51 No.895874>>895875 >>895876 >>895881 >>897445

>>895872

TFW your linked list is slower for the one thing it should be better at because of how hardware actually works

https://baptiste-wicht.com/posts/2012/12/cpp-benchmark-vector-list-deque.html

▶Scheme is Life!!ph6IF8xgJ2 04/10/18 (Tue) 14:08:09 No.895875>>895878 >>895883

>>895873

If you need to frequently mutate the order of your data, deques can still prove too slow, or their head be too big.

>>895874

>muh benchmarks

Filesystems, do you understand them?

▶Anonymous 04/10/18 (Tue) 14:08:13 No.895876

>>895874

See that the only case where the list is actually faster is where they happen to store very large values at each node instead of a pointer to them which is a retarded contrived use case.

▶Anonymous 04/10/18 (Tue) 14:09:11 No.895878

>>895875

>amortized

▶Scheme is Life!!ph6IF8xgJ2 04/10/18 (Tue) 14:10:08 No.895881

>>895874

>The random position is found by linear search.

Gee!

▶Anonymous 04/10/18 (Tue) 14:11:05 No.895882>>895898

>not just implementing a linked list with a lookup table for fast iteration

Kiss and make up, gentlemen. Try not to touch balls though, that's gay.

▶Anonymous 04/10/18 (Tue) 14:11:22 No.895883>>895884

>>895875

Yeah no one is actually going to every have to do a linear search on their data to find what they need

▶Scheme is Life!!ph6IF8xgJ2 04/10/18 (Tue) 14:12:18 No.895884>>895890

>>895883

Lists are not intended for linear searches.

▶Scheme is Life!!ph6IF8xgJ2 04/10/18 (Tue) 14:13:05 No.895887>>895890

>>895877

>they will likely behind a pointer then, so even then it loses.

Only if you're a terrible programmer.

▶Anonymous 04/10/18 (Tue) 14:13:18 No.895888

>>895827

>SQL databases are much different than C storage. For starters, the length of VARCHAR is stored only once, in the column definition. When the length is dynamic, we're talking about text, which will indeed make the extra 8 bytes literally nothing.

That really matters nothing at all. The compiler should be able to handle this, along with the promotion rules. My point is that fixed/limited size strings are nothing new and people know how to handle it. The reason most modern programming languages use the same type of strings for everything is because C hacked them in as simple pointers to chars, when that type actually has another property and it is that it is null terminated, so even though the following languages knew null terminated strings were bad because they caused all sorts of problems, they didn't think making a distinction wasn't worth it, so they just used size_t for every string, be it 2 or 20000 characters long.

Riddle me this: what would be so wrong about using structs for strings, where one of the members is a pointer and the other is an unsigned integer which number of bytes adjusts itself to the minimum number that can hold the number of characters in the string? This way, strings up to the max unsigned char value occupy the same as null terminated strings, and strings that take up to the maximum unsigned short int value would occupy a measly single extra byte. In addition, by manipulating pointer and length you could generate a view into a string, which is more or less what Rust already does, and save memory in the process.

▶Anonymous 04/10/18 (Tue) 14:13:32 No.895890>>895894 >>895895

>>895884

Okay so we can agree then that lists are useless for almost everything?

>>895887

How dare someone store and object bigger than the size of a page behind a pointer!

▶Anonymous 04/10/18 (Tue) 14:15:39 No.895894>>895899

>>895890

lists DO have uses, though. Pretending that they don't is cargo cult programming

▶Scheme is Life!!ph6IF8xgJ2 04/10/18 (Tue) 14:15:58 No.895895>>895899 >>895900

>>895890

>so we can agree then that lists are useless for almost everything?

They are not useless, they are slower. And sure, they're slower almost every time, but not every time, which is the point I'm making from the beginning.

>How dare someone store and object bigger than the size of a page behind a pointer!

>2048 bytes

>bigger than a page

nigguh

▶Anonymous 04/10/18 (Tue) 14:17:05 No.895898

>>895882

this table will need to be updated each time you insert or remove something, defeating the purpose.

what you actually probably want is https://bitbucket.org/astrieanna/bitmapped-vector-trie.

▶Anonymous 04/10/18 (Tue) 14:18:16 No.895899>>895901

>>895895

>>895894

Most things have uses, and the less useful should not be the default.

▶Anonymous 04/10/18 (Tue) 14:18:44 No.895900>>895901

>>895895

>And sure, they're slower almost every time, but not every time, which is the point I'm making from the beginning

still useless for realtime, as memory allocation is unpredictable generally.

▶Scheme is Life!!ph6IF8xgJ2 04/10/18 (Tue) 14:19:17 No.895901>>895903 >>895907

>>895899

I don't think I said they were or should be the default, have I?

>>895900

Man, I've mentioned filesystems three times already.

▶Anonymous 04/10/18 (Tue) 14:19:58 No.895903>>895905 >>895922 >>895930

>>895901

They are the default in schema

▶Anonymous 04/10/18 (Tue) 14:20:08 No.895904>>895906 >>895913

>hurr durr lets LARP about irrelevant shit

/tech/ in a nutshell. I bet most of you fags haven't even programmed anything except fizzbuzz tier shit.

▶Anonymous 04/10/18 (Tue) 14:20:12 No.895905>>895922

>>895903

*scheme

▶Anonymous 04/10/18 (Tue) 14:20:26 No.895906>>895910

>>895904

>>>/g/

▶Anonymous 04/10/18 (Tue) 14:20:35 No.895907

>>895901

>Man, I've mentioned filesystems three times already.

filesystems like the FAT? :^)

I've seen better filesystems use more clever data structures.

▶Anonymous 04/10/18 (Tue) 14:21:49 No.895910>>895911 >>895913

>>895906

not an argument XDDDDDDDDDDDDDDDDDDDDDDD

▶Anonymous 04/10/18 (Tue) 14:22:41 No.895911

>>895910

>>>/molyneux/

▶Anonymous 04/10/18 (Tue) 14:24:51 No.895913>>895915

>>895910

>>895904

>I-I bet you guyz hasnt even program! L O L

"well reasoned argument"

>n-not an argument

shhh. The grown NEETs are talking.

▶Anonymous 04/10/18 (Tue) 14:25:18 No.895915

>>895913

>grown NEETs

LOL. Keep LARPing faggots.

▶Scheme is Life!!ph6IF8xgJ2 04/10/18 (Tue) 14:31:46 No.895922>>895924

>>895903

>>895905

They aren't.

▶Anonymous 04/10/18 (Tue) 14:32:55 No.895924>>895925

>>895922

yes they are

▶Scheme is Life!!ph6IF8xgJ2 04/10/18 (Tue) 14:36:23 No.895925>>895926

>>895924

They aren't, that's a fact that's provable by knowing anything at all about Scheme. I don't see why that's even an argument.

https://www.gnu.org/software/mit-scheme/documentation/mit-scheme-ref/Vectors.html

https://www.gnu.org/software/mit-scheme/documentation/mit-scheme-ref/Strings.html#Strings

https://www.gnu.org/software/mit-scheme/documentation/mit-scheme-ref/Associations.html#Associations

Do these look like linked lists to you?

▶Anonymous 04/10/18 (Tue) 14:38:04 No.895926>>895929

>>895925

https://www.gnu.org/software/mit-scheme/documentation/mit-scheme-ref/Lists.html#Lists

They are the default.

▶Scheme is Life!!ph6IF8xgJ2 04/10/18 (Tue) 14:39:15 No.895929>>895932

>>895926

>Ctrl+F

>"default"

>0 results

▶Anonymous 04/10/18 (Tue) 14:39:17 No.895930>>895931

>>895903

what does it even mean for them to be "default"?

▶Scheme is Life!!ph6IF8xgJ2 04/10/18 (Tue) 14:39:41 No.895931

>>895930

It means "I know nothing about programming and I need to read my SICP".

▶Anonymous 04/10/18 (Tue) 14:40:08 No.895932>>895933 >>895935

>>895929

>A more streamlined notation can be used for lists: the elements of the list are simply enclosed in parentheses and separated by spaces. The empty list is written (). For example, the following are equivalent notations for a list of symbols:

>(a b c d e)

>(a . (b . (c . (d . (e . ())))))

LOL. It is the default

▶Scheme is Life!!ph6IF8xgJ2 04/10/18 (Tue) 14:41:07 No.895933>>895936

>>895932

>this means they're the default DS

▶Anonymous 04/10/18 (Tue) 14:41:42 No.895935>>895937

>>895932

you are confusing the abstract concept of lists, and the particular implementation of linked lists. Scheme uses lists heavily but that doesn't mean its based in linked lists

▶Anonymous 04/10/18 (Tue) 14:41:54 No.895936>>895938

>>895933

They are. If you write (a b c) you have a linked list.

▶Anonymous 04/10/18 (Tue) 14:42:25 No.895937>>895939

>>895935

>that doesn't mean its based in linked lists

proof?

Mine is here: https://www.gnu.org/software/mit-scheme/documentation/mit-scheme-ref/Lists.html#Lists

▶Scheme is Life!!ph6IF8xgJ2 04/10/18 (Tue) 14:43:27 No.895938>>895940

>>895936

You don't, and you're retarded, and you don't understand the difference between an actual list structure, and the concept of list used in the representation of Scheme programs.

If you type (a b c), you are calling the function 'a' with the arguments 'b' and 'c'.

▶Anonymous 04/10/18 (Tue) 14:43:58 No.895939>>895942

>>895937

proof of what? I'm just pointing out a distinction.

>Ctrl+F

>link

>0 results

▶Anonymous 04/10/18 (Tue) 14:44:06 No.895940>>895943

>>895938

>>A more streamlined notation can be used for lists: the elements of the list are simply enclosed in parentheses and separated by spaces. The empty list is written (). For example, the following are equivalent notations for a list of symbols:

>>(a b c d e)

>>(a . (b . (c . (d . (e . ())))))

https://www.gnu.org/software/mit-scheme/documentation/mit-scheme-ref/Lists.html#Lists

▶Anonymous 04/10/18 (Tue) 14:44:39 No.895942>>896308

>>895939

>7 Lists

>A pair (sometimes called a dotted pair) is a data structure with two fields called the car and cdr fields (for historical reasons). Pairs are created by the procedure cons. The car and cdr fields are accessed by the procedures car and cdr. The car and cdr fields are assigned by the procedures set-car! and set-cdr!.

>Pairs are used primarily to represent lists. A list can be defined recursively as either the empty list or a pair whose cdr is a list. More precisely, the set of lists is defined as the smallest set X such that

> The empty list is in X.

> If list is in X, then any pair whose cdr field contains list is also in X.

▶Scheme is Life!!ph6IF8xgJ2 04/10/18 (Tue) 14:45:48 No.895943>>895945

>>895940

>I still don't understand Scheme

You could have said "oh, ok, I thought so" when I told you they weren't the default, and you would have just appeared as someone who doesn't know Scheme, which is in itself not a bad thing. Now you're just making an ass out of yourself.

▶Anonymous 04/10/18 (Tue) 14:46:27 No.895945>>895946

>>895943

>you don't know scheme

weal argument.

https://www.gnu.org/software/mit-scheme/documentation/mit-scheme-ref/Lists.html#Lists

▶Scheme is Life!!ph6IF8xgJ2 04/10/18 (Tue) 14:47:12 No.895946>>895947

>>895945

https://cs.wmich.edu/~gupta/teaching/cs4850/sumII06/The%20syntax%20of%20C%20in%20Backus-Naur%20form.htm

Are lists the default in C? It says "lists" right in its grammar.

▶Anonymous 04/10/18 (Tue) 14:47:55 No.895947>>895949

>>895946