system/freebsd-src

mirror of https://github.com/freebsd/freebsd-src synced 2024-10-04 07:31:11 +00:00

Author	SHA1	Message	Date
Robert Watson	395a08c904	Extend coverage of SOCK_LOCK(so) to include so_count, the socket reference count: - Assert SOCK_LOCK(so) macros that directly manipulate so_count: soref(), sorele(). - Assert SOCK_LOCK(so) in macros/functions that rely on the state of so_count: sofree(), sotryfree(). - Acquire SOCK_LOCK(so) before calling these functions or macros in various contexts in the stack, both at the socket and protocol layers. - In some cases, perform soisdisconnected() before sotryfree(), as this could result in frobbing of a non-present socket if sotryfree() actually frees the socket. - Note that sofree()/sotryfree() will release the socket lock even if they don't free the socket. Submitted by: sam Sponsored by: FreeBSD Foundation Obtained from: BSD/OS	2004-06-12 20:47:32 +00:00
Robert Watson	f6c0cce6d9	Introduce a mutex into struct sockbuf, sb_mtx, which will be used to protect fields in the socket buffer. Add accessor macros to use the mutex (SOCKBUF_()). Initialize the mutex in soalloc(), and destroy it in sodealloc(). Add addition, add SOCK_() access macros which will protect most remaining fields in the socket; for the time being, use the receive socket buffer mutex to implement socket level locking to reduce memory overhead. Submitted by: sam Sponosored by: FreeBSD Foundation Obtained from: BSD/OS	2004-06-12 16:08:41 +00:00
Poul-Henning Kamp	2653139fd2	Fix registration of loadable line disciplines. This should make watch(8)/snp(4) work again.	2004-06-12 12:31:42 +00:00
Bosko Milekic	96e124135b	Gah! Plug a mbuf leak I introduced in the last commit. I don the pointy-hat. Problem reported by: Peter Holm <pho@>	2004-06-11 18:17:25 +00:00
Julian Elischer	94e0a4cdf3	Shuffle some code around.	2004-06-11 17:48:20 +00:00
Poul-Henning Kamp	1930e303cf	Deorbit COMPAT_SUNOS. We inherited this from the sparc32 port of BSD4.4-Lite1. We have neither a sparc32 port nor a SunOS4.x compatibility desire these days.	2004-06-11 11:16:26 +00:00
Brian Feldman	b4adfcf2f4	Make sysctl_wire_old_buffer() respect ENOMEM from vslock() by marking the valid length as 0. This prevents vsunlock() from removing a system wire from memory that was not successfully wired (by us). Submitted by: tegge	2004-06-11 02:20:37 +00:00
Robert Watson	0d9ce3a1ac	Introduce a subsystem lock around UNIX domain sockets in order to protect global and allocated variables. This strategy is derived from work originally developed by BSDi for BSD/OS, and applied to FreeBSD by Sam Leffler: - Add unp_mtx, a global mutex which will protect all UNIX domain socket related variables, structures, etc. - Add UNP_LOCK(), UNP_UNLOCK(), UNP_LOCK_ASSERT() macros. - Acquire unp_mtx on entering most UNIX domain socket code, drop/re-acquire around calls into VFS, and release it on return. - Avoid performing sodupsockaddr() while holding the mutex, so in general move to allocating storage before acquiring the mutex to copy the data. - Make a stack copy of the xucred rather than copying out while holding unp_mtx. Copy the peer credential out after releasing the mutex. - Add additional assertions of vnode locks following VOP_CREATE(). A few notes: - Use of an sx lock for the file list mutex may cause problems with regard to unp_mtx when garbage collection passed file descriptors. - The locking in unp_pcblist() for sysctl monitoring is correct subject to the unpcb zone not returning memory for reuse by other subsystems (consistent with similar existing concerns). - Sam's version of this change, as with the BSD/OS version, made use of both a global lock and per-unpcb locks. However, in practice, the global lock covered all accesses, so I have simplified out the unpcb locks in the interest of getting this merged faster (reducing the overhead but not sacrificing granularity in most cases). We will want to explore possibilities for improving lock granularity in this code in the future. Submitted by: sam Sponsored by: FreeBSD Foundatiuon Obtained from: BSD/OS 5 snapshot provided by BSDi	2004-06-10 21:34:38 +00:00
Bosko Milekic	b5b2ea9a46	Plug a race where upon free this scenario could occur: (time grows downward) thread 1 thread 2 ------------\|------------ dec ref_cnt \| \| dec ref_cnt <-- ref_cnt now zero cmpset \| free all \| return \| \| alloc again,\| reuse prev \| ref_cnt \| \| cmpset, read \| already freed \| ref_cnt ------------\|------------ This should fix that by performing only a single atomic test-and-set that will serve to decrement the ref_cnt, only if it hasn't changed since the earlier read, otherwise it'll loop and re-read. This forces ordering of decrements so that truly the thread which did the LAST decrement is the one that frees. This is how atomic-instruction-based refcnting should probably be handled. Submitted by: Julian Elischer	2004-06-10 00:04:27 +00:00
Maxime Henrion	931f76ab48	Fix a panic happening when m_getm() is called with len < MCLBYTES. Reported by: ale Tested by: ale Reviewed by: bosko	2004-06-09 14:53:35 +00:00
Juli Mallett	6c27c6039b	Add a comment explaining td_critnest's initial state and its life from that point on, as it happens relatively indirectly, and in a codepath the casual reader may not be acquainted with or find obvious. Glanced at by: jhb	2004-06-09 14:06:44 +00:00
Poul-Henning Kamp	b7b4b455b5	Rename struct pt_ioctl to "ptsc" and pointers to it from "pti" to "pt"	2004-06-09 10:21:53 +00:00
Poul-Henning Kamp	b7ffba0afc	Ditch K&R function style	2004-06-09 10:16:14 +00:00
Poul-Henning Kamp	2195e4207a	Reference count struct tty. Add two new functions: ttyref() and ttyrel(). ttymalloc() creates a struct tty with a reference count of one. when ttyrel sees the count go to zero, struct tty is freed. Hold references for open ttys and for ttys which are controlling terminal for sessions. Until drivers start using ttyrel(), this commit will make no difference.	2004-06-09 09:41:30 +00:00
Poul-Henning Kamp	a59df4e1ee	Fix a race in destruction of sessions.	2004-06-09 09:29:08 +00:00
Poul-Henning Kamp	c0afc00670	Move PTY private defines into PTY private files.	2004-06-09 09:09:54 +00:00
Stefan Farfeleder	1a5ff9285a	Avoid assignments to cast expressions. Reviewed by: md5 Approved by: das (mentor)	2004-06-08 13:08:19 +00:00
Tim J. Robbins	f55530b436	Remove remnants of PGINPROF.	2004-06-08 10:37:30 +00:00
Robert Watson	aa57bb0424	Correct a resource leak introduced in recent accept locking changes: when I reordered events in accept1() to allocate a file descriptor earlier, I didn't properly update use of goto on exit to unwind for cases where the file descriptor is now held, but wasn't previously. The result was that, in the event of accept() on a non-blocking socket, or in the event of a socket error, a file descriptor would be leaked. This ended up being non-fatal in many cases, as the file descriptor would be properly GC'd on process exit, so only showed up for processes that do a lot of non-blocking accept() calls, and also live for a long time (such as qmail). This change updates the use of goto targets to do additional unwinding. Eyes provided by: Brian Feldman <green@freebsd.org> Feet, hands provided by: Stefan Ehmann <shoesoft@gmx.net>, Dimitry Andric <dimitry@andric.com> Arjan van Leeuwen <avleeuwen@piwebs.com>	2004-06-07 21:45:44 +00:00
Poul-Henning Kamp	5df76176f7	Make linesw[] an array of pointers to linedesc instead of an array of linedisc.	2004-06-07 20:45:45 +00:00
Julian Elischer	345ad86692	Split kern_thread.c into 2 parts. kern_kse.c and kern_thread.c Kern_kse has already been committed. This separates out the KSE threading ABI from generic thread support.	2004-06-07 19:00:57 +00:00
David Xu	36939a0a5c	According to SUSv3, sigwait is different with sigwaitinfo, sigwait returns error code in return value, not in errno.	2004-06-07 13:35:02 +00:00
Pawel Jakub Dawidek	79db0f1cbf	Remove unused code. Submitted by: Bjoern A. Zeeb	2004-06-07 12:19:55 +00:00
Hajimu UMEMOTO	7a1a900c65	allow more than MLEN bytes for ancillary data to meet the requirement of Section 20.1 of RFC3542. Obtained from: KAME MFC after: 1 week	2004-06-07 09:59:50 +00:00
Tim J. Robbins	be5318b2ca	Remove a stale and misleading comment.	2004-06-07 09:35:00 +00:00
Julian Elischer	30276dc9f8	Move the KSE ABI specific code here and separate it from code that is generic to any threading system. This commit does not link this file to the build yet, nor does it remove these functions from their current location in kern_thread.c. (that commit coming up after further review)	2004-06-07 07:25:03 +00:00
Poul-Henning Kamp	9a6dc4b647	Remove filename+line number from panic messages.	2004-06-06 21:26:49 +00:00
Bruce Evans	05b2c96fd3	Detect interrupt storms better. The storm detection didn't work at all with an ASUS A7N8X-E motherboard in APIC mode, since storming interrupts don't repeat immediately. Use DELAY(1) to wait a bit for them to repeat. This affects all systems. Only delay for the first (10 * intr_storm_threshold) interrupts (per interrupt handler) so that this is only a pessimization while warming up. Throttle after calling the sub-handlers instead of before so that the long delay given by throttling can be used instead of the DELAY(1) to detect storms after warming up. Reduced the throttling period from 1/10 second to 1/hz seconds so that throttling doesn't destroy performance so much. Interrupts that are detected as storming are effectively handled by polling at a frequency of hz Hz. On A7N8X-E's there is another hardware or configuration bug that makes the throttled frequency closer to 2*hz Hz.	2004-06-05 18:27:28 +00:00
Maxime Henrion	bd304417e1	When we don't have any meaningful value to print for the device sysctl tree, output an empty string instead of "?". This is already what happened with DEVICE_SYSCTL_LOCATION and DEVICE_SYSCTL_PNPINFO. This makes the output of "sysctl dev" much nicer (it won't display those empty sysctls). Reviewed by: des	2004-06-05 11:39:05 +00:00
Tim J. Robbins	f99619a0dc	Change the types of vn_rdwr_inchunks()'s len and aresid arguments to size_t and size_t *, respectively. Update callers for the new interface. This is a better fix for overflows that occurred when dumping segments larger than 2GB to core files.	2004-06-05 02:18:28 +00:00
Tim J. Robbins	2b471bc616	Back out workaround for vn_rdwr_inchunks()'s INT_MAX length limitation after discussions with bde; vn_rdwr_inchunks() itself should be fixed.	2004-06-05 02:00:12 +00:00
Poul-Henning Kamp	13e84a71e0	Centralize the line discipline optimization determination in a function called ttyldoptim(). Use this function from all the relevant drivers. I belive no drivers finger linesw[] directly anymore, paving the way for locking and refcounting.	2004-06-04 21:55:55 +00:00
Poul-Henning Kamp	fe3ec6224a	Manual edits to change linesw[]-frobbing to ttyld_*() calls.	2004-06-04 20:04:52 +00:00
Poul-Henning Kamp	2140d01b27	Machine generated patch which changes linedisc calls from accessing linesw[] directly to using the ttyld...() functions The ttyld...() functions ar inline so there is no performance hit.	2004-06-04 16:02:56 +00:00
Tim J. Robbins	c4d85674d5	Remove a stale comment.	2004-06-04 11:00:22 +00:00
Dag-Erling Smørgrav	35e32fd8a3	Add a devclass level to the dev sysctl tree, in order to support per- class variables in addition to per-device variables. In plain English, this means that dev.foo0.bar is now called dev.foo.0.bar, and it is possible to to have dev.foo.bar as well.	2004-06-04 10:23:00 +00:00
Poul-Henning Kamp	d1afdc6644	Get rid of ttyregister(). All drivers now use ttymalloc() for struct tty, so now we stand a chance of implementing refcounting and getting rid of the damn things again.	2004-06-04 07:17:03 +00:00
Poul-Henning Kamp	214ef22684	Use ttymalloc() instead of ttyregister(). Use ttyioctl() instead of direct calls to the linedisc.	2004-06-04 06:50:35 +00:00
Tim J. Robbins	16e6d16299	Write segments to core dump files in maximally-sized chunks that neither exceed vn_rdwr_inchunks()'s INT_MAX length limitation nor span a block boundary. This fixes dumping segments larger than 2GB. PR: 67546	2004-06-04 06:30:16 +00:00
Robert Watson	e7dd9a1001	Mark sun_noname as const since it's immutable. Update definitions of functions that potentially accept &sun_noname (sbappendaddr(), et al) to accept a const sockaddr pointer.	2004-06-04 04:07:08 +00:00
Alan Cox	62326de742	Move the definitions of SWAPBLK_NONE and SWAPBLK_MASK from vm_page.h to blist.h, enabling the removal of numerous #includes from subr_blist.c. (subr_blist.c and swap_pager.c are the only users of these definitions.)	2004-06-04 04:03:26 +00:00
John Baldwin	ba8b26f960	- Comment out NULL, NULL barrier for Unix domain sockets section as the double NULL entries signal Witness to stop processing the array of order entries meaning none of the spin locks are added resulting in panics on boot. - Add a missing NULL, NULL terminator to the Slip locks list to keep them separate from the spin locks.	2004-06-03 20:07:44 +00:00
Tim J. Robbins	cc05397ffc	Remove checks for curthread == NULL - it can't happen.	2004-06-03 10:22:47 +00:00
Tim J. Robbins	fa2a4d0595	Move TDF_DEADLKTREAT into td_pflags (and rename it accordingly) to avoid having to acquire sched_lock when manipulating it in lockmgr(), uiomove(), and uiomove_fromphys(). Reviewed by: jhb	2004-06-03 01:47:37 +00:00
Robert Watson	d97e0534fa	Expand the hard-coded WITNESS lock order to include the following relationships: Sockets: filedesc->accept->sellck Routing: radix node head->rtentry->ifaddr UDP: udp->udpinp TCP: tcp->tcpinp SLIP: slip_mtx->slip sc_mtx Drop in a place holder section for UNIX domain sockets. Various sections to be expanded over the next few days.	2004-06-02 23:28:06 +00:00
Maxime Henrion	2e34ae7a26	As discussed on arch@, flatten the device sysctl tree to make it more convenient to deal with. The notion of hierarchy is however preserved by adding a new %parent node.	2004-06-02 22:43:35 +00:00
Tim J. Robbins	e4e815db72	Remove a redundant "td = curthread" statement from profclock().	2004-06-02 12:05:06 +00:00
Tim J. Robbins	aa0aa7a113	Move TDF_SA from td_flags to td_pflags (and rename it accordingly) so that it is no longer necessary to hold sched_lock while manipulating it. Reviewed by: davidxu	2004-06-02 07:52:36 +00:00
Jeff Roberson	dc03363dd8	- Run sched_balance() and sched_balance_groups() from hardclock via sched_clock() rather than using callouts. This means we no longer have to take the load of the callout thread into consideration while balancing and should make the balancing decisions simpler and more accurate. Tested on: x86/UP, amd64/SMP	2004-06-02 05:46:48 +00:00
Robert Watson	2658b3bb8e	Integrate accept locking from rwatson_netperf, introducing a new global mutex, accept_mtx, which serializes access to the following fields across all sockets: so_qlen so_incqlen so_qstate so_comp so_incomp so_list so_head While providing only coarse granularity, this approach avoids lock order issues between sockets by avoiding ownership of the fields by a specific socket and its per-socket mutexes. While here, rewrite soclose(), sofree(), soaccept(), and sonewconn() to add assertions, close additional races and address lock order concerns. In particular: - Reorganize the optimistic concurrency behavior in accept1() to always allocate a file descriptor with falloc() so that if we do find a socket, we don't have to encounter the "Oh, there wasn't a socket" race that can occur if falloc() sleeps in the current code, which broke inbound accept() ordering, not to mention requiring backing out socket state changes in a way that raced with the protocol level. We may want to add a lockless read of the queue state if polling of empty queues proves to be important to optimize. - In accept1(), soref() the socket while holding the accept lock so that the socket cannot be free'd in a race with the protocol layer. Likewise in netgraph equivilents of the accept1() code. - In sonewconn(), loop waiting for the queue to be small enough to insert our new socket once we've committed to inserting it, or races can occur that cause the incomplete socket queue to overfill. In the previously implementation, it was sufficient to simply tested once since calling soabort() didn't release synchronization permitting another thread to insert a socket as we discard a previous one. - In soclose()/sofree()/et al, it is the responsibility of the caller to remove a socket from the incomplete connection queue before calling soabort(), which prevents soabort() from having to walk into the accept socket to release the socket from its queue, and avoids races when releasing the accept mutex to enter soabort(), permitting soabort() to avoid lock ordering issues with the caller. - Generally cluster accept queue related operations together throughout these functions in order to facilitate locking. Annotate new locking in socketvar.h.	2004-06-02 04:15:39 +00:00

1 2 3 4 5 ...

7268 commits