synapse

mirror of https://github.com/element-hq/synapse.git synced 2024-11-22 01:25:44 +03:00

Author	SHA1	Message	Date
Erik Johnston	8f35f8148e	Fix bug where a new writer advances their token too quickly (#16473 ) * Fix bug where a new writer advances their token too quickly When starting a new writer (for e.g. persisting events), the `MultiWriterIdGenerator` doesn't have a minimum token for it as there are no rows matching that new writer in the DB. This results in the the first stream ID it acquired being announced as persisted before it actually finishes persisting, if another writer gets and persists a subsequent stream ID. This is due to the logic of setting the minimum persisted position to the minimum known position of across all writers, and the new writer starts off not being considered. * Fix sending out POSITIONs when our token advances without update Broke in #14820 * For replication HTTP requests, only wait for minimal position	2023-10-23 16:57:30 +01:00
Patrick Cloke	6ad1f9eac2	Convert DeviceLastConnectionInfo to attrs. (#16507 ) To improve type safety & memory usage.	2023-10-17 12:47:42 +00:00
Patrick Cloke	a4904dcb04	Convert simple_select_many_batch, simple_select_many_txn to tuples. (#16444 )	2023-10-11 13:24:56 -04:00
Patrick Cloke	85bfd4735e	Return an immutable value from get_latest_event_ids_in_room. (#16326 )	2023-09-18 09:29:05 -04:00
Erik Johnston	954921736b	Refactor `get_user_by_id` (#16316 )	2023-09-14 12:46:30 +01:00
Erik Johnston	2b35626b6b	Refactor storing of server keys (#16261 )	2023-09-12 11:08:04 +01:00
Patrick Cloke	aa483cb4c9	Update ruff config (#16283 ) Enable additional checks & clean-up unneeded configuration.	2023-09-08 11:24:36 -04:00
Mathieu Velten	dcb2778341	Add last_seen_ts to the admin users API (#16218 )	2023-09-04 18:13:28 +02:00
David Robertson	6525fd65ee	Log the details of background update failures (#16212 )	2023-09-01 12:41:56 +01:00
Erik Johnston	a2e0d4cd60	Fix rare bug that broke looping calls (#16210 ) * Fix rare bug that broke looping calls We can't interact with the reactor from the main thread via looping call. Introduced in v1.90.0 / #15791. * Newsfile	2023-08-30 14:18:42 +01:00
Patrick Cloke	9ec3da06da	Bump mypy-zope & mypy. (#16188 )	2023-08-29 10:38:56 -04:00
V02460	84f441f88f	Prepare unit tests for Python 3.12 (#16099 )	2023-08-25 15:05:10 -04:00
Patrick Cloke	a8a46b1336	Replace simple_async_mock with AsyncMock (#16180 ) Python 3.8 has a native AsyncMock, use it instead of a custom implementation.	2023-08-25 09:27:21 -04:00
Patrick Cloke	daf11e26ef	Replace make_awaitable with AsyncMock (#16179 ) Python 3.8 provides a native AsyncMock, we can replace the homegrown version we have.	2023-08-24 19:38:46 -04:00
Neil Johnson	ec662bbe41	Filter out unwanted user_agents from udv. (#16124 )	2023-08-23 14:00:34 +01:00
Erik Johnston	bd558a6dc3	Speed up state res in rare case we don't have all events (#16116 ) If we don't have all the auth events in a room then not all state events will have a chain cover index. Even so, we can still use the chain cover index on the events that do have it, rather than bailing and using the slower functions. This situation should not arise for newly persisted rooms, as we check we have the full auth chain for each event, but can happen for existing rooms. c.f. #15245	2023-08-18 15:32:06 +01:00
Erik Johnston	eb0dbab15b	Fix database performance of read/write worker locks (#16061 ) We were seeing serialization errors when taking out multiple read locks. The transactions were retried, so isn't causing any failures. Introduced in #15782.	2023-08-17 14:07:57 +01:00
Patrick Cloke	ad3f43be9a	Run pyupgrade for python 3.7 & 3.8. (#16110 )	2023-08-15 08:11:20 -04:00
Mathieu Velten	dac97642e4	Implements admin API to lock an user (MSC3939) (#15870 )	2023-08-10 09:10:55 +00:00
Mathieu Velten	f0a860908b	Allow config of the backoff algorithm for the federation client. (#15754 ) Adds three new configuration variables: * destination_min_retry_interval is identical to before (10mn). * destination_retry_multiplier is now 2 instead of 5, the maximum value will be reached slower. * destination_max_retry_interval is one day instead of (essentially) infinity. Capping this will cause destinations to continue to be retried sometimes instead of being lost forever. The previous value was 2 ^ 62 milliseconds.	2023-08-03 14:36:55 -04:00
Erik Johnston	ae55cc1e6b	Add ability to wait for locks and add locks to purge history / room deletion (#15791 ) c.f. #13476	2023-07-31 10:58:03 +01:00
Olivier Wilkinson (reivilibre)	8e8431bc6e	Merge branch 'master' into develop	2023-07-18 16:45:39 +01:00
Shay	e625c3dca0	Revert "Stop writing to column `user_id` of tables `profiles` and `user_filters`. (#15953 ) * Revert "Stop writing to column `user_id` of tables `profiles` and `user_filters` (#15787)" This reverts commit `f25b0f8808`. * newsfragement	2023-07-18 11:44:09 +01:00
Eric Eastwood	1c802de626	Re-introduce the outbound federation proxy (#15913 ) Allow configuring the set of workers to proxy outbound federation traffic through (`outbound_federation_restricted_to`). This is useful when you have a worker setup with `federation_sender` instances responsible for sending outbound federation requests and want to make sure all outbound federation traffic goes through those instances. Before this change, the generic workers would still contact federation themselves for things like profile lookups, backfill, etc. This PR allows you to set more strict access controls/firewall for all workers and only allow the `federation_sender`'s to contact the outside world.	2023-07-18 09:49:21 +01:00
Eric Eastwood	c9bf644fa0	Revert "Federation outbound proxy" (#15910 ) Revert "Federation outbound proxy (#15773)" This reverts commit `b07b14b494`.	2023-07-10 11:10:20 -05:00
Erik Johnston	e55a9b3e41	Fix downgrading to previous version of Synapse (#15907 ) We do this by marking the constraint as deferrable.	2023-07-10 16:24:42 +01:00
Shay	f25b0f8808	Stop writing to column `user_id` of tables `profiles` and `user_filters` (#15787 )	2023-07-07 09:23:27 -07:00
Eric Eastwood	b07b14b494	Federation outbound proxy (#15773 ) Allow configuring the set of workers to proxy outbound federation traffic through (`outbound_federation_restricted_to`). This is useful when you have a worker setup with `federation_sender` instances responsible for sending outbound federation requests and want to make sure all outbound federation traffic goes through those instances. Before this change, the generic workers would still contact federation themselves for things like profile lookups, backfill, etc. This PR allows you to set more strict access controls/firewall for all workers and only allow the `federation_sender`'s to contact the outside world. The original code is from @erikjohnston's branches which I've gotten in-shape to merge.	2023-07-05 18:53:55 -05:00
Erik Johnston	39d131b016	Add basic read/write lock (#15782 )	2023-07-05 17:25:00 +01:00
Erik Johnston	95a96b21eb	Add foreign key constraint to `event_forward_extremities`. (#15751 )	2023-07-05 09:43:19 +00:00
Eric Eastwood	0f02f0b4da	Remove experimental MSC2716 implementation to incrementally import history into existing rooms (#15748 ) Context for why we're removing the implementation: - https://github.com/matrix-org/matrix-spec-proposals/pull/2716#issuecomment-1487441010 - https://github.com/matrix-org/matrix-spec-proposals/pull/2716#issuecomment-1504262734 Anyone wanting to continue MSC2716, should also address these leftover tasks: https://github.com/matrix-org/synapse/issues/10737 Closes https://github.com/matrix-org/synapse/issues/10737 in the fact that it is not longer necessary to track those things.	2023-06-16 14:12:24 -05:00
Jason Little	21fea6b749	Prefill events after invalidate not before when persisting events (#15758 ) Fixes #15757	2023-06-14 09:42:18 +01:00
Shay	553f2f53e7	Replace `EventContext` fields `prev_group` and `delta_ids` with field `state_group_deltas` (#15233 )	2023-06-13 13:22:06 -07:00
Erik Johnston	c485ed1c5a	Clear event caches when we purge history (#15609 ) This should help a little with #13476 --------- Co-authored-by: Patrick Cloke <patrickc@matrix.org>	2023-06-08 13:14:40 +01:00
Shay	d0c4257f14	`N + 3`: Read from column `full_user_id` rather than `user_id` of tables `profiles` and `user_filters` (#15649 )	2023-06-02 17:24:13 -07:00
Olivier Wilkinson (reivilibre)	a1154dfc20	Merge branch 'master' into develop	2023-05-26 17:16:15 +01:00
reivilibre	c775d80b73	Fix a bug introduced in Synapse v1.84.0 where workers do not start up when no `instance_map` was provided. (#15672 ) * Fix #15669: always populate instance map even if it was empty * Fix some tests * Fix more tests * Newsfile Signed-off-by: Olivier Wilkinson (reivilibre) <oliverw@matrix.org> * CI fix: don't forget to update apt repository sources before installing olddeps deps * Add test testing the backwards compatibility --------- Signed-off-by: Olivier Wilkinson (reivilibre) <oliverw@matrix.org>	2023-05-26 14:28:55 +00:00
Eric Eastwood	77156a4bc1	Process previously failed backfill events in the background (#15585 ) Process previously failed backfill events in the background because they are bound to fail again and we don't need to waste time holding up the request for something that is bound to fail again. Fix https://github.com/matrix-org/synapse/issues/13623 Follow-up to https://github.com/matrix-org/synapse/issues/13621 and https://github.com/matrix-org/synapse/issues/13622 Part of making `/messages` faster: https://github.com/matrix-org/synapse/issues/13356	2023-05-24 23:22:24 -05:00
Patrick Cloke	1f55c04cbc	Improve type hints for cached decorator. (#15658 ) The cached decorators always return a Deferred, which was not properly propagated. It was close enough when wrapping coroutines, but failed if a bare function was wrapped.	2023-05-24 12:59:31 +00:00
Shay	9f6ff6a0eb	Add not null constraint to column `full_user_id` of tables `profiles` and `user_filters` (#15537 )	2023-05-16 10:57:39 -07:00
Shay	301b4156d5	Add column `full_user_id` to tables `profiles` and `user_filters`. (#15458 )	2023-04-26 16:03:26 -07:00
Patrick Cloke	5e024a0645	Modify StoreKeyFetcher to read from server_keys_json. (#15417 ) Before this change: * `PerspectivesKeyFetcher` and `ServerKeyFetcher` write to `server_keys_json`. * `PerspectivesKeyFetcher` also writes to `server_signature_keys`. * `StoreKeyFetcher` reads from `server_signature_keys`. After this change: * `PerspectivesKeyFetcher` and `ServerKeyFetcher` write to `server_keys_json`. * `PerspectivesKeyFetcher` also writes to `server_signature_keys`. * `StoreKeyFetcher` reads from `server_keys_json`. This results in `StoreKeyFetcher` now using the results from `ServerKeyFetcher` in addition to those from `PerspectivesKeyFetcher`, i.e. keys which are directly fetched from a server will now be pulled from the database instead of refetched. An additional minor change is included to avoid creating a `PerspectivesKeyFetcher` (and checking it) if no `trusted_key_servers` are configured. The overall impact of this should be better usage of cached results: * If a server has no trusted key servers configured then it should reduce how often keys are fetched. * if a server's trusted key server does not have a requested server's keys cached then it should reduce how often keys are directly fetched.	2023-04-20 12:30:32 -04:00
reivilibre	edae20f926	Improve robustness when handling a perspective key response by deduplicating received server keys. (#15423 ) * Change `store_server_verify_keys` to take a `Mapping[(str, str), FKR]` This is because we already can't handle duplicate keys — leads to cardinality violation * Newsfile Signed-off-by: Olivier Wilkinson (reivilibre) <oliverw@matrix.org> --------- Signed-off-by: Olivier Wilkinson (reivilibre) <oliverw@matrix.org>	2023-04-13 15:35:03 +01:00
Erik Johnston	6204c3663e	Revert pruning of old devices (#15360 ) * Revert "Fix registering a device on an account with lots of devices (#15348)" This reverts commit `f0d8f66eaa`. * Revert "Delete stale non-e2e devices for users, take 3 (#15183)" This reverts commit `78cdb72cd6`.	2023-03-31 13:51:51 +01:00
Sean Quah	d9f694932c	Fix spinloop during partial state sync when a prev event is in backoff (#15351 ) Previously, we would spin in a tight loop until `update_state_for_partial_state_event` stopped raising `FederationPullAttemptBackoffError`s. Replace the spinloop with a wait until the backoff period has expired. Signed-off-by: Sean Quah <seanq@matrix.org>	2023-03-30 13:36:41 +01:00
Erik Johnston	78cdb72cd6	Delete stale non-e2e devices for users, take 3 (#15183 ) This should help reduce the number of devices e.g. simple bots the repeatedly login rack up. We only delete non-e2e devices as they should be safe to delete, whereas if we delete e2e devices for a user we may accidentally break their ability to receive e2e keys for a message.	2023-03-29 12:07:14 +01:00
David Robertson	3b0083c92a	Use immutabledict instead of frozendict (#15113 ) Additionally: * Consistently use `freeze()` in test --------- Co-authored-by: Patrick Cloke <clokep@users.noreply.github.com> Co-authored-by: 6543 <6543@obermui.de>	2023-03-22 17:15:34 +00:00
6543	6b6e91e610	Fix ICU tests on alpine / macOS. (#15177 ) The word boundary behaviour is slightly different, consider it acceptable for the tests.	2023-03-03 14:22:06 +00:00
reivilibre	d62cd940cb	Fix a long-standing bug where an initial sync would not respond to changes to the list of ignored users if there was an initial sync cached. (#15163 )	2023-02-28 17:11:26 +00:00
Shay	1c95ddd09b	Batch up storing state groups when creating new room (#14918 )	2023-02-24 13:15:29 -08:00
Sean Quah	335f52d595	Improve handling of non-ASCII characters in user directory search (#15143 ) * Fix a long-standing bug where non-ASCII characters in search terms, including accented letters, would not match characters in a different case. * Fix a long-standing bug where search terms using combining accents would not match display names using precomposed accents and vice versa. To fully take effect, the user directory must be rebuilt after this change. Fixes #14630. Signed-off-by: Sean Quah <seanq@matrix.org>	2023-02-24 13:39:45 +00:00
dependabot[bot]	9bb2eac719	Bump black from 22.12.0 to 23.1.0 (#15103 )	2023-02-22 15:29:09 -05:00
David Robertson	647ff3ef65	Remove unused `room_alias` field from `/createRoom` response (#15093 ) * Change `create_room` return type * Don't return room alias from /createRoom * Update other callsites * Fix up mypy complaints It looks like new_room_user_id is None iff new_room_id is None. It's a shame we haven't expressed this in a way that mypy can understand. * Changelog	2023-02-22 11:07:28 +00:00
reivilibre	1cbc3f197c	Fix a bug introduced in Synapse v1.74.0 where searching with colons when using ICU for search term tokenisation would fail with an error. (#15079 ) Co-authored-by: David Robertson <davidr@element.io>	2023-02-20 12:00:18 +00:00
Patrick Cloke	42aea0d8af	Add final type hint to tests.unittest. (#15072 ) Adds a return type to HomeServerTestCase.make_homeserver and deal with any variables which are no longer Any.	2023-02-14 14:03:35 -05:00
Shay	03bccd542b	Add a class UnpersistedEventContext to allow for the batching up of storing state groups (#14675 ) * add class UnpersistedEventContext * modify create new client event to create unpersistedeventcontexts * persist event contexts after creation * fix tests to persist unpersisted event contexts * cleanup * misc lints + cleanup * changelog + fix comments * lints * fix batch insertion? * reduce redundant calculation * add unpersisted event classes * rework compute_event_context, split into function that returns unpersisted event context and then persists it * use calculate_context_info to create unpersisted event contexts * update typing * $%#^&* * black * fix comments and consolidate classes, use attr.s for class * requested changes * lint * requested changes * requested changes * refactor to be stupidly explicit * clearer renaming and flow * make partial state non-optional * update docstrings --------- Co-authored-by: Erik Johnston <erik@matrix.org>	2023-02-09 13:05:02 -08:00
Patrick Cloke	230a831c73	Attempt to delete more duplicate rows in receipts_linearized table. (#14915 ) The previous assumption was that the stream_id column was unique (for a room ID, receipt type, user ID tuple), but this turned out to be incorrect. Now find the max stream ID, then map this back to a database-specific row identifier and delete other rows which match the (room ID, receipt type, user ID) tuple, but not the row ID.	2023-02-01 15:45:10 -05:00
Sean Quah	6d14fdc271	Make sqlite database migrations transactional again, part two (#14926 ) #14910 fixed the regression introduced by #13873 where sqlite database migrations would no longer run inside a transaction. However, it committed the transaction before Synapse updated its bookkeeping of which migrations have been run, which means that migrations may be run again after they have completed successfully. Leave the transaction open at the end of `executescript`, to restore the old, correct behaviour. Also make the PostgreSQL behaviour consistent with SQLite. Fixes #14909. Signed-off-by: Sean Quah <seanq@matrix.org>	2023-01-31 11:03:55 +00:00
Andrew Morgan	871ff05add	Fix type hints in typing edu unit tests (#14886 )	2023-01-26 10:15:50 +00:00
Patrick Cloke	82d3efa312	Skip processing stats for broken rooms. (#14873 ) * Skip processing stats for broken rooms. * Newsfragment * Use a custom exception.	2023-01-23 11:36:20 +00:00
Erik Johnston	65d0386693	Always notify replication when a stream advances (#14877 ) This ensures that all other workers are told about stream updates in a timely manner, without having to remember to manually poke replication.	2023-01-20 18:02:18 +00:00
Erik Johnston	9187fd940e	Wait for streams to catch up when processing HTTP replication. (#14820 ) This should hopefully mitigate a class of races where data gets out of sync due a HTTP replication request racing with the replication streams.	2023-01-18 19:35:29 +00:00
Erik Johnston	b50c008453	Re-enable some linting (#14821 ) * Re-enable some linting * Newsfile * Remove comment	2023-01-12 10:52:07 +00:00
David Robertson	e2a1adbf5d	Allow selecting "prejoin" events by state keys (#14642 ) * Declare new config * Parse new config * Read new config * Don't use trial/our TestCase where it's not needed Before: ``` $ time trial tests/events/test_utils.py > /dev/null real 0m2.277s user 0m2.186s sys 0m0.083s ``` After: ``` $ time trial tests/events/test_utils.py > /dev/null real 0m0.566s user 0m0.508s sys 0m0.056s ``` * Helper to upsert to event fields without exceeding size limits. * Use helper when adding invite/knock state Now that we allow admins to include events in prejoin room state with arbitrary state keys, be a good Matrix citizen and ensure they don't accidentally create an oversized event. * Changelog * Move StateFilter tests should have done this in #14668 * Add extra methods to StateFilter * Use StateFilter * Ensure test file enforces typed defs; alphabetise * Workaround surprising get_current_state_ids * Whoops, fix mypy	2022-12-13 00:54:46 +00:00
David Robertson	b5b5f66084	Move `StateFilter` to `synapse.types` (#14668 ) * Move `StateFilter` to `synapse.types` * Changelog	2022-12-12 16:19:30 +00:00
reivilibre	74b89c2761	Revert the deletion of stale devices due to performance issues. (#14662 )	2022-12-12 13:55:23 +00:00
Brendan Abolivier	2a3cd59dd0	Add optional ICU support for user search (#14464 ) Fixes #13655 This change uses ICU (International Components for Unicode) to improve boundary detection in user search. This change also adds a new dependency on libicu-dev and pkg-config for the Debian packages, which are available in all supported distros.	2022-12-12 13:21:17 +01:00
Patrick Cloke	3ac412b4e2	Require types in tests.storage. (#14646 ) Adds missing type hints to `tests.storage` package and does not allow untyped definitions.	2022-12-09 12:36:32 -05:00
Erik Johnston	c2de2ca630	Delete stale non-e2e devices for users, take 2 (#14595 ) This should help reduce the number of devices e.g. simple bots the repeatedly login rack up. We only delete non-e2e devices as they should be safe to delete, whereas if we delete e2e devices for a user we may accidentally break their ability to receive e2e keys for a message.	2022-12-09 09:37:07 +00:00
reivilibre	cf1059d045	Fix a long-standing bug where the user directory would return 1 more row than requested. (#14631 )	2022-12-07 11:19:43 +00:00
David Robertson	781b14ec69	Merge branch 'release-v1.73' into develop	2022-12-01 13:43:30 +00:00
Nick Mills-Barrett	e8bce8999f	Aggregate unread notif count query for badge count calculation (#14255 ) Fetch the unread notification counts used by the badge counts in push notifications for all rooms at once (instead of fetching them per room).	2022-11-30 08:45:06 -05:00
David Robertson	c29e2c6306	Revert "POC delete stale non-e2e devices for users (#14038 )" (#14582 )	2022-11-29 17:48:48 +00:00
Erik Johnston	c7e29ca277	POC delete stale non-e2e devices for users (#14038 ) This should help reduce the number of devices e.g. simple bots the repeatedly login rack up. We only delete non-e2e devices as they should be safe to delete, whereas if we delete e2e devices for a user we may accidentally break their ability to receive e2e keys for a message. Co-authored-by: Patrick Cloke <clokep@users.noreply.github.com> Co-authored-by: Sean Quah <8349537+squahtx@users.noreply.github.com>	2022-11-29 10:36:41 +00:00
reivilibre	9af2be192a	Remove legacy Prometheus metrics names. They were deprecated in Synapse v1.69.0 and disabled by default in Synapse v1.71.0. (#14538 )	2022-11-24 09:09:17 +00:00
Sean Quah	9cae44f49e	Track unconverted device list outbound pokes using a position instead (#14516 ) When a local device list change is added to `device_lists_changes_in_room`, the `converted_to_destinations` flag is set to `FALSE` and the `_handle_new_device_update_async` background process is started. This background process looks for unconverted rows in `device_lists_changes_in_room`, copies them to `device_lists_outbound_pokes` and updates the flag. To update the `converted_to_destinations` flag, the database performs a `DELETE` and `INSERT` internally, which fragments the table. To avoid this, track unconverted rows using a `(stream ID, room ID)` position instead of the flag. From now on, the `converted_to_destinations` column indicates rows that need converting to outbound pokes, but does not indicate whether the conversion has already taken place. Closes #14037. Signed-off-by: Sean Quah <seanq@matrix.org>	2022-11-22 16:46:52 +00:00
David Robertson	115f0eb233	Reintroduce #14376 , with bugfix for monoliths (#14468 ) * Add tests for StreamIdGenerator * Drive-by: annotate all defs * Revert "Revert "Remove slaved id tracker (#14376)" (#14463)" This reverts commit `d63814fd73`, which in turn reverted `36097e88c4`. This restores the latter. * Fix StreamIdGenerator not handling unpersisted IDs Spotted by @erikjohnston. Closes #14456. * Changelog Co-authored-by: Nick Mills-Barrett <nick@fizzadar.com> Co-authored-by: Erik Johnston <erik@matrix.org>	2022-11-16 22:16:46 +00:00
Sean Quah	882277008c	Fix background updates failing to add unique indexes on receipts (#14453 ) As part of the database migration to support threaded receipts, there is a possible window in between `73/08thread_receipts_non_null.sql.postgres` removing the original unique constraints on `receipts_linearized` and `receipts_graph` and the `reeipts_linearized_unique_index` and `receipts_graph_unique_index` background updates from `72/08thread_receipts.sql` completing where the unique constraints on `receipts_linearized` and `receipts_graph` are missing. Any emulated upserts on these tables must therefore be performed with a lock held, otherwise duplicate rows can end up in the tables when there are concurrent emulated upserts. Fix the missing lock. Note that emulated upserts no longer happen by default on sqlite, since the minimum supported version of sqlite supports native upserts by default now. Finally, clean up any duplicate receipts that may have crept in before trying to create the `receipts_graph_unique_index` and `receipts_linearized_unique_index` unique indexes. Signed-off-by: Sean Quah <seanq@matrix.org>	2022-11-16 15:01:22 +00:00
Patrick Cloke	e9a4343cb2	Drop support for Postgres 10 in full text search code. (#14397 )	2022-11-09 09:55:34 -05:00
Patrick Cloke	67583281e3	Fix tests for change in PostgreSQL 14 behavior change. (#14310 ) PostgreSQL 14 changed the behavior of `websearch_to_tsquery` to improve some behaviour. The tests were hitting those edge-cases about handling of hanging double quotes. This fixes the tests to take into account the PostgreSQL version.	2022-10-27 13:58:12 +00:00
James Salter	d902181de9	Unified search query syntax using the full-text search capabilities of the underlying DB. (#11635 ) Support a unified search query syntax which leverages more of the full-text search of each database supported by Synapse. Supports, with the same syntax across Postgresql 11+ and Sqlite: - quoted "search terms" - `AND`, `OR`, `-` (negation) operators - Matching words based on their stem, e.g. searches for "dog" matches documents containing "dogs". This is achieved by - If on postgresql 11+, pass the user input to `websearch_to_tsquery` - If on sqlite, manually parse the query and transform it into the sqlite-specific query syntax. Note that postgresql 10, which is close to end-of-life, falls back to using `phraseto_tsquery`, which only supports a subset of the features. Multiple terms separated by a space are implicitly ANDed. Note that: 1. There is no escaping of full-text syntax that might be supported by the database; e.g. `NOT`, `NEAR`, `*` in sqlite. This runs the risk that people might discover this as accidental functionality and depend on something we don't guarantee. 2. English text is assumed for stemming. To support other languages, either the target language needs to be known at the time of indexing the message (via room metadata, or otherwise), or a separate index for each language supported could be created. Sqlite docs: https://www.sqlite.org/fts3.html#full_text_index_queries Postgres docs: https://www.postgresql.org/docs/11/textsearch-controls.html	2022-10-25 14:05:22 -04:00
Andrew Morgan	828b5502cf	Remove `_get_events_cache` check optimisation from `_have_seen_events_dict` (#14161 )	2022-10-18 10:33:21 +01:00
Patrick Cloke	4283bd1cf9	Support filtering the /messages API by relation type (MSC3874). (#14148 ) Gated behind an experimental configuration flag.	2022-10-17 11:32:11 -04:00
Eric Eastwood	40bb37eb27	Stop getting missing `prev_events` after we already know their signature is invalid (#13816 ) While https://github.com/matrix-org/synapse/pull/13635 stops us from doing the slow thing after we've already done it once, this PR stops us from doing one of the slow things in the first place. Related to - https://github.com/matrix-org/synapse/issues/13622 - https://github.com/matrix-org/synapse/pull/13635 - https://github.com/matrix-org/synapse/issues/13676 Part of https://github.com/matrix-org/synapse/issues/13356 Follow-up to https://github.com/matrix-org/synapse/pull/13815 which tracks event signature failures. With this PR, we avoid the call to the costly `_get_state_ids_after_missing_prev_event` because the signature failure will count as an attempt before and we filter events based on the backoff before calling `_get_state_ids_after_missing_prev_event` now. For example, this will save us 156s out of the 185s total that this `matrix.org` `/messages` request. If you want to see the full Jaeger trace of this, you can drag and drop this `trace.json` into your own Jaeger, https://gist.github.com/MadLittleMods/4b12d0d0afe88c2f65ffcc907306b761 To explain this exact scenario around `/messages` -> backfill, we call `/backfill` and first check the signatures of the 100 events. We see bad signature for `$luA4l7QHhf_jadH3mI-AyFqho0U2Q-IXXUbGSMq6h6M` and `$zuOn2Rd2vsC7SUia3Hp3r6JSkSFKcc5j3QTTqW_0jDw` (both member events). Then we process the 98 events remaining that have valid signatures but one of the events references `$luA4l7QHhf_jadH3mI-AyFqho0U2Q-IXXUbGSMq6h6M` as a `prev_event`. So we have to do the whole `_get_state_ids_after_missing_prev_event` rigmarole which pulls in those same events which fail again because the signatures are still invalid. - `backfill` - `outgoing-federation-request` `/backfill` - `_check_sigs_and_hash_and_fetch` - `_check_sigs_and_hash_and_fetch_one` for each event received over backfill - ❗ `$luA4l7QHhf_jadH3mI-AyFqho0U2Q-IXXUbGSMq6h6M` fails with `Signature on retrieved event was invalid.`: `unable to verify signature for sender domain xxx: 401: Failed to find any key to satisfy: _FetchKeyRequest(...)` - ❗ `$zuOn2Rd2vsC7SUia3Hp3r6JSkSFKcc5j3QTTqW_0jDw` fails with `Signature on retrieved event was invalid.`: `unable to verify signature for sender domain xxx: 401: Failed to find any key to satisfy: _FetchKeyRequest(...)` - `_process_pulled_events` - `_process_pulled_event` for each validated event - ❗ Event `$Q0iMdqtz3IJYfZQU2Xk2WjB5NDF8Gg8cFSYYyKQgKJ0` references `$luA4l7QHhf_jadH3mI-AyFqho0U2Q-IXXUbGSMq6h6M` as a `prev_event` which is missing so we try to get it - `_get_state_ids_after_missing_prev_event` - `outgoing-federation-request` `/state_ids` - ❗ `get_pdu` for `$luA4l7QHhf_jadH3mI-AyFqho0U2Q-IXXUbGSMq6h6M` which fails the signature check again - ❗ `get_pdu` for `$zuOn2Rd2vsC7SUia3Hp3r6JSkSFKcc5j3QTTqW_0jDw` which fails the signature check	2022-10-15 00:36:49 -05:00
Patrick Cloke	d1bdeccb50	Accept threaded receipts for events related to the root event. (#14174 ) The root node of a thread (and events related to it) are considered "part of a thread" when validating receipts. This allows clients which show the root node in both the main timeline and the threaded timeline to easily send receipts in either. Note that threaded notifications are not created for these events, these events created notifications on the main timeline.	2022-10-14 18:05:25 +00:00
Patrick Cloke	dcced5a8d7	Use threaded receipts when fetching events for push. (#13878 ) Update the HTTP and email pushers to consider threaded read receipts when fetching unread events.	2022-10-04 12:07:02 -04:00
Patrick Cloke	2b6d41ebd6	Recursively fetch the thread for receipts & notifications. (#13824 ) Consider an event to be part of a thread if you can follow a chain of relations up to a thread root. Part of MSC3773 & MSC3771.	2022-10-04 11:36:16 -04:00
Patrick Cloke	a7ba457b2b	Mark events as read using threaded read receipts from MSC3771. (#13877 ) Applies the proper logic for unthreaded and threaded receipts to either apply to all events in the room or only events in the same thread, respectively.	2022-10-04 10:46:42 -04:00
Patrick Cloke	b4ec4f5e71	Track notification counts per thread (implement MSC3773). (#13776 ) When retrieving counts of notifications segment the results based on the thread ID, but choose whether to return them as individual threads or as a single summed field by letting the client opt-in via a sync flag. The summarization code is also updated to be per thread, instead of per room.	2022-10-04 09:47:04 -04:00
David Robertson	285d72556b	Update mypy and mypy-zope, attempt 3 (#13993 ) Co-authored-by: Patrick Cloke <clokep@users.noreply.github.com>	2022-09-30 17:36:28 +01:00
David Robertson	8e52cb0bce	Revert "Update mypy and mypy-zope (#13925 )" This reverts commit `6d543d6d9f`.	2022-09-30 16:37:48 +01:00
David Robertson	6d543d6d9f	Update mypy and mypy-zope (#13925 ) * Update mypy and mypy-zope * Unignore assigning to LogRecord attributes Presumably https://github.com/python/typeshed/pull/8064 makes this ok Cherry-picked from #13521 * Remove unused ignores due to mypy ParamSpec fixes https://github.com/python/mypy/pull/12668 Cherry-picked from #13521 * Remove additional unused ignores * Fix new mypy complaints related to `assertGreater` Presumably due to https://github.com/python/typeshed/pull/8077 * Changelog * Reword changelog Co-authored-by: Patrick Cloke <clokep@users.noreply.github.com> Co-authored-by: Patrick Cloke <clokep@users.noreply.github.com>	2022-09-30 16:34:47 +01:00
David Robertson	e8f30a76ca	Fix overflows in /messages backfill calculation (#13936 ) * Reproduce bug * Compute `least_function` first * Substitute `least_function` with an f-string * Bugfix: avoid overflow Co-authored-by: Eric Eastwood <erice@element.io>	2022-09-30 11:54:53 +01:00
Brendan Abolivier	be76cd8200	Allow admins to require a manual approval process before new accounts can be used (using MSC3866) (#13556 )	2022-09-29 15:23:24 +02:00
Patrick Cloke	568016929f	Clarify that a method returns only unthreaded receipts. (#13937 ) By renaming it and updating the docstring. Additionally, refactors a method which is used only by tests.	2022-09-29 07:07:31 -04:00
Eric Eastwood	df8b91ed2b	Limit and filter the number of backfill points to get from the database (#13879 ) There is no need to grab thousands of backfill points when we only need 5 to make the `/backfill` request with. We need to grab a few extra in case the first few aren't visible in the history. Previously, we grabbed thousands of backfill points from the database, then sorted and filtered them in the app. Fetching the 4.6k backfill points for `#matrix:matrix.org` from the database takes ~50ms - ~570ms so it's not like this saves a lot of time 🤷. But it might save us more time now that `get_backfill_points_in_room`/`get_insertion_event_backward_extremities_in_room` are more complicated after https://github.com/matrix-org/synapse/pull/13635 This PR moves the filtering and limiting to the SQL query so we just have less data to work with in the first place. Part of https://github.com/matrix-org/synapse/issues/13356	2022-09-28 15:26:16 -05:00
Shay	8ab16a92ed	Persist CreateRoom events to DB in a batch (#13800 )	2022-09-28 10:11:48 +00:00
Eric Eastwood	29269d9d3f	Fix `have_seen_event` cache not being invalidated (#13863 ) Fix https://github.com/matrix-org/synapse/issues/13856 Fix https://github.com/matrix-org/synapse/issues/13865 > Discovered while trying to make Synapse fast enough for [this MSC2716 test for importing many batches](https://github.com/matrix-org/complement/pull/214#discussion_r741678240). As an example, disabling the `have_seen_event` cache saves 10 seconds for each `/messages` request in that MSC2716 Complement test because we're not making as many federation requests for `/state` (speeding up `have_seen_event` itself is related to https://github.com/matrix-org/synapse/issues/13625) > > But this will also make `/messages` faster in general so we can include it in the [faster `/messages` milestone](https://github.com/matrix-org/synapse/milestone/11). > > -- https://github.com/matrix-org/synapse/issues/13856 ### The problem `_invalidate_caches_for_event` doesn't run in monolith mode which means we never even tried to clear the `have_seen_event` and other caches. And even in worker mode, it only runs on the workers, not the master (AFAICT). Additionally there was bug with the key being wrong so `_invalidate_caches_for_event` never invalidates the `have_seen_event` cache even when it does run. Because we were using the `@cachedList` wrong, it was putting items in the cache under keys like `((room_id, event_id),)` with a `set` in a `set` (ex. `(('!TnCIJPKzdQdUlIyXdQ:test', '$Iu0eqEBN7qcyF1S9B3oNB3I91v2o5YOgRNPwi_78s-k'),)`) and we we're trying to invalidate with just `(room_id, event_id)` which did nothing.	2022-09-27 15:55:43 -05:00
Patrick Cloke	2fae1a3f78	Improve tests for get_unread_push_actions_for_user_in_range_. (#13893 ) Adds a docstring. * Reduces a small amount of duplicated code. * Improves tests.	2022-09-26 18:28:12 +00:00
Eric Eastwood	ac1a31740b	Only try to backfill event if we haven't tried before recently (#13635 ) Only try to backfill event if we haven't tried before recently (exponential backoff). No need to keep trying the same backfill point that fails over and over. Fix https://github.com/matrix-org/synapse/issues/13622 Fix https://github.com/matrix-org/synapse/issues/8451 Follow-up to https://github.com/matrix-org/synapse/pull/13589 Part of https://github.com/matrix-org/synapse/issues/13356	2022-09-23 14:01:29 -05:00

1 2 3 4 5 ...

736 commits