[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <50C60496.5040800@ericsson.com>
Date: Mon, 10 Dec 2012 10:49:42 -0500
From: Jon Maloy <jon.maloy@...csson.com>
To: Ying Xue <ying.xue@...driver.com>
CC: <nhorman@...driver.com>, <Paul.Gortmaker@...driver.com>,
<erik.hugne@...csson.com>, <netdev@...r.kernel.org>,
<tipc-discussion@...ts.sourceforge.net>
Subject: Re: [PATCH net-next v3] tipc: sk_recv_queue size check only for connectionless
sockets
On 12/10/2012 05:13 AM, Jon Maloy wrote:
> On 12/10/2012 04:23 AM, Ying Xue wrote:
>> The sk_receive_queue limit control is currently performed for all
>> arriving messages, disregarding socket and message type. But for
>> connectionless sockets this check is redundant, since the protocol
>> flow already makes queue overflow impossible.
>>
>> We move the sk_receive_queue limit control so that it's only performed
>> for connectionless sockets, i.e. SOCK_RDM and SOCK_DGRAM type sockets.
>>
>> However, as Neil Horman specified, we cannot simply force the socket
>> receive queue limit against connectionless sockets as it may create a
>> DoS vulnerability. For example, if a sender floods a receiver with
>> messages containing an invalid set of message importance bits or
>> CRITICAL importance, we will queue messages indefinitely.
>>
>> To avoid DoS attack, socket receive queue will be marked as overflow
>> if we receive messages with invalid message importances, meanwhile,
>> we also set one new threshold for CRITICAL importance messages.
>>
>> Signed-off-by: Ying Xue <ying.xue@...driver.com>
>> Signed-off-by: Jon Maloy <jon.maloy@...csson.com>
>> Cc: Neil Horman <nhorman@...driver.com>
>> Signed-off-by: Paul Gortmaker <paul.gortmaker@...driver.com>
>> ---
>> v3 changes:
>> - set new threshold for CRITICAL message
>> - defined an importance factor table to avoid multiplication and
>> division operations in rx_queue_full().
>> - changed return value of rx_queue_full() from integer to boolean.
>>
>> net/tipc/socket.c | 44 +++++++++++++++++++-------------------------
>> 1 files changed, 19 insertions(+), 25 deletions(-)
>>
>> diff --git a/net/tipc/socket.c b/net/tipc/socket.c
>> index 9b4e483..a18a757 100644
>> --- a/net/tipc/socket.c
>> +++ b/net/tipc/socket.c
>> @@ -43,7 +43,7 @@
>> #define SS_LISTENING -1 /* socket is listening */
>> #define SS_READY -2 /* socket is connectionless */
>>
>> -#define OVERLOAD_LIMIT_BASE 10000
>> +#define OVERLOAD_LIMIT_BASE 5000
>> #define CONN_TIMEOUT_DEFAULT 8000 /* default connect timeout = 8s */
>>
>> struct tipc_sock {
>> @@ -73,6 +73,13 @@ static struct proto tipc_proto;
>>
>> static int sockets_enabled;
>>
>> +static const u32 msg_importance_factor[] = {
>> + OVERLOAD_LIMIT_BASE, /* TIPC_LOW_IMPORTANCE limit */
>> + OVERLOAD_LIMIT_BASE * 2, /* TIPC_MEDIUM_IMPORTANCE limit */
>> + OVERLOAD_LIMIT_BASE * 100, /* TIPC_HIGH_IMPORTANCE limit */
>> + OVERLOAD_LIMIT_BASE * 200 /* TIPC_CRITICAL_IMPORTANCE limit */
>> + };
>> +
>> /*
>> * Revised TIPC socket locking policy:
>> *
>> @@ -1158,28 +1165,17 @@ static void tipc_data_ready(struct sock *sk, int len)
>> * rx_queue_full - determine if receive queue can accept another message
>> * @msg: message to be added to queue
>> * @queue_size: current size of queue
>> - * @base: nominal maximum size of queue
>> *
>> - * Returns 1 if queue is unable to accept message, 0 otherwise
>> + * Returns true if queue is unable to accept message, false otherwise
>> */
>> -static int rx_queue_full(struct tipc_msg *msg, u32 queue_size, u32 base)
>> +static bool rx_queue_full(struct tipc_msg *msg, u32 queue_size)
>> {
>> - u32 threshold;
>> u32 imp = msg_importance(msg);
>>
>> - if (imp == TIPC_LOW_IMPORTANCE)
>> - threshold = base;
>> - else if (imp == TIPC_MEDIUM_IMPORTANCE)
>> - threshold = base * 2;
>> - else if (imp == TIPC_HIGH_IMPORTANCE)
>> - threshold = base * 100;
>> - else
>> - return 0;
>> + if (unlikely(imp > TIPC_CRITICAL_IMPORTANCE))
>> + return true;
>
> This test is not necessary. Such messages have already been filtered out
> in tipc_recv_msg() at link level.
> The test msg_isdata(), which determines if a message should be sent up to
> the port/socket level, is also an implicit test that
> importance < TIPC_CRITICAL_IMPORTANCE.
(importance <= TIPC_CRITICAL_IMPORTANCE), of course.
This is an effect of the co-location of the user and importance fields in the
TIPC header. I.e., the importance is in reality coded into the value of
the user field.
To clarify (and improve) my previous suggestion, what I had in mind
was something like this:
recv_q_len = skb_queue_len(&sk->sk_receive_queue);
- if (unlikely(recv_q_len >= (OVERLOAD_LIMIT_BASE / 2))) {
- if (rx_queue_full(msg, recv_q_len,
- OVERLOAD_LIMIT_BASE / 2))
- return TIPC_ERR_OVERLOAD;
- }
+ imp = msg_importance(msg);
+ if (unlikely(recv_q_len > (OVERLOAD_LIMIT_BASE << (imp << 1))))
+ return TIPC_ERR_OVERLOAD;
} else {
if (msg_mcast(msg))
return TIPC_ERR_NO_PORT;
No need for any separate function at all, and guaranteed inline.
This will probably translate to one single shift-instruction extra,
and no memor access, since the compiler merge (imp << 1) with the
shift operation used to read imp from the header.
The limits for message rejection become the following,
given an OVERLOAD_LIMIT_BASE of 10000:
TIP_LOW_IMPORTANCE: 10000 (previously 10000)
TIPC_MEDIUM_IMPORTANCE 40000 (previously 20000)
TIPC_MEDIUM_IMPORTANCE 1600000 (previously 1000000)
TIPC_CRITICAL_IMPORTANCE 25600000 (previously no limit)
///jon
>
>
>>
>> - if (msg_connected(msg))
>> - threshold *= 4;
>> -
>> - return queue_size >= threshold;
>> + return queue_size >= msg_importance_factor[imp];
>
> Ok. Less optimal than my suggestion, but also lower risk until we know
> the consequences of changing the multiplication factors.
>
>> }
>>
>> /**
>> @@ -1275,7 +1271,6 @@ static u32 filter_rcv(struct sock *sk, struct sk_buff *buf)
>> {
>> struct socket *sock = sk->sk_socket;
>> struct tipc_msg *msg = buf_msg(buf);
>> - u32 recv_q_len;
>> u32 res = TIPC_OK;
>>
>> /* Reject message if it is wrong sort of message for socket */
>> @@ -1285,19 +1280,18 @@ static u32 filter_rcv(struct sock *sk, struct sk_buff *buf)
>> if (sock->state == SS_READY) {
>> if (msg_connected(msg))
>> return TIPC_ERR_NO_PORT;
>> + /* Reject SOCK_DGRAM and SOCK_RDM message if there isn't room
>> + * to queue it
>> + */
>> + if (unlikely(rx_queue_full(msg,
>> + skb_queue_len(&sk->sk_receive_queue))))
>> + return TIPC_ERR_OVERLOAD;
>> } else {
>> res = filter_connect(tipc_sk(sk), &buf);
>> if (res != TIPC_OK || buf == NULL)
>> return res;
>> }
>>
>> - /* Reject message if there isn't room to queue it */
>> - recv_q_len = skb_queue_len(&sk->sk_receive_queue);
>> - if (unlikely(recv_q_len >= (OVERLOAD_LIMIT_BASE / 2))) {
>> - if (rx_queue_full(msg, recv_q_len, OVERLOAD_LIMIT_BASE / 2))
>> - return TIPC_ERR_OVERLOAD;
>> - }
>> -
>> /* Enqueue message (finally!) */
>> TIPC_SKB_CB(buf)->handle = 0;
>> __skb_queue_tail(&sk->sk_receive_queue, buf);
>>
>
> --
> To unsubscribe from this list: send the line "unsubscribe netdev" in
> the body of a message to majordomo@...r.kernel.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
>
--
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Powered by blists - more mailing lists