; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005360 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005360
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionbHLH-MYC_N domain-containing protein
Genome locationchr6:15230245..15231802
RNA-Seq ExpressionLag0005360
SyntenyLag0005360
Gene Ontology termsGO:0003677 - DNA binding (molecular function)
InterPro domainsIPR025610 - Transcription factor MYC/MYB N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG2707472.1 hypothetical protein I3760_05G149000 [Carya illinoinensis]9.1e-13371.31Show/hide
Query:  MEEHLSPLAVTHLLQHTLRSMCIHENSEWVYAVFWRILPRNYPPPKWDAQGAYDRSRGNRRNWILVWEDGFCNFAASSS------DC-------------
        MEEHLS LAVTHLLQHTLRS+CIHENS+WVYAVFWRILPRNYPPPKWD QGAYDRSRGNRRNWILVWEDGFCNF ASS+      DC             
Subjt:  MEEHLSPLAVTHLLQHTLRSMCIHENSEWVYAVFWRILPRNYPPPKWDAQGAYDRSRGNRRNWILVWEDGFCNFAASSS------DC-------------

Query:  -QGLQPELFFKMSHEIYNYGEGLIGKVAADRSHKWIYKEPNDQVINLLSSWHNSADTQPRTWEAQFQNGIKTIALIAVREGVVQLGATHKVVEDLSSVVF
         QGLQPELFFKMSHEIYNYGEGLIGKVAAD SHKWI+KEPNDQ +NLLS+WHNSAD+ PRTWEAQFQ+GIKTI LIAVREGV+QLGA HKV+EDLS VV 
Subjt:  -QGLQPELFFKMSHEIYNYGEGLIGKVAADRSHKWIYKEPNDQVINLLSSWHNSADTQPRTWEAQFQNGIKTIALIAVREGVVQLGATHKVVEDLSSVVF

Query:  LRKKFNYIESIPGVLLPHPSSSSSSSSSLYPFKVDGFGGSDIWQFPGGVANPPADTTLLYDHSNHQLRITPSMSSLEALLSKLPSVVPPSSQSPSQFLAA
        LRKKF+YIESIPGVLLPHPSSS+      +PFK+DG+   + W   G +A P    T LYD  N  L+ITPSMSSLEALLSKLPSVVPP++   SQF+++
Subjt:  LRKKFNYIESIPGVLLPHPSSSSSSSSSLYPFKVDGFGGSDIWQFPGGVANPPADTTLLYDHSNHQLRITPSMSSLEALLSKLPSVVPPSSQSPSQFLAA

Query:  QQRPLEFISMEKLAKEEIEEEFYGPE----TSSSSMPAY-RYQNAHSNTTNN
         QRPLEFI MEK+AK+EI+EE Y PE     SSSS+ AY R QN H N   N
Subjt:  QQRPLEFISMEKLAKEEIEEEFYGPE----TSSSSMPAY-RYQNAHSNTTNN

OMO78151.1 hypothetical protein CCACVL1_14613 [Corchorus capsularis]4.8e-13472.24Show/hide
Query:  MEEHLSPLAVTHLLQHTLRSMCIHENSEWVYAVFWRILPRNYPPPKWDAQGAYDRSRGNRRNWILVWEDGFCNFAAS-----SSDC--------------
        MEEHLSPLAVTHLLQHTLRS+CIHENS+WVYAVFWRILPRNYPPPKWD QGAYDRSRGNRRNWILVWEDGFCNFAAS     S DC              
Subjt:  MEEHLSPLAVTHLLQHTLRSMCIHENSEWVYAVFWRILPRNYPPPKWDAQGAYDRSRGNRRNWILVWEDGFCNFAAS-----SSDC--------------

Query:  QGLQPELFFKMSHEIYNYGEGLIGKVAADRSHKWIYKEPNDQVINLLSSWHNSADTQPRTWEAQFQNGIKTIALIAVREGVVQLGATHKVVEDLSSVVFL
        QGLQPELFFKMSHEIYNYGEGLIGKVAAD SHKWIYKEPN+Q IN LS+WHNSAD+ PRTWEAQFQ GIKTIALIAVREGVVQLGA HKV+EDLS VV L
Subjt:  QGLQPELFFKMSHEIYNYGEGLIGKVAADRSHKWIYKEPNDQVINLLSSWHNSADTQPRTWEAQFQNGIKTIALIAVREGVVQLGATHKVVEDLSSVVFL

Query:  RKKFNYIESIPGVLLPHPSSSSSSSSSLYPFKVDGFGGSDIWQFPGGVANPPADTTLLYDHSNHQLRITPSMSSLEALLSKLPSVVPP------SSQSPS
        RKKF+YIESIPGVLLPHPSSS+      YP+KVDG+G  + W FP G   PP +    YDH N  ++ITPSMSSLEALLSKLPSVVPP       SQ  S
Subjt:  RKKFNYIESIPGVLLPHPSSSSSSSSSLYPFKVDGFGGSDIWQFPGGVANPPADTTLLYDHSNHQLRITPSMSSLEALLSKLPSVVPP------SSQSPS

Query:  QFL-AAQQRPLEFIS-MEKLAKEEIEEEFYGPE-----TSSSSMPAYRYQNAH
        QFL ++ QRP+E+++ MEK+AKEEI+EE Y PE      SSSS+ AYR Q  H
Subjt:  QFL-AAQQRPLEFIS-MEKLAKEEIEEEFYGPE-----TSSSSMPAYRYQNAH

XP_007044586.2 PREDICTED: uncharacterized protein LOC18609421 isoform X2 [Theobroma cacao]4.1e-13373.07Show/hide
Query:  MEEHLSPLAVTHLLQHTLRSMCIHENSEWVYAVFWRILPRNYPPPKWDAQGAYDRSRGNRRNWILVWEDGFCNFAAS-----SSDC--------------
        MEEHLSPLAVTHLLQHTLRS+CIHENS+WVYAVFWRILPRNYPPPKWD Q AYDRSRGNRRNWILVWEDGFCNFAAS     S DC              
Subjt:  MEEHLSPLAVTHLLQHTLRSMCIHENSEWVYAVFWRILPRNYPPPKWDAQGAYDRSRGNRRNWILVWEDGFCNFAAS-----SSDC--------------

Query:  QGLQPELFFKMSHEIYNYGEGLIGKVAADRSHKWIYKEPNDQVINLLSSWHNSADTQPRTWEAQFQNGIKTIALIAVREGVVQLGATHKVVEDLSSVVFL
        QGLQPELFFKMSHEIYNYGEGLIGKVAAD SHKWIYKEPNDQ IN LS+WHNSAD+ PRTWEAQFQ GIKTIALIAVREGVVQLGA +KV+EDLS VV L
Subjt:  QGLQPELFFKMSHEIYNYGEGLIGKVAADRSHKWIYKEPNDQVINLLSSWHNSADTQPRTWEAQFQNGIKTIALIAVREGVVQLGATHKVVEDLSSVVFL

Query:  RKKFNYIESIPGVLLPHPSSSSSSSSSLYPFKVDGFGGSDIWQFPGGVANPPADTTLLYDHSNHQLRITPSMSSLEALLSKLPSVVPPSSQ--------S
        RKKF+YIESIPGVLLPHPSSS+      YPFKVDG+G  + W FP  +A P    T  YDH N  ++ITPSMSSLEALLSKLPSVVPP S         S
Subjt:  RKKFNYIESIPGVLLPHPSSSSSSSSSLYPFKVDGFGGSDIWQFPGGVANPPADTTLLYDHSNHQLRITPSMSSLEALLSKLPSVVPPSSQ--------S

Query:  PSQFLAAQQRPLEFISMEKLAKEEIEEEFYGPE----TSSSSMPAYRYQ
         SQFL++ QRP+E+I MEK+AKEEI+EE Y PE     SSSS+ AYR Q
Subjt:  PSQFLAAQQRPLEFISMEKLAKEEIEEEFYGPE----TSSSSMPAYRYQ

XP_022159131.1 protein RICE SALT SENSITIVE 3 isoform X1 [Momordica charantia]8.2e-14279.65Show/hide
Query:  MEEHLSPLAVTHLLQHTLRSMCIHENSEWVYAVFWRILPRNYPPP-KWDAQGAYDRSRGNRRNWILVWEDGFCNFAASSSD-------------------
        MEEHLSPLAVTHLLQHTLRS+CIHENS WVYAVFWRILPRNYPPP +WD Q AYDRSRGNRRNWIL WEDGFCNFAASS+                    
Subjt:  MEEHLSPLAVTHLLQHTLRSMCIHENSEWVYAVFWRILPRNYPPP-KWDAQGAYDRSRGNRRNWILVWEDGFCNFAASSSD-------------------

Query:  -----CQGLQPELFFKMSHEIYNYGEGLIGKVAADRSHKWIYKEPNDQVINLL-SSWHNS-ADTQPRTWEAQFQNGIKTIALIAVREGVVQLGATHKVVE
             CQGLQPELFFKMSHEIYNYGEGLIGKVAAD SHKWI KEPNDQ IN L SSWH+S ADTQPRTWEAQFQ+GIKTIALIAVREGVVQLGA HKVVE
Subjt:  -----CQGLQPELFFKMSHEIYNYGEGLIGKVAADRSHKWIYKEPNDQVINLL-SSWHNS-ADTQPRTWEAQFQNGIKTIALIAVREGVVQLGATHKVVE

Query:  DLSSVVFLRKKFNYIESIPGVLLPHPSSSSSSSSSLYPFKVDGFGGSDIWQFPGGVANPPADTTLLYDHSNHQLRITPSMSSLEALLSKLPSVVPPSSQS
        DLS VVFLRKKFNYIESIPGVLLPHP  SSSSSSSLYPFKVDGFG SDIWQF GGV NPP     LYDH NHQ RITPSMSSLEALLSKLPSVVPPSSQS
Subjt:  DLSSVVFLRKKFNYIESIPGVLLPHPSSSSSSSSSLYPFKVDGFGGSDIWQFPGGVANPPADTTLLYDHSNHQLRITPSMSSLEALLSKLPSVVPPSSQS

Query:  PSQFLAAQ-QRPLEFISMEKLAKEEIEEEFYGPETSSSS
        PSQFL  Q QRPLEFISMEK+AKEEI+    GP+   +S
Subjt:  PSQFLAAQ-QRPLEFISMEKLAKEEIEEEFYGPETSSSS

XP_022159132.1 protein RICE SALT SENSITIVE 3 isoform X2 [Momordica charantia]1.5e-14380.18Show/hide
Query:  MEEHLSPLAVTHLLQHTLRSMCIHENSEWVYAVFWRILPRNYPPPKWDAQGAYDRSRGNRRNWILVWEDGFCNFAASSSD--------------------
        MEEHLSPLAVTHLLQHTLRS+CIHENS WVYAVFWRILPRNYPPPKWD Q AYDRSRGNRRNWIL WEDGFCNFAASS+                     
Subjt:  MEEHLSPLAVTHLLQHTLRSMCIHENSEWVYAVFWRILPRNYPPPKWDAQGAYDRSRGNRRNWILVWEDGFCNFAASSSD--------------------

Query:  ----CQGLQPELFFKMSHEIYNYGEGLIGKVAADRSHKWIYKEPNDQVINLL-SSWHNS-ADTQPRTWEAQFQNGIKTIALIAVREGVVQLGATHKVVED
            CQGLQPELFFKMSHEIYNYGEGLIGKVAAD SHKWI KEPNDQ IN L SSWH+S ADTQPRTWEAQFQ+GIKTIALIAVREGVVQLGA HKVVED
Subjt:  ----CQGLQPELFFKMSHEIYNYGEGLIGKVAADRSHKWIYKEPNDQVINLL-SSWHNS-ADTQPRTWEAQFQNGIKTIALIAVREGVVQLGATHKVVED

Query:  LSSVVFLRKKFNYIESIPGVLLPHPSSSSSSSSSLYPFKVDGFGGSDIWQFPGGVANPPADTTLLYDHSNHQLRITPSMSSLEALLSKLPSVVPPSSQSP
        LS VVFLRKKFNYIESIPGVLLPHP  SSSSSSSLYPFKVDGFG SDIWQF GGV NPP     LYDH NHQ RITPSMSSLEALLSKLPSVVPPSSQSP
Subjt:  LSSVVFLRKKFNYIESIPGVLLPHPSSSSSSSSSLYPFKVDGFGGSDIWQFPGGVANPPADTTLLYDHSNHQLRITPSMSSLEALLSKLPSVVPPSSQSP

Query:  SQFLAAQ-QRPLEFISMEKLAKEEIEEEFYGPETSSSS
        SQFL  Q QRPLEFISMEK+AKEEI+    GP+   +S
Subjt:  SQFLAAQ-QRPLEFISMEKLAKEEIEEEFYGPETSSSS

TrEMBL top hitse value%identityAlignment
A0A061E6Z5 Serine/threonine-protein kinase WNK-related2.2e-13272.41Show/hide
Query:  MEEHLSPLAVTHLLQHTLRSMCIHENSEWVYAVFWRILPRNYPPPKWDAQGAYDRSRGNRRNWILVWEDGFCNFAAS-----SSDC--------------
        MEEHLSPLAVTHLLQHTLRS+CIHENS+WVYAVFWRILPRNYPPPKWD Q AYDRSRGNRRNWILVWEDGFCNFAAS     S DC              
Subjt:  MEEHLSPLAVTHLLQHTLRSMCIHENSEWVYAVFWRILPRNYPPPKWDAQGAYDRSRGNRRNWILVWEDGFCNFAAS-----SSDC--------------

Query:  QGLQPELFFKMSHEIYNYGEGLIGKVAADRSHKWIYKEPNDQVINLLSSWHNSADTQPRTWEAQFQNGIKTIALIAVREGVVQLGATHKVVEDLSSVVFL
        QGLQPELFFKMSHEIYNYGEGLIGKVAAD SHKWIYKEPNDQ IN LS+WHNSAD+ PRTWEAQFQ GIKTIALIAVREGVVQLGA +KV+EDLS VV L
Subjt:  QGLQPELFFKMSHEIYNYGEGLIGKVAADRSHKWIYKEPNDQVINLLSSWHNSADTQPRTWEAQFQNGIKTIALIAVREGVVQLGATHKVVEDLSSVVFL

Query:  RKKFNYIESIPGVLLPHPSSSSSSSSSLYPFKVDGFGGSDIWQFPGGVANPPADTTLLYDHSNHQLRITPSMSSLEALLSKLPSVVPPSSQ--------S
        RKKF+YIESIPGVLLPHPSSS+      YPFKVDG+G  + W FP  +A P    T  YDH N  ++ITPSMSSLEALLSKLPSVVPP S         S
Subjt:  RKKFNYIESIPGVLLPHPSSSSSSSSSLYPFKVDGFGGSDIWQFPGGVANPPADTTLLYDHSNHQLRITPSMSSLEALLSKLPSVVPPSSQ--------S

Query:  PSQFLAAQQRPLEFISMEKLAKEEIEEEFYGPE---TSSSSMPAYRYQ
         SQFL++ QRP+E+I MEK+AKEEI+EE    +    SSSS+ AYR Q
Subjt:  PSQFLAAQQRPLEFISMEKLAKEEIEEEFYGPE---TSSSSMPAYRYQ

A0A1R3I6L5 bHLH-MYC_N domain-containing protein2.3e-13472.24Show/hide
Query:  MEEHLSPLAVTHLLQHTLRSMCIHENSEWVYAVFWRILPRNYPPPKWDAQGAYDRSRGNRRNWILVWEDGFCNFAAS-----SSDC--------------
        MEEHLSPLAVTHLLQHTLRS+CIHENS+WVYAVFWRILPRNYPPPKWD QGAYDRSRGNRRNWILVWEDGFCNFAAS     S DC              
Subjt:  MEEHLSPLAVTHLLQHTLRSMCIHENSEWVYAVFWRILPRNYPPPKWDAQGAYDRSRGNRRNWILVWEDGFCNFAAS-----SSDC--------------

Query:  QGLQPELFFKMSHEIYNYGEGLIGKVAADRSHKWIYKEPNDQVINLLSSWHNSADTQPRTWEAQFQNGIKTIALIAVREGVVQLGATHKVVEDLSSVVFL
        QGLQPELFFKMSHEIYNYGEGLIGKVAAD SHKWIYKEPN+Q IN LS+WHNSAD+ PRTWEAQFQ GIKTIALIAVREGVVQLGA HKV+EDLS VV L
Subjt:  QGLQPELFFKMSHEIYNYGEGLIGKVAADRSHKWIYKEPNDQVINLLSSWHNSADTQPRTWEAQFQNGIKTIALIAVREGVVQLGATHKVVEDLSSVVFL

Query:  RKKFNYIESIPGVLLPHPSSSSSSSSSLYPFKVDGFGGSDIWQFPGGVANPPADTTLLYDHSNHQLRITPSMSSLEALLSKLPSVVPP------SSQSPS
        RKKF+YIESIPGVLLPHPSSS+      YP+KVDG+G  + W FP G   PP +    YDH N  ++ITPSMSSLEALLSKLPSVVPP       SQ  S
Subjt:  RKKFNYIESIPGVLLPHPSSSSSSSSSLYPFKVDGFGGSDIWQFPGGVANPPADTTLLYDHSNHQLRITPSMSSLEALLSKLPSVVPP------SSQSPS

Query:  QFL-AAQQRPLEFIS-MEKLAKEEIEEEFYGPE-----TSSSSMPAYRYQNAH
        QFL ++ QRP+E+++ MEK+AKEEI+EE Y PE      SSSS+ AYR Q  H
Subjt:  QFL-AAQQRPLEFIS-MEKLAKEEIEEEFYGPE-----TSSSSMPAYRYQNAH

A0A6J1B125 uncharacterized protein LOC1104231862.2e-13272.41Show/hide
Query:  MEEHLSPLAVTHLLQHTLRSMCIHENSEWVYAVFWRILPRNYPPPKWDAQGAYDRSRGNRRNWILVWEDGFCNFAAS-----SSDC--------------
        MEEHLSPLAVTHLLQHTLRS+CIHENS+WVYAVFWRILPRNYPPPKWD Q AYDRSRGNRRNWILVWEDGFCNFAAS     S DC              
Subjt:  MEEHLSPLAVTHLLQHTLRSMCIHENSEWVYAVFWRILPRNYPPPKWDAQGAYDRSRGNRRNWILVWEDGFCNFAAS-----SSDC--------------

Query:  QGLQPELFFKMSHEIYNYGEGLIGKVAADRSHKWIYKEPNDQVINLLSSWHNSADTQPRTWEAQFQNGIKTIALIAVREGVVQLGATHKVVEDLSSVVFL
        QGLQPELFFKMSHEIYNYGEGLIGKVAAD SHKWIYKEPNDQ IN LS+WHNSAD+ PRTWEAQFQ GIKTIALIAVREGVVQLGA +KV+EDLS VV L
Subjt:  QGLQPELFFKMSHEIYNYGEGLIGKVAADRSHKWIYKEPNDQVINLLSSWHNSADTQPRTWEAQFQNGIKTIALIAVREGVVQLGATHKVVEDLSSVVFL

Query:  RKKFNYIESIPGVLLPHPSSSSSSSSSLYPFKVDGFGGSDIWQFPGGVANPPADTTLLYDHSNHQLRITPSMSSLEALLSKLPSVVPPSSQ--------S
        RKKF+YIESIPGVLLPHPSSS+      YPFKVDG+G  + W FP  +A P    T  YDH N  ++ITPSMSSLEALLSKLPSVVPP S         S
Subjt:  RKKFNYIESIPGVLLPHPSSSSSSSSSLYPFKVDGFGGSDIWQFPGGVANPPADTTLLYDHSNHQLRITPSMSSLEALLSKLPSVVPPSSQ--------S

Query:  PSQFLAAQQRPLEFISMEKLAKEEIEEEFYGPE---TSSSSMPAYRYQ
         SQFL++ QRP+E+I MEK+AKEEI+EE    +    SSSS+ AYR Q
Subjt:  PSQFLAAQQRPLEFISMEKLAKEEIEEEFYGPE---TSSSSMPAYRYQ

A0A6J1DYZ3 protein RICE SALT SENSITIVE 3 isoform X27.2e-14480.18Show/hide
Query:  MEEHLSPLAVTHLLQHTLRSMCIHENSEWVYAVFWRILPRNYPPPKWDAQGAYDRSRGNRRNWILVWEDGFCNFAASSSD--------------------
        MEEHLSPLAVTHLLQHTLRS+CIHENS WVYAVFWRILPRNYPPPKWD Q AYDRSRGNRRNWIL WEDGFCNFAASS+                     
Subjt:  MEEHLSPLAVTHLLQHTLRSMCIHENSEWVYAVFWRILPRNYPPPKWDAQGAYDRSRGNRRNWILVWEDGFCNFAASSSD--------------------

Query:  ----CQGLQPELFFKMSHEIYNYGEGLIGKVAADRSHKWIYKEPNDQVINLL-SSWHNS-ADTQPRTWEAQFQNGIKTIALIAVREGVVQLGATHKVVED
            CQGLQPELFFKMSHEIYNYGEGLIGKVAAD SHKWI KEPNDQ IN L SSWH+S ADTQPRTWEAQFQ+GIKTIALIAVREGVVQLGA HKVVED
Subjt:  ----CQGLQPELFFKMSHEIYNYGEGLIGKVAADRSHKWIYKEPNDQVINLL-SSWHNS-ADTQPRTWEAQFQNGIKTIALIAVREGVVQLGATHKVVED

Query:  LSSVVFLRKKFNYIESIPGVLLPHPSSSSSSSSSLYPFKVDGFGGSDIWQFPGGVANPPADTTLLYDHSNHQLRITPSMSSLEALLSKLPSVVPPSSQSP
        LS VVFLRKKFNYIESIPGVLLPHP  SSSSSSSLYPFKVDGFG SDIWQF GGV NPP     LYDH NHQ RITPSMSSLEALLSKLPSVVPPSSQSP
Subjt:  LSSVVFLRKKFNYIESIPGVLLPHPSSSSSSSSSLYPFKVDGFGGSDIWQFPGGVANPPADTTLLYDHSNHQLRITPSMSSLEALLSKLPSVVPPSSQSP

Query:  SQFLAAQ-QRPLEFISMEKLAKEEIEEEFYGPETSSSS
        SQFL  Q QRPLEFISMEK+AKEEI+    GP+   +S
Subjt:  SQFLAAQ-QRPLEFISMEKLAKEEIEEEFYGPETSSSS

A0A6J1E2Z9 protein RICE SALT SENSITIVE 3 isoform X14.0e-14279.65Show/hide
Query:  MEEHLSPLAVTHLLQHTLRSMCIHENSEWVYAVFWRILPRNYPPP-KWDAQGAYDRSRGNRRNWILVWEDGFCNFAASSSD-------------------
        MEEHLSPLAVTHLLQHTLRS+CIHENS WVYAVFWRILPRNYPPP +WD Q AYDRSRGNRRNWIL WEDGFCNFAASS+                    
Subjt:  MEEHLSPLAVTHLLQHTLRSMCIHENSEWVYAVFWRILPRNYPPP-KWDAQGAYDRSRGNRRNWILVWEDGFCNFAASSSD-------------------

Query:  -----CQGLQPELFFKMSHEIYNYGEGLIGKVAADRSHKWIYKEPNDQVINLL-SSWHNS-ADTQPRTWEAQFQNGIKTIALIAVREGVVQLGATHKVVE
             CQGLQPELFFKMSHEIYNYGEGLIGKVAAD SHKWI KEPNDQ IN L SSWH+S ADTQPRTWEAQFQ+GIKTIALIAVREGVVQLGA HKVVE
Subjt:  -----CQGLQPELFFKMSHEIYNYGEGLIGKVAADRSHKWIYKEPNDQVINLL-SSWHNS-ADTQPRTWEAQFQNGIKTIALIAVREGVVQLGATHKVVE

Query:  DLSSVVFLRKKFNYIESIPGVLLPHPSSSSSSSSSLYPFKVDGFGGSDIWQFPGGVANPPADTTLLYDHSNHQLRITPSMSSLEALLSKLPSVVPPSSQS
        DLS VVFLRKKFNYIESIPGVLLPHP  SSSSSSSLYPFKVDGFG SDIWQF GGV NPP     LYDH NHQ RITPSMSSLEALLSKLPSVVPPSSQS
Subjt:  DLSSVVFLRKKFNYIESIPGVLLPHPSSSSSSSSSLYPFKVDGFGGSDIWQFPGGVANPPADTTLLYDHSNHQLRITPSMSSLEALLSKLPSVVPPSSQS

Query:  PSQFLAAQ-QRPLEFISMEKLAKEEIEEEFYGPETSSSS
        PSQFL  Q QRPLEFISMEK+AKEEI+    GP+   +S
Subjt:  PSQFLAAQ-QRPLEFISMEKLAKEEIEEEFYGPETSSSS

SwissProt top hitse value%identityAlignment
E3SXU5 Truncated basic helix-loop-helix protein A3.4e-0524.78Show/hide
Query:  ENSEWVYAVFWRILPRNYPPPKWDAQGAYDRSRGNRRNWILVWEDGFCNFAASS---------------------------------------SDCQGLQ
        ++ +W Y++FW+I P+                       ILVW DG+ N A  +                                         C  L 
Subjt:  ENSEWVYAVFWRILPRNYPPPKWDAQGAYDRSRGNRRNWILVWEDGFCNFAASS---------------------------------------SDCQGLQ

Query:  P------ELFFKMSHEI-YNYGEGLIGKVAADRSHKWIYKEPNDQVINLLSSWHNSADTQPRTWEAQFQNGIKTIALIAVREGVVQLGATHKVVEDLSSV
        P      E F+ M     +  G GL GK  A R H W           L  +    + T  R   A+  N I+T+  I V +GVV++G T KV EDL+ +
Subjt:  P------ELFFKMSHEI-YNYGEGLIGKVAADRSHKWIYKEPNDQVINLLSSWHNSADTQPRTWEAQFQNGIKTIALIAVREGVVQLGATHKVVEDLSSV

Query:  VFLRKKFNYIESIPGVLLPHPSSSSSSSSS
          +R  F    S+P    P P+ S  S+S+
Subjt:  VFLRKKFNYIESIPGVLLPHPSSSSSSSSS

K4PW38 Protein RICE SALT SENSITIVE 32.9e-3338.46Show/hide
Query:  LQHTLRSMCIHENSEWVYAVFWRILPRNYPPPKWDAQGAYDRSRGNRRNWILVWEDGFC--NFAASSSDCQGLQP--ELFFKMSHEIYNYGEGLIGKVAA
        L   LR++C+  NS+W Y+VFW I PR   P      G   +   +  + +L+WEDGFC    A    D  G  P  + F KMS ++YNYGEGL+GKVA+
Subjt:  LQHTLRSMCIHENSEWVYAVFWRILPRNYPPPKWDAQGAYDRSRGNRRNWILVWEDGFC--NFAASSSDCQGLQP--ELFFKMSHEIYNYGEGLIGKVAA

Query:  DRSHKWIYKEPNDQVINLLSSWHNSADTQPRTWEAQFQNGIKTIALIAVREGVVQLGATHKVVEDLSSVVFLRKKFNYIESIPGVLLPHPSSSS---SSS
        D+ HKW++KEP++   N+ + W +S D  P  W  QF +GI+TIA+I    G++QLG+   + EDL  V+ +R  F  +    G  L    SSS   S S
Subjt:  DRSHKWIYKEPNDQVINLLSSWHNSADTQPRTWEAQFQNGIKTIALIAVREGVVQLGATHKVVEDLSSVVFLRKKFNYIESIPGVLLPHPSSSS---SSS

Query:  SSLYPFKVDGFGGSDIWQFPG
         S +P K        ++ +PG
Subjt:  SSLYPFKVDGFGGSDIWQFPG

P0C7P8 Transcription factor EMB14442.7e-1532.28Show/hide
Query:  HLLQHTLRSMCIHENSEWVYAVFWRILPRNYPPPKWDAQGAYDRSRGNRRNWILVWEDGFCNFAASSSDCQGLQPELFF--------------KMSHEIY
        + LQ  LRS+C   N++W YAVFW++   N+  P                  +L  ED +C      +  +GL PE                 KMS+ ++
Subjt:  HLLQHTLRSMCIHENSEWVYAVFWRILPRNYPPPKWDAQGAYDRSRGNRRNWILVWEDGFCNFAASSSDCQGLQPELFF--------------KMSHEIY

Query:  NYGEGLIGKVAADRSHKWIYKEPNDQVINLLSSWHNSADTQPRTWEAQFQNGIKTIALIAVRE-GVVQLGATHKVVEDLSSVVFLRKKF
        + GEG++G+VA    H+WI+ E        L+  H++       WE+Q   GIKTI ++AV   GVVQLG+  KV ED + V  +R  F
Subjt:  NYGEGLIGKVAADRSHKWIYKEPNDQVINLLSSWHNSADTQPRTWEAQFQNGIKTIALIAVRE-GVVQLGATHKVVEDLSSVVFLRKKF

Q58G01 Transcription factor bHLH1551.2e-1533.71Show/hide
Query:  QHTLRSMCIHENSEWVYAVFWRILPRNYPPPKWDAQGAYDRSRGNRRNWILVWEDGFCNFAASSSDCQGLQPEL---FFKMSHEIYNYGEGLIGKVAADR
        Q  L+S C   N++W YAVFW++                   RG+R   +L  ED +  +    ++  G    L     KMS+ +Y+ GEG++G+VA   
Subjt:  QHTLRSMCIHENSEWVYAVFWRILPRNYPPPKWDAQGAYDRSRGNRRNWILVWEDGFCNFAASSSDCQGLQPEL---FFKMSHEIYNYGEGLIGKVAADR

Query:  SHKWIYKEPNDQVINLLSSWHNSADTQPRTWEAQFQNGIKTIALIAVRE-GVVQLGATHKVVEDLSSVVFLRKKF
         H+W++ E N    N    +HN        WE+Q   GIKTI ++AV   GVVQLG+  KV ED++ V  +R  F
Subjt:  SHKWIYKEPNDQVINLLSSWHNSADTQPRTWEAQFQNGIKTIALIAVRE-GVVQLGATHKVVEDLSSVVFLRKKF

Q9XIN0 Transcription factor LHW7.0e-1128.71Show/hide
Query:  LLQHTLRSMCIHENSEWVYAVFWRILPRNYPPPKWDAQGAYDRSRGNRRNWILVWEDGFCNFAASSS---------DCQGLQPELF----FKMSHEIYNY
        LL+  LRSMC+  N++W YAVFW+I                    G + + +L+WE+ +    +SS+         D QG +          +++ I   
Subjt:  LLQHTLRSMCIHENSEWVYAVFWRILPRNYPPPKWDAQGAYDRSRGNRRNWILVWEDGFCNFAASSS---------DCQGLQPELF----FKMSHEIYNY

Query:  GEGLIGKVAADRSHKWIYKEPNDQVINLLSSWHNSADTQP----RTWEAQFQNGIKTIALI-AVREGVVQLGATHKVVEDLSSVVFLRKKFNYIESIPGV
        GEGL+G+ A    H+WI          L +S++   D  P         QF  GI+T+A+   V  GVVQLG++  ++E+L  V  ++     +  +PG 
Subjt:  GEGLIGKVAADRSHKWIYKEPNDQVINLLSSWHNSADTQP----RTWEAQFQNGIKTIALI-AVREGVVQLGATHKVVEDLSSVVFLRKKFNYIESIPGV

Query:  LL
        LL
Subjt:  LL

Arabidopsis top hitse value%identityAlignment
AT1G60060.1 Serine/threonine-protein kinase WNK (With No Lysine)-related3.3e-11760.27Show/hide
Query:  MEEHLSPLAVTHLLQHTLRSMCIHENSEWVYAVFWRILPRNYPPPKWDAQGAYDRSRGNRRNWILVWEDGFCNFAASSSDC-------------------
        MEEHL+PLAVTHLLQHTLRS+CIHENS+WVYAVFWRILPRNYPPPKWD QGAYDRSRGNRRNWILVWEDGFCNFAAS+++                    
Subjt:  MEEHLSPLAVTHLLQHTLRSMCIHENSEWVYAVFWRILPRNYPPPKWDAQGAYDRSRGNRRNWILVWEDGFCNFAASSSDC-------------------

Query:  ----QGLQPELFFKMSHEIYNYGEGLIGKVAADRSHKWIYKEPNDQVINLLSSWHNSADTQPRTWEAQFQNGIKTIALIAVREGVVQLGATHKVVEDLSS
            QGLQPELFFKMSHEIYNYGEGLIGKVAAD SHKWIYKEPNDQ IN LS+WHNSAD+ PRTWEAQFQ+GIKTIALI+VREGVVQLGA HKV+EDLS 
Subjt:  ----QGLQPELFFKMSHEIYNYGEGLIGKVAADRSHKWIYKEPNDQVINLLSSWHNSADTQPRTWEAQFQNGIKTIALIAVREGVVQLGATHKVVEDLSS

Query:  VVFLRKKFNYIESIPGVLLPHPSSSSSSSSSLYPFKVDGFGGSDIWQFPGGVANPPADTTLLYDHSNH------------------------QLRITPSM
        VV LRKK +YIESIPGVLLPHPSSS       YPF       SD W FP GVA P       + HS+H                         ++ITPSM
Subjt:  VVFLRKKFNYIESIPGVLLPHPSSSSSSSSSLYPFKVDGFGGSDIWQFPGGVANPPADTTLLYDHSNH------------------------QLRITPSM

Query:  SSLEALLSKLPSVVPPSSQSPSQFLAAQQRPLEFISMEKLAKEEIEEEFYGPETSSSSMPAYRYQNAHSNTTNND
        SSLEALLSKLPSVVPP++Q P  +      P    + E++++EE  + F           +  + + ++  +NND
Subjt:  SSLEALLSKLPSVVPPSSQSPSQFLAAQQRPLEFISMEKLAKEEIEEEFYGPETSSSSMPAYRYQNAHSNTTNND

AT2G31280.1 conserved peptide upstream open reading frame 78.8e-1733.71Show/hide
Query:  QHTLRSMCIHENSEWVYAVFWRILPRNYPPPKWDAQGAYDRSRGNRRNWILVWEDGFCNFAASSSDCQGLQPEL---FFKMSHEIYNYGEGLIGKVAADR
        Q  L+S C   N++W YAVFW++                   RG+R   +L  ED +  +    ++  G    L     KMS+ +Y+ GEG++G+VA   
Subjt:  QHTLRSMCIHENSEWVYAVFWRILPRNYPPPKWDAQGAYDRSRGNRRNWILVWEDGFCNFAASSSDCQGLQPEL---FFKMSHEIYNYGEGLIGKVAADR

Query:  SHKWIYKEPNDQVINLLSSWHNSADTQPRTWEAQFQNGIKTIALIAVRE-GVVQLGATHKVVEDLSSVVFLRKKF
         H+W++ E N    N    +HN        WE+Q   GIKTI ++AV   GVVQLG+  KV ED++ V  +R  F
Subjt:  SHKWIYKEPNDQVINLLSSWHNSADTQPRTWEAQFQNGIKTIALIAVRE-GVVQLGATHKVVEDLSSVVFLRKKF

AT2G31280.3 conserved peptide upstream open reading frame 78.8e-1733.71Show/hide
Query:  QHTLRSMCIHENSEWVYAVFWRILPRNYPPPKWDAQGAYDRSRGNRRNWILVWEDGFCNFAASSSDCQGLQPEL---FFKMSHEIYNYGEGLIGKVAADR
        Q  L+S C   N++W YAVFW++                   RG+R   +L  ED +  +    ++  G    L     KMS+ +Y+ GEG++G+VA   
Subjt:  QHTLRSMCIHENSEWVYAVFWRILPRNYPPPKWDAQGAYDRSRGNRRNWILVWEDGFCNFAASSSDCQGLQPEL---FFKMSHEIYNYGEGLIGKVAADR

Query:  SHKWIYKEPNDQVINLLSSWHNSADTQPRTWEAQFQNGIKTIALIAVRE-GVVQLGATHKVVEDLSSVVFLRKKF
         H+W++ E N    N    +HN        WE+Q   GIKTI ++AV   GVVQLG+  KV ED++ V  +R  F
Subjt:  SHKWIYKEPNDQVINLLSSWHNSADTQPRTWEAQFQNGIKTIALIAVRE-GVVQLGATHKVVEDLSSVVFLRKKF

AT3G15240.2 Serine/threonine-protein kinase WNK (With No Lysine)-related1.3e-3337.81Show/hide
Query:  LQHTLRSMCIHENSEWVYAVFWRILPRNYPPPKWDAQGAYDRSRGNRRNWILVWEDGFCNFAASSSDC----QGLQP--ELFFKMSHEIYNYGEGLIGKV
        L   LR++C+  N++W Y+VFW I PR    P+    G   +   +  + +L+WEDG+C     +  C    +G  P  + F KMS ++YNYGEGL+GKV
Subjt:  LQHTLRSMCIHENSEWVYAVFWRILPRNYPPPKWDAQGAYDRSRGNRRNWILVWEDGFCNFAASSSDC----QGLQP--ELFFKMSHEIYNYGEGLIGKV

Query:  AADRSHKWIYKEPNDQVINLLSSWHNSADTQPRTWEAQFQNGIKTIALIAVREGVVQLGATHKVVEDLSSVVFLRKKFNYIESIPGVLLPHPSSSSSSSS
        A+D+ HKW++KE  +   N  S W +S D  P  W  QF++GI+TIA+I    G++QLG+   + EDL  V+ +R  F  +    G  L    SS+ ++S
Subjt:  AADRSHKWIYKEPNDQVINLLSSWHNSADTQPRTWEAQFQNGIKTIALIAVREGVVQLGATHKVVEDLSSVVFLRKKFNYIESIPGVLLPHPSSSSSSSS

Query:  S
        S
Subjt:  S

AT5G53900.2 Serine/threonine-protein kinase WNK (With No Lysine)-related5.5e-3539.72Show/hide
Query:  LQHTLRSMCIHENSEWVYAVFWRILPRNYPPPKWDAQGAYDRSRGNRR-NWILVWEDGFCNFAAS-----SSDCQGLQPEL----FFKMSHEIYNYGEGL
        L   LRS+C   NS+W+Y+VFW I PR    P+   +G      G+   + +L+WEDGFC    S      +D +G + +L    F KMS ++YNYGEGL
Subjt:  LQHTLRSMCIHENSEWVYAVFWRILPRNYPPPKWDAQGAYDRSRGNRR-NWILVWEDGFCNFAAS-----SSDCQGLQPEL----FFKMSHEIYNYGEGL

Query:  IGKVAADRSHKWIYKEPNDQVINLLSSWHNSADTQPRTWEAQFQNGIKTIALIAVREGVVQLGATHKVVEDLSSVVFLRKKFNYIESIPGVLLPHPSSS-
        +GKVA+D+ HKW++KEP++   NL + W +S D  P  W  QF++GI+TIA+I    G++QLG+   + EDL  V+ +R+ F  I    G  L    SS 
Subjt:  IGKVAADRSHKWIYKEPNDQVINLLSSWHNSADTQPRTWEAQFQNGIKTIALIAVREGVVQLGATHKVVEDLSSVVFLRKKFNYIESIPGVLLPHPSSS-

Query:  --SSSSSSLYPFKV
          ++ SSS  P ++
Subjt:  --SSSSSSLYPFKV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAACATCTAAGCCCATTGGCCGTGACTCATCTTCTTCAACATACGCTGAGAAGTATGTGCATCCATGAAAACTCCGAGTGGGTTTATGCAGTGTTTTGGAGGAT
ACTTCCAAGAAACTACCCTCCACCCAAATGGGATGCTCAAGGTGCTTATGACAGATCCAGAGGGAACAGAAGAAACTGGATACTAGTCTGGGAAGATGGTTTCTGCAACT
TCGCCGCCTCATCCAGCGATTGTCAGGGACTTCAACCGGAGCTCTTCTTCAAGATGTCGCACGAGATCTACAATTATGGAGAAGGTTTAATCGGAAAAGTCGCGGCCGAC
CGTAGTCATAAGTGGATTTACAAAGAACCAAATGATCAAGTAATAAACCTGTTGTCGTCTTGGCACAACTCAGCTGACACTCAACCTAGAACTTGGGAAGCACAATTTCA
GAACGGCATAAAGACCATAGCTCTTATAGCGGTTCGGGAAGGAGTTGTTCAATTAGGAGCTACTCACAAGGTGGTTGAAGATTTGAGCTCTGTGGTGTTTTTAAGAAAGA
AATTCAACTACATAGAAAGCATTCCGGGCGTGCTTTTGCCACACCCGTCATCGTCATCGTCATCGTCGTCGTCGTTATATCCTTTCAAGGTGGATGGGTTTGGCGGCTCA
GATATATGGCAATTCCCTGGAGGAGTAGCAAACCCACCAGCTGATACAACGTTGTTGTACGACCACTCGAACCATCAATTGAGAATAACTCCCTCCATGAGCAGCCTTGA
AGCTCTCCTCTCAAAGCTACCTTCGGTGGTGCCGCCCAGCTCACAGTCACCATCTCAGTTTCTGGCAGCCCAGCAGAGGCCATTAGAATTCATATCCATGGAAAAGCTGG
CTAAGGAAGAGATTGAAGAAGAGTTTTATGGGCCTGAGACCAGCAGTAGTTCAATGCCGGCTTATCGCTATCAAAATGCACATAGCAACACAACCAACAATGATGAAGAA
TTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGAACATCTAAGCCCATTGGCCGTGACTCATCTTCTTCAACATACGCTGAGAAGTATGTGCATCCATGAAAACTCCGAGTGGGTTTATGCAGTGTTTTGGAGGAT
ACTTCCAAGAAACTACCCTCCACCCAAATGGGATGCTCAAGGTGCTTATGACAGATCCAGAGGGAACAGAAGAAACTGGATACTAGTCTGGGAAGATGGTTTCTGCAACT
TCGCCGCCTCATCCAGCGATTGTCAGGGACTTCAACCGGAGCTCTTCTTCAAGATGTCGCACGAGATCTACAATTATGGAGAAGGTTTAATCGGAAAAGTCGCGGCCGAC
CGTAGTCATAAGTGGATTTACAAAGAACCAAATGATCAAGTAATAAACCTGTTGTCGTCTTGGCACAACTCAGCTGACACTCAACCTAGAACTTGGGAAGCACAATTTCA
GAACGGCATAAAGACCATAGCTCTTATAGCGGTTCGGGAAGGAGTTGTTCAATTAGGAGCTACTCACAAGGTGGTTGAAGATTTGAGCTCTGTGGTGTTTTTAAGAAAGA
AATTCAACTACATAGAAAGCATTCCGGGCGTGCTTTTGCCACACCCGTCATCGTCATCGTCATCGTCGTCGTCGTTATATCCTTTCAAGGTGGATGGGTTTGGCGGCTCA
GATATATGGCAATTCCCTGGAGGAGTAGCAAACCCACCAGCTGATACAACGTTGTTGTACGACCACTCGAACCATCAATTGAGAATAACTCCCTCCATGAGCAGCCTTGA
AGCTCTCCTCTCAAAGCTACCTTCGGTGGTGCCGCCCAGCTCACAGTCACCATCTCAGTTTCTGGCAGCCCAGCAGAGGCCATTAGAATTCATATCCATGGAAAAGCTGG
CTAAGGAAGAGATTGAAGAAGAGTTTTATGGGCCTGAGACCAGCAGTAGTTCAATGCCGGCTTATCGCTATCAAAATGCACATAGCAACACAACCAACAATGATGAAGAA
TTTTAG
Protein sequenceShow/hide protein sequence
MEEHLSPLAVTHLLQHTLRSMCIHENSEWVYAVFWRILPRNYPPPKWDAQGAYDRSRGNRRNWILVWEDGFCNFAASSSDCQGLQPELFFKMSHEIYNYGEGLIGKVAAD
RSHKWIYKEPNDQVINLLSSWHNSADTQPRTWEAQFQNGIKTIALIAVREGVVQLGATHKVVEDLSSVVFLRKKFNYIESIPGVLLPHPSSSSSSSSSLYPFKVDGFGGS
DIWQFPGGVANPPADTTLLYDHSNHQLRITPSMSSLEALLSKLPSVVPPSSQSPSQFLAAQQRPLEFISMEKLAKEEIEEEFYGPETSSSSMPAYRYQNAHSNTTNNDEE
F