; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi09G017750 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi09G017750
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionPlant protein of unknown function (DUF247)
Genome locationchr09:26497321..26499757
RNA-Seq ExpressionLsi09G017750
SyntenyLsi09G017750
Gene Ontology termsNA
InterPro domainsIPR004158 - Protein of unknown function DUF247, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039868.1 UPF0481 protein [Cucumis melo var. makuwa]8.4e-9750.23Show/hide
Query:  MGKVVNEVKELLNKVEDNVLKEELRKMQELSRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAKNNLEFESIIVES
        MG VVNEV ELL+K+E+++  EE    Q +     K SIYKIP+ IREVNPKA+EP+LVS GPYHYGKP+LL ME EK   +L FKA+NNL  ES IV+S
Subjt:  MGKVVNEVKELLNKVEDNVLKEELRKMQELSRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAKNNLEFESIIVES

Query:  VSSTLKQLLGSY-DNLEDKWKQNPAKFLKLMIVDGCFMLDLIINGCPQSLA-SMIGNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKDKGSLSLLILYWM
        V S L+ LLGSY DNL+D WK++ A FLKLMIVDGCF+LDLI     +SL   MI +IKRDM+LLENQLP  LL+ L+ L       +   L L  + W 
Subjt:  VSSTLKQLLGSY-DNLEDKWKQNPAKFLKLMIVDGCFMLDLIINGCPQSLA-SMIGNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKDKGSLSLLILYWM

Query:  GIPKGGVITG--KYLHILDMHRALMLYPSIEKTDDYNSR-----SRRRYNQTEVEC---GSATRLGKLGIEFKEIITDNLMDVSFNFKTGVLSVPQLKLD
         +   G I      LHILDM+R+ +L P+I K D+ +            N+  VEC    SAT+L K GI+F++  T+NLMDVSF+FK  VL +P L +D
Subjt:  GIPKGGVITG--KYLHILDMHRALMLYPSIEKTDDYNSR-----SRRRYNQTEVEC---GSATRLGKLGIEFKEIITDNLMDVSFNFKTGVLSVPQLKLD

Query:  H-TTLPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPTKCLDTHICEVLNMLNKHCNNPWTMWW
           T P L+N+M FEK H  +  +V SFVV M NLI  ++DVSLL SK IL + +  D   V+QLFN L+KG TK L+ H+CE+  MLN + +N W++W 
Subjt:  H-TTLPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPTKCLDTHICEVLNMLNKHCNNPWTMWW

Query:  VN--LKDPRTQDPWVIISVAAVLFGFLLQIIQTV
         N  LK P+T++PW IIS+A   FGF   ++Q +
Subjt:  VN--LKDPRTQDPWVIISVAAVLFGFLLQIIQTV

XP_004140657.1 UPF0481 protein At3g47200 [Cucumis sativus]1.0e-9449.2Show/hide
Query:  KDMGKVVNEVKELLNKVEDNVLKEELRKMQELSRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAKNNLEFESIIV
        ++MG VVNEV  LL+K+E+++  EE      LS Q  K SIYKIP+ IREVNPKA+EP+LVS GPYHYGKP+LL ME EK   +L FKA+NNL  ES IV
Subjt:  KDMGKVVNEVKELLNKVEDNVLKEELRKMQELSRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAKNNLEFESIIV

Query:  ESVSSTLKQLLGSY-DNLEDKWKQNPAKFLKLMIVDGCFMLDLIINGCPQSLA-SMIGNIKRDMLLLENQLPFNLLQHLYDL-VNNNGHKDKGSLSLLIL
        +SV S L+ LLGSY DNL+D+WK++ A FLKLMIVDGCF+LDLI     +SL   MI +IKRDM+LLENQLP  LL+ L+ L   N   K+   L  +I 
Subjt:  ESVSSTLKQLLGSY-DNLEDKWKQNPAKFLKLMIVDGCFMLDLIINGCPQSLA-SMIGNIKRDMLLLENQLPFNLLQHLYDL-VNNNGHKDKGSLSLLIL

Query:  YWMGIPKGGVITGKYLHILDMHRALMLYPSIEKTDD------------YNSRSRRRYNQTEVECGSATRLGKLGIEFKEIITDNLMDVSFNFKTGVLSVP
         WMG+    +     LHILD++R+ +L P+  K D+              +     Y Q+ +   SAT+L K GI+F++  T+NLMDVSF+FK  VL +P
Subjt:  YWMGIPKGGVITGKYLHILDMHRALMLYPSIEKTDD------------YNSRSRRRYNQTEVECGSATRLGKLGIEFKEIITDNLMDVSFNFKTGVLSVP

Query:  QLKLDH-TTLPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPTKCLDTHICEVLNMLNKHCNNP
         L +D   T P L+N+M FEK H  +  +V SFVV M NLI  E+DVSLL SK IL + +  D   V+QLFN L+KG TK L++H+ E+  MLN + +N 
Subjt:  QLKLDH-TTLPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPTKCLDTHICEVLNMLNKHCNNP

Query:  WTMWWVN--LKDPRTQDPWVIISVAAVLFGFLLQIIQTV
        W++W  N  LK P+T++ W IIS+A   FGF    +Q +
Subjt:  WTMWWVN--LKDPRTQDPWVIISVAAVLFGFLLQIIQTV

XP_016902479.1 PREDICTED: uncharacterized protein LOC103499077 [Cucumis melo]5.5e-9649.89Show/hide
Query:  KDMGKVVNEVKELLNKVEDNVLKEELRKMQELSRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAKNNLEFESIIV
        ++MG VVNEV ELL+K+E+++  EE    Q +     K SIYKIP+ IREVNPKA+EP+LVS GPYHYGKP+LL ME EK   +L FKA+NNL  ES IV
Subjt:  KDMGKVVNEVKELLNKVEDNVLKEELRKMQELSRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAKNNLEFESIIV

Query:  ESVSSTLKQLLGSY-DNLEDKWKQNPAKFLKLMIVDGCFMLDLIINGCPQSLA-SMIGNIKRDMLLLENQLPFNLLQHLYDLV-----NNNGHKDKGSLS
        +SV S L+ LLGSY DNL+D WK++ A FLKLMIVDGCF+LDLI     +SL   MI +IKRDM+LLENQLP  LL+ L+ L      N N H D   L 
Subjt:  ESVSSTLKQLLGSY-DNLEDKWKQNPAKFLKLMIVDGCFMLDLIINGCPQSLA-SMIGNIKRDMLLLENQLPFNLLQHLYDLV-----NNNGHKDKGSLS

Query:  LLILYWMGIPKGGVITGKYLHILDMHRALMLYPSIEKTDDYNSR-----SRRRYNQTEVEC---GSATRLGKLGIEFKEIITDNLMDVSFNFKTGVLSVP
         +I  WM +    +     LHILDM+R+ +L P+I K D+ +            N+  VEC    SAT+L K GI+F++  T+NLMDVSF+FK  VL +P
Subjt:  LLILYWMGIPKGGVITGKYLHILDMHRALMLYPSIEKTDDYNSR-----SRRRYNQTEVEC---GSATRLGKLGIEFKEIITDNLMDVSFNFKTGVLSVP

Query:  QLKLDH-TTLPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPTKCLDTHICEVLNMLNKHCNNP
         L +D   T P L+N+M FEK H  +  +V SFVV M NLI  +++VSLL SK IL + +  D   V+QLFN L+KG TK L+ H+CE+  MLN + +N 
Subjt:  QLKLDH-TTLPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPTKCLDTHICEVLNMLNKHCNNP

Query:  WTMWWVN--LKDPRTQDPWVIISVAAVLFGFLLQIIQTV
        W++W  N  LK P+T++PW  IS+A   FGF   ++Q +
Subjt:  WTMWWVN--LKDPRTQDPWVIISVAAVLFGFLLQIIQTV

XP_023513986.1 UPF0481 protein At3g47200-like isoform X1 [Cucurbita pepo subsp. pepo]4.5e-8242.92Show/hide
Query:  GDDDEFIIKDMGKVVNEVKELLNKVEDN--VLKEELRKMQELSRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAK
        G DD   I D   +V +VK  L+++ D+  +L  E  +  EL       SIYKIP  + + +PKA+EP++VS GPY++GK +L  MELEK   +  FK +
Subjt:  GDDDEFIIKDMGKVVNEVKELLNKVEDN--VLKEELRKMQELSRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAK

Query:  NNLEFESIIVESVSSTLKQLLGSYDNLEDKWKQNPAKFLKLMIVDGCFMLDLIINGCPQSLASMIGNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKDKG
          L+ ES IV+ VS+ L +L+ SYD LE++W ++P KFL+LMIVDGCFML  +IN CP SL ++  +IK+DMLLLENQLP  LL+ LY +   N    + 
Subjt:  NNLEFESIIVESVSSTLKQLLGSYDNLEDKWKQNPAKFLKLMIVDGCFMLDLIINGCPQSLASMIGNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKDKG

Query:  SLSLLILYWMGIPKGGVITGKYLHILDMHRALMLYPSIEKTDDYNSRSRRRYNQTEVECGSATRLGKLGIEFKEIITDNLMDVSFNFKTGVLSVPQLKLD
         L  L+  W+ IP+  V+    LHIL+M++  +L+P I++T D++        +++V    AT+L + GI+FK   TD+L DV F+ K GVL +PQL +D
Subjt:  SLSLLILYWMGIPKGGVITGKYLHILDMHRALMLYPSIEKTDDYNSRSRRRYNQTEVECGSATRLGKLGIEFKEIITDNLMDVSFNFKTGVLSVPQLKLD

Query:  HTTLPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPTKCLDTHICEVLNMLNKHCNNPWTMWWV
          T   LLN+M FEK H + G  V SFV+ M+NLI+ E DV++L  +K+L N +  D++    LFN+L  G    LDTH+  V   +N+HCN PW     
Subjt:  HTTLPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPTKCLDTHICEVLNMLNKHCNNPWTMWWV

Query:  NLKDPRTQDPWVIISVAAVLFGFLLQIIQTVYGIYGIY
         LK    Q PW IIS+ A +FGF++ I+Q +Y     Y
Subjt:  NLKDPRTQDPWVIISVAAVLFGFLLQIIQTVYGIYGIY

XP_038875622.1 UPF0481 protein At3g47200-like [Benincasa hispida]6.7e-9448.25Show/hide
Query:  VVNEVKELLNKVEDNVLKEELRKMQELSRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAKNNLEFESIIVESVSS
        VV +VK  LNK+ D+   +E+  + E +    + SIYKIPE +R++  KAFEPQLVS GPYH+GK +L+ ME EKQ  + RF     L  ES IVES+S+
Subjt:  VVNEVKELLNKVEDNVLKEELRKMQELSRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAKNNLEFESIIVESVSS

Query:  TLKQLLGSYDNLEDKWKQNPAKFLKLMIVDGCFMLDLIINGCPQSLASMIGNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKDKGSLSLLILYWMGIPKG
         L+ L G+YD L++KWKQ+ AKFL++MIVDGCFMLD     CP SL++M  +IKRDMLLLENQLP  LLQ LYD + N   +D+ SL  LI   M I + 
Subjt:  TLKQLLGSYDNLEDKWKQNPAKFLKLMIVDGCFMLDLIINGCPQSLASMIGNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKDKGSLSLLILYWMGIPKG

Query:  GVITGKYLHILDMHRALMLYPSIEKTDDYNSRSRRRYNQTEVEC---GSATRLGKLGIEFKEIITDNLMDVSFNFKTGVLSVPQLKLDHTTLPNLLNMMT
         V+ G+ LHILDM+RA +LYP +++  D + +++  Y Q++ EC     AT+L   GI+FK   T NL DVSFN K GVL +P + +D  T   LLN+M 
Subjt:  GVITGKYLHILDMHRALMLYPSIEKTDDYNSRSRRRYNQTEVEC---GSATRLGKLGIEFKEIITDNLMDVSFNFKTGVLSVPQLKLDHTTLPNLLNMMT

Query:  FEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPTKCLDTHICEVLNMLNKHCNNPWTMWWVNLKDPRTQDPWV
        FEK + + GS+V SFV+ M NLI+ + DV+LL S KIL N L  D      LF+ L KG    LD+HI +V   +NKHC+  W  W  +LK    Q+PW 
Subjt:  FEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPTKCLDTHICEVLNMLNKHCNNPWTMWWVNLKDPRTQDPWV

Query:  IISVAAVLFGFLLQIIQTVYGIYGIYRPN
        IIS+ A +FGF + I+Q +Y I   +R N
Subjt:  IISVAAVLFGFLLQIIQTVYGIYGIYRPN

TrEMBL top hitse value%identityAlignment
A0A0A0KAA3 Uncharacterized protein5.3e-8948.05Show/hide
Query:  KDMGKVVNEVKELLNKVEDNVLKEELRKMQELSRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAKNNLEFESIIV
        ++MG VVNEV  LL+K+E+++  EE      LS Q  K SIYKIP+ IREVNPKA+EP+LVS GPYHYGKP+LL ME EK   +L FKA+NNL  ES IV
Subjt:  KDMGKVVNEVKELLNKVEDNVLKEELRKMQELSRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAKNNLEFESIIV

Query:  ESVSSTLKQLLGSY-DNLEDKWKQNPAKFLKLMIVDGCFMLDLIINGCPQSLA-SMIGNIKRDMLLLENQLPFNLLQHLYDL-VNNNGHKDKGSLSLLIL
        +SV S L+ LLGSY DNL+D+WK++ A FLKLMIVDGCF+LDLI     +SL   MI +IKRDM+LLENQLP  LL+ L+ L   N   K+   L  +I 
Subjt:  ESVSSTLKQLLGSY-DNLEDKWKQNPAKFLKLMIVDGCFMLDLIINGCPQSLA-SMIGNIKRDMLLLENQLPFNLLQHLYDL-VNNNGHKDKGSLSLLIL

Query:  YWMGIPKGGVITGKYLHILDMHRALMLYPSIEKTDD------------YNSRSRRRYNQTEVECGSATRLGKLGIEFKEIITDNLMDVSFNFKTGVLSVP
         WMG+    +     LHILD++R+ +L P+  K D+              +     Y Q+ +   SAT+L K GI+F++  T+NLMDVSF+FK  VL +P
Subjt:  YWMGIPKGGVITGKYLHILDMHRALMLYPSIEKTDD------------YNSRSRRRYNQTEVECGSATRLGKLGIEFKEIITDNLMDVSFNFKTGVLSVP

Query:  QLKLDH-TTLPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPTKCLDTHICEVLNMLNKHCNNP
         L +D   T P L+N+M FEK H  +  +V SFVV M NLI  E+DVSLL SK IL + +  D   V+QLFN L+KG TK L++HI  + N+        
Subjt:  QLKLDH-TTLPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPTKCLDTHICEVLNMLNKHCNNP

Query:  WTMWWVNLKDPRTQDPWVIISVAAVLFGFLLQIIQTV
               LK P+T++ W IIS+A   FGF    +Q +
Subjt:  WTMWWVNLKDPRTQDPWVIISVAAVLFGFLLQIIQTV

A0A1S4E2L7 uncharacterized protein LOC1034990772.6e-9649.89Show/hide
Query:  KDMGKVVNEVKELLNKVEDNVLKEELRKMQELSRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAKNNLEFESIIV
        ++MG VVNEV ELL+K+E+++  EE    Q +     K SIYKIP+ IREVNPKA+EP+LVS GPYHYGKP+LL ME EK   +L FKA+NNL  ES IV
Subjt:  KDMGKVVNEVKELLNKVEDNVLKEELRKMQELSRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAKNNLEFESIIV

Query:  ESVSSTLKQLLGSY-DNLEDKWKQNPAKFLKLMIVDGCFMLDLIINGCPQSLA-SMIGNIKRDMLLLENQLPFNLLQHLYDLV-----NNNGHKDKGSLS
        +SV S L+ LLGSY DNL+D WK++ A FLKLMIVDGCF+LDLI     +SL   MI +IKRDM+LLENQLP  LL+ L+ L      N N H D   L 
Subjt:  ESVSSTLKQLLGSY-DNLEDKWKQNPAKFLKLMIVDGCFMLDLIINGCPQSLA-SMIGNIKRDMLLLENQLPFNLLQHLYDLV-----NNNGHKDKGSLS

Query:  LLILYWMGIPKGGVITGKYLHILDMHRALMLYPSIEKTDDYNSR-----SRRRYNQTEVEC---GSATRLGKLGIEFKEIITDNLMDVSFNFKTGVLSVP
         +I  WM +    +     LHILDM+R+ +L P+I K D+ +            N+  VEC    SAT+L K GI+F++  T+NLMDVSF+FK  VL +P
Subjt:  LLILYWMGIPKGGVITGKYLHILDMHRALMLYPSIEKTDDYNSR-----SRRRYNQTEVEC---GSATRLGKLGIEFKEIITDNLMDVSFNFKTGVLSVP

Query:  QLKLDH-TTLPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPTKCLDTHICEVLNMLNKHCNNP
         L +D   T P L+N+M FEK H  +  +V SFVV M NLI  +++VSLL SK IL + +  D   V+QLFN L+KG TK L+ H+CE+  MLN + +N 
Subjt:  QLKLDH-TTLPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPTKCLDTHICEVLNMLNKHCNNP

Query:  WTMWWVN--LKDPRTQDPWVIISVAAVLFGFLLQIIQTV
        W++W  N  LK P+T++PW  IS+A   FGF   ++Q +
Subjt:  WTMWWVN--LKDPRTQDPWVIISVAAVLFGFLLQIIQTV

A0A5A7TA99 UPF0481 protein4.1e-9750.23Show/hide
Query:  MGKVVNEVKELLNKVEDNVLKEELRKMQELSRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAKNNLEFESIIVES
        MG VVNEV ELL+K+E+++  EE    Q +     K SIYKIP+ IREVNPKA+EP+LVS GPYHYGKP+LL ME EK   +L FKA+NNL  ES IV+S
Subjt:  MGKVVNEVKELLNKVEDNVLKEELRKMQELSRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAKNNLEFESIIVES

Query:  VSSTLKQLLGSY-DNLEDKWKQNPAKFLKLMIVDGCFMLDLIINGCPQSLA-SMIGNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKDKGSLSLLILYWM
        V S L+ LLGSY DNL+D WK++ A FLKLMIVDGCF+LDLI     +SL   MI +IKRDM+LLENQLP  LL+ L+ L       +   L L  + W 
Subjt:  VSSTLKQLLGSY-DNLEDKWKQNPAKFLKLMIVDGCFMLDLIINGCPQSLA-SMIGNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKDKGSLSLLILYWM

Query:  GIPKGGVITG--KYLHILDMHRALMLYPSIEKTDDYNSR-----SRRRYNQTEVEC---GSATRLGKLGIEFKEIITDNLMDVSFNFKTGVLSVPQLKLD
         +   G I      LHILDM+R+ +L P+I K D+ +            N+  VEC    SAT+L K GI+F++  T+NLMDVSF+FK  VL +P L +D
Subjt:  GIPKGGVITG--KYLHILDMHRALMLYPSIEKTDDYNSR-----SRRRYNQTEVEC---GSATRLGKLGIEFKEIITDNLMDVSFNFKTGVLSVPQLKLD

Query:  H-TTLPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPTKCLDTHICEVLNMLNKHCNNPWTMWW
           T P L+N+M FEK H  +  +V SFVV M NLI  ++DVSLL SK IL + +  D   V+QLFN L+KG TK L+ H+CE+  MLN + +N W++W 
Subjt:  H-TTLPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPTKCLDTHICEVLNMLNKHCNNPWTMWW

Query:  VN--LKDPRTQDPWVIISVAAVLFGFLLQIIQTV
         N  LK P+T++PW IIS+A   FGF   ++Q +
Subjt:  VN--LKDPRTQDPWVIISVAAVLFGFLLQIIQTV

A0A6J1HB25 UPF0481 protein At3g47200-like3.7e-8242.86Show/hide
Query:  GDDDEFIIKDMGKVVNEVKELLNKVEDN--VLKEELRKMQELSRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAK
        G DD   I D   +V  VK  L+++ D+  +L  E  +  EL       SIYKIP  + + +PKA+EPQ+VS GPY++GK +L  MELEK   +  FKA+
Subjt:  GDDDEFIIKDMGKVVNEVKELLNKVEDN--VLKEELRKMQELSRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAK

Query:  NNLEFESIIVESVSSTLKQLLGSYDNLEDKWKQNPAKFLKLMIVDGCFMLDLIINGCPQSLASMIGNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKDKG
          L+ ES IV  VS+ L +L+ SYD LE+ WK++P KFL+LMIVDGCFML  +IN CP SL ++  +IK+DMLLLENQLP  LL+ LY + + N    + 
Subjt:  NNLEFESIIVESVSSTLKQLLGSYDNLEDKWKQNPAKFLKLMIVDGCFMLDLIINGCPQSLASMIGNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKDKG

Query:  SLSLLILYWMGIPKGGVITGKYLHILDMHRALMLYPSIEKTDDYNSRSRRRYNQTEVEC---GSATRLGKLGIEFKEIITDNLMDVSFNFKTGVLSVPQL
         L  L+  W+ IP+  V+    LHIL+M++  +L+P I++TD            ++ EC     AT+L + GI+FK   T +L DV F+ K GVL +P+L
Subjt:  SLSLLILYWMGIPKGGVITGKYLHILDMHRALMLYPSIEKTDDYNSRSRRRYNQTEVEC---GSATRLGKLGIEFKEIITDNLMDVSFNFKTGVLSVPQL

Query:  KLDHTTLPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPTKCLDTHICEVLNMLNKHCNNPWTM
         +D  T   LLN+M FEK H + G +V SFV+ M+NLI+ E DV++L  +++L N +  D++    LFN+L  G    LDTH+  V   +N+HCN PW  
Subjt:  KLDHTTLPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPTKCLDTHICEVLNMLNKHCNNPWTM

Query:  WWVNLKDPRTQDPWVIISVAAVLFGFLLQIIQTVYGIYGIY
            LK    Q PW IIS+ A +FGF++ I+Q +Y     Y
Subjt:  WWVNLKDPRTQDPWVIISVAAVLFGFLLQIIQTVYGIYGIY

A0A6J1KYV8 UPF0481 protein At3g47200-like isoform X15.4e-8142.4Show/hide
Query:  GDDDEFIIKDMGKVVNEVKELLNKVEDN--VLKEELRKMQELSRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAK
        G DD   I D   +V  VK  L+++ D+  +L  E  +  EL       SIYKIP  + + +PKA+EPQ+VS GPY++GK +L  MELEK   +  FK +
Subjt:  GDDDEFIIKDMGKVVNEVKELLNKVEDN--VLKEELRKMQELSRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAK

Query:  NNLEFESIIVESVSSTLKQLLGSYDNLEDKWKQNPAKFLKLMIVDGCFMLDLIINGCPQSLASMIGNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKDKG
          L+ ES IV  VS+ L +L+ SYD LE++W Q+P KFL+LMIVDGCFML  +I+ CP SL ++  +IK+DMLLLENQLP  LL+ LY +   N    + 
Subjt:  NNLEFESIIVESVSSTLKQLLGSYDNLEDKWKQNPAKFLKLMIVDGCFMLDLIINGCPQSLASMIGNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKDKG

Query:  SLSLLILYWMGIPKGGVITGKYLHILDMHRALMLYPSIEKTDDYNSRSRRRYNQTEVEC---GSATRLGKLGIEFKEIITDNLMDVSFNFKTGVLSVPQL
            L+  W+ IP+  V+    LHIL+M++  +LYP I++ D          + ++ EC     AT+L + GI+FK   T++L DV F+ K GVL +PQL
Subjt:  SLSLLILYWMGIPKGGVITGKYLHILDMHRALMLYPSIEKTDDYNSRSRRRYNQTEVEC---GSATRLGKLGIEFKEIITDNLMDVSFNFKTGVLSVPQL

Query:  KLDHTTLPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPTKCLDTHICEVLNMLNKHCNNPWTM
         +D  T   +LN+M FEK H + G +V SFV+ M+NLI+ E DV++L  +KIL N +  D++    LF++L  G    LD+H+  V  M+N HCN PW  
Subjt:  KLDHTTLPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPTKCLDTHICEVLNMLNKHCNNPWTM

Query:  WWVNLKDPRTQDPWVIISVAAVLFGFLLQIIQTVYGIYGIY
            LK    Q PW IIS+ A +FGF++ I+Q +Y     Y
Subjt:  WWVNLKDPRTQDPWVIISVAAVLFGFLLQIIQTVYGIYGIY

SwissProt top hitse value%identityAlignment
P0C897 Putative UPF0481 protein At3g026451.2e-0525Show/hide
Query:  NVLKEELRKMQELSRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAK-NNLEFESII--VESVSSTLKQLLGSYDN
        NV K    +++E   ++   SI+ +P+ +   +P ++ P  VS GPYH  KP L EME  K     + + + N+  F  ++  ++S+   ++     Y  
Subjt:  NVLKEELRKMQELSRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAK-NNLEFESII--VESVSSTLKQLLGSYDN

Query:  LEDKWKQNPAKFLKLMIVDGCFMLDLIINGCPQSLASMI-----GNIKRDMLLLENQLPFNLLQ
               N    L +M VD  F+++ +     + + ++I       I RD++++ENQ+P  +L+
Subjt:  LEDKWKQNPAKFLKLMIVDGCFMLDLIINGCPQSLASMI-----GNIKRDMLLLENQLPFNLLQ

Q9SD53 UPF0481 protein At3g472005.8e-3228.18Show/hide
Query:  SRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRF--KAKNNLEFESIIVESVSSTLKQLLGSYDNLEDKWKQNPAKFLK
        S  K    I+++PE    +NPKA++P++VS GPYHYG+ +L  ++  K      F  +AK     E+++V++V            +LEDK +++ ++ LK
Subjt:  SRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRF--KAKNNLEFESIIVESVSSTLKQLLGSYDNLEDKWKQNPAKFLK

Query:  -------LMIVDGCF--MLDLIINGCPQ-------SLASMIGNIKRDMLLLENQLPFNLLQHLY---------DLVNNNGHKDKGSLSLLILYWMGIPKG
               +M++DGCF  M+ LI++G  +       S+  ++ +I+ D+LLLENQ+PF +LQ LY         DL     H  K  +     YW      
Subjt:  -------LMIVDGCF--MLDLIINGCPQ-------SLASMIGNIKRDMLLLENQLPFNLLQHLY---------DLVNNNGHKDKGSLSLLILYWMGIPKG

Query:  GVITGKYLHILDMHRALMLYPSIEKTDDYNS-------RSRRRYNQTEVECG------SATRLGKLGIEFKEIITDNLMDVSFNFKTGVLSVPQLKLDHT
             K  H+LD+ R   L P+  ++D  +S          +  N   V+        SA RL   GI+F+   +     ++   K   L +PQL+ D  
Subjt:  GVITGKYLHILDMHRALMLYPSIEKTDDYNS-------RSRRRYNQTEVECG------SATRLGKLGIEFKEIITDNLMDVSFNFKTGVLSVPQLKLDHT

Query:  TLPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPTKCLDT-HICEVLNMLNKHCNNPWTMWWVN
             LN + FE+F+    +E+ +++VFM  L+  EEDV+ L + K++          V + F  +SK     +DT ++  V   +N++    +   W  
Subjt:  TLPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPTKCLDT-HICEVLNMLNKHCNNPWTMWWVN

Query:  LKDPRTQDPWVIISVAAVLFGFLLQIIQTVYGI
         +    + PW  +S  AVLF  LL ++Q+   I
Subjt:  LKDPRTQDPWVIISVAAVLFGFLLQIIQTVYGI

Arabidopsis top hitse value%identityAlignment
AT3G47250.1 Plant protein of unknown function (DUF247)6.3e-3427.06Show/hide
Query:  LLNKVEDNVLKEELRKMQE------LSRQKGKAS--IYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRF---KAKNNLEFESIIVE
        L  + E +V+    R +Q       L    GKAS  I++IP+ + EVNPKA++P++VS GPYHYG+ N L+M  + +  +L     +A      E+++  
Subjt:  LLNKVEDNVLKEELRKMQE------LSRQKGKAS--IYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRF---KAKNNLEFESIIVE

Query:  SVSSTLKQLLGSYDNLEDKWKQNPAKFLKLMIVDGCFML----------DLIINGCP-QSLASMIGNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKDKG
        +V +   ++  SY    ++ +   ++ + +MI+DGCF+L          DL +N  P  ++  ++ +I+ D+LLLENQ+PF +L+ ++D    +G    G
Subjt:  SVSSTLKQLLGSYDNLEDKWKQNPAKFLKLMIVDGCFML----------DLIINGCP-QSLASMIGNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKDKG

Query:  SLSLLI--LYWMGIPKGGVITGKY-----LHILDMHRALMLYPSIEKTDDYNSRSRRRYNQTEVECG-------------SATRLGKLGIEFKEIITDNL
         L+ +    + + I K      K+      H+LD++R   + PS+    +  +    ++  ++ + G             SA RL   GI+F+ + +D  
Subjt:  SLSLLI--LYWMGIPKGGVITGKY-----LHILDMHRALMLYPSIEKTDDYNSRSRRRYNQTEVECG-------------SATRLGKLGIEFKEIITDNL

Query:  MDVSFNFKTGVLSVPQLKLDHTTLPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLIS-KKILRNELDQDRKVVLQLFNQLSKGPTKCLDT-
          +    K   L +P L+LD       LN + FE+F+ +  +++ S+VVFM  L+  +ED + L + K+I+ N    + + V Q F  + K      DT 
Subjt:  MDVSFNFKTGVLSVPQLKLDHTTLPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLIS-KKILRNELDQDRKVVLQLFNQLSKGPTKCLDT-

Query:  --HICEVLNMLNKHCNNPWTMWWVNLKDPRTQDPWVIISVAAVLFGFLLQIIQTVYGIYGIY
          ++  V   +N++ +  +   W   +    + PW  +S  AV+F  LL + Q  Y I   Y
Subjt:  --HICEVLNMLNKHCNNPWTMWWVNLKDPRTQDPWVIISVAAVLFGFLLQIIQTVYGIYGIY

AT3G50120.1 Plant protein of unknown function (DUF247)1.6e-3726.71Show/hide
Query:  GKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAKNNLEFESIIVESVSSTLKQLLGSYDNLEDKWKQNPAKFLKLMIVDG
        GK  IY++P  ++E + K++ PQ VS GPYH+GK  L  M+  K     R   + N   + + ++++    ++    Y   E     +  +F++++++DG
Subjt:  GKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAKNNLEFESIIVESVSSTLKQLLGSYDNLEDKWKQNPAKFLKLMIVDG

Query:  CFMLDLI---INGCPQ----------SLASMIGNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKDKGSLSLLILYWMGI-PKGGVITGK-----------
        CF+L+L    + G  +          ++   + +I+RDM++LENQLP  +L  L +L     ++      L I ++  + P    +T             
Subjt:  CFMLDLI---INGCPQ----------SLASMIGNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKDKGSLSLLILYWMGI-PKGGVITGK-----------

Query:  -----------YLHILDMHRALMLYPS-------IEKTDDYNSRSRRRYNQTEVECGSATRLGKLGIEFKEIITDNLMDVSFNFKTGVLSVPQLKLDHTT
                    LH LD+ R  +L  S         K    N+R   +  Q  + C   T L + GI+F+   TD   D+   FK G L +P+L +   T
Subjt:  -----------YLHILDMHRALMLYPS-------IEKTDDYNSRSRRRYNQTEVECGSATRLGKLGIEFKEIITDNLMDVSFNFKTGVLSVPQLKLDHTT

Query:  LPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPT-KCLDTHICEVLNMLNKHCNNPWTMWWVNL
            LN++ FE+ H    +++ S+++FM NLI++ EDVS L    I+ + L  D +V   LFN+L +       D+++  +   +N++ ++ W  W   L
Subjt:  LPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPT-KCLDTHICEVLNMLNKHCNNPWTMWWVNL

Query:  KDPRTQDPWVIISVAAVLFGFLLQIIQTVYGIYGIYRP
        K     +PW I+S  A +   +L   Q+ Y +Y  Y+P
Subjt:  KDPRTQDPWVIISVAAVLFGFLLQIIQTVYGIYGIYRP

AT3G50150.1 Plant protein of unknown function (DUF247)2.6e-3526.91Show/hide
Query:  KASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQN--GYLRFKAKNNLEFESIIVESVSSTLKQLLGSYDNLEDKWKQNPAKFLKLMIVD
        K  IY++P  ++E + K++ PQ VS GPYH+GK +L  ME  K      +  + K+N+E   + ++++    ++    Y    D   +N  +F +++++D
Subjt:  KASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQN--GYLRFKAKNNLEFESIIVESVSSTLKQLLGSYDNLEDKWKQNPAKFLKLMIVD

Query:  GCFMLDLI---INGCPQ----------SLASMIGNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKDKGSLSLLILYWMGI-PKGGVITGKY---------
        GCF+L+L    I G  +          +   ++ +I+RDM++LENQLP  +L  L  L     ++      + + ++  + P   V+T            
Subjt:  GCFMLDLI---INGCPQ----------SLASMIGNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKDKGSLSLLILYWMGI-PKGGVITGKY---------

Query:  --------LHILDM-HRALMLYPSIEKTDD---YNSRSRRRYNQTEVECGSATRLGKLGIEFKEIITDNLMDVSFNFKTGVLSVPQLKLDHTTLPNLLNM
                LH LD+ HR+L+   S E T+    Y   S     Q  + C   T L   G+ F    T  L D+   FK G L +P+L +   T     N+
Subjt:  --------LHILDM-HRALMLYPSIEKTDD---YNSRSRRRYNQTEVECGSATRLGKLGIEFKEIITDNLMDVSFNFKTGVLSVPQLKLDHTTLPNLLNM

Query:  MTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPT-KCLDTHICEVLNMLNKHCNNPWTMWWVNLKDPRTQD
        + FE+ H Q  + + S+++FM NLI + +DVS L    I+ + L  D +V   LFN+L K       D ++ ++   +N++ +  W      L+     +
Subjt:  MTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPT-KCLDTHICEVLNMLNKHCNNPWTMWWVNLKDPRTQD

Query:  PWVIISVAAVLFGFLLQIIQTVYGIYGIYRP
        PW   S +A +    L   Q+ + +Y  Y+P
Subjt:  PWVIISVAAVLFGFLLQIIQTVYGIYGIYRP

AT3G50170.1 Plant protein of unknown function (DUF247)9.5e-3827.84Show/hide
Query:  GKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAKNNLEFESIIVESVSSTLKQLLGSYDN----LEDKWK---QNPA---
        GK  IY++P  ++E + K++ PQ VS GPYH+GK  L  ME  K      ++A N           V   LKQ +  Y N    LE+K +   + P    
Subjt:  GKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAKNNLEFESIIVESVSSTLKQLLGSYDN----LEDKWK---QNPA---

Query:  --KFLKLMIVDGCFMLDLI---INGCPQ----------SLASMIGNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKDKGSLSLLILYWMGI-PKGGVITG
          +F +++++DGCF+L+L    + G  +          ++  ++ +I+RDM++LENQLP  +L  L +L     ++      + + ++  + P G  +T 
Subjt:  --KFLKLMIVDGCFMLDLI---INGCPQ----------SLASMIGNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKDKGSLSLLILYWMGI-PKGGVITG

Query:  K--------------------YLHILDMHRALMLYP-------SIEKTDDYNSRSRRRYNQTEVECGSATRLGKLGIEFKEIITDNLMDVSFNFKTGVLS
                              LH LD+ R  +L         S+ K    N+R   +  Q  V C   T L + G++F++  TD   D+   FK G L 
Subjt:  K--------------------YLHILDMHRALMLYP-------SIEKTDDYNSRSRRRYNQTEVECGSATRLGKLGIEFKEIITDNLMDVSFNFKTGVLS

Query:  VPQLKLDHTTLPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPT-KCLDTHICEVLNMLNKHCN
        +P+L +   T     N++ FE+ H +  + + S+++FM NLI + EDVS L    I+ + L  D +V   LFN+L +       D+H+  +   +N++ N
Subjt:  VPQLKLDHTTLPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPT-KCLDTHICEVLNMLNKHCN

Query:  NPWTMWWVNLKDPRTQDPWVIISVAAVLFGFLLQIIQTVYGIYGIYRPN
          W +    L      +PW   S +A +   LL + Q+ Y +Y  Y+PN
Subjt:  NPWTMWWVNLKDPRTQDPWVIISVAAVLFGFLLQIIQTVYGIYGIYRPN

AT4G31980.1 unknown protein2.2e-3428.05Show/hide
Query:  ELLNKVEDNVLKEELR-KMQELSRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAKNNLEFESIIVESVSSTLKQL
        E +N+ E + L + ++ K+  LS    K  IYK+P ++R +NP A+ P+LVSFGP H GK  L  ME +K    L F  + N   E ++   ++ T +Q 
Subjt:  ELLNKVEDNVLKEELR-KMQELSRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAKNNLEFESIIVESVSSTLKQL

Query:  LGSYDNLEDKWKQNPAKFLKLMIVDGCFMLDLIINGCPQSL----------ASMIGNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKDKGSLSLLIL---
          +     +  K +  +F+++++VDG F+++L++      L          + MI ++ RDM+L+ENQLPF +++ ++ L+ N  +  +G+ S++ L   
Subjt:  LGSYDNLEDKWKQNPAKFLKLMIVDGCFMLDLIINGCPQSL----------ASMIGNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKDKGSLSLLIL---

Query:  ---YWMGIPKGGVITGKYLHILDMHRALMLYPSIEKTDDYNSRSRRRYNQTEVE-CGSATRLGKLGIEFKEIITDN-LMDVSFNFKTGVLSVPQLKLDHT
           Y++          +  H +D+ R+  L     K +         Y   +V+    AT L   G+ FK   T + L+D+S  F  GVL +P + +D  
Subjt:  ---YWMGIPKGGVITGKYLHILDMHRALMLYPSIEKTDDYNSRSRRRYNQTEVE-CGSATRLGKLGIEFKEIITDN-LMDVSFNFKTGVLSVPQLKLDHT

Query:  TLPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPTKCLDTHICEVLNMLNKHCNNPWTMWWVNL
        T     N++ FE+         + +++ +   I++  D  LLI   I+ N L      V  LFN +SK        +   +   L  +CN PW  W   L
Subjt:  TLPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPTKCLDTHICEVLNMLNKHCNNPWTMWWVNL

Query:  KDPRTQDPWVIISVAAVLFGFLLQIIQTVYGIYGI
        +     +PW + SV A L   LL  IQ+V  I  +
Subjt:  KDPRTQDPWVIISVAAVLFGFLLQIIQTVYGIYGI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATATGATGCAGTCAACAGTAGGCAGCGAATTCTTTGGTGGAGACGATGATGAGTTTATTATTAAAGATATGGGTAAGGTTGTAAATGAGGTGAAAGAGCTTCTGAA
TAAAGTTGAGGATAATGTTTTGAAGGAGGAGTTACGTAAGATGCAAGAATTATCAAGGCAAAAGGGGAAAGCTTCAATCTACAAAATACCAGAAGAAATAAGAGAAGTTA
ATCCCAAAGCTTTCGAGCCACAGTTGGTGTCTTTTGGGCCATACCATTATGGGAAACCCAATTTGCTTGAAATGGAATTGGAAAAACAGAATGGGTATCTCCGTTTTAAA
GCTAAAAACAATTTAGAATTTGAGTCCATTATTGTGGAGAGTGTGTCGAGCACTTTGAAACAGCTGTTGGGATCTTATGATAATCTTGAGGATAAATGGAAGCAAAATCC
TGCTAAGTTCTTGAAGCTCATGATTGTGGATGGCTGTTTCATGCTGGATTTGATAATCAATGGATGTCCTCAATCGCTAGCTTCTATGATTGGGAATATAAAGCGGGACA
TGCTGTTGCTTGAGAATCAACTACCCTTCAACCTACTTCAACACCTCTATGATTTAGTGAACAACAACGGCCATAAGGATAAAGGGAGTCTGAGCTTGCTGATTTTGTAC
TGGATGGGGATACCGAAGGGAGGGGTAATAACGGGAAAATACTTGCACATTTTGGATATGCACAGAGCTTTAATGTTGTATCCTTCAATAGAAAAGACAGATGATTACAA
TAGTAGAAGCCGAAGAAGGTACAACCAGACGGAGGTGGAGTGTGGATCGGCAACACGACTTGGTAAATTAGGGATAGAATTTAAGGAAATTATTACAGATAACCTTATGG
ACGTGTCCTTCAACTTCAAAACAGGAGTGCTGAGTGTGCCACAGCTGAAGCTGGATCACACCACTCTACCAAATTTATTAAATATGATGACATTTGAGAAATTCCATGAC
CAGATTGGAAGTGAAGTAATTTCATTTGTGGTGTTTATGACAAATCTAATTGAAGCCGAAGAAGATGTGTCGCTGCTAATCTCAAAGAAAATATTAAGGAATGAACTTGA
TCAAGACCGTAAAGTTGTGTTACAATTGTTCAATCAGCTGTCAAAAGGACCAACCAAATGTTTGGATACACACATTTGTGAAGTATTAAACATGTTGAATAAGCATTGTA
ATAATCCTTGGACTATGTGGTGGGTAAATCTCAAAGACCCTCGAACTCAAGATCCATGGGTTATTATCTCCGTTGCTGCCGTTCTCTTTGGTTTTCTGCTCCAAATTATC
CAAACCGTCTACGGAATTTATGGAATTTACCGCCCAAATCATTAA
mRNA sequenceShow/hide mRNA sequence
GCATTATAAGGCATTGTGGAGATGAATATGATGCAGTCAACAGTAGGCAGCGAATTCTTTGGTGGAGACGATGATGAGTTTATTATTAAAGATATGGGTAAGGTTGTAAA
TGAGGTGAAAGAGCTTCTGAATAAAGTTGAGGATAATGTTTTGAAGGAGGAGTTACGTAAGATGCAAGAATTATCAAGGCAAAAGGGGAAAGCTTCAATCTACAAAATAC
CAGAAGAAATAAGAGAAGTTAATCCCAAAGCTTTCGAGCCACAGTTGGTGTCTTTTGGGCCATACCATTATGGGAAACCCAATTTGCTTGAAATGGAATTGGAAAAACAG
AATGGGTATCTCCGTTTTAAAGCTAAAAACAATTTAGAATTTGAGTCCATTATTGTGGAGAGTGTGTCGAGCACTTTGAAACAGCTGTTGGGATCTTATGATAATCTTGA
GGATAAATGGAAGCAAAATCCTGCTAAGTTCTTGAAGCTCATGATTGTGGATGGCTGTTTCATGCTGGATTTGATAATCAATGGATGTCCTCAATCGCTAGCTTCTATGA
TTGGGAATATAAAGCGGGACATGCTGTTGCTTGAGAATCAACTACCCTTCAACCTACTTCAACACCTCTATGATTTAGTGAACAACAACGGCCATAAGGATAAAGGGAGT
CTGAGCTTGCTGATTTTGTACTGGATGGGGATACCGAAGGGAGGGGTAATAACGGGAAAATACTTGCACATTTTGGATATGCACAGAGCTTTAATGTTGTATCCTTCAAT
AGAAAAGACAGATGATTACAATAGTAGAAGCCGAAGAAGGTACAACCAGACGGAGGTGGAGTGTGGATCGGCAACACGACTTGGTAAATTAGGGATAGAATTTAAGGAAA
TTATTACAGATAACCTTATGGACGTGTCCTTCAACTTCAAAACAGGAGTGCTGAGTGTGCCACAGCTGAAGCTGGATCACACCACTCTACCAAATTTATTAAATATGATG
ACATTTGAGAAATTCCATGACCAGATTGGAAGTGAAGTAATTTCATTTGTGGTGTTTATGACAAATCTAATTGAAGCCGAAGAAGATGTGTCGCTGCTAATCTCAAAGAA
AATATTAAGGAATGAACTTGATCAAGACCGTAAAGTTGTGTTACAATTGTTCAATCAGCTGTCAAAAGGACCAACCAAATGTTTGGATACACACATTTGTGAAGTATTAA
ACATGTTGAATAAGCATTGTAATAATCCTTGGACTATGTGGTGGGTAAATCTCAAAGACCCTCGAACTCAAGATCCATGGGTTATTATCTCCGTTGCTGCCGTTCTCTTT
GGTTTTCTGCTCCAAATTATCCAAACCGTCTACGGAATTTATGGAATTTACCGCCCAAATCATTAACCACATTGTTTTTTTTCTTTTTTTTTTTTT
Protein sequenceShow/hide protein sequence
MNMMQSTVGSEFFGGDDDEFIIKDMGKVVNEVKELLNKVEDNVLKEELRKMQELSRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFK
AKNNLEFESIIVESVSSTLKQLLGSYDNLEDKWKQNPAKFLKLMIVDGCFMLDLIINGCPQSLASMIGNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKDKGSLSLLILY
WMGIPKGGVITGKYLHILDMHRALMLYPSIEKTDDYNSRSRRRYNQTEVECGSATRLGKLGIEFKEIITDNLMDVSFNFKTGVLSVPQLKLDHTTLPNLLNMMTFEKFHD
QIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPTKCLDTHICEVLNMLNKHCNNPWTMWWVNLKDPRTQDPWVIISVAAVLFGFLLQII
QTVYGIYGIYRPNH