; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10010724 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10010724
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPlant protein of unknown function (DUF247)
Genome locationChr06:25227543..25229928
RNA-Seq ExpressionHG10010724
SyntenyHG10010724
Gene Ontology termsNA
InterPro domainsIPR004158 - Protein of unknown function DUF247, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039868.1 UPF0481 protein [Cucumis melo var. makuwa]4.7e-9549.32Show/hide
Query:  MGKVVNEVIELLNKVEDNVLKEELRKMQELSRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAKNNLEFESIIVES
        MG VVNEV ELL+K+E+++  EE    Q +     K SIYKIP+ IREVNPKA+EP+LVS GPYHYGKP+LL ME EK   +L FKA+NNL  ES IV+S
Subjt:  MGKVVNEVIELLNKVEDNVLKEELRKMQELSRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAKNNLEFESIIVES

Query:  VSSTLKQLLGSY-DNLEDKWKQNPAKFLKLMIVDGCFMLDLIINGCPQSLA-SMIGNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKSIDDVITQDKGSL
        V S L+ LLGSY DNL+D WK++ A FLKLMIVDGCF+LDLI     +SL   MI +IKRDM+LLENQLP  LL+ L+ L         + +       L
Subjt:  VSSTLKQLLGSY-DNLEDKWKQNPAKFLKLMIVDGCFMLDLIINGCPQSLA-SMIGNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKSIDDVITQDKGSL

Query:  SLLILYWMGIPKGGVITGKYLHILDMHRALMLYPSIEKTDDYNSR-----SRRRYNQTEVEC---GSATRLGKLGIEFKEIITDNLMDVSFNFKTGVLSV
          +I  WM +    +     LHILDM+R+ +L P+I K D+ +            N+  VEC    SAT+L K GI+F++  T+NLMDVSF+FK  VL +
Subjt:  SLLILYWMGIPKGGVITGKYLHILDMHRALMLYPSIEKTDDYNSR-----SRRRYNQTEVEC---GSATRLGKLGIEFKEIITDNLMDVSFNFKTGVLSV

Query:  PQLKLDH-TTLPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPTKCLDTHICEVLNMLNKHCNN
        P L +D   T P L+N+M FEK H  +  +V SFVV M NLI  ++DVSLL SK IL + +  D   V+QLFN L+KG TK L+ H+CE+  MLN + +N
Subjt:  PQLKLDH-TTLPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPTKCLDTHICEVLNMLNKHCNN

Query:  PWTMWWVN--LKDPRTQDPWVIISVAAVLFGFLLQIIQTV
         W++W  N  LK P+T++PW IIS+A   FGF   ++Q +
Subjt:  PWTMWWVN--LKDPRTQDPWVIISVAAVLFGFLLQIIQTV

KAG7025238.1 UPF0481 protein, partial [Cucurbita argyrosperma subsp. argyrosperma]4.3e-8042.45Show/hide
Query:  DNVLKEELRKMQELSRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAKNNLEFESIIVESVSSTLKQLLGSYDNLE
        D +L        E  +     SIYKIP  + + +PKA+EPQ+VS GPY++GK +L  MELEK   +  FK +  L+ ES IV  VS+ L +L+ SYD LE
Subjt:  DNVLKEELRKMQELSRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAKNNLEFESIIVESVSSTLKQLLGSYDNLE

Query:  DKWKQNPAKFLKLMIVDGCFMLDLIINGCPQSLASMIGNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKSIDDVITQDKGSLSLLILYWMGIPKGGVITG
        + WK++PAKFL+LMIVDGCFML  +IN CP SL ++  +IK+DMLLLENQLP  LLQ LY +               D+  L  L+  W+ IP+  V+  
Subjt:  DKWKQNPAKFLKLMIVDGCFMLDLIINGCPQSLASMIGNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKSIDDVITQDKGSLSLLILYWMGIPKGGVITG

Query:  KYLHILDMHRALMLYPSIEKTDDYNSRSRRRYNQTEVEC---GSATRLGKLGIEFKEIITDNLMDVSFNFKTGVLSVPQLKLDHTTLPNLLNMMTFEKFH
          LHIL+M++  +L+P I++TD          + ++ EC     AT+L + GI+FK   T +L DV F+ K GVL +P+L +D  T   LLN+M FEK H
Subjt:  KYLHILDMHRALMLYPSIEKTDDYNSRSRRRYNQTEVEC---GSATRLGKLGIEFKEIITDNLMDVSFNFKTGVLSVPQLKLDHTTLPNLLNMMTFEKFH

Query:  DQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPTKCLDTHICEVLNMLNKHCNNPWTMWWVNLKDPRTQDPWVIISVA
         + G +V SFV+ M+NLI+ E DV++L  +++L N +  D++    LFN+L  G    LDTH+  V   +N+HCN PW      LK    Q PW IIS+ 
Subjt:  DQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPTKCLDTHICEVLNMLNKHCNNPWTMWWVNLKDPRTQDPWVIISVA

Query:  AVLFGFLLQIIQTVYGIYGIYRPN
        A +FGF++ I+Q +Y     Y  N
Subjt:  AVLFGFLLQIIQTVYGIYGIYRPN

XP_004140657.1 UPF0481 protein At3g47200 [Cucumis sativus]3.4e-9348.21Show/hide
Query:  KDMGKVVNEVIELLNKVEDNVLKEELRKMQELSRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAKNNLEFESIIV
        ++MG VVNEV  LL+K+E+++  EE      LS Q  K SIYKIP+ IREVNPKA+EP+LVS GPYHYGKP+LL ME EK   +L FKA+NNL  ES IV
Subjt:  KDMGKVVNEVIELLNKVEDNVLKEELRKMQELSRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAKNNLEFESIIV

Query:  ESVSSTLKQLLGSY-DNLEDKWKQNPAKFLKLMIVDGCFMLDLIINGCPQSLA-SMIGNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKSIDDVITQDKG
        +SV S L+ LLGSY DNL+D+WK++ A FLKLMIVDGCF+LDLI     +SL   MI +IKRDM+LLENQLP  LL+ L+ L   N ++       ++  
Subjt:  ESVSSTLKQLLGSY-DNLEDKWKQNPAKFLKLMIVDGCFMLDLIINGCPQSLA-SMIGNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKSIDDVITQDKG

Query:  SLSLLILYWMGIPKGGVITGKYLHILDMHRALMLYPSIEKTDD------------YNSRSRRRYNQTEVECGSATRLGKLGIEFKEIITDNLMDVSFNFK
         L  +I  WMG+    +     LHILD++R+ +L P+  K D+              +     Y Q+ +   SAT+L K GI+F++  T+NLMDVSF+FK
Subjt:  SLSLLILYWMGIPKGGVITGKYLHILDMHRALMLYPSIEKTDD------------YNSRSRRRYNQTEVECGSATRLGKLGIEFKEIITDNLMDVSFNFK

Query:  TGVLSVPQLKLDH-TTLPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPTKCLDTHICEVLNML
          VL +P L +D   T P L+N+M FEK H  +  +V SFVV M NLI  E+DVSLL SK IL + +  D   V+QLFN L+KG TK L++H+ E+  ML
Subjt:  TGVLSVPQLKLDH-TTLPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPTKCLDTHICEVLNML

Query:  NKHCNNPWTMWWVN--LKDPRTQDPWVIISVAAVLFGFLLQIIQTV
        N + +N W++W  N  LK P+T++ W IIS+A   FGF    +Q +
Subjt:  NKHCNNPWTMWWVN--LKDPRTQDPWVIISVAAVLFGFLLQIIQTV

XP_016902479.1 PREDICTED: uncharacterized protein LOC103499077 [Cucumis melo]1.8e-9448.64Show/hide
Query:  KDMGKVVNEVIELLNKVEDNVLKEELRKMQELSRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAKNNLEFESIIV
        ++MG VVNEV ELL+K+E+++  EE    Q +     K SIYKIP+ IREVNPKA+EP+LVS GPYHYGKP+LL ME EK   +L FKA+NNL  ES IV
Subjt:  KDMGKVVNEVIELLNKVEDNVLKEELRKMQELSRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAKNNLEFESIIV

Query:  ESVSSTLKQLLGSY-DNLEDKWKQNPAKFLKLMIVDGCFMLDLIINGCPQSLA-SMIGNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKSIDDVITQDKG
        +SV S L+ LLGSY DNL+D WK++ A FLKLMIVDGCF+LDLI     +SL   MI +IKRDM+LLENQLP  LL+ L+ L         + +      
Subjt:  ESVSSTLKQLLGSY-DNLEDKWKQNPAKFLKLMIVDGCFMLDLIINGCPQSLA-SMIGNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKSIDDVITQDKG

Query:  SLSLLILYWMGIPKGGVITGKYLHILDMHRALMLYPSIEKTDDYNSR-----SRRRYNQTEVEC---GSATRLGKLGIEFKEIITDNLMDVSFNFKTGVL
         L  +I  WM +    +     LHILDM+R+ +L P+I K D+ +            N+  VEC    SAT+L K GI+F++  T+NLMDVSF+FK  VL
Subjt:  SLSLLILYWMGIPKGGVITGKYLHILDMHRALMLYPSIEKTDDYNSR-----SRRRYNQTEVEC---GSATRLGKLGIEFKEIITDNLMDVSFNFKTGVL

Query:  SVPQLKLDH-TTLPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPTKCLDTHICEVLNMLNKHC
         +P L +D   T P L+N+M FEK H  +  +V SFVV M NLI  +++VSLL SK IL + +  D   V+QLFN L+KG TK L+ H+CE+  MLN + 
Subjt:  SVPQLKLDH-TTLPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPTKCLDTHICEVLNMLNKHC

Query:  NNPWTMWWVN--LKDPRTQDPWVIISVAAVLFGFLLQIIQTV
        +N W++W  N  LK P+T++PW  IS+A   FGF   ++Q +
Subjt:  NNPWTMWWVN--LKDPRTQDPWVIISVAAVLFGFLLQIIQTV

XP_038875622.1 UPF0481 protein At3g47200-like [Benincasa hispida]1.1e-9147.14Show/hide
Query:  VVNEVIELLNKVEDNVLKEELRKMQELSRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAKNNLEFESIIVESVSS
        VV +V   LNK+ D+   +E+  + E +    + SIYKIPE +R++  KAFEPQLVS GPYH+GK +L+ ME EKQ  + RF     L  ES IVES+S+
Subjt:  VVNEVIELLNKVEDNVLKEELRKMQELSRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAKNNLEFESIIVESVSS

Query:  TLKQLLGSYDNLEDKWKQNPAKFLKLMIVDGCFMLDLIINGCPQSLASMIGNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKSIDDVITQDKGSLSLLIL
         L+ L G+YD L++KWKQ+ AKFL++MIVDGCFMLD     CP SL++M  +IKRDMLLLENQLP  LLQ LYD + N           Q+  SL  LI 
Subjt:  TLKQLLGSYDNLEDKWKQNPAKFLKLMIVDGCFMLDLIINGCPQSLASMIGNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKSIDDVITQDKGSLSLLIL

Query:  YWMGIPKGGVITGKYLHILDMHRALMLYPSIEKTDDYNSRSRRRYNQTEVEC---GSATRLGKLGIEFKEIITDNLMDVSFNFKTGVLSVPQLKLDHTTL
          M I +  V+ G+ LHILDM+RA +LYP +++  D + +++  Y Q++ EC     AT+L   GI+FK   T NL DVSFN K GVL +P + +D  T 
Subjt:  YWMGIPKGGVITGKYLHILDMHRALMLYPSIEKTDDYNSRSRRRYNQTEVEC---GSATRLGKLGIEFKEIITDNLMDVSFNFKTGVLSVPQLKLDHTTL

Query:  PNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPTKCLDTHICEVLNMLNKHCNNPWTMWWVNLKD
          LLN+M FEK + + GS+V SFV+ M NLI+ + DV+LL S KIL N L  D      LF+ L KG    LD+HI +V   +NKHC+  W  W  +LK 
Subjt:  PNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPTKCLDTHICEVLNMLNKHCNNPWTMWWVNLKD

Query:  PRTQDPWVIISVAAVLFGFLLQIIQTVYGIYGIYRPN
           Q+PW IIS+ A +FGF + I+Q +Y I   +R N
Subjt:  PRTQDPWVIISVAAVLFGFLLQIIQTVYGIYGIYRPN

TrEMBL top hitse value%identityAlignment
A0A0A0KAA3 Uncharacterized protein1.3e-8747.07Show/hide
Query:  KDMGKVVNEVIELLNKVEDNVLKEELRKMQELSRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAKNNLEFESIIV
        ++MG VVNEV  LL+K+E+++  EE      LS Q  K SIYKIP+ IREVNPKA+EP+LVS GPYHYGKP+LL ME EK   +L FKA+NNL  ES IV
Subjt:  KDMGKVVNEVIELLNKVEDNVLKEELRKMQELSRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAKNNLEFESIIV

Query:  ESVSSTLKQLLGSY-DNLEDKWKQNPAKFLKLMIVDGCFMLDLIINGCPQSLA-SMIGNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKSIDDVITQDKG
        +SV S L+ LLGSY DNL+D+WK++ A FLKLMIVDGCF+LDLI     +SL   MI +IKRDM+LLENQLP  LL+ L+ L   N ++       ++  
Subjt:  ESVSSTLKQLLGSY-DNLEDKWKQNPAKFLKLMIVDGCFMLDLIINGCPQSLA-SMIGNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKSIDDVITQDKG

Query:  SLSLLILYWMGIPKGGVITGKYLHILDMHRALMLYPSIEKTDD------------YNSRSRRRYNQTEVECGSATRLGKLGIEFKEIITDNLMDVSFNFK
         L  +I  WMG+    +     LHILD++R+ +L P+  K D+              +     Y Q+ +   SAT+L K GI+F++  T+NLMDVSF+FK
Subjt:  SLSLLILYWMGIPKGGVITGKYLHILDMHRALMLYPSIEKTDD------------YNSRSRRRYNQTEVECGSATRLGKLGIEFKEIITDNLMDVSFNFK

Query:  TGVLSVPQLKLDH-TTLPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPTKCLDTHICEVLNML
          VL +P L +D   T P L+N+M FEK H  +  +V SFVV M NLI  E+DVSLL SK IL + +  D   V+QLFN L+KG TK L++HI  + N+ 
Subjt:  TGVLSVPQLKLDH-TTLPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPTKCLDTHICEVLNML

Query:  NKHCNNPWTMWWVNLKDPRTQDPWVIISVAAVLFGFLLQIIQTV
                      LK P+T++ W IIS+A   FGF    +Q +
Subjt:  NKHCNNPWTMWWVNLKDPRTQDPWVIISVAAVLFGFLLQIIQTV

A0A1S4E2L7 uncharacterized protein LOC1034990778.7e-9548.64Show/hide
Query:  KDMGKVVNEVIELLNKVEDNVLKEELRKMQELSRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAKNNLEFESIIV
        ++MG VVNEV ELL+K+E+++  EE    Q +     K SIYKIP+ IREVNPKA+EP+LVS GPYHYGKP+LL ME EK   +L FKA+NNL  ES IV
Subjt:  KDMGKVVNEVIELLNKVEDNVLKEELRKMQELSRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAKNNLEFESIIV

Query:  ESVSSTLKQLLGSY-DNLEDKWKQNPAKFLKLMIVDGCFMLDLIINGCPQSLA-SMIGNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKSIDDVITQDKG
        +SV S L+ LLGSY DNL+D WK++ A FLKLMIVDGCF+LDLI     +SL   MI +IKRDM+LLENQLP  LL+ L+ L         + +      
Subjt:  ESVSSTLKQLLGSY-DNLEDKWKQNPAKFLKLMIVDGCFMLDLIINGCPQSLA-SMIGNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKSIDDVITQDKG

Query:  SLSLLILYWMGIPKGGVITGKYLHILDMHRALMLYPSIEKTDDYNSR-----SRRRYNQTEVEC---GSATRLGKLGIEFKEIITDNLMDVSFNFKTGVL
         L  +I  WM +    +     LHILDM+R+ +L P+I K D+ +            N+  VEC    SAT+L K GI+F++  T+NLMDVSF+FK  VL
Subjt:  SLSLLILYWMGIPKGGVITGKYLHILDMHRALMLYPSIEKTDDYNSR-----SRRRYNQTEVEC---GSATRLGKLGIEFKEIITDNLMDVSFNFKTGVL

Query:  SVPQLKLDH-TTLPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPTKCLDTHICEVLNMLNKHC
         +P L +D   T P L+N+M FEK H  +  +V SFVV M NLI  +++VSLL SK IL + +  D   V+QLFN L+KG TK L+ H+CE+  MLN + 
Subjt:  SVPQLKLDH-TTLPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPTKCLDTHICEVLNMLNKHC

Query:  NNPWTMWWVN--LKDPRTQDPWVIISVAAVLFGFLLQIIQTV
        +N W++W  N  LK P+T++PW  IS+A   FGF   ++Q +
Subjt:  NNPWTMWWVN--LKDPRTQDPWVIISVAAVLFGFLLQIIQTV

A0A5A7TA99 UPF0481 protein2.3e-9549.32Show/hide
Query:  MGKVVNEVIELLNKVEDNVLKEELRKMQELSRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAKNNLEFESIIVES
        MG VVNEV ELL+K+E+++  EE    Q +     K SIYKIP+ IREVNPKA+EP+LVS GPYHYGKP+LL ME EK   +L FKA+NNL  ES IV+S
Subjt:  MGKVVNEVIELLNKVEDNVLKEELRKMQELSRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAKNNLEFESIIVES

Query:  VSSTLKQLLGSY-DNLEDKWKQNPAKFLKLMIVDGCFMLDLIINGCPQSLA-SMIGNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKSIDDVITQDKGSL
        V S L+ LLGSY DNL+D WK++ A FLKLMIVDGCF+LDLI     +SL   MI +IKRDM+LLENQLP  LL+ L+ L         + +       L
Subjt:  VSSTLKQLLGSY-DNLEDKWKQNPAKFLKLMIVDGCFMLDLIINGCPQSLA-SMIGNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKSIDDVITQDKGSL

Query:  SLLILYWMGIPKGGVITGKYLHILDMHRALMLYPSIEKTDDYNSR-----SRRRYNQTEVEC---GSATRLGKLGIEFKEIITDNLMDVSFNFKTGVLSV
          +I  WM +    +     LHILDM+R+ +L P+I K D+ +            N+  VEC    SAT+L K GI+F++  T+NLMDVSF+FK  VL +
Subjt:  SLLILYWMGIPKGGVITGKYLHILDMHRALMLYPSIEKTDDYNSR-----SRRRYNQTEVEC---GSATRLGKLGIEFKEIITDNLMDVSFNFKTGVLSV

Query:  PQLKLDH-TTLPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPTKCLDTHICEVLNMLNKHCNN
        P L +D   T P L+N+M FEK H  +  +V SFVV M NLI  ++DVSLL SK IL + +  D   V+QLFN L+KG TK L+ H+CE+  MLN + +N
Subjt:  PQLKLDH-TTLPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPTKCLDTHICEVLNMLNKHCNN

Query:  PWTMWWVN--LKDPRTQDPWVIISVAAVLFGFLLQIIQTV
         W++W  N  LK P+T++PW IIS+A   FGF   ++Q +
Subjt:  PWTMWWVN--LKDPRTQDPWVIISVAAVLFGFLLQIIQTV

A0A6J1HB25 UPF0481 protein At3g47200-like1.3e-7941.87Show/hide
Query:  GDDDEFIIKDMGKVVNEVIELLNKVEDN--VLKEELRKMQELSRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAK
        G DD   I D   +V  V   L+++ D+  +L  E  +  EL       SIYKIP  + + +PKA+EPQ+VS GPY++GK +L  MELEK   +  FKA+
Subjt:  GDDDEFIIKDMGKVVNEVIELLNKVEDN--VLKEELRKMQELSRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAK

Query:  NNLEFESIIVESVSSTLKQLLGSYDNLEDKWKQNPAKFLKLMIVDGCFMLDLIINGCPQSLASMIGNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKSID
          L+ ES IV  VS+ L +L+ SYD LE+ WK++P KFL+LMIVDGCFML  +IN CP SL ++  +IK+DMLLLENQLP  LL+ LY + + N      
Subjt:  NNLEFESIIVESVSSTLKQLLGSYDNLEDKWKQNPAKFLKLMIVDGCFMLDLIINGCPQSLASMIGNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKSID

Query:  DVITQDKGSLSLLILYWMGIPKGGVITGKYLHILDMHRALMLYPSIEKTDDYNSRSRRRYNQTEVEC---GSATRLGKLGIEFKEIITDNLMDVSFNFKT
          +   +  L  L+  W+ IP+  V+    LHIL+M++  +L+P I++TD            ++ EC     AT+L + GI+FK   T +L DV F+ K 
Subjt:  DVITQDKGSLSLLILYWMGIPKGGVITGKYLHILDMHRALMLYPSIEKTDDYNSRSRRRYNQTEVEC---GSATRLGKLGIEFKEIITDNLMDVSFNFKT

Query:  GVLSVPQLKLDHTTLPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPTKCLDTHICEVLNMLNK
        GVL +P+L +D  T   LLN+M FEK H + G +V SFV+ M+NLI+ E DV++L  +++L N +  D++    LFN+L  G    LDTH+  V   +N+
Subjt:  GVLSVPQLKLDHTTLPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPTKCLDTHICEVLNMLNK

Query:  HCNNPWTMWWVNLKDPRTQDPWVIISVAAVLFGFLLQIIQTVYGIYGIY
        HCN PW      LK    Q PW IIS+ A +FGF++ I+Q +Y     Y
Subjt:  HCNNPWTMWWVNLKDPRTQDPWVIISVAAVLFGFLLQIIQTVYGIYGIY

A0A6J1KVQ6 UPF0481 protein At3g47200-like isoform X21.1e-7841.87Show/hide
Query:  GDDDEFIIKDMGKVVNEVIELLNKVEDN--VLKEELRKMQELSRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAK
        G DD   I D   +V  V   L+++ D+  +L  E  +  EL       SIYKIP  + + +PKA+EPQ+VS GPY++GK +L  MELEK   +  FK +
Subjt:  GDDDEFIIKDMGKVVNEVIELLNKVEDN--VLKEELRKMQELSRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAK

Query:  NNLEFESIIVESVSSTLKQLLGSYDNLEDKWKQNPAKFLKLMIVDGCFMLDLIINGCPQSLASMIGNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKSID
          L+ ES IV  VS+ L +L+ SYD LE++W Q+P KFL+LMIVDGCFML  +I+ CP SL ++  +IK+DMLLLENQLP  LL+ LY +   N      
Subjt:  NNLEFESIIVESVSSTLKQLLGSYDNLEDKWKQNPAKFLKLMIVDGCFMLDLIINGCPQSLASMIGNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKSID

Query:  DVITQDKGSLSLLILYWMGIPKGGVITGKYLHILDMHRALMLYPSIEKTDDYNSRSRRRYNQTEVEC---GSATRLGKLGIEFKEIITDNLMDVSFNFKT
          + QD      L+  W+ IP+  V+    LHIL+M++  +LYP I++ D          + ++ EC     AT+L + GI+FK   T++L DV F+ K 
Subjt:  DVITQDKGSLSLLILYWMGIPKGGVITGKYLHILDMHRALMLYPSIEKTDDYNSRSRRRYNQTEVEC---GSATRLGKLGIEFKEIITDNLMDVSFNFKT

Query:  GVLSVPQLKLDHTTLPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPTKCLDTHICEVLNMLNK
        GVL +PQL +D  T   +LN+M FEK H + G +V SFV+ M+NLI+ E DV++L  +KIL N +  D++    LF++L  G    LD+H+  V  M+N 
Subjt:  GVLSVPQLKLDHTTLPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPTKCLDTHICEVLNMLNK

Query:  HCNNPWTMWWVNLKDPRTQDPWVIISVAAVLFGFLLQIIQTVYGIYGIY
        HCN PW      LK    Q PW IIS+ A +FGF++ I+Q +Y     Y
Subjt:  HCNNPWTMWWVNLKDPRTQDPWVIISVAAVLFGFLLQIIQTVYGIYGIY

SwissProt top hitse value%identityAlignment
P0C897 Putative UPF0481 protein At3g026451.2e-0524.18Show/hide
Query:  NVLKEELRKMQELSRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAK-NNLEFESII--VESVSSTLKQLLGSYDN
        NV K    +++E   ++   SI+ +P+ +   +P ++ P  VS GPYH  KP L EME  K     + + + N+  F  ++  ++S+   ++     Y  
Subjt:  NVLKEELRKMQELSRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAK-NNLEFESII--VESVSSTLKQLLGSYDN

Query:  LEDKWKQNPAKFLKLMIVDGCFMLDLIINGCPQSLASMI-----GNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKSIDDVI
               N    L +M VD  F+++ +     + + ++I       I RD++++ENQ+P  +L+   +       +S DD++
Subjt:  LEDKWKQNPAKFLKLMIVDGCFMLDLIINGCPQSLASMI-----GNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKSIDDVI

Q9SD53 UPF0481 protein At3g472006.5e-3128.02Show/hide
Query:  SRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRF--KAKNNLEFESIIVESVSSTLKQLLGSYDNLEDKWKQNPAKFLK
        S  K    I+++PE    +NPKA++P++VS GPYHYG+ +L  ++  K      F  +AK     E+++V++V            +LEDK +++ ++ LK
Subjt:  SRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRF--KAKNNLEFESIIVESVSSTLKQLLGSYDNLEDKWKQNPAKFLK

Query:  -------LMIVDGCF--MLDLIINGCPQ-------SLASMIGNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKSIDDVITQDKGSLSLLILYWMGIP--K
               +M++DGCF  M+ LI++G  +       S+  ++ +I+ D+LLLENQ+PF +LQ LY  V +    S D         L+ +  ++   P  K
Subjt:  -------LMIVDGCF--MLDLIINGCPQ-------SLASMIGNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKSIDDVITQDKGSLSLLILYWMGIP--K

Query:  GGVITGKY-----LHILDMHRALMLYPSIEKTDDYNS-------RSRRRYNQTEVECG------SATRLGKLGIEFKEIITDNLMDVSFNFKTGVLSVPQ
         G    K+      H+LD+ R   L P+  ++D  +S          +  N   V+        SA RL   GI+F+   +     ++   K   L +PQ
Subjt:  GGVITGKY-----LHILDMHRALMLYPSIEKTDDYNS-------RSRRRYNQTEVECG------SATRLGKLGIEFKEIITDNLMDVSFNFKTGVLSVPQ

Query:  LKLDHTTLPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPTKCLDT-HICEVLNMLNKHCNNPW
        L+ D       LN + FE+F+    +E+ +++VFM  L+  EEDV+ L + K++          V + F  +SK     +DT ++  V   +N++    +
Subjt:  LKLDHTTLPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPTKCLDT-HICEVLNMLNKHCNNPW

Query:  TMWWVNLKDPRTQDPWVIISVAAVLFGFLLQIIQTVYGI
           W   +    + PW  +S  AVLF  LL ++Q+   I
Subjt:  TMWWVNLKDPRTQDPWVIISVAAVLFGFLLQIIQTVYGI

Arabidopsis top hitse value%identityAlignment
AT3G50120.1 Plant protein of unknown function (DUF247)2.1e-3726.39Show/hide
Query:  GKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAKNNLEFESIIVESVSSTLKQLLGSYDNLEDKWKQNPAKFLKLMIVDG
        GK  IY++P  ++E + K++ PQ VS GPYH+GK  L  M+  K     R   + N   + + ++++    ++    Y   E     +  +F++++++DG
Subjt:  GKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAKNNLEFESIIVESVSSTLKQLLGSYDNLEDKWKQNPAKFLKLMIVDG

Query:  CFMLDLI---INGCPQ----------SLASMIGNIKRDMLLLENQLPFNLLQHLYDL----------------------------VNNNGHKSIDDVITQ
        CF+L+L    + G  +          ++   + +I+RDM++LENQLP  +L  L +L                            +  +G   +++ + +
Subjt:  CFMLDLI---INGCPQ----------SLASMIGNIKRDMLLLENQLPFNLLQHLYDL----------------------------VNNNGHKSIDDVITQ

Query:  DKGSLSLLILYWMGIPKGGVITGKYLHILDMHRALMLYPS-------IEKTDDYNSRSRRRYNQTEVECGSATRLGKLGIEFKEIITDNLMDVSFNFKTG
        DK   S      MG           LH LD+ R  +L  S         K    N+R   +  Q  + C   T L + GI+F+   TD   D+   FK G
Subjt:  DKGSLSLLILYWMGIPKGGVITGKYLHILDMHRALMLYPS-------IEKTDDYNSRSRRRYNQTEVECGSATRLGKLGIEFKEIITDNLMDVSFNFKTG

Query:  VLSVPQLKLDHTTLPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPT-KCLDTHICEVLNMLNK
         L +P+L +   T    LN++ FE+ H    +++ S+++FM NLI++ EDVS L    I+ + L  D +V   LFN+L +       D+++  +   +N+
Subjt:  VLSVPQLKLDHTTLPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPT-KCLDTHICEVLNMLNK

Query:  HCNNPWTMWWVNLKDPRTQDPWVIISVAAVLFGFLLQIIQTVYGIYGIYRP
        + ++ W  W   LK     +PW I+S  A +   +L   Q+ Y +Y  Y+P
Subjt:  HCNNPWTMWWVNLKDPRTQDPWVIISVAAVLFGFLLQIIQTVYGIYGIYRP

AT3G50130.1 Plant protein of unknown function (DUF247)2.1e-3224.89Show/hide
Query:  KEELRKMQELSRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAKNNLEFESIIVESVSSTLKQLLGSYDNLEDKWK
        K E    ++ +    K  IY++P+ ++E N K++ PQ VS GP+H+G  +LL M+  K        A+   + E + ++++     +    Y+   D   
Subjt:  KEELRKMQELSRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAKNNLEFESIIVESVSSTLKQLLGSYDNLEDKWK

Query:  QNPAKFLKLMIVDGCFMLDLIINGCPQSLASM--------------IGNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKS----------------IDDV
         +  KF +++++DGCF+L+L   G  +  + +              + +I+RDM++LENQLP  +L  L ++     H++                 D+ 
Subjt:  QNPAKFLKLMIVDGCFMLDLIINGCPQSLASM--------------IGNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKS----------------IDDV

Query:  ITQDKGSLSLLILYWMGIPKGGVITGKYLHILDMHRALMLYPSIEKTD-------DYNSRSRRRYNQTEVECGSATRLGKLGIEFKEIITDNLMDVSFNF
        +T+   SL     +    P      G+ LH LD+ R  +L P              + +R   +  Q  + C   T L + GI+F+   TD   D+   F
Subjt:  ITQDKGSLSLLILYWMGIPKGGVITGKYLHILDMHRALMLYPSIEKTD-------DYNSRSRRRYNQTEVECGSATRLGKLGIEFKEIITDNLMDVSFNF

Query:  KTGVLSVPQLKLDHTTLPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPT-KCLDTHICEVLNM
        K G L +P+L +   T     N++ FE+ H    +++ S+++FM NLI++ EDV  L    I+ + L  D +V   LFN+L +       ++++ ++ N 
Subjt:  KTGVLSVPQLKLDHTTLPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPT-KCLDTHICEVLNM

Query:  LNKHCNNPWTMWWVNLKDPRTQDPWVIISVAAVLFGFLLQIIQTVYGIYGIYRP
        ++++ +  W +    LK     +PW   S  A L   +L + Q+ +  Y  + P
Subjt:  LNKHCNNPWTMWWVNLKDPRTQDPWVIISVAAVLFGFLLQIIQTVYGIYGIYRP

AT3G50150.1 Plant protein of unknown function (DUF247)4.5e-3527.23Show/hide
Query:  KASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQN--GYLRFKAKNNLEFESIIVESVSSTLKQLLGSYDNLEDKWKQNPAKFLKLMIVD
        K  IY++P  ++E + K++ PQ VS GPYH+GK +L  ME  K      +  + K+N+E   + ++++    ++    Y    D   +N  +F +++++D
Subjt:  KASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQN--GYLRFKAKNNLEFESIIVESVSSTLKQLLGSYDNLEDKWKQNPAKFLKLMIVD

Query:  GCFMLDLI---INGCPQ----------SLASMIGNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKS----------------IDDVITQDKGSLSLLILY
        GCF+L+L    I G  +          +   ++ +I+RDM++LENQLP  +L  L  L     +++                  +V+T+ + SL      
Subjt:  GCFMLDLI---INGCPQ----------SLASMIGNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKS----------------IDDVITQDKGSLSLLILY

Query:  WMGIPKGGVITGKYLHILDM-HRALMLYPSIEKTDD---YNSRSRRRYNQTEVECGSATRLGKLGIEFKEIITDNLMDVSFNFKTGVLSVPQLKLDHTTL
              GG      LH LD+ HR+L+   S E T+    Y   S     Q  + C   T L   G+ F    T  L D+   FK G L +P+L +   T 
Subjt:  WMGIPKGGVITGKYLHILDM-HRALMLYPSIEKTDD---YNSRSRRRYNQTEVECGSATRLGKLGIEFKEIITDNLMDVSFNFKTGVLSVPQLKLDHTTL

Query:  PNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPT-KCLDTHICEVLNMLNKHCNNPWTMWWVNLK
            N++ FE+ H Q  + + S+++FM NLI + +DVS L    I+ + L  D +V   LFN+L K       D ++ ++   +N++ +  W      L+
Subjt:  PNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPT-KCLDTHICEVLNMLNKHCNNPWTMWWVNLK

Query:  DPRTQDPWVIISVAAVLFGFLLQIIQTVYGIYGIYRP
             +PW   S +A +    L   Q+ + +Y  Y+P
Subjt:  DPRTQDPWVIISVAAVLFGFLLQIIQTVYGIYGIYRP

AT3G50170.1 Plant protein of unknown function (DUF247)4.3e-3828.32Show/hide
Query:  GKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAKNNLEFESIIVESVSSTLKQLLGSYDN----LEDKWK---QNPA---
        GK  IY++P  ++E + K++ PQ VS GPYH+GK  L  ME  K      ++A N           V   LKQ +  Y N    LE+K +   + P    
Subjt:  GKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAKNNLEFESIIVESVSSTLKQLLGSYDN----LEDKWK---QNPA---

Query:  --KFLKLMIVDGCFMLDLI---INGCPQ----------SLASMIGNIKRDMLLLENQLPFNLLQHLYDL----VNNNG---HKSI---------DDVITQ
          +F +++++DGCF+L+L    + G  +          ++  ++ +I+RDM++LENQLP  +L  L +L     N  G   H ++          + +T+
Subjt:  --KFLKLMIVDGCFMLDLI---INGCPQ----------SLASMIGNIKRDMLLLENQLPFNLLQHLYDL----VNNNG---HKSI---------DDVITQ

Query:  DKGSLSLLILYWMGIPKGGVITGKYLHILDMHRALMLYP-------SIEKTDDYNSRSRRRYNQTEVECGSATRLGKLGIEFKEIITDNLMDVSFNFKTG
           S    ++ W+      +     LH LD+ R  +L         S+ K    N+R   +  Q  V C   T L + G++F++  TD   D+   FK G
Subjt:  DKGSLSLLILYWMGIPKGGVITGKYLHILDMHRALMLYP-------SIEKTDDYNSRSRRRYNQTEVECGSATRLGKLGIEFKEIITDNLMDVSFNFKTG

Query:  VLSVPQLKLDHTTLPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPT-KCLDTHICEVLNMLNK
         L +P+L +   T     N++ FE+ H +  + + S+++FM NLI + EDVS L    I+ + L  D +V   LFN+L +       D+H+  +   +N+
Subjt:  VLSVPQLKLDHTTLPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPT-KCLDTHICEVLNMLNK

Query:  HCNNPWTMWWVNLKDPRTQDPWVIISVAAVLFGFLLQIIQTVYGIYGIYRPN
        + N  W +    L      +PW   S +A +   LL + Q+ Y +Y  Y+PN
Subjt:  HCNNPWTMWWVNLKDPRTQDPWVIISVAAVLFGFLLQIIQTVYGIYGIYRPN

AT4G31980.1 unknown protein1.7e-3427.69Show/hide
Query:  ELLNKVEDNVLKEELR-KMQELSRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAKNNLEFESIIVESVSSTLKQL
        E +N+ E + L + ++ K+  LS    K  IYK+P ++R +NP A+ P+LVSFGP H GK  L  ME +K    L F  + N   E ++   ++ T +Q 
Subjt:  ELLNKVEDNVLKEELR-KMQELSRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFKAKNNLEFESIIVESVSSTLKQL

Query:  LGSYDNLEDKWKQNPAKFLKLMIVDGCFMLDLIINGCPQSL----------ASMIGNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKSIDDVITQDKGSL
          +     +  K +  +F+++++VDG F+++L++      L          + MI ++ RDM+L+ENQLPF +++ ++ L+ N   +    +I   +   
Subjt:  LGSYDNLEDKWKQNPAKFLKLMIVDGCFMLDLIINGCPQSL----------ASMIGNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKSIDDVITQDKGSL

Query:  SLLILYWMGIPKGGVITGKYLHILDMHRALMLYPSIEKTDDYNSRSRRRYNQTEVE-CGSATRLGKLGIEFKEIITDN-LMDVSFNFKTGVLSVPQLKLD
        S    Y++          +  H +D+ R+  L     K +         Y   +V+    AT L   G+ FK   T + L+D+S  F  GVL +P + +D
Subjt:  SLLILYWMGIPKGGVITGKYLHILDMHRALMLYPSIEKTDDYNSRSRRRYNQTEVE-CGSATRLGKLGIEFKEIITDN-LMDVSFNFKTGVLSVPQLKLD

Query:  HTTLPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPTKCLDTHICEVLNMLNKHCNNPWTMWWV
          T     N++ FE+         + +++ +   I++  D  LLI   I+ N L      V  LFN +SK        +   +   L  +CN PW  W  
Subjt:  HTTLPNLLNMMTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPTKCLDTHICEVLNMLNKHCNNPWTMWWV

Query:  NLKDPRTQDPWVIISVAAVLFGFLLQIIQTVYGIYGI
         L+     +PW + SV A L   LL  IQ+V  I  +
Subjt:  NLKDPRTQDPWVIISVAAVLFGFLLQIIQTVYGIYGI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATATGATGCAGTCAACAGTAGGCAGCGAATTCTTTGGTGGAGACGATGATGAGTTTATTATTAAAGATATGGGTAAGGTTGTAAATGAGGTGATAGAGCTTCTGAA
TAAAGTTGAGGATAATGTTTTGAAGGAGGAGTTACGTAAGATGCAAGAATTATCAAGGCAAAAGGGGAAAGCTTCAATCTACAAAATACCAGAAGAAATAAGAGAAGTTA
ATCCCAAAGCTTTCGAGCCACAGTTGGTGTCTTTTGGGCCATACCATTATGGGAAACCCAATTTGCTTGAAATGGAATTGGAAAAACAGAATGGGTATCTCCGTTTTAAA
GCTAAAAACAATTTAGAATTTGAGTCCATTATTGTGGAGAGTGTGTCGAGCACTTTGAAACAGCTGTTGGGATCTTATGATAATCTTGAGGATAAATGGAAGCAAAATCC
TGCTAAGTTCTTGAAGCTCATGATTGTGGATGGCTGTTTCATGCTGGATTTGATAATCAATGGATGTCCTCAATCGCTAGCTTCTATGATTGGGAATATAAAGCGGGACA
TGCTGTTGCTTGAGAATCAACTACCCTTCAACCTACTTCAACACCTCTATGATTTAGTGAACAACAACGGCCATAAGAGTATAGATGATGTGATTACACAGGATAAAGGG
AGTCTGAGCTTGCTGATTTTGTACTGGATGGGGATACCGAAGGGAGGGGTAATAACGGGAAAATACTTGCACATTTTGGATATGCACAGAGCTTTAATGTTGTATCCTTC
AATAGAAAAGACAGATGATTACAATAGTAGAAGCCGAAGAAGGTACAACCAGACGGAGGTGGAGTGTGGATCGGCAACACGACTTGGTAAATTAGGGATAGAATTTAAGG
AAATTATTACAGATAACCTTATGGACGTGTCCTTCAACTTCAAAACAGGAGTGCTGAGTGTGCCACAGCTGAAGCTGGATCACACCACTCTACCAAATTTATTAAATATG
ATGACATTTGAGAAATTCCATGACCAGATTGGAAGTGAAGTAATTTCATTTGTGGTGTTTATGACAAATCTAATTGAAGCCGAAGAAGATGTGTCGCTGCTAATCTCAAA
GAAAATATTAAGGAATGAACTTGATCAAGACCGTAAAGTTGTGTTACAATTGTTCAATCAGCTGTCAAAAGGACCAACCAAATGTTTGGATACACACATTTGTGAAGTAT
TAAACATGTTGAATAAGCATTGTAATAATCCTTGGACTATGTGGTGGGTAAATCTCAAAGACCCTCGAACTCAAGATCCATGGGTTATTATCTCCGTTGCTGCCGTTCTC
TTTGGTTTTCTGCTCCAAATTATCCAAACCGTCTACGGAATTTATGGAATTTACCGCCCAAATCATTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATATGATGCAGTCAACAGTAGGCAGCGAATTCTTTGGTGGAGACGATGATGAGTTTATTATTAAAGATATGGGTAAGGTTGTAAATGAGGTGATAGAGCTTCTGAA
TAAAGTTGAGGATAATGTTTTGAAGGAGGAGTTACGTAAGATGCAAGAATTATCAAGGCAAAAGGGGAAAGCTTCAATCTACAAAATACCAGAAGAAATAAGAGAAGTTA
ATCCCAAAGCTTTCGAGCCACAGTTGGTGTCTTTTGGGCCATACCATTATGGGAAACCCAATTTGCTTGAAATGGAATTGGAAAAACAGAATGGGTATCTCCGTTTTAAA
GCTAAAAACAATTTAGAATTTGAGTCCATTATTGTGGAGAGTGTGTCGAGCACTTTGAAACAGCTGTTGGGATCTTATGATAATCTTGAGGATAAATGGAAGCAAAATCC
TGCTAAGTTCTTGAAGCTCATGATTGTGGATGGCTGTTTCATGCTGGATTTGATAATCAATGGATGTCCTCAATCGCTAGCTTCTATGATTGGGAATATAAAGCGGGACA
TGCTGTTGCTTGAGAATCAACTACCCTTCAACCTACTTCAACACCTCTATGATTTAGTGAACAACAACGGCCATAAGAGTATAGATGATGTGATTACACAGGATAAAGGG
AGTCTGAGCTTGCTGATTTTGTACTGGATGGGGATACCGAAGGGAGGGGTAATAACGGGAAAATACTTGCACATTTTGGATATGCACAGAGCTTTAATGTTGTATCCTTC
AATAGAAAAGACAGATGATTACAATAGTAGAAGCCGAAGAAGGTACAACCAGACGGAGGTGGAGTGTGGATCGGCAACACGACTTGGTAAATTAGGGATAGAATTTAAGG
AAATTATTACAGATAACCTTATGGACGTGTCCTTCAACTTCAAAACAGGAGTGCTGAGTGTGCCACAGCTGAAGCTGGATCACACCACTCTACCAAATTTATTAAATATG
ATGACATTTGAGAAATTCCATGACCAGATTGGAAGTGAAGTAATTTCATTTGTGGTGTTTATGACAAATCTAATTGAAGCCGAAGAAGATGTGTCGCTGCTAATCTCAAA
GAAAATATTAAGGAATGAACTTGATCAAGACCGTAAAGTTGTGTTACAATTGTTCAATCAGCTGTCAAAAGGACCAACCAAATGTTTGGATACACACATTTGTGAAGTAT
TAAACATGTTGAATAAGCATTGTAATAATCCTTGGACTATGTGGTGGGTAAATCTCAAAGACCCTCGAACTCAAGATCCATGGGTTATTATCTCCGTTGCTGCCGTTCTC
TTTGGTTTTCTGCTCCAAATTATCCAAACCGTCTACGGAATTTATGGAATTTACCGCCCAAATCATTAA
Protein sequenceShow/hide protein sequence
MNMMQSTVGSEFFGGDDDEFIIKDMGKVVNEVIELLNKVEDNVLKEELRKMQELSRQKGKASIYKIPEEIREVNPKAFEPQLVSFGPYHYGKPNLLEMELEKQNGYLRFK
AKNNLEFESIIVESVSSTLKQLLGSYDNLEDKWKQNPAKFLKLMIVDGCFMLDLIINGCPQSLASMIGNIKRDMLLLENQLPFNLLQHLYDLVNNNGHKSIDDVITQDKG
SLSLLILYWMGIPKGGVITGKYLHILDMHRALMLYPSIEKTDDYNSRSRRRYNQTEVECGSATRLGKLGIEFKEIITDNLMDVSFNFKTGVLSVPQLKLDHTTLPNLLNM
MTFEKFHDQIGSEVISFVVFMTNLIEAEEDVSLLISKKILRNELDQDRKVVLQLFNQLSKGPTKCLDTHICEVLNMLNKHCNNPWTMWWVNLKDPRTQDPWVIISVAAVL
FGFLLQIIQTVYGIYGIYRPNH