; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr024880 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr024880
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPlant protein of unknown function (DUF247)
Genome locationtig00002486:3704146..3705452
RNA-Seq ExpressionSgr024880
SyntenySgr024880
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004158 - Protein of unknown function DUF247, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7018572.1 UPF0481 protein, partial [Cucurbita argyrosperma subsp. argyrosperma]1.7e-16870.51Show/hide
Query:  MDPSSAFSHSIDIPAISRKRSEEESLLSSIEGKLEAFCSSTIIFKAPDEISIDDSDRDVFVPAKVSIGPLHRGARHLESMEDQKWRYLCAFLKRNPSVSL
        MD S A SHSID+PA S+  SEEESLLSSIEGKLEAFCSS  IF+AP+EISI+  DR+VFVP+KVSIGP H GA HLESME+ KW YL AFLK NPSV L
Subjt:  MDPSSAFSHSIDIPAISRKRSEEESLLSSIEGKLEAFCSSTIIFKAPDEISIDDSDRDVFVPAKVSIGPLHRGARHLESMEDQKWRYLCAFLKRNPSVSL

Query:  DDLLQLIVKSESRVRKCYEVELFDLDGDKFAQMMLLDCCFVLELLLRYSTKRLRRRNDPVFTTPGLLFDLRCDMMLLENQIPYFLLKDVYDSVQDQLQEN
          L++L+VKSESR+RKCYE E +  D DKF+Q+MLLDCCF+LELLLR+S KRLRRRND VFTTPGLLFDLRCD+MLLENQIPYFLLKDVY++VQD  +E 
Subjt:  DDLLQLIVKSESRVRKCYEVELFDLDGDKFAQMMLLDCCFVLELLLRYSTKRLRRRNDPVFTTPGLLFDLRCDMMLLENQIPYFLLKDVYDSVQDQLQEN

Query:  SSLNELAFRFFRTMVAGERQFVYDNFMQDADHLLDM--------------KQQSEAEELPSASKLQAAGIKFKDARTPKSVLDIRFQKGVLQIPPLKVYQ
         SLN+L FRFF+TMVAG+RQ VYDNFM +ADHLL+M                +S+++ELP+ASKL+ AGIK K+AR+ KS+LDI+FQ GVL+IPPLKVYQ
Subjt:  SSLNELAFRFFRTMVAGERQFVYDNFMQDADHLLDM--------------KQQSEAEELPSASKLQAAGIKFKDARTPKSVLDIRFQKGVLQIPPLKVYQ

Query:  RTEAILRNLIAYEICQSGSAQQVRSYVNFMSHLLHSDEDVKLLYERRILSDHEDDETQIIRNLKWLSQQKANLSGTFFAGVVQKLNERPDRCLRRWRRLK
        +TE ILRNL+AYEI QSGS +QV+SY+NFMSHLL SD+DVK+LY R+IL+D EDDE QIIRNLKW+S+++ +LSGT+FAG+VQKLNE+PDRC+ RWR+L+
Subjt:  RTEAILRNLIAYEICQSGSAQQVRSYVNFMSHLLHSDEDVKLLYERRILSDHEDDETQIIRNLKWLSQQKANLSGTFFAGVVQKLNERPDRCLRRWRRLK

Query:  RNPAAIGVAAFWVVVVIFVAAFFSALSFLQHRYK
        RNP AIG+ A  VVVVIFVAAFFSA S LQ RYK
Subjt:  RNPAAIGVAAFWVVVVIFVAAFFSALSFLQHRYK

XP_022138112.1 UPF0481 protein At3g47200-like [Momordica charantia]6.9e-17071.95Show/hide
Query:  MDPSSAFSHSIDIPAISRKRSEEESLLSSIEGKLEAFCSSTIIFKAPDEISIDDSDRDVFVPAKVSIGPLHRGARHLESMEDQKWRYLCAFLKRNPSVSL
        M+PS A SH+IDIPAISR+RS+EESLL S+E K+EAFCSS IIFK PDEISID  +R+VFVPAKVSIGP H GA HLESMED KW YLCAFLK NPSV L
Subjt:  MDPSSAFSHSIDIPAISRKRSEEESLLSSIEGKLEAFCSSTIIFKAPDEISIDDSDRDVFVPAKVSIGPLHRGARHLESMEDQKWRYLCAFLKRNPSVSL

Query:  DDLLQLIVKSESRVRKCYEVELFDLDGDKFAQMMLLDCCFVLELLLRYSTKRLRRRNDPVFTTPGLLFDLRCDMMLLENQIPYFLLKDVYDSVQDQLQEN
        DDLL+ + KSESRVRKCYEVE  DLD  KFA+MM+LDCCFVLELLLR+S KRL+RRNDPVFTTPGLL DL+ D++LLENQIPYFLL++VY+ VQD  +EN
Subjt:  DDLLQLIVKSESRVRKCYEVELFDLDGDKFAQMMLLDCCFVLELLLRYSTKRLRRRNDPVFTTPGLLFDLRCDMMLLENQIPYFLLKDVYDSVQDQLQEN

Query:  SSLNELAFRFFRTMVAGERQFVYDNFMQDADHLLDM---------------KQQSEAEELPSASKLQAAGIKFKDARTPKSVLDIRFQKGVLQIPPLKVY
          LN+LAFRFFRT+VAGERQ VYDNF QDADHLLD+                 +S+  ELP ASKL++AGIKFK+A TPKSVLDI+FQ G L+IP L+V 
Subjt:  SSLNELAFRFFRTMVAGERQFVYDNFMQDADHLLDM---------------KQQSEAEELPSASKLQAAGIKFKDARTPKSVLDIRFQKGVLQIPPLKVY

Query:  QRTEAILRNLIAYEICQSGSAQQVRSYVNFMSHLLHSDEDVKLLYERRILSDHEDDETQIIRNLKWLSQQKANLSGTFFAGVVQKLNERPDRCLRRWRRL
        + TE IL+NLIAYEICQ GSAQQV+SYV+FMSHLL SDED+KLL  R+IL + E DETQII NLKW+ QQKANLSGT+FAGVVQKLNE PDR +  WRRL
Subjt:  QRTEAILRNLIAYEICQSGSAQQVRSYVNFMSHLLHSDEDVKLLYERRILSDHEDDETQIIRNLKWLSQQKANLSGTFFAGVVQKLNERPDRCLRRWRRL

Query:  KRNPAAIGVAAFWVVVVIFVAAFFSALSFLQHRYK
        +RNP AIGV A W +VVIFVAAFFSALS LQ RY+
Subjt:  KRNPAAIGVAAFWVVVVIFVAAFFSALSFLQHRYK

XP_022955709.1 UPF0481 protein At3g47200-like [Cucurbita moschata]3.4e-16970.74Show/hide
Query:  MDPSSAFSHSIDIPAISRKRSEEESLLSSIEGKLEAFCSSTIIFKAPDEISIDDSDRDVFVPAKVSIGPLHRGARHLESMEDQKWRYLCAFLKRNPSVSL
        MD S A SHSID+PA S+  SEEESLLSSIEGKLEAFCSS  IF+AP+EISI+  DR+VFVPAKVSIGP H GA HLESME+ KW YL AFLK NPSV L
Subjt:  MDPSSAFSHSIDIPAISRKRSEEESLLSSIEGKLEAFCSSTIIFKAPDEISIDDSDRDVFVPAKVSIGPLHRGARHLESMEDQKWRYLCAFLKRNPSVSL

Query:  DDLLQLIVKSESRVRKCYEVELFDLDGDKFAQMMLLDCCFVLELLLRYSTKRLRRRNDPVFTTPGLLFDLRCDMMLLENQIPYFLLKDVYDSVQDQLQEN
          L++L+VKSESR+RKCYE E +  D DKF+Q+MLLDCCF+LELLLR+S KRLRRRND VFTTPGLLFDLRCD+MLLENQIPYFLLKDVY++VQD  +EN
Subjt:  DDLLQLIVKSESRVRKCYEVELFDLDGDKFAQMMLLDCCFVLELLLRYSTKRLRRRNDPVFTTPGLLFDLRCDMMLLENQIPYFLLKDVYDSVQDQLQEN

Query:  SSLNELAFRFFRTMVAGERQFVYDNFMQDADHLLDM--------------KQQSEAEELPSASKLQAAGIKFKDARTPKSVLDIRFQKGVLQIPPLKVYQ
         SLN+L FRFF+TMVAG+RQ VYDNFM +ADHLL+M                +S+++ELP+ASKL+ AGIK K+AR+ KS+LDI+FQ GVL+IPPLKVYQ
Subjt:  SSLNELAFRFFRTMVAGERQFVYDNFMQDADHLLDM--------------KQQSEAEELPSASKLQAAGIKFKDARTPKSVLDIRFQKGVLQIPPLKVYQ

Query:  RTEAILRNLIAYEICQSGSAQQVRSYVNFMSHLLHSDEDVKLLYERRILSDHEDDETQIIRNLKWLSQQKANLSGTFFAGVVQKLNERPDRCLRRWRRLK
        +TE ILRNL+AYEI QSGS +QV+SY+NFMSHLL SD+DVK+LY R+IL+D EDDE QIIRNLKW+S+++ +LSGT+FAG+VQKLNE+PDRC+ RWR+L+
Subjt:  RTEAILRNLIAYEICQSGSAQQVRSYVNFMSHLLHSDEDVKLLYERRILSDHEDDETQIIRNLKWLSQQKANLSGTFFAGVVQKLNERPDRCLRRWRRLK

Query:  RNPAAIGVAAFWVVVVIFVAAFFSALSFLQHRYK
        R P AIG+ A  VVVVIFVAAFFSA S LQ RYK
Subjt:  RNPAAIGVAAFWVVVVIFVAAFFSALSFLQHRYK

XP_023526431.1 UPF0481 protein At3g47200-like [Cucurbita pepo subsp. pepo]1.3e-16870.74Show/hide
Query:  MDPSSAFSHSIDIPAISRKRSEEESLLSSIEGKLEAFCSSTIIFKAPDEISIDDSDRDVFVPAKVSIGPLHRGARHLESMEDQKWRYLCAFLKRNPSVSL
        MD S A SHSID+PA S+  SEEESLLSSIEGKLEAFCSS  IF+AP+EISI+  DR+VFVPAKVSIGP H GA HLESME+ KW YL AFLK NPSV L
Subjt:  MDPSSAFSHSIDIPAISRKRSEEESLLSSIEGKLEAFCSSTIIFKAPDEISIDDSDRDVFVPAKVSIGPLHRGARHLESMEDQKWRYLCAFLKRNPSVSL

Query:  DDLLQLIVKSESRVRKCYEVELFDLDGDKFAQMMLLDCCFVLELLLRYSTKRLRRRNDPVFTTPGLLFDLRCDMMLLENQIPYFLLKDVYDSVQDQLQEN
          L++L+VKSESR+RKCYE E +  D DKF+Q+MLLDCCF+LELLLR+S KRLRRRND VFTTPGLLFDLRCD+MLLENQIPYFLLKDVY++VQD  +EN
Subjt:  DDLLQLIVKSESRVRKCYEVELFDLDGDKFAQMMLLDCCFVLELLLRYSTKRLRRRNDPVFTTPGLLFDLRCDMMLLENQIPYFLLKDVYDSVQDQLQEN

Query:  SSLNELAFRFFRTMVAGERQFVYDNFMQDADHLLDM--------------KQQSEAEELPSASKLQAAGIKFKDARTPKSVLDIRFQKGVLQIPPLKVYQ
         SLN+L FRFF+T+V G+RQ VYDNF  +ADHLL+M                +S++ ELPSASKL+ AGIK K+AR+ KS+LDI+FQ GVL+IPPLKVYQ
Subjt:  SSLNELAFRFFRTMVAGERQFVYDNFMQDADHLLDM--------------KQQSEAEELPSASKLQAAGIKFKDARTPKSVLDIRFQKGVLQIPPLKVYQ

Query:  RTEAILRNLIAYEICQSGSAQQVRSYVNFMSHLLHSDEDVKLLYERRILSDHEDDETQIIRNLKWLSQQKANLSGTFFAGVVQKLNERPDRCLRRWRRLK
        +TE ILRNL+AYEI QSGS +QV+SY+NFMSHLL SD+DVK+LY R+IL D EDDE QIIRNLKW+S +K +LSGT+FAG+VQKLNE+PDRC+ RWR+L+
Subjt:  RTEAILRNLIAYEICQSGSAQQVRSYVNFMSHLLHSDEDVKLLYERRILSDHEDDETQIIRNLKWLSQQKANLSGTFFAGVVQKLNERPDRCLRRWRRLK

Query:  RNPAAIGVAAFWVVVVIFVAAFFSALSFLQHRYK
        RNP AIG+ A  VVVVIFVAAFFSA S LQ RYK
Subjt:  RNPAAIGVAAFWVVVVIFVAAFFSALSFLQHRYK

XP_038880915.1 UPF0481 protein At3g47200-like [Benincasa hispida]2.5e-17271.2Show/hide
Query:  MDPSSAFSHSIDIPAISRKRSEEESLLSSIEGKLEAFCSSTIIFKAPDEISIDDSDRDVFVPAKVSIGPLHRGARHLESMEDQKWRYLCAFLKRNPSVSL
        M+ S  FSHSIDI AI++  S+EESLLSS+EGKLEAFCSS  IF+AP++ISI+  D++VFVPAKVSIGP H GA HLE ME+ KWRYL  FLK NPS++L
Subjt:  MDPSSAFSHSIDIPAISRKRSEEESLLSSIEGKLEAFCSSTIIFKAPDEISIDDSDRDVFVPAKVSIGPLHRGARHLESMEDQKWRYLCAFLKRNPSVSL

Query:  DDLLQLIVKSESRVRKCYEVELFDLDGDKFAQMMLLDCCFVLELLLRYSTKRLRRRNDPVFTTPGLLFDLRCDMMLLENQIPYFLLKDVYDSVQDQLQEN
        DDL++L+VKSESR+RKCYE E +DLD DKF+QMMLLDCCF+LELLLRYS KR RR NDPVF TPGLLFDLRCD+MLLENQIPYFLL +VY++VQD L+EN
Subjt:  DDLLQLIVKSESRVRKCYEVELFDLDGDKFAQMMLLDCCFVLELLLRYSTKRLRRRNDPVFTTPGLLFDLRCDMMLLENQIPYFLLKDVYDSVQDQLQEN

Query:  SSLNELAFRFFRTMVAGERQFVYDNFMQDADHLLDM--------------KQQSEAEELPSASKLQAAGIKFKDARTPKSVLDIRFQKGVLQIPPLKVYQ
         SLN+L FRFF+TMVAG+R+FVYDNFM +ADHLL+M                +S++ ELPSASKL+ AGIKFK+AR+PKS+LDI+FQKGVL+IPPL+VYQ
Subjt:  SSLNELAFRFFRTMVAGERQFVYDNFMQDADHLLDM--------------KQQSEAEELPSASKLQAAGIKFKDARTPKSVLDIRFQKGVLQIPPLKVYQ

Query:  RTEAILRNLIAYEICQSGSAQQVRSYVNFMSHLLHSDEDVKLLYERRILSDHEDDETQIIRNLKWLSQQKANLSGTFFAGVVQKLNERPDRCLRRWRRLK
        +TEAILRNL AYEI Q GS  QV+SY+NFMSHLL SDEDVK+L  R+IL D EDDE QII+NLKW+ ++K +LSGT+FAG+VQKLNE+PDRCL +WR L+
Subjt:  RTEAILRNLIAYEICQSGSAQQVRSYVNFMSHLLHSDEDVKLLYERRILSDHEDDETQIIRNLKWLSQQKANLSGTFFAGVVQKLNERPDRCLRRWRRLK

Query:  RNPAAIGVAAFWVVVVIFVAAFFSALSFLQHRYK
        RNP AIGVAA WVVVVIFVAAFFSA+S LQ RYK
Subjt:  RNPAAIGVAAFWVVVVIFVAAFFSALSFLQHRYK

TrEMBL top hitse value%identityAlignment
A0A0A0L821 Uncharacterized protein3.4e-14662.21Show/hide
Query:  MDPSSAFSHSIDIPAISRKRSEEESLLSSIEGKLEAFCSSTIIFKAPDEISIDDSDRDVFVPAKVSIGPLHRGARHLESMEDQKWRYLCAFLKRNPSVSL
        MDPS+  SH+I+I  IS++  +EESLLS IE KLEA CSS  I+KAP EI+I+  DR+VF+PAKVSIGP H GA HLES+E  KW YL  FL   PS++L
Subjt:  MDPSSAFSHSIDIPAISRKRSEEESLLSSIEGKLEAFCSSTIIFKAPDEISIDDSDRDVFVPAKVSIGPLHRGARHLESMEDQKWRYLCAFLKRNPSVSL

Query:  DDLLQLIVKSESRVRKCYEVELFDLDGDKFAQMMLLDCCFVLELLLRYSTKRLRRRNDPVFTTPGLLFDLRCDMMLLENQIPYFLLKDVYDSVQDQLQEN
         DL++L+VKSESR RKCYE E +  D D+F+Q+MLLDCCF+LELLLRY+ +R RR NDPVFTTPGLL+DLRCD++LLENQIPYFLL+++Y  V D L+EN
Subjt:  DDLLQLIVKSESRVRKCYEVELFDLDGDKFAQMMLLDCCFVLELLLRYSTKRLRRRNDPVFTTPGLLFDLRCDMMLLENQIPYFLLKDVYDSVQDQLQEN

Query:  SSLNELAFRFFRTMVAGERQFVYDNFMQDADHLLDM--------------KQQSEAEELPSASKLQAAGIKFKDARTPKSVLDIRFQKGVLQIPPLKVYQ
          L++L  RFFRTMV G+R+F+ DNF+ +A+HLL+M                + +++ELPSASKL+AAGIKFK+AR+ KS+LDI+FQ GVL+IPPL+VYQ
Subjt:  SSLNELAFRFFRTMVAGERQFVYDNFMQDADHLLDM--------------KQQSEAEELPSASKLQAAGIKFKDARTPKSVLDIRFQKGVLQIPPLKVYQ

Query:  RTEAILRNLIAYEICQSGSAQQVRSYVNFMSHLLHSDEDVKLLYERRILSDHEDDETQIIRNLKWLSQQKANLSGTFFAGVVQKLNERPDRCLRRWRRLK
        +TE ILRNL AYEICQ G+  QV+SY+NFMSHLL SDEDVK+L  ++IL+  +D+E QII  LKW+ +QK +LSGTFFAG+VQKL E+PDR + RWRRL+
Subjt:  RTEAILRNLIAYEICQSGSAQQVRSYVNFMSHLLHSDEDVKLLYERRILSDHEDDETQIIRNLKWLSQQKANLSGTFFAGVVQKLNERPDRCLRRWRRLK

Query:  RNPAAIGVAAFWVVVVIFVAAFFSALSFLQHRYK
         N  AI VA   +VVVIF AAFF+A S LQ RYK
Subjt:  RNPAAIGVAAFWVVVVIFVAAFFSALSFLQHRYK

A0A1S3AY98 UPF0481 protein At3g47200-like5.4e-15264.29Show/hide
Query:  MDPSSAFSHSIDIPAISRKRSEEESLLSSIEGKLEAFCSSTIIFKAPDEISIDDSDRDVFVPAKVSIGPLHRGARHLESMEDQKWRYLCAFLKRNPSVSL
        MD S+  SH+I+IP IS++ S EESLLSSIEGKLEA CSS  IFKAP EI+I+   R+VFVPAKVSIGP H GA HL+S+E+ KWRYL  FLK N S++L
Subjt:  MDPSSAFSHSIDIPAISRKRSEEESLLSSIEGKLEAFCSSTIIFKAPDEISIDDSDRDVFVPAKVSIGPLHRGARHLESMEDQKWRYLCAFLKRNPSVSL

Query:  DDLLQLIVKSESRVRKCYEVELFDLDGDKFAQMMLLDCCFVLELLLRYSTKRLRRRNDPVFTTPGLLFDLRCDMMLLENQIPYFLLKDVYDSVQDQLQEN
         DL++++VKSESR++KCYE +   LD D+F+ +MLLDCCF+LELLLRYS +R +RRNDPVFTTPGLLFD++CD+MLLENQIPYFLL ++Y+ V D  +EN
Subjt:  DDLLQLIVKSESRVRKCYEVELFDLDGDKFAQMMLLDCCFVLELLLRYSTKRLRRRNDPVFTTPGLLFDLRCDMMLLENQIPYFLLKDVYDSVQDQLQEN

Query:  SSLNELAFRFFRTMVAGERQFVYDNFMQDADHLLDM--------------KQQSEAEELPSASKLQAAGIKFKDARTPKSVLDIRFQKGVLQIPPLKVYQ
          L++L FRFFRTMV G+R+F+ DNF+ +ADHLL+M                + +++ELPSASKL+ AGIKFK+AR+ KS+LDI+FQ GVL+IPPL+VYQ
Subjt:  SSLNELAFRFFRTMVAGERQFVYDNFMQDADHLLDM--------------KQQSEAEELPSASKLQAAGIKFKDARTPKSVLDIRFQKGVLQIPPLKVYQ

Query:  RTEAILRNLIAYEICQSGSAQQVRSYVNFMSHLLHSDEDVKLLYERRILSDHEDDETQIIRNLKWLSQQKANLSGTFFAGVVQKLNERPDRCLRRWRRLK
        +TEAILRNL AYEI QSG+ QQV+SY+ FMSHLL SD DVK+L  ++IL   EDDE QII NLKW+ +QK +LSGT+FAG+VQKLNE+PDR + RWRRL+
Subjt:  RTEAILRNLIAYEICQSGSAQQVRSYVNFMSHLLHSDEDVKLLYERRILSDHEDDETQIIRNLKWLSQQKANLSGTFFAGVVQKLNERPDRCLRRWRRLK

Query:  RNPAAIGVAAFWVVVVIFVAAFFSALSFLQHRYK
        R P AIGVAA  +VVVIF AAFF+A S LQ RYK
Subjt:  RNPAAIGVAAFWVVVVIFVAAFFSALSFLQHRYK

A0A6J1CA62 UPF0481 protein At3g47200-like3.3e-17071.95Show/hide
Query:  MDPSSAFSHSIDIPAISRKRSEEESLLSSIEGKLEAFCSSTIIFKAPDEISIDDSDRDVFVPAKVSIGPLHRGARHLESMEDQKWRYLCAFLKRNPSVSL
        M+PS A SH+IDIPAISR+RS+EESLL S+E K+EAFCSS IIFK PDEISID  +R+VFVPAKVSIGP H GA HLESMED KW YLCAFLK NPSV L
Subjt:  MDPSSAFSHSIDIPAISRKRSEEESLLSSIEGKLEAFCSSTIIFKAPDEISIDDSDRDVFVPAKVSIGPLHRGARHLESMEDQKWRYLCAFLKRNPSVSL

Query:  DDLLQLIVKSESRVRKCYEVELFDLDGDKFAQMMLLDCCFVLELLLRYSTKRLRRRNDPVFTTPGLLFDLRCDMMLLENQIPYFLLKDVYDSVQDQLQEN
        DDLL+ + KSESRVRKCYEVE  DLD  KFA+MM+LDCCFVLELLLR+S KRL+RRNDPVFTTPGLL DL+ D++LLENQIPYFLL++VY+ VQD  +EN
Subjt:  DDLLQLIVKSESRVRKCYEVELFDLDGDKFAQMMLLDCCFVLELLLRYSTKRLRRRNDPVFTTPGLLFDLRCDMMLLENQIPYFLLKDVYDSVQDQLQEN

Query:  SSLNELAFRFFRTMVAGERQFVYDNFMQDADHLLDM---------------KQQSEAEELPSASKLQAAGIKFKDARTPKSVLDIRFQKGVLQIPPLKVY
          LN+LAFRFFRT+VAGERQ VYDNF QDADHLLD+                 +S+  ELP ASKL++AGIKFK+A TPKSVLDI+FQ G L+IP L+V 
Subjt:  SSLNELAFRFFRTMVAGERQFVYDNFMQDADHLLDM---------------KQQSEAEELPSASKLQAAGIKFKDARTPKSVLDIRFQKGVLQIPPLKVY

Query:  QRTEAILRNLIAYEICQSGSAQQVRSYVNFMSHLLHSDEDVKLLYERRILSDHEDDETQIIRNLKWLSQQKANLSGTFFAGVVQKLNERPDRCLRRWRRL
        + TE IL+NLIAYEICQ GSAQQV+SYV+FMSHLL SDED+KLL  R+IL + E DETQII NLKW+ QQKANLSGT+FAGVVQKLNE PDR +  WRRL
Subjt:  QRTEAILRNLIAYEICQSGSAQQVRSYVNFMSHLLHSDEDVKLLYERRILSDHEDDETQIIRNLKWLSQQKANLSGTFFAGVVQKLNERPDRCLRRWRRL

Query:  KRNPAAIGVAAFWVVVVIFVAAFFSALSFLQHRYK
        +RNP AIGV A W +VVIFVAAFFSALS LQ RY+
Subjt:  KRNPAAIGVAAFWVVVVIFVAAFFSALSFLQHRYK

A0A6J1GVU1 UPF0481 protein At3g47200-like1.7e-16970.74Show/hide
Query:  MDPSSAFSHSIDIPAISRKRSEEESLLSSIEGKLEAFCSSTIIFKAPDEISIDDSDRDVFVPAKVSIGPLHRGARHLESMEDQKWRYLCAFLKRNPSVSL
        MD S A SHSID+PA S+  SEEESLLSSIEGKLEAFCSS  IF+AP+EISI+  DR+VFVPAKVSIGP H GA HLESME+ KW YL AFLK NPSV L
Subjt:  MDPSSAFSHSIDIPAISRKRSEEESLLSSIEGKLEAFCSSTIIFKAPDEISIDDSDRDVFVPAKVSIGPLHRGARHLESMEDQKWRYLCAFLKRNPSVSL

Query:  DDLLQLIVKSESRVRKCYEVELFDLDGDKFAQMMLLDCCFVLELLLRYSTKRLRRRNDPVFTTPGLLFDLRCDMMLLENQIPYFLLKDVYDSVQDQLQEN
          L++L+VKSESR+RKCYE E +  D DKF+Q+MLLDCCF+LELLLR+S KRLRRRND VFTTPGLLFDLRCD+MLLENQIPYFLLKDVY++VQD  +EN
Subjt:  DDLLQLIVKSESRVRKCYEVELFDLDGDKFAQMMLLDCCFVLELLLRYSTKRLRRRNDPVFTTPGLLFDLRCDMMLLENQIPYFLLKDVYDSVQDQLQEN

Query:  SSLNELAFRFFRTMVAGERQFVYDNFMQDADHLLDM--------------KQQSEAEELPSASKLQAAGIKFKDARTPKSVLDIRFQKGVLQIPPLKVYQ
         SLN+L FRFF+TMVAG+RQ VYDNFM +ADHLL+M                +S+++ELP+ASKL+ AGIK K+AR+ KS+LDI+FQ GVL+IPPLKVYQ
Subjt:  SSLNELAFRFFRTMVAGERQFVYDNFMQDADHLLDM--------------KQQSEAEELPSASKLQAAGIKFKDARTPKSVLDIRFQKGVLQIPPLKVYQ

Query:  RTEAILRNLIAYEICQSGSAQQVRSYVNFMSHLLHSDEDVKLLYERRILSDHEDDETQIIRNLKWLSQQKANLSGTFFAGVVQKLNERPDRCLRRWRRLK
        +TE ILRNL+AYEI QSGS +QV+SY+NFMSHLL SD+DVK+LY R+IL+D EDDE QIIRNLKW+S+++ +LSGT+FAG+VQKLNE+PDRC+ RWR+L+
Subjt:  RTEAILRNLIAYEICQSGSAQQVRSYVNFMSHLLHSDEDVKLLYERRILSDHEDDETQIIRNLKWLSQQKANLSGTFFAGVVQKLNERPDRCLRRWRRLK

Query:  RNPAAIGVAAFWVVVVIFVAAFFSALSFLQHRYK
        R P AIG+ A  VVVVIFVAAFFSA S LQ RYK
Subjt:  RNPAAIGVAAFWVVVVIFVAAFFSALSFLQHRYK

A0A6J1IWZ4 UPF0481 protein At3g47200-like7.3e-14969.47Show/hide
Query:  MDPSSAFSHSIDIPAISRKRSEEESLLSSIEGKLEAFCSSTIIFKAPDEISIDDSDRDVFVPAKVSIGPLHRGARHLESMEDQKWRYLCAFLKRNPSVSL
        MD S A SHSID+PA S+  SEEESLLSSIE KLEAFCSS  IF+A +EISI+  DR+VFVPAKVSIGP H GA HLESME+ KW YL AFLK NPSV L
Subjt:  MDPSSAFSHSIDIPAISRKRSEEESLLSSIEGKLEAFCSSTIIFKAPDEISIDDSDRDVFVPAKVSIGPLHRGARHLESMEDQKWRYLCAFLKRNPSVSL

Query:  DDLLQLIVKSESRVRKCYEVELFDLDGDKFAQMMLLDCCFVLELLLRYSTKRLRRRNDPVFTTPGLLFDLRCDMMLLENQIPYFLLKDVYDSVQDQLQEN
          L++L+VKSESR+RKCYE E +  D +KF+Q+MLLDCCF+LELLLRYS KRLRRRND VFTTPGLLFDLRCD+MLLENQIPYFLLKDVY +VQD  +EN
Subjt:  DDLLQLIVKSESRVRKCYEVELFDLDGDKFAQMMLLDCCFVLELLLRYSTKRLRRRNDPVFTTPGLLFDLRCDMMLLENQIPYFLLKDVYDSVQDQLQEN

Query:  SSLNELAFRFFRTMVAGERQFVYDNFMQDADHLLDM--------------KQQSEAEELPSASKLQAAGIKFKDARTPKSVLDIRFQKGVLQIPPLKVYQ
         SLN+L FRFF+TMVAG+RQFVYDNFM +ADHLL+M                 S++ ELPSASKL+ AGIK K+ ++ KS+LDI+FQ GVL+IPPLKVYQ
Subjt:  SSLNELAFRFFRTMVAGERQFVYDNFMQDADHLLDM--------------KQQSEAEELPSASKLQAAGIKFKDARTPKSVLDIRFQKGVLQIPPLKVYQ

Query:  RTEAILRNLIAYEICQSGSAQQVRSYVNFMSHLLHSDEDVKLLYERRILSDHEDDETQIIRNLKWLSQQKANLSGTFFAGVVQKLNERPDRCL
        +TE ILRNL+AYEI QSGS +QV+SY+NFMSHLL SD+DVK+LY R+IL+D E+DE QIIRNLKW+ ++K +LSGT+FAG+VQKLN++ DRC+
Subjt:  RTEAILRNLIAYEICQSGSAQQVRSYVNFMSHLLHSDEDVKLLYERRILSDHEDDETQIIRNLKWLSQQKANLSGTFFAGVVQKLNERPDRCL

SwissProt top hitse value%identityAlignment
P0C897 Putative UPF0481 protein At3g026455.0e-0624.48Show/hide
Query:  QDADHLLDMKQQSEAEEL--PSASKLQAAGIKFK-DARTPKSVLDIRFQKGVLQIPPLKVYQRTEAILRNLIAYEICQSGSAQQVRSYVNFMSHLLHSDE
        Q++  +LD+++    EEL  PS S L  AG++FK  A    S +      G   +P + +   TE +LRNL+AYE   +        Y   ++ ++ S+E
Subjt:  QDADHLLDMKQQSEAEEL--PSASKLQAAGIKFK-DARTPKSVLDIRFQKGVLQIPPLKVYQRTEAILRNLIAYEICQSGSAQQVRSYVNFMSHLLHSDE

Query:  DVKLLYERRILSDHEDDETQIIRNLKWLSQQKANLSGTFFAGVVQKLNERPDRCLRRWRRLKRNPAAIGVAAFWVVVVIFVAAFFSALSFLQ
        DV+LL E+ +L      + +       +S+        F    ++ +N        RW+        + V   W ++    A     L  LQ
Subjt:  DVKLLYERRILSDHEDDETQIIRNLKWLSQQKANLSGTFFAGVVQKLNERPDRCLRRWRRLKRNPAAIGVAAFWVVVVIFVAAFFSALSFLQ

Q9SD53 UPF0481 protein At3g472004.8e-3328.78Show/hide
Query:  PSSAFSHSIDIPAISRKRSEEESLLSSIEGKLEAFCSSTIIFKAPDEISIDDSDRDVFVPAKVSIGPLHRGARHLESMEDQKWRYLCAFL--KRNPSVSL
        P SAF + +         S+E  LL    GK      S  IF+ P+  S    +   + P  VSIGP H G +HL+ ++  K R L  FL   +   V  
Subjt:  PSSAFSHSIDIPAISRKRSEEESLLSSIEGKLEAFCSSTIIFKAPDEISIDDSDRDVFVPAKVSIGPLHRGARHLESMEDQKWRYLCAFL--KRNPSVSL

Query:  DDLLQLIVKSESRVRKCYEVELFDLDGDKFAQMMLLDCCFVLELLLRYSTKRLRRRNDPVFTTPGLLFDLRCDMMLLENQIPYFLLKDVYDSVQDQLQEN
        + L++ +V  E ++RK Y  EL    G     MM+LD CF+L + L  S   +    DP+F+ P LL  ++ D++LLENQ+P+F+L+ +Y  V  ++  +
Subjt:  DDLLQLIVKSESRVRKCYEVELFDLDGDKFAQMMLLDCCFVLELLLRYSTKRLRRRNDPVFTTPGLLFDLRCDMMLLENQIPYFLLKDVYDSVQDQLQEN

Query:  SSLNELAFRFFRTMVAGERQFVYDNFMQDADHLLDMKQQ------SEAEE----------------------------LPSASKLQAAGIKFKDARTPK-
        S LN +AF FF+  +  E  +   +    A HLLD+ ++      SE+++                            + SA +L+  GIKF+  R+ + 
Subjt:  SSLNELAFRFFRTMVAGERQFVYDNFMQDADHLLDMKQQ------SEAEE----------------------------LPSASKLQAAGIKFKDARTPK-

Query:  SVLDIRFQKGVLQIPPLKVYQRTEAILRNLIAYEICQSGSAQQVRSYVNFMSHLLHSDEDVKLL-YERRILSDHEDDETQIIRNLKWLSQQKA-NLSGTF
        S+L++R +K  LQIP L+      +   N +A+E   + S+ ++ +Y+ FM  LL+++EDV  L  ++ I+ +H     ++    K +S+     +  ++
Subjt:  SVLDIRFQKGVLQIPPLKVYQRTEAILRNLIAYEICQSGSAQQVRSYVNFMSHLLHSDEDVKLL-YERRILSDHEDDETQIIRNLKWLSQQKA-NLSGTF

Query:  FAGVVQKLNE
           V + +NE
Subjt:  FAGVVQKLNE

Arabidopsis top hitse value%identityAlignment
AT2G36430.1 Plant protein of unknown function (DUF247)1.0e-4129.27Show/hide
Query:  IDIPAISRKRSEEESLLSSIEGKLEAFCSSTIIFKAPDEISIDDSDRDVFVPAKVSIGPLHRGARHLESMEDQKWRYLCAFLKRNPSVSLDDLLQLIVKS
        I I  + +K  E   LLSS  GK    CS   IF+ P   S+ D +   + P  VSIGP HRG   L+ +E+ KWRYL   L R  +++L+D ++ +   
Subjt:  IDIPAISRKRSEEESLLSSIEGKLEAFCSSTIIFKAPDEISIDDSDRDVFVPAKVSIGPLHRGARHLESMEDQKWRYLCAFLKRNPSVSLDDLLQLIVKS

Query:  ESRVRKCYEVELFDLDGDKFAQMMLLDCCFVLELLLRYSTKRLRRRNDPVFTTPGLLFDLRCDMMLLENQIPYFLLKDVYDSVQ--DQLQENSSLNELAF
        E   R+CY  E   +D ++F +MM+LD CF+LEL  + +       NDP+     +L     D + LENQIP+F+L+ +++  +  ++ + N+SL  LAF
Subjt:  ESRVRKCYEVELFDLDGDKFAQMMLLDCCFVLELLLRYSTKRLRRRNDPVFTTPGLLFDLRCDMMLLENQIPYFLLKDVYDSVQ--DQLQENSSLNELAF

Query:  RFFRTMVAGERQFVYDNFMQDADHLLDM--------------------KQQSEAEELPSASKLQAAGIKFKDARTPKSVLDIRFQKGVLQIPPLKVYQRT
         FF  M+    + +       A HLLD+                    K++  +  + S SKL+ AGIK ++ +  +S L +RF+ G +++P + V    
Subjt:  RFFRTMVAGERQFVYDNFMQDADHLLDM--------------------KQQSEAEELPSASKLQAAGIKFKDARTPKSVLDIRFQKGVLQIPPLKVYQRT

Query:  EAILRNLIAYEICQSGSAQQVRSYVNFMSHLLHSDEDVKLLYERRILSDHEDDETQIIRNLKWLSQQKA-NLSGTFFAGVVQKLNE
         + L N +AYE C    +    +Y   +  L ++ +DV+ L ++ I+ ++   +T++ + +  L +  A +++  +   + +++NE
Subjt:  EAILRNLIAYEICQSGSAQQVRSYVNFMSHLLHSDEDVKLLYERRILSDHEDDETQIIRNLKWLSQQKA-NLSGTFFAGVVQKLNE

AT3G47210.1 Plant protein of unknown function (DUF247)6.5e-4133.25Show/hide
Query:  RSEEESLLSSIEGKLEAFCSSTIIFKAPDEISIDDSDRDVFVPAKVSIGPLHRGARHLESMEDQKWRYLCAFLKRNPSVSLDDLLQLIVKSESRVRKCYE
        RSE+  LL    GK      S  IF+ P   S  + + + + P  VSIGP H G +HLE ++  K R+L  FL R  SV  D L   +V  E  +RK Y 
Subjt:  RSEEESLLSSIEGKLEAFCSSTIIFKAPDEISIDDSDRDVFVPAKVSIGPLHRGARHLESMEDQKWRYLCAFLKRNPSVSLDDLLQLIVKSESRVRKCYE

Query:  VELFDLDGDKFAQMMLLDCCFVLELLLRYSTK-RLRRRNDPVFTTPGLLFDLRCDMMLLENQIPYFLLKDVYDSVQDQLQENSSLNELAFRFFR-TMVAG
         E  +    +   MM+LD CF+L LLL  S K  L    DP+ T P +L  ++ D++LLENQ+P+F+L+ ++D  + ++     LN +AF FF  +M   
Subjt:  VELFDLDGDKFAQMMLLDCCFVLELLLRYSTK-RLRRRNDPVFTTPGLLFDLRCDMMLLENQIPYFLLKDVYDSVQDQLQENSSLNELAFRFFR-TMVAG

Query:  ERQFV-YDNFMQDADHLLDM-----------------------KQQSEAEELPSASKLQAAGIKFKDARTPKSVLDIRFQKGVLQIPPLKVYQRTEAILR
        ER +V + NF  +A HLLD+                       K  S    L SA++L   GI F       S+LDIR +K  LQIP L++     +IL 
Subjt:  ERQFV-YDNFMQDADHLLDM-----------------------KQQSEAEELPSASKLQAAGIKFKDARTPKSVLDIRFQKGVLQIPPLKVYQRTEAILR

Query:  NLIAYEICQSGSAQQVRSYVNFMSHLLHSDEDVKLLYERRILSDHEDDETQIIRNLKWLSQQKA-NLSGTFFAGVVQKLNERPDR
        N +A+E   + S   + SYV FM  LL+  ED   L  RRI+ ++   E ++ +  K + +    ++  ++   V  ++NE   +
Subjt:  NLIAYEICQSGSAQQVRSYVNFMSHLLHSDEDVKLLYERRILSDHEDDETQIIRNLKWLSQQKA-NLSGTFFAGVVQKLNERPDR

AT3G50140.1 Plant protein of unknown function (DUF247)1.9e-3728.5Show/hide
Query:  IFKAPDEISIDDSDRDVFVPAKVSIGPLHRGARHLESMEDQKWRYLCAFLKRNPSVSLDDLLQLIVKSESRVRKCYEVELFDLDGDKFAQMMLLDCCFVL
        I++ P  +S+  SD++ + P  VS+GP H G  HL  M+  KWR +   +KR     ++  +  + + E R R CYE  +  L  +KF QM++LD CFVL
Subjt:  IFKAPDEISIDDSDRDVFVPAKVSIGPLHRGARHLESMEDQKWRYLCAFLKRNPSVSLDDLLQLIVKSESRVRKCYEVELFDLDGDKFAQMMLLDCCFVL

Query:  ELL---LRYSTKRLRRRNDPVFTTPGLLFDLRCDMMLLENQIPYFLLKDVYDSVQDQLQENSSLNELAFRFFRTMV-----------AGERQFVYDNFMQ
        +L        +K    RNDPVF   G +  +R DM++LENQ+P F+L  + +       +   + +LA RFF  ++           + E    + N + 
Subjt:  ELL---LRYSTKRLRRRNDPVFTTPGLLFDLRCDMMLLENQIPYFLLKDVYDSVQDQLQENSSLNELAFRFFRTMV-----------AGERQFVYDNFMQ

Query:  DAD----HLLDMKQQS---------------------------EAEELPSASKLQAAGIKFKDARTPKSVLDIRFQKGVLQIPPLKVYQRTEAILRNLIA
        D +    H LD+ ++S                           + + L   ++L+ AGIKFK  ++ +   DI+F+ G L+IP L ++  T+++  NLIA
Subjt:  DAD----HLLDMKQQS---------------------------EAEELPSASKLQAAGIKFKDARTPKSVLDIRFQKGVLQIPPLKVYQRTEAILRNLIA

Query:  YEICQSGSAQQVRSYVNFMSHLLHSDEDVKLLYERRILSDHEDDETQIIRNLKWLSQQKA-NLSGTFFAGVVQKLNERPDRCL-RRWRRLKRNPAAIGVA
        YE C   S   + SY+ FM +L+ S ED++ L+   I+     +++++      L Q+ A +L  T+    + +L+ + DR   R+W  LK        +
Subjt:  YEICQSGSAQQVRSYVNFMSHLLHSDEDVKLLYERRILSDHEDDETQIIRNLKWLSQQKA-NLSGTFFAGVVQKLNERPDRCL-RRWRRLKRNPAAIGVA

Query:  AFWVVVVIFVAAFFSALSFLQ
          W     F A     L+  Q
Subjt:  AFWVVVVIFVAAFFSALSFLQ

AT4G31980.1 unknown protein6.7e-4630.4Show/hide
Query:  RSEEESLLSSIEGKLEAFCSS----TIIFKAPDEISIDDSDRDVFVPAKVSIGPLHRGARHLESMEDQKWRYLCAFLKRNPSVSLDDLLQLIVKSESRVR
        ++E ++L+ SI+ KL AF SS      I+K P+++     + D + P  VS GPLHRG   L++MEDQK+RYL +F+ R  S SL+DL++L    E   R
Subjt:  RSEEESLLSSIEGKLEAFCSS----TIIFKAPDEISIDDSDRDVFVPAKVSIGPLHRGARHLESMEDQKWRYLCAFLKRNPSVSLDDLLQLIVKSESRVR

Query:  KCYEVELFDLDGDKFAQMMLLDCCFVLELLLRYSTKRLRRRNDPVFTTPGLLFDLRCDMMLLENQIPYFLLKDVYDSVQDQLQENS-SLNELAFRFFRTM
         CY  E   L  D+F +M+++D  F++ELLLR    RLR  ND +F    ++ D+  DM+L+ENQ+P+F++K+++  + +  Q+ + S+ +LA R F   
Subjt:  KCYEVELFDLDGDKFAQMMLLDCCFVLELLLRYSTKRLRRRNDPVFTTPGLLFDLRCDMMLLENQIPYFLLKDVYDSVQDQLQENS-SLNELAFRFFRTM

Query:  VAGERQFVYDNFMQDADHLLDMKQQS--------------EAEELPSASKLQAAGIKFKDARTPKSVLDIRFQKGVLQIPPLKVYQRTEAILRNLIAYEI
        ++   +   + F+ + +H +D+ +                + +  P A++L  AG++FK A T   +LDI F  GVL+IP + V   TE++ +N+I +E 
Subjt:  VAGERQFVYDNFMQDADHLLDMKQQS--------------EAEELPSASKLQAAGIKFKDARTPKSVLDIRFQKGVLQIPPLKVYQRTEAILRNLIAYEI

Query:  CQSGSAQQVRSYVNFMSHLLHSDEDVKLLYERRILSDHEDDETQIIRNLKWLSQQKANLSGTFFAGVVQKLNERPDRCLRRWRRLKR-----NPAAIGVA
        C+  S +    Y+  +   + S  D  LL    I+ ++  +   +      +S++       +F+ + + L    +    RW+ + R     NP      
Subjt:  CQSGSAQQVRSYVNFMSHLLHSDEDVKLLYERRILSDHEDDETQIIRNLKWLSQQKANLSGTFFAGVVQKLNERPDRCLRRWRRLKR-----NPAAIGVA

Query:  AFWVVVVIFVAAFFSALSFLQ
          W V  +F A     L+F+Q
Subjt:  AFWVVVVIFVAAFFSALSFLQ

AT5G22540.1 Plant protein of unknown function (DUF247)1.7e-4132.84Show/hide
Query:  FVPAKVSIGPLHRGARHLESMEDQKWRYLCAFLKRNPSVSL--DDLLQLIVKSESRVRKCYEVELFDLDGDKFAQMMLLDCCFVLELLLRYSTK-RLRRR
        + P  VSIGP H G  HL+  +  K R+L  F+ +         +L++ +   E  +R  Y  +L  LD +   QMM+LD CF+L L    S K      
Subjt:  FVPAKVSIGPLHRGARHLESMEDQKWRYLCAFLKRNPSVSL--DDLLQLIVKSESRVRKCYEVELFDLDGDKFAQMMLLDCCFVLELLLRYSTK-RLRRR

Query:  NDPVFTTPGLLFDLRCDMMLLENQIPYFLLKDVYDSVQDQLQENSSLNELAFRFFRTMVAGERQFVYDNFMQDADHLLDM--------------KQQSEA
        +DP+F  P +L  +R D++LLENQ+PY LL+ ++++   +L   S LNE+AF FF   +     F   ++  +A HLLD+              K  S  
Subjt:  NDPVFTTPGLLFDLRCDMMLLENQIPYFLLKDVYDSVQDQLQENSSLNELAFRFFRTMVAGERQFVYDNFMQDADHLLDM--------------KQQSEA

Query:  EE---------LPSASKLQAAGIKFKDARTPKSVLDIRFQKGVLQIPPLKVYQRTEAILRNLIAYEICQSGSAQQVRSYVNFMSHLLHSDEDVKLLYERR
                   + SA KL   GIKFK  +   S+LDI +  GVL IPP+ +   T +I  N +A+E   + S+  + SYV FM+ L++ + D   L ERR
Subjt:  EE---------LPSASKLQAAGIKFKDARTPKSVLDIRFQKGVLQIPPLKVYQRTEAILRNLIAYEICQSGSAQQVRSYVNFMSHLLHSDEDVKLLYERR

Query:  ILSDHEDDETQIIRNLKWLSQQKA-NLSGTFFAGVVQKLNE
        IL ++   E ++ R  K + +  A +L  ++ A V + +NE
Subjt:  ILSDHEDDETQIIRNLKWLSQQKA-NLSGTFFAGVVQKLNE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCCGTCAAGTGCATTTTCTCATTCGATTGATATTCCGGCAATTTCCCGAAAAAGATCCGAGGAAGAATCCCTCTTATCTTCCATTGAAGGAAAGTTGGAAGCCTT
CTGTTCATCCACTATTATCTTCAAAGCTCCTGACGAAATCAGTATCGACGACAGCGACAGAGACGTCTTCGTCCCCGCCAAAGTCTCAATCGGCCCTCTTCACCGCGGCG
CTCGACATCTGGAATCCATGGAAGATCAGAAGTGGCGCTACTTGTGCGCTTTCTTGAAGCGCAATCCGTCCGTCAGTTTAGATGATCTTCTGCAACTTATTGTAAAATCT
GAGAGCCGAGTGCGAAAATGCTATGAGGTAGAGTTGTTCGATCTCGACGGGGACAAGTTCGCGCAGATGATGTTGCTCGATTGCTGCTTCGTTCTTGAGCTTCTTTTGCG
ATACTCGACGAAGAGGCTGCGACGCCGGAACGATCCTGTTTTCACTACTCCTGGTTTGCTCTTCGATTTGAGGTGCGACATGATGCTACTCGAAAATCAGATTCCCTATT
TCCTTCTGAAAGACGTTTATGACAGTGTGCAAGATCAACTCCAGGAAAATTCGTCTCTCAATGAGCTCGCCTTCCGATTCTTCAGAACTATGGTTGCCGGAGAACGGCAA
TTTGTTTACGACAATTTCATGCAAGATGCAGATCATCTGCTTGATATGAAACAACAAAGTGAAGCCGAAGAATTGCCCTCTGCGTCGAAGCTTCAAGCCGCGGGAATCAA
ATTCAAGGATGCGAGAACTCCAAAGAGCGTTTTGGACATCAGATTTCAAAAAGGCGTCCTCCAAATTCCCCCTCTCAAAGTGTACCAGCGCACGGAAGCGATTCTGAGGA
ATCTCATCGCATATGAGATCTGTCAATCCGGAAGCGCTCAGCAAGTAAGATCGTATGTCAATTTCATGAGCCACCTTCTCCATTCCGACGAAGACGTGAAGCTGCTCTAT
GAACGTAGAATCCTGAGCGATCATGAGGACGATGAGACGCAGATTATTCGGAATCTGAAATGGTTAAGCCAGCAGAAGGCGAACTTATCGGGAACGTTCTTCGCCGGCGT
TGTTCAAAAACTAAACGAGCGGCCGGACCGATGCCTCCGGCGGTGGCGGCGGCTGAAAAGAAATCCGGCGGCAATCGGCGTTGCCGCATTTTGGGTGGTGGTTGTGATCT
TCGTGGCGGCCTTCTTTTCTGCACTTTCCTTCCTTCAGCACCGTTATAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGATCCGTCAAGTGCATTTTCTCATTCGATTGATATTCCGGCAATTTCCCGAAAAAGATCCGAGGAAGAATCCCTCTTATCTTCCATTGAAGGAAAGTTGGAAGCCTT
CTGTTCATCCACTATTATCTTCAAAGCTCCTGACGAAATCAGTATCGACGACAGCGACAGAGACGTCTTCGTCCCCGCCAAAGTCTCAATCGGCCCTCTTCACCGCGGCG
CTCGACATCTGGAATCCATGGAAGATCAGAAGTGGCGCTACTTGTGCGCTTTCTTGAAGCGCAATCCGTCCGTCAGTTTAGATGATCTTCTGCAACTTATTGTAAAATCT
GAGAGCCGAGTGCGAAAATGCTATGAGGTAGAGTTGTTCGATCTCGACGGGGACAAGTTCGCGCAGATGATGTTGCTCGATTGCTGCTTCGTTCTTGAGCTTCTTTTGCG
ATACTCGACGAAGAGGCTGCGACGCCGGAACGATCCTGTTTTCACTACTCCTGGTTTGCTCTTCGATTTGAGGTGCGACATGATGCTACTCGAAAATCAGATTCCCTATT
TCCTTCTGAAAGACGTTTATGACAGTGTGCAAGATCAACTCCAGGAAAATTCGTCTCTCAATGAGCTCGCCTTCCGATTCTTCAGAACTATGGTTGCCGGAGAACGGCAA
TTTGTTTACGACAATTTCATGCAAGATGCAGATCATCTGCTTGATATGAAACAACAAAGTGAAGCCGAAGAATTGCCCTCTGCGTCGAAGCTTCAAGCCGCGGGAATCAA
ATTCAAGGATGCGAGAACTCCAAAGAGCGTTTTGGACATCAGATTTCAAAAAGGCGTCCTCCAAATTCCCCCTCTCAAAGTGTACCAGCGCACGGAAGCGATTCTGAGGA
ATCTCATCGCATATGAGATCTGTCAATCCGGAAGCGCTCAGCAAGTAAGATCGTATGTCAATTTCATGAGCCACCTTCTCCATTCCGACGAAGACGTGAAGCTGCTCTAT
GAACGTAGAATCCTGAGCGATCATGAGGACGATGAGACGCAGATTATTCGGAATCTGAAATGGTTAAGCCAGCAGAAGGCGAACTTATCGGGAACGTTCTTCGCCGGCGT
TGTTCAAAAACTAAACGAGCGGCCGGACCGATGCCTCCGGCGGTGGCGGCGGCTGAAAAGAAATCCGGCGGCAATCGGCGTTGCCGCATTTTGGGTGGTGGTTGTGATCT
TCGTGGCGGCCTTCTTTTCTGCACTTTCCTTCCTTCAGCACCGTTATAAATGA
Protein sequenceShow/hide protein sequence
MDPSSAFSHSIDIPAISRKRSEEESLLSSIEGKLEAFCSSTIIFKAPDEISIDDSDRDVFVPAKVSIGPLHRGARHLESMEDQKWRYLCAFLKRNPSVSLDDLLQLIVKS
ESRVRKCYEVELFDLDGDKFAQMMLLDCCFVLELLLRYSTKRLRRRNDPVFTTPGLLFDLRCDMMLLENQIPYFLLKDVYDSVQDQLQENSSLNELAFRFFRTMVAGERQ
FVYDNFMQDADHLLDMKQQSEAEELPSASKLQAAGIKFKDARTPKSVLDIRFQKGVLQIPPLKVYQRTEAILRNLIAYEICQSGSAQQVRSYVNFMSHLLHSDEDVKLLY
ERRILSDHEDDETQIIRNLKWLSQQKANLSGTFFAGVVQKLNERPDRCLRRWRRLKRNPAAIGVAAFWVVVVIFVAAFFSALSFLQHRYK