; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC11G210370 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC11G210370
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionDUF4149 domain-containing protein
Genome locationCiama_Chr11:21613171..21615452
RNA-Seq ExpressionCaUC11G210370
SyntenyCaUC11G210370
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR025423 - Domain of unknown function DUF4149


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578300.1 Transmembrane protein 205, partial [Cucurbita argyrosperma subsp. sororia]1.4e-15470.48Show/hide
Query:  MTNLFALCLIITTLTAAGLWSSPSPSP----SQDVIVKEGHRVVVVEYDNQGQHNTKVSFSSEPV-------------------QDSERHSTKDLICDAF
        MTNLFALCL+IT+LTAAGLW SPSP+      QDVIVKEGHRVVVVEY +QGQHNTKVS SSEP                    +DSERH T+DLICDA 
Subjt:  MTNLFALCLIITTLTAAGLWSSPSPSP----SQDVIVKEGHRVVVVEYDNQGQHNTKVSFSSEPV-------------------QDSERHSTKDLICDAF

Query:  GKCKHKIANVVEKAKATVSETAQEAHD-------AFDEAKETVSDKSHHMGTLFTEKAH-----------------------------ELKEGAKETLKE
        GKCKHKIA+ V KAK  VSETAQEAHD       AFDEAKETVSDKSHH+GT F+EK H                             +LKEGAK+ LKE
Subjt:  GKCKHKIANVVEKAKATVSETAQEAHD-------AFDEAKETVSDKSHHMGTLFTEKAH-----------------------------ELKEGAKETLKE

Query:  AKARE-ERLAMEIGREARETAEKIKTGGNKLKENLMNIVDRGVMLIKDSFRYLGSVQS--RAMDVLGLLGFAMALGMGVWNTFISSYVLASVLPRQQLAV
         KAR+ +  AME GREAR+TAEKIKTGGNK+KENLM I D G+ L+ DSFRYL S++S   AMDVL LLGF MALGMGVW TFISSYVLAS LPRQQLAV
Subjt:  AKARE-ERLAMEIGREARETAEKIKTGGNKLKENLMNIVDRGVMLIKDSFRYLGSVQS--RAMDVLGLLGFAMALGMGVWNTFISSYVLASVLPRQQLAV

Query:  VQSKIYPVYFRAMASSIGMALLGHLFSHTHWMFPIPKTAQVLQGYVLVAALLMIFANSLYMEPRATKVMFERLKLEKEEGRGIEDIATEPRNVNDTPPAM
        VQSKIYP+YFRAMASSIGMAL GHLFS T WMFPIPK A+V+QGYVLVAALLMIFANSLYMEP+ATKVMFERLK+EKEEG+GIEDIA EPR+ ND PPA+
Subjt:  VQSKIYPVYFRAMASSIGMALLGHLFSHTHWMFPIPKTAQVLQGYVLVAALLMIFANSLYMEPRATKVMFERLKLEKEEGRGIEDIATEPRNVNDTPPAM

Query:  TTTTPTQ-VDLEVVKSRIMGLSKRLKKLNSYSSLLNLLTLMALTWHLVYLSQRF
        TT+T TQ V+ E VKSRI+GL+KRLKKLNSYSSLLNLLTLMALTWHLVYLSQRF
Subjt:  TTTTPTQ-VDLEVVKSRIMGLSKRLKKLNSYSSLLNLLTLMALTWHLVYLSQRF

KAG7015871.1 Transmembrane protein, partial [Cucurbita argyrosperma subsp. argyrosperma]7.4e-15369.43Show/hide
Query:  MTNLFALCLIITTLTAAGLWSSPSPSP----SQDVIVKEGHRVVVVEYDNQGQHNTKVSFSSEPV-------------------QDSERHSTKDLICDAF
        M NLFALCL+IT+LTAAGLW SPSP+      QDVIVKEGHRVVVVEY +QGQHNTKVS SSEP                    +DSERH T+DLICDA 
Subjt:  MTNLFALCLIITTLTAAGLWSSPSPSP----SQDVIVKEGHRVVVVEYDNQGQHNTKVSFSSEPV-------------------QDSERHSTKDLICDAF

Query:  GKCKHKIANVVEKAKATVSETAQEAHD-------AFDEAKETVSDKSHHMGTLFTEKAH-----------------------------ELKEGAKETLKE
        GKCKHKIA+ V KAK  VSETAQEAHD       AFDEAKETVSDKSHH+GT F+EK H                             +LKEGAK+ LKE
Subjt:  GKCKHKIANVVEKAKATVSETAQEAHD-------AFDEAKETVSDKSHHMGTLFTEKAH-----------------------------ELKEGAKETLKE

Query:  AKARE-ERLAMEIGREARETAEKIKTGGNKLKENLMNIVDRGVMLIKDSFRYLGSVQS--RAMDVLGLLGFAMALGMGVWNTFISSYVLASVLPRQQLAV
         KAR+ +  AME GREAR+TAEKIKTGGNK+KENLM I D G+ L+ DSFRYL S++S   AMDVL LLGF MALGMGVW TFISSYVLAS LPRQQLAV
Subjt:  AKARE-ERLAMEIGREARETAEKIKTGGNKLKENLMNIVDRGVMLIKDSFRYLGSVQS--RAMDVLGLLGFAMALGMGVWNTFISSYVLASVLPRQQLAV

Query:  VQSKIYPVYFRAMASSIGMALLGHLFSHTHWMFPIPKTAQVLQGYVLVAALLMIFANSLYMEPRATKVMFERLKLEKEEGRGIEDIATEPRNVNDTPPAM
        VQSKIYP+YFRAMASSIGMAL GHLFS T WMFPIPK A+V+QGYVLVAALLMIFANSLYMEP+ATKVMFERLK+EKEEG+GIEDIA EPR+ +D PPA+
Subjt:  VQSKIYPVYFRAMASSIGMALLGHLFSHTHWMFPIPKTAQVLQGYVLVAALLMIFANSLYMEPRATKVMFERLKLEKEEGRGIEDIATEPRNVNDTPPAM

Query:  TTTTPTQ-VDLEVVKSRIMGLSKRLKKLNSYSSLLNLLTLMALTWHLVYLSQRFHNPC
        TT+T TQ V+ E VKSRI+GL+KRLKKLNS SSLLNLLTLMALTWHLVYLSQR   PC
Subjt:  TTTTPTQ-VDLEVVKSRIMGLSKRLKKLNSYSSLLNLLTLMALTWHLVYLSQRFHNPC

XP_022938890.1 uncharacterized protein LOC111444965 [Cucurbita moschata]1.8e-15469.87Show/hide
Query:  MTNLFALCLIITTLTAAGLWSSPSPSP----SQDVIVKEGHRVVVVEYDNQGQHNTKVSFSSEPV-------------------QDSERHSTKDLICDAF
        M NLFALCL+IT+LTAAGLW SPSP+      QDVIVKEGHRVVVVEY +QGQHNTKVS SSEP                    +DSERH T+DLICDA 
Subjt:  MTNLFALCLIITTLTAAGLWSSPSPSP----SQDVIVKEGHRVVVVEYDNQGQHNTKVSFSSEPV-------------------QDSERHSTKDLICDAF

Query:  GKCKHKIANVVEKAKATVSETAQEAHD-------AFDEAKETVSDKSHHMGTLFTEKAH-----------------------------ELKEGAKETLKE
        GKCKHKIA+ V KAK  VSETAQEAHD       AFDEAKETVSDKSHH+GT F+EK H                             +LKEGAK+ LKE
Subjt:  GKCKHKIANVVEKAKATVSETAQEAHD-------AFDEAKETVSDKSHHMGTLFTEKAH-----------------------------ELKEGAKETLKE

Query:  AKARE-ERLAMEIGREARETAEKIKTGGNKLKENLMNIVDRGVMLIKDSFRYLGSVQS--RAMDVLGLLGFAMALGMGVWNTFISSYVLASVLPRQQLAV
         KAR+ +  AME GREAR+TAEKIKTGGNK+KENLM I D G+ L+ DSFRYL S++S   AMDVL LLGF MALGMGVW TFISSYVLAS LPRQQLAV
Subjt:  AKARE-ERLAMEIGREARETAEKIKTGGNKLKENLMNIVDRGVMLIKDSFRYLGSVQS--RAMDVLGLLGFAMALGMGVWNTFISSYVLASVLPRQQLAV

Query:  VQSKIYPVYFRAMASSIGMALLGHLFSHTHWMFPIPKTAQVLQGYVLVAALLMIFANSLYMEPRATKVMFERLKLEKEEGRGIEDIATEPRNVNDTPPAM
        VQSKIYP+YFRAMASSIGMAL GHLFS T WMFPIPK A+V+QGYVLVAALLMIFANSLYMEP+ATKVMFERLK+EKEEG+GIEDIA EPR+ ND PPA+
Subjt:  VQSKIYPVYFRAMASSIGMALLGHLFSHTHWMFPIPKTAQVLQGYVLVAALLMIFANSLYMEPRATKVMFERLKLEKEEGRGIEDIATEPRNVNDTPPAM

Query:  TTTTPTQ-VDLEVVKSRIMGLSKRLKKLNSYSSLLNLLTLMALTWHLVYLSQRFHNPC
        TT+T TQ V+ E VKSRI+GL+KRLKKLNSYSSLLNLLTLMALTWHLVYLSQR   PC
Subjt:  TTTTPTQ-VDLEVVKSRIMGLSKRLKKLNSYSSLLNLLTLMALTWHLVYLSQRFHNPC

XP_022992882.1 uncharacterized protein LOC111489083 [Cucurbita maxima]1.4e-15470.09Show/hide
Query:  MTNLFALCLIITTLTAAGLWSSPSPSP----SQDVIVKEGHRVVVVEYDNQGQHNTKVSFSSEPV-------------------QDSERHSTKDLICDAF
        M NLFALCL+IT+LTAAGLW SPSP+      QDVIVKEGHRVVVVEY +QGQHNTKVS SSEP                    +DSERH T+DLICDA 
Subjt:  MTNLFALCLIITTLTAAGLWSSPSPSP----SQDVIVKEGHRVVVVEYDNQGQHNTKVSFSSEPV-------------------QDSERHSTKDLICDAF

Query:  GKCKHKIANVVEKAKATVSETAQEAHD-------AFDEAKETVSDKSHHMGTLFTEKAH-----------------------------ELKEGAKETLKE
        GKCKHKIA+ V KAK  VSETAQEAHD       AFDEAKETVSDKSHH+GT F+EK H                             +LKEGAK+ LKE
Subjt:  GKCKHKIANVVEKAKATVSETAQEAHD-------AFDEAKETVSDKSHHMGTLFTEKAH-----------------------------ELKEGAKETLKE

Query:  AKAREERL-AMEIGREARETAEKIKTGGNKLKENLMNIVDRGVMLIKDSFRYLGSVQS--RAMDVLGLLGFAMALGMGVWNTFISSYVLASVLPRQQLAV
         KARE R  AME GREAR+TAEKIKTGGNK+KENL+ I   G+ L+ DSFRYLGS++S   AMDVL LLGF+MALGMGVW TFISSYVLAS LPRQQLAV
Subjt:  AKAREERL-AMEIGREARETAEKIKTGGNKLKENLMNIVDRGVMLIKDSFRYLGSVQS--RAMDVLGLLGFAMALGMGVWNTFISSYVLASVLPRQQLAV

Query:  VQSKIYPVYFRAMASSIGMALLGHLFSHTHWMFPIPKTAQVLQGYVLVAALLMIFANSLYMEPRATKVMFERLKLEKEEGRGIEDIATEPRNVNDTPPAM
        VQSKIYP+YFRAMASSIGMAL GHLFS T WMFPIPK A+V+QGYVLVAALL IFANSLYMEP+ATKVMFERLK+EKEEG+GIEDIA EPR+ ND PPA+
Subjt:  VQSKIYPVYFRAMASSIGMALLGHLFSHTHWMFPIPKTAQVLQGYVLVAALLMIFANSLYMEPRATKVMFERLKLEKEEGRGIEDIATEPRNVNDTPPAM

Query:  TTTTPTQ-VDLEVVKSRIMGLSKRLKKLNSYSSLLNLLTLMALTWHLVYLSQRFHNPC
        TT+T TQ VD E VKSRI+GL+KRLKKLNSYSSLLNLLTLMALTWHLVYLSQR   PC
Subjt:  TTTTPTQ-VDLEVVKSRIMGLSKRLKKLNSYSSLLNLLTLMALTWHLVYLSQRFHNPC

XP_023550065.1 uncharacterized protein LOC111808367 isoform X1 [Cucurbita pepo subsp. pepo]1.3e-15267.51Show/hide
Query:  MTNLFALCLIITTLTAAGLWSSPSPSP----SQDVIVKEGHRVVVVEYDNQGQHNTKVSFSSEPV-------------------QDSERHSTKDLICDAF
        M NLFALCL+IT+LTAAGLW SPSP+      QDVIVKEGHRVVVVEY +QGQHNTKVS SSEP                    +DSERH T+DLICDA 
Subjt:  MTNLFALCLIITTLTAAGLWSSPSPSP----SQDVIVKEGHRVVVVEYDNQGQHNTKVSFSSEPV-------------------QDSERHSTKDLICDAF

Query:  GKCKHKIANVVEKAKATVSETAQEAHD-------AFDEAKETVSDKSHHMGTLFTEKAH-----------------------------------------
        GKCKHKIA+ V KAK  VSETAQEAHD       AFDEAKETVSDKSHH+GT F+EK H                                         
Subjt:  GKCKHKIANVVEKAKATVSETAQEAHD-------AFDEAKETVSDKSHHMGTLFTEKAH-----------------------------------------

Query:  ----ELKEGAKETLKEAKARE-ERLAMEIGREARETAEKIKTGGNKLKENLMNIVDRGVMLIKDSFRYLGSVQS--RAMDVLGLLGFAMALGMGVWNTFI
            +LKEGAK+ LKE KAR+ +  AME GREAR+TAEKIKTGGNK+KENLM I D G+ L+ DSFRYL S++S   AMDVL LLGF MALGMGVW TFI
Subjt:  ----ELKEGAKETLKEAKARE-ERLAMEIGREARETAEKIKTGGNKLKENLMNIVDRGVMLIKDSFRYLGSVQS--RAMDVLGLLGFAMALGMGVWNTFI

Query:  SSYVLASVLPRQQLAVVQSKIYPVYFRAMASSIGMALLGHLFSHTHWMFPIPKTAQVLQGYVLVAALLMIFANSLYMEPRATKVMFERLKLEKEEGRGIE
        SSYVLAS LPRQQLAVVQSKIYP+YFRAMASSIGMAL GHLFS T WMFPIPK A+V+QGYVLVAALLMIFANSLYMEP+ATKVMFERLK+EKEEG+GIE
Subjt:  SSYVLASVLPRQQLAVVQSKIYPVYFRAMASSIGMALLGHLFSHTHWMFPIPKTAQVLQGYVLVAALLMIFANSLYMEPRATKVMFERLKLEKEEGRGIE

Query:  DIATEPRNVNDTPPAMTTTTPTQ-VDLEVVKSRIMGLSKRLKKLNSYSSLLNLLTLMALTWHLVYLSQRFHNPC
        DIA EPR+ ND PPA+TT+T TQ V+ E VKSRI+GL+KRLKKLNSYSSLLNLLTLMALTWHLVYLSQR   PC
Subjt:  DIATEPRNVNDTPPAMTTTTPTQ-VDLEVVKSRIMGLSKRLKKLNSYSSLLNLLTLMALTWHLVYLSQRFHNPC

TrEMBL top hitse value%identityAlignment
A0A0A0KSX1 DUF4149 domain-containing protein8.6e-13973.21Show/hide
Query:  LIITTLTAAGLWSSPSPSPSQDVIVKEGHRVVVVEYDNQGQHNTKVSFSSEPVQD---SERHSTKDLICDAFGKCKHKIANVVEKAKATVSETAQEAHDA
        LI+TT TAAGLWS   P P Q+VIVKEGHR+VVVEYD+QGQHNTKVS SSEP QD   SERH TKDLICD +GKCKHK+A+ VEKAK  V+ETAQEAHD 
Subjt:  LIITTLTAAGLWSSPSPSPSQDVIVKEGHRVVVVEYDNQGQHNTKVSFSSEPVQD---SERHSTKDLICDAFGKCKHKIANVVEKAKATVSETAQEAHDA

Query:  FDEAKETVSDKSHHMGTLFTEKAHELKEGAKETLKEAKAREERLAMEIGREARETAEKIKTGGNKLKENLMNIVDRGVMLIKDSFRYLGSVQSRAMDVLG
             E+V+D        F     +LKEGAKETL+ AK+REE++     R A+ET EKIKTG NKLKENLM +VDRG  +I   FR+LG      MD LG
Subjt:  FDEAKETVSDKSHHMGTLFTEKAHELKEGAKETLKEAKAREERLAMEIGREARETAEKIKTGGNKLKENLMNIVDRGVMLIKDSFRYLGSVQSRAMDVLG

Query:  LLGFAMALGMGVWNTFISSYVLASVLPRQQLAVVQSKIYPVYFRAMASSIGMALLGHLFSHTHWMFPIPKTAQVLQGYVLVAALLMIFANSLYMEPRATK
        LLGF MALGMGVW TFISSYVLASVLPRQQL VVQSKIYPVYF+AMAS IGMALLGHLFS T W FPIPK ++V+QGYVLVAALLMIFANSLYMEPRATK
Subjt:  LLGFAMALGMGVWNTFISSYVLASVLPRQQLAVVQSKIYPVYFRAMASSIGMALLGHLFSHTHWMFPIPKTAQVLQGYVLVAALLMIFANSLYMEPRATK

Query:  VMFERLKLEKEEGRGIEDIATEPR-NVNDTPPAMTTTTPTQ-VDLEVVKSRIMGLSKRLKKLNSYSSLLNLLTLMALTWHLVYLSQRFHNPC
        VMFERLK+EKEEGRGIEDIA E   NV D  PA+T++TPTQ VD EVVKSRI+GL+KRLKKLNSYSSLLNLLTLMALTWHLVYLSQR  NPC
Subjt:  VMFERLKLEKEEGRGIEDIATEPR-NVNDTPPAMTTTTPTQ-VDLEVVKSRIMGLSKRLKKLNSYSSLLNLLTLMALTWHLVYLSQRFHNPC

A0A1S3CA56 uncharacterized protein LOC1034986956.2e-14569.41Show/hide
Query:  MYFGVHPH----LNLKMTNLFALCLIITTLTAAGLWSSPSPSPSQDVIVKEGHRVVVVEYDNQGQHNTKVSFSSEP---VQDSERHSTKDLICDAFGKCK
        MYF +HP+    L+  MTNLFA+ LIITTLTAAGLWS   P P Q+VIVKEGHRVVVVEYD+QGQHNTKVS SSEP    ++SERH TKDLICD +GKCK
Subjt:  MYFGVHPH----LNLKMTNLFALCLIITTLTAAGLWSSPSPSPSQDVIVKEGHRVVVVEYDNQGQHNTKVSFSSEP---VQDSERHSTKDLICDAFGKCK

Query:  HKIANVVEKAKATVSETAQEAHD-------AFDEAKETVSD-------------KSHHMGT--LFTEKAHELKEGAKETLKEAKAREERLAMEIGREARE
        HK+A+ VEKAK  V+ETAQEAHD       AFDEAK+ + +             K    G    F E   +LKEGAKETL+ AK+REE++     R A+E
Subjt:  HKIANVVEKAKATVSETAQEAHD-------AFDEAKETVSD-------------KSHHMGT--LFTEKAHELKEGAKETLKEAKAREERLAMEIGREARE

Query:  TAEKIKTGGNKLKENLMNIVDRGVMLIKDSFRYLGSVQSRAMDVLGLLGFAMALGMGVWNTFISSYVLASVLPRQQLAVVQSKIYPVYFRAMASSIGMAL
        T EKI+TG NKLKENLM +VDRG  ++   FR+LG      MD LGLLGFAMALGMGVW TFISSYVLASVLPRQQL VVQSKIYPVYF+AMAS IGMAL
Subjt:  TAEKIKTGGNKLKENLMNIVDRGVMLIKDSFRYLGSVQSRAMDVLGLLGFAMALGMGVWNTFISSYVLASVLPRQQLAVVQSKIYPVYFRAMASSIGMAL

Query:  LGHLFSHTHWMFPIPKTAQVLQGYVLVAALLMIFANSLYMEPRATKVMFERLKLEKEEGRGIEDIATEPR-NVNDTPPAMTTTTPTQ-VDLEVVKSRIMG
        LGHLFS T W FPIPK ++V+QGYVLVAALLMIFANSLYMEPRATKVMFERLK+EKEEGRGIEDI  E   NV D  PA+T++TPTQ VD EVVKSRI+G
Subjt:  LGHLFSHTHWMFPIPKTAQVLQGYVLVAALLMIFANSLYMEPRATKVMFERLKLEKEEGRGIEDIATEPR-NVNDTPPAMTTTTPTQ-VDLEVVKSRIMG

Query:  LSKRLKKLNSYSSLLNLLTLMALTWHLVYLSQRFHNPC
        L+KRLKKLNSYSSLLNLLTLMALTWHLVYLSQR  NPC
Subjt:  LSKRLKKLNSYSSLLNLLTLMALTWHLVYLSQRFHNPC

A0A5D3BMZ0 DUF4149 domain-containing protein2.2e-14270.64Show/hide
Query:  MTNLFALCLIITTLTAAGLWSSPSPSPSQDVIVKEGHRVVVVEYDNQGQHNTKVSFSSEPVQD---SERHSTKDLICDAFGKCKHKIANVVEKAKATVSE
        MTNLFA+ LI+TTLTAAGLWS   P P Q+VIVKEGHRVVVVEYD+QGQHNTKVS SSEP  D   SERH TKDLICD +GKCKHK+A+ VEKAK  V+E
Subjt:  MTNLFALCLIITTLTAAGLWSSPSPSPSQDVIVKEGHRVVVVEYDNQGQHNTKVSFSSEPVQD---SERHSTKDLICDAFGKCKHKIANVVEKAKATVSE

Query:  TAQEAHD-------AFDEAKETVSDKSHH------------MGTLFTEKAHELKEGAKETLKEAKAREERLAMEIGREARETAEKIKTGGNKLKENLMNI
        TAQEAHD       AFDEAK+ + + +                  F E   +LKEGAKETL+ AK+REE++     R A+ET EKI+TG NKLKENLM +
Subjt:  TAQEAHD-------AFDEAKETVSDKSHH------------MGTLFTEKAHELKEGAKETLKEAKAREERLAMEIGREARETAEKIKTGGNKLKENLMNI

Query:  VDRGVMLIKDSFRYLGSVQSRAMDVLGLLGFAMALGMGVWNTFISSYVLASVLPRQQLAVVQSKIYPVYFRAMASSIGMALLGHLFSHTHWMFPIPKTAQ
        VDRG  ++   FR+LG      MD LGLLGFAMALGMGVW TFISSYVLASVLPRQQL VVQSKIYPVYF+AMAS IGMALLGHLFS T W FPIPK ++
Subjt:  VDRGVMLIKDSFRYLGSVQSRAMDVLGLLGFAMALGMGVWNTFISSYVLASVLPRQQLAVVQSKIYPVYFRAMASSIGMALLGHLFSHTHWMFPIPKTAQ

Query:  VLQGYVLVAALLMIFANSLYMEPRATKVMFERLKLEKEEGRGIEDIATEPR-NVNDTPPAMTTTTPTQ-VDLEVVKSRIMGLSKRLKKLNSYSSLLNLLT
        V+QGYVLVAALLMIFANSLYMEPRATKVMFERLK+EKEEGRGIEDI  E   NV D  PA+T++TPTQ VD EVVKSRI+GL+KRLKKLNSYSSLLNLLT
Subjt:  VLQGYVLVAALLMIFANSLYMEPRATKVMFERLKLEKEEGRGIEDIATEPR-NVNDTPPAMTTTTPTQ-VDLEVVKSRIMGLSKRLKKLNSYSSLLNLLT

Query:  LMALTWHLVYLSQRFHNPC
        LMALTWHLVYLSQR  NPC
Subjt:  LMALTWHLVYLSQRFHNPC

A0A6J1FL32 uncharacterized protein LOC1114449658.6e-15569.87Show/hide
Query:  MTNLFALCLIITTLTAAGLWSSPSPSP----SQDVIVKEGHRVVVVEYDNQGQHNTKVSFSSEPV-------------------QDSERHSTKDLICDAF
        M NLFALCL+IT+LTAAGLW SPSP+      QDVIVKEGHRVVVVEY +QGQHNTKVS SSEP                    +DSERH T+DLICDA 
Subjt:  MTNLFALCLIITTLTAAGLWSSPSPSP----SQDVIVKEGHRVVVVEYDNQGQHNTKVSFSSEPV-------------------QDSERHSTKDLICDAF

Query:  GKCKHKIANVVEKAKATVSETAQEAHD-------AFDEAKETVSDKSHHMGTLFTEKAH-----------------------------ELKEGAKETLKE
        GKCKHKIA+ V KAK  VSETAQEAHD       AFDEAKETVSDKSHH+GT F+EK H                             +LKEGAK+ LKE
Subjt:  GKCKHKIANVVEKAKATVSETAQEAHD-------AFDEAKETVSDKSHHMGTLFTEKAH-----------------------------ELKEGAKETLKE

Query:  AKARE-ERLAMEIGREARETAEKIKTGGNKLKENLMNIVDRGVMLIKDSFRYLGSVQS--RAMDVLGLLGFAMALGMGVWNTFISSYVLASVLPRQQLAV
         KAR+ +  AME GREAR+TAEKIKTGGNK+KENLM I D G+ L+ DSFRYL S++S   AMDVL LLGF MALGMGVW TFISSYVLAS LPRQQLAV
Subjt:  AKARE-ERLAMEIGREARETAEKIKTGGNKLKENLMNIVDRGVMLIKDSFRYLGSVQS--RAMDVLGLLGFAMALGMGVWNTFISSYVLASVLPRQQLAV

Query:  VQSKIYPVYFRAMASSIGMALLGHLFSHTHWMFPIPKTAQVLQGYVLVAALLMIFANSLYMEPRATKVMFERLKLEKEEGRGIEDIATEPRNVNDTPPAM
        VQSKIYP+YFRAMASSIGMAL GHLFS T WMFPIPK A+V+QGYVLVAALLMIFANSLYMEP+ATKVMFERLK+EKEEG+GIEDIA EPR+ ND PPA+
Subjt:  VQSKIYPVYFRAMASSIGMALLGHLFSHTHWMFPIPKTAQVLQGYVLVAALLMIFANSLYMEPRATKVMFERLKLEKEEGRGIEDIATEPRNVNDTPPAM

Query:  TTTTPTQ-VDLEVVKSRIMGLSKRLKKLNSYSSLLNLLTLMALTWHLVYLSQRFHNPC
        TT+T TQ V+ E VKSRI+GL+KRLKKLNSYSSLLNLLTLMALTWHLVYLSQR   PC
Subjt:  TTTTPTQ-VDLEVVKSRIMGLSKRLKKLNSYSSLLNLLTLMALTWHLVYLSQRFHNPC

A0A6J1JR64 uncharacterized protein LOC1114890836.6e-15570.09Show/hide
Query:  MTNLFALCLIITTLTAAGLWSSPSPSP----SQDVIVKEGHRVVVVEYDNQGQHNTKVSFSSEPV-------------------QDSERHSTKDLICDAF
        M NLFALCL+IT+LTAAGLW SPSP+      QDVIVKEGHRVVVVEY +QGQHNTKVS SSEP                    +DSERH T+DLICDA 
Subjt:  MTNLFALCLIITTLTAAGLWSSPSPSP----SQDVIVKEGHRVVVVEYDNQGQHNTKVSFSSEPV-------------------QDSERHSTKDLICDAF

Query:  GKCKHKIANVVEKAKATVSETAQEAHD-------AFDEAKETVSDKSHHMGTLFTEKAH-----------------------------ELKEGAKETLKE
        GKCKHKIA+ V KAK  VSETAQEAHD       AFDEAKETVSDKSHH+GT F+EK H                             +LKEGAK+ LKE
Subjt:  GKCKHKIANVVEKAKATVSETAQEAHD-------AFDEAKETVSDKSHHMGTLFTEKAH-----------------------------ELKEGAKETLKE

Query:  AKAREERL-AMEIGREARETAEKIKTGGNKLKENLMNIVDRGVMLIKDSFRYLGSVQS--RAMDVLGLLGFAMALGMGVWNTFISSYVLASVLPRQQLAV
         KARE R  AME GREAR+TAEKIKTGGNK+KENL+ I   G+ L+ DSFRYLGS++S   AMDVL LLGF+MALGMGVW TFISSYVLAS LPRQQLAV
Subjt:  AKAREERL-AMEIGREARETAEKIKTGGNKLKENLMNIVDRGVMLIKDSFRYLGSVQS--RAMDVLGLLGFAMALGMGVWNTFISSYVLASVLPRQQLAV

Query:  VQSKIYPVYFRAMASSIGMALLGHLFSHTHWMFPIPKTAQVLQGYVLVAALLMIFANSLYMEPRATKVMFERLKLEKEEGRGIEDIATEPRNVNDTPPAM
        VQSKIYP+YFRAMASSIGMAL GHLFS T WMFPIPK A+V+QGYVLVAALL IFANSLYMEP+ATKVMFERLK+EKEEG+GIEDIA EPR+ ND PPA+
Subjt:  VQSKIYPVYFRAMASSIGMALLGHLFSHTHWMFPIPKTAQVLQGYVLVAALLMIFANSLYMEPRATKVMFERLKLEKEEGRGIEDIATEPRNVNDTPPAM

Query:  TTTTPTQ-VDLEVVKSRIMGLSKRLKKLNSYSSLLNLLTLMALTWHLVYLSQRFHNPC
        TT+T TQ VD E VKSRI+GL+KRLKKLNSYSSLLNLLTLMALTWHLVYLSQR   PC
Subjt:  TTTTPTQ-VDLEVVKSRIMGLSKRLKKLNSYSSLLNLLTLMALTWHLVYLSQRFHNPC

SwissProt top hitse value%identityAlignment
A1L2F6 Transmembrane protein 2055.4e-0525.27Show/hide
Query:  VLGLLGFAMALGMGVWNTFISSYVLASVLPRQQLAVVQSKIYPVYFRAMASSIGMALLGHLFSHTHWMFPIPKTAQVLQGYVLVAALLMIFANSLYMEPR
        VL LL  +   GM VW +FI+ +VL S +      +VQSK++PVYF  +     ++L  +   H   +    +  Q+   +V   A++M   N+ +  P 
Subjt:  VLGLLGFAMALGMGVWNTFISSYVLASVLPRQQLAVVQSKIYPVYFRAMASSIGMALLGHLFSHTHWMFPIPKTAQVLQGYVLVAALLMIFANSLYMEPR

Query:  ATKVMFERLKLEKEEGRGIEDIATEPRNVNDTPPAMTTTTPTQVDLEVVKSRIMGLSKRLKKLNSYSSLLNLLTLMALTWHLVYLS
        AT+ M    ++EKE G G                 M++       L     +         + +  S+L NL+    +T +L+YL+
Subjt:  ATKVMFERLKLEKEEGRGIEDIATEPRNVNDTPPAMTTTTPTQVDLEVVKSRIMGLSKRLKKLNSYSSLLNLLTLMALTWHLVYLS

Q32L10 Transmembrane protein 2051.7e-0628.27Show/hide
Query:  VLGLLGFAMALGMGVWNTFISSYVLASVLPRQQLAVVQSKIYPVYFRAMASSIGMALLGHLFSHTHWMFPIPKTAQVLQGYVLVAALLMIFANSLYMEPR
        V+ LL  + A GM +W TFIS +VL   LPR    +VQSK++P YF     S+G A +      +   +      +  Q ++L+ +L +   N+ ++E R
Subjt:  VLGLLGFAMALGMGVWNTFISSYVLASVLPRQQLAVVQSKIYPVYFRAMASSIGMALLGHLFSHTHWMFPIPKTAQVLQGYVLVAALLMIFANSLYMEPR

Query:  ATKVMFERLKLEKEEGRGIEDIATEPRNVNDTPPAMTTTTPTQVDLEVVKSRIMGLSKRLKKLNSYSSLLNLLTLMALTWHLVYLSQRFHN
         T  M+    +EKE G G            + P +   + P    L     +   L ++  + +  SSL NL  L++   HL  L+   HN
Subjt:  ATKVMFERLKLEKEEGRGIEDIATEPRNVNDTPPAMTTTTPTQVDLEVVKSRIMGLSKRLKKLNSYSSLLNLLTLMALTWHLVYLSQRFHN

Q5REM8 Transmembrane protein 2052.6e-0733.33Show/hide
Query:  LLGFAMALGMGVWNTFISSYVLASVLPRQQLAVVQSKIYPVYFRAMASSIGMALLGHLFSHTHWMFPIPKTAQVLQGYVLVAALLMIFANSLYMEPRATK
        LL  + A GM +W TF+S ++L   LPR    +VQSK++P YF     S+G A +      +   +      +  Q Y+L  +L +   N+ ++EPR T 
Subjt:  LLGFAMALGMGVWNTFISSYVLASVLPRQQLAVVQSKIYPVYFRAMASSIGMALLGHLFSHTHWMFPIPKTAQVLQGYVLVAALLMIFANSLYMEPRATK

Query:  VMFERLKLEKEEGRGIE
         M+    +EKE G G E
Subjt:  VMFERLKLEKEEGRGIE

Q6UW68 Transmembrane protein 2052.0e-0733.33Show/hide
Query:  LLGFAMALGMGVWNTFISSYVLASVLPRQQLAVVQSKIYPVYFRAMASSIGMALLGHLFSHTHWMFPIPKTAQVLQGYVLVAALLMIFANSLYMEPRATK
        LL  + A GM +W TF+S ++L   LPR    +VQSK++P YF     S+G A +      +   +      +  Q Y+L  +L +   N+ ++EPR T 
Subjt:  LLGFAMALGMGVWNTFISSYVLASVLPRQQLAVVQSKIYPVYFRAMASSIGMALLGHLFSHTHWMFPIPKTAQVLQGYVLVAALLMIFANSLYMEPRATK

Query:  VMFERLKLEKEEGRGIE
         M+    +EKE G G E
Subjt:  VMFERLKLEKEEGRGIE

Q91XE8 Transmembrane protein 2056.8e-0835.83Show/hide
Query:  VLGLLGFAMALGMGVWNTFISSYVLASVLPRQQLAVVQSKIYPVYFRAMASSIGMALLGHLFSHTHWMFPIPKTAQVLQGYVLVAALLMIFANSLYMEPR
        V+ LL  + A GM VW TFIS ++L   LPR    +VQSK++PVYF     S+G A +          +      +V Q  +L+ +L +   N+ ++E R
Subjt:  VLGLLGFAMALGMGVWNTFISSYVLASVLPRQQLAVVQSKIYPVYFRAMASSIGMALLGHLFSHTHWMFPIPKTAQVLQGYVLVAALLMIFANSLYMEPR

Query:  ATKVMFERLKLEKEEGRGIE
         T VM     +EKE G G E
Subjt:  ATKVMFERLKLEKEEGRGIE

Arabidopsis top hitse value%identityAlignment
AT1G22600.1 Late embryogenesis abundant protein (LEA) family protein3.9e-5136Show/hide
Query:  MTNLFALCLIITTLTAAGLWSSPSPSPSQDVIVKEGHRVVVVEYDNQGQHNTKV--------------------------SFSSEPVQDS-ERHSTK-DL
        MT L AL L+++ L   G           D+IV++GHRVVVVEYD  G+ NT+V                          + SS P ++  E H+T  +L
Subjt:  MTNLFALCLIITTLTAAGLWSSPSPSPSQDVIVKEGHRVVVVEYDNQGQHNTKV--------------------------SFSSEPVQDS-ERHSTK-DL

Query:  ICDAFGKCKHKIANVVEKAKATVSETAQEAHDAFDEAKETVSDKSHHMGTLFTEKAHELKEGAKETLKEAKAREERLAMEIGREARETAEKIKTGGNKLK
        ICDA GKCKHK+  V+ + K              D     +SD++  M      +A E++E      +EA+ +    A +     ++  EK++       
Subjt:  ICDAFGKCKHKIANVVEKAKATVSETAQEAHDAFDEAKETVSDKSHHMGTLFTEKAHELKEGAKETLKEAKAREERLAMEIGREARETAEKIKTGGNKLK

Query:  ENLMNIVDRGVMLIKDSFRYLGSVQSRAMDVLGLLGFAMALGMGVWNTFISSYVLASVLPRQQLAVVQSKIYPVYFRAMASSIGMALLGHLFSHTHWMFP
             I  RG+  +  +   L  + S    V+G++G A A GM VW TF+S YVLASVL  QQ  VVQSK+YPVYF+A++  I + LLGH+      +F 
Subjt:  ENLMNIVDRGVMLIKDSFRYLGSVQSRAMDVLGLLGFAMALGMGVWNTFISSYVLASVLPRQQLAVVQSKIYPVYFRAMASSIGMALLGHLFSHTHWMFP

Query:  IPKTAQVLQGYVLVAALLMIFANSLYMEPRATKVMFERLKLEKEEGRGIE--DIATEPRNVNDTPPAMTTTTPTQVDLEVVKSRIMGLSKRLKKLNSYSS
              + Q   L++++LM+ AN+ ++  RATK MFE +K EKE+GRG +  D +    +   T      T  T  D +VVK R+  LS+R++KLN+YSS
Subjt:  IPKTAQVLQGYVLVAALLMIFANSLYMEPRATKVMFERLKLEKEEGRGIE--DIATEPRNVNDTPPAMTTTTPTQVDLEVVKSRIMGLSKRLKKLNSYSS

Query:  LLNLLTLMALTWHLVYLSQRFHNPC
         LNLLTLM+LTWH VYL  R    C
Subjt:  LLNLLTLMALTWHLVYLSQRFHNPC

AT1G72100.1 late embryogenesis abundant domain-containing protein / LEA domain-containing protein1.2e-6036.44Show/hide
Query:  MTNLFALCLIITTLTAAGLWSSPSPSPSQ---------DVIVKEGHRVVVVEYDNQGQHNTKVSFS----------------------------------
        MTNL ALCL+++TL AA +W SPSP+ +          +VIVK+GH VVVVEYD  G+ NT+VS S                                  
Subjt:  MTNLFALCLIITTLTAAGLWSSPSPSPSQ---------DVIVKEGHRVVVVEYDNQGQHNTKVSFS----------------------------------

Query:  --------SEPVQDSE---RHSTK-DLICDAFGKCKHKIANVVEKAK----ATVSETAQE-----AHDAFD----------EAKETVSDKSHHMGTLFTE
                S+PV   E    H+T  ++ICDAFGKC+ KIA+VV +AK     +V ETA +     AH A D          + ++TV+D++ +     TE
Subjt:  --------SEPVQDSE---RHSTK-DLICDAFGKCKHKIANVVEKAK----ATVSETAQE-----AHDAFD----------EAKETVSDKSHHMGTLFTE

Query:  KAHELKEG-------AKETL-------KEAKAREERLAME------------IGREARETAE----KIKTGGNKLKENLMNIVDRGVMLIKDSFRYLGSV
        KAH+ KEG       AKE++       KE+ A++   A E            + ++A E+ E    +++    +LKE   +        +K+  R  GS 
Subjt:  KAHELKEG-------AKETL-------KEAKAREERLAME------------IGREARETAE----KIKTGGNKLKENLMNIVDRGVMLIKDSFRYLGSV

Query:  ------QSRAMDVLGLLGFAMALGMGVWNTFISSYVLASVLPRQQLAVVQSKIYPVYFRAMASSIGMALLGHLFSHTHWMFPIPKTAQVLQGYVLVAALL
               ++   ++GL G A A G  VW TF+SSYVLASVL RQQ  VVQSK+YPVYF+A +  I + L GH+ S    +  +    ++ QG  L+++  
Subjt:  ------QSRAMDVLGLLGFAMALGMGVWNTFISSYVLASVLPRQQLAVVQSKIYPVYFRAMASSIGMALLGHLFSHTHWMFPIPKTAQVLQGYVLVAALL

Query:  MIFANSLYMEPRATKVMFERLKLEKEEGRGIEDIATEPRNVNDTPPAMTTTTPTQVDLEVVKSRIMGLSKRLKKLNSYSSLLNLLTLMALTWHLVYLSQR
        MI AN  ++EPRATK MFER+K EKEEGRG E                      +   + ++ ++  LS+RL KLN+YSS LN+LTLM+LTWH VYL QR
Subjt:  MIFANSLYMEPRATKVMFERLKLEKEEGRGIEDIATEPRNVNDTPPAMTTTTPTQVDLEVVKSRIMGLSKRLKKLNSYSSLLNLLTLMALTWHLVYLSQR

Query:  FHNPC
            C
Subjt:  FHNPC

AT3G62580.1 Late embryogenesis abundant protein (LEA) family protein5.4e-0824.07Show/hide
Query:  GVMLIKDSFRYLGSVQSRA-----MDVLGLLGFAMALGMGVWNTFISSYVLASVLPRQQLAVVQSKIYPVYFRAMASSIGMALLGHLFSHTHWMFPIPKT
        GV+   ++F  L + ++ A     + +  LL FA A G  +W TFI   ++   LPR Q   +QSK++P YF  + S   ++L    + H    +    T
Subjt:  GVMLIKDSFRYLGSVQSRA-----MDVLGLLGFAMALGMGVWNTFISSYVLASVLPRQQLAVVQSKIYPVYFRAMASSIGMALLGHLFSHTHWMFPIPKT

Query:  AQVLQGYVLVAALLMIFANSLYMEPRATKVMFERLKLEKEEGRGIEDIATEPRNVNDTPPAMTTTTPTQVDLEVVKSRIMGLSKRLKKLNSYSSLLNLLT
         +  Q   L++A      N     P    +M +R K+E+E   G E   ++ R    + P                 ++  ++K+   ++  SSL N+ +
Subjt:  AQVLQGYVLVAALLMIFANSLYMEPRATKVMFERLKLEKEEGRGIEDIATEPRNVNDTPPAMTTTTPTQVDLEVVKSRIMGLSKRLKKLNSYSSLLNLLT

Query:  LMALTWHLVYLSQRFH
          +L  H  YL+ + +
Subjt:  LMALTWHLVYLSQRFH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATTTTGGTGTTCATCCACATCTTAATTTGAAGATGACCAACCTATTCGCTTTGTGTCTCATCATCACTACTTTAACTGCGGCGGGACTGTGGTCTTCTCCTTCTCC
TTCTCCTTCGCAAGATGTTATTGTTAAAGAAGGCCACCGAGTGGTTGTGGTCGAGTACGACAACCAAGGTCAACACAATACTAAGGTTTCCTTCTCTTCCGAACCCGTCC
AAGATTCGGAAAGGCACAGCACCAAAGATCTTATTTGCGATGCCTTCGGAAAATGTAAGCATAAGATAGCCAATGTTGTAGAGAAAGCTAAAGCAACGGTTTCGGAGACG
GCGCAGGAGGCCCACGACGCATTCGATGAAGCCAAAGAGACGGTTTCAGACAAATCCCACCACATGGGAACGTTGTTCACAGAGAAAGCACATGAATTGAAAGAGGGTGC
AAAGGAAACATTGAAAGAAGCCAAAGCAAGAGAAGAACGGCTGGCAATGGAGATAGGGAGAGAAGCAAGAGAAACTGCAGAGAAAATTAAAACTGGGGGAAACAAGCTTA
AGGAGAATCTGATGAATATAGTTGATAGAGGAGTAATGCTAATAAAGGATTCATTTAGGTACTTGGGTTCGGTACAGTCGCGGGCGATGGATGTGTTGGGTCTGTTGGGA
TTTGCTATGGCTTTGGGAATGGGTGTGTGGAATACCTTCATCTCTAGCTATGTGCTGGCGAGTGTGCTGCCAAGGCAGCAGTTGGCAGTGGTACAGAGCAAGATATATCC
TGTGTATTTTAGGGCCATGGCTTCCTCCATTGGGATGGCCTTATTAGGCCATCTTTTCAGCCACACACACTGGATGTTTCCAATTCCAAAAACTGCTCAAGTGCTACAAG
GATATGTACTTGTGGCTGCACTTCTGATGATCTTTGCCAATTCTCTCTACATGGAGCCTAGAGCCACCAAGGTAATGTTTGAGAGATTAAAGCTTGAAAAGGAAGAAGGA
AGAGGAATTGAGGACATAGCCACTGAACCTCGGAATGTGAATGATACTCCCCCAGCAATGACAACCACCACACCCACACAAGTCGACTTAGAGGTGGTGAAATCCAGAAT
CATGGGGTTGAGTAAAAGGCTAAAGAAACTCAATTCATATTCCTCCTTGTTAAACCTCCTCACTCTCATGGCTCTCACTTGGCATCTTGTGTACCTAAGCCAGCGTTTTC
ACAATCCCTGCTAA
mRNA sequenceShow/hide mRNA sequence
TCCAGCCCTAAACTAAATATCCATTGCTACTTTTCTATTCTTTAATCCTTTCGTTGCCGCCTTAGCCACGACGATCAGTTTGTTATTTTCAGTATAAAGAAATAGCAACG
TGTCAAAGGTTTGCTGAGACACGTGAGGGGGAGCAGAAAGCAAAAGTCTTATTGTTCTTACTTAATGCAAACGTTGAAATGTAATATGCAAACATTTGTCTCTCTATACA
CACTACGCGTCATTTGTGTCTTTATAAATGTATTTTGGTGTTCATCCACATCTTAATTTGAAGATGACCAACCTATTCGCTTTGTGTCTCATCATCACTACTTTAACTGC
GGCGGGACTGTGGTCTTCTCCTTCTCCTTCTCCTTCGCAAGATGTTATTGTTAAAGAAGGCCACCGAGTGGTTGTGGTCGAGTACGACAACCAAGGTCAACACAATACTA
AGGTTTCCTTCTCTTCCGAACCCGTCCAAGATTCGGAAAGGCACAGCACCAAAGATCTTATTTGCGATGCCTTCGGAAAATGTAAGCATAAGATAGCCAATGTTGTAGAG
AAAGCTAAAGCAACGGTTTCGGAGACGGCGCAGGAGGCCCACGACGCATTCGATGAAGCCAAAGAGACGGTTTCAGACAAATCCCACCACATGGGAACGTTGTTCACAGA
GAAAGCACATGAATTGAAAGAGGGTGCAAAGGAAACATTGAAAGAAGCCAAAGCAAGAGAAGAACGGCTGGCAATGGAGATAGGGAGAGAAGCAAGAGAAACTGCAGAGA
AAATTAAAACTGGGGGAAACAAGCTTAAGGAGAATCTGATGAATATAGTTGATAGAGGAGTAATGCTAATAAAGGATTCATTTAGGTACTTGGGTTCGGTACAGTCGCGG
GCGATGGATGTGTTGGGTCTGTTGGGATTTGCTATGGCTTTGGGAATGGGTGTGTGGAATACCTTCATCTCTAGCTATGTGCTGGCGAGTGTGCTGCCAAGGCAGCAGTT
GGCAGTGGTACAGAGCAAGATATATCCTGTGTATTTTAGGGCCATGGCTTCCTCCATTGGGATGGCCTTATTAGGCCATCTTTTCAGCCACACACACTGGATGTTTCCAA
TTCCAAAAACTGCTCAAGTGCTACAAGGATATGTACTTGTGGCTGCACTTCTGATGATCTTTGCCAATTCTCTCTACATGGAGCCTAGAGCCACCAAGGTAATGTTTGAG
AGATTAAAGCTTGAAAAGGAAGAAGGAAGAGGAATTGAGGACATAGCCACTGAACCTCGGAATGTGAATGATACTCCCCCAGCAATGACAACCACCACACCCACACAAGT
CGACTTAGAGGTGGTGAAATCCAGAATCATGGGGTTGAGTAAAAGGCTAAAGAAACTCAATTCATATTCCTCCTTGTTAAACCTCCTCACTCTCATGGCTCTCACTTGGC
ATCTTGTGTACCTAAGCCAGCGTTTTCACAATCCCTGCTAATACAGCAATATTTTCCTTTCCTACTGTTCTTAGGTGTCAAACTTGTACTAGGAGTGTTATCTTTCTAAT
GTATTTCAGATTGTTTATTTTCTGTTTAATAGTTTGTTCATTAGTATCTACAAGTTAGTTGTTCTGTTATTTGAGAGTTCAGCCTATAAATAACTAACAACAAAAACTAA
AAATACAACGGTGAATTGATATCAACCTAATAGTGGAGTCAAATTACTATTTTTAATGTTACAAAATGGAGGATCCCCATTTAAGTGGTCGCTACAATCAGCACATTCAT
AAACAGAACGAACGAGCAGTAGCTGATCATCCAAGTACCTCTGTAATAAGCCGATCTCCTCCCCACTGTCACCGCCAATCTGCTCCGGGAGCGGTGGTCTGCAGCCGCCT
TCCTCACATTCCTGCAAGCGTTCGATTTCTGGTGAATCGCGAAGATATATCCTTCTCCTGTGACGAGCAAGGCGGATGCAGAGATCTATGGCAAAGATGAAGGGGAAAGT
AGCGCAGAAAATGGGAATGAAGAGAGGGGAACAAACCAAGAGAAAGAGAAAGCGGCTACGGCTCCGGCTCCGGCTGGAATGAAGCCAGGAAGAGGCGGCGGCTAACATGA
TGAGTGGCGGATCTGTGGGGAAGAAGGAGAGGTAGGGGATCGAAGACATCCAGCTTGGAAGATGAATTTCATGGTGCGAGGAGCGGGGAGAAGAAGAGAATTAAGGGGAT
TGGGGTTCTCGATGAACATGCCTTTTCCTTTTCAAATCCAAACCTAAATGCTGTTTCATTTTATATAATTAACTCATTAATC
Protein sequenceShow/hide protein sequence
MYFGVHPHLNLKMTNLFALCLIITTLTAAGLWSSPSPSPSQDVIVKEGHRVVVVEYDNQGQHNTKVSFSSEPVQDSERHSTKDLICDAFGKCKHKIANVVEKAKATVSET
AQEAHDAFDEAKETVSDKSHHMGTLFTEKAHELKEGAKETLKEAKAREERLAMEIGREARETAEKIKTGGNKLKENLMNIVDRGVMLIKDSFRYLGSVQSRAMDVLGLLG
FAMALGMGVWNTFISSYVLASVLPRQQLAVVQSKIYPVYFRAMASSIGMALLGHLFSHTHWMFPIPKTAQVLQGYVLVAALLMIFANSLYMEPRATKVMFERLKLEKEEG
RGIEDIATEPRNVNDTPPAMTTTTPTQVDLEVVKSRIMGLSKRLKKLNSYSSLLNLLTLMALTWHLVYLSQRFHNPC