; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0010221 (gene) of Chayote v1 genome

Gene IDSed0010221
OrganismSechium edule (Chayote v1)
DescriptionProtein of unknown function (DUF1005)
Genome locationLG07:8641055..8642884
RNA-Seq ExpressionSed0010221
SyntenySed0010221
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR010410 - Protein of unknown function DUF1005


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7014924.1 hypothetical protein SDJN02_22555, partial [Cucurbita argyrosperma subsp. argyrosperma]7.9e-20983.78Show/hide
Query:  MDPCPFLRLLVGNLAIKFPVAARPSFSGVHPSASPCFCKIKLAGFPTQFATVPLLLHGDT-----SSSSLAACFTLNKSHIQSLLSKRKDPSLKIEVFTG
        MDPCPFLR+LVGNLA+KFP+AA+PSFSGVHPS+SPCFCKIKL+ FPTQFATVPL++ G+T     SSS LAACF+LNKS I+ L+SKRKD  +KIEV+TG
Subjt:  MDPCPFLRLLVGNLAIKFPVAARPSFSGVHPSASPCFCKIKLAGFPTQFATVPLLLHGDT-----SSSSLAACFTLNKSHIQSLLSKRKDPSLKIEVFTG

Query:  RRDSAACGAGVLGTSARLLGRIVLPITALSTAETKPVVFQNGWTRIGGGRKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQPVFTCKF
        R     CG  + G+SA+LLGRIV+P+T  S +ETKP VFQNGWT I GGRKGYSSAQLHLTVRAE DPRFVFRFDGEPECSPQVFQVQG+V+QPVFTCKF
Subjt:  RRDSAACGAGVLGTSARLLGRIVLPITALSTAETKPVVFQNGWTRIGGGRKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQPVFTCKF

Query:  GFRNERDWDRSRSSISEQSSTSKSWLPKIRSEKDQPAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWQPWGRLEAW
        GFRNERDWDRSRSSI+EQSSTS+SWLPKI SE+DQ AKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSW PWGRLEAW
Subjt:  GFRNERDWDRSRSSISEQSSTSKSWLPKIRSEKDQPAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWQPWGRLEAW

Query:  RESGGSDSIGSRFELLPATSAAATLATSAISSSAGGKFTIDRTGSASPAISPNGSFDLGSRSGSRPGSGDFGYLSSPYQYKGFVMSATVDGMKKKNRRPE
        RESGGSDSIG RFELLPATSAAATLA S ISSS+GGKFTID+TGSASP ISPNGSFDLGS SGSRPGSGDFGYL++ YQYKGFVMS  V+GMKKKNRR E
Subjt:  RESGGSDSIGSRFELLPATSAAATLATSAISSSAGGKFTIDRTGSASPAISPNGSFDLGSRSGSRPGSGDFGYLSSPYQYKGFVMSATVDGMKKKNRRPE

Query:  VEVAVQHVTCTEDAAAFVALAAAMDLSMDACRLFSQKLRKELRQ
        VEVAVQHVTCTEDAA FVALAAA++LSMDACRLFSQKLRKELRQ
Subjt:  VEVAVQHVTCTEDAAAFVALAAAMDLSMDACRLFSQKLRKELRQ

XP_022922983.1 uncharacterized protein LOC111430804 [Cucurbita moschata]2.1e-20984.01Show/hide
Query:  MDPCPFLRLLVGNLAIKFPVAARPSFSGVHPSASPCFCKIKLAGFPTQFATVPLLLHGDT-----SSSSLAACFTLNKSHIQSLLSKRKDPSLKIEVFTG
        MDPCPFLR+LVGNLA+KFP+AA+PSFSGVHPS+SPCFCKIKL+ FPTQFATVPL++ G+T     SSS LAACF+LNKS I+ L+SKRKD  +KIEV+TG
Subjt:  MDPCPFLRLLVGNLAIKFPVAARPSFSGVHPSASPCFCKIKLAGFPTQFATVPLLLHGDT-----SSSSLAACFTLNKSHIQSLLSKRKDPSLKIEVFTG

Query:  RRDSAACGAGVLGTSARLLGRIVLPITALSTAETKPVVFQNGWTRIGGGRKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQPVFTCKF
        R     CG  + G+SA+LLGRIV+P+T  S +ETKP VFQNGWT I GGRKGYSSAQLHLTVRAE DPRFVFRFDGEPECSPQVFQVQG+V+QPVFTCKF
Subjt:  RRDSAACGAGVLGTSARLLGRIVLPITALSTAETKPVVFQNGWTRIGGGRKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQPVFTCKF

Query:  GFRNERDWDRSRSSISEQSSTSKSWLPKIRSEKDQPAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWQPWGRLEAW
        GFRNERDWDRSRSSI+EQSSTS+SWLPKI SE+DQ AKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSW PWGRLEAW
Subjt:  GFRNERDWDRSRSSISEQSSTSKSWLPKIRSEKDQPAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWQPWGRLEAW

Query:  RESGGSDSIGSRFELLPATSAAATLATSAISSSAGGKFTIDRTGSASPAISPNGSFDLGSRSGSRPGSGDFGYLSSPYQYKGFVMSATVDGMKKKNRRPE
        RESGGSDSIG RFELLPATSAAATLA S ISSS+GGKFTID+TGSASP ISPNGSFDLGS SGSRPGSGDFGYL++ YQYKGFVMS  V+GMKKKNRR E
Subjt:  RESGGSDSIGSRFELLPATSAAATLATSAISSSAGGKFTIDRTGSASPAISPNGSFDLGSRSGSRPGSGDFGYLSSPYQYKGFVMSATVDGMKKKNRRPE

Query:  VEVAVQHVTCTEDAAAFVALAAAMDLSMDACRLFSQKLRKELRQ
        VEVAVQHVTCTEDAA FVALAAA+DLSMDACRLFSQKLRKELRQ
Subjt:  VEVAVQHVTCTEDAAAFVALAAAMDLSMDACRLFSQKLRKELRQ

XP_022989454.1 uncharacterized protein LOC111486504 [Cucurbita maxima]5.1e-20883.78Show/hide
Query:  MDPCPFLRLLVGNLAIKFPVAARPSFSGVHPSASPCFCKIKLAGFPTQFATVPLLLHGD-----------TSSSSLAACFTLNKSHIQSLLSKRKDPSLK
        MDPC FLR+LVGNLA+KFPVAA+PSFSGVHPS+SPCFCKIKL  FPTQF TVPLL+ G+           +S SSLAACF+LNKS I+ L+SKRKD S+K
Subjt:  MDPCPFLRLLVGNLAIKFPVAARPSFSGVHPSASPCFCKIKLAGFPTQFATVPLLLHGD-----------TSSSSLAACFTLNKSHIQSLLSKRKDPSLK

Query:  IEVFTGRRDSAACGAGVLGTSARLLGRIVLPITALSTAETKPVVFQNGWTRIGGGRKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQP
        IEVFTG R  A+CG  +L +SA+LLGRIV+PITA S AETKP +FQNGWT IG G++G SSAQLHLTVRAEPDPRF+FRFDGEPECSPQVFQVQG+V QP
Subjt:  IEVFTGRRDSAACGAGVLGTSARLLGRIVLPITALSTAETKPVVFQNGWTRIGGGRKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQP

Query:  VFTCKFGFRNERDWDRSRSSISEQSSTSKSWLPKIRSEKDQPAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWQPW
        VFTCKFGFRNERDWDRSRSSISEQSSTSKSWLPKIRSEKDQ A ERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSW+PW
Subjt:  VFTCKFGFRNERDWDRSRSSISEQSSTSKSWLPKIRSEKDQPAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWQPW

Query:  GRLEAWRESGGSDSIGSRFELLPATSAAATLATSAISSSAGGKFTIDRTGSASPAISPNGSFDLGSRSGSRPGSGDFGYLSSPYQYKGFVMSATVDGMKK
        GRLEAWRESGGSDSIG RFELLP  SAAA LATS ISSSAGGKFTID TGSASP ISPNGSFDL S SGSRPGSGDFGYLSS YQYKGFVMS TV+GMKK
Subjt:  GRLEAWRESGGSDSIGSRFELLPATSAAATLATSAISSSAGGKFTIDRTGSASPAISPNGSFDLGSRSGSRPGSGDFGYLSSPYQYKGFVMSATVDGMKK

Query:  KNRRPEVEVAVQHVTCTEDAAAFVALAAAMDLSMDACRLFSQKLRKELRQ
        ++RRPEVEVAVQHVTCTEDAA FVALAAA+DLSMDACRLFSQKLRKELRQ
Subjt:  KNRRPEVEVAVQHVTCTEDAAAFVALAAAMDLSMDACRLFSQKLRKELRQ

XP_023516135.1 uncharacterized protein LOC111780086 [Cucurbita pepo subsp. pepo]1.3e-20883.81Show/hide
Query:  MDPCPFLRLLVGNLAIKFPVAARPSFSGVHPSASPCFCKIKLAGFPTQFATVPLLLHGDTSS------------SSLAACFTLNKSHIQSLLSKRKDPSL
        MDPC FLR+LVGNLA+KFPVAA+PSFSGVHPS+SPCFCKIKL  FPTQF TVPLL+ G+TS             SSLA+CF+LNKS I+ L+SKRKD S+
Subjt:  MDPCPFLRLLVGNLAIKFPVAARPSFSGVHPSASPCFCKIKLAGFPTQFATVPLLLHGDTSS------------SSLAACFTLNKSHIQSLLSKRKDPSL

Query:  KIEVFTGRRDSAACGAGVLGTSARLLGRIVLPITALSTAETKPVVFQNGWTRIGGGRKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQ
        KIEVFTG R  A+CG  +L +SA+LLGRIV+PITA S AETKP +FQNGWT IG G++G SSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQG+V Q
Subjt:  KIEVFTGRRDSAACGAGVLGTSARLLGRIVLPITALSTAETKPVVFQNGWTRIGGGRKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQ

Query:  PVFTCKFGFRNERDWDRSRSSISEQSSTSKSWLPKIRSEKDQPAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWQP
        PVFTCKFGFRNERDWDRSRSSISEQSSTSKSWLPKIRSEKDQ A ERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSW+P
Subjt:  PVFTCKFGFRNERDWDRSRSSISEQSSTSKSWLPKIRSEKDQPAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWQP

Query:  WGRLEAWRESGGSDSIGSRFELLPATSAAATLATSAISSSAGGKFTIDRTGSASPAISPNGSFDLGSRSGSRPGSGDFGYLSSPYQYKGFVMSATVDGMK
        WGRLEAWRESGGSDSIG RFELLP  SAAA LATS ISSSAGGKFTID TGSASP ISPNGSFDL S SGSRPGSGDFGYLSS YQYKGFVMS TV+GMK
Subjt:  WGRLEAWRESGGSDSIGSRFELLPATSAAATLATSAISSSAGGKFTIDRTGSASPAISPNGSFDLGSRSGSRPGSGDFGYLSSPYQYKGFVMSATVDGMK

Query:  KKNRRPEVEVAVQHVTCTEDAAAFVALAAAMDLSMDACRLFSQKLRKELRQ
        K++RRPEVEVAVQHVTCTEDAA FVALAAA+DLSMDACRLFSQKLRKELRQ
Subjt:  KKNRRPEVEVAVQHVTCTEDAAAFVALAAAMDLSMDACRLFSQKLRKELRQ

XP_023553155.1 uncharacterized protein LOC111810646 [Cucurbita pepo subsp. pepo]1.8e-20883.78Show/hide
Query:  MDPCPFLRLLVGNLAIKFPVAARPSFSGVHPSASPCFCKIKLAGFPTQFATVPLLLHGDT-----SSSSLAACFTLNKSHIQSLLSKRKDPSLKIEVFTG
        MDPCPFLR+LVGNLA+KFPVAA+PSFSGVHPS+SPCFCKIKL+ FPTQFATVPL++ G+T     SSS LAACF+LNKS I+ L+SKRKD  +KIEV+TG
Subjt:  MDPCPFLRLLVGNLAIKFPVAARPSFSGVHPSASPCFCKIKLAGFPTQFATVPLLLHGDT-----SSSSLAACFTLNKSHIQSLLSKRKDPSLKIEVFTG

Query:  RRDSAACGAGVLGTSARLLGRIVLPITALSTAETKPVVFQNGWTRIGGGRKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQPVFTCKF
        R     CG  + G+SA+LLGRIV+P+T  S +ETKP VFQNGWT I GG KGYSSAQLHLTVRAE DPRFVFRFDGEPECSPQVFQVQG+V+QPVFTCKF
Subjt:  RRDSAACGAGVLGTSARLLGRIVLPITALSTAETKPVVFQNGWTRIGGGRKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQPVFTCKF

Query:  GFRNERDWDRSRSSISEQSSTSKSWLPKIRSEKDQPAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWQPWGRLEAW
        GFRNERDWDRSRSSI+EQSSTS+SWLPKI SE+DQ AKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSW PWGRLEAW
Subjt:  GFRNERDWDRSRSSISEQSSTSKSWLPKIRSEKDQPAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWQPWGRLEAW

Query:  RESGGSDSIGSRFELLPATSAAATLATSAISSSAGGKFTIDRTGSASPAISPNGSFDLGSRSGSRPGSGDFGYLSSPYQYKGFVMSATVDGMKKKNRRPE
        RESGGSDSIG RFELLPATSAAATLA S ISSS+GGKFTID+TGSASP ISPNGSFDLGS SGSRPGSGDFGYL++ YQYKGFVMS  V+GMKKKNR+ E
Subjt:  RESGGSDSIGSRFELLPATSAAATLATSAISSSAGGKFTIDRTGSASPAISPNGSFDLGSRSGSRPGSGDFGYLSSPYQYKGFVMSATVDGMKKKNRRPE

Query:  VEVAVQHVTCTEDAAAFVALAAAMDLSMDACRLFSQKLRKELRQ
        VEVAVQHVTCTEDAA FVALAAA+DLSMDACRLFSQKLRKELRQ
Subjt:  VEVAVQHVTCTEDAAAFVALAAAMDLSMDACRLFSQKLRKELRQ

TrEMBL top hitse value%identityAlignment
A0A1S3BYK8 uncharacterized protein LOC1034948384.0e-20681.28Show/hide
Query:  MDPCPFLRLLVGNLAIKFPVAARPSFSGVHPSASPCFCKIKLAGFPTQFATVPLLLHGDTSS---------------SSLAACFTLNKSHIQSLLSKRKD
        MDPCPFLR+LVGNLA+KFPVAA+PSFSGVHPS SPCFCKIKL  FPTQF T+PLL+ G+ S                SSLAACF+LNKS I+ L+ KRKD
Subjt:  MDPCPFLRLLVGNLAIKFPVAARPSFSGVHPSASPCFCKIKLAGFPTQFATVPLLLHGDTSS---------------SSLAACFTLNKSHIQSLLSKRKD

Query:  PSLKIEVFTGRRDSAACGAGVLGTSARLLGRIVLPITALSTAETKPVVFQNGWTRIGGGRKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGN
         S+KIEV+TGR   A C   V G+SA+LLGRI +P+T    +ETKP VFQNGWT IG G+KGYSSAQLHLTVR+EPDPRFVFRFDGEPECSPQVFQVQG+
Subjt:  PSLKIEVFTGRRDSAACGAGVLGTSARLLGRIVLPITALSTAETKPVVFQNGWTRIGGGRKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGN

Query:  VRQPVFTCKFGFRNERDWDRSRSSISEQSSTSKSWLPKIRSEKDQPAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGS
        V+QPVFTCKFGFRNERDWDRSRSSI+EQSSTSKSWLPKIRSE+DQ AKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGS
Subjt:  VRQPVFTCKFGFRNERDWDRSRSSISEQSSTSKSWLPKIRSEKDQPAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGS

Query:  WQPWGRLEAWRESGGSDSIGSRFELLPATSAAATLATSAISSSAGGKFTIDRTGSASPAISPNGSFDLGSRSGSRPGSGDFGYLSSPYQYKGFVMSATVD
        W+PWGRLEAWRESGGSDSIG RFELLPATSAAATLA S ISS +GG+FTID TGSASPAISPNGSFDLGS +GSRPGSGDFGYL+  YQYKGFVMS  V+
Subjt:  WQPWGRLEAWRESGGSDSIGSRFELLPATSAAATLATSAISSSAGGKFTIDRTGSASPAISPNGSFDLGSRSGSRPGSGDFGYLSSPYQYKGFVMSATVD

Query:  GMKKKNRRPEVEVAVQHVTCTEDAAAFVALAAAMDLSMDACRLFSQKLRKELRQ
        GMKKK+RRPEVEV VQHVTCTEDAA FVALAAA+DLSMDACRLFSQKLRKELRQ
Subjt:  GMKKKNRRPEVEVAVQHVTCTEDAAAFVALAAAMDLSMDACRLFSQKLRKELRQ

A0A5D3E122 Formin-like protein 184.0e-20681.28Show/hide
Query:  MDPCPFLRLLVGNLAIKFPVAARPSFSGVHPSASPCFCKIKLAGFPTQFATVPLLLHGDTSS---------------SSLAACFTLNKSHIQSLLSKRKD
        MDPCPFLR+LVGNLA+KFPVAA+PSFSGVHPS SPCFCKIKL  FPTQF T+PLL+ G+ S                SSLAACF+LNKS I+ L+ KRKD
Subjt:  MDPCPFLRLLVGNLAIKFPVAARPSFSGVHPSASPCFCKIKLAGFPTQFATVPLLLHGDTSS---------------SSLAACFTLNKSHIQSLLSKRKD

Query:  PSLKIEVFTGRRDSAACGAGVLGTSARLLGRIVLPITALSTAETKPVVFQNGWTRIGGGRKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGN
         S+KIEV+TGR   A C   V G+SA+LLGRI +P+T    +ETKP VFQNGWT IG G+KGYSSAQLHLTVR+EPDPRFVFRFDGEPECSPQVFQVQG+
Subjt:  PSLKIEVFTGRRDSAACGAGVLGTSARLLGRIVLPITALSTAETKPVVFQNGWTRIGGGRKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGN

Query:  VRQPVFTCKFGFRNERDWDRSRSSISEQSSTSKSWLPKIRSEKDQPAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGS
        V+QPVFTCKFGFRNERDWDRSRSSI+EQSSTSKSWLPKIRSE+DQ AKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGS
Subjt:  VRQPVFTCKFGFRNERDWDRSRSSISEQSSTSKSWLPKIRSEKDQPAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGS

Query:  WQPWGRLEAWRESGGSDSIGSRFELLPATSAAATLATSAISSSAGGKFTIDRTGSASPAISPNGSFDLGSRSGSRPGSGDFGYLSSPYQYKGFVMSATVD
        W+PWGRLEAWRESGGSDSIG RFELLPATSAAATLA S ISS +GG+FTID TGSASPAISPNGSFDLGS +GSRPGSGDFGYL+  YQYKGFVMS  V+
Subjt:  WQPWGRLEAWRESGGSDSIGSRFELLPATSAAATLATSAISSSAGGKFTIDRTGSASPAISPNGSFDLGSRSGSRPGSGDFGYLSSPYQYKGFVMSATVD

Query:  GMKKKNRRPEVEVAVQHVTCTEDAAAFVALAAAMDLSMDACRLFSQKLRKELRQ
        GMKKK+RRPEVEV VQHVTCTEDAA FVALAAA+DLSMDACRLFSQKLRKELRQ
Subjt:  GMKKKNRRPEVEVAVQHVTCTEDAAAFVALAAAMDLSMDACRLFSQKLRKELRQ

A0A6J1E8C1 uncharacterized protein LOC1114308041.0e-20984.01Show/hide
Query:  MDPCPFLRLLVGNLAIKFPVAARPSFSGVHPSASPCFCKIKLAGFPTQFATVPLLLHGDT-----SSSSLAACFTLNKSHIQSLLSKRKDPSLKIEVFTG
        MDPCPFLR+LVGNLA+KFP+AA+PSFSGVHPS+SPCFCKIKL+ FPTQFATVPL++ G+T     SSS LAACF+LNKS I+ L+SKRKD  +KIEV+TG
Subjt:  MDPCPFLRLLVGNLAIKFPVAARPSFSGVHPSASPCFCKIKLAGFPTQFATVPLLLHGDT-----SSSSLAACFTLNKSHIQSLLSKRKDPSLKIEVFTG

Query:  RRDSAACGAGVLGTSARLLGRIVLPITALSTAETKPVVFQNGWTRIGGGRKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQPVFTCKF
        R     CG  + G+SA+LLGRIV+P+T  S +ETKP VFQNGWT I GGRKGYSSAQLHLTVRAE DPRFVFRFDGEPECSPQVFQVQG+V+QPVFTCKF
Subjt:  RRDSAACGAGVLGTSARLLGRIVLPITALSTAETKPVVFQNGWTRIGGGRKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQPVFTCKF

Query:  GFRNERDWDRSRSSISEQSSTSKSWLPKIRSEKDQPAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWQPWGRLEAW
        GFRNERDWDRSRSSI+EQSSTS+SWLPKI SE+DQ AKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSW PWGRLEAW
Subjt:  GFRNERDWDRSRSSISEQSSTSKSWLPKIRSEKDQPAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWQPWGRLEAW

Query:  RESGGSDSIGSRFELLPATSAAATLATSAISSSAGGKFTIDRTGSASPAISPNGSFDLGSRSGSRPGSGDFGYLSSPYQYKGFVMSATVDGMKKKNRRPE
        RESGGSDSIG RFELLPATSAAATLA S ISSS+GGKFTID+TGSASP ISPNGSFDLGS SGSRPGSGDFGYL++ YQYKGFVMS  V+GMKKKNRR E
Subjt:  RESGGSDSIGSRFELLPATSAAATLATSAISSSAGGKFTIDRTGSASPAISPNGSFDLGSRSGSRPGSGDFGYLSSPYQYKGFVMSATVDGMKKKNRRPE

Query:  VEVAVQHVTCTEDAAAFVALAAAMDLSMDACRLFSQKLRKELRQ
        VEVAVQHVTCTEDAA FVALAAA+DLSMDACRLFSQKLRKELRQ
Subjt:  VEVAVQHVTCTEDAAAFVALAAAMDLSMDACRLFSQKLRKELRQ

A0A6J1JBG6 uncharacterized protein LOC1114829431.2e-20783.56Show/hide
Query:  MDPCPFLRLLVGNLAIKFPVAARPSFSGVHPSASPCFCKIKLAGFPTQFATVPLLLHGDT-----SSSSLAACFTLNKSHIQSLLSKRKDPSLKIEVFTG
        MDPCPFLR+LVGNLA+KFPVAA+PSFSGVHPS+SPCFCKIKL+ FPTQFATVPL++ G+T     SSS LAACF+LNKS I+ L+SKRKD  +KIEV+TG
Subjt:  MDPCPFLRLLVGNLAIKFPVAARPSFSGVHPSASPCFCKIKLAGFPTQFATVPLLLHGDT-----SSSSLAACFTLNKSHIQSLLSKRKDPSLKIEVFTG

Query:  RRDSAACGAGVLGTSARLLGRIVLPITALSTAETKPVVFQNGWTRIGGGRKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQPVFTCKF
        R     CG  + G+SA+LLGRIV+P+T  S +ETKP VF NGWT I GG KGYSSAQLHLTVRAE DPRFVFRFDGEPECSPQVFQVQG+V+QPVFTCKF
Subjt:  RRDSAACGAGVLGTSARLLGRIVLPITALSTAETKPVVFQNGWTRIGGGRKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQPVFTCKF

Query:  GFRNERDWDRSRSSISEQSSTSKSWLPKIRSEKDQPAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWQPWGRLEAW
        GFRNERDWDRSRSSI+EQSSTS+SWLPKI SE+DQ AKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSW PWGRLEAW
Subjt:  GFRNERDWDRSRSSISEQSSTSKSWLPKIRSEKDQPAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWQPWGRLEAW

Query:  RESGGSDSIGSRFELLPATSAAATLATSAISSSAGGKFTIDRTGSASPAISPNGSFDLGSRSGSRPGSGDFGYLSSPYQYKGFVMSATVDGMKKKNRRPE
        RESGGSDSIG RFELLPATSAAATLA S ISSS+GGKFTID+TGSASP ISPN SFDLGS SGSRPGSGDFGYL++ YQYKGFVMS  V+GMKKKNRR E
Subjt:  RESGGSDSIGSRFELLPATSAAATLATSAISSSAGGKFTIDRTGSASPAISPNGSFDLGSRSGSRPGSGDFGYLSSPYQYKGFVMSATVDGMKKKNRRPE

Query:  VEVAVQHVTCTEDAAAFVALAAAMDLSMDACRLFSQKLRKELRQ
        VEVAVQHVTCTEDAA FVALAAA+DLSMDACRLFSQKLRKELRQ
Subjt:  VEVAVQHVTCTEDAAAFVALAAAMDLSMDACRLFSQKLRKELRQ

A0A6J1JPD8 uncharacterized protein LOC1114865042.5e-20883.78Show/hide
Query:  MDPCPFLRLLVGNLAIKFPVAARPSFSGVHPSASPCFCKIKLAGFPTQFATVPLLLHGD-----------TSSSSLAACFTLNKSHIQSLLSKRKDPSLK
        MDPC FLR+LVGNLA+KFPVAA+PSFSGVHPS+SPCFCKIKL  FPTQF TVPLL+ G+           +S SSLAACF+LNKS I+ L+SKRKD S+K
Subjt:  MDPCPFLRLLVGNLAIKFPVAARPSFSGVHPSASPCFCKIKLAGFPTQFATVPLLLHGD-----------TSSSSLAACFTLNKSHIQSLLSKRKDPSLK

Query:  IEVFTGRRDSAACGAGVLGTSARLLGRIVLPITALSTAETKPVVFQNGWTRIGGGRKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQP
        IEVFTG R  A+CG  +L +SA+LLGRIV+PITA S AETKP +FQNGWT IG G++G SSAQLHLTVRAEPDPRF+FRFDGEPECSPQVFQVQG+V QP
Subjt:  IEVFTGRRDSAACGAGVLGTSARLLGRIVLPITALSTAETKPVVFQNGWTRIGGGRKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQP

Query:  VFTCKFGFRNERDWDRSRSSISEQSSTSKSWLPKIRSEKDQPAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWQPW
        VFTCKFGFRNERDWDRSRSSISEQSSTSKSWLPKIRSEKDQ A ERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSW+PW
Subjt:  VFTCKFGFRNERDWDRSRSSISEQSSTSKSWLPKIRSEKDQPAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWQPW

Query:  GRLEAWRESGGSDSIGSRFELLPATSAAATLATSAISSSAGGKFTIDRTGSASPAISPNGSFDLGSRSGSRPGSGDFGYLSSPYQYKGFVMSATVDGMKK
        GRLEAWRESGGSDSIG RFELLP  SAAA LATS ISSSAGGKFTID TGSASP ISPNGSFDL S SGSRPGSGDFGYLSS YQYKGFVMS TV+GMKK
Subjt:  GRLEAWRESGGSDSIGSRFELLPATSAAATLATSAISSSAGGKFTIDRTGSASPAISPNGSFDLGSRSGSRPGSGDFGYLSSPYQYKGFVMSATVDGMKK

Query:  KNRRPEVEVAVQHVTCTEDAAAFVALAAAMDLSMDACRLFSQKLRKELRQ
        ++RRPEVEVAVQHVTCTEDAA FVALAAA+DLSMDACRLFSQKLRKELRQ
Subjt:  KNRRPEVEVAVQHVTCTEDAAAFVALAAAMDLSMDACRLFSQKLRKELRQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10020.1 Protein of unknown function (DUF1005)1.1e-12856.37Show/hide
Query:  MDPCPFLRLLVGNLAIKFPVAARPSFSGVHPSASPCFCKIKLAGFPTQFATVPLLLHGDT---SSSSLAACFTLNKSHIQSLLSK---RKDPSLKIEVFT
        MDPCPF+RL +GNLA+K P+AA+ + S VHPS+SPCFCKIKL  FP Q A +P +    T      +LAA F L+ S IQ L S+      P LKI ++T
Subjt:  MDPCPFLRLLVGNLAIKFPVAARPSFSGVHPSASPCFCKIKLAGFPTQFATVPLLLHGDT---SSSSLAACFTLNKSHIQSLLSK---RKDPSLKIEVFT

Query:  GRRDSAACGAGVLGTSARLLGRIVLPITALSTAETKPVVFQNGWTRIG-GGRKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQPVFTC
        GR   AACG      S RLL ++ +P+  LS  ++KP VF NGW  +G G  K  SSAQ HL V+AEPDPRFVF+FDGEPECSPQV Q+QGN+RQPVFTC
Subjt:  GRRDSAACGAGVLGTSARLLGRIVLPITALSTAETKPVVFQNGWTRIG-GGRKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQPVFTC

Query:  KFGFRNERDWDRSRSSISEQSSTSKSWLPKIRSEKDQPAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWQPWGRLE
        KF  R+  D  +   S+  ++S S+SWL    SE+++P KERKGWSIT+HDLSGSPVA AS+VTPFV SPG+ RVSRSNPG+WLILRP D +W+PWGRLE
Subjt:  KFGFRNERDWDRSRSSISEQSSTSKSWLPKIRSEKDQPAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWQPWGRLE

Query:  AWRESGG-SDSIGSRFELLP--ATSAAATLATSAISSSAGGKFTIDRTGSASPAISPNGSFD-----LGSRSGS----------RPGSGDFGYLSSPYQ-
        AWRE GG +D +G RFEL+P  ++ A   LA S ISS  GGKF+I+  GS+  + SP    +      G  SGS          R GSGD+GY   P+  
Subjt:  AWRESGG-SDSIGSRFELLP--ATSAAATLATSAISSSAGGKFTIDRTGSASPAISPNGSFD-----LGSRSGS----------RPGSGDFGYLSSPYQ-

Query:  YKGFVMSATVDGMKKKNRRPEVEVAVQHVTCTEDAAAFVALAAAMDLSMDACRLFSQKLRKEL
        YKGFVMSA+V+G + K  +P VEV+VQHV+C EDAAA+VAL+AA+DLSMDACRLF+Q++RKEL
Subjt:  YKGFVMSATVDGMKKKNRRPEVEVAVQHVTCTEDAAAFVALAAAMDLSMDACRLFSQKLRKEL

AT1G50040.1 Protein of unknown function (DUF1005)1.1e-11553.18Show/hide
Query:  MDPCPFLRLLVGNLAIKFP----------VAARPSFSGVHPSASPCFCKIKLAGFPTQFATVPLLLHGDTSS---------SSLAACFTLNKSHIQSLLS
        MDPC F+R++VGNLA++FP           ++ PS S V  S+  C+CKIK   FP Q  +VP+LL  ++ S         S++AACF+L+KS I++ L 
Subjt:  MDPCPFLRLLVGNLAIKFP----------VAARPSFSGVHPSASPCFCKIKLAGFPTQFATVPLLLHGDTSS---------SSLAACFTLNKSHIQSLLS

Query:  KRKDPSLKIEVFTGRRDSAACGAGVLGTSARLLGRIVLPITALSTAETKPVVFQNGWTRIG----GGRKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSP
        K K   L +EV++  R SA+CG  V  +  +L+GR  + +  L  AE+K  +  NGW  +G      +K  S  +LH++VR EPD RFVF+FDGEPECSP
Subjt:  KRKDPSLKIEVFTGRRDSAACGAGVLGTSARLLGRIVLPITALSTAETKPVVFQNGWTRIG----GGRKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSP

Query:  QVFQVQGNVRQPVFTCKFGFRNERDWDRSRSSISEQSSTSKSWLPKIRSEKDQPAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWL
        QVFQVQGN +Q VFTCKFGFRN  D + S S            L  + S K+Q +KERKGWSITIHDLSGSPVA ASMVTPFVPSPGS+RVSRS+PGAWL
Subjt:  QVFQVQGNVRQPVFTCKFGFRNERDWDRSRSSISEQSSTSKSWLPKIRSEKDQPAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWL

Query:  ILRPVDGSWQPWGRLEAWRESGGSDSIGSRFELLPATSAAATLATSAISSSAGGKFTIDRTGSASPAI---SPNGSFDLGSRSGSR-----PGSG-DFGY
        ILRP   +W+PW RL+AWRE G SD +G RFEL     A A  A+S+IS+  GG F ID + S +      S  GSFDL S S  R      GSG DF +
Subjt:  ILRPVDGSWQPWGRLEAWRESGGSDSIGSRFELLPATSAAATLATSAISSSAGGKFTIDRTGSASPAI---SPNGSFDLGSRSGSR-----PGSG-DFGY

Query:  -LSSPYQYKGFVMSATVDGMKKKNRRPEVEVAVQHVTCTEDAAAFVALAAAMDLSMDACRLFSQKLRKELRQ
         LS   Q  GFVMS  V G++K++ +P+VEV V+HVTCTEDAAA VALAAA+DLSMDACRLFSQKLR ELRQ
Subjt:  -LSSPYQYKGFVMSATVDGMKKKNRRPEVEVAVQHVTCTEDAAAFVALAAAMDLSMDACRLFSQKLRKELRQ

AT3G19680.1 Protein of unknown function (DUF1005)4.3e-12853.48Show/hide
Query:  MDPCPFLRLLVGNLAIKFPVAAR-------PSFSGVHPSASPCFCKIKLAGFPTQFATVPLLLHGDTSS----------SSLAACFTLNKSHIQSLLSKR
        MDPC F+R++VGNLA++FP ++        PS SG++P+A  C+CKI+   FP +  +VP++   ++ S          S++AACF+L+K+ I++ L K 
Subjt:  MDPCPFLRLLVGNLAIKFPVAAR-------PSFSGVHPSASPCFCKIKLAGFPTQFATVPLLLHGDTSS----------SSLAACFTLNKSHIQSLLSKR

Query:  KDPSLKIEVFT-GRRD------SAACGAGVLGTSARLLGRIVLPITALSTAETKPVVFQNGWTRIGGGR---KGYSSAQLHLTVRAEPDPRFVFRFDGEP
        K   L +E ++ G  D       A+CG    G   +LLGR  + +  L +AETK  +  NGW  +   +   K  S  +LH++VR EPDPRFVF+FDGEP
Subjt:  KDPSLKIEVFT-GRRD------SAACGAGVLGTSARLLGRIVLPITALSTAETKPVVFQNGWTRIGGGR---KGYSSAQLHLTVRAEPDPRFVFRFDGEP

Query:  ECSPQVFQVQGNVRQPVFTCKFGFRNERDWDRS---RSSISEQSSTSKSWLPKIRSEKDQPAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSR
        ECSPQVFQVQGN +Q VFTCKFG RN    DR+    SS+  + S+++S +  + SEK+QP+KERKGWSIT+HDLSGSPVA ASMVTPFVPSPGS+RV+R
Subjt:  ECSPQVFQVQGNVRQPVFTCKFGFRNERDWDRS---RSSISEQSSTSKSWLPKIRSEKDQPAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSR

Query:  SNPGAWLILRPVDGSWQPWGRLEAWRESGGSDSIGSRFELLPATSAAATLATSAISSSAGGKFTIDRTGSAS-----PAISPNGSFDL------GSRSGS
        S+PGAWLILRP   +W+PWGRLEAWRE+G SD++G RFEL     A A  A+S+IS   GG F ID TG  S     P  SP GS+DL      GSR  S
Subjt:  SNPGAWLILRPVDGSWQPWGRLEAWRESGGSDSIGSRFELLPATSAAATLATSAISSSAGGKFTIDRTGSAS-----PAISPNGSFDL------GSRSGS

Query:  RPGSG---DFGYL-----SSPYQYKGFVMSATVDGMKKKNRRPEVEVAVQHVTCTEDAAAFVALAAAMDLSMDACRLFSQKLRKELRQ
        RPGSG   DFGYL     S+  Q +GFVMSATV+G+ K++ +PEVEV V HVTCTEDAAA VALAAA+DLS+DACRLFS KLRKELRQ
Subjt:  RPGSG---DFGYL-----SSPYQYKGFVMSATVDGMKKKNRRPEVEVAVQHVTCTEDAAAFVALAAAMDLSMDACRLFSQKLRKELRQ

AT4G29310.1 Protein of unknown function (DUF1005)8.4e-10049.45Show/hide
Query:  MDPCPFLRLLVGNLAIKFPVAARPSFSG--VHPSASPCFCKIKLAGFPTQFATVPLLLHGDTS----SSSLAACFTLNKSHIQSLLSKRKDPSLKIEVFT
        MDPCPF+RL + +LA++ P  A     G  VHPS++PC+CK+++  FP+Q A +PL    D S    SS+ A  F L+   I+ +  K+   SL++ V+ 
Subjt:  MDPCPFLRLLVGNLAIKFPVAARPSFSG--VHPSASPCFCKIKLAGFPTQFATVPLLLHGDTS----SSSLAACFTLNKSHIQSLLSKRKDPSLKIEVFT

Query:  GRRDSAACGAGVLGTSARLLGRIVLPITALSTAETKPVVFQNGWTRIGG-GRKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQPVFTC
        G R    CG      S +LLG++ + +  L+ A ++ V F NGW ++GG G K   SA+LHL V AEPDPRFVF+F GEPECSP V+Q+Q N++QPVF+C
Subjt:  GRRDSAACGAGVLGTSARLLGRIVLPITALSTAETKPVVFQNGWTRIGG-GRKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQPVFTC

Query:  KFGFRNERDWDRSRSSISEQSSTSKSWLPKIRS----EKDQPAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRP---VDGSW
        KF   ++R+  RSRS  S  + +S+ W+ +  S    EK Q A+ERKGW ITIHDLSGSPVAAASM+TPFV SPGS RVSRSNPGAWLILRP      SW
Subjt:  KFGFRNERDWDRSRSSISEQSSTSKSWLPKIRS----EKDQPAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRP---VDGSW

Query:  QPWGRLEAWRESGGSDSIGSRFELL--PATSAAATLATSAISSSAGGKFTIDRTGSASPAISPNGSFDLGSRSGSRPGSGDFGYLSSPYQYKGFVMSATV
        +PWGRLEAWRE G  D +G +FEL+   +TS    +A   +S+  GGKF+IDR  S                     G G+   +SSP   KGFVM ++V
Subjt:  QPWGRLEAWRESGGSDSIGSRFELL--PATSAAATLATSAISSSAGGKFTIDRTGSASPAISPNGSFDLGSRSGSRPGSGDFGYLSSPYQYKGFVMSATV

Query:  DGMKKKNRRPEVEVAVQHVTCTEDAAAFVALAAAMDLSMDACRLFSQKLRKEL
        +G + K  +P V V  QHVTC  DAA FVAL+AA+DLS+DAC+LFS+KLRKEL
Subjt:  DGMKKKNRRPEVEVAVQHVTCTEDAAAFVALAAAMDLSMDACRLFSQKLRKEL

AT5G17640.1 Protein of unknown function (DUF1005)4.6e-7440.17Show/hide
Query:  MDPCPFLRLLVGNLAIKFPVAARPSFSGVHPS---ASPCFCKIKLAGFPTQFATVPLL--LHGDTSSSSLAACFTLNKSHIQSLLSK----RKDPSLKIE
        MDP  F+RL VG+LA++ P     S S  +     +S C C+IKL GFP Q  ++PL+  L       S++  F L +S +++LL+          L+I 
Subjt:  MDPCPFLRLLVGNLAIKFPVAARPSFSGVHPS---ASPCFCKIKLAGFPTQFATVPLL--LHGDTSSSSLAACFTLNKSHIQSLLSK----RKDPSLKIE

Query:  VFTGRRDSAACGAGVLGTSARLLGRIVLPITALSTAETKPVVFQNGWTRIGGGRKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQPVF
        VFTG++ S  CG   +G   + +G   L +      E KP++  NGW  IG  ++   +A+LHL V+ +PDPR+VF+F+     SPQ+ Q++G+V+QP+F
Subjt:  VFTGRRDSAACGAGVLGTSARLLGRIVLPITALSTAETKPVVFQNGWTRIGGGRKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQPVF

Query:  TCKFGFRNERDWDRSRSSISEQSSTSKSWLPK-IRSEKDQPAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRP---VDGSWQ
        +CKF          SR  +S+    +  W      +E +   +ERKGW + IHDLSGS VAAA + TPFVPS G   V++SNPGAWL++RP      SWQ
Subjt:  TCKFGFRNERDWDRSRSSISEQSSTSKSWLPK-IRSEKDQPAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRP---VDGSWQ

Query:  PWGRLEAWRESGGSDSIGSRFELLPATSAAATLATS--AISSSAGGKFTIDR-----TGSASPAISPNGSFDLGSRSGSRPGSGDFGYLSSPYQYKGFVM
        PWG+LEAWRE G  DS+  RF LL        +  S   IS+  GG+F ID      T +A+P  SP               SGDF  L       GFVM
Subjt:  PWGRLEAWRESGGSDSIGSRFELLPATSAAATLATS--AISSSAGGKFTIDR-----TGSASPAISPNGSFDLGSRSGSRPGSGDFGYLSSPYQYKGFVM

Query:  SATVDGMKKKNRRPEVEVAVQHVTCTEDAAAFVALAAAMDLSMDACRLFSQKLRKELR
        S+ V G + K+ +P V++A++HVTC EDAA F+ALAAA+DLS+ AC+ F +  R+  R
Subjt:  SATVDGMKKKNRRPEVEVAVQHVTCTEDAAAFVALAAAMDLSMDACRLFSQKLRKELR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCCCTGCCCCTTCCTCCGCCTTCTCGTCGGAAACTTGGCCATCAAGTTTCCGGTCGCCGCCAGACCCTCCTTCTCCGGCGTCCATCCCTCCGCCTCCCCATGCTT
CTGCAAAATCAAACTCGCCGGTTTCCCCACTCAATTCGCCACCGTTCCTCTCCTCCTCCACGGCGACACTTCTTCTTCTTCACTCGCCGCCTGCTTCACTCTCAACAAAT
CTCACATTCAGAGCCTTCTTTCCAAGCGGAAGGATCCGTCTCTCAAAATCGAAGTATTCACCGGCCGCCGTGATTCCGCCGCTTGCGGCGCCGGCGTCCTTGGAACCTCC
GCCAGGTTGCTCGGCCGAATCGTCCTGCCGATCACGGCTCTGAGTACGGCCGAAACTAAACCGGTCGTGTTTCAGAACGGCTGGACCAGAATCGGCGGGGGCAGAAAAGG
TTACTCCTCGGCTCAATTGCACTTGACGGTTCGTGCTGAGCCGGACCCGAGATTCGTGTTCCGGTTCGATGGAGAGCCGGAGTGTAGTCCTCAGGTTTTTCAGGTCCAAG
GGAATGTACGGCAACCGGTTTTTACTTGCAAATTCGGTTTCAGAAACGAACGTGATTGGGATCGTTCAAGGTCGTCAATCTCTGAACAGAGCAGCACTTCGAAGAGTTGG
TTACCGAAGATCCGATCCGAAAAGGACCAACCGGCAAAGGAACGGAAAGGATGGTCGATAACGATCCACGATCTATCCGGATCGCCGGTGGCCGCCGCGTCGATGGTGAC
GCCGTTCGTCCCGTCTCCGGGATCGCACCGTGTAAGCCGCTCAAATCCCGGCGCCTGGCTGATTCTCCGGCCGGTTGACGGAAGCTGGCAGCCGTGGGGCCGCCTCGAGG
CCTGGCGGGAGAGCGGCGGCTCCGACTCCATCGGCTCCAGATTTGAGCTCCTTCCGGCGACCTCCGCCGCCGCCACGCTGGCGACCTCCGCCATAAGCTCTAGCGCCGGC
GGGAAGTTCACGATCGACAGGACCGGCAGCGCGTCTCCGGCGATTAGCCCTAACGGAAGCTTCGATTTAGGGTCCAGATCCGGATCTCGACCCGGATCCGGGGATTTCGG
GTACTTGTCGTCGCCGTATCAGTACAAAGGATTTGTGATGTCGGCGACGGTGGATGGGATGAAGAAGAAGAACAGGCGGCCGGAGGTGGAGGTGGCGGTGCAGCACGTGA
CTTGCACGGAGGACGCGGCGGCGTTTGTGGCGTTGGCGGCGGCGATGGACCTGAGCATGGACGCCTGTAGGCTGTTCTCGCAGAAGCTTAGAAAGGAGCTGAGGCAATGA
mRNA sequenceShow/hide mRNA sequence
GTTACATCTTCACATCCTCAATGCTCCTTTTCCATTCTCTTATTATCTTCGTTCTTTTCTTAATCCCCATTCAAATTTCACTTCTCATCGGAAAGTTTCTCCGTTTCCTC
TCCGGCGAAACCCCCCGTCGCCGGCTTAATCTACTTCCGCCCTCATGGATCCCTGCCCCTTCCTCCGCCTTCTCGTCGGAAACTTGGCCATCAAGTTTCCGGTCGCCGCC
AGACCCTCCTTCTCCGGCGTCCATCCCTCCGCCTCCCCATGCTTCTGCAAAATCAAACTCGCCGGTTTCCCCACTCAATTCGCCACCGTTCCTCTCCTCCTCCACGGCGA
CACTTCTTCTTCTTCACTCGCCGCCTGCTTCACTCTCAACAAATCTCACATTCAGAGCCTTCTTTCCAAGCGGAAGGATCCGTCTCTCAAAATCGAAGTATTCACCGGCC
GCCGTGATTCCGCCGCTTGCGGCGCCGGCGTCCTTGGAACCTCCGCCAGGTTGCTCGGCCGAATCGTCCTGCCGATCACGGCTCTGAGTACGGCCGAAACTAAACCGGTC
GTGTTTCAGAACGGCTGGACCAGAATCGGCGGGGGCAGAAAAGGTTACTCCTCGGCTCAATTGCACTTGACGGTTCGTGCTGAGCCGGACCCGAGATTCGTGTTCCGGTT
CGATGGAGAGCCGGAGTGTAGTCCTCAGGTTTTTCAGGTCCAAGGGAATGTACGGCAACCGGTTTTTACTTGCAAATTCGGTTTCAGAAACGAACGTGATTGGGATCGTT
CAAGGTCGTCAATCTCTGAACAGAGCAGCACTTCGAAGAGTTGGTTACCGAAGATCCGATCCGAAAAGGACCAACCGGCAAAGGAACGGAAAGGATGGTCGATAACGATC
CACGATCTATCCGGATCGCCGGTGGCCGCCGCGTCGATGGTGACGCCGTTCGTCCCGTCTCCGGGATCGCACCGTGTAAGCCGCTCAAATCCCGGCGCCTGGCTGATTCT
CCGGCCGGTTGACGGAAGCTGGCAGCCGTGGGGCCGCCTCGAGGCCTGGCGGGAGAGCGGCGGCTCCGACTCCATCGGCTCCAGATTTGAGCTCCTTCCGGCGACCTCCG
CCGCCGCCACGCTGGCGACCTCCGCCATAAGCTCTAGCGCCGGCGGGAAGTTCACGATCGACAGGACCGGCAGCGCGTCTCCGGCGATTAGCCCTAACGGAAGCTTCGAT
TTAGGGTCCAGATCCGGATCTCGACCCGGATCCGGGGATTTCGGGTACTTGTCGTCGCCGTATCAGTACAAAGGATTTGTGATGTCGGCGACGGTGGATGGGATGAAGAA
GAAGAACAGGCGGCCGGAGGTGGAGGTGGCGGTGCAGCACGTGACTTGCACGGAGGACGCGGCGGCGTTTGTGGCGTTGGCGGCGGCGATGGACCTGAGCATGGACGCCT
GTAGGCTGTTCTCGCAGAAGCTTAGAAAGGAGCTGAGGCAATGAAAGATTGATTGTCGTTTCATGTGGCTCAGCTCGTCGTTTTGCCAATTTCAGCAGTTACATTTTGTA
ATTTTCCATATTCATTTTATTTTTGTTTATTTTTTCAATAGCTTTTTTGTACAGAGAAGAGATCAATCGATGCACGATTCTTCCCCTTTTTGTAATATATTATTTGTTGT
ATTATACCATTTGTTAATCTGTTCATTAAAAAATATTTAAACTTTTGTATTATATTATATGAGTAGTATGCAGGGGCAATTTAATTATTCTA
Protein sequenceShow/hide protein sequence
MDPCPFLRLLVGNLAIKFPVAARPSFSGVHPSASPCFCKIKLAGFPTQFATVPLLLHGDTSSSSLAACFTLNKSHIQSLLSKRKDPSLKIEVFTGRRDSAACGAGVLGTS
ARLLGRIVLPITALSTAETKPVVFQNGWTRIGGGRKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQPVFTCKFGFRNERDWDRSRSSISEQSSTSKSW
LPKIRSEKDQPAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWQPWGRLEAWRESGGSDSIGSRFELLPATSAAATLATSAISSSAG
GKFTIDRTGSASPAISPNGSFDLGSRSGSRPGSGDFGYLSSPYQYKGFVMSATVDGMKKKNRRPEVEVAVQHVTCTEDAAAFVALAAAMDLSMDACRLFSQKLRKELRQ