; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg012140 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg012140
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionCys-Gly metallodipeptidase DUG1
Genome locationscaffold1:7532076..7538560
RNA-Seq ExpressionSpg012140
SyntenySpg012140
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7034256.1 hypothetical protein SDJN02_03983, partial [Cucurbita argyrosperma subsp. argyrosperma]4.2e-21277.39Show/hide
Query:  MEKEEEPKFASTPNLQHEANGVLSKNEKSVSDGTDEAKSAKSGCQFLENAARQNQQCTALLQRVLNPQHAGERSSPLAVSAAVNERLQPPQNLANLQHQL
        MEKE+E KFAST  LQH  NGV  KNEKSV DGTD AK+ KSGCQFLENAA QNQQ T LLQR LNP+HAGE+SS  A  AAVNERLQPP+NL  LQHQL
Subjt:  MEKEEEPKFASTPNLQHEANGVLSKNEKSVSDGTDEAKSAKSGCQFLENAARQNQQCTALLQRVLNPQHAGERSSPLAVSAAVNERLQPPQNLANLQHQL

Query:  SPPPQPQPQQFLFSSQPFWVQPQPSISLGETEGSWQAPAAFGAGASTRCQPQAPNFCYPVGFPTYPGFPGSWDASILWGQTQPLPFPGMSNYPRASYGFV
         PPPQPQPQQF+ SSQPFWVQPQPSISLG TEGSWQ PAAFG GAS RCQPQAPNFCYPVG+PTYPGF GSWDASI WGQT PL FPG+SNYPRASYGF 
Subjt:  SPPPQPQPQQFLFSSQPFWVQPQPSISLGETEGSWQAPAAFGAGASTRCQPQAPNFCYPVGFPTYPGFPGSWDASILWGQTQPLPFPGMSNYPRASYGFV

Query:  SSQSWPMPVPSCLTSSFGQPPLRGVIKPPEKLSQKHQRLWEAQSAENVQLWNMIGQLQGELADCKGRLIKLEAEISSLRPAATDEPAVGVGNGGLTVRGQ
        SSQS PMP+P+C+T S GQP LRGVIKPPE+LSQKHQRLWEAQSAENVQLW+MIGQ QGELADCKGRLIKLEAEISSLR  AT+EPAV VGNGG+TVRGQ
Subjt:  SSQSWPMPVPSCLTSSFGQPPLRGVIKPPEKLSQKHQRLWEAQSAENVQLWNMIGQLQGELADCKGRLIKLEAEISSLRPAATDEPAVGVGNGGLTVRGQ

Query:  PTKRGRSKRAMAQAGSQPPLQPRTRGRKPAIGRTKVEEVTPSVLEKESLNKVGDKDKDFTSLNITEQDKNEGISAAITQNNGITEIEDGTLKMAASIDNQ
        P+KRGRSKRA+A  GS    Q RTR RKPA+G TKV EV P++L K+SLNKV D  KDFT L+ITEQDKNEGISA I    GI EI++GTLKM  S  NQ
Subjt:  PTKRGRSKRAMAQAGSQPPLQPRTRGRKPAIGRTKVEEVTPSVLEKESLNKVGDKDKDFTSLNITEQDKNEGISAAITQNNGITEIEDGTLKMAASIDNQ

Query:  VLQQFPEIQSCGIEFKSPSLLKFNYEGKENFIWPSGIISEDSERNTLIIASPTIYTNGNVSRQGITRQNFEDEGEASESGFPAAGHKKENEEMADEFSSG
         LQQFP+IQSCGIEFKSPS LK NYE         GII  DS+ N L IASPTIYTNGNVSRQGITR NFEDE EA+ESGFP  G+KKEN+EMADEFSSG
Subjt:  VLQQFPEIQSCGIEFKSPSLLKFNYEGKENFIWPSGIISEDSERNTLIIASPTIYTNGNVSRQGITRQNFEDEGEASESGFPAAGHKKENEEMADEFSSG

Query:  PEEIETQNGSSWC
         EEIETQNGSSWC
Subjt:  PEEIETQNGSSWC

XP_022949720.1 uncharacterized protein LOC111453031 isoform X1 [Cucurbita moschata]1.7e-20876.41Show/hide
Query:  MEKEEEPKFASTPNLQHEANGVLSKNEKSVSDGTDEAKSAKSGCQFLENAARQNQQCTALLQRVLNPQHAGERSSPLAVSAAVNERLQPPQNLANLQHQL
        MEKE+E KFAST NLQH  NGV  KNEKSV DGTD AK+ KSGCQFLENAA QNQQ T LLQR LNP+HAGE+SS  A  AAVNERLQPP+NL  LQHQL
Subjt:  MEKEEEPKFASTPNLQHEANGVLSKNEKSVSDGTDEAKSAKSGCQFLENAARQNQQCTALLQRVLNPQHAGERSSPLAVSAAVNERLQPPQNLANLQHQL

Query:  SPPPQPQPQQFLFSSQPFWVQPQPSISLGETEGSWQAPAAFGAGASTRCQPQAPNFCYPVGFPTYPGFPGSWDASILWGQTQPLPFPGMSNYPRASYGFV
             PQPQQF+ SSQPFWVQPQPSISLG TEGSWQ PAAFGAGAS RCQPQAPNFCYPVG+PTYPGF GSWDASI WGQT PL FPG+SNYPRASYG  
Subjt:  SPPPQPQPQQFLFSSQPFWVQPQPSISLGETEGSWQAPAAFGAGASTRCQPQAPNFCYPVGFPTYPGFPGSWDASILWGQTQPLPFPGMSNYPRASYGFV

Query:  SSQSWPMPVPSCLTSSFGQPPLRGVIKPPEKLSQKHQRLWEAQSAENVQLWNMIGQLQGELADCKGRLIKLEAEISSLRPAATDEPAVGVGNGGLTVRGQ
        SSQS PMP+P+C+TSS GQP LRGVIKPPE+LSQKHQRLWEAQSAENVQLW+MIGQLQGELADCKGRLIKLEAEIS LR  AT+EPAV VGNGG+TVRGQ
Subjt:  SSQSWPMPVPSCLTSSFGQPPLRGVIKPPEKLSQKHQRLWEAQSAENVQLWNMIGQLQGELADCKGRLIKLEAEISSLRPAATDEPAVGVGNGGLTVRGQ

Query:  PTKRGRSKRAMAQAGSQPPLQPRTRGRKPAIGRTKVEEVTPSVLEKESLNKVGDKDKDFTSLNITEQDKNEGISAAITQNNGITEIEDGTLKMAASIDNQ
        P+KRGRSKRA+A  GS    Q RTR RKPA+G TKV EV P++L K+SLNKV D  K+FT L+ITEQDKNEGIS  I    GI EI++GTLKM  S  NQ
Subjt:  PTKRGRSKRAMAQAGSQPPLQPRTRGRKPAIGRTKVEEVTPSVLEKESLNKVGDKDKDFTSLNITEQDKNEGISAAITQNNGITEIEDGTLKMAASIDNQ

Query:  VLQQFPEIQSCGIEFKSPSLLKFNYEGKENFIWPSGIISEDSERNTLIIASPTIYTNGNVSRQGITRQNFEDEGEASESGFPAAGHKKENEEMADEFSSG
         LQQFP+IQSCGIEFKSPS LK NYE         GII  DS+ N L IASPTIYTNGNVSRQGITR NFEDE EA+ESGFP  G+KKEN+EMADEFSSG
Subjt:  VLQQFPEIQSCGIEFKSPSLLKFNYEGKENFIWPSGIISEDSERNTLIIASPTIYTNGNVSRQGITRQNFEDEGEASESGFPAAGHKKENEEMADEFSSG

Query:  PEEIETQNGSSWC
         EEIETQNG SWC
Subjt:  PEEIETQNGSSWC

XP_022979099.1 uncharacterized protein LOC111478840 isoform X1 [Cucurbita maxima]4.0e-21077Show/hide
Query:  MEKEEEPKFASTPNLQHEANGVLSKNEKSVSDGTDEAKSAKSGCQFLENAARQNQQCTALLQRVLNPQHAGERSSPLAVSAAVNERLQPPQNLANLQHQL
        MEKE+E K AST NLQH  NGV  KNEKSV DGTD AK+AKSGCQFLENAA QNQQ T LLQR LNP+HAGE+SS  A  AAVNERLQPP+NL   QHQL
Subjt:  MEKEEEPKFASTPNLQHEANGVLSKNEKSVSDGTDEAKSAKSGCQFLENAARQNQQCTALLQRVLNPQHAGERSSPLAVSAAVNERLQPPQNLANLQHQL

Query:  SPPPQPQPQQFLFSSQPFWVQPQPSISLGETEGSWQAPAAFGAGASTRCQPQAPNFCYPVGFPTYPGFPGSWDASILWGQTQPLPFPGMSNYPRASYGFV
         PPPQPQPQQF+ SSQPFWVQPQ SISLG TEGSWQ PAAFGAGAS RCQPQAPNFCYPVG+PTYPGF GSWDASI WGQT PL FPG+SNYPRASYGF 
Subjt:  SPPPQPQPQQFLFSSQPFWVQPQPSISLGETEGSWQAPAAFGAGASTRCQPQAPNFCYPVGFPTYPGFPGSWDASILWGQTQPLPFPGMSNYPRASYGFV

Query:  SSQSWPMPVPSCLTSSFGQPPLRGVIKPPEKLSQKHQRLWEAQSAENVQLWNMIGQLQGELADCKGRLIKLEAEISSLRPAATDEPAVGVGNGGLTVRGQ
        SSQS PMP+PSC+ SS GQP LRGVIKPPE+LSQKHQRLWEAQSAENVQLW+MIGQLQ ELADCKGRLIKLEAEISSLR  ATDE AV VGNGG+TVRGQ
Subjt:  SSQSWPMPVPSCLTSSFGQPPLRGVIKPPEKLSQKHQRLWEAQSAENVQLWNMIGQLQGELADCKGRLIKLEAEISSLRPAATDEPAVGVGNGGLTVRGQ

Query:  PTKRGRSKRAMAQAGSQPPLQPRTRGRKPAIGRTKVEEVTPSVLEKESLNKVGDKDKDFTSLNITEQDKNEGISAAITQNNGITEIEDGTLKMAASIDNQ
        P KRGRSKRA+A  GS    Q RTR RKP +G TKV EV P++L K+SLNKV D  +DFT L+ITEQDKNEGISA I    GI EI++GTLK+  S  NQ
Subjt:  PTKRGRSKRAMAQAGSQPPLQPRTRGRKPAIGRTKVEEVTPSVLEKESLNKVGDKDKDFTSLNITEQDKNEGISAAITQNNGITEIEDGTLKMAASIDNQ

Query:  VLQQFPEIQSCGIEFKSPSLLKFNYEGKENFIWPSGIISEDSERNTLIIASPTIYTNGNVSRQGITRQNFEDEGEASESGFPAAGHKKENEEMADEFSSG
         LQQFP+IQSCGIEFKSPS LK NYE         GII  DS+ N L IASPTIYTNGNVSRQGITR NFEDE EA+ESGFP  G+KKEN+EMADEFSSG
Subjt:  VLQQFPEIQSCGIEFKSPSLLKFNYEGKENFIWPSGIISEDSERNTLIIASPTIYTNGNVSRQGITRQNFEDEGEASESGFPAAGHKKENEEMADEFSSG

Query:  PEEIETQNGSSWC
         EEIETQNGSSWC
Subjt:  PEEIETQNGSSWC

XP_023544631.1 uncharacterized protein LOC111804156 isoform X1 [Cucurbita pepo subsp. pepo]4.1e-20776.41Show/hide
Query:  MEKEEEPKFASTPNLQHEANGVLSKNEKSVSDGTDEAKSAKSGCQFLENAARQNQQCTALLQRVLNPQHAGERSSPLAVSAAVNERLQPPQNLANLQHQL
        MEKE+E KFAST NLQH  NGV  KNEKSV DGTD AK+AKSGCQFLENAA QNQQ T LLQR LNP+HAG +SS  A   AVNERLQPP+NL  LQHQL
Subjt:  MEKEEEPKFASTPNLQHEANGVLSKNEKSVSDGTDEAKSAKSGCQFLENAARQNQQCTALLQRVLNPQHAGERSSPLAVSAAVNERLQPPQNLANLQHQL

Query:  SPPPQPQPQQFLFSSQPFWVQPQPSISLGETEGSWQAPAAFGAGASTRCQPQAPNFCYPVGFPTYPGFPGSWDASILWGQTQPLPFPGMSNYPRASYGFV
         PPPQPQPQQF+ SSQPFWVQPQPSISLG TEGSWQ PAAFGAGAS RCQPQAPNFCYPVG+PTYPGF GSWDASI WGQT PL FPG+SNYPRASYGF 
Subjt:  SPPPQPQPQQFLFSSQPFWVQPQPSISLGETEGSWQAPAAFGAGASTRCQPQAPNFCYPVGFPTYPGFPGSWDASILWGQTQPLPFPGMSNYPRASYGFV

Query:  SSQSWPMPVPSCLTSSFGQPPLRGVIKPPEKLSQKHQRLWEAQSAENVQLWNMIGQLQGELADCKGRLIKLEAEISSLRPAATDEPAVGVGNGGLTVRGQ
        SSQS PMP+PSC+TSS GQP LRGVIKPPE+LSQKHQRLWEAQSAENVQLW+MIGQLQGELADCKGRLIKLEAEISSLR  AT+EPAV VGNGG+ VRGQ
Subjt:  SSQSWPMPVPSCLTSSFGQPPLRGVIKPPEKLSQKHQRLWEAQSAENVQLWNMIGQLQGELADCKGRLIKLEAEISSLRPAATDEPAVGVGNGGLTVRGQ

Query:  PTKRGRSKRAMAQAGSQPPLQPRTRGRKPAIGRTKVEEVTPSVLEKESLNKVGDKDKDFTSLNITEQDKNEGISAAITQNNGITEIEDGTLKMAASIDNQ
        P+KRGRSKRA+A  GS    Q RTR RKP +G TKV EV P++L K+SLNKV D  KDFT L+ITEQDKNEGISA I    GI EI++GTLKM       
Subjt:  PTKRGRSKRAMAQAGSQPPLQPRTRGRKPAIGRTKVEEVTPSVLEKESLNKVGDKDKDFTSLNITEQDKNEGISAAITQNNGITEIEDGTLKMAASIDNQ

Query:  VLQQFPEIQSCGIEFKSPSLLKFNYEGKENFIWPSGIISEDSERNTLIIASPTIYTNGNVSRQGITRQNFEDEGEASESGFPAAGHKKENEEMADEFSSG
            FP+IQSCGIEFKSPS LK NYE         GII  DS+ N L IASPT+YTNGNVSRQGITR NFEDE EA+ESGFP  G+KKEN+EMADEFSSG
Subjt:  VLQQFPEIQSCGIEFKSPSLLKFNYEGKENFIWPSGIISEDSERNTLIIASPTIYTNGNVSRQGITRQNFEDEGEASESGFPAAGHKKENEEMADEFSSG

Query:  PEEIETQNGSSWC
         EEIETQNGSSWC
Subjt:  PEEIETQNGSSWC

XP_038883396.1 uncharacterized protein LOC120074371 [Benincasa hispida]2.3e-22680.93Show/hide
Query:  MEKEEEPKFASTPNLQHEANGVLS-KNEKSVSDGTDEAKSAKSGCQFLENAARQNQQCTALLQRVLNPQHAGERSSPLAVSAAVNERLQPPQNLANLQHQ
        MEKEE+PKFASTPNL+H+ANGV S KNEKSVSDGTD AK+AKSGCQFLENA  QNQQCTA LQR LNPQHAGE+ SP    AAVNERLQ PQNLANLQHQ
Subjt:  MEKEEEPKFASTPNLQHEANGVLS-KNEKSVSDGTDEAKSAKSGCQFLENAARQNQQCTALLQRVLNPQHAGERSSPLAVSAAVNERLQPPQNLANLQHQ

Query:  LSPPPQPQPQQFLFSSQPFWVQPQPSISLGETEGSWQAPAAFGAGASTRCQPQAPNFCYPVGFPTYPGFPGSWDASILWGQTQPLPFPGMSNYPRASYGF
        LSPP  PQPQQF+ SSQPFWVQPQPSIS G TEGSWQAP AFGAGAS RCQPQAPNF YPVG+PTY GFPG  DASI WGQTQPL FPG+SNYPRAS GF
Subjt:  LSPPPQPQPQQFLFSSQPFWVQPQPSISLGETEGSWQAPAAFGAGASTRCQPQAPNFCYPVGFPTYPGFPGSWDASILWGQTQPLPFPGMSNYPRASYGF

Query:  VSSQSWPMPVPSCLTSSFGQPPLRGVIKPPEKLSQKHQRLWEAQSAENVQLWNMIGQLQGELADCKGRLIKLEAEISSLRPAATDEPAVGVGNGGLTVRG
         SSQSWPMP+PSC+TSS GQP LRGVIKPPEKLSQKHQRLWEAQSAENVQLW++IG+LQGELAD KGRL KLE EISSLR AATDEPAV VGN G+TVRG
Subjt:  VSSQSWPMPVPSCLTSSFGQPPLRGVIKPPEKLSQKHQRLWEAQSAENVQLWNMIGQLQGELADCKGRLIKLEAEISSLRPAATDEPAVGVGNGGLTVRG

Query:  QPTKRGRSKRAMAQAGSQPPLQPRTRGRKPAIGRTKVEEVTPSVLEKESLNKVGDKDKDFTSLNITEQDKNEGISAAITQNNGITEIEDGTLKMAASIDN
        QP KRGRSKRA+A  GSQPPLQPRTRGRKPA  RTKVEE  P+ L K+SLNKV DK KDFTSL+ITEQDKNEGISA I QNNG  EI +GTLKM A +DN
Subjt:  QPTKRGRSKRAMAQAGSQPPLQPRTRGRKPAIGRTKVEEVTPSVLEKESLNKVGDKDKDFTSLNITEQDKNEGISAAITQNNGITEIEDGTLKMAASIDN

Query:  QVLQQFPEIQSCGIEFKSPSLLKFNYEGKENFIWPSGIISEDSERNTLIIASPTIYTNGNVSRQGITRQNFEDEGEASESGFPAAGHKKENEEMADEFSS
        QVLQQ PEIQSCGIEFK  SLLK NYE  ENFIW  GIISEDSE+N   IASPTIYTNGNVSRQGI R NF+ E EA+E GFPA  HKKE+EEM DEFSS
Subjt:  QVLQQFPEIQSCGIEFKSPSLLKFNYEGKENFIWPSGIISEDSERNTLIIASPTIYTNGNVSRQGITRQNFEDEGEASESGFPAAGHKKENEEMADEFSS

Query:  GPEEIETQNGSSWC
        GPEEIET+NGSSWC
Subjt:  GPEEIETQNGSSWC

TrEMBL top hitse value%identityAlignment
A0A0A0KGK4 Uncharacterized protein1.3e-19873.79Show/hide
Query:  MEKEEEPKFASTPNLQHEANGVLSKNEKSVSDGTDEAKSAKSGCQFLENAARQNQQCTALLQRVLNPQHAGERSSPLAVSAAVNERLQPPQNLANLQHQL
        MEKEE+P+F STP+L+H+ANG+ SKNEKSVSDGTD AK AKSG QFLEN A  NQ  TALLQR   PQHA + SSP A  AAVNERLQ PQN ANL HQL
Subjt:  MEKEEEPKFASTPNLQHEANGVLSKNEKSVSDGTDEAKSAKSGCQFLENAARQNQQCTALLQRVLNPQHAGERSSPLAVSAAVNERLQPPQNLANLQHQL

Query:  SPPPQPQPQQFLFSSQPFWVQPQPSISLGETEGSWQAPAAFGAGASTRCQPQAPNFCYPVGFPTYPGFPGSWDASILWGQTQPLPFPGMSNYPRASYGFV
        S P  PQPQQF+ SSQPFWVQPQPSIS G TEGSWQ+P A  AGAS  CQPQAPNF YPVG+PTYPGFPGS D SI WGQTQP+ FPG+SNYPRAS GFV
Subjt:  SPPPQPQPQQFLFSSQPFWVQPQPSISLGETEGSWQAPAAFGAGASTRCQPQAPNFCYPVGFPTYPGFPGSWDASILWGQTQPLPFPGMSNYPRASYGFV

Query:  SSQSWPMPVPSCLTSSFGQPPLRGVIKPPEKLSQKHQRLWEAQSAENVQLWNMIGQLQGELADCKGRLIKLEAEISSLRPAATDEPAVGVGNGGLTVRGQ
        SSQSWPMP+PSC+TSS GQP LRGVIKPPEKLSQKHQ+LWEAQSAENVQLW+MIG+LQGELA  KGRL KLEAEIS LR AAT+EPAV VGN  + +RGQ
Subjt:  SSQSWPMPVPSCLTSSFGQPPLRGVIKPPEKLSQKHQRLWEAQSAENVQLWNMIGQLQGELADCKGRLIKLEAEISSLRPAATDEPAVGVGNGGLTVRGQ

Query:  PTKRGRSKRAMAQAGSQPPLQPRTRGRKPAIGRTKVEEVTPSVLEKESLNKVGD-KDKDFTSLNITEQDKNEGISAAITQNNGITEIEDGTLKMAASIDN
        P KRGRSKRA A  GSQPPLQPRTR RKPA+ RTKVEE   ++L K+SLNK  D K K FTSL+IT+QDKNE ISA+I QNNGI EI+D TLKM  S+D 
Subjt:  PTKRGRSKRAMAQAGSQPPLQPRTRGRKPAIGRTKVEEVTPSVLEKESLNKVGD-KDKDFTSLNITEQDKNEGISAAITQNNGITEIEDGTLKMAASIDN

Query:  QVLQQFPEIQSCGIEFKSPSLLKFNYEGKENFIWPSGIISEDSERNTLIIASPTIYTNGNVSRQGITRQNFEDEGEASESGF-PAAGHKKENEEMADEFS
        QVL+Q  EI  CGIEFK PS+LK NYE         GIIS+DSE N   IASPTIYTNGNV+RQGITR NF+ EG  +E GF PA  HK  NEEMADEFS
Subjt:  QVLQQFPEIQSCGIEFKSPSLLKFNYEGKENFIWPSGIISEDSERNTLIIASPTIYTNGNVSRQGITRQNFEDEGEASESGF-PAAGHKKENEEMADEFS

Query:  SGPEEIETQNGSSWC
        SGPEEIETQNGSSWC
Subjt:  SGPEEIETQNGSSWC

A0A6J1GCT8 uncharacterized protein LOC111453031 isoform X28.9e-20076.42Show/hide
Query:  VLSKNEKSVSDGTDEAKSAKSGCQFLENAARQNQQCTALLQRVLNPQHAGERSSPLAVSAAVNERLQPPQNLANLQHQLSPPPQPQPQQFLFSSQPFWVQ
        V  KNEKSV DGTD AK+ KSGCQFLENAA QNQQ T LLQR LNP+HAGE+SS  A  AAVNERLQPP+NL  LQHQL     PQPQQF+ SSQPFWVQ
Subjt:  VLSKNEKSVSDGTDEAKSAKSGCQFLENAARQNQQCTALLQRVLNPQHAGERSSPLAVSAAVNERLQPPQNLANLQHQLSPPPQPQPQQFLFSSQPFWVQ

Query:  PQPSISLGETEGSWQAPAAFGAGASTRCQPQAPNFCYPVGFPTYPGFPGSWDASILWGQTQPLPFPGMSNYPRASYGFVSSQSWPMPVPSCLTSSFGQPP
        PQPSISLG TEGSWQ PAAFGAGAS RCQPQAPNFCYPVG+PTYPGF GSWDASI WGQT PL FPG+SNYPRASYG  SSQS PMP+P+C+TSS GQP 
Subjt:  PQPSISLGETEGSWQAPAAFGAGASTRCQPQAPNFCYPVGFPTYPGFPGSWDASILWGQTQPLPFPGMSNYPRASYGFVSSQSWPMPVPSCLTSSFGQPP

Query:  LRGVIKPPEKLSQKHQRLWEAQSAENVQLWNMIGQLQGELADCKGRLIKLEAEISSLRPAATDEPAVGVGNGGLTVRGQPTKRGRSKRAMAQAGSQPPLQ
        LRGVIKPPE+LSQKHQRLWEAQSAENVQLW+MIGQLQGELADCKGRLIKLEAEIS LR  AT+EPAV VGNGG+TVRGQP+KRGRSKRA+A  GS    Q
Subjt:  LRGVIKPPEKLSQKHQRLWEAQSAENVQLWNMIGQLQGELADCKGRLIKLEAEISSLRPAATDEPAVGVGNGGLTVRGQPTKRGRSKRAMAQAGSQPPLQ

Query:  PRTRGRKPAIGRTKVEEVTPSVLEKESLNKVGDKDKDFTSLNITEQDKNEGISAAITQNNGITEIEDGTLKMAASIDNQVLQQFPEIQSCGIEFKSPSLL
         RTR RKPA+G TKV EV P++L K+SLNKV D  K+FT L+ITEQDKNEGIS  I    GI EI++GTLKM  S  NQ LQQFP+IQSCGIEFKSPS L
Subjt:  PRTRGRKPAIGRTKVEEVTPSVLEKESLNKVGDKDKDFTSLNITEQDKNEGISAAITQNNGITEIEDGTLKMAASIDNQVLQQFPEIQSCGIEFKSPSLL

Query:  KFNYEGKENFIWPSGIISEDSERNTLIIASPTIYTNGNVSRQGITRQNFEDEGEASESGFPAAGHKKENEEMADEFSSGPEEIETQNGSSWC
        K NYE         GII  DS+ N L IASPTIYTNGNVSRQGITR NFEDE EA+ESGFP  G+KKEN+EMADEFSSG EEIETQNG SWC
Subjt:  KFNYEGKENFIWPSGIISEDSERNTLIIASPTIYTNGNVSRQGITRQNFEDEGEASESGFPAAGHKKENEEMADEFSSGPEEIETQNGSSWC

A0A6J1GDK6 uncharacterized protein LOC111453031 isoform X18.1e-20976.41Show/hide
Query:  MEKEEEPKFASTPNLQHEANGVLSKNEKSVSDGTDEAKSAKSGCQFLENAARQNQQCTALLQRVLNPQHAGERSSPLAVSAAVNERLQPPQNLANLQHQL
        MEKE+E KFAST NLQH  NGV  KNEKSV DGTD AK+ KSGCQFLENAA QNQQ T LLQR LNP+HAGE+SS  A  AAVNERLQPP+NL  LQHQL
Subjt:  MEKEEEPKFASTPNLQHEANGVLSKNEKSVSDGTDEAKSAKSGCQFLENAARQNQQCTALLQRVLNPQHAGERSSPLAVSAAVNERLQPPQNLANLQHQL

Query:  SPPPQPQPQQFLFSSQPFWVQPQPSISLGETEGSWQAPAAFGAGASTRCQPQAPNFCYPVGFPTYPGFPGSWDASILWGQTQPLPFPGMSNYPRASYGFV
             PQPQQF+ SSQPFWVQPQPSISLG TEGSWQ PAAFGAGAS RCQPQAPNFCYPVG+PTYPGF GSWDASI WGQT PL FPG+SNYPRASYG  
Subjt:  SPPPQPQPQQFLFSSQPFWVQPQPSISLGETEGSWQAPAAFGAGASTRCQPQAPNFCYPVGFPTYPGFPGSWDASILWGQTQPLPFPGMSNYPRASYGFV

Query:  SSQSWPMPVPSCLTSSFGQPPLRGVIKPPEKLSQKHQRLWEAQSAENVQLWNMIGQLQGELADCKGRLIKLEAEISSLRPAATDEPAVGVGNGGLTVRGQ
        SSQS PMP+P+C+TSS GQP LRGVIKPPE+LSQKHQRLWEAQSAENVQLW+MIGQLQGELADCKGRLIKLEAEIS LR  AT+EPAV VGNGG+TVRGQ
Subjt:  SSQSWPMPVPSCLTSSFGQPPLRGVIKPPEKLSQKHQRLWEAQSAENVQLWNMIGQLQGELADCKGRLIKLEAEISSLRPAATDEPAVGVGNGGLTVRGQ

Query:  PTKRGRSKRAMAQAGSQPPLQPRTRGRKPAIGRTKVEEVTPSVLEKESLNKVGDKDKDFTSLNITEQDKNEGISAAITQNNGITEIEDGTLKMAASIDNQ
        P+KRGRSKRA+A  GS    Q RTR RKPA+G TKV EV P++L K+SLNKV D  K+FT L+ITEQDKNEGIS  I    GI EI++GTLKM  S  NQ
Subjt:  PTKRGRSKRAMAQAGSQPPLQPRTRGRKPAIGRTKVEEVTPSVLEKESLNKVGDKDKDFTSLNITEQDKNEGISAAITQNNGITEIEDGTLKMAASIDNQ

Query:  VLQQFPEIQSCGIEFKSPSLLKFNYEGKENFIWPSGIISEDSERNTLIIASPTIYTNGNVSRQGITRQNFEDEGEASESGFPAAGHKKENEEMADEFSSG
         LQQFP+IQSCGIEFKSPS LK NYE         GII  DS+ N L IASPTIYTNGNVSRQGITR NFEDE EA+ESGFP  G+KKEN+EMADEFSSG
Subjt:  VLQQFPEIQSCGIEFKSPSLLKFNYEGKENFIWPSGIISEDSERNTLIIASPTIYTNGNVSRQGITRQNFEDEGEASESGFPAAGHKKENEEMADEFSSG

Query:  PEEIETQNGSSWC
         EEIETQNG SWC
Subjt:  PEEIETQNGSSWC

A0A6J1IS95 uncharacterized protein LOC111478840 isoform X21.9e-20277.24Show/hide
Query:  VLSKNEKSVSDGTDEAKSAKSGCQFLENAARQNQQCTALLQRVLNPQHAGERSSPLAVSAAVNERLQPPQNLANLQHQLSPPPQPQPQQFLFSSQPFWVQ
        V  KNEKSV DGTD AK+AKSGCQFLENAA QNQQ T LLQR LNP+HAGE+SS  A  AAVNERLQPP+NL   QHQL PPPQPQPQQF+ SSQPFWVQ
Subjt:  VLSKNEKSVSDGTDEAKSAKSGCQFLENAARQNQQCTALLQRVLNPQHAGERSSPLAVSAAVNERLQPPQNLANLQHQLSPPPQPQPQQFLFSSQPFWVQ

Query:  PQPSISLGETEGSWQAPAAFGAGASTRCQPQAPNFCYPVGFPTYPGFPGSWDASILWGQTQPLPFPGMSNYPRASYGFVSSQSWPMPVPSCLTSSFGQPP
        PQ SISLG TEGSWQ PAAFGAGAS RCQPQAPNFCYPVG+PTYPGF GSWDASI WGQT PL FPG+SNYPRASYGF SSQS PMP+PSC+ SS GQP 
Subjt:  PQPSISLGETEGSWQAPAAFGAGASTRCQPQAPNFCYPVGFPTYPGFPGSWDASILWGQTQPLPFPGMSNYPRASYGFVSSQSWPMPVPSCLTSSFGQPP

Query:  LRGVIKPPEKLSQKHQRLWEAQSAENVQLWNMIGQLQGELADCKGRLIKLEAEISSLRPAATDEPAVGVGNGGLTVRGQPTKRGRSKRAMAQAGSQPPLQ
        LRGVIKPPE+LSQKHQRLWEAQSAENVQLW+MIGQLQ ELADCKGRLIKLEAEISSLR  ATDE AV VGNGG+TVRGQP KRGRSKRA+A  GS    Q
Subjt:  LRGVIKPPEKLSQKHQRLWEAQSAENVQLWNMIGQLQGELADCKGRLIKLEAEISSLRPAATDEPAVGVGNGGLTVRGQPTKRGRSKRAMAQAGSQPPLQ

Query:  PRTRGRKPAIGRTKVEEVTPSVLEKESLNKVGDKDKDFTSLNITEQDKNEGISAAITQNNGITEIEDGTLKMAASIDNQVLQQFPEIQSCGIEFKSPSLL
         RTR RKP +G TKV EV P++L K+SLNKV D  +DFT L+ITEQDKNEGISA I    GI EI++GTLK+  S  NQ LQQFP+IQSCGIEFKSPS L
Subjt:  PRTRGRKPAIGRTKVEEVTPSVLEKESLNKVGDKDKDFTSLNITEQDKNEGISAAITQNNGITEIEDGTLKMAASIDNQVLQQFPEIQSCGIEFKSPSLL

Query:  KFNYEGKENFIWPSGIISEDSERNTLIIASPTIYTNGNVSRQGITRQNFEDEGEASESGFPAAGHKKENEEMADEFSSGPEEIETQNGSSWC
        K NYE         GII  DS+ N L IASPTIYTNGNVSRQGITR NFEDE EA+ESGFP  G+KKEN+EMADEFSSG EEIETQNGSSWC
Subjt:  KFNYEGKENFIWPSGIISEDSERNTLIIASPTIYTNGNVSRQGITRQNFEDEGEASESGFPAAGHKKENEEMADEFSSGPEEIETQNGSSWC

A0A6J1IVR4 uncharacterized protein LOC111478840 isoform X11.9e-21077Show/hide
Query:  MEKEEEPKFASTPNLQHEANGVLSKNEKSVSDGTDEAKSAKSGCQFLENAARQNQQCTALLQRVLNPQHAGERSSPLAVSAAVNERLQPPQNLANLQHQL
        MEKE+E K AST NLQH  NGV  KNEKSV DGTD AK+AKSGCQFLENAA QNQQ T LLQR LNP+HAGE+SS  A  AAVNERLQPP+NL   QHQL
Subjt:  MEKEEEPKFASTPNLQHEANGVLSKNEKSVSDGTDEAKSAKSGCQFLENAARQNQQCTALLQRVLNPQHAGERSSPLAVSAAVNERLQPPQNLANLQHQL

Query:  SPPPQPQPQQFLFSSQPFWVQPQPSISLGETEGSWQAPAAFGAGASTRCQPQAPNFCYPVGFPTYPGFPGSWDASILWGQTQPLPFPGMSNYPRASYGFV
         PPPQPQPQQF+ SSQPFWVQPQ SISLG TEGSWQ PAAFGAGAS RCQPQAPNFCYPVG+PTYPGF GSWDASI WGQT PL FPG+SNYPRASYGF 
Subjt:  SPPPQPQPQQFLFSSQPFWVQPQPSISLGETEGSWQAPAAFGAGASTRCQPQAPNFCYPVGFPTYPGFPGSWDASILWGQTQPLPFPGMSNYPRASYGFV

Query:  SSQSWPMPVPSCLTSSFGQPPLRGVIKPPEKLSQKHQRLWEAQSAENVQLWNMIGQLQGELADCKGRLIKLEAEISSLRPAATDEPAVGVGNGGLTVRGQ
        SSQS PMP+PSC+ SS GQP LRGVIKPPE+LSQKHQRLWEAQSAENVQLW+MIGQLQ ELADCKGRLIKLEAEISSLR  ATDE AV VGNGG+TVRGQ
Subjt:  SSQSWPMPVPSCLTSSFGQPPLRGVIKPPEKLSQKHQRLWEAQSAENVQLWNMIGQLQGELADCKGRLIKLEAEISSLRPAATDEPAVGVGNGGLTVRGQ

Query:  PTKRGRSKRAMAQAGSQPPLQPRTRGRKPAIGRTKVEEVTPSVLEKESLNKVGDKDKDFTSLNITEQDKNEGISAAITQNNGITEIEDGTLKMAASIDNQ
        P KRGRSKRA+A  GS    Q RTR RKP +G TKV EV P++L K+SLNKV D  +DFT L+ITEQDKNEGISA I    GI EI++GTLK+  S  NQ
Subjt:  PTKRGRSKRAMAQAGSQPPLQPRTRGRKPAIGRTKVEEVTPSVLEKESLNKVGDKDKDFTSLNITEQDKNEGISAAITQNNGITEIEDGTLKMAASIDNQ

Query:  VLQQFPEIQSCGIEFKSPSLLKFNYEGKENFIWPSGIISEDSERNTLIIASPTIYTNGNVSRQGITRQNFEDEGEASESGFPAAGHKKENEEMADEFSSG
         LQQFP+IQSCGIEFKSPS LK NYE         GII  DS+ N L IASPTIYTNGNVSRQGITR NFEDE EA+ESGFP  G+KKEN+EMADEFSSG
Subjt:  VLQQFPEIQSCGIEFKSPSLLKFNYEGKENFIWPSGIISEDSERNTLIIASPTIYTNGNVSRQGITRQNFEDEGEASESGFPAAGHKKENEEMADEFSSG

Query:  PEEIETQNGSSWC
         EEIETQNGSSWC
Subjt:  PEEIETQNGSSWC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G22790.1 unknown protein6.0e-5543.88Show/hide
Query:  SFAAYAKTVIDHLKS-LQIPVLPGLSDPEFTSVESTFRFSFPPDLRSILQEGLPVGPGFPNWRSSSFQQLHILINLPKFCLLKEISQRKFWCQSWGAQPN
        S   Y KT+++H KS     V PGL++ E ++VES+  FSFP DLRSILQ GLPVG  FPNWR+ S +     + LP   L + + +  FW  SWG +P 
Subjt:  SFAAYAKTVIDHLKS-LQIPVLPGLSDPEFTSVESTFRFSFPPDLRSILQEGLPVGPGFPNWRSSSFQQLHILINLPKFCLLKEISQRKFWCQSWGAQPN

Query:  DPNDAVALAKQFLDRVPVLVPIYKNCYIPS-APNMAGNPVFHVDGGEIRVLSFDLAGFFQTHEYSHLGEPKPDRSVINSPAWAATEARAVEFWTEVASE-
        +  +A++L K+ ++  PVLVP+Y + Y+PS  PN+AGNPVF +DG  +R LS D+ GF +    S    P  DR     P       R VEFW++VA   
Subjt:  DPNDAVALAKQFLDRVPVLVPIYKNCYIPS-APNMAGNPVFHVDGGEIRVLSFDLAGFFQTHEYSHLGEPKPDRSVINSPAWAATEARAVEFWTEVASE-

Query:  KKAAARGTTEGWWNEGEFEMGLDGCLEDVFWKLREGGWREEDVRDMMMMDGDDRSSEQNEPTIEQLKVSVCDILLSGG
        +   AR  T  WW+   FE GL  CL+D FWKLRE GW E+DVRDMMMMD    S +QN    +Q +    D++ + G
Subjt:  KKAAARGTTEGWWNEGEFEMGLDGCLEDVFWKLREGGWREEDVRDMMMMDGDDRSSEQNEPTIEQLKVSVCDILLSGG

AT3G50340.1 unknown protein6.8e-4330.37Show/hide
Query:  AAVKPPRPKLACFSFAAYAKTVIDHLKSLQIPVLPGLSDPEFTSVESTFRFSFPPDLRSILQEGLPVGPGFPNWRSSSFQ-QLHILINLPKFCLLKEISQ
        AA   P  + +  SF++ A  VI HL + +I V PGL+D EF   E+ F F+FPPDLR++L  GLPVG GFP+WRS   +  L  +I+LP   +  +I++
Subjt:  AAVKPPRPKLACFSFAAYAKTVIDHLKSLQIPVLPGLSDPEFTSVESTFRFSFPPDLRSILQEGLPVGPGFPNWRSSSFQ-QLHILINLPKFCLLKEISQ

Query:  RKFWCQSWGAQPNDPNDAVALAKQFLDRVPVLVPIYKNCYIPSAPNMAGNPVFHVDGGEIRVLSFDLAGFFQTHEYSHLGEPKP-----DRSVINSPAWA
           W +SWG +P+DP  A+ +A+  L R P+++PI+ +CYIP  P++AGNPVF++D   I     DL+ FF+        +  P      RSV    A +
Subjt:  RKFWCQSWGAQPNDPNDAVALAKQFLDRVPVLVPIYKNCYIPSAPNMAGNPVFHVDGGEIRVLSFDLAGFFQTHEYSHLGEPKP-----DRSVINSPAWA

Query:  ATEA--------------------RAVEFWTEVASEKK----AAARGTTEGWWNE-------GEFEMGLDGCLEDVFWKLREGGWREEDVRDMMMMDGD-
        ++ +                    R VEFW++ A +++    A++  ++     E        E    +D  +  +   LR GGW E DV D++ +    
Subjt:  ATEA--------------------RAVEFWTEVASEKK----AAARGTTEGWWNE-------GEFEMGLDGCLEDVFWKLREGGWREEDVRDMMMMDGD-

Query:  ----DRSSEQNEPTIEQLKVSV---CDILLSGGWSRDDVAYSLDLEDNSAIVIPEEEESTFEINLHRQPIRIPQWAAGVDRS
            +     N+  ++ L +      + L   GWS ++V+ +L  +       PE+E+   +        RI + A  V RS
Subjt:  ----DRSSEQNEPTIEQLKVSV---CDILLSGGWSRDDVAYSLDLEDNSAIVIPEEEESTFEINLHRQPIRIPQWAAGVDRS

AT5G67020.1 unknown protein1.8e-4332.34Show/hide
Query:  AAVKPPRPKLACFSFAAYAKTVIDHLKSLQIPVLPGLSDPEFTSVESTFRFSFPPDLRSILQEGLPVGPGFPNWRSSSFQ-QLHILINLPKFCLLKEISQ
        AA   P  + +  SF+ +A  VI+HLK+  I + PGLSD EF  VE+ F F+FPPDLR IL  GL VG GFP+WRS   +  L  +I+LP   +  +I++
Subjt:  AAVKPPRPKLACFSFAAYAKTVIDHLKSLQIPVLPGLSDPEFTSVESTFRFSFPPDLRSILQEGLPVGPGFPNWRSSSFQ-QLHILINLPKFCLLKEISQ

Query:  RKFWCQSWGAQPNDPNDAVALAKQFLDRVPVLVPIYKNCYIPSAPNMAGNPVFHVDGGEIRVLSFDLAGFFQTHEYSHLGEPKP-----DRSVINSPAWA
           WC+SWG +P DP  A+ +A+  L R P+L+PI+ +CYIP  P++AGNPVF +D   I     DL+ FF+        E  P      RSV    A +
Subjt:  RKFWCQSWGAQPNDPNDAVALAKQFLDRVPVLVPIYKNCYIPSAPNMAGNPVFHVDGGEIRVLSFDLAGFFQTHEYSHLGEPKP-----DRSVINSPAWA

Query:  ----------------ATEARAVEFWTEVASEKKAAARGTTEGWWNEG------EFEMGLDGCLEDVFWKLREGGWREEDVRDMMMMDGD-----DRSSE
                        A ++R VEFW++ A ++      +T    +        E    ++  +  +   LR GGW E D+ +++ +        +    
Subjt:  ----------------ATEARAVEFWTEVASEKKAAARGTTEGWWNEG------EFEMGLDGCLEDVFWKLREGGWREEDVRDMMMMDGD-----DRSSE

Query:  QNEPTIEQLKVS---VCDILLSGGWSRDDVAYSLDLE
         N+  ++ L +    + + L   GWS ++V+ +L  +
Subjt:  QNEPTIEQLKVS---VCDILLSGGWSRDDVAYSLDLE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAAAGAGGTGCCAATCACTTGGAACGGCCGTTTTCTTTTTCAATGCGATGGAAAAGGAAGAAGAACCCAAGTTTGCTTCTACCCCCAATCTTCAACACGAAGCCAA
TGGAGTTCTGAGCAAGAATGAGAAAAGCGTATCCGATGGGACAGATGAAGCAAAAAGCGCCAAATCTGGGTGCCAGTTTTTGGAGAATGCGGCTCGTCAAAATCAGCAGT
GCACTGCTCTTCTTCAAAGGGTACTCAATCCTCAACATGCAGGGGAAAGATCTTCACCGTTAGCCGTTTCTGCCGCCGTGAATGAGCGGCTTCAGCCACCGCAAAACCTG
GCCAATCTTCAGCACCAGCTCAGCCCGCCGCCGCAGCCGCAGCCGCAGCAATTTTTATTCTCCTCGCAACCCTTTTGGGTACAGCCGCAGCCGAGCATTTCCCTTGGAGA
AACTGAAGGTAGCTGGCAAGCTCCGGCGGCGTTTGGCGCCGGAGCCTCGACGAGATGTCAACCCCAAGCTCCTAATTTCTGTTACCCTGTTGGATTTCCAACATACCCTG
GTTTTCCAGGTTCTTGGGATGCCTCAATTTTGTGGGGTCAAACACAACCATTACCGTTTCCTGGGATGTCCAATTATCCAAGAGCATCATACGGTTTTGTCTCTTCTCAA
TCTTGGCCTATGCCAGTTCCTAGTTGTTTAACATCTTCCTTTGGACAACCCCCTTTAAGAGGAGTCATCAAGCCCCCTGAAAAGCTTTCTCAGAAGCATCAAAGACTTTG
GGAAGCCCAGTCTGCAGAAAATGTTCAATTGTGGAATATGATTGGACAGTTGCAAGGGGAATTAGCTGACTGCAAGGGCCGCTTGATTAAACTTGAAGCTGAAATTTCAT
CTTTAAGACCAGCAGCTACGGATGAGCCTGCTGTGGGAGTTGGAAATGGTGGCCTTACTGTGAGGGGACAACCAACAAAGCGAGGAAGGTCGAAACGAGCAATGGCCCAG
GCAGGTTCACAACCTCCATTGCAACCCCGGACACGGGGGCGAAAGCCAGCTATAGGAAGGACAAAAGTAGAAGAAGTAACACCATCTGTTCTTGAAAAAGAAAGCTTGAA
TAAGGTGGGTGATAAGGATAAGGATTTTACTTCCCTTAACATTACAGAGCAAGACAAGAATGAAGGCATTTCAGCTGCCATCACTCAAAACAATGGGATCACGGAGATTG
AAGATGGCACTCTCAAGATGGCTGCTTCTATAGACAATCAAGTTCTTCAGCAATTCCCAGAAATTCAATCATGTGGAATTGAATTCAAGTCACCATCATTATTGAAATTC
AATTATGAAGGTAAAGAAAATTTCATATGGCCTTCAGGAATCATCTCTGAAGACTCTGAACGAAACACGTTGATCATAGCCTCCCCAACAATCTACACCAATGGAAATGT
CAGCAGACAAGGAATAACTAGACAGAACTTCGAAGACGAGGGCGAAGCATCTGAATCCGGATTTCCTGCAGCAGGACACAAAAAGGAAAATGAAGAAATGGCAGATGAAT
TCAGCTCAGGACCTGAAGAGATTGAAACCCAAAATGGCTCCTCATGGTGTGGAAAGCAAACTCCATGCCTCTCCTCCAAATCCGCTCCTCCCAATCTCTACATTAACCCA
ACATTTCCCACTTTCTTTTTACCCTCTACAATGGCCGCCGCCACCACCGCCACCGCCACTGCCGCCGTCAAGCCTCCCCGCCCAAAACTTGCCTGTTTCTCCTTCGCCGC
CTATGCCAAAACCGTCATCGACCATCTCAAATCCCTCCAAATTCCCGTCCTCCCCGGGCTCTCGGATCCTGAATTCACCTCCGTCGAGTCCACTTTCCGATTCTCTTTCC
CGCCGGACCTCCGTTCAATTCTCCAGGAGGGGCTCCCGGTTGGGCCTGGTTTTCCCAATTGGCGATCCTCCTCCTTTCAACAGCTTCACATTCTCATCAATCTCCCCAAA
TTCTGCCTCCTCAAAGAAATTTCCCAAAGAAAATTCTGGTGCCAATCTTGGGGGGCTCAACCCAATGACCCAAACGACGCCGTTGCTCTGGCCAAGCAATTCTTGGACAG
AGTTCCTGTTCTTGTCCCCATTTACAAAAACTGTTACATCCCTTCCGCGCCGAACATGGCCGGAAACCCTGTATTTCACGTCGACGGTGGGGAGATTCGCGTTTTAAGCT
TTGATTTGGCTGGATTCTTCCAAACCCACGAATATTCTCATCTGGGTGAGCCCAAACCGGACCGGTCGGTGATCAACTCGCCGGCTTGGGCCGCGACGGAGGCCCGTGCG
GTGGAATTCTGGACGGAAGTGGCTTCGGAAAAGAAAGCGGCGGCGCGTGGAACAACGGAAGGGTGGTGGAACGAAGGGGAATTCGAAATGGGGTTGGACGGATGCCTTGA
GGACGTGTTTTGGAAGTTGCGAGAAGGCGGATGGAGGGAAGAAGATGTGAGAGATATGATGATGATGGACGGCGATGATCGGAGCTCAGAACAAAACGAACCGACGATCG
AGCAACTCAAAGTATCAGTATGTGATATTTTACTGAGTGGGGGATGGAGCAGAGACGACGTGGCGTACTCCCTCGATCTTGAAGACAATTCCGCCATTGTTATCCCTGAA
GAAGAAGAATCAACGTTTGAAATCAATCTTCATCGGCAGCCAATAAGAATCCCCCAATGGGCAGCCGGCGTTGACAGATCGGAGAGGGGATCCGTTGACGGCGGTGTAAA
GGTGGAGTTTAGGCCGGATGACGGTGGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAAAGAGGTGCCAATCACTTGGAACGGCCGTTTTCTTTTTCAATGCGATGGAAAAGGAAGAAGAACCCAAGTTTGCTTCTACCCCCAATCTTCAACACGAAGCCAA
TGGAGTTCTGAGCAAGAATGAGAAAAGCGTATCCGATGGGACAGATGAAGCAAAAAGCGCCAAATCTGGGTGCCAGTTTTTGGAGAATGCGGCTCGTCAAAATCAGCAGT
GCACTGCTCTTCTTCAAAGGGTACTCAATCCTCAACATGCAGGGGAAAGATCTTCACCGTTAGCCGTTTCTGCCGCCGTGAATGAGCGGCTTCAGCCACCGCAAAACCTG
GCCAATCTTCAGCACCAGCTCAGCCCGCCGCCGCAGCCGCAGCCGCAGCAATTTTTATTCTCCTCGCAACCCTTTTGGGTACAGCCGCAGCCGAGCATTTCCCTTGGAGA
AACTGAAGGTAGCTGGCAAGCTCCGGCGGCGTTTGGCGCCGGAGCCTCGACGAGATGTCAACCCCAAGCTCCTAATTTCTGTTACCCTGTTGGATTTCCAACATACCCTG
GTTTTCCAGGTTCTTGGGATGCCTCAATTTTGTGGGGTCAAACACAACCATTACCGTTTCCTGGGATGTCCAATTATCCAAGAGCATCATACGGTTTTGTCTCTTCTCAA
TCTTGGCCTATGCCAGTTCCTAGTTGTTTAACATCTTCCTTTGGACAACCCCCTTTAAGAGGAGTCATCAAGCCCCCTGAAAAGCTTTCTCAGAAGCATCAAAGACTTTG
GGAAGCCCAGTCTGCAGAAAATGTTCAATTGTGGAATATGATTGGACAGTTGCAAGGGGAATTAGCTGACTGCAAGGGCCGCTTGATTAAACTTGAAGCTGAAATTTCAT
CTTTAAGACCAGCAGCTACGGATGAGCCTGCTGTGGGAGTTGGAAATGGTGGCCTTACTGTGAGGGGACAACCAACAAAGCGAGGAAGGTCGAAACGAGCAATGGCCCAG
GCAGGTTCACAACCTCCATTGCAACCCCGGACACGGGGGCGAAAGCCAGCTATAGGAAGGACAAAAGTAGAAGAAGTAACACCATCTGTTCTTGAAAAAGAAAGCTTGAA
TAAGGTGGGTGATAAGGATAAGGATTTTACTTCCCTTAACATTACAGAGCAAGACAAGAATGAAGGCATTTCAGCTGCCATCACTCAAAACAATGGGATCACGGAGATTG
AAGATGGCACTCTCAAGATGGCTGCTTCTATAGACAATCAAGTTCTTCAGCAATTCCCAGAAATTCAATCATGTGGAATTGAATTCAAGTCACCATCATTATTGAAATTC
AATTATGAAGGTAAAGAAAATTTCATATGGCCTTCAGGAATCATCTCTGAAGACTCTGAACGAAACACGTTGATCATAGCCTCCCCAACAATCTACACCAATGGAAATGT
CAGCAGACAAGGAATAACTAGACAGAACTTCGAAGACGAGGGCGAAGCATCTGAATCCGGATTTCCTGCAGCAGGACACAAAAAGGAAAATGAAGAAATGGCAGATGAAT
TCAGCTCAGGACCTGAAGAGATTGAAACCCAAAATGGCTCCTCATGGTGTGGAAAGCAAACTCCATGCCTCTCCTCCAAATCCGCTCCTCCCAATCTCTACATTAACCCA
ACATTTCCCACTTTCTTTTTACCCTCTACAATGGCCGCCGCCACCACCGCCACCGCCACTGCCGCCGTCAAGCCTCCCCGCCCAAAACTTGCCTGTTTCTCCTTCGCCGC
CTATGCCAAAACCGTCATCGACCATCTCAAATCCCTCCAAATTCCCGTCCTCCCCGGGCTCTCGGATCCTGAATTCACCTCCGTCGAGTCCACTTTCCGATTCTCTTTCC
CGCCGGACCTCCGTTCAATTCTCCAGGAGGGGCTCCCGGTTGGGCCTGGTTTTCCCAATTGGCGATCCTCCTCCTTTCAACAGCTTCACATTCTCATCAATCTCCCCAAA
TTCTGCCTCCTCAAAGAAATTTCCCAAAGAAAATTCTGGTGCCAATCTTGGGGGGCTCAACCCAATGACCCAAACGACGCCGTTGCTCTGGCCAAGCAATTCTTGGACAG
AGTTCCTGTTCTTGTCCCCATTTACAAAAACTGTTACATCCCTTCCGCGCCGAACATGGCCGGAAACCCTGTATTTCACGTCGACGGTGGGGAGATTCGCGTTTTAAGCT
TTGATTTGGCTGGATTCTTCCAAACCCACGAATATTCTCATCTGGGTGAGCCCAAACCGGACCGGTCGGTGATCAACTCGCCGGCTTGGGCCGCGACGGAGGCCCGTGCG
GTGGAATTCTGGACGGAAGTGGCTTCGGAAAAGAAAGCGGCGGCGCGTGGAACAACGGAAGGGTGGTGGAACGAAGGGGAATTCGAAATGGGGTTGGACGGATGCCTTGA
GGACGTGTTTTGGAAGTTGCGAGAAGGCGGATGGAGGGAAGAAGATGTGAGAGATATGATGATGATGGACGGCGATGATCGGAGCTCAGAACAAAACGAACCGACGATCG
AGCAACTCAAAGTATCAGTATGTGATATTTTACTGAGTGGGGGATGGAGCAGAGACGACGTGGCGTACTCCCTCGATCTTGAAGACAATTCCGCCATTGTTATCCCTGAA
GAAGAAGAATCAACGTTTGAAATCAATCTTCATCGGCAGCCAATAAGAATCCCCCAATGGGCAGCCGGCGTTGACAGATCGGAGAGGGGATCCGTTGACGGCGGTGTAAA
GGTGGAGTTTAGGCCGGATGACGGTGGCTGA
Protein sequenceShow/hide protein sequence
MKKRCQSLGTAVFFFNAMEKEEEPKFASTPNLQHEANGVLSKNEKSVSDGTDEAKSAKSGCQFLENAARQNQQCTALLQRVLNPQHAGERSSPLAVSAAVNERLQPPQNL
ANLQHQLSPPPQPQPQQFLFSSQPFWVQPQPSISLGETEGSWQAPAAFGAGASTRCQPQAPNFCYPVGFPTYPGFPGSWDASILWGQTQPLPFPGMSNYPRASYGFVSSQ
SWPMPVPSCLTSSFGQPPLRGVIKPPEKLSQKHQRLWEAQSAENVQLWNMIGQLQGELADCKGRLIKLEAEISSLRPAATDEPAVGVGNGGLTVRGQPTKRGRSKRAMAQ
AGSQPPLQPRTRGRKPAIGRTKVEEVTPSVLEKESLNKVGDKDKDFTSLNITEQDKNEGISAAITQNNGITEIEDGTLKMAASIDNQVLQQFPEIQSCGIEFKSPSLLKF
NYEGKENFIWPSGIISEDSERNTLIIASPTIYTNGNVSRQGITRQNFEDEGEASESGFPAAGHKKENEEMADEFSSGPEEIETQNGSSWCGKQTPCLSSKSAPPNLYINP
TFPTFFLPSTMAAATTATATAAVKPPRPKLACFSFAAYAKTVIDHLKSLQIPVLPGLSDPEFTSVESTFRFSFPPDLRSILQEGLPVGPGFPNWRSSSFQQLHILINLPK
FCLLKEISQRKFWCQSWGAQPNDPNDAVALAKQFLDRVPVLVPIYKNCYIPSAPNMAGNPVFHVDGGEIRVLSFDLAGFFQTHEYSHLGEPKPDRSVINSPAWAATEARA
VEFWTEVASEKKAAARGTTEGWWNEGEFEMGLDGCLEDVFWKLREGGWREEDVRDMMMMDGDDRSSEQNEPTIEQLKVSVCDILLSGGWSRDDVAYSLDLEDNSAIVIPE
EEESTFEINLHRQPIRIPQWAAGVDRSERGSVDGGVKVEFRPDDGG