; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh09G006250 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh09G006250
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionPeptidase_S9 domain-containing protein
Genome locationCmo_Chr09:3117826..3123284
RNA-Seq ExpressionCmoCh09G006250
SyntenyCmoCh09G006250
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008236 - serine-type peptidase activity (molecular function)
InterPro domainsIPR001375 - Peptidase S9, prolyl oligopeptidase, catalytic domain
IPR029058 - Alpha/Beta hydrolase fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022936567.1 uncharacterized protein LOC111443135 isoform X1 [Cucurbita moschata]1.5e-237100Show/hide
Query:  MAILIPDALLRPSLTRLCLAATSPWNRQSSNEIKSLYRVAALGNCQMGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMPIQETNPPAFSKAMASCPK
        MAILIPDALLRPSLTRLCLAATSPWNRQSSNEIKSLYRVAALGNCQMGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMPIQETNPPAFSKAMASCPK
Subjt:  MAILIPDALLRPSLTRLCLAATSPWNRQSSNEIKSLYRVAALGNCQMGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMPIQETNPPAFSKAMASCPK

Query:  ATFSNLKDLLHEENLHLTTEEGEQGKLPILIISMKDSRQQKRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYYDALISSWKR
        ATFSNLKDLLHEENLHLTTEEGEQGKLPILIISMKDSRQQKRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYYDALISSWKR
Subjt:  ATFSNLKDLLHEENLHLTTEEGEQGKLPILIISMKDSRQQKRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYYDALISSWKR

Query:  GDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARVESIKPVFEEARIELGMNEIDK
        GDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARVESIKPVFEEARIELGMNEIDK
Subjt:  GDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARVESIKPVFEEARIELGMNEIDK

Query:  EVVKKVWNRIAPGLVSQFGSIYSVPAIAPRPLLLLNGADDPRCPIAGLDAPVSRTQIAYQKFGCPENFKFIAQPGIGHEMTPEMVKEASQWFDRFLGHRI
        EVVKKVWNRIAPGLVSQFGSIYSVPAIAPRPLLLLNGADDPRCPIAGLDAPVSRTQIAYQKFGCPENFKFIAQPGIGHEMTPEMVKEASQWFDRFLGHRI
Subjt:  EVVKKVWNRIAPGLVSQFGSIYSVPAIAPRPLLLLNGADDPRCPIAGLDAPVSRTQIAYQKFGCPENFKFIAQPGIGHEMTPEMVKEASQWFDRFLGHRI

Query:  KDQLLD
        KDQLLD
Subjt:  KDQLLD

XP_022936568.1 uncharacterized protein LOC111443135 isoform X2 [Cucurbita moschata]3.7e-23198.28Show/hide
Query:  MAILIPDALLRPSLTRLCLAATSPWNRQSSNEIKSLYRVAALGNCQMGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMPIQETNPPAFSKAMASCPK
        MAILIPDALLRPSLTRLCLAATSPWNRQSSNEIKSLYRVAALGNCQMGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMPIQETNPPAFS       K
Subjt:  MAILIPDALLRPSLTRLCLAATSPWNRQSSNEIKSLYRVAALGNCQMGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMPIQETNPPAFSKAMASCPK

Query:  ATFSNLKDLLHEENLHLTTEEGEQGKLPILIISMKDSRQQKRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYYDALISSWKR
        ATFSNLKDLLHEENLHLTTEEGEQGKLPILIISMKDSRQQKRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYYDALISSWKR
Subjt:  ATFSNLKDLLHEENLHLTTEEGEQGKLPILIISMKDSRQQKRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYYDALISSWKR

Query:  GDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARVESIKPVFEEARIELGMNEIDK
        GDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARVESIKPVFEEARIELGMNEIDK
Subjt:  GDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARVESIKPVFEEARIELGMNEIDK

Query:  EVVKKVWNRIAPGLVSQFGSIYSVPAIAPRPLLLLNGADDPRCPIAGLDAPVSRTQIAYQKFGCPENFKFIAQPGIGHEMTPEMVKEASQWFDRFLGHRI
        EVVKKVWNRIAPGLVSQFGSIYSVPAIAPRPLLLLNGADDPRCPIAGLDAPVSRTQIAYQKFGCPENFKFIAQPGIGHEMTPEMVKEASQWFDRFLGHRI
Subjt:  EVVKKVWNRIAPGLVSQFGSIYSVPAIAPRPLLLLNGADDPRCPIAGLDAPVSRTQIAYQKFGCPENFKFIAQPGIGHEMTPEMVKEASQWFDRFLGHRI

Query:  KDQLLD
        KDQLLD
Subjt:  KDQLLD

XP_022975810.1 uncharacterized protein LOC111476405 isoform X1 [Cucurbita maxima]2.0e-21692.87Show/hide
Query:  MAILIPDALLRPSLTRLCLAATSPWNRQSSNEIKSLYRVAALGNCQMGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMP-----IQETNPPAFSKAM
        MAILIPDALLRPSLTRLC AATSPWNRQ+SN IKS YRVAA     M EAVVDADKFRAEFLRVLRSRRSPEVPLNVKRT P     IQE NP  FSKAM
Subjt:  MAILIPDALLRPSLTRLCLAATSPWNRQSSNEIKSLYRVAALGNCQMGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMP-----IQETNPPAFSKAM

Query:  ASCPKATFSNLKDLLHEENLHLTTEEGEQGKLPILIISMKDSRQQKRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYYDALI
        ASCPKATFSNLKDLLHEENLHLTTEEGEQGKLPILIISMKDSRQQKRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNK+TYYDALI
Subjt:  ASCPKATFSNLKDLLHEENLHLTTEEGEQGKLPILIISMKDSRQQKRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYYDALI

Query:  SSWKRGDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARVESIKPVFEEARIELGM
        SSWK+GDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARVESIKPVFEEARIELGM
Subjt:  SSWKRGDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARVESIKPVFEEARIELGM

Query:  NEIDKEVVKKVWNRIAPGLVSQFGSIYSVPAIAPRPLLLLNGADDPRCPIAGLDAPVSRTQIAYQKFGCPENFKFIAQPGIGHEMTPEMVKEASQWFDRF
        NEI+ EVVKKVWNRIAPGLVSQFGSIYSVPAIAPRPLLLLNGADDPRCPIAGLDA VS TQ+AYQ+FGCPENFKFIAQPGIGHEMTPEMVKEASQWFDRF
Subjt:  NEIDKEVVKKVWNRIAPGLVSQFGSIYSVPAIAPRPLLLLNGADDPRCPIAGLDAPVSRTQIAYQKFGCPENFKFIAQPGIGHEMTPEMVKEASQWFDRF

Query:  LGHRIKD
        LGH IK+
Subjt:  LGHRIKD

XP_022975811.1 uncharacterized protein LOC111476405 isoform X2 [Cucurbita maxima]6.1e-21091.15Show/hide
Query:  MAILIPDALLRPSLTRLCLAATSPWNRQSSNEIKSLYRVAALGNCQMGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMP-----IQETNPPAFSKAM
        MAILIPDALLRPSLTRLC AATSPWNRQ+SN IKS YRVAA     M EAVVDADKFRAEFLRVLRSRRSPEVPLNVKRT P     IQE NP  FS   
Subjt:  MAILIPDALLRPSLTRLCLAATSPWNRQSSNEIKSLYRVAALGNCQMGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMP-----IQETNPPAFSKAM

Query:  ASCPKATFSNLKDLLHEENLHLTTEEGEQGKLPILIISMKDSRQQKRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYYDALI
            KATFSNLKDLLHEENLHLTTEEGEQGKLPILIISMKDSRQQKRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNK+TYYDALI
Subjt:  ASCPKATFSNLKDLLHEENLHLTTEEGEQGKLPILIISMKDSRQQKRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYYDALI

Query:  SSWKRGDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARVESIKPVFEEARIELGM
        SSWK+GDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARVESIKPVFEEARIELGM
Subjt:  SSWKRGDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARVESIKPVFEEARIELGM

Query:  NEIDKEVVKKVWNRIAPGLVSQFGSIYSVPAIAPRPLLLLNGADDPRCPIAGLDAPVSRTQIAYQKFGCPENFKFIAQPGIGHEMTPEMVKEASQWFDRF
        NEI+ EVVKKVWNRIAPGLVSQFGSIYSVPAIAPRPLLLLNGADDPRCPIAGLDA VS TQ+AYQ+FGCPENFKFIAQPGIGHEMTPEMVKEASQWFDRF
Subjt:  NEIDKEVVKKVWNRIAPGLVSQFGSIYSVPAIAPRPLLLLNGADDPRCPIAGLDAPVSRTQIAYQKFGCPENFKFIAQPGIGHEMTPEMVKEASQWFDRF

Query:  LGHRIKD
        LGH IK+
Subjt:  LGHRIKD

XP_023535291.1 uncharacterized protein LOC111796769 [Cucurbita pepo subsp. pepo]3.8e-22092.94Show/hide
Query:  MAILIPDALLRPSLTRLCLAATSPWNRQSSNEIKSLYRVAALGNCQMGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMPIQETNPPAFSKAMASCPK
        MAILIPD LLRPSLTRLC AATSPWNRQ+SNEIKS YRVAA     M EAV+DADKFRAEFLR+LRSRRSPEVPLNVKRTMPIQE NPPAFSKAMASCPK
Subjt:  MAILIPDALLRPSLTRLCLAATSPWNRQSSNEIKSLYRVAALGNCQMGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMPIQETNPPAFSKAMASCPK

Query:  ATFSNLKDLLHEENLHLTTEEGEQGKLPILIISMKDSRQQKRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTY-----YDALI
        ATFSNLKDLLHEENLHL TEEGEQGKLPILIISMKDSRQQKRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTY       ALI
Subjt:  ATFSNLKDLLHEENLHLTTEEGEQGKLPILIISMKDSRQQKRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTY-----YDALI

Query:  SSWKRGDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARVESIKPVFEEARIELGM
        SSWKRGDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARVESIKPVFEEARIELGM
Subjt:  SSWKRGDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARVESIKPVFEEARIELGM

Query:  NEIDKEVVKKVWNRIAPGLVSQFGSIYSVPAIAPRPLLLLNGADDPRCPIAGLDAPVSRTQIAYQKFGCPENFKFIAQPGIGHEMTPEMVKEASQWFDRF
        NEI+KEVVKKVWNRIAPGLVSQFGSIYSVPAIAPRPLLLLNGADDPRCPIAGLDAPVS+T+ AYQ+ GCPENFKFIAQPGIGHEMTPEMVKEASQWFD+F
Subjt:  NEIDKEVVKKVWNRIAPGLVSQFGSIYSVPAIAPRPLLLLNGADDPRCPIAGLDAPVSRTQIAYQKFGCPENFKFIAQPGIGHEMTPEMVKEASQWFDRF

Query:  LGHRIKDQLLD
        LGH IKDQLLD
Subjt:  LGHRIKDQLLD

TrEMBL top hitse value%identityAlignment
A0A5A7TAI4 Putative esterase YitV3.9e-19483.5Show/hide
Query:  MAILIPDALLRPSLTRLCLAATSPWNRQSSNEIKSLYRVAALGNC-----QMGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMP-----IQETNPPA
        MAILI    L PSL  LC A T PWNRQS    KS YRVAA G       QM EA+VDADKFRAEFLRVLR+RRS EVPLNVK T P     IQE +PP 
Subjt:  MAILIPDALLRPSLTRLCLAATSPWNRQSSNEIKSLYRVAALGNC-----QMGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMP-----IQETNPPA

Query:  FSKAMASCPKATFSNLKDLLHEENLHLTTEEGEQGKLPILIISMKDSRQQKRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTY
        FSKAMASCPK T  NLKDLLHEENLHLTTEEGEQG+LPILI+SMK+SRQQKRP IVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAK KTTY
Subjt:  FSKAMASCPKATFSNLKDLLHEENLHLTTEEGEQGKLPILIISMKDSRQQKRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTY

Query:  YDALISSWKRGDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARVESIKPVFEEAR
         DALIS+WK+GDTMPFIFDTVWDLIKLADYLT+REDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQ F WA+DNDKWQARV+SIKPVFEEAR
Subjt:  YDALISSWKRGDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARVESIKPVFEEAR

Query:  IELGMNEIDKEVVKKVWNRIAPGLVSQFGSIYSVPAIAPRPLLLLNGADDPRCPIAGLDAPVSRTQIAYQKFGCPENFKFIAQPGIGHEMTPEMVKEASQ
        I+LGMNEI+KEVVKKVWNRIAPGL SQF SIYSVPAIAPRPLLLLNGADDPRCP+AGLDAPVSR Q AYQKFGCPENFKFIAQ GIGHEMT EMVKEAS 
Subjt:  IELGMNEIDKEVVKKVWNRIAPGLVSQFGSIYSVPAIAPRPLLLLNGADDPRCPIAGLDAPVSRTQIAYQKFGCPENFKFIAQPGIGHEMTPEMVKEASQ

Query:  WFDRFLGHRIKD
        WFD+FL   IK+
Subjt:  WFDRFLGHRIKD

A0A6J1F8T3 uncharacterized protein LOC111443135 isoform X17.5e-238100Show/hide
Query:  MAILIPDALLRPSLTRLCLAATSPWNRQSSNEIKSLYRVAALGNCQMGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMPIQETNPPAFSKAMASCPK
        MAILIPDALLRPSLTRLCLAATSPWNRQSSNEIKSLYRVAALGNCQMGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMPIQETNPPAFSKAMASCPK
Subjt:  MAILIPDALLRPSLTRLCLAATSPWNRQSSNEIKSLYRVAALGNCQMGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMPIQETNPPAFSKAMASCPK

Query:  ATFSNLKDLLHEENLHLTTEEGEQGKLPILIISMKDSRQQKRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYYDALISSWKR
        ATFSNLKDLLHEENLHLTTEEGEQGKLPILIISMKDSRQQKRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYYDALISSWKR
Subjt:  ATFSNLKDLLHEENLHLTTEEGEQGKLPILIISMKDSRQQKRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYYDALISSWKR

Query:  GDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARVESIKPVFEEARIELGMNEIDK
        GDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARVESIKPVFEEARIELGMNEIDK
Subjt:  GDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARVESIKPVFEEARIELGMNEIDK

Query:  EVVKKVWNRIAPGLVSQFGSIYSVPAIAPRPLLLLNGADDPRCPIAGLDAPVSRTQIAYQKFGCPENFKFIAQPGIGHEMTPEMVKEASQWFDRFLGHRI
        EVVKKVWNRIAPGLVSQFGSIYSVPAIAPRPLLLLNGADDPRCPIAGLDAPVSRTQIAYQKFGCPENFKFIAQPGIGHEMTPEMVKEASQWFDRFLGHRI
Subjt:  EVVKKVWNRIAPGLVSQFGSIYSVPAIAPRPLLLLNGADDPRCPIAGLDAPVSRTQIAYQKFGCPENFKFIAQPGIGHEMTPEMVKEASQWFDRFLGHRI

Query:  KDQLLD
        KDQLLD
Subjt:  KDQLLD

A0A6J1FDL4 uncharacterized protein LOC111443135 isoform X21.8e-23198.28Show/hide
Query:  MAILIPDALLRPSLTRLCLAATSPWNRQSSNEIKSLYRVAALGNCQMGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMPIQETNPPAFSKAMASCPK
        MAILIPDALLRPSLTRLCLAATSPWNRQSSNEIKSLYRVAALGNCQMGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMPIQETNPPAFS       K
Subjt:  MAILIPDALLRPSLTRLCLAATSPWNRQSSNEIKSLYRVAALGNCQMGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMPIQETNPPAFSKAMASCPK

Query:  ATFSNLKDLLHEENLHLTTEEGEQGKLPILIISMKDSRQQKRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYYDALISSWKR
        ATFSNLKDLLHEENLHLTTEEGEQGKLPILIISMKDSRQQKRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYYDALISSWKR
Subjt:  ATFSNLKDLLHEENLHLTTEEGEQGKLPILIISMKDSRQQKRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYYDALISSWKR

Query:  GDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARVESIKPVFEEARIELGMNEIDK
        GDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARVESIKPVFEEARIELGMNEIDK
Subjt:  GDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARVESIKPVFEEARIELGMNEIDK

Query:  EVVKKVWNRIAPGLVSQFGSIYSVPAIAPRPLLLLNGADDPRCPIAGLDAPVSRTQIAYQKFGCPENFKFIAQPGIGHEMTPEMVKEASQWFDRFLGHRI
        EVVKKVWNRIAPGLVSQFGSIYSVPAIAPRPLLLLNGADDPRCPIAGLDAPVSRTQIAYQKFGCPENFKFIAQPGIGHEMTPEMVKEASQWFDRFLGHRI
Subjt:  EVVKKVWNRIAPGLVSQFGSIYSVPAIAPRPLLLLNGADDPRCPIAGLDAPVSRTQIAYQKFGCPENFKFIAQPGIGHEMTPEMVKEASQWFDRFLGHRI

Query:  KDQLLD
        KDQLLD
Subjt:  KDQLLD

A0A6J1IF89 uncharacterized protein LOC111476405 isoform X23.0e-21091.15Show/hide
Query:  MAILIPDALLRPSLTRLCLAATSPWNRQSSNEIKSLYRVAALGNCQMGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMP-----IQETNPPAFSKAM
        MAILIPDALLRPSLTRLC AATSPWNRQ+SN IKS YRVAA     M EAVVDADKFRAEFLRVLRSRRSPEVPLNVKRT P     IQE NP  FS   
Subjt:  MAILIPDALLRPSLTRLCLAATSPWNRQSSNEIKSLYRVAALGNCQMGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMP-----IQETNPPAFSKAM

Query:  ASCPKATFSNLKDLLHEENLHLTTEEGEQGKLPILIISMKDSRQQKRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYYDALI
            KATFSNLKDLLHEENLHLTTEEGEQGKLPILIISMKDSRQQKRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNK+TYYDALI
Subjt:  ASCPKATFSNLKDLLHEENLHLTTEEGEQGKLPILIISMKDSRQQKRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYYDALI

Query:  SSWKRGDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARVESIKPVFEEARIELGM
        SSWK+GDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARVESIKPVFEEARIELGM
Subjt:  SSWKRGDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARVESIKPVFEEARIELGM

Query:  NEIDKEVVKKVWNRIAPGLVSQFGSIYSVPAIAPRPLLLLNGADDPRCPIAGLDAPVSRTQIAYQKFGCPENFKFIAQPGIGHEMTPEMVKEASQWFDRF
        NEI+ EVVKKVWNRIAPGLVSQFGSIYSVPAIAPRPLLLLNGADDPRCPIAGLDA VS TQ+AYQ+FGCPENFKFIAQPGIGHEMTPEMVKEASQWFDRF
Subjt:  NEIDKEVVKKVWNRIAPGLVSQFGSIYSVPAIAPRPLLLLNGADDPRCPIAGLDAPVSRTQIAYQKFGCPENFKFIAQPGIGHEMTPEMVKEASQWFDRF

Query:  LGHRIKD
        LGH IK+
Subjt:  LGHRIKD

A0A6J1IHS0 uncharacterized protein LOC111476405 isoform X19.5e-21792.87Show/hide
Query:  MAILIPDALLRPSLTRLCLAATSPWNRQSSNEIKSLYRVAALGNCQMGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMP-----IQETNPPAFSKAM
        MAILIPDALLRPSLTRLC AATSPWNRQ+SN IKS YRVAA     M EAVVDADKFRAEFLRVLRSRRSPEVPLNVKRT P     IQE NP  FSKAM
Subjt:  MAILIPDALLRPSLTRLCLAATSPWNRQSSNEIKSLYRVAALGNCQMGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMP-----IQETNPPAFSKAM

Query:  ASCPKATFSNLKDLLHEENLHLTTEEGEQGKLPILIISMKDSRQQKRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYYDALI
        ASCPKATFSNLKDLLHEENLHLTTEEGEQGKLPILIISMKDSRQQKRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNK+TYYDALI
Subjt:  ASCPKATFSNLKDLLHEENLHLTTEEGEQGKLPILIISMKDSRQQKRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYYDALI

Query:  SSWKRGDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARVESIKPVFEEARIELGM
        SSWK+GDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARVESIKPVFEEARIELGM
Subjt:  SSWKRGDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARVESIKPVFEEARIELGM

Query:  NEIDKEVVKKVWNRIAPGLVSQFGSIYSVPAIAPRPLLLLNGADDPRCPIAGLDAPVSRTQIAYQKFGCPENFKFIAQPGIGHEMTPEMVKEASQWFDRF
        NEI+ EVVKKVWNRIAPGLVSQFGSIYSVPAIAPRPLLLLNGADDPRCPIAGLDA VS TQ+AYQ+FGCPENFKFIAQPGIGHEMTPEMVKEASQWFDRF
Subjt:  NEIDKEVVKKVWNRIAPGLVSQFGSIYSVPAIAPRPLLLLNGADDPRCPIAGLDAPVSRTQIAYQKFGCPENFKFIAQPGIGHEMTPEMVKEASQWFDRF

Query:  LGHRIKD
        LGH IK+
Subjt:  LGHRIKD

SwissProt top hitse value%identityAlignment
O34973 Putative hydrolase YtaP4.7e-1124.79Show/hide
Query:  SRGYVAIAIDSRYHGE-RAKNKTTYYDALISSWKRGDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQC
        S GY  +AID    G+ R K ++  +  ++ + K    M      ++D +   DY+  R D+ P RIG  G S+GG+ AW+ AA D R  V V +     
Subjt:  SRGYVAIAIDSRYHGE-RAKNKTTYYDALISSWKRGDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQC

Query:  FRWAIDNDKWQARVESIKPVFEEARIELGMNEIDKEVVKKVWN-------RIAPGLVSQFGSIYSVPAIAPRPLLLLNGADDPRCPIAGLDAPVSRTQIA
                                      +++D  V+ K  N          P L   F +      IAPRP L L G  D   P  G+D         
Subjt:  FRWAIDNDKWQARVESIKPVFEEARIELGMNEIDKEVVKKVWN-------RIAPGLVSQFGSIYSVPAIAPRPLLLLNGADDPRCPIAGLDAPVSRTQIA

Query:  YQKFGCPENFKFIAQPGIGHEMTPEMVKEASQWFDRFL
        Y   G  + ++ + +   GH  T  +  EA ++  ++L
Subjt:  YQKFGCPENFKFIAQPGIGHEMTPEMVKEASQWFDRFL

P29368 Uncharacterized 31.7 kDa protein in traX-finO intergenic region1.3e-0531.09Show/hide
Query:  KRPAIVFLHSTNKCKEWLRP-LLEAYASRGYVAIAIDSRYHGERAKNKTTYYDALISSWKRGDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESL
        K P I+  H     +  L P    A+   G+  I  D R  GE             S  +RG  +P +     D+I + ++  K+E ID  RIG+ G SL
Subjt:  KRPAIVFLHSTNKCKEWLRP-LLEAYASRGYVAIAIDSRYHGERAKNKTTYYDALISSWKRGDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESL

Query:  GGMHAWFAAAADTRYSVVV
        GG H + A A D R   +V
Subjt:  GGMHAWFAAAADTRYSVVV

P39839 Uncharacterized peptidase YuxL4.1e-0726.64Show/hide
Query:  EAYASRGYVAIAIDSR-YHGERAKNKTTYYDALISSWKRGDTMPFIFDTVWDLIKLADYLTKRE-DIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPI
        +  A++GY  + I+ R  HG         Y     +  RGD     +D   D+++  D   KR+  IDP R+G+TG S GG    +      R+   V  
Subjt:  EAYASRGYVAIAIDSR-YHGERAKNKTTYYDALISSWKRGDTMPFIFDTVWDLIKLADYLTKRE-DIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPI

Query:  IGVQCFRWAIDNDKWQARVESIKPVFEEARIELGMNEIDKEVVKKVWNRIAPGLVSQFGSIYSVPAIAPRPLLLLNGADDPRCPIAGLDAPVSRTQIAYQ
              + +I N      V  I   F + ++E  M E D E   K+W+R          S     A    PLL+L+G  D RCPI        +  IA +
Subjt:  IGVQCFRWAIDNDKWQARVESIKPVFEEARIELGMNEIDKEVVKKVWNRIAPGLVSQFGSIYSVPAIAPRPLLLLNGADDPRCPIAGLDAPVSRTQIAYQ

Query:  KFGCPENFKFIAQPGIGHEMTP--------EMVKEASQWFDRFL
        K G  +  K +  P   H ++         + +   S WFD+ L
Subjt:  KFGCPENFKFIAQPGIGHEMTP--------EMVKEASQWFDRFL

Q99390 Uncharacterized 31.7 kDa protein in traX-finO intergenic region3.5e-0631.93Show/hide
Query:  KRPAIVFLHSTNKCKEWLRP-LLEAYASRGYVAIAIDSRYHGERAKNKTTYYDALISSWKRGDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESL
        K P I+  H     +  L P    A+   G+  I  D R  GE             S  +RG  +P +     D+I + ++  K+E ID  RIG+ G SL
Subjt:  KRPAIVFLHSTNKCKEWLRP-LLEAYASRGYVAIAIDSRYHGERAKNKTTYYDALISSWKRGDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESL

Query:  GGMHAWFAAAADTRYSVVV
        GG H + AAA D R   +V
Subjt:  GGMHAWFAAAADTRYSVVV

Arabidopsis top hitse value%identityAlignment
AT5G25770.1 alpha/beta-Hydrolases superfamily protein8.7e-13865.54Show/hide
Query:  MGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMPIQ----ETNPPAFSKAMASCPKATFSNLKDLLHEENLHLTTEEGEQGKLPILIISMKDSRQQKR
        M   +     FR +FLR+L SRRSP+VPL    + PI+    + + P+ + A+ SCPK     LKD+L EEN+HL TE+ EQGKLP+LI+S+K+  ++KR
Subjt:  MGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMPIQ----ETNPPAFSKAMASCPKATFSNLKDLLHEENLHLTTEEGEQGKLPILIISMKDSRQQKR

Query:  PAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYYDALISSWKRGDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGM
        PAIVF+H TN  KEWLRP LEAYASRGYVAI +DSRYHGERA  KT Y DALISSW+ G+TMPFIFDTVWDLIKLA+YLT+R+DIDP +IGITG SLGGM
Subjt:  PAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYYDALISSWKRGDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGM

Query:  HAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARVESIKPVFEEARIELGMNEIDKEVVKKVWNRIAPGLVSQFGSIYSVPAIAPRPLLLLNGADDPR
        HAWFAAAADTRYSVVVP+IGVQ FRWAI+ND+W+ARV SIKP+FEEARI+LG N IDKE+V+KVWNRIAPGL S+F S YS+P IAPRPL +LNGA+DPR
Subjt:  HAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARVESIKPVFEEARIELGMNEIDKEVVKKVWNRIAPGLVSQFGSIYSVPAIAPRPLLLLNGADDPR

Query:  CPIAGLDAPVSRTQIAYQKFGCPENFKFIAQPGIGHEMTPEMVKEASQWFDRFL
        CP+ GL+  + R + AY++   P NFKF A+ G+GHE T  M+KE+S WFD+FL
Subjt:  CPIAGLDAPVSRTQIAYQKFGCPENFKFIAQPGIGHEMTPEMVKEASQWFDRFL

AT5G25770.2 alpha/beta-Hydrolases superfamily protein8.7e-13865.54Show/hide
Query:  MGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMPIQ----ETNPPAFSKAMASCPKATFSNLKDLLHEENLHLTTEEGEQGKLPILIISMKDSRQQKR
        M   +     FR +FLR+L SRRSP+VPL    + PI+    + + P+ + A+ SCPK     LKD+L EEN+HL TE+ EQGKLP+LI+S+K+  ++KR
Subjt:  MGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMPIQ----ETNPPAFSKAMASCPKATFSNLKDLLHEENLHLTTEEGEQGKLPILIISMKDSRQQKR

Query:  PAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYYDALISSWKRGDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGM
        PAIVF+H TN  KEWLRP LEAYASRGYVAI +DSRYHGERA  KT Y DALISSW+ G+TMPFIFDTVWDLIKLA+YLT+R+DIDP +IGITG SLGGM
Subjt:  PAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYYDALISSWKRGDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGM

Query:  HAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARVESIKPVFEEARIELGMNEIDKEVVKKVWNRIAPGLVSQFGSIYSVPAIAPRPLLLLNGADDPR
        HAWFAAAADTRYSVVVP+IGVQ FRWAI+ND+W+ARV SIKP+FEEARI+LG N IDKE+V+KVWNRIAPGL S+F S YS+P IAPRPL +LNGA+DPR
Subjt:  HAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARVESIKPVFEEARIELGMNEIDKEVVKKVWNRIAPGLVSQFGSIYSVPAIAPRPLLLLNGADDPR

Query:  CPIAGLDAPVSRTQIAYQKFGCPENFKFIAQPGIGHEMTPEMVKEASQWFDRFL
        CP+ GL+  + R + AY++   P NFKF A+ G+GHE T  M+KE+S WFD+FL
Subjt:  CPIAGLDAPVSRTQIAYQKFGCPENFKFIAQPGIGHEMTPEMVKEASQWFDRFL

AT5G25770.3 alpha/beta-Hydrolases superfamily protein8.7e-13865.54Show/hide
Query:  MGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMPIQ----ETNPPAFSKAMASCPKATFSNLKDLLHEENLHLTTEEGEQGKLPILIISMKDSRQQKR
        M   +     FR +FLR+L SRRSP+VPL    + PI+    + + P+ + A+ SCPK     LKD+L EEN+HL TE+ EQGKLP+LI+S+K+  ++KR
Subjt:  MGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMPIQ----ETNPPAFSKAMASCPKATFSNLKDLLHEENLHLTTEEGEQGKLPILIISMKDSRQQKR

Query:  PAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYYDALISSWKRGDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGM
        PAIVF+H TN  KEWLRP LEAYASRGYVAI +DSRYHGERA  KT Y DALISSW+ G+TMPFIFDTVWDLIKLA+YLT+R+DIDP +IGITG SLGGM
Subjt:  PAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYYDALISSWKRGDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGM

Query:  HAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARVESIKPVFEEARIELGMNEIDKEVVKKVWNRIAPGLVSQFGSIYSVPAIAPRPLLLLNGADDPR
        HAWFAAAADTRYSVVVP+IGVQ FRWAI+ND+W+ARV SIKP+FEEARI+LG N IDKE+V+KVWNRIAPGL S+F S YS+P IAPRPL +LNGA+DPR
Subjt:  HAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARVESIKPVFEEARIELGMNEIDKEVVKKVWNRIAPGLVSQFGSIYSVPAIAPRPLLLLNGADDPR

Query:  CPIAGLDAPVSRTQIAYQKFGCPENFKFIAQPGIGHEMTPEMVKEASQWFDRFL
        CP+ GL+  + R + AY++   P NFKF A+ G+GHE T  M+KE+S WFD+FL
Subjt:  CPIAGLDAPVSRTQIAYQKFGCPENFKFIAQPGIGHEMTPEMVKEASQWFDRFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGCATCACTAGATGCGAAAATAGGATCAGTTGGTGGTCGTGGAGACGCAAGGGAGAATTGTATCAAAGTTTCGACATCAATATTGTATTTATACGCAGCCAAGCCGG
TCCTTATCGGCGTTTGGCGAGTGCCTGCAGACTGCAAAATTACGCAAACCATATTATAACGATAAGGAAAAAGAAAAGGCGGCAACTCTGCGTCAGCCGTTCAGTCTGTC
GGAAAAACGATGTGGGCCAACGACGCAGTACAATAATAATGGCAATTCTCATACCTGACGCCCTACTCCGCCCTTCCCTAACACGCCTCTGCCTTGCAGCAACTTCGCCA
TGGAACCGCCAAAGTTCCAATGAGATTAAATCCTTGTACAGGGTCGCAGCCTTGGGAAACTGTCAAATGGGTGAAGCTGTCGTTGACGCTGACAAGTTTCGGGCTGAATT
CCTTCGAGTTTTGCGTAGTAGACGATCTCCAGAAGTCCCGCTTAATGTGAAGCGCACAATGCCTATTCAGGAGACCAACCCGCCGGCCTTCAGTAAGGCTATGGCTTCTT
GTCCAAAGGCAACTTTTAGCAATTTGAAGGACTTGCTTCATGAGGAAAATCTTCACCTGACTACCGAGGAAGGAGAGCAAGGGAAGTTGCCTATATTGATTATAAGCATG
AAGGATAGCAGACAGCAAAAAAGACCTGCAATTGTTTTTCTGCACAGTACAAATAAGTGTAAAGAGTGGTTGAGACCATTGCTTGAGGCTTATGCATCAAGGGGTTATGT
AGCTATTGCCATTGATTCTCGTTACCATGGTGAAAGGGCCAAGAACAAAACCACTTACTATGATGCTCTTATATCTTCATGGAAAAGAGGCGATACCATGCCGTTCATAT
TTGACACGGTATGGGACTTGATAAAACTGGCGGATTATCTGACGAAAAGGGAGGACATTGACCCATGTAGAATAGGAATTACTGGCGAATCACTTGGAGGAATGCATGCA
TGGTTTGCTGCTGCTGCTGATACTCGTTACTCCGTGGTTGTCCCCATAATTGGCGTGCAGTGTTTTCGATGGGCCATAGATAACGATAAGTGGCAGGCACGAGTCGAGAG
TATAAAACCTGTTTTCGAGGAAGCCCGAATCGAATTAGGCATGAACGAGATCGACAAAGAGGTGGTGAAGAAGGTCTGGAACAGGATTGCTCCTGGTTTAGTTTCCCAAT
TCGGCTCGATTTATTCGGTTCCAGCTATCGCACCACGTCCTTTGTTGTTATTAAATGGTGCAGATGACCCTCGATGTCCGATTGCTGGTTTGGATGCTCCCGTGTCGAGA
ACCCAGATAGCTTATCAGAAGTTCGGTTGTCCAGAAAATTTTAAGTTCATTGCACAGCCTGGGATTGGCCACGAAATGACACCAGAGATGGTAAAAGAAGCTAGCCAATG
GTTTGATAGGTTCTTAGGCCACAGAATCAAAGATCAGCTGTTGGATTAA
mRNA sequenceShow/hide mRNA sequence
ATGCGCATCACTAGATGCGAAAATAGGATCAGTTGGTGGTCGTGGAGACGCAAGGGAGAATTGTATCAAAGTTTCGACATCAATATTGTATTTATACGCAGCCAAGCCGG
TCCTTATCGGCGTTTGGCGAGTGCCTGCAGACTGCAAAATTACGCAAACCATATTATAACGATAAGGAAAAAGAAAAGGCGGCAACTCTGCGTCAGCCGTTCAGTCTGTC
GGAAAAACGATGTGGGCCAACGACGCAGTACAATAATAATGGCAATTCTCATACCTGACGCCCTACTCCGCCCTTCCCTAACACGCCTCTGCCTTGCAGCAACTTCGCCA
TGGAACCGCCAAAGTTCCAATGAGATTAAATCCTTGTACAGGGTCGCAGCCTTGGGAAACTGTCAAATGGGTGAAGCTGTCGTTGACGCTGACAAGTTTCGGGCTGAATT
CCTTCGAGTTTTGCGTAGTAGACGATCTCCAGAAGTCCCGCTTAATGTGAAGCGCACAATGCCTATTCAGGAGACCAACCCGCCGGCCTTCAGTAAGGCTATGGCTTCTT
GTCCAAAGGCAACTTTTAGCAATTTGAAGGACTTGCTTCATGAGGAAAATCTTCACCTGACTACCGAGGAAGGAGAGCAAGGGAAGTTGCCTATATTGATTATAAGCATG
AAGGATAGCAGACAGCAAAAAAGACCTGCAATTGTTTTTCTGCACAGTACAAATAAGTGTAAAGAGTGGTTGAGACCATTGCTTGAGGCTTATGCATCAAGGGGTTATGT
AGCTATTGCCATTGATTCTCGTTACCATGGTGAAAGGGCCAAGAACAAAACCACTTACTATGATGCTCTTATATCTTCATGGAAAAGAGGCGATACCATGCCGTTCATAT
TTGACACGGTATGGGACTTGATAAAACTGGCGGATTATCTGACGAAAAGGGAGGACATTGACCCATGTAGAATAGGAATTACTGGCGAATCACTTGGAGGAATGCATGCA
TGGTTTGCTGCTGCTGCTGATACTCGTTACTCCGTGGTTGTCCCCATAATTGGCGTGCAGTGTTTTCGATGGGCCATAGATAACGATAAGTGGCAGGCACGAGTCGAGAG
TATAAAACCTGTTTTCGAGGAAGCCCGAATCGAATTAGGCATGAACGAGATCGACAAAGAGGTGGTGAAGAAGGTCTGGAACAGGATTGCTCCTGGTTTAGTTTCCCAAT
TCGGCTCGATTTATTCGGTTCCAGCTATCGCACCACGTCCTTTGTTGTTATTAAATGGTGCAGATGACCCTCGATGTCCGATTGCTGGTTTGGATGCTCCCGTGTCGAGA
ACCCAGATAGCTTATCAGAAGTTCGGTTGTCCAGAAAATTTTAAGTTCATTGCACAGCCTGGGATTGGCCACGAAATGACACCAGAGATGGTAAAAGAAGCTAGCCAATG
GTTTGATAGGTTCTTAGGCCACAGAATCAAAGATCAGCTGTTGGATTAATAGCTCATTTCGGGTTGATAAAAGTTTGAAAATGTTTGCTTAATCCACGACACCTAAATGG
TGTGTGACGTCAAAGGTCTCTATTTATAATCAACTAAATTATCTGATTATCATATATGAAAATAGAAATAATTAGAAAAACCAATAAATAAAAAGATACTACAGATATCT
GTTATGATTCAAACTGTTCTCGTTGACTCAGTAG
Protein sequenceShow/hide protein sequence
MRITRCENRISWWSWRRKGELYQSFDINIVFIRSQAGPYRRLASACRLQNYANHIITIRKKKRRQLCVSRSVCRKNDVGQRRSTIIMAILIPDALLRPSLTRLCLAATSP
WNRQSSNEIKSLYRVAALGNCQMGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMPIQETNPPAFSKAMASCPKATFSNLKDLLHEENLHLTTEEGEQGKLPILIISM
KDSRQQKRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYYDALISSWKRGDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHA
WFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARVESIKPVFEEARIELGMNEIDKEVVKKVWNRIAPGLVSQFGSIYSVPAIAPRPLLLLNGADDPRCPIAGLDAPVSR
TQIAYQKFGCPENFKFIAQPGIGHEMTPEMVKEASQWFDRFLGHRIKDQLLD