; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg03051 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg03051
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionPeptidase_S9 domain-containing protein
Genome locationCarg_Chr09:3219061..3222705
RNA-Seq ExpressionCarg03051
SyntenyCarg03051
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008236 - serine-type peptidase activity (molecular function)
InterPro domainsIPR001375 - Peptidase S9, prolyl oligopeptidase, catalytic domain
IPR029058 - Alpha/Beta hydrolase fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7024623.1 hypothetical protein SDJN02_13441, partial [Cucurbita argyrosperma subsp. argyrosperma]1.3e-116100Show/hide
Query:  MAILIPDALLRPSLTRLCLAATSPWNRQSSNEIKSLYRVAALGNCQMGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMPIQEPNPPAFSKVDAALIS
        MAILIPDALLRPSLTRLCLAATSPWNRQSSNEIKSLYRVAALGNCQMGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMPIQEPNPPAFSKVDAALIS
Subjt:  MAILIPDALLRPSLTRLCLAATSPWNRQSSNEIKSLYRVAALGNCQMGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMPIQEPNPPAFSKVDAALIS

Query:  SWKRGDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARVEKARIELGMNEINKEVV
        SWKRGDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARVEKARIELGMNEINKEVV
Subjt:  SWKRGDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARVEKARIELGMNEINKEVV

Query:  KKVCCFKFLV
        KKVCCFKFLV
Subjt:  KKVCCFKFLV

XP_022936567.1 uncharacterized protein LOC111443135 isoform X1 [Cucurbita moschata]9.0e-9464.71Show/hide
Query:  MAILIPDALLRPSLTRLCLAATSPWNRQSSNEIKSLYRVAALGNCQMGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMPIQEPNPPAFSKVDA----
        MAILIPDALLRPSLTRLCLAATSPWNRQSSNEIKSLYRVAALGNCQMGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMPIQE NPPAFSK  A    
Subjt:  MAILIPDALLRPSLTRLCLAATSPWNRQSSNEIKSLYRVAALGNCQMGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMPIQEPNPPAFSKVDA----

Query:  --------------------------------------------------------------------------------------------ALISSWKR
                                                                                                    ALISSWKR
Subjt:  --------------------------------------------------------------------------------------------ALISSWKR

Query:  GDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARV-------EKARIELGMNEINK
        GDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARV       E+ARIELGMNEI+K
Subjt:  GDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARV-------EKARIELGMNEINK

Query:  EVVKKV
        EVVKKV
Subjt:  EVVKKV

XP_022936568.1 uncharacterized protein LOC111443135 isoform X2 [Cucurbita moschata]2.4e-9466.22Show/hide
Query:  MAILIPDALLRPSLTRLCLAATSPWNRQSSNEIKSLYRVAALGNCQMGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMPIQEPNPPAFSK-------
        MAILIPDALLRPSLTRLCLAATSPWNRQSSNEIKSLYRVAALGNCQMGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMPIQE NPPAFSK       
Subjt:  MAILIPDALLRPSLTRLCLAATSPWNRQSSNEIKSLYRVAALGNCQMGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMPIQEPNPPAFSK-------

Query:  -------------------------------------------------------------------VDA---------------ALISSWKRGDTMPFI
                                                                           +D+               ALISSWKRGDTMPFI
Subjt:  -------------------------------------------------------------------VDA---------------ALISSWKRGDTMPFI

Query:  FDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARV-------EKARIELGMNEINKEVVKKV
        FDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARV       E+ARIELGMNEI+KEVVKKV
Subjt:  FDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARV-------EKARIELGMNEINKEVVKKV

XP_022975811.1 uncharacterized protein LOC111476405 isoform X2 [Cucurbita maxima]6.0e-8260.86Show/hide
Query:  MAILIPDALLRPSLTRLCLAATSPWNRQSSNEIKSLYRVAALGNCQMGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMP-----IQEPNPPAFSK--
        MAILIPDALLRPSLTRLC AATSPWNRQ+SN IKS YRVAA     M EAVVDADKFRAEFLRVLRSRRSPEVPLNVKRT P     IQEPNP  FSK  
Subjt:  MAILIPDALLRPSLTRLCLAATSPWNRQSSNEIKSLYRVAALGNCQMGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMP-----IQEPNPPAFSK--

Query:  ------------------------------------------------------------------------VDA---------------ALISSWKRGD
                                                                                +D+               ALISSWK+GD
Subjt:  ------------------------------------------------------------------------VDA---------------ALISSWKRGD

Query:  TMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARV-------EKARIELGMNEINKEV
        TMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARV       E+ARIELGMNEIN EV
Subjt:  TMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARV-------EKARIELGMNEINKEV

Query:  VKKV
        VKKV
Subjt:  VKKV

XP_023535291.1 uncharacterized protein LOC111796769 [Cucurbita pepo subsp. pepo]5.3e-8660.45Show/hide
Query:  MAILIPDALLRPSLTRLCLAATSPWNRQSSNEIKSLYRVAALGNCQMGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMPIQEPNPPAFSKVDA----
        MAILIPD LLRPSLTRLC AATSPWNRQ+SNEIKS YRVAA     M EAV+DADKFRAEFLR+LRSRRSPEVPLNVKRTMPIQEPNPPAFSK  A    
Subjt:  MAILIPDALLRPSLTRLCLAATSPWNRQSSNEIKSLYRVAALGNCQMGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMPIQEPNPPAFSKVDA----

Query:  -------------------------------------------------------------------------------------------------ALI
                                                                                                         ALI
Subjt:  -------------------------------------------------------------------------------------------------ALI

Query:  SSWKRGDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARV-------EKARIELGM
        SSWKRGDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARV       E+ARIELGM
Subjt:  SSWKRGDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARV-------EKARIELGM

Query:  NEINKEVVKKV
        NEINKEVVKKV
Subjt:  NEINKEVVKKV

TrEMBL top hitse value%identityAlignment
A0A5A7TAI4 Putative esterase YitV3.7e-6952.53Show/hide
Query:  MAILIPDALLRPSLTRLCLAATSPWNRQSSNEIKSLYRVAALGNC-----QMGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMP-----IQEPNPPA
        MAILI    L PSL  LC A T PWNRQS    KS YRVAA G       QM EA+VDADKFRAEFLRVLR+RRS EVPLNVK T P     IQE +PP 
Subjt:  MAILIPDALLRPSLTRLCLAATSPWNRQSSNEIKSLYRVAALGNC-----QMGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMP-----IQEPNPPA

Query:  FSKVDA----------------------------------------------------------------------------------------------
        FSK  A                                                                                              
Subjt:  FSKVDA----------------------------------------------------------------------------------------------

Query:  --ALISSWKRGDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARV-------EKAR
          ALIS+WK+GDTMPFIFDTVWDLIKLADYLT+REDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQ F WA+DNDKWQARV       E+AR
Subjt:  --ALISSWKRGDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARV-------EKAR

Query:  IELGMNEINKEVVKKV
        I+LGMNEINKEVVKKV
Subjt:  IELGMNEINKEVVKKV

A0A6J1F8T3 uncharacterized protein LOC111443135 isoform X14.4e-9464.71Show/hide
Query:  MAILIPDALLRPSLTRLCLAATSPWNRQSSNEIKSLYRVAALGNCQMGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMPIQEPNPPAFSKVDA----
        MAILIPDALLRPSLTRLCLAATSPWNRQSSNEIKSLYRVAALGNCQMGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMPIQE NPPAFSK  A    
Subjt:  MAILIPDALLRPSLTRLCLAATSPWNRQSSNEIKSLYRVAALGNCQMGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMPIQEPNPPAFSKVDA----

Query:  --------------------------------------------------------------------------------------------ALISSWKR
                                                                                                    ALISSWKR
Subjt:  --------------------------------------------------------------------------------------------ALISSWKR

Query:  GDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARV-------EKARIELGMNEINK
        GDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARV       E+ARIELGMNEI+K
Subjt:  GDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARV-------EKARIELGMNEINK

Query:  EVVKKV
        EVVKKV
Subjt:  EVVKKV

A0A6J1FDL4 uncharacterized protein LOC111443135 isoform X21.1e-9466.22Show/hide
Query:  MAILIPDALLRPSLTRLCLAATSPWNRQSSNEIKSLYRVAALGNCQMGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMPIQEPNPPAFSK-------
        MAILIPDALLRPSLTRLCLAATSPWNRQSSNEIKSLYRVAALGNCQMGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMPIQE NPPAFSK       
Subjt:  MAILIPDALLRPSLTRLCLAATSPWNRQSSNEIKSLYRVAALGNCQMGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMPIQEPNPPAFSK-------

Query:  -------------------------------------------------------------------VDA---------------ALISSWKRGDTMPFI
                                                                           +D+               ALISSWKRGDTMPFI
Subjt:  -------------------------------------------------------------------VDA---------------ALISSWKRGDTMPFI

Query:  FDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARV-------EKARIELGMNEINKEVVKKV
        FDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARV       E+ARIELGMNEI+KEVVKKV
Subjt:  FDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARV-------EKARIELGMNEINKEVVKKV

A0A6J1IF89 uncharacterized protein LOC111476405 isoform X22.9e-8260.86Show/hide
Query:  MAILIPDALLRPSLTRLCLAATSPWNRQSSNEIKSLYRVAALGNCQMGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMP-----IQEPNPPAFSK--
        MAILIPDALLRPSLTRLC AATSPWNRQ+SN IKS YRVAA     M EAVVDADKFRAEFLRVLRSRRSPEVPLNVKRT P     IQEPNP  FSK  
Subjt:  MAILIPDALLRPSLTRLCLAATSPWNRQSSNEIKSLYRVAALGNCQMGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMP-----IQEPNPPAFSK--

Query:  ------------------------------------------------------------------------VDA---------------ALISSWKRGD
                                                                                +D+               ALISSWK+GD
Subjt:  ------------------------------------------------------------------------VDA---------------ALISSWKRGD

Query:  TMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARV-------EKARIELGMNEINKEV
        TMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARV       E+ARIELGMNEIN EV
Subjt:  TMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARV-------EKARIELGMNEINKEV

Query:  VKKV
        VKKV
Subjt:  VKKV

A0A6J1IHS0 uncharacterized protein LOC111476405 isoform X11.1e-8159.49Show/hide
Query:  MAILIPDALLRPSLTRLCLAATSPWNRQSSNEIKSLYRVAALGNCQMGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMP-----IQEPNPPAFSKVD
        MAILIPDALLRPSLTRLC AATSPWNRQ+SN IKS YRVAA     M EAVVDADKFRAEFLRVLRSRRSPEVPLNVKRT P     IQEPNP  FSK  
Subjt:  MAILIPDALLRPSLTRLCLAATSPWNRQSSNEIKSLYRVAALGNCQMGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMP-----IQEPNPPAFSKVD

Query:  A------------------------------------------------------------------------------------------------ALI
        A                                                                                                ALI
Subjt:  A------------------------------------------------------------------------------------------------ALI

Query:  SSWKRGDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARV-------EKARIELGM
        SSWK+GDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARV       E+ARIELGM
Subjt:  SSWKRGDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARV-------EKARIELGM

Query:  NEINKEVVKKV
        NEIN EVVKKV
Subjt:  NEINKEVVKKV

SwissProt top hitse value%identityAlignment
O34973 Putative hydrolase YtaP7.3e-0642Show/hide
Query:  VWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPI
        ++D +   DY+  R D+ P RIG  G S+GG+ AW+ AA D R  V V +
Subjt:  VWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPI

Q99390 Uncharacterized 31.7 kDa protein in traX-finO intergenic region6.9e-0439.68Show/hide
Query:  SSWKRGDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVV
        S  +RG  +P +     D+I + ++  K+E ID  RIG+ G SLGG H + AAA D R   +V
Subjt:  SSWKRGDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVV

Arabidopsis top hitse value%identityAlignment
AT5G25770.1 alpha/beta-Hydrolases superfamily protein1.3e-4575.44Show/hide
Query:  ALISSWKRGDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARV-------EKARIE
        ALISSW+ G+TMPFIFDTVWDLIKLA+YLT+R+DIDP +IGITG SLGGMHAWFAAAADTRYSVVVP+IGVQ FRWAI+ND+W+ARV       E+ARI+
Subjt:  ALISSWKRGDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARV-------EKARIE

Query:  LGMNEINKEVVKKV
        LG N I+KE+V+KV
Subjt:  LGMNEINKEVVKKV

AT5G25770.1 alpha/beta-Hydrolases superfamily protein4.6e-0345Show/hide
Query:  MGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMPIQEP
        M   +     FR +FLR+L SRRSP+VPL    + PI+ P
Subjt:  MGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMPIQEP

AT5G25770.2 alpha/beta-Hydrolases superfamily protein1.3e-4575.44Show/hide
Query:  ALISSWKRGDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARV-------EKARIE
        ALISSW+ G+TMPFIFDTVWDLIKLA+YLT+R+DIDP +IGITG SLGGMHAWFAAAADTRYSVVVP+IGVQ FRWAI+ND+W+ARV       E+ARI+
Subjt:  ALISSWKRGDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARV-------EKARIE

Query:  LGMNEINKEVVKKV
        LG N I+KE+V+KV
Subjt:  LGMNEINKEVVKKV

AT5G25770.2 alpha/beta-Hydrolases superfamily protein4.6e-0345Show/hide
Query:  MGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMPIQEP
        M   +     FR +FLR+L SRRSP+VPL    + PI+ P
Subjt:  MGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMPIQEP

AT5G25770.3 alpha/beta-Hydrolases superfamily protein1.3e-4575.44Show/hide
Query:  ALISSWKRGDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARV-------EKARIE
        ALISSW+ G+TMPFIFDTVWDLIKLA+YLT+R+DIDP +IGITG SLGGMHAWFAAAADTRYSVVVP+IGVQ FRWAI+ND+W+ARV       E+ARI+
Subjt:  ALISSWKRGDTMPFIFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARV-------EKARIE

Query:  LGMNEINKEVVKKV
        LG N I+KE+V+KV
Subjt:  LGMNEINKEVVKKV

AT5G25770.3 alpha/beta-Hydrolases superfamily protein4.6e-0345Show/hide
Query:  MGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMPIQEP
        M   +     FR +FLR+L SRRSP+VPL    + PI+ P
Subjt:  MGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMPIQEP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAATTCTCATACCTGACGCCCTACTCCGCCCTTCCCTAACACGCCTCTGCCTTGCAGCAACTTCGCCATGGAACCGCCAAAGTTCCAATGAGATTAAATCCTTGTA
CAGGGTCGCAGCCTTGGGAAACTGTCAAATGGGTGAAGCTGTCGTTGACGCTGACAAGTTTCGGGCTGAATTCCTTCGAGTTTTGCGTAGTAGACGATCTCCAGAAGTCC
CGCTTAATGTGAAGCGCACAATGCCTATTCAGGAGCCCAACCCGCCAGCCTTCAGTAAGGTTGATGCAGCTCTTATATCTTCATGGAAAAGAGGCGATACCATGCCGTTC
ATATTTGACACGGTATGGGACTTGATAAAACTGGCGGATTATCTGACGAAAAGGGAGGACATTGACCCATGTAGAATAGGAATTACTGGCGAATCACTTGGAGGAATGCA
TGCATGGTTTGCTGCTGCTGCTGATACTCGCTACTCTGTGGTTGTCCCCATAATTGGCGTGCAGTGTTTTCGATGGGCCATAGATAACGATAAGTGGCAGGCACGAGTCG
AGAAAGCCCGAATCGAATTAGGCATGAACGAGATCAACAAAGAGGTGGTGAAGAAGGTTTGTTGTTTCAAGTTCCTGGTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCAATTCTCATACCTGACGCCCTACTCCGCCCTTCCCTAACACGCCTCTGCCTTGCAGCAACTTCGCCATGGAACCGCCAAAGTTCCAATGAGATTAAATCCTTGTA
CAGGGTCGCAGCCTTGGGAAACTGTCAAATGGGTGAAGCTGTCGTTGACGCTGACAAGTTTCGGGCTGAATTCCTTCGAGTTTTGCGTAGTAGACGATCTCCAGAAGTCC
CGCTTAATGTGAAGCGCACAATGCCTATTCAGGAGCCCAACCCGCCAGCCTTCAGTAAGGTTGATGCAGCTCTTATATCTTCATGGAAAAGAGGCGATACCATGCCGTTC
ATATTTGACACGGTATGGGACTTGATAAAACTGGCGGATTATCTGACGAAAAGGGAGGACATTGACCCATGTAGAATAGGAATTACTGGCGAATCACTTGGAGGAATGCA
TGCATGGTTTGCTGCTGCTGCTGATACTCGCTACTCTGTGGTTGTCCCCATAATTGGCGTGCAGTGTTTTCGATGGGCCATAGATAACGATAAGTGGCAGGCACGAGTCG
AGAAAGCCCGAATCGAATTAGGCATGAACGAGATCAACAAAGAGGTGGTGAAGAAGGTTTGTTGTTTCAAGTTCCTGGTTTAG
Protein sequenceShow/hide protein sequence
MAILIPDALLRPSLTRLCLAATSPWNRQSSNEIKSLYRVAALGNCQMGEAVVDADKFRAEFLRVLRSRRSPEVPLNVKRTMPIQEPNPPAFSKVDAALISSWKRGDTMPF
IFDTVWDLIKLADYLTKREDIDPCRIGITGESLGGMHAWFAAAADTRYSVVVPIIGVQCFRWAIDNDKWQARVEKARIELGMNEINKEVVKKVCCFKFLV