; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Csor.00g302000 (gene) of Silver-seed gourd (wild; sororia) v1 genome

Gene IDCsor.00g302000
OrganismCucurbita argyrosperma subsp. sororia (Silver-seed gourd (wild; sororia) v1)
DescriptionZn-dependent exopeptidases superfamily protein
Genome locationCsor_Chr18:10952928..10955306
RNA-Seq ExpressionCsor.00g302000
SyntenyCsor.00g302000
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0004177 - aminopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001948 - Peptidase M18


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573883.1 putative aspartyl aminopeptidase, partial [Cucurbita argyrosperma subsp. sororia]1.87e-171100Show/hide
Query:  MAGISRLQVHLLHSTPPALKSTSLLSMFPRFSRSTPRKIFTPRLLCSTPQSSSSEAGSSSSSVGDLLDYLNESWTQFHSTAESKRHLVAADLQIIPLLCH
        MAGISRLQVHLLHSTPPALKSTSLLSMFPRFSRSTPRKIFTPRLLCSTPQSSSSEAGSSSSSVGDLLDYLNESWTQFHSTAESKRHLVAADLQIIPLLCH
Subjt:  MAGISRLQVHLLHSTPPALKSTSLLSMFPRFSRSTPRKIFTPRLLCSTPQSSSSEAGSSSSSVGDLLDYLNESWTQFHSTAESKRHLVAADLQIIPLLCH

Query:  SIYVAYETTEEIWLVGHLTCFARFCSWSTHQFVAFLASSYCALRALIDSCESPSDLKSEQAVRMVALFDIEEVPTHLAQILYSIPSILSKMSAASRSFRS
        SIYVAYETTEEIWLVGHLTCFARFCSWSTHQFVAFLASSYCALRALIDSCESPSDLKSEQAVRMVALFDIEEVPTHLAQILYSIPSILSKMSAASRSFRS
Subjt:  SIYVAYETTEEIWLVGHLTCFARFCSWSTHQFVAFLASSYCALRALIDSCESPSDLKSEQAVRMVALFDIEEVPTHLAQILYSIPSILSKMSAASRSFRS

Query:  SEGSIQEAGAPTMFQAMRRIASCLGQGYVGEGAVERAFRQSFLGTFNSK
        SEGSIQEAGAPTMFQAMRRIASCLGQGYVGEGAVERAFRQSFLGTFNSK
Subjt:  SEGSIQEAGAPTMFQAMRRIASCLGQGYVGEGAVERAFRQSFLGTFNSK

KAG7012948.1 putative aspartyl aminopeptidase [Cucurbita argyrosperma subsp. argyrosperma]3.26e-10271.89Show/hide
Query:  MAGISRLQVHLLHSTPPALKSTSLLSMFPRFSRSTPRKIFTPRLLCSTPQSSSSEAGSSSSSVGDLLDYLNESWTQFHSTAESKRHLVAADLQIIPLLCH
        MAGISRLQVHLLHSTPPALKSTSLLSMFPRFSRSTPRKIFTPRLLCSTPQSSSSEAGSSSSSVGDLLDYLNESWTQFHSTAESKRHLVAA   ++     
Subjt:  MAGISRLQVHLLHSTPPALKSTSLLSMFPRFSRSTPRKIFTPRLLCSTPQSSSSEAGSSSSSVGDLLDYLNESWTQFHSTAESKRHLVAADLQIIPLLCH

Query:  SIYVAYETTEEI-----WLVGHLTCFARF-CSWSTHQFVAFLASSYCALRALIDSCESPSDLKSEQAVRMVALFDIEEVPTHLAQILYSIPSILSKMSAA
                 EE      W++   T +  F C +   +FVAFLASSYCALRALIDSCESPSDLKSEQAVRMVALFDIEEV +                   
Subjt:  SIYVAYETTEEI-----WLVGHLTCFARF-CSWSTHQFVAFLASSYCALRALIDSCESPSDLKSEQAVRMVALFDIEEVPTHLAQILYSIPSILSKMSAA

Query:  SRSFRSSEGSIQEAGAPTMFQAMRRIASCLGQGYVGEGAVERAFRQSFL
                GSIQEAGAPTMFQAMRRIASCLGQGYVGEGAVERAFRQSFL
Subjt:  SRSFRSSEGSIQEAGAPTMFQAMRRIASCLGQGYVGEGAVERAFRQSFL

XP_004150844.3 LOW QUALITY PROTEIN: probable aspartyl aminopeptidase [Cucumis sativus]9.98e-4837.53Show/hide
Query:  MAGISRLQVHLLHSTPPALKSTSLLSMFPRFSRSTPRKIFTPRLLCS----TPQSSSSEAGSSSSSVGDLLDYLNESWTQFHSTAESKRHLVAADLQII-
        MA ISRLQ+ LLH TP +LKS S+ S FP FSRS+PRK F PRLLCS    TPQ+SSSEAGSSSS VGDLLDYLNESWTQFH+TAE+KR LVAA   ++ 
Subjt:  MAGISRLQVHLLHSTPPALKSTSLLSMFPRFSRSTPRKIFTPRLLCS----TPQSSSSEAGSSSSSVGDLLDYLNESWTQFHSTAESKRHLVAADLQII-

Query:  --------PLLCH------SIYVAYETTEEI----------------------------------------------WLVGHLTCFAR------------
                P  C+      S  VA+   E+                                               W    L+   R            
Subjt:  --------PLLCH------SIYVAYETTEEI----------------------------------------------WLVGHLTCFAR------------

Query:  ----------------------------------------------------------FCSWSTH----------------QFVAF--------------
                                                                  F   S H                  V+F              
Subjt:  ----------------------------------------------------------FCSWSTH----------------QFVAF--------------

Query:  -------------LASSYCALRALIDSCESPSDLKSEQAVRMVALFDIEEVPTHLAQILYSIPSILSKMSAASRSFRSSEGSIQEAGAPTMFQAMRRIAS
                     LASSYCALRALIDSCES SDLKSEQAVRMVALFD EEV +                           GSIQ AGAPTMFQAMRRIAS
Subjt:  -------------LASSYCALRALIDSCESPSDLKSEQAVRMVALFDIEEVPTHLAQILYSIPSILSKMSAASRSFRSSEGSIQEAGAPTMFQAMRRIAS

Query:  CLGQGYVGEGAVERAFRQSFL
         L QGYVGEGA ERAFRQSFL
Subjt:  CLGQGYVGEGAVERAFRQSFL

XP_022945761.1 uncharacterized protein LOC111449906 [Cucurbita moschata]1.51e-5194.74Show/hide
Query:  MAGISRLQVHLLHSTPPALKSTSLLSMFPRFSRSTPRKIFTPRLLCSTPQSSSSEAGSSSSSVGDLLDYLNESWTQFHSTAESKRHLVAADLQII
        MAGISRLQVHLLHSTPPALKSTSLLSMFPRFSRSTPRKIFTPRLLCSTPQSSSSEAGSSSSSVGDLLDYLNESWTQFHSTAESKRHLVAA   ++
Subjt:  MAGISRLQVHLLHSTPPALKSTSLLSMFPRFSRSTPRKIFTPRLLCSTPQSSSSEAGSSSSSVGDLLDYLNESWTQFHSTAESKRHLVAADLQII

XP_031742173.1 probable aspartyl aminopeptidase [Cucumis sativus]9.98e-4837.53Show/hide
Query:  MAGISRLQVHLLHSTPPALKSTSLLSMFPRFSRSTPRKIFTPRLLCS----TPQSSSSEAGSSSSSVGDLLDYLNESWTQFHSTAESKRHLVAADLQII-
        MA ISRLQ+ LLH TP +LKS S+ S FP FSRS+PRK F PRLLCS    TPQ+SSSEAGSSSS VGDLLDYLNESWTQFH+TAE+KR LVAA   ++ 
Subjt:  MAGISRLQVHLLHSTPPALKSTSLLSMFPRFSRSTPRKIFTPRLLCS----TPQSSSSEAGSSSSSVGDLLDYLNESWTQFHSTAESKRHLVAADLQII-

Query:  --------PLLCH------SIYVAYETTEEI----------------------------------------------WLVGHLTCFAR------------
                P  C+      S  VA+   E+                                               W    L+   R            
Subjt:  --------PLLCH------SIYVAYETTEEI----------------------------------------------WLVGHLTCFAR------------

Query:  ----------------------------------------------------------FCSWSTH----------------QFVAF--------------
                                                                  F   S H                  V+F              
Subjt:  ----------------------------------------------------------FCSWSTH----------------QFVAF--------------

Query:  -------------LASSYCALRALIDSCESPSDLKSEQAVRMVALFDIEEVPTHLAQILYSIPSILSKMSAASRSFRSSEGSIQEAGAPTMFQAMRRIAS
                     LASSYCALRALIDSCES SDLKSEQAVRMVALFD EEV +                           GSIQ AGAPTMFQAMRRIAS
Subjt:  -------------LASSYCALRALIDSCESPSDLKSEQAVRMVALFDIEEVPTHLAQILYSIPSILSKMSAASRSFRSSEGSIQEAGAPTMFQAMRRIAS

Query:  CLGQGYVGEGAVERAFRQSFL
         L QGYVGEGA ERAFRQSFL
Subjt:  CLGQGYVGEGAVERAFRQSFL

TrEMBL top hitse value%identityAlignment
A0A0A0KUS9 Uncharacterized protein1.27e-4837.53Show/hide
Query:  MAGISRLQVHLLHSTPPALKSTSLLSMFPRFSRSTPRKIFTPRLLCS----TPQSSSSEAGSSSSSVGDLLDYLNESWTQFHSTAESKRHLVAADLQII-
        MA ISRLQ+ LLH TP +LKS S+ S FP FSRS+PRK F PRLLCS    TPQ+SSSEAGSSSS VGDLLDYLNESWTQFH+TAE+KR LVAA   ++ 
Subjt:  MAGISRLQVHLLHSTPPALKSTSLLSMFPRFSRSTPRKIFTPRLLCS----TPQSSSSEAGSSSSSVGDLLDYLNESWTQFHSTAESKRHLVAADLQII-

Query:  --------PLLCH------SIYVAYETTEEI----------------------------------------------WLVGHLTCFAR------------
                P  C+      S +VA+   E+                                               W    L+   R            
Subjt:  --------PLLCH------SIYVAYETTEEI----------------------------------------------WLVGHLTCFAR------------

Query:  ----------------------------------------------------------FCSWSTH----------------QFVAF--------------
                                                                  F   S H                  V+F              
Subjt:  ----------------------------------------------------------FCSWSTH----------------QFVAF--------------

Query:  -------------LASSYCALRALIDSCESPSDLKSEQAVRMVALFDIEEVPTHLAQILYSIPSILSKMSAASRSFRSSEGSIQEAGAPTMFQAMRRIAS
                     LASSYCALRALIDSCES SDLKSEQAVRMVALFD EEV +                           GSIQ AGAPTMFQAMRRIAS
Subjt:  -------------LASSYCALRALIDSCESPSDLKSEQAVRMVALFDIEEVPTHLAQILYSIPSILSKMSAASRSFRSSEGSIQEAGAPTMFQAMRRIAS

Query:  CLGQGYVGEGAVERAFRQSFL
         L QGYVGEGA ERAFRQSFL
Subjt:  CLGQGYVGEGAVERAFRQSFL

A0A5A7SZY8 Putative aspartyl aminopeptidase isoform X13.87e-4536.34Show/hide
Query:  MAGISRLQVHLLHSTPPALKSTSLLSMFPRFSRSTPRKIFTPRLLCS----TPQSSSSEAGSSSSSVGDLLDYLNESWTQFHSTAESKRHLVAADLQII-
        MA ISRLQ+ LLH TP +LKS S+ S FP FSRS+PRK   PRLLCS    TPQ+SSSE GSSSS VGDLLDYLNESWTQFH+TAE+KR LVAA   ++ 
Subjt:  MAGISRLQVHLLHSTPPALKSTSLLSMFPRFSRSTPRKIFTPRLLCS----TPQSSSSEAGSSSSSVGDLLDYLNESWTQFHSTAESKRHLVAADLQII-

Query:  --------PLLCH------SIYVAYETTEEI----------------------------------------------WLVGHLTCFAR------------
                P  C+      S  VA+   E+                                               W    L+   R            
Subjt:  --------PLLCH------SIYVAYETTEEI----------------------------------------------WLVGHLTCFAR------------

Query:  ----------------------------------------------------------FCSWSTH----------------QFVAF--------------
                                                                  F   + H                  V+F              
Subjt:  ----------------------------------------------------------FCSWSTH----------------QFVAF--------------

Query:  -------------LASSYCALRALIDSCESPSDLKSEQAVRMVALFDIEEVPTHLAQILYSIPSILSKMSAASRSFRSSEGSIQEAGAPTMFQAMRRIAS
                     LASSYCALRALIDSCES SDLK+E+AVRMVALFD EEV +                           GSIQ AGAPTMFQAMRRIAS
Subjt:  -------------LASSYCALRALIDSCESPSDLKSEQAVRMVALFDIEEVPTHLAQILYSIPSILSKMSAASRSFRSSEGSIQEAGAPTMFQAMRRIAS

Query:  CLGQGYVGEGAVERAFRQSFL
         L QGYVGEGA ERAFRQSFL
Subjt:  CLGQGYVGEGAVERAFRQSFL

A0A5D3CD05 Putative aspartyl aminopeptidase isoform X13.87e-4536.34Show/hide
Query:  MAGISRLQVHLLHSTPPALKSTSLLSMFPRFSRSTPRKIFTPRLLCS----TPQSSSSEAGSSSSSVGDLLDYLNESWTQFHSTAESKRHLVAADLQII-
        MA ISRLQ+ LLH TP +LKS S+ S FP FSRS+PRK   PRLLCS    TPQ+SSSE GSSSS VGDLLDYLNESWTQFH+TAE+KR LVAA   ++ 
Subjt:  MAGISRLQVHLLHSTPPALKSTSLLSMFPRFSRSTPRKIFTPRLLCS----TPQSSSSEAGSSSSSVGDLLDYLNESWTQFHSTAESKRHLVAADLQII-

Query:  --------PLLCH------SIYVAYETTEEI----------------------------------------------WLVGHLTCFAR------------
                P  C+      S  VA+   E+                                               W    L+   R            
Subjt:  --------PLLCH------SIYVAYETTEEI----------------------------------------------WLVGHLTCFAR------------

Query:  ----------------------------------------------------------FCSWSTH----------------QFVAF--------------
                                                                  F   + H                  V+F              
Subjt:  ----------------------------------------------------------FCSWSTH----------------QFVAF--------------

Query:  -------------LASSYCALRALIDSCESPSDLKSEQAVRMVALFDIEEVPTHLAQILYSIPSILSKMSAASRSFRSSEGSIQEAGAPTMFQAMRRIAS
                     LASSYCALRALIDSCES SDLK+E+AVRMVALFD EEV +                           GSIQ AGAPTMFQAMRRIAS
Subjt:  -------------LASSYCALRALIDSCESPSDLKSEQAVRMVALFDIEEVPTHLAQILYSIPSILSKMSAASRSFRSSEGSIQEAGAPTMFQAMRRIAS

Query:  CLGQGYVGEGAVERAFRQSFL
         L QGYVGEGA ERAFRQSFL
Subjt:  CLGQGYVGEGAVERAFRQSFL

A0A6J1DC87 probable aspartyl aminopeptidase3.66e-4736.58Show/hide
Query:  MAGISRLQVHLLHSTPPALKSTSLLSMFPRFSRSTPRKIFTPRLLCS----TPQSSSSEAGSSSSSVGDLLDYLNESWTQFHSTAESKRHLVAADLQII-
        MA ISRLQVHLLH TPPALKS++LLS FPR SR++ R+    R LCS    TPQSSSSE+GSSSS VGDL+DYLNESWTQFH+TAE+KR LVAA  +++ 
Subjt:  MAGISRLQVHLLHSTPPALKSTSLLSMFPRFSRSTPRKIFTPRLLCS----TPQSSSSEAGSSSSSVGDLLDYLNESWTQFHSTAESKRHLVAADLQII-

Query:  --------PLLCH------SIYVAYETTEEI----------------------------------------------WLVGHLTCFARFC----------
                P  C+      S  VA+   E+                                               W    L+   R            
Subjt:  --------PLLCH------SIYVAYETTEEI----------------------------------------------WLVGHLTCFARFC----------

Query:  ------------------------------SWSTH----------------------------------------------QFVAF--------------
                                      +  TH                                                V+F              
Subjt:  ------------------------------SWSTH----------------------------------------------QFVAF--------------

Query:  -------------LASSYCALRALIDSCESPSDLKSEQAVRMVALFDIEEVPTHLAQILYSIPSILSKMSAASRSFRSSEGSIQEAGAPTMFQAMRRIAS
                     LASSYCALRALIDSCESPSDLKSEQ VRMVALFD EEV +                           GSIQ AGAPTMFQAMRRI S
Subjt:  -------------LASSYCALRALIDSCESPSDLKSEQAVRMVALFDIEEVPTHLAQILYSIPSILSKMSAASRSFRSSEGSIQEAGAPTMFQAMRRIAS

Query:  CLGQGYVGEGAVERAFRQSFL
         L QGYVGEGA ERAFRQSFL
Subjt:  CLGQGYVGEGAVERAFRQSFL

A0A6J1G1S9 uncharacterized protein LOC1114499067.30e-5294.74Show/hide
Query:  MAGISRLQVHLLHSTPPALKSTSLLSMFPRFSRSTPRKIFTPRLLCSTPQSSSSEAGSSSSSVGDLLDYLNESWTQFHSTAESKRHLVAADLQII
        MAGISRLQVHLLHSTPPALKSTSLLSMFPRFSRSTPRKIFTPRLLCSTPQSSSSEAGSSSSSVGDLLDYLNESWTQFHSTAESKRHLVAA   ++
Subjt:  MAGISRLQVHLLHSTPPALKSTSLLSMFPRFSRSTPRKIFTPRLLCSTPQSSSSEAGSSSSSVGDLLDYLNESWTQFHSTAESKRHLVAADLQII

SwissProt top hitse value%identityAlignment
B9RAJ0 Probable aspartyl aminopeptidase1.1e-0535.19Show/hide
Query:  LASSYCALRALIDSCESPSDLKSEQAVRMVALFDIEEVPTHLAQILYSIPSILSKMSAASRSFRSSEGSIQEAGAPTMFQAMRRIASCLGQGYVGEGAVE
        L  S+C+L+ALID+  S S L++E  VRMVALFD EEV                              S Q AG+P MF A+ RI S           + 
Subjt:  LASSYCALRALIDSCESPSDLKSEQAVRMVALFDIEEVPTHLAQILYSIPSILSKMSAASRSFRSSEGSIQEAGAPTMFQAMRRIASCLGQGYVGEGAVE

Query:  RAFRQSFL
        +A ++SFL
Subjt:  RAFRQSFL

Q2HJH1 Aspartyl aminopeptidase3.3e-0540Show/hide
Query:  LASSYCALRALIDSCESPSDLKSEQAVRMVALFDIEEVPTHLAQILYSIPS--ILSKMSAASRSFRSSEGSIQEA
        L S +CAL+ALIDSC +P+ L ++  VRM+AL+D EEV +  AQ   S+ +  +L ++SA+ +   + E +I ++
Subjt:  LASSYCALRALIDSCESPSDLKSEQAVRMVALFDIEEVPTHLAQILYSIPS--ILSKMSAASRSFRSSEGSIQEA

Q5RBT2 Aspartyl aminopeptidase2.8e-0440Show/hide
Query:  LASSYCALRALIDSCESPSDLKSEQAVRMVALFDIEEVPTHLAQILYSIPS--ILSKMSAASRSFRSSEGSIQEA
        L S +CAL+ALIDSC  P  L +E  VRM+ L+D EEV +  AQ   S+ +  +L ++SA+ +   + E +I ++
Subjt:  LASSYCALRALIDSCESPSDLKSEQAVRMVALFDIEEVPTHLAQILYSIPS--ILSKMSAASRSFRSSEGSIQEA

Q9ULA0 Aspartyl aminopeptidase2.1e-0441.33Show/hide
Query:  LASSYCALRALIDSCESPSDLKSEQAVRMVALFDIEEVPTHLAQILYSIPS--ILSKMSAASRSFRSSEGSIQEA
        L S +CAL+ALIDSC  P  L +E  VRMV L+D EEV +  AQ   S+ +  +L ++SA+ +   + E +I ++
Subjt:  LASSYCALRALIDSCESPSDLKSEQAVRMVALFDIEEVPTHLAQILYSIPS--ILSKMSAASRSFRSSEGSIQEA

Q9Z2W0 Aspartyl aminopeptidase5.6e-0542.67Show/hide
Query:  LASSYCALRALIDSCESPSDLKSEQAVRMVALFDIEEVPTHLAQILYSIPS--ILSKMSAASRSFRSSEGSIQEA
        L S +CAL+ALIDSC SP+ L  +  VRMV L+D EEV +  AQ   S+ +  IL ++SA+ +   + E +I ++
Subjt:  LASSYCALRALIDSCESPSDLKSEQAVRMVALFDIEEVPTHLAQILYSIPS--ILSKMSAASRSFRSSEGSIQEA

Arabidopsis top hitse value%identityAlignment
AT5G04710.1 Zn-dependent exopeptidases superfamily protein1.1e-1649.07Show/hide
Query:  LASSYCALRALIDSCESPSDLKSEQAVRMVALFDIEEVPTHLAQILYSIPSILSKMSAASRSFRSSEGSIQEAGAPTMFQAMRRIASCLGQGYVGEGAVE
        LASS+CALRALIDSCES  +L +E  +RM+ALFD EEV                              S Q AGAPTMFQAMRRI S LG   V E   +
Subjt:  LASSYCALRALIDSCESPSDLKSEQAVRMVALFDIEEVPTHLAQILYSIPSILSKMSAASRSFRSSEGSIQEAGAPTMFQAMRRIASCLGQGYVGEGAVE

Query:  RAFRQSFL
        RA R+SFL
Subjt:  RAFRQSFL

AT5G04710.1 Zn-dependent exopeptidases superfamily protein1.3e-1246.53Show/hide
Query:  MAGISRLQVHLLHSTPPALKSTSLLSM---FPRFSRSTPRKIF---TPRLLCSTPQSSSSEAGSSSSSVGDLLDYLNESWTQFHSTAESKRHLVAADLQI
        MA I+RL   L HS P     +S LS    FP +   +P + F   +P L  S   S S  + S++S VGDLLDYLNESWTQFH+TAE+KR L+AA   +
Subjt:  MAGISRLQVHLLHSTPPALKSTSLLSM---FPRFSRSTPRKIF---TPRLLCSTPQSSSSEAGSSSSSVGDLLDYLNESWTQFHSTAESKRHLVAADLQI

Query:  I
        +
Subjt:  I

AT5G60160.1 Zn-dependent exopeptidases superfamily protein1.8e-0643.84Show/hide
Query:  LASSYCALRALIDSCESPSDLKSEQAVRMVALFDIEEVPTHLAQILYSIPSILSKMSAASRSFRSSEGSIQEA
        L  S+C+L+ALID+  S SDL+ E  +RMVALFD EEV ++ AQ   S P ++  MS  +  F S    +++A
Subjt:  LASSYCALRALIDSCESPSDLKSEQAVRMVALFDIEEVPTHLAQILYSIPSILSKMSAASRSFRSSEGSIQEA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGGAATATCTCGCTTGCAAGTGCATCTCCTTCACTCTACTCCTCCCGCTCTCAAGTCGACTTCGCTCCTCTCGATGTTTCCTCGCTTTTCTCGCTCTACTCCGCG
TAAAATTTTCACTCCTCGTCTTCTCTGTTCGACTCCTCAGAGTTCTTCTTCGGAGGCTGGTTCGAGTTCGAGCAGTGTTGGTGATCTTCTCGATTATCTCAATGAGTCCT
GGACTCAGTTTCATTCTACAGCCGAATCGAAACGGCATTTAGTTGCTGCTGATCTGCAAATCATTCCGCTTCTTTGTCATTCAATTTATGTGGCTTATGAAACTACAGAA
GAAATCTGGCTAGTCGGCCACTTGACTTGTTTTGCAAGATTTTGCTCTTGGTCTACTCATCAGTTTGTAGCTTTCCTTGCGTCAAGCTATTGTGCTTTGAGAGCTCTTAT
TGATTCTTGTGAATCGCCTAGTGACTTAAAGAGTGAACAGGCAGTTCGAATGGTTGCTTTATTTGATATTGAAGAGGTTCCAACTCATCTAGCTCAAATTCTATATTCAA
TCCCAAGCATCCTTAGCAAGATGTCTGCTGCATCAAGATCCTTCAGATCTTCGGAAGGTTCAATTCAGGAAGCTGGTGCACCCACCATGTTTCAGGCCATGAGGCGCATA
GCCAGCTGCTTAGGCCAAGGATACGTTGGTGAAGGTGCTGTTGAGCGCGCTTTTAGGCAATCATTTCTCGGTACGTTCAATTCAAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGGGAATATCTCGCTTGCAAGTGCATCTCCTTCACTCTACTCCTCCCGCTCTCAAGTCGACTTCGCTCCTCTCGATGTTTCCTCGCTTTTCTCGCTCTACTCCGCG
TAAAATTTTCACTCCTCGTCTTCTCTGTTCGACTCCTCAGAGTTCTTCTTCGGAGGCTGGTTCGAGTTCGAGCAGTGTTGGTGATCTTCTCGATTATCTCAATGAGTCCT
GGACTCAGTTTCATTCTACAGCCGAATCGAAACGGCATTTAGTTGCTGCTGATCTGCAAATCATTCCGCTTCTTTGTCATTCAATTTATGTGGCTTATGAAACTACAGAA
GAAATCTGGCTAGTCGGCCACTTGACTTGTTTTGCAAGATTTTGCTCTTGGTCTACTCATCAGTTTGTAGCTTTCCTTGCGTCAAGCTATTGTGCTTTGAGAGCTCTTAT
TGATTCTTGTGAATCGCCTAGTGACTTAAAGAGTGAACAGGCAGTTCGAATGGTTGCTTTATTTGATATTGAAGAGGTTCCAACTCATCTAGCTCAAATTCTATATTCAA
TCCCAAGCATCCTTAGCAAGATGTCTGCTGCATCAAGATCCTTCAGATCTTCGGAAGGTTCAATTCAGGAAGCTGGTGCACCCACCATGTTTCAGGCCATGAGGCGCATA
GCCAGCTGCTTAGGCCAAGGATACGTTGGTGAAGGTGCTGTTGAGCGCGCTTTTAGGCAATCATTTCTCGGTACGTTCAATTCAAAGTAG
Protein sequenceShow/hide protein sequence
MAGISRLQVHLLHSTPPALKSTSLLSMFPRFSRSTPRKIFTPRLLCSTPQSSSSEAGSSSSSVGDLLDYLNESWTQFHSTAESKRHLVAADLQIIPLLCHSIYVAYETTE
EIWLVGHLTCFARFCSWSTHQFVAFLASSYCALRALIDSCESPSDLKSEQAVRMVALFDIEEVPTHLAQILYSIPSILSKMSAASRSFRSSEGSIQEAGAPTMFQAMRRI
ASCLGQGYVGEGAVERAFRQSFLGTFNSK