; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0012281 (gene) of Chayote v1 genome

Gene IDSed0012281
OrganismSechium edule (Chayote v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationLG01:10545385..10546392
RNA-Seq ExpressionSed0012281
SyntenySed0012281
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037097.1 Transposon Ty3-G Gag-Pol polyprotein [Cucumis melo var. makuwa]3.1e-4030.4Show/hide
Query:  KKVTEEVQKTPAMEKKNEEEAKPKRRLPEAIANR----MLEFPLFHGTDAPTWILKIELHFKFHAMDDFPTMIDVIALSMSGQALYWFRCLENQNQMPKS
        +K     + T +    +++  + K    EA A+R     +E P+F G D  +W+ + E +F+ H + D   M+ V  +S  G AL W+R  E + +   S
Subjt:  KKVTEEVQKTPAMEKKNEEEAKPKRRLPEAIANR----MLEFPLFHGTDAPTWILKIELHFKFHAMDDFPTMIDVIALSMSGQALYWFRCLENQNQMPKS

Query:  WNEFRHALFERFEGG--DTMCERFIALQQHGTVREYCSQFELYGALLPDIPDSVLRAKFMNGLHALIRAEVRVFQPKHIRDMMKTARLVEDKNL------
        W   +  L  RF     + + ERF+ ++Q  TV +Y + F+   A L D+PD V++  FMNGL   IRAEVR+ +PK + +MM+ A+LVE++ +      
Subjt:  WNEFRHALFERFEGG--DTMCERFIALQQHGTVREYCSQFELYGALLPDIPDSVLRAKFMNGLHALIRAEVRVFQPKHIRDMMKTARLVEDKNL------

Query:  ----------------------------ASTSLPV--------GDALRHTMKLGATIDGKPVVVKVDSGETHNFISRNLAKDLKLPVDDYGISSVVLGSG
                                     +T+ P+         +A   TMK+   I  + V++ +D G THNFIS  L + L+LPV + G   V+LGSG
Subjt:  ----------------------------ASTSLPV--------GDALRHTMKLGATIDGKPVVVKVDSGETHNFISRNLAKDLKLPVDDYGISSVVLGSG

Query:  KAIKVDEVFRGVLLRIQNLTFVEDCLPIEMEGDFEVILGMPWLRGM---EVDWKARTMNMNLGEKTVTFQGDPFL
          ++   +   V +++ N    E+ LP+E+ G  +V+LGM WL  +    VDWK  T+  +   K +  +GDP L
Subjt:  KAIKVDEVFRGVLLRIQNLTFVEDCLPIEMEGDFEVILGMPWLRGM---EVDWKARTMNMNLGEKTVTFQGDPFL

KAA0056890.1 aminoacyl-tRNA ligase [Cucumis melo var. makuwa]1.2e-7146.69Show/hide
Query:  METKLKKV-TEEVQKTPAMEKKN--------------EEEAKPKRRLPEAIANRMLEFPLFHGTDAPTWILKIELHFKFHAMDDFPTMIDVIALSMSGQA
        ++ KLKKV  EEV   P     N               ++  P+      + N  LE P+F GTD   WILK+E +F+ H +DD   M+D I L MSGQA
Subjt:  METKLKKV-TEEVQKTPAMEKKN--------------EEEAKPKRRLPEAIANRMLEFPLFHGTDAPTWILKIELHFKFHAMDDFPTMIDVIALSMSGQA

Query:  LYWFRCLENQNQMPKSWNEFRHALFERFEGGDTMCERFIALQQHGTVREYCSQFELYGALLPDIPDSVLRAKFMNGLHALIRAEVRVFQPKHIRDMMKTA
        L WFRC +N  + P+SW+EFR +L+ RF     +C +F+ L+Q G+V EYCS+FE  GALLP++   VL AKFMNGL   IR +VR+  PK I D+M  A
Subjt:  LYWFRCLENQNQMPKSWNEFRHALFERFEGGDTMCERFIALQQHGTVREYCSQFELYGALLPDIPDSVLRAKFMNGLHALIRAEVRVFQPKHIRDMMKTA

Query:  RLVEDKNLASTSLPVGDALRHTMKLGATIDGKPVVVKVDSGETHNFISRNLAKDLKLPVDDYGISSVVLGSGKAIKVDEVFRGVLLRIQNLTFVEDCLPI
        RL E KN          AL  T + G T+  + V+VKV S   +N IS+NLA DLKL +D YG  SVVLGSGK +K D + RGVLL+I N T+VED  P+
Subjt:  RLVEDKNLASTSLPVGDALRHTMKLGATIDGKPVVVKVDSGETHNFISRNLAKDLKLPVDDYGISSVVLGSGKAIKVDEVFRGVLLRIQNLTFVEDCLPI

Query:  EMEGDFEVILGMPWLRG---MEVDWKARTMNMNLGEKTVTFQGDPFL
        +M  D EVILG  WL     MEVDWK   M + +G++TVT + DPFL
Subjt:  EMEGDFEVILGMPWLRG---MEVDWKARTMNMNLGEKTVTFQGDPFL

KAE8652678.1 hypothetical protein Csa_013756 [Cucumis sativus]5.0e-7047.09Show/hide
Query:  METKLKKV-TEEVQKTPAMEKKNEE---------EAKPKRRLPEAIANRM--LEFPLFHGTDAPTWILKIELHFKFHAMDDFPTMIDVIALSMSGQALYW
        +E KLKKV  EEV   P  +  +            A  +R L  A  +RM  LE P+F GTD   WILK+E +F+ H +DD   M++ I L MSGQAL W
Subjt:  METKLKKV-TEEVQKTPAMEKKNEE---------EAKPKRRLPEAIANRM--LEFPLFHGTDAPTWILKIELHFKFHAMDDFPTMIDVIALSMSGQALYW

Query:  FRCLENQNQMPKSWNEFRHALFERFEGGDTMCERFIALQQHGTVREYCSQFELYGALLPDIPDSVLRAKFMNGLHALIRAEVRVFQPKHIRDMMKTARLV
        FRC +N    P+SW EFR +L++RF  G  +  RFI LQQ G+V EYCS+FE  GALLP++   V+ AKFMNGL   IR EVR+   + I D+M  ARL 
Subjt:  FRCLENQNQMPKSWNEFRHALFERFEGGDTMCERFIALQQHGTVREYCSQFELYGALLPDIPDSVLRAKFMNGLHALIRAEVRVFQPKHIRDMMKTARLV

Query:  EDKNLASTSLPVGDALRHTMKLGATIDGKPVVVKVDSGETHNFISRNLAKDLKLPVDDYGISSVVLGSGKAIKVDEVFRGVLLRIQNLTFVEDCLPIEME
        E KN         +   ++ K   T+  + VVVKV S   +N IS+NLA DLKL +D YG  SVVLGSGK +K D + RGVLL+I N T+ ED  P++M 
Subjt:  EDKNLASTSLPVGDALRHTMKLGATIDGKPVVVKVDSGETHNFISRNLAKDLKLPVDDYGISSVVLGSGKAIKVDEVFRGVLLRIQNLTFVEDCLPIEME

Query:  GDFEVILGMPW---LRGMEVDWKARTMNMNLGEKTVTFQGDPFL
         D EVILG  W   L  MEVDWK  TM + +G++ VT + DP L
Subjt:  GDFEVILGMPW---LRGMEVDWKARTMNMNLGEKTVTFQGDPFL

TYK06549.1 transposon Tf2-1 polyprotein isoform X1 [Cucumis melo var. makuwa]1.8e-4030.4Show/hide
Query:  KKVTEEVQKTPAMEKKNEEEAKPKRRLPEAIANR----MLEFPLFHGTDAPTWILKIELHFKFHAMDDFPTMIDVIALSMSGQALYWFRCLENQNQMPKS
        +K     + T +    +++  + K    EA A+R     +E P+F G D  +W+ + E +F+ H + D   M+ V  +S  G AL W+R  E + +   S
Subjt:  KKVTEEVQKTPAMEKKNEEEAKPKRRLPEAIANR----MLEFPLFHGTDAPTWILKIELHFKFHAMDDFPTMIDVIALSMSGQALYWFRCLENQNQMPKS

Query:  WNEFRHALFERFEGG--DTMCERFIALQQHGTVREYCSQFELYGALLPDIPDSVLRAKFMNGLHALIRAEVRVFQPKHIRDMMKTARLVEDKNL------
        W   +  L  RF     + + ERF+ ++Q  TV +Y + F+   A L D+PD V++  FMNGL   IRAEVR+ +PK + +MM+ A+LVE++ +      
Subjt:  WNEFRHALFERFEGG--DTMCERFIALQQHGTVREYCSQFELYGALLPDIPDSVLRAKFMNGLHALIRAEVRVFQPKHIRDMMKTARLVEDKNL------

Query:  ----------------------------ASTSLPV--------GDALRHTMKLGATIDGKPVVVKVDSGETHNFISRNLAKDLKLPVDDYGISSVVLGSG
                                     +T+ P+         +A   TMK+   I  + V++ +D G THNFIS  L + L+LPV + G   V+LGSG
Subjt:  ----------------------------ASTSLPV--------GDALRHTMKLGATIDGKPVVVKVDSGETHNFISRNLAKDLKLPVDDYGISSVVLGSG

Query:  KAIKVDEVFRGVLLRIQNLTFVEDCLPIEMEGDFEVILGMPWLRGM---EVDWKARTMNMNLGEKTVTFQGDPFL
          ++   +   V +++ N    E+ LP+E+ G  +V+LGM WL  +    VDWK  T+  +   K ++ +GDP L
Subjt:  KAIKVDEVFRGVLLRIQNLTFVEDCLPIEMEGDFEVILGMPWLRGM---EVDWKARTMNMNLGEKTVTFQGDPFL

XP_016900762.1 PREDICTED: uncharacterized protein LOC107991016 [Cucumis melo]5.7e-6646.39Show/hide
Query:  METKLKKV-TEEVQKTPAMEKKN--------------EEEAKPKRRLPEAIANRMLEFPLFHGTDAPTWILKIELHFKFHAMDDFPTMIDVIALSMSGQA
        ++ KLKKV  EEV   P     N               ++  P+      + N  LE P+F GTD   WILK+E +F+ H +DD   M+D I L MSGQA
Subjt:  METKLKKV-TEEVQKTPAMEKKN--------------EEEAKPKRRLPEAIANRMLEFPLFHGTDAPTWILKIELHFKFHAMDDFPTMIDVIALSMSGQA

Query:  LYWFRCLENQNQMPKSWNEFRHALFERFEGGDTMCERFIALQQHGTVREYCSQFELYGALLPDIPDSVLRAKFMNGLHALIRAEVRVFQPKHIRDMMKTA
        L WFRC +N  + P+SW+EFR +L+ RF     +C +F+ L+Q G+V EYCS+FE  GALLP++   VL AKFMNGL   IR +VR+  PK I D+M  A
Subjt:  LYWFRCLENQNQMPKSWNEFRHALFERFEGGDTMCERFIALQQHGTVREYCSQFELYGALLPDIPDSVLRAKFMNGLHALIRAEVRVFQPKHIRDMMKTA

Query:  RLVEDKNLASTSLPVGDALRHTMKLGATIDGKPVVVKVDSGETHNFISRNLAKDLKLPVDDYGISSVVLGSGKAIKVDEVFRGVLLRIQNLTFVEDCLPI
        RL E KN          AL  T + G T+  + V+VKV S   +N IS+NLA DLKL +D YG  SVVLGSGK +K D + RGVLL+I N T+VED  P+
Subjt:  RLVEDKNLASTSLPVGDALRHTMKLGATIDGKPVVVKVDSGETHNFISRNLAKDLKLPVDDYGISSVVLGSGKAIKVDEVFRGVLLRIQNLTFVEDCLPI

Query:  EMEGDFEVILGMPWLRG---MEVDWKARTMNM
        +M  D EVILG  WL     MEVDWK   M +
Subjt:  EMEGDFEVILGMPWLRG---MEVDWKARTMNM

TrEMBL top hitse value%identityAlignment
A0A0A0LUB3 Retrotrans_gag domain-containing protein1.4e-3847.55Show/hide
Query:  METKLKKV-TEEVQKTPAMEKKNEE---------EAKPKRRLPEAIANRM--LEFPLFHGTDAPTWILKIELHFKFHAMDDFPTMIDVIALSMSGQALYW
        +E KLKKV  EEV   P  +  +            A  +R L  A  +RM  LE P+F GTD   WILK+E +F+ H +DD   M++ I L MSGQAL W
Subjt:  METKLKKV-TEEVQKTPAMEKKNEE---------EAKPKRRLPEAIANRM--LEFPLFHGTDAPTWILKIELHFKFHAMDDFPTMIDVIALSMSGQALYW

Query:  FRCLENQNQMPKSWNEFRHALFERFEGGDTMCERFIALQQHGTVREYCSQFELYGALLPDIPDSVLRAKFMNGLHALIRAEVRVFQPKHIRDMMKTARLV
        FRC +N    P+SW EFR +L++RF  G  +  RFI LQQ G+V EYCS+FE  GALLP++   V+ AKFMNGL   IR EVR+   + I D+M  ARL 
Subjt:  FRCLENQNQMPKSWNEFRHALFERFEGGDTMCERFIALQQHGTVREYCSQFELYGALLPDIPDSVLRAKFMNGLHALIRAEVRVFQPKHIRDMMKTARLV

Query:  EDKN
        E KN
Subjt:  EDKN

A0A1S4DXQ7 uncharacterized protein LOC1079910162.8e-6646.39Show/hide
Query:  METKLKKV-TEEVQKTPAMEKKN--------------EEEAKPKRRLPEAIANRMLEFPLFHGTDAPTWILKIELHFKFHAMDDFPTMIDVIALSMSGQA
        ++ KLKKV  EEV   P     N               ++  P+      + N  LE P+F GTD   WILK+E +F+ H +DD   M+D I L MSGQA
Subjt:  METKLKKV-TEEVQKTPAMEKKN--------------EEEAKPKRRLPEAIANRMLEFPLFHGTDAPTWILKIELHFKFHAMDDFPTMIDVIALSMSGQA

Query:  LYWFRCLENQNQMPKSWNEFRHALFERFEGGDTMCERFIALQQHGTVREYCSQFELYGALLPDIPDSVLRAKFMNGLHALIRAEVRVFQPKHIRDMMKTA
        L WFRC +N  + P+SW+EFR +L+ RF     +C +F+ L+Q G+V EYCS+FE  GALLP++   VL AKFMNGL   IR +VR+  PK I D+M  A
Subjt:  LYWFRCLENQNQMPKSWNEFRHALFERFEGGDTMCERFIALQQHGTVREYCSQFELYGALLPDIPDSVLRAKFMNGLHALIRAEVRVFQPKHIRDMMKTA

Query:  RLVEDKNLASTSLPVGDALRHTMKLGATIDGKPVVVKVDSGETHNFISRNLAKDLKLPVDDYGISSVVLGSGKAIKVDEVFRGVLLRIQNLTFVEDCLPI
        RL E KN          AL  T + G T+  + V+VKV S   +N IS+NLA DLKL +D YG  SVVLGSGK +K D + RGVLL+I N T+VED  P+
Subjt:  RLVEDKNLASTSLPVGDALRHTMKLGATIDGKPVVVKVDSGETHNFISRNLAKDLKLPVDDYGISSVVLGSGKAIKVDEVFRGVLLRIQNLTFVEDCLPI

Query:  EMEGDFEVILGMPWLRG---MEVDWKARTMNM
        +M  D EVILG  WL     MEVDWK   M +
Subjt:  EMEGDFEVILGMPWLRG---MEVDWKARTMNM

A0A5A7T6B1 Transposon Ty3-G Gag-Pol polyprotein1.5e-4030.4Show/hide
Query:  KKVTEEVQKTPAMEKKNEEEAKPKRRLPEAIANR----MLEFPLFHGTDAPTWILKIELHFKFHAMDDFPTMIDVIALSMSGQALYWFRCLENQNQMPKS
        +K     + T +    +++  + K    EA A+R     +E P+F G D  +W+ + E +F+ H + D   M+ V  +S  G AL W+R  E + +   S
Subjt:  KKVTEEVQKTPAMEKKNEEEAKPKRRLPEAIANR----MLEFPLFHGTDAPTWILKIELHFKFHAMDDFPTMIDVIALSMSGQALYWFRCLENQNQMPKS

Query:  WNEFRHALFERFEGG--DTMCERFIALQQHGTVREYCSQFELYGALLPDIPDSVLRAKFMNGLHALIRAEVRVFQPKHIRDMMKTARLVEDKNL------
        W   +  L  RF     + + ERF+ ++Q  TV +Y + F+   A L D+PD V++  FMNGL   IRAEVR+ +PK + +MM+ A+LVE++ +      
Subjt:  WNEFRHALFERFEGG--DTMCERFIALQQHGTVREYCSQFELYGALLPDIPDSVLRAKFMNGLHALIRAEVRVFQPKHIRDMMKTARLVEDKNL------

Query:  ----------------------------ASTSLPV--------GDALRHTMKLGATIDGKPVVVKVDSGETHNFISRNLAKDLKLPVDDYGISSVVLGSG
                                     +T+ P+         +A   TMK+   I  + V++ +D G THNFIS  L + L+LPV + G   V+LGSG
Subjt:  ----------------------------ASTSLPV--------GDALRHTMKLGATIDGKPVVVKVDSGETHNFISRNLAKDLKLPVDDYGISSVVLGSG

Query:  KAIKVDEVFRGVLLRIQNLTFVEDCLPIEMEGDFEVILGMPWLRGM---EVDWKARTMNMNLGEKTVTFQGDPFL
          ++   +   V +++ N    E+ LP+E+ G  +V+LGM WL  +    VDWK  T+  +   K +  +GDP L
Subjt:  KAIKVDEVFRGVLLRIQNLTFVEDCLPIEMEGDFEVILGMPWLRGM---EVDWKARTMNMNLGEKTVTFQGDPFL

A0A5D3BJD9 Aminoacyl-tRNA ligase5.7e-7246.69Show/hide
Query:  METKLKKV-TEEVQKTPAMEKKN--------------EEEAKPKRRLPEAIANRMLEFPLFHGTDAPTWILKIELHFKFHAMDDFPTMIDVIALSMSGQA
        ++ KLKKV  EEV   P     N               ++  P+      + N  LE P+F GTD   WILK+E +F+ H +DD   M+D I L MSGQA
Subjt:  METKLKKV-TEEVQKTPAMEKKN--------------EEEAKPKRRLPEAIANRMLEFPLFHGTDAPTWILKIELHFKFHAMDDFPTMIDVIALSMSGQA

Query:  LYWFRCLENQNQMPKSWNEFRHALFERFEGGDTMCERFIALQQHGTVREYCSQFELYGALLPDIPDSVLRAKFMNGLHALIRAEVRVFQPKHIRDMMKTA
        L WFRC +N  + P+SW+EFR +L+ RF     +C +F+ L+Q G+V EYCS+FE  GALLP++   VL AKFMNGL   IR +VR+  PK I D+M  A
Subjt:  LYWFRCLENQNQMPKSWNEFRHALFERFEGGDTMCERFIALQQHGTVREYCSQFELYGALLPDIPDSVLRAKFMNGLHALIRAEVRVFQPKHIRDMMKTA

Query:  RLVEDKNLASTSLPVGDALRHTMKLGATIDGKPVVVKVDSGETHNFISRNLAKDLKLPVDDYGISSVVLGSGKAIKVDEVFRGVLLRIQNLTFVEDCLPI
        RL E KN          AL  T + G T+  + V+VKV S   +N IS+NLA DLKL +D YG  SVVLGSGK +K D + RGVLL+I N T+VED  P+
Subjt:  RLVEDKNLASTSLPVGDALRHTMKLGATIDGKPVVVKVDSGETHNFISRNLAKDLKLPVDDYGISSVVLGSGKAIKVDEVFRGVLLRIQNLTFVEDCLPI

Query:  EMEGDFEVILGMPWLRG---MEVDWKARTMNMNLGEKTVTFQGDPFL
        +M  D EVILG  WL     MEVDWK   M + +G++TVT + DPFL
Subjt:  EMEGDFEVILGMPWLRG---MEVDWKARTMNMNLGEKTVTFQGDPFL

A0A5D3C860 Transposon Tf2-1 polyprotein isoform X18.9e-4130.4Show/hide
Query:  KKVTEEVQKTPAMEKKNEEEAKPKRRLPEAIANR----MLEFPLFHGTDAPTWILKIELHFKFHAMDDFPTMIDVIALSMSGQALYWFRCLENQNQMPKS
        +K     + T +    +++  + K    EA A+R     +E P+F G D  +W+ + E +F+ H + D   M+ V  +S  G AL W+R  E + +   S
Subjt:  KKVTEEVQKTPAMEKKNEEEAKPKRRLPEAIANR----MLEFPLFHGTDAPTWILKIELHFKFHAMDDFPTMIDVIALSMSGQALYWFRCLENQNQMPKS

Query:  WNEFRHALFERFEGG--DTMCERFIALQQHGTVREYCSQFELYGALLPDIPDSVLRAKFMNGLHALIRAEVRVFQPKHIRDMMKTARLVEDKNL------
        W   +  L  RF     + + ERF+ ++Q  TV +Y + F+   A L D+PD V++  FMNGL   IRAEVR+ +PK + +MM+ A+LVE++ +      
Subjt:  WNEFRHALFERFEGG--DTMCERFIALQQHGTVREYCSQFELYGALLPDIPDSVLRAKFMNGLHALIRAEVRVFQPKHIRDMMKTARLVEDKNL------

Query:  ----------------------------ASTSLPV--------GDALRHTMKLGATIDGKPVVVKVDSGETHNFISRNLAKDLKLPVDDYGISSVVLGSG
                                     +T+ P+         +A   TMK+   I  + V++ +D G THNFIS  L + L+LPV + G   V+LGSG
Subjt:  ----------------------------ASTSLPV--------GDALRHTMKLGATIDGKPVVVKVDSGETHNFISRNLAKDLKLPVDDYGISSVVLGSG

Query:  KAIKVDEVFRGVLLRIQNLTFVEDCLPIEMEGDFEVILGMPWLRGM---EVDWKARTMNMNLGEKTVTFQGDPFL
          ++   +   V +++ N    E+ LP+E+ G  +V+LGM WL  +    VDWK  T+  +   K ++ +GDP L
Subjt:  KAIKVDEVFRGVLLRIQNLTFVEDCLPIEMEGDFEVILGMPWLRGM---EVDWKARTMNMNLGEKTVTFQGDPFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G30770.1 Eukaryotic aspartyl protease family protein3.4e-0835.64Show/hide
Query:  MKLGATIDGKPVVVKVDSGETHNFISRNLAKDLKLPVDDYGISSVVLGSGKAIKVDEVFRGVLLRIQNLTFVEDCLPIEM-EGDFEVILGMPWLRGMEVD
        M+    I    VVV +DSG T+NFIS  LA  LKLP      +SV+LG  + I+      G+ L +Q +   E+ L +++ + D +VILG    + +E  
Subjt:  MKLGATIDGKPVVVKVDSGETHNFISRNLAKDLKLPVDDYGISSVVLGSGKAIKVDEVFRGVLLRIQNLTFVEDCLPIEM-EGDFEVILGMPWLRGMEVD

Query:  W
        W
Subjt:  W

AT3G42723.1 aminoacyl-tRNA ligases;ATP binding;nucleotide binding4.4e-0830.08Show/hide
Query:  IDVIALSMSGQALYWFRCLENQNQMPKSWNEFRHALFERFEGGDTM----CERFIALQQHGTVREYCSQFELYGALLPDIPDSVLRAKFMNGLHALIRAE
        + ++  ++ G    W + L  +N  P SW EF+  +    E   TM       +  +QQ G+VREY  +FE        +P   L A F+ GL   ++  
Subjt:  IDVIALSMSGQALYWFRCLENQNQMPKSWNEFRHALFERFEGGDTM----CERFIALQQHGTVREYCSQFELYGALLPDIPDSVLRAKFMNGLHALIRAE

Query:  VRVFQPKHIRDMMKTARLVEDKN
        VR  +P  I  MM TA+ +E+ N
Subjt:  VRVFQPKHIRDMMKTARLVEDKN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAAAAATAACCAAATGGAAACCAAGTTGAAGAAGGTTACAGAGGAAGTTCAGAAGACCCCAGCAATGGAGAAGAAGAACGAAGAAGAAGCGAAACCAAAGCGACG
CCTACCCGAAGCCATAGCCAACCGGATGCTGGAGTTTCCTCTCTTCCATGGAACCGATGCGCCCACCTGGATCTTGAAAATAGAGCTCCACTTCAAATTTCACGCCATGG
ACGACTTTCCCACCATGATCGATGTCATCGCACTCTCTATGTCCGGCCAGGCCTTATACTGGTTCCGATGCCTCGAAAACCAGAATCAAATGCCGAAATCGTGGAATGAG
TTTCGCCATGCTCTGTTCGAGCGATTTGAAGGCGGCGACACCATGTGTGAACGGTTCATTGCGTTGCAGCAACATGGGACCGTGAGGGAGTATTGCAGCCAGTTCGAGTT
GTACGGGGCGCTCCTTCCAGACATTCCTGACTCCGTTCTTCGAGCCAAGTTTATGAACGGCTTACACGCACTCATTCGCGCCGAGGTCCGGGTGTTCCAGCCAAAGCACA
TACGAGACATGATGAAAACGGCGAGGTTGGTGGAAGATAAGAACCTCGCCTCGACGAGCCTACCCGTCGGAGATGCATTGCGTCATACCATGAAACTTGGAGCCACCATC
GACGGGAAGCCCGTGGTTGTTAAGGTCGACAGTGGGGAAACTCATAATTTCATATCCCGAAATTTGGCTAAGGATTTGAAGCTCCCAGTGGACGACTACGGCATCAGCAG
TGTGGTTTTGGGTTCCGGGAAGGCCATCAAAGTAGATGAAGTTTTTCGCGGCGTGCTGCTGCGAATCCAGAATCTAACGTTTGTGGAGGACTGTTTGCCGATTGAAATGG
AAGGGGATTTTGAGGTCATATTGGGAATGCCGTGGCTGCGTGGCATGGAAGTTGATTGGAAAGCTCGAACCATGAACATGAATTTGGGGGAAAAGACTGTAACATTTCAG
GGAGACCCATTTCTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAAAAAATAACCAAATGGAAACCAAGTTGAAGAAGGTTACAGAGGAAGTTCAGAAGACCCCAGCAATGGAGAAGAAGAACGAAGAAGAAGCGAAACCAAAGCGACG
CCTACCCGAAGCCATAGCCAACCGGATGCTGGAGTTTCCTCTCTTCCATGGAACCGATGCGCCCACCTGGATCTTGAAAATAGAGCTCCACTTCAAATTTCACGCCATGG
ACGACTTTCCCACCATGATCGATGTCATCGCACTCTCTATGTCCGGCCAGGCCTTATACTGGTTCCGATGCCTCGAAAACCAGAATCAAATGCCGAAATCGTGGAATGAG
TTTCGCCATGCTCTGTTCGAGCGATTTGAAGGCGGCGACACCATGTGTGAACGGTTCATTGCGTTGCAGCAACATGGGACCGTGAGGGAGTATTGCAGCCAGTTCGAGTT
GTACGGGGCGCTCCTTCCAGACATTCCTGACTCCGTTCTTCGAGCCAAGTTTATGAACGGCTTACACGCACTCATTCGCGCCGAGGTCCGGGTGTTCCAGCCAAAGCACA
TACGAGACATGATGAAAACGGCGAGGTTGGTGGAAGATAAGAACCTCGCCTCGACGAGCCTACCCGTCGGAGATGCATTGCGTCATACCATGAAACTTGGAGCCACCATC
GACGGGAAGCCCGTGGTTGTTAAGGTCGACAGTGGGGAAACTCATAATTTCATATCCCGAAATTTGGCTAAGGATTTGAAGCTCCCAGTGGACGACTACGGCATCAGCAG
TGTGGTTTTGGGTTCCGGGAAGGCCATCAAAGTAGATGAAGTTTTTCGCGGCGTGCTGCTGCGAATCCAGAATCTAACGTTTGTGGAGGACTGTTTGCCGATTGAAATGG
AAGGGGATTTTGAGGTCATATTGGGAATGCCGTGGCTGCGTGGCATGGAAGTTGATTGGAAAGCTCGAACCATGAACATGAATTTGGGGGAAAAGACTGTAACATTTCAG
GGAGACCCATTTCTCTGA
Protein sequenceShow/hide protein sequence
MEKNNQMETKLKKVTEEVQKTPAMEKKNEEEAKPKRRLPEAIANRMLEFPLFHGTDAPTWILKIELHFKFHAMDDFPTMIDVIALSMSGQALYWFRCLENQNQMPKSWNE
FRHALFERFEGGDTMCERFIALQQHGTVREYCSQFELYGALLPDIPDSVLRAKFMNGLHALIRAEVRVFQPKHIRDMMKTARLVEDKNLASTSLPVGDALRHTMKLGATI
DGKPVVVKVDSGETHNFISRNLAKDLKLPVDDYGISSVVLGSGKAIKVDEVFRGVLLRIQNLTFVEDCLPIEMEGDFEVILGMPWLRGMEVDWKARTMNMNLGEKTVTFQ
GDPFL