; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI05G13450 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI05G13450
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationChr5:13119959..13121231
RNA-Seq ExpressionCSPI05G13450
SyntenyCSPI05G13450
Gene Ontology termsGO:0006796 - phosphate-containing compound metabolic process (biological process)
GO:0006807 - nitrogen compound metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0044260 - cellular macromolecule metabolic process (biological process)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039461.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]4.8e-9059.32Show/hide
Query:  DRVIEETFMNGLLSWIKAEVDFCRTTRLAQMMQLAQSVENREIVRA---------------------AKSGGSANSIDNKSDSTIPMRTITLRGVAGGEA
        +RVIE+TFMNGLL W+++EV F R   LA+MM++AQ VENREIVR                      A SGG A   ++KS+++ P+RTITLR  A  E 
Subjt:  DRVIEETFMNGLLSWIKAEVDFCRTTRLAQMMQLAQSVENREIVRA---------------------AKSGGSANSIDNKSDSTIPMRTITLRGVAGGEA

Query:  KKEGPSKRLSDMEFRARKEKALYFRCNEKYSHDHKCKMREQRELRMYVVTEGKEEFEIVEDMESEEKELKMVRIDEEDQAIIELSINFVVGLSNPGTMKV
        ++EG  KRL D EF++RKEK L FRCNEKYS DHKC+++EQRELRM+VVTEG+EE+EIVE+ E EEKEL  + ++E+   ++ELSIN VVGL++PGTMKV
Subjt:  KKEGPSKRLSDMEFRARKEKALYFRCNEKYSHDHKCKMREQRELRMYVVTEGKEEFEIVEDMESEEKELKMVRIDEEDQAIIELSINFVVGLSNPGTMKV

Query:  RGEIEGKEVVLLVDCGATHNFIFEKLVKSLQIPTKDTSHYGVILGSDTTIKGKGICEAVELKMNDWKVVANFLPLELGGVDVVLEMQWLYSLGIT
        RG++ G+EV++L+DCGATHNF+ EKLVK L +P K+TSHYGVILGS   ++GKGICE +E+++N WK+V +FLPLELGGVDV+L MQWLYSLG+T
Subjt:  RGEIEGKEVVLLVDCGATHNFIFEKLVKSLQIPTKDTSHYGVILGSDTTIKGKGICEAVELKMNDWKVVANFLPLELGGVDVVLEMQWLYSLGIT

KAA0039975.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]7.4e-9160.07Show/hide
Query:  DRVIEETFMNGLLSWIKAEVDFCRTTRLAQMMQLAQSVENREIVR-----AAKSGG---SANSI-----------DNKSDSTIPMRTITLRGVAGGEAKK
        +RVI +TFMNGLL W+++EV FCR   LA+MM++AQ VENREI R        SGG     NS+           DNK+++  P+RTITLR     E ++
Subjt:  DRVIEETFMNGLLSWIKAEVDFCRTTRLAQMMQLAQSVENREIVR-----AAKSGG---SANSI-----------DNKSDSTIPMRTITLRGVAGGEAKK

Query:  EGPSKRLSDMEFRARKEKALYFRCNEKYSHDHKCKMREQRELRMYVVTEGKEEFEIVEDMESEEKELKMVRIDEEDQAIIELSINFVVGLSNPGTMKVRG
        EG  KRL D EF+ARKEK L FRCNEKYS DH+C+M+EQRELRM+VVTEG+EE+EIVED E EEKEL  + I+E    ++ELSIN VVGL++PGTMKVRG
Subjt:  EGPSKRLSDMEFRARKEKALYFRCNEKYSHDHKCKMREQRELRMYVVTEGKEEFEIVEDMESEEKELKMVRIDEEDQAIIELSINFVVGLSNPGTMKVRG

Query:  EIEGKEVVLLVDCGATHNFIFEKLVKSLQIPTKDTSHYGVILGSDTTIKGKGICEAVELKMNDWKVVANFLPLELGGVDVVLEMQWLYSLGIT
        ++ G+EVV+L+DCGATHNF+ EKLV+ L +P K+TSHYGVILGS   ++GKG+CE +E+++ DWK+V +FLPLELGGVDV+L MQWLYSLG+T
Subjt:  EIEGKEVVLLVDCGATHNFIFEKLVKSLQIPTKDTSHYGVILGSDTTIKGKGICEAVELKMNDWKVVANFLPLELGGVDVVLEMQWLYSLGIT

TYK06640.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]3.3e-9160.41Show/hide
Query:  DRVIEETFMNGLLSWIKAEVDFCRTTRLAQMMQLAQSVENREIVR-----AAKSGG---SANSI-----------DNKSDSTIPMRTITLRGVAGGEAKK
        +RVI +TFMNGLL W+++EV FCR   LA+MM++AQ VENREI R        SGG     NS+           DNK+++  P+RTITLR     E ++
Subjt:  DRVIEETFMNGLLSWIKAEVDFCRTTRLAQMMQLAQSVENREIVR-----AAKSGG---SANSI-----------DNKSDSTIPMRTITLRGVAGGEAKK

Query:  EGPSKRLSDMEFRARKEKALYFRCNEKYSHDHKCKMREQRELRMYVVTEGKEEFEIVEDMESEEKELKMVRIDEEDQAIIELSINFVVGLSNPGTMKVRG
        EG  KRL D EF+ARKEK L FRCNEKYS DHKC+M+EQRELRM+VVTEG+EE+EIVED E EEKEL  + I+E    ++ELSIN VVGL++PGTMKVRG
Subjt:  EGPSKRLSDMEFRARKEKALYFRCNEKYSHDHKCKMREQRELRMYVVTEGKEEFEIVEDMESEEKELKMVRIDEEDQAIIELSINFVVGLSNPGTMKVRG

Query:  EIEGKEVVLLVDCGATHNFIFEKLVKSLQIPTKDTSHYGVILGSDTTIKGKGICEAVELKMNDWKVVANFLPLELGGVDVVLEMQWLYSLGIT
        ++ G+EVV+L+DCGATHNF+ EKLV+ L +P K+TSHYGVILGS   ++GKG+CE +E+++ DWK+V +FLPLELGGVDV+L MQWLYSLG+T
Subjt:  EIEGKEVVLLVDCGATHNFIFEKLVKSLQIPTKDTSHYGVILGSDTTIKGKGICEAVELKMNDWKVVANFLPLELGGVDVVLEMQWLYSLGIT

TYK18846.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]3.3e-9160.41Show/hide
Query:  DRVIEETFMNGLLSWIKAEVDFCRTTRLAQMMQLAQSVENREIVR-----AAKSGG---SANSI-----------DNKSDSTIPMRTITLRGVAGGEAKK
        +RVI +TFMNGLL W+++EV FCR   LA+MM++AQ VENREI R        SGG     NS+           DNK+++  P+RTITLR     E ++
Subjt:  DRVIEETFMNGLLSWIKAEVDFCRTTRLAQMMQLAQSVENREIVR-----AAKSGG---SANSI-----------DNKSDSTIPMRTITLRGVAGGEAKK

Query:  EGPSKRLSDMEFRARKEKALYFRCNEKYSHDHKCKMREQRELRMYVVTEGKEEFEIVEDMESEEKELKMVRIDEEDQAIIELSINFVVGLSNPGTMKVRG
        EG  KRL D EF+ARKEK L FRCNEKYS DHKC+M+EQRELRM+VVTEG+EE+EIVED E EEKEL  + I+E    ++ELSIN VVGL++PGTMKVRG
Subjt:  EGPSKRLSDMEFRARKEKALYFRCNEKYSHDHKCKMREQRELRMYVVTEGKEEFEIVEDMESEEKELKMVRIDEEDQAIIELSINFVVGLSNPGTMKVRG

Query:  EIEGKEVVLLVDCGATHNFIFEKLVKSLQIPTKDTSHYGVILGSDTTIKGKGICEAVELKMNDWKVVANFLPLELGGVDVVLEMQWLYSLGIT
        ++ G+EVV+L+DCGATHNF+ EKLV+ L +P K+TSHYGVILGS   ++GKG+CE +E+++ DWK+V +FLPLELGGVDV+L MQWLYSLG+T
Subjt:  EIEGKEVVLLVDCGATHNFIFEKLVKSLQIPTKDTSHYGVILGSDTTIKGKGICEAVELKMNDWKVVANFLPLELGGVDVVLEMQWLYSLGIT

XP_031745972.1 uncharacterized protein LOC116406393 [Cucumis sativus]1.6e-9362.59Show/hide
Query:  KDRVIEETFMNGLLSWIKAEVDFCRTTRLAQMMQLAQSVENREIVR-------------------AAKSGGSANSIDNKSDSTIPMRTITLRGVAGGEAK
        +DRV+EETFMNGL  WIKAEV FC+   LA+MM  AQ VENREI+R                     +S  + N  ++K ++  P+RT+TLR  A GE K
Subjt:  KDRVIEETFMNGLLSWIKAEVDFCRTTRLAQMMQLAQSVENREIVR-------------------AAKSGGSANSIDNKSDSTIPMRTITLRGVAGGEAK

Query:  KEGPSKRLSDMEFRARKEKALYFRCNEKYSHDHKCKMREQRELRMYVVTEGKEEFEIVEDMESEEKELKMVRIDEEDQAIIELSINFVVGLSNPGTMKVR
        KEGP+KRL D EF+ARKEK L FRCNEKY H H+CK REQRELRMYVV E  EE+EIVE+ E +E EL  V I+ EDQAI+ELSIN VVGL+NPGTMKVR
Subjt:  KEGPSKRLSDMEFRARKEKALYFRCNEKYSHDHKCKMREQRELRMYVVTEGKEEFEIVEDMESEEKELKMVRIDEEDQAIIELSINFVVGLSNPGTMKVR

Query:  GEIEGKEVVLLVDCGATHNFIFEKLVKSLQIPTKDTSHYGVILGSDTTIKGKGICEAVELKMNDWKVVANFLPLELGGVDVVLEMQWLYSLGIT
        G+I+ +EV++L+DCGATHNFI +K+V+ L +PTK TSHYGVILGS   +KGKGICE +EL++  WKV ANFLPLELGGVD VLEMQWLYSLG+T
Subjt:  GEIEGKEVVLLVDCGATHNFIFEKLVKSLQIPTKDTSHYGVILGSDTTIKGKGICEAVELKMNDWKVVANFLPLELGGVDVVLEMQWLYSLGIT

TrEMBL top hitse value%identityAlignment
A0A5A7TET8 Ty3/gypsy retrotransposon protein3.6e-9160.07Show/hide
Query:  DRVIEETFMNGLLSWIKAEVDFCRTTRLAQMMQLAQSVENREIVR-----AAKSGG---SANSI-----------DNKSDSTIPMRTITLRGVAGGEAKK
        +RVI +TFMNGLL W+++EV FCR   LA+MM++AQ VENREI R        SGG     NS+           DNK+++  P+RTITLR     E ++
Subjt:  DRVIEETFMNGLLSWIKAEVDFCRTTRLAQMMQLAQSVENREIVR-----AAKSGG---SANSI-----------DNKSDSTIPMRTITLRGVAGGEAKK

Query:  EGPSKRLSDMEFRARKEKALYFRCNEKYSHDHKCKMREQRELRMYVVTEGKEEFEIVEDMESEEKELKMVRIDEEDQAIIELSINFVVGLSNPGTMKVRG
        EG  KRL D EF+ARKEK L FRCNEKYS DH+C+M+EQRELRM+VVTEG+EE+EIVED E EEKEL  + I+E    ++ELSIN VVGL++PGTMKVRG
Subjt:  EGPSKRLSDMEFRARKEKALYFRCNEKYSHDHKCKMREQRELRMYVVTEGKEEFEIVEDMESEEKELKMVRIDEEDQAIIELSINFVVGLSNPGTMKVRG

Query:  EIEGKEVVLLVDCGATHNFIFEKLVKSLQIPTKDTSHYGVILGSDTTIKGKGICEAVELKMNDWKVVANFLPLELGGVDVVLEMQWLYSLGIT
        ++ G+EVV+L+DCGATHNF+ EKLV+ L +P K+TSHYGVILGS   ++GKG+CE +E+++ DWK+V +FLPLELGGVDV+L MQWLYSLG+T
Subjt:  EIEGKEVVLLVDCGATHNFIFEKLVKSLQIPTKDTSHYGVILGSDTTIKGKGICEAVELKMNDWKVVANFLPLELGGVDVVLEMQWLYSLGIT

A0A5A7U9J7 Ty3/gypsy retrotransposon protein5.2e-9060.07Show/hide
Query:  VIEETFMNGLLSWIKAEVDFCRTTRLAQMMQLAQSVENREIVR---------------------AAKSGGSANSIDNKSDSTIPMRTITLRGVAGGEAKK
        VIE+TFMNGLL W+++EV FCR   LA+MM+ AQ VENREIVR                        SGG A  I  K +++ P+RTITLR  A  E ++
Subjt:  VIEETFMNGLLSWIKAEVDFCRTTRLAQMMQLAQSVENREIVR---------------------AAKSGGSANSIDNKSDSTIPMRTITLRGVAGGEAKK

Query:  EGPSKRLSDMEFRARKEKALYFRCNEKYSHDHKCKMREQRELRMYVVTEGKEEFEIVEDMESEEKELKMVRIDEEDQAIIELSINFVVGLSNPGTMKVRG
        EG  KRL D EF+ARKEK L FRCNEKYS DHKC+++EQRELRM+VVTEGKEE+EIVE+ E EEKEL  + ++E+   ++ELSIN VVGL++P TMKVRG
Subjt:  EGPSKRLSDMEFRARKEKALYFRCNEKYSHDHKCKMREQRELRMYVVTEGKEEFEIVEDMESEEKELKMVRIDEEDQAIIELSINFVVGLSNPGTMKVRG

Query:  EIEGKEVVLLVDCGATHNFIFEKLVKSLQIPTKDTSHYGVILGSDTTIKGKGICEAVELKMNDWKVVANFLPLELGGVDVVLEMQWLYSLGIT
        ++ G+EV++L+DCGATHNF+ EKLVK L +P K+TSHYGVILGS   ++GKGICE +E+++N WKVV +FLPLELGGVDV+L MQWLYSLG+T
Subjt:  EIEGKEVVLLVDCGATHNFIFEKLVKSLQIPTKDTSHYGVILGSDTTIKGKGICEAVELKMNDWKVVANFLPLELGGVDVVLEMQWLYSLGIT

A0A5D3BLF5 Ty3/gypsy retrotransposon protein2.3e-9059.32Show/hide
Query:  DRVIEETFMNGLLSWIKAEVDFCRTTRLAQMMQLAQSVENREIVRA---------------------AKSGGSANSIDNKSDSTIPMRTITLRGVAGGEA
        +RVIE+TFMNGLL W+++EV F R   LA+MM++AQ VENREIVR                      A SGG A   ++KS+++ P+RTITLR  A  E 
Subjt:  DRVIEETFMNGLLSWIKAEVDFCRTTRLAQMMQLAQSVENREIVRA---------------------AKSGGSANSIDNKSDSTIPMRTITLRGVAGGEA

Query:  KKEGPSKRLSDMEFRARKEKALYFRCNEKYSHDHKCKMREQRELRMYVVTEGKEEFEIVEDMESEEKELKMVRIDEEDQAIIELSINFVVGLSNPGTMKV
        ++EG  KRL D EF++RKEK L FRCNEKYS DHKC+++EQRELRM+VVTEG+EE+EIVE+ E EEKEL  + ++E+   ++ELSIN VVGL++PGTMKV
Subjt:  KKEGPSKRLSDMEFRARKEKALYFRCNEKYSHDHKCKMREQRELRMYVVTEGKEEFEIVEDMESEEKELKMVRIDEEDQAIIELSINFVVGLSNPGTMKV

Query:  RGEIEGKEVVLLVDCGATHNFIFEKLVKSLQIPTKDTSHYGVILGSDTTIKGKGICEAVELKMNDWKVVANFLPLELGGVDVVLEMQWLYSLGIT
        RG++ G+EV++L+DCGATHNF+ EKLVK L +P K+TSHYGVILGS   ++GKGICE +E+++N WK+V +FLPLELGGVDV+L MQWLYSLG+T
Subjt:  RGEIEGKEVVLLVDCGATHNFIFEKLVKSLQIPTKDTSHYGVILGSDTTIKGKGICEAVELKMNDWKVVANFLPLELGGVDVVLEMQWLYSLGIT

A0A5D3D5P9 Ty3/gypsy retrotransposon protein1.6e-9160.41Show/hide
Query:  DRVIEETFMNGLLSWIKAEVDFCRTTRLAQMMQLAQSVENREIVR-----AAKSGG---SANSI-----------DNKSDSTIPMRTITLRGVAGGEAKK
        +RVI +TFMNGLL W+++EV FCR   LA+MM++AQ VENREI R        SGG     NS+           DNK+++  P+RTITLR     E ++
Subjt:  DRVIEETFMNGLLSWIKAEVDFCRTTRLAQMMQLAQSVENREIVR-----AAKSGG---SANSI-----------DNKSDSTIPMRTITLRGVAGGEAKK

Query:  EGPSKRLSDMEFRARKEKALYFRCNEKYSHDHKCKMREQRELRMYVVTEGKEEFEIVEDMESEEKELKMVRIDEEDQAIIELSINFVVGLSNPGTMKVRG
        EG  KRL D EF+ARKEK L FRCNEKYS DHKC+M+EQRELRM+VVTEG+EE+EIVED E EEKEL  + I+E    ++ELSIN VVGL++PGTMKVRG
Subjt:  EGPSKRLSDMEFRARKEKALYFRCNEKYSHDHKCKMREQRELRMYVVTEGKEEFEIVEDMESEEKELKMVRIDEEDQAIIELSINFVVGLSNPGTMKVRG

Query:  EIEGKEVVLLVDCGATHNFIFEKLVKSLQIPTKDTSHYGVILGSDTTIKGKGICEAVELKMNDWKVVANFLPLELGGVDVVLEMQWLYSLGIT
        ++ G+EVV+L+DCGATHNF+ EKLV+ L +P K+TSHYGVILGS   ++GKG+CE +E+++ DWK+V +FLPLELGGVDV+L MQWLYSLG+T
Subjt:  EIEGKEVVLLVDCGATHNFIFEKLVKSLQIPTKDTSHYGVILGSDTTIKGKGICEAVELKMNDWKVVANFLPLELGGVDVVLEMQWLYSLGIT

A0A5D3DLL9 Ty3/gypsy retrotransposon protein1.6e-9160.41Show/hide
Query:  DRVIEETFMNGLLSWIKAEVDFCRTTRLAQMMQLAQSVENREIVR-----AAKSGG---SANSI-----------DNKSDSTIPMRTITLRGVAGGEAKK
        +RVI +TFMNGLL W+++EV FCR   LA+MM++AQ VENREI R        SGG     NS+           DNK+++  P+RTITLR     E ++
Subjt:  DRVIEETFMNGLLSWIKAEVDFCRTTRLAQMMQLAQSVENREIVR-----AAKSGG---SANSI-----------DNKSDSTIPMRTITLRGVAGGEAKK

Query:  EGPSKRLSDMEFRARKEKALYFRCNEKYSHDHKCKMREQRELRMYVVTEGKEEFEIVEDMESEEKELKMVRIDEEDQAIIELSINFVVGLSNPGTMKVRG
        EG  KRL D EF+ARKEK L FRCNEKYS DHKC+M+EQRELRM+VVTEG+EE+EIVED E EEKEL  + I+E    ++ELSIN VVGL++PGTMKVRG
Subjt:  EGPSKRLSDMEFRARKEKALYFRCNEKYSHDHKCKMREQRELRMYVVTEGKEEFEIVEDMESEEKELKMVRIDEEDQAIIELSINFVVGLSNPGTMKVRG

Query:  EIEGKEVVLLVDCGATHNFIFEKLVKSLQIPTKDTSHYGVILGSDTTIKGKGICEAVELKMNDWKVVANFLPLELGGVDVVLEMQWLYSLGIT
        ++ G+EVV+L+DCGATHNF+ EKLV+ L +P K+TSHYGVILGS   ++GKG+CE +E+++ DWK+V +FLPLELGGVDV+L MQWLYSLG+T
Subjt:  EIEGKEVVLLVDCGATHNFIFEKLVKSLQIPTKDTSHYGVILGSDTTIKGKGICEAVELKMNDWKVVANFLPLELGGVDVVLEMQWLYSLGIT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G29750.1 Eukaryotic aspartyl protease family protein1.1e-1229.11Show/hide
Query:  EQRELRMYVVTEGKEEFEIVEDMESEEKELKMVRIDEEDQAIIELSINFVVGLSNPGTMKVRGEIEGKEVVLLVDCGATHNFIFEKLVKSLQIPTKDTSH
        + R+  +  +T  + + ++V+  +    EL+   ++++   + +     V+ L+    M+  G I   +VV+ +D GAT NFI  +L  SL++PT  T+ 
Subjt:  EQRELRMYVVTEGKEEFEIVEDMESEEKELKMVRIDEEDQAIIELSINFVVGLSNPGTMKVRGEIEGKEVVLLVDCGATHNFIFEKLVKSLQIPTKDTSH

Query:  YGVILGSDTTIKGKGICEAVELKMNDWKVVANFLPLELG--GVDVVLEMQWLYSLGIT
          V+LG    I+  G C  + L + + ++  NFL L+L    VDV+L  +WL  LG T
Subjt:  YGVILGSDTTIKGKGICEAVELKMNDWKVVANFLPLELG--GVDVVLEMQWLYSLGIT

AT3G30770.1 Eukaryotic aspartyl protease family protein7.6e-0929.56Show/hide
Query:  QRELRMYVVTEGKEEFEIVEDMESEEKELKMVRIDEEDQAIIELSINFVVGLSNPGTMKVRGEIEGKEVVLLVDCGATHNFIFEKLVKSLQIPTKDTSHY
        Q EL  +++ EG   +    + E   ++ K +R     Q   + +  F  G      M+  G I   +VV+++D GAT+NFI ++L   L++PT  T+  
Subjt:  QRELRMYVVTEGKEEFEIVEDMESEEKELKMVRIDEEDQAIIELSINFVVGLSNPGTMKVRGEIEGKEVVLLVDCGATHNFIFEKLVKSLQIPTKDTSHY

Query:  GVILGSDTTIKGKGICEAVELKMNDWKVVANFLPLELGGVDV----------VLEMQWL
         V+LG    I+  G C  + L + + ++  NFL L+L   DV           LE QWL
Subjt:  GVILGSDTTIKGKGICEAVELKMNDWKVVANFLPLELGGVDV----------VLEMQWL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAACGCCAGAAGGCAGAACTGAATGCGAAGGACAGGGTGATTGAGGAAACTTTCATGAATGGGTTGCTGTCATGGATCAAGGCTGAGGTAGATTTTTGTCGAACAAC
CAGATTAGCCCAGATGATGCAATTAGCTCAGTCGGTGGAGAATCGGGAGATCGTCCGGGCTGCTAAATCTGGAGGGAGTGCCAATTCAATTGATAATAAGAGCGACTCTA
CAATTCCCATGAGAACTATCACCTTAAGAGGGGTGGCTGGTGGGGAAGCTAAGAAGGAGGGGCCTAGTAAGAGATTATCTGACATGGAGTTTCGAGCTAGAAAGGAGAAG
GCACTCTATTTTCGATGTAACGAAAAGTACTCCCATGACCATAAATGTAAGATGAGGGAGCAAAGAGAGCTTCGAATGTATGTTGTCACAGAGGGAAAGGAGGAATTTGA
GATTGTGGAGGATATGGAGAGTGAGGAGAAAGAATTGAAAATGGTGAGGATAGATGAGGAAGACCAAGCCATTATAGAATTATCCATTAATTTTGTGGTAGGATTATCGA
ACCCGGGTACTATGAAGGTGAGAGGGGAGATCGAAGGCAAGGAAGTGGTGCTCTTAGTAGATTGTGGGGCTACACACAACTTTATCTTTGAAAAGTTGGTGAAGAGTTTA
CAGATACCGACGAAAGACACTTCTCATTATGGGGTGATTTTGGGGTCCGACACAACTATTAAGGGGAAAGGAATCTGTGAAGCCGTAGAGTTGAAAATGAATGATTGGAA
GGTAGTAGCGAACTTTTTACCGTTGGAGTTAGGGGGTGTGGACGTGGTCTTAGAGATGCAGTGGCTTTACTCTCTTGGCATCACATAA
mRNA sequenceShow/hide mRNA sequence
ATGAAACGCCAGAAGGCAGAACTGAATGCGAAGGACAGGGTGATTGAGGAAACTTTCATGAATGGGTTGCTGTCATGGATCAAGGCTGAGGTAGATTTTTGTCGAACAAC
CAGATTAGCCCAGATGATGCAATTAGCTCAGTCGGTGGAGAATCGGGAGATCGTCCGGGCTGCTAAATCTGGAGGGAGTGCCAATTCAATTGATAATAAGAGCGACTCTA
CAATTCCCATGAGAACTATCACCTTAAGAGGGGTGGCTGGTGGGGAAGCTAAGAAGGAGGGGCCTAGTAAGAGATTATCTGACATGGAGTTTCGAGCTAGAAAGGAGAAG
GCACTCTATTTTCGATGTAACGAAAAGTACTCCCATGACCATAAATGTAAGATGAGGGAGCAAAGAGAGCTTCGAATGTATGTTGTCACAGAGGGAAAGGAGGAATTTGA
GATTGTGGAGGATATGGAGAGTGAGGAGAAAGAATTGAAAATGGTGAGGATAGATGAGGAAGACCAAGCCATTATAGAATTATCCATTAATTTTGTGGTAGGATTATCGA
ACCCGGGTACTATGAAGGTGAGAGGGGAGATCGAAGGCAAGGAAGTGGTGCTCTTAGTAGATTGTGGGGCTACACACAACTTTATCTTTGAAAAGTTGGTGAAGAGTTTA
CAGATACCGACGAAAGACACTTCTCATTATGGGGTGATTTTGGGGTCCGACACAACTATTAAGGGGAAAGGAATCTGTGAAGCCGTAGAGTTGAAAATGAATGATTGGAA
GGTAGTAGCGAACTTTTTACCGTTGGAGTTAGGGGGTGTGGACGTGGTCTTAGAGATGCAGTGGCTTTACTCTCTTGGCATCACATAA
Protein sequenceShow/hide protein sequence
MKRQKAELNAKDRVIEETFMNGLLSWIKAEVDFCRTTRLAQMMQLAQSVENREIVRAAKSGGSANSIDNKSDSTIPMRTITLRGVAGGEAKKEGPSKRLSDMEFRARKEK
ALYFRCNEKYSHDHKCKMREQRELRMYVVTEGKEEFEIVEDMESEEKELKMVRIDEEDQAIIELSINFVVGLSNPGTMKVRGEIEGKEVVLLVDCGATHNFIFEKLVKSL
QIPTKDTSHYGVILGSDTTIKGKGICEAVELKMNDWKVVANFLPLELGGVDVVLEMQWLYSLGIT