; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh05G011940 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh05G011940
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionReverse transcriptase
Genome locationCmo_Chr05:9361352..9362400
RNA-Seq ExpressionCmoCh05G011940
SyntenyCmoCh05G011940
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022975176.1 uncharacterized protein LOC111474215 [Cucurbita maxima]8.3e-16384.73Show/hide
Query:  MSTTKSTPKAQADRLTLIEEEMLFLKEVPDTLRFLETRVTELSEKVVQVDAMGNRLDGLPIAELLFRVTSLEERVAPTSSPKPSDSPGSSVAHKEGRGEE
        MS TKSTPKAQADRLTLIEE+MLFLKEVPDTL FLE RVTELSEKVV++DAMGNRLDGLPIAEL+F+VTSLEERVAPTSSPKPS SP SSVAHKEGRGEE
Subjt:  MSTTKSTPKAQADRLTLIEEEMLFLKEVPDTLRFLETRVTELSEKVVQVDAMGNRLDGLPIAELLFRVTSLEERVAPTSSPKPSDSPGSSVAHKEGRGEE

Query:  FDVLQNTMMSLFNGLADEFRTTVDDLQERMAAMSTRIEITMKAVE-----------------------GNRDAKELENFIFDVEQYFKATTACTDDKKVT
        FDVLQNTMMSLFNGLADEFRTT+DD+QERMA+M TRIE+TMKAVE                       GNRDAKELENFIFDVEQYFKATTACTDDKKVT
Subjt:  FDVLQNTMMSLFNGLADEFRTTVDDLQERMAAMSTRIEITMKAVE-----------------------GNRDAKELENFIFDVEQYFKATTACTDDKKVT

Query:  VAAMYLIDDAKLWWRTKVQDIEDGLCTIDSWEDLKRELREQFLPENAGHIAMEKIVALKHTGNIRDYVRQFSTLMLDIRGTAEKDKVFFFINGLQPWAKT
        VA+MYL DDAKLWWRTKVQDIEDGLCTIDSWEDLK+ELR+QFLPENAGH+AMEK+VALKHTG+IRDYVRQFSTLMLDIRGT+EKDKVFFFINGLQPWAKT
Subjt:  VAAMYLIDDAKLWWRTKVQDIEDGLCTIDSWEDLKRELREQFLPENAGHIAMEKIVALKHTGNIRDYVRQFSTLMLDIRGTAEKDKVFFFINGLQPWAKT

Query:  KIHENRVQTLAAAMACAERLVDCGNEAGSQRRATPAPNNGGKPYRPP
        K+HEN+VQTLAAAMACAERL+D GNEAGSQRR TPAPN GGKPY+PP
Subjt:  KIHENRVQTLAAAMACAERLVDCGNEAGSQRRATPAPNNGGKPYRPP

XP_022975706.1 uncharacterized protein LOC111475733, partial [Cucurbita maxima]1.8e-16285.01Show/hide
Query:  MSTTKSTPKAQADRLTLIEEEMLFLKEVPDTLRFLETRVTELSEKVVQVDAMGNRLDGLPIAELLFRVTSLEERVAPTSSPKPSDSPGSSVAHKEGRGEE
        MSTTKSTPKAQADRLTLIEEEMLFLKEVP TLRFLE RVTELS+KVV +DAMGNRLDGLPIAEL+FRVTSLEERVAPTSSPKPS SP SSVAHKEGRGEE
Subjt:  MSTTKSTPKAQADRLTLIEEEMLFLKEVPDTLRFLETRVTELSEKVVQVDAMGNRLDGLPIAELLFRVTSLEERVAPTSSPKPSDSPGSSVAHKEGRGEE

Query:  FDVLQNTMMSLFNGLADEFRTTVDDLQERMAAMSTRIEITMKAVE-----------------------GNRDAKELENFIFDVEQYFKATTACTDDKKVT
        FDVLQNTMMSLFNGLADEFRTT+DD+QERMA+M TRIE+TMKAVE                       GNRDAKELENFIFDVEQYFKATTACTDDKKVT
Subjt:  FDVLQNTMMSLFNGLADEFRTTVDDLQERMAAMSTRIEITMKAVE-----------------------GNRDAKELENFIFDVEQYFKATTACTDDKKVT

Query:  VAAMYLIDDAKLWWRTKVQDIEDGLCTIDSWEDLKRELREQFLPENAGHIAMEKIVALKHTGNIRDYVRQFSTLMLDIRGTAEKDKVFFFINGLQPWAKT
        VA+MYL DDAKLWWRTKVQDIEDGLCTIDSWEDLK+ELR+QFLPENAGH+AMEK+VALKHTG IRDYVRQFSTLMLDIRGT+EKDKVFFFINGLQPWAKT
Subjt:  VAAMYLIDDAKLWWRTKVQDIEDGLCTIDSWEDLKRELREQFLPENAGHIAMEKIVALKHTGNIRDYVRQFSTLMLDIRGTAEKDKVFFFINGLQPWAKT

Query:  KIHENRVQTLAAAMACAERLVDCGNEAGSQRRATPAPNNGGKPYRPP
        K+HEN+VQTLAAAMAC ERL+D GNEAGSQRR TPAPN GGKPY+PP
Subjt:  KIHENRVQTLAAAMACAERLVDCGNEAGSQRRATPAPNNGGKPYRPP

XP_023524533.1 uncharacterized protein LOC111788429 [Cucurbita pepo subsp. pepo]1.6e-17491.67Show/hide
Query:  MSTTKSTPKAQADRLTLIEEEMLFLKEVPDTLRFLETRVTELSEKVVQVDAMGNRLDGLPIAELLFRVTSLEERVAPTSSPKPSDSPGSSVAHKEGRGEE
        MSTTKSTPKAQADRLTLIEEEMLFLKEVPDTLRFLETRVTELSEKVVQVDAMGNRLDGLPIAELLFRVTSLEERVAPTSSPKPSDSP SSVAHKEGRGEE
Subjt:  MSTTKSTPKAQADRLTLIEEEMLFLKEVPDTLRFLETRVTELSEKVVQVDAMGNRLDGLPIAELLFRVTSLEERVAPTSSPKPSDSPGSSVAHKEGRGEE

Query:  FDVLQNTMMSLFNGLADEFRTTVDDLQERMAAMSTRIEITMKAVE-----------------------GNRDAKELENFIFDVEQYFKATTACTDDKKVT
        FD+LQNTMMSLFNGLADEFR+TVDDLQERM+AMSTRIE+TMKAVE                       GNRDAKELENFIFDVEQYFKATTACTDDKKVT
Subjt:  FDVLQNTMMSLFNGLADEFRTTVDDLQERMAAMSTRIEITMKAVE-----------------------GNRDAKELENFIFDVEQYFKATTACTDDKKVT

Query:  VAAMYLIDDAKLWWRTKVQDIEDGLCTIDSWEDLKRELREQFLPENAGHIAMEKIVALKHTGNIRDYVRQFSTLMLDIRGTAEKDKVFFFINGLQPWAKT
        VAAMYL+DDAKLWWRTKVQDIEDGLCTIDSWEDLKRELREQFLPENAGHIAMEKIVALKHTGNIRDYVRQFSTLMLDIRGTAEKDKVFFFINGLQPWAKT
Subjt:  VAAMYLIDDAKLWWRTKVQDIEDGLCTIDSWEDLKRELREQFLPENAGHIAMEKIVALKHTGNIRDYVRQFSTLMLDIRGTAEKDKVFFFINGLQPWAKT

Query:  KIHENRVQTLAAAMACAERLVDCGNEAGSQRRATPAPNNGGKPYRPPG
        KIHENRVQTLAAAMACAERLVDCGNEAGSQRRATPAPNNGGKPYRPPG
Subjt:  KIHENRVQTLAAAMACAERLVDCGNEAGSQRRATPAPNNGGKPYRPPG

XP_023526180.1 uncharacterized protein LOC111789739 [Cucurbita pepo subsp. pepo]2.1e-16693.85Show/hide
Query:  MSTTKSTPKAQADRLTLIEEEMLFLKEVPDTLRFLETRVTELSEKVVQVDAMGNRLDGLPIAELLFRVTSLEERVAPTSSPKPSDSPGSSVAHKEGRGEE
        MSTTKSTPKAQADRLTLIEEEMLFLKEVPDTLRFLETRVTELSEKVVQVDAMGNRLDGLPIAELLFRVTSLEERVAPTSSPKPSDSP SSVAHKEGRGEE
Subjt:  MSTTKSTPKAQADRLTLIEEEMLFLKEVPDTLRFLETRVTELSEKVVQVDAMGNRLDGLPIAELLFRVTSLEERVAPTSSPKPSDSPGSSVAHKEGRGEE

Query:  FDVLQNTMMSLFNGLADEFRTTVDDLQERMAAMSTRIEITMKAVEGNRDAKELENFIFDVEQYFKATTACTDDKKVTVAAMYLIDDAKLWWRTKVQDIED
        FDVLQNTMMSLFNGLADEFR+TVDDLQERMAAMSTRIE+TMKAVEG                YFKATTACTDDKKVTVAAMYL+DDAKLWWRTKVQDIED
Subjt:  FDVLQNTMMSLFNGLADEFRTTVDDLQERMAAMSTRIEITMKAVEGNRDAKELENFIFDVEQYFKATTACTDDKKVTVAAMYLIDDAKLWWRTKVQDIED

Query:  GLCTIDSWEDLKRELREQFLPENAGHIAMEKIVALKHTGNIRDYVRQFSTLMLDIRGTAEKDKVFFFINGLQPWAKTKIHENRVQTLAAAMACAERLVDC
        GLCTIDSWEDLKRELREQFLPENAGHIAMEKIVALKHTGNIRDYVRQFSTLMLDIRGTAEKDKVFFFINGLQPWAKTKIHENRVQTLAAAMACAERLVDC
Subjt:  GLCTIDSWEDLKRELREQFLPENAGHIAMEKIVALKHTGNIRDYVRQFSTLMLDIRGTAEKDKVFFFINGLQPWAKTKIHENRVQTLAAAMACAERLVDC

Query:  GNEAGSQRRATPAPNNGGKPYRPPG
        GNEAGSQRRATPAPNNGGKPYRPPG
Subjt:  GNEAGSQRRATPAPNNGGKPYRPPG

XP_023537907.1 uncharacterized protein LOC111798805 [Cucurbita pepo subsp. pepo]1.6e-17491.67Show/hide
Query:  MSTTKSTPKAQADRLTLIEEEMLFLKEVPDTLRFLETRVTELSEKVVQVDAMGNRLDGLPIAELLFRVTSLEERVAPTSSPKPSDSPGSSVAHKEGRGEE
        MSTTKSTPKAQADRLTLIEEEMLFLKEVPDTLRFLETRVTELSEKVVQVDAMGNRLDGLPIAELLFRVTSLEERVAPTSSPKPSDSP SSVAHKEGRGEE
Subjt:  MSTTKSTPKAQADRLTLIEEEMLFLKEVPDTLRFLETRVTELSEKVVQVDAMGNRLDGLPIAELLFRVTSLEERVAPTSSPKPSDSPGSSVAHKEGRGEE

Query:  FDVLQNTMMSLFNGLADEFRTTVDDLQERMAAMSTRIEITMKAVE-----------------------GNRDAKELENFIFDVEQYFKATTACTDDKKVT
        FD+LQNTMMSLFNGLADEFR+TVDDLQERM+AMSTRIE+TMKAVE                       GNRDAKELENFIFDVEQYFKATTACTDDKKVT
Subjt:  FDVLQNTMMSLFNGLADEFRTTVDDLQERMAAMSTRIEITMKAVE-----------------------GNRDAKELENFIFDVEQYFKATTACTDDKKVT

Query:  VAAMYLIDDAKLWWRTKVQDIEDGLCTIDSWEDLKRELREQFLPENAGHIAMEKIVALKHTGNIRDYVRQFSTLMLDIRGTAEKDKVFFFINGLQPWAKT
        VAAMYL+DDAKLWWRTKVQDIEDGLCTIDSWEDLKRELREQFLPENAGHIAMEKIVALKHTGNIRDYVRQFSTLMLDIRGTAEKDKVFFFINGLQPWAKT
Subjt:  VAAMYLIDDAKLWWRTKVQDIEDGLCTIDSWEDLKRELREQFLPENAGHIAMEKIVALKHTGNIRDYVRQFSTLMLDIRGTAEKDKVFFFINGLQPWAKT

Query:  KIHENRVQTLAAAMACAERLVDCGNEAGSQRRATPAPNNGGKPYRPPG
        KIHENRVQTLAAAMACAERLVDCGNEAGSQRRATPAPNNGGKPYRPPG
Subjt:  KIHENRVQTLAAAMACAERLVDCGNEAGSQRRATPAPNNGGKPYRPPG

TrEMBL top hitse value%identityAlignment
A0A6J1GE48 uncharacterized protein LOC1114533332.3e-9487.1Show/hide
Query:  MSTTKSTPKAQADRLTLIEEEMLFLKEVPDTLRFLETRVTELSEKVVQVDAMGNRLDGLPIAELLFRVTSLEERVAPTSSPKPSDSPGSSVAHKEGRGEE
        MSTTKSTPKAQADRLTLIEEEMLFLKEVPDTLRFLETRVTELSEKVVQVDAMGNRLDGLPIAELLFRVTSLEERVAPTSSPKPSDSP SSVAHKEGRGEE
Subjt:  MSTTKSTPKAQADRLTLIEEEMLFLKEVPDTLRFLETRVTELSEKVVQVDAMGNRLDGLPIAELLFRVTSLEERVAPTSSPKPSDSPGSSVAHKEGRGEE

Query:  FDVLQNTMMSLFNGLADEFRTTVDDLQERMAAMSTRIEITMKAVE-----------------------GNRDAKELENFIFDVEQYFKATTACTDDKKVT
        FDVLQNTMMSLFNGLADEFR+TVDDLQERMAAMSTRIEITMKAVE                       GN+DAKELENFIFDVEQYFKATTACTDDKKVT
Subjt:  FDVLQNTMMSLFNGLADEFRTTVDDLQERMAAMSTRIEITMKAVE-----------------------GNRDAKELENFIFDVEQYFKATTACTDDKKVT

Query:  VAAMYLIDDAKLWWRTK
        VAAMYL+DD KLWWRTK
Subjt:  VAAMYLIDDAKLWWRTK

A0A6J1ID35 uncharacterized protein LOC1114714738.4e-16184.44Show/hide
Query:  MSTTKSTPKAQADRLTLIEEEMLFLKEVPDTLRFLETRVTELSEKVVQVDAMGNRLDGLPIAELLFRVTSLEERVAPTSSPKPSDSPGSSVAHKEGRGEE
        MSTTKSTPKAQADRLTLIEEEML LKEVPDTLRFLE RVTELSEKVV +DAMGNRLDGLPIAEL+FRVTSLEERVAPTSSPKPS SP SSVAHKEGRGEE
Subjt:  MSTTKSTPKAQADRLTLIEEEMLFLKEVPDTLRFLETRVTELSEKVVQVDAMGNRLDGLPIAELLFRVTSLEERVAPTSSPKPSDSPGSSVAHKEGRGEE

Query:  FDVLQNTMMSLFNGLADEFRTTVDDLQERMAAMSTRIEITMKAVEG-----------------------NRDAKELENFIFDVEQYFKATTACTDDKKVT
        FDVLQNTMMSLFNGLADEFRTT+DD+QERMA+M TRIE+TMKAVE                        NRDAKELENFIFDVEQYFKATTACTDDKKVT
Subjt:  FDVLQNTMMSLFNGLADEFRTTVDDLQERMAAMSTRIEITMKAVEG-----------------------NRDAKELENFIFDVEQYFKATTACTDDKKVT

Query:  VAAMYLIDDAKLWWRTKVQDIEDGLCTIDSWEDLKRELREQFLPENAGHIAMEKIVALKHTGNIRDYVRQFSTLMLDIRGTAEKDKVFFFINGLQPWAKT
        VA+MYLIDDAKLWWRTKVQDIEDGL TIDSWEDLK+ELR++FLPENAGH+AMEK+VALKHTG IRDYVRQFSTLMLDI GT EKDK+FFFINGLQPWAKT
Subjt:  VAAMYLIDDAKLWWRTKVQDIEDGLCTIDSWEDLKRELREQFLPENAGHIAMEKIVALKHTGNIRDYVRQFSTLMLDIRGTAEKDKVFFFINGLQPWAKT

Query:  KIHENRVQTLAAAMACAERLVDCGNEAGSQRRATPAPNNGGKPYRPP
        K+HEN+VQTLAAAMACAERL+D GNEAGSQRR TPAPN GGKPY+PP
Subjt:  KIHENRVQTLAAAMACAERLVDCGNEAGSQRRATPAPNNGGKPYRPP

A0A6J1IDF7 uncharacterized protein LOC1114742154.0e-16384.73Show/hide
Query:  MSTTKSTPKAQADRLTLIEEEMLFLKEVPDTLRFLETRVTELSEKVVQVDAMGNRLDGLPIAELLFRVTSLEERVAPTSSPKPSDSPGSSVAHKEGRGEE
        MS TKSTPKAQADRLTLIEE+MLFLKEVPDTL FLE RVTELSEKVV++DAMGNRLDGLPIAEL+F+VTSLEERVAPTSSPKPS SP SSVAHKEGRGEE
Subjt:  MSTTKSTPKAQADRLTLIEEEMLFLKEVPDTLRFLETRVTELSEKVVQVDAMGNRLDGLPIAELLFRVTSLEERVAPTSSPKPSDSPGSSVAHKEGRGEE

Query:  FDVLQNTMMSLFNGLADEFRTTVDDLQERMAAMSTRIEITMKAVE-----------------------GNRDAKELENFIFDVEQYFKATTACTDDKKVT
        FDVLQNTMMSLFNGLADEFRTT+DD+QERMA+M TRIE+TMKAVE                       GNRDAKELENFIFDVEQYFKATTACTDDKKVT
Subjt:  FDVLQNTMMSLFNGLADEFRTTVDDLQERMAAMSTRIEITMKAVE-----------------------GNRDAKELENFIFDVEQYFKATTACTDDKKVT

Query:  VAAMYLIDDAKLWWRTKVQDIEDGLCTIDSWEDLKRELREQFLPENAGHIAMEKIVALKHTGNIRDYVRQFSTLMLDIRGTAEKDKVFFFINGLQPWAKT
        VA+MYL DDAKLWWRTKVQDIEDGLCTIDSWEDLK+ELR+QFLPENAGH+AMEK+VALKHTG+IRDYVRQFSTLMLDIRGT+EKDKVFFFINGLQPWAKT
Subjt:  VAAMYLIDDAKLWWRTKVQDIEDGLCTIDSWEDLKRELREQFLPENAGHIAMEKIVALKHTGNIRDYVRQFSTLMLDIRGTAEKDKVFFFINGLQPWAKT

Query:  KIHENRVQTLAAAMACAERLVDCGNEAGSQRRATPAPNNGGKPYRPP
        K+HEN+VQTLAAAMACAERL+D GNEAGSQRR TPAPN GGKPY+PP
Subjt:  KIHENRVQTLAAAMACAERLVDCGNEAGSQRRATPAPNNGGKPYRPP

A0A6J1IEF9 uncharacterized protein LOC1114749452.6e-16284.73Show/hide
Query:  MSTTKSTPKAQADRLTLIEEEMLFLKEVPDTLRFLETRVTELSEKVVQVDAMGNRLDGLPIAELLFRVTSLEERVAPTSSPKPSDSPGSSVAHKEGRGEE
        MSTTKSTPKAQADRLTLIEEEMLFLKEVPDTLRFLE RVTELSEKVV +DAMGNRLDGLPIAEL+FRVTSLEERVAPTSSPKPS SP SSVAHKEGRGEE
Subjt:  MSTTKSTPKAQADRLTLIEEEMLFLKEVPDTLRFLETRVTELSEKVVQVDAMGNRLDGLPIAELLFRVTSLEERVAPTSSPKPSDSPGSSVAHKEGRGEE

Query:  FDVLQNTMMSLFNGLADEFRTTVDDLQERMAAMSTRIEITMKAVE-----------------------GNRDAKELENFIFDVEQYFKATTACTDDKKVT
        FDVLQNTMMSLFNGLADEFRTT+DD+QERMA+M TRIE+TMKAVE                       GN+DAKELENFIFDVEQYFKATT C DDKKVT
Subjt:  FDVLQNTMMSLFNGLADEFRTTVDDLQERMAAMSTRIEITMKAVE-----------------------GNRDAKELENFIFDVEQYFKATTACTDDKKVT

Query:  VAAMYLIDDAKLWWRTKVQDIEDGLCTIDSWEDLKRELREQFLPENAGHIAMEKIVALKHTGNIRDYVRQFSTLMLDIRGTAEKDKVFFFINGLQPWAKT
        VA+MYL DDAKLWWRTKVQDIEDGLCTIDSWEDLK+ELR+QFLPENA H+AMEK+VALKHTG+IRDYVRQFSTLMLDIRGT+EKDKVFFFINGLQPWAKT
Subjt:  VAAMYLIDDAKLWWRTKVQDIEDGLCTIDSWEDLKRELREQFLPENAGHIAMEKIVALKHTGNIRDYVRQFSTLMLDIRGTAEKDKVFFFINGLQPWAKT

Query:  KIHENRVQTLAAAMACAERLVDCGNEAGSQRRATPAPNNGGKPYRPP
        K+HEN+VQTLAAAMACAERL+D GNEAGSQRR TPAPN GGKPY+PP
Subjt:  KIHENRVQTLAAAMACAERLVDCGNEAGSQRRATPAPNNGGKPYRPP

A0A6J1IEY4 uncharacterized protein LOC1114757338.9e-16385.01Show/hide
Query:  MSTTKSTPKAQADRLTLIEEEMLFLKEVPDTLRFLETRVTELSEKVVQVDAMGNRLDGLPIAELLFRVTSLEERVAPTSSPKPSDSPGSSVAHKEGRGEE
        MSTTKSTPKAQADRLTLIEEEMLFLKEVP TLRFLE RVTELS+KVV +DAMGNRLDGLPIAEL+FRVTSLEERVAPTSSPKPS SP SSVAHKEGRGEE
Subjt:  MSTTKSTPKAQADRLTLIEEEMLFLKEVPDTLRFLETRVTELSEKVVQVDAMGNRLDGLPIAELLFRVTSLEERVAPTSSPKPSDSPGSSVAHKEGRGEE

Query:  FDVLQNTMMSLFNGLADEFRTTVDDLQERMAAMSTRIEITMKAVE-----------------------GNRDAKELENFIFDVEQYFKATTACTDDKKVT
        FDVLQNTMMSLFNGLADEFRTT+DD+QERMA+M TRIE+TMKAVE                       GNRDAKELENFIFDVEQYFKATTACTDDKKVT
Subjt:  FDVLQNTMMSLFNGLADEFRTTVDDLQERMAAMSTRIEITMKAVE-----------------------GNRDAKELENFIFDVEQYFKATTACTDDKKVT

Query:  VAAMYLIDDAKLWWRTKVQDIEDGLCTIDSWEDLKRELREQFLPENAGHIAMEKIVALKHTGNIRDYVRQFSTLMLDIRGTAEKDKVFFFINGLQPWAKT
        VA+MYL DDAKLWWRTKVQDIEDGLCTIDSWEDLK+ELR+QFLPENAGH+AMEK+VALKHTG IRDYVRQFSTLMLDIRGT+EKDKVFFFINGLQPWAKT
Subjt:  VAAMYLIDDAKLWWRTKVQDIEDGLCTIDSWEDLKRELREQFLPENAGHIAMEKIVALKHTGNIRDYVRQFSTLMLDIRGTAEKDKVFFFINGLQPWAKT

Query:  KIHENRVQTLAAAMACAERLVDCGNEAGSQRRATPAPNNGGKPYRPP
        K+HEN+VQTLAAAMAC ERL+D GNEAGSQRR TPAPN GGKPY+PP
Subjt:  KIHENRVQTLAAAMACAERLVDCGNEAGSQRRATPAPNNGGKPYRPP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGACGACAAAGTCGACACCCAAAGCACAAGCCGACCGGCTTACGCTAATAGAGGAGGAAATGCTGTTCCTCAAAGAAGTCCCTGACACCCTCCGTTTCCTGGAAAC
ACGGGTGACCGAACTGAGTGAGAAAGTCGTCCAAGTCGACGCAATGGGCAACCGCCTGGATGGGTTGCCAATCGCAGAACTGTTGTTTCGAGTGACCTCGCTCGAAGAAA
GAGTTGCTCCTACGAGCAGCCCAAAACCGTCTGATAGTCCGGGTAGCTCTGTCGCACACAAAGAGGGACGTGGCGAAGAGTTCGACGTACTACAAAACACAATGATGAGT
TTGTTCAATGGATTGGCTGATGAATTCAGAACAACAGTCGACGATCTCCAAGAAAGGATGGCCGCCATGAGCACTCGAATTGAAATTACCATGAAAGCCGTAGAAGGGAA
TCGGGACGCCAAAGAGTTGGAGAACTTCATTTTTGACGTCGAACAGTATTTCAAAGCCACAACGGCTTGTACCGACGACAAGAAGGTGACTGTAGCCGCGATGTATCTCA
TAGACGACGCCAAACTGTGGTGGCGTACGAAGGTGCAAGACATCGAGGATGGATTGTGCACCATCGACTCCTGGGAGGACCTCAAGAGAGAGTTAAGGGAACAATTCCTC
CCCGAAAACGCAGGGCATATAGCAATGGAGAAAATAGTAGCCCTAAAACACACTGGAAACATACGGGACTACGTCAGACAGTTCTCAACCCTGATGTTGGATATCAGGGG
CACAGCAGAGAAGGACAAGGTGTTCTTCTTTATAAATGGTCTACAGCCGTGGGCCAAAACAAAGATACATGAGAACAGGGTCCAAACCCTAGCTGCCGCAATGGCCTGCG
CCGAGAGACTCGTAGACTGTGGGAACGAAGCAGGATCCCAAAGAAGAGCGACACCAGCCCCAAACAATGGGGGCAAACCATACCGACCACCAGGCCAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGACGACAAAGTCGACACCCAAAGCACAAGCCGACCGGCTTACGCTAATAGAGGAGGAAATGCTGTTCCTCAAAGAAGTCCCTGACACCCTCCGTTTCCTGGAAAC
ACGGGTGACCGAACTGAGTGAGAAAGTCGTCCAAGTCGACGCAATGGGCAACCGCCTGGATGGGTTGCCAATCGCAGAACTGTTGTTTCGAGTGACCTCGCTCGAAGAAA
GAGTTGCTCCTACGAGCAGCCCAAAACCGTCTGATAGTCCGGGTAGCTCTGTCGCACACAAAGAGGGACGTGGCGAAGAGTTCGACGTACTACAAAACACAATGATGAGT
TTGTTCAATGGATTGGCTGATGAATTCAGAACAACAGTCGACGATCTCCAAGAAAGGATGGCCGCCATGAGCACTCGAATTGAAATTACCATGAAAGCCGTAGAAGGGAA
TCGGGACGCCAAAGAGTTGGAGAACTTCATTTTTGACGTCGAACAGTATTTCAAAGCCACAACGGCTTGTACCGACGACAAGAAGGTGACTGTAGCCGCGATGTATCTCA
TAGACGACGCCAAACTGTGGTGGCGTACGAAGGTGCAAGACATCGAGGATGGATTGTGCACCATCGACTCCTGGGAGGACCTCAAGAGAGAGTTAAGGGAACAATTCCTC
CCCGAAAACGCAGGGCATATAGCAATGGAGAAAATAGTAGCCCTAAAACACACTGGAAACATACGGGACTACGTCAGACAGTTCTCAACCCTGATGTTGGATATCAGGGG
CACAGCAGAGAAGGACAAGGTGTTCTTCTTTATAAATGGTCTACAGCCGTGGGCCAAAACAAAGATACATGAGAACAGGGTCCAAACCCTAGCTGCCGCAATGGCCTGCG
CCGAGAGACTCGTAGACTGTGGGAACGAAGCAGGATCCCAAAGAAGAGCGACACCAGCCCCAAACAATGGGGGCAAACCATACCGACCACCAGGCCAGTGA
Protein sequenceShow/hide protein sequence
MSTTKSTPKAQADRLTLIEEEMLFLKEVPDTLRFLETRVTELSEKVVQVDAMGNRLDGLPIAELLFRVTSLEERVAPTSSPKPSDSPGSSVAHKEGRGEEFDVLQNTMMS
LFNGLADEFRTTVDDLQERMAAMSTRIEITMKAVEGNRDAKELENFIFDVEQYFKATTACTDDKKVTVAAMYLIDDAKLWWRTKVQDIEDGLCTIDSWEDLKRELREQFL
PENAGHIAMEKIVALKHTGNIRDYVRQFSTLMLDIRGTAEKDKVFFFINGLQPWAKTKIHENRVQTLAAAMACAERLVDCGNEAGSQRRATPAPNNGGKPYRPPGQ