; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G09420 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G09420
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationChr4:7366651..7367533
RNA-Seq ExpressionCSPI04G09420
SyntenyCSPI04G09420
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR025724 - GAG-pre-integrase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040427.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]7.9e-9273.73Show/hide
Query:  MDLVKSSVLNEEMRRKSQSSCSQSDVLRHIKKYCRKLKRDRKNHKGKEKKNEDDSDTDTITVATEDFFVLSDGDVVNLATQQSSWVIDSGASIHATSKRE
        MDLVKSSVLNEEMRRKSQSS  QSD L                 +G+ K                     S G  VNL  QQSSWVIDSGAS+HATSKRE
Subjt:  MDLVKSSVLNEEMRRKSQSSCSQSDVLRHIKKYCRKLKRDRKNHKGKEKKNEDDSDTDTITVATEDFFVLSDGDVVNLATQQSSWVIDSGASIHATSKRE

Query:  FFASYTPGDFGSVMMDNDGSTNTVGIGDVHLKNRNGSRLILKNVKHIPDIRMNLISICKLDDEGFYNTFNNGIWKLTKGSMVIAKGQKFSSLYYMNAKIM
        FFASYTPGDFGSV M NDG TN VGIGDVHLKNRNGSRLILKNVKHIPDI MNLIS  KLDDEGF NTF+NGIWKLTKGSMVIA GQKFSSLYYM+AKI+
Subjt:  FFASYTPGDFGSVMMDNDGSTNTVGIGDVHLKNRNGSRLILKNVKHIPDIRMNLISICKLDDEGFYNTFNNGIWKLTKGSMVIAKGQKFSSLYYMNAKIM

Query:  DSDINTGNDEANVELWHKRLSHISEKGLKILTKKNHLLDLKSTPLKQCPHCLAGK
        D DINT NDEANVELWHKRLSH+SEKGLKILTKKNHL DLKSTPLK+CPHCLAGK
Subjt:  DSDINTGNDEANVELWHKRLSHISEKGLKILTKKNHLLDLKSTPLKQCPHCLAGK

KAA0047570.1 putative retrotransposon [Cucumis melo var. makuwa]5.1e-10771.62Show/hide
Query:  MDLVKSSVLNEEMRRKSQSSCSQSDVL-------------------------------------RHIKKYCRKLKRDRKNHKGKEKKNEDDSDTDTITVA
        MDLVKSSVLNEEMRRKSQSS  QSDVL                                      HIKKYCRKLKRD KNHKGKEKKN+D+SDTDTI VA
Subjt:  MDLVKSSVLNEEMRRKSQSSCSQSDVL-------------------------------------RHIKKYCRKLKRDRKNHKGKEKKNEDDSDTDTITVA

Query:  TEDFFVLSDGDVVNLATQQSSWVIDSGASIHATSKREFFASYTPGDFGSVMMDNDGSTNTVGIGDVHLKNRNGSRLILKNVKHIPDIRMNLISICKLDDE
        TE+F++LS+GDVVNLA QQSSWVIDSGAS++ATSK +FFASYTP DFGSV M NDGS N VGIGDVHL NRNGSRLILKNVKHI DIRMNLIS  KLDDE
Subjt:  TEDFFVLSDGDVVNLATQQSSWVIDSGASIHATSKREFFASYTPGDFGSVMMDNDGSTNTVGIGDVHLKNRNGSRLILKNVKHIPDIRMNLISICKLDDE

Query:  GFYNTFNNGIWKLTKGSMVIAKGQKFSSLYYMNAKIMDSDINTGNDEANVELWHKRLSHISEKGLKILTKK-----------NHLLDLKSTPLKQCPHCL
        GF NTF+NGIWKLTKGS+VIA+G KFSSLYYM+AKI+DSDINT NDE N+ELWHKRLSH+SEKGLKILTKK           NHL DLKSTPLK+CPHCL
Subjt:  GFYNTFNNGIWKLTKGSMVIAKGQKFSSLYYMNAKIMDSDINTGNDEANVELWHKRLSHISEKGLKILTKK-----------NHLLDLKSTPLKQCPHCL

Query:  AGK
        AGK
Subjt:  AGK

KAA0065636.1 putative retrotransposon [Cucumis melo var. makuwa]3.5e-9275.1Show/hide
Query:  RKSQSSCSQSDVLRHIKKYCRKLKRDRKNHKGKEKKNEDDSDTDTITVATEDFFVLSDGDVVNLATQQSSWVIDSGASIHATSKREFFASYTPGDFGSVM
        R +   C       HIK YCRKLKRD KNHKG+EKKN++DS+ DTI +A EDF++LS+GDVVNLATQQ+S VIDSGAS HATSKRE  ASYTPGDFG+V 
Subjt:  RKSQSSCSQSDVLRHIKKYCRKLKRDRKNHKGKEKKNEDDSDTDTITVATEDFFVLSDGDVVNLATQQSSWVIDSGASIHATSKREFFASYTPGDFGSVM

Query:  MDNDGSTNTVGIGDVHLKNRNGSRLILKNVKHIPDIRMNLISICKLDDEGFYNTFNNGIWKLTKGSMVIAKGQKFSSLYYMNAKIMDSDINTGNDEANVE
        M NDGSTN VGIG VHLKN NGSRLILKNVKHIPDIRMNLIS  KLD+EGF NTF+NGIWKLT+G MVIAKGQK S LYY++AKI+DSDINT N EANVE
Subjt:  MDNDGSTNTVGIGDVHLKNRNGSRLILKNVKHIPDIRMNLISICKLDDEGFYNTFNNGIWKLTKGSMVIAKGQKFSSLYYMNAKIMDSDINTGNDEANVE

Query:  LWHKRLSHISEKGLKILTKKNHLLDLKSTPLKQCPHCLAGK
        LW  RLSH+SEKGLKIL KKNHL DLKS PLK   H LAGK
Subjt:  LWHKRLSHISEKGLKILTKKNHLLDLKSTPLKQCPHCLAGK

TXG70578.1 hypothetical protein EZV62_005513 [Acer yangbiense]7.9e-8458.56Show/hide
Query:  MDLVKSSVLNEEMRRKSQSSCSQSDVL-------------------------------------RHIKKYCRKLKRDRKNHKGKEKKNEDDSDTDTITVA
        MDL KSSVLNEEMRRKSQ S SQS+VL                                      HIKKYCR+LKRD KN KGKEKK +D +D D ++  
Subjt:  MDLVKSSVLNEEMRRKSQSSCSQSDVL-------------------------------------RHIKKYCRKLKRDRKNHKGKEKKNEDDSDTDTITVA

Query:  TEDFFVLSDGDVVNLATQQSSWVIDSGASIHATSKREFFASYTPGDFGSVMMDNDGSTNTVGIGDVHLKNRNGSRLILKNVKHIPDIRMNLISICKLDDE
        T+DF V+ D DVVNLA  ++SWVIDSGASIHATS+R+FFASYT GDFG V M N+G    VG+GDV L+  NG  L+LKNVKHIPDIR+NLIS  KLDDE
Subjt:  TEDFFVLSDGDVVNLATQQSSWVIDSGASIHATSKREFFASYTPGDFGSVMMDNDGSTNTVGIGDVHLKNRNGSRLILKNVKHIPDIRMNLISICKLDDE

Query:  GFYNTFNNGIWKLTKGSMVIAKGQKFSSLYYMNAKIMDSDINTGNDEANVELWHKRLSHISEKGLKILTKKNHLLDLKSTPLKQCPHCLAGK
        GF NTF++G WKLTKGSM++A+G+K SSLY+M AK+ D  INT ++E+  ELWH+RL H+SEKGL +L KKN L  +K+ PLK+C HCLAGK
Subjt:  GFYNTFNNGIWKLTKGSMVIAKGQKFSSLYYMNAKIMDSDINTGNDEANVELWHKRLSHISEKGLKILTKKNHLLDLKSTPLKQCPHCLAGK

TYJ98688.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]2.7e-9274.12Show/hide
Query:  MDLVKSSVLNEEMRRKSQSSCSQSDVLRHIKKYCRKLKRDRKNHKGKEKKNEDDSDTDTITVATEDFFVLSDGDVVNLATQQSSWVIDSGASIHATSKRE
        MDLVKSSVLNEEMRRKSQSS  QSD L                 +G+ K                     S G  VNLA QQSSWVIDSGAS+HATSKRE
Subjt:  MDLVKSSVLNEEMRRKSQSSCSQSDVLRHIKKYCRKLKRDRKNHKGKEKKNEDDSDTDTITVATEDFFVLSDGDVVNLATQQSSWVIDSGASIHATSKRE

Query:  FFASYTPGDFGSVMMDNDGSTNTVGIGDVHLKNRNGSRLILKNVKHIPDIRMNLISICKLDDEGFYNTFNNGIWKLTKGSMVIAKGQKFSSLYYMNAKIM
        FFASYTPGDFGSV M NDG TN VGIGDVHLKNRNGSRLILKNVKHIPDI MNLIS  KLDDEGF NTF+NGIWKLTKGSMVIA GQKFSSLYYM+AKI+
Subjt:  FFASYTPGDFGSVMMDNDGSTNTVGIGDVHLKNRNGSRLILKNVKHIPDIRMNLISICKLDDEGFYNTFNNGIWKLTKGSMVIAKGQKFSSLYYMNAKIM

Query:  DSDINTGNDEANVELWHKRLSHISEKGLKILTKKNHLLDLKSTPLKQCPHCLAGK
        D DINT NDEANVELWHKRLSH+SEKGLKILTKKNHL DLKSTPLK+CPHCLAGK
Subjt:  DSDINTGNDEANVELWHKRLSHISEKGLKILTKKNHLLDLKSTPLKQCPHCLAGK

TrEMBL top hitse value%identityAlignment
A0A5A7TFU1 Retrovirus-related Pol polyprotein from transposon TNT 1-943.8e-9273.73Show/hide
Query:  MDLVKSSVLNEEMRRKSQSSCSQSDVLRHIKKYCRKLKRDRKNHKGKEKKNEDDSDTDTITVATEDFFVLSDGDVVNLATQQSSWVIDSGASIHATSKRE
        MDLVKSSVLNEEMRRKSQSS  QSD L                 +G+ K                     S G  VNL  QQSSWVIDSGAS+HATSKRE
Subjt:  MDLVKSSVLNEEMRRKSQSSCSQSDVLRHIKKYCRKLKRDRKNHKGKEKKNEDDSDTDTITVATEDFFVLSDGDVVNLATQQSSWVIDSGASIHATSKRE

Query:  FFASYTPGDFGSVMMDNDGSTNTVGIGDVHLKNRNGSRLILKNVKHIPDIRMNLISICKLDDEGFYNTFNNGIWKLTKGSMVIAKGQKFSSLYYMNAKIM
        FFASYTPGDFGSV M NDG TN VGIGDVHLKNRNGSRLILKNVKHIPDI MNLIS  KLDDEGF NTF+NGIWKLTKGSMVIA GQKFSSLYYM+AKI+
Subjt:  FFASYTPGDFGSVMMDNDGSTNTVGIGDVHLKNRNGSRLILKNVKHIPDIRMNLISICKLDDEGFYNTFNNGIWKLTKGSMVIAKGQKFSSLYYMNAKIM

Query:  DSDINTGNDEANVELWHKRLSHISEKGLKILTKKNHLLDLKSTPLKQCPHCLAGK
        D DINT NDEANVELWHKRLSH+SEKGLKILTKKNHL DLKSTPLK+CPHCLAGK
Subjt:  DSDINTGNDEANVELWHKRLSHISEKGLKILTKKNHLLDLKSTPLKQCPHCLAGK

A0A5C7IN93 CCHC-type domain-containing protein3.8e-8458.56Show/hide
Query:  MDLVKSSVLNEEMRRKSQSSCSQSDVL-------------------------------------RHIKKYCRKLKRDRKNHKGKEKKNEDDSDTDTITVA
        MDL KSSVLNEEMRRKSQ S SQS+VL                                      HIKKYCR+LKRD KN KGKEKK +D +D D ++  
Subjt:  MDLVKSSVLNEEMRRKSQSSCSQSDVL-------------------------------------RHIKKYCRKLKRDRKNHKGKEKKNEDDSDTDTITVA

Query:  TEDFFVLSDGDVVNLATQQSSWVIDSGASIHATSKREFFASYTPGDFGSVMMDNDGSTNTVGIGDVHLKNRNGSRLILKNVKHIPDIRMNLISICKLDDE
        T+DF V+ D DVVNLA  ++SWVIDSGASIHATS+R+FFASYT GDFG V M N+G    VG+GDV L+  NG  L+LKNVKHIPDIR+NLIS  KLDDE
Subjt:  TEDFFVLSDGDVVNLATQQSSWVIDSGASIHATSKREFFASYTPGDFGSVMMDNDGSTNTVGIGDVHLKNRNGSRLILKNVKHIPDIRMNLISICKLDDE

Query:  GFYNTFNNGIWKLTKGSMVIAKGQKFSSLYYMNAKIMDSDINTGNDEANVELWHKRLSHISEKGLKILTKKNHLLDLKSTPLKQCPHCLAGK
        GF NTF++G WKLTKGSM++A+G+K SSLY+M AK+ D  INT ++E+  ELWH+RL H+SEKGL +L KKN L  +K+ PLK+C HCLAGK
Subjt:  GFYNTFNNGIWKLTKGSMVIAKGQKFSSLYYMNAKIMDSDINTGNDEANVELWHKRLSHISEKGLKILTKKNHLLDLKSTPLKQCPHCLAGK

A0A5D3BKF7 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-9274.12Show/hide
Query:  MDLVKSSVLNEEMRRKSQSSCSQSDVLRHIKKYCRKLKRDRKNHKGKEKKNEDDSDTDTITVATEDFFVLSDGDVVNLATQQSSWVIDSGASIHATSKRE
        MDLVKSSVLNEEMRRKSQSS  QSD L                 +G+ K                     S G  VNLA QQSSWVIDSGAS+HATSKRE
Subjt:  MDLVKSSVLNEEMRRKSQSSCSQSDVLRHIKKYCRKLKRDRKNHKGKEKKNEDDSDTDTITVATEDFFVLSDGDVVNLATQQSSWVIDSGASIHATSKRE

Query:  FFASYTPGDFGSVMMDNDGSTNTVGIGDVHLKNRNGSRLILKNVKHIPDIRMNLISICKLDDEGFYNTFNNGIWKLTKGSMVIAKGQKFSSLYYMNAKIM
        FFASYTPGDFGSV M NDG TN VGIGDVHLKNRNGSRLILKNVKHIPDI MNLIS  KLDDEGF NTF+NGIWKLTKGSMVIA GQKFSSLYYM+AKI+
Subjt:  FFASYTPGDFGSVMMDNDGSTNTVGIGDVHLKNRNGSRLILKNVKHIPDIRMNLISICKLDDEGFYNTFNNGIWKLTKGSMVIAKGQKFSSLYYMNAKIM

Query:  DSDINTGNDEANVELWHKRLSHISEKGLKILTKKNHLLDLKSTPLKQCPHCLAGK
        D DINT NDEANVELWHKRLSH+SEKGLKILTKKNHL DLKSTPLK+CPHCLAGK
Subjt:  DSDINTGNDEANVELWHKRLSHISEKGLKILTKKNHLLDLKSTPLKQCPHCLAGK

A0A5D3C706 Putative retrotransposon1.7e-9275.1Show/hide
Query:  RKSQSSCSQSDVLRHIKKYCRKLKRDRKNHKGKEKKNEDDSDTDTITVATEDFFVLSDGDVVNLATQQSSWVIDSGASIHATSKREFFASYTPGDFGSVM
        R +   C       HIK YCRKLKRD KNHKG+EKKN++DS+ DTI +A EDF++LS+GDVVNLATQQ+S VIDSGAS HATSKRE  ASYTPGDFG+V 
Subjt:  RKSQSSCSQSDVLRHIKKYCRKLKRDRKNHKGKEKKNEDDSDTDTITVATEDFFVLSDGDVVNLATQQSSWVIDSGASIHATSKREFFASYTPGDFGSVM

Query:  MDNDGSTNTVGIGDVHLKNRNGSRLILKNVKHIPDIRMNLISICKLDDEGFYNTFNNGIWKLTKGSMVIAKGQKFSSLYYMNAKIMDSDINTGNDEANVE
        M NDGSTN VGIG VHLKN NGSRLILKNVKHIPDIRMNLIS  KLD+EGF NTF+NGIWKLT+G MVIAKGQK S LYY++AKI+DSDINT N EANVE
Subjt:  MDNDGSTNTVGIGDVHLKNRNGSRLILKNVKHIPDIRMNLISICKLDDEGFYNTFNNGIWKLTKGSMVIAKGQKFSSLYYMNAKIMDSDINTGNDEANVE

Query:  LWHKRLSHISEKGLKILTKKNHLLDLKSTPLKQCPHCLAGK
        LW  RLSH+SEKGLKIL KKNHL DLKS PLK   H LAGK
Subjt:  LWHKRLSHISEKGLKILTKKNHLLDLKSTPLKQCPHCLAGK

A0A5D3CVK2 Putative retrotransposon2.5e-10771.62Show/hide
Query:  MDLVKSSVLNEEMRRKSQSSCSQSDVL-------------------------------------RHIKKYCRKLKRDRKNHKGKEKKNEDDSDTDTITVA
        MDLVKSSVLNEEMRRKSQSS  QSDVL                                      HIKKYCRKLKRD KNHKGKEKKN+D+SDTDTI VA
Subjt:  MDLVKSSVLNEEMRRKSQSSCSQSDVL-------------------------------------RHIKKYCRKLKRDRKNHKGKEKKNEDDSDTDTITVA

Query:  TEDFFVLSDGDVVNLATQQSSWVIDSGASIHATSKREFFASYTPGDFGSVMMDNDGSTNTVGIGDVHLKNRNGSRLILKNVKHIPDIRMNLISICKLDDE
        TE+F++LS+GDVVNLA QQSSWVIDSGAS++ATSK +FFASYTP DFGSV M NDGS N VGIGDVHL NRNGSRLILKNVKHI DIRMNLIS  KLDDE
Subjt:  TEDFFVLSDGDVVNLATQQSSWVIDSGASIHATSKREFFASYTPGDFGSVMMDNDGSTNTVGIGDVHLKNRNGSRLILKNVKHIPDIRMNLISICKLDDE

Query:  GFYNTFNNGIWKLTKGSMVIAKGQKFSSLYYMNAKIMDSDINTGNDEANVELWHKRLSHISEKGLKILTKK-----------NHLLDLKSTPLKQCPHCL
        GF NTF+NGIWKLTKGS+VIA+G KFSSLYYM+AKI+DSDINT NDE N+ELWHKRLSH+SEKGLKILTKK           NHL DLKSTPLK+CPHCL
Subjt:  GFYNTFNNGIWKLTKGSMVIAKGQKFSSLYYMNAKIMDSDINTGNDEANVELWHKRLSHISEKGLKILTKK-----------NHLLDLKSTPLKQCPHCL

Query:  AGK
        AGK
Subjt:  AGK

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-947.5e-4540.74Show/hide
Query:  RRKSQ-SSCSQSDVLRHIKKYCRKLKRDRKNHKGKEKKNEDDSDTDTITVATEDFFVLSDGDVVNLATQQSSWVIDSGASIHATSKREFFASYTPGDFGS
        R KS+  +C   +   H K+ C   ++ +    G  +KN+D++            F+  + + ++L+  +S WV+D+ AS HAT  R+ F  Y  GDFG+
Subjt:  RRKSQ-SSCSQSDVLRHIKKYCRKLKRDRKNHKGKEKKNEDDSDTDTITVATEDFFVLSDGDVVNLATQQSSWVIDSGASIHATSKREFFASYTPGDFGS

Query:  VMMDNDGSTNTVGIGDVHLKNRNGSRLILKNVKHIPDIRMNLISICKLDDEGFYNTFNNGIWKLTKGSMVIAKGQKFSSLYYMNAKIMDSDINTGNDEAN
        V M N   +   GIGD+ +K   G  L+LK+V+H+PD+RMNLIS   LD +G+ + F N  W+LTKGS+VIAKG    +LY  NA+I   ++N   DE +
Subjt:  VMMDNDGSTNTVGIGDVHLKNRNGSRLILKNVKHIPDIRMNLISICKLDDEGFYNTFNNGIWKLTKGSMVIAKGQKFSSLYYMNAKIMDSDINTGNDEAN

Query:  VELWHKRLSHISEKGLKILTKKNHLLDLKSTPLKQCPHCLAGK
        V+LWHKR+ H+SEKGL+IL KK+ +   K T +K C +CL GK
Subjt:  VELWHKRLSHISEKGLKILTKKNHLLDLKSTPLKQCPHCLAGK

P93293 Uncharacterized mitochondrial protein AtMg003001.0e-0937.08Show/hide
Query:  NNGIWKLTKGSMVIAKGQKFSSLYYMNAKIMDSDIN---TGNDEANVELWHKRLSHISEKGLKILTKKNHLLDLKSTPLKQCPHCLAGK
        + G+ K+ KG   I KG +  SLY +   +   + N   T  DE    LWH RL+H+S++G+++L KK  L   K + LK C  C+ GK
Subjt:  NNGIWKLTKGSMVIAKGQKFSSLYYMNAKIMDSDIN---TGNDEANVELWHKRLSHISEKGLKILTKKNHLLDLKSTPLKQCPHCLAGK

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein7.3e-1137.08Show/hide
Query:  NNGIWKLTKGSMVIAKGQKFSSLYYMNAKIMDSDIN---TGNDEANVELWHKRLSHISEKGLKILTKKNHLLDLKSTPLKQCPHCLAGK
        + G+ K+ KG   I KG +  SLY +   +   + N   T  DE    LWH RL+H+S++G+++L KK  L   K + LK C  C+ GK
Subjt:  NNGIWKLTKGSMVIAKGQKFSSLYYMNAKIMDSDIN---TGNDEANVELWHKRLSHISEKGLKILTKKNHLLDLKSTPLKQCPHCLAGK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACCTAGTCAAAAGTAGCGTGTTGAACGAGGAGATGAGAAGAAAGTCTCAAAGTTCTTGTTCACAGTCAGATGTTTTGAGGCATATAAAGAAGTATTGTCGAAAATT
GAAAAGAGACCGTAAAAATCATAAGGGCAAGGAAAAGAAGAATGAGGATGATAGTGATACTGATACAATCACTGTAGCCACTGAAGATTTTTTCGTCTTGTCTGATGGTG
ATGTTGTAAATCTTGCCACACAACAGAGCAGTTGGGTGATTGATAGTGGTGCCTCAATTCATGCTACTTCAAAGAGGGAATTTTTTGCATCCTACACTCCTGGTGATTTT
GGCAGTGTTATGATGGATAATGACGGATCAACAAATACAGTTGGCATCGGAGATGTACACTTGAAAAACAGAAATGGTTCTAGGCTGATTTTGAAAAATGTGAAACATAT
TCCTGATATTCGCATGAACTTGATTTCTATATGTAAGCTTGATGACGAAGGTTTCTACAATACCTTCAACAATGGCATATGGAAGCTTACTAAAGGTTCAATGGTTATAG
CAAAGGGACAAAAATTTTCTTCACTGTACTATATGAATGCAAAAATCATGGATTCTGATATAAATACAGGGAATGATGAAGCAAATGTTGAGCTTTGGCATAAGAGACTT
AGCCATATAAGTGAAAAGGGTTTAAAGATTTTAACCAAGAAAAATCATCTTCTTGATTTAAAGAGTACACCTCTAAAACAGTGTCCTCATTGTTTGGCAGGAAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGACCTAGTCAAAAGTAGCGTGTTGAACGAGGAGATGAGAAGAAAGTCTCAAAGTTCTTGTTCACAGTCAGATGTTTTGAGGCATATAAAGAAGTATTGTCGAAAATT
GAAAAGAGACCGTAAAAATCATAAGGGCAAGGAAAAGAAGAATGAGGATGATAGTGATACTGATACAATCACTGTAGCCACTGAAGATTTTTTCGTCTTGTCTGATGGTG
ATGTTGTAAATCTTGCCACACAACAGAGCAGTTGGGTGATTGATAGTGGTGCCTCAATTCATGCTACTTCAAAGAGGGAATTTTTTGCATCCTACACTCCTGGTGATTTT
GGCAGTGTTATGATGGATAATGACGGATCAACAAATACAGTTGGCATCGGAGATGTACACTTGAAAAACAGAAATGGTTCTAGGCTGATTTTGAAAAATGTGAAACATAT
TCCTGATATTCGCATGAACTTGATTTCTATATGTAAGCTTGATGACGAAGGTTTCTACAATACCTTCAACAATGGCATATGGAAGCTTACTAAAGGTTCAATGGTTATAG
CAAAGGGACAAAAATTTTCTTCACTGTACTATATGAATGCAAAAATCATGGATTCTGATATAAATACAGGGAATGATGAAGCAAATGTTGAGCTTTGGCATAAGAGACTT
AGCCATATAAGTGAAAAGGGTTTAAAGATTTTAACCAAGAAAAATCATCTTCTTGATTTAAAGAGTACACCTCTAAAACAGTGTCCTCATTGTTTGGCAGGAAAGTAG
Protein sequenceShow/hide protein sequence
MDLVKSSVLNEEMRRKSQSSCSQSDVLRHIKKYCRKLKRDRKNHKGKEKKNEDDSDTDTITVATEDFFVLSDGDVVNLATQQSSWVIDSGASIHATSKREFFASYTPGDF
GSVMMDNDGSTNTVGIGDVHLKNRNGSRLILKNVKHIPDIRMNLISICKLDDEGFYNTFNNGIWKLTKGSMVIAKGQKFSSLYYMNAKIMDSDINTGNDEANVELWHKRL
SHISEKGLKILTKKNHLLDLKSTPLKQCPHCLAGK