; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g15030 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g15030
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Genome locationchr4:11438352..11444831
RNA-Seq ExpressionMoc04g15030
SyntenyMoc04g15030
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031662.1 gag/pol protein [Cucumis melo var. makuwa]4.8e-6444.09Show/hide
Query:  MKEGSSVREHVLNLMVHLNVAESNGAVIDEQSQISFILESLMKSFLPFRSNTVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATSKRFNRGSSSGT
        MKEG SVREHVLN +V+ NVA+ NGAV DE++Q+S+IL+SL KSFL FRSN  MNK+EY +TTLL ELQ +QSLM+ K  EGEAN   S+RF        
Subjt:  MKEGSSVREHVLNLMVHLNVAESNGAVIDEQSQISFILESLMKSFLPFRSNTVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATSKRFNRGSSSGT

Query:  RSAPSSSGSRTFKKKKAAGKGSKPDSAAAAQKGKVKFAEKEKCFHCNMDRHWKRNCPMYLAENKKAN---------------------------------
          APSSSGS+  +K K  GKG  P + AA  KGKVK A K KCFHCN+D HWKRNCP YLA+ K+                                   
Subjt:  RSAPSSSGSRTFKKKKAAGKGSKPDSAAAAQKGKVKFAEKEKCFHCNMDRHWKRNCPMYLAENKKAN---------------------------------

Query:  ------------------------------------------------------------------------------------------EEGHVRDHKP
                                                                                                  EE H+RDHK 
Subjt:  ------------------------------------------------------------------------------------------EEGHVRDHKP

Query:  RSKVVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQSHPPQVLKVPRHSGRMVSQPDRYVGLTETQVVIPDDGVEDPLTYKKAMEDTDKDKCVKAMD
        RSK+V+N    EATN STRVVD+ G ++R VDE  TS QSH  Q L++PR SGR+VSQP+RY+GLTETQVVIPDDGVEDPL+YK+AM D DK++ VK +D
Subjt:  RSKVVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQSHPPQVLKVPRHSGRMVSQPDRYVGLTETQVVIPDDGVEDPLTYKKAMEDTDKDKCVKAMD

Query:  LGMESM
        L MESM
Subjt:  LGMESM

KAA0032016.1 retrovirus-related pol polyprotein from transposon tnt 1-94 [Cucumis melo var. makuwa]1.1e-6549.22Show/hide
Query:  MKEGSSVREHVLNLMVHLNVAESNGAVIDEQSQISFILESLMKSFLPFRSNTVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATSK-RFNRGSSSG
        MKEG+SVREHVL++M+H ++AE NG  IDE +QISFILESL KSF+PF++N  +NK+E+ LT LLNELQ +Q+L K KG+E EANV T+K +F RGSSS 
Subjt:  MKEGSSVREHVLNLMVHLNVAESNGAVIDEQSQISFILESLMKSFLPFRSNTVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATSK-RFNRGSSSG

Query:  TRSAPSSSGSRTFKKKKAAGKGSKPDSAAAAQKGKVKFAEKEKCFHCNMDRHWKRNCPMYLAENKKAN------------------------------EE
        ++S PS    +  KK    GKG  P       KGK K  EK KC+HC  + H  RNCP YLA+ K+                                EE
Subjt:  TRSAPSSSGSRTFKKKKAAGKGSKPDSAAAAQKGKVKFAEKEKCFHCNMDRHWKRNCPMYLAENKKAN------------------------------EE

Query:  GHVRDHKPRSKVVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQSHPPQVLKVPRHSGRMVSQPDRYVGLTETQVVIPDDGVEDPLTYKKAMEDTDK
         H+R+H+ RSK+V+ EIS+ AT       D+  ++T+VVD+     Q+HP Q L  PR SGR+V QPDRY+GL+E Q++IPDDG+EDPLTYK+AM D D 
Subjt:  GHVRDHKPRSKVVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQSHPPQVLKVPRHSGRMVSQPDRYVGLTETQVVIPDDGVEDPLTYKKAMEDTDK

Query:  DKCVKAMDLGMESMGVGGVVT
        D+ +KAMD  MESM    V T
Subjt:  DKCVKAMDLGMESMGVGGVVT

KAA0050670.1 gag/pol protein [Cucumis melo var. makuwa]1.7e-6641.47Show/hide
Query:  MKEGSSVREHVLNLMVHLNVAESNGAVIDEQSQISFILESLMKSFLPFRSNTVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFNRGSSSG
        M EG+SVREHVLN+MVH NVAE NGAVIDE SQ+SFILESL++SFL FRSN VMNK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS ++F+RG + G
Subjt:  MKEGSSVREHVLNLMVHLNVAESNGAVIDEQSQISFILESLMKSFLPFRSNTVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFNRGSSSG

Query:  TRSAPSSSGSRTFKKKKAAGKGSKPDSAAAAQKGKVKFAEKEKCFHCNMDRHWKRNCPMYLAENKKAN--------------------------------
        T+S PSSSG++ +KKKK  G+G+K +  AA    K K A K  CFHCN + HWKRNCP YLAE KKA                                 
Subjt:  TRSAPSSSGSRTFKKKKAAGKGSKPDSAAAAQKGKVKFAEKEKCFHCNMDRHWKRNCPMYLAENKKAN--------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------EEGHVRDHKPRSKVVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQSHPPQVLKVPRHSGRMVSQPDRYVGLTETQVVI
                          EE H+R+HKP SK+V+N++S+E T  STRVV++    TRVV  A +S ++H PQ L+ PR SGR+ + P RY+ LTET  VI
Subjt:  ------------------EEGHVRDHKPRSKVVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQSHPPQVLKVPRHSGRMVSQPDRYVGLTETQVVI

Query:  PDDGVEDPLTYKKAMEDTDKDKCVKAMDLGMESM
         D  +EDPLT+KKAMED DKD+ +KAM+L +ESM
Subjt:  PDDGVEDPLTYKKAMEDTDKDKCVKAMDLGMESM

KAA0061339.1 gag/pol protein [Cucumis melo var. makuwa]8.7e-6648.44Show/hide
Query:  MKEGSSVREHVLNLMVHLNVAESNGAVIDEQSQISFILESLMKSFLPFRSNTVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFNRGSSSG
        M EG+SVREHVLN+MVH NVAE NGAVIDE SQ+SFILESL +SFL FRSN VMNK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS ++F+RGS+SG
Subjt:  MKEGSSVREHVLNLMVHLNVAESNGAVIDEQSQISFILESLMKSFLPFRSNTVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFNRGSSSG

Query:  TRSAPSSSGSRTFKKKKAAGKGSKPDSAAAAQKGKVKFA--------------EKEKCFHCNMDRH--------------------WKRNC---------
        T+S PSSSG++ +KKKK  G+G+K + AAA    K K A              E+      +M R                     +  NC         
Subjt:  TRSAPSSSGSRTFKKKKAAGKGSKPDSAAAAQKGKVKFA--------------EKEKCFHCNMDRH--------------------WKRNC---------

Query:  --------------------------PMYLAENKKANEEGHVRDHKPRSKVVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQSHPPQVLKVPRHSG
                                   ++++ N    EE H+R+HKPRSK+V+NE+S+E T  STRVV++    TRVV   S++R +H PQ L+ PR SG
Subjt:  --------------------------PMYLAENKKANEEGHVRDHKPRSKVVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQSHPPQVLKVPRHSG

Query:  RMVSQPDRYVGLTETQVVIPDDGVEDPLTYKKAMEDTDKDKCVKAMDLGMESM
        R+ + P RY+ LTET  VI D  +EDPLT+KKAMED DKD+ +KAM+L +ESM
Subjt:  RMVSQPDRYVGLTETQVVIPDDGVEDPLTYKKAMEDTDKDKCVKAMDLGMESM

KAA0066490.1 gag/pol protein [Cucumis melo var. makuwa]4.8e-6439.09Show/hide
Query:  MKEGSSVREHVLNLMVHLNVAESNGAVIDEQSQISFILESLMKSFLPFRSNTVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFNRGSSSG
        M EG+SVREHVLN+MVH ++AE NGAVIDE SQ+SFILESL +SFL FRSN VMNK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS ++F+RGS+SG
Subjt:  MKEGSSVREHVLNLMVHLNVAESNGAVIDEQSQISFILESLMKSFLPFRSNTVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFNRGSSSG

Query:  TRSAPSSSGSRTFKKKKAAGKGSKPDSAAAAQKGKVKFAEKEKCFHCNMDRHWKRNCPMYLAENKKAN--------------------------------
        T+S PSSSG++ +KKKK  G+G+K + AAA    K K A K  CFHCN + HWKRNCP YLAE KKA                                 
Subjt:  TRSAPSSSGSRTFKKKKAAGKGSKPDSAAAAQKGKVKFAEKEKCFHCNMDRHWKRNCPMYLAENKKAN--------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------------------EEGHVRDHKPRSKVVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQSHPP
                                                       EE H+R+HKPRSK+V+NE+S+E T  STRVV++     RVV   S++R +H P
Subjt:  -----------------------------------------------EEGHVRDHKPRSKVVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQSHPP

Query:  QVLKVPRHSGRMVSQPDRYVGLTETQVVIPDDGVEDPLTYKKAMEDTDKDKCVKAMDLGMESM
        Q L+ PR SGR+ + P RY+ LTET  VI D  +EDPLT+KKAMED DKD+ +KAM+L ++SM
Subjt:  QVLKVPRHSGRMVSQPDRYVGLTETQVVIPDDGVEDPLTYKKAMEDTDKDKCVKAMDLGMESM

TrEMBL top hitse value%identityAlignment
A0A5A7SKQ7 Gag/pol protein2.3e-6444.09Show/hide
Query:  MKEGSSVREHVLNLMVHLNVAESNGAVIDEQSQISFILESLMKSFLPFRSNTVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATSKRFNRGSSSGT
        MKEG SVREHVLN +V+ NVA+ NGAV DE++Q+S+IL+SL KSFL FRSN  MNK+EY +TTLL ELQ +QSLM+ K  EGEAN   S+RF        
Subjt:  MKEGSSVREHVLNLMVHLNVAESNGAVIDEQSQISFILESLMKSFLPFRSNTVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATSKRFNRGSSSGT

Query:  RSAPSSSGSRTFKKKKAAGKGSKPDSAAAAQKGKVKFAEKEKCFHCNMDRHWKRNCPMYLAENKKAN---------------------------------
          APSSSGS+  +K K  GKG  P + AA  KGKVK A K KCFHCN+D HWKRNCP YLA+ K+                                   
Subjt:  RSAPSSSGSRTFKKKKAAGKGSKPDSAAAAQKGKVKFAEKEKCFHCNMDRHWKRNCPMYLAENKKAN---------------------------------

Query:  ------------------------------------------------------------------------------------------EEGHVRDHKP
                                                                                                  EE H+RDHK 
Subjt:  ------------------------------------------------------------------------------------------EEGHVRDHKP

Query:  RSKVVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQSHPPQVLKVPRHSGRMVSQPDRYVGLTETQVVIPDDGVEDPLTYKKAMEDTDKDKCVKAMD
        RSK+V+N    EATN STRVVD+ G ++R VDE  TS QSH  Q L++PR SGR+VSQP+RY+GLTETQVVIPDDGVEDPL+YK+AM D DK++ VK +D
Subjt:  RSKVVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQSHPPQVLKVPRHSGRMVSQPDRYVGLTETQVVIPDDGVEDPLTYKKAMEDTDKDKCVKAMD

Query:  LGMESM
        L MESM
Subjt:  LGMESM

A0A5A7U676 Gag/pol protein8.4e-6741.47Show/hide
Query:  MKEGSSVREHVLNLMVHLNVAESNGAVIDEQSQISFILESLMKSFLPFRSNTVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFNRGSSSG
        M EG+SVREHVLN+MVH NVAE NGAVIDE SQ+SFILESL++SFL FRSN VMNK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS ++F+RG + G
Subjt:  MKEGSSVREHVLNLMVHLNVAESNGAVIDEQSQISFILESLMKSFLPFRSNTVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFNRGSSSG

Query:  TRSAPSSSGSRTFKKKKAAGKGSKPDSAAAAQKGKVKFAEKEKCFHCNMDRHWKRNCPMYLAENKKAN--------------------------------
        T+S PSSSG++ +KKKK  G+G+K +  AA    K K A K  CFHCN + HWKRNCP YLAE KKA                                 
Subjt:  TRSAPSSSGSRTFKKKKAAGKGSKPDSAAAAQKGKVKFAEKEKCFHCNMDRHWKRNCPMYLAENKKAN--------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------EEGHVRDHKPRSKVVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQSHPPQVLKVPRHSGRMVSQPDRYVGLTETQVVI
                          EE H+R+HKP SK+V+N++S+E T  STRVV++    TRVV  A +S ++H PQ L+ PR SGR+ + P RY+ LTET  VI
Subjt:  ------------------EEGHVRDHKPRSKVVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQSHPPQVLKVPRHSGRMVSQPDRYVGLTETQVVI

Query:  PDDGVEDPLTYKKAMEDTDKDKCVKAMDLGMESM
         D  +EDPLT+KKAMED DKD+ +KAM+L +ESM
Subjt:  PDDGVEDPLTYKKAMEDTDKDKCVKAMDLGMESM

A0A5A7V6N0 Gag/pol protein4.2e-6648.44Show/hide
Query:  MKEGSSVREHVLNLMVHLNVAESNGAVIDEQSQISFILESLMKSFLPFRSNTVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFNRGSSSG
        M EG+SVREHVLN+MVH NVAE NGAVIDE SQ+SFILESL +SFL FRSN VMNK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS ++F+RGS+SG
Subjt:  MKEGSSVREHVLNLMVHLNVAESNGAVIDEQSQISFILESLMKSFLPFRSNTVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFNRGSSSG

Query:  TRSAPSSSGSRTFKKKKAAGKGSKPDSAAAAQKGKVKFA--------------EKEKCFHCNMDRH--------------------WKRNC---------
        T+S PSSSG++ +KKKK  G+G+K + AAA    K K A              E+      +M R                     +  NC         
Subjt:  TRSAPSSSGSRTFKKKKAAGKGSKPDSAAAAQKGKVKFA--------------EKEKCFHCNMDRH--------------------WKRNC---------

Query:  --------------------------PMYLAENKKANEEGHVRDHKPRSKVVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQSHPPQVLKVPRHSG
                                   ++++ N    EE H+R+HKPRSK+V+NE+S+E T  STRVV++    TRVV   S++R +H PQ L+ PR SG
Subjt:  --------------------------PMYLAENKKANEEGHVRDHKPRSKVVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQSHPPQVLKVPRHSG

Query:  RMVSQPDRYVGLTETQVVIPDDGVEDPLTYKKAMEDTDKDKCVKAMDLGMESM
        R+ + P RY+ LTET  VI D  +EDPLT+KKAMED DKD+ +KAM+L +ESM
Subjt:  RMVSQPDRYVGLTETQVVIPDDGVEDPLTYKKAMEDTDKDKCVKAMDLGMESM

A0A5A7VH46 Gag/pol protein2.3e-6439.09Show/hide
Query:  MKEGSSVREHVLNLMVHLNVAESNGAVIDEQSQISFILESLMKSFLPFRSNTVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFNRGSSSG
        M EG+SVREHVLN+MVH ++AE NGAVIDE SQ+SFILESL +SFL FRSN VMNK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS ++F+RGS+SG
Subjt:  MKEGSSVREHVLNLMVHLNVAESNGAVIDEQSQISFILESLMKSFLPFRSNTVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFNRGSSSG

Query:  TRSAPSSSGSRTFKKKKAAGKGSKPDSAAAAQKGKVKFAEKEKCFHCNMDRHWKRNCPMYLAENKKAN--------------------------------
        T+S PSSSG++ +KKKK  G+G+K + AAA    K K A K  CFHCN + HWKRNCP YLAE KKA                                 
Subjt:  TRSAPSSSGSRTFKKKKAAGKGSKPDSAAAAQKGKVKFAEKEKCFHCNMDRHWKRNCPMYLAENKKAN--------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------------------EEGHVRDHKPRSKVVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQSHPP
                                                       EE H+R+HKPRSK+V+NE+S+E T  STRVV++     RVV   S++R +H P
Subjt:  -----------------------------------------------EEGHVRDHKPRSKVVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQSHPP

Query:  QVLKVPRHSGRMVSQPDRYVGLTETQVVIPDDGVEDPLTYKKAMEDTDKDKCVKAMDLGMESM
        Q L+ PR SGR+ + P RY+ LTET  VI D  +EDPLT+KKAMED DKD+ +KAM+L ++SM
Subjt:  QVLKVPRHSGRMVSQPDRYVGLTETQVVIPDDGVEDPLTYKKAMEDTDKDKCVKAMDLGMESM

A0A5D3CYG9 Retrovirus-related pol polyprotein from transposon tnt 1-945.5e-6649.22Show/hide
Query:  MKEGSSVREHVLNLMVHLNVAESNGAVIDEQSQISFILESLMKSFLPFRSNTVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATSK-RFNRGSSSG
        MKEG+SVREHVL++M+H ++AE NG  IDE +QISFILESL KSF+PF++N  +NK+E+ LT LLNELQ +Q+L K KG+E EANV T+K +F RGSSS 
Subjt:  MKEGSSVREHVLNLMVHLNVAESNGAVIDEQSQISFILESLMKSFLPFRSNTVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATSK-RFNRGSSSG

Query:  TRSAPSSSGSRTFKKKKAAGKGSKPDSAAAAQKGKVKFAEKEKCFHCNMDRHWKRNCPMYLAENKKAN------------------------------EE
        ++S PS    +  KK    GKG  P       KGK K  EK KC+HC  + H  RNCP YLA+ K+                                EE
Subjt:  TRSAPSSSGSRTFKKKKAAGKGSKPDSAAAAQKGKVKFAEKEKCFHCNMDRHWKRNCPMYLAENKKAN------------------------------EE

Query:  GHVRDHKPRSKVVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQSHPPQVLKVPRHSGRMVSQPDRYVGLTETQVVIPDDGVEDPLTYKKAMEDTDK
         H+R+H+ RSK+V+ EIS+ AT       D+  ++T+VVD+     Q+HP Q L  PR SGR+V QPDRY+GL+E Q++IPDDG+EDPLTYK+AM D D 
Subjt:  GHVRDHKPRSKVVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQSHPPQVLKVPRHSGRMVSQPDRYVGLTETQVVIPDDGVEDPLTYKKAMEDTDK

Query:  DKCVKAMDLGMESMGVGGVVT
        D+ +KAMD  MESM    V T
Subjt:  DKCVKAMDLGMESMGVGGVVT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGAGGGTTCATCAGTGCGAGAACACGTTCTCAACCTAATGGTCCACTTGAACGTGGCTGAGTCGAATGGGGCCGTCATAGACGAGCAGAGTCAGATCAGCTTCAT
TCTGGAATCTCTTATGAAGAGTTTCCTGCCATTCCGCAGCAATACAGTTATGAATAAGTTGGAGTACACTCTTACCACGCTCCTAAACGAGCTGCAGACTTACCAGTCTC
TTATGAAATGTAAGGGACAAGAAGGGGAGGCAAATGTTGCCACCTCAAAGAGGTTCAACAGAGGATCATCCTCTGGAACCAGGTCTGCGCCCTCTTCTTCTGGAAGTAGG
ACTTTTAAGAAGAAGAAGGCTGCTGGTAAGGGGTCTAAACCTGACTCAGCTGCTGCTGCCCAGAAAGGCAAGGTCAAGTTTGCAGAGAAAGAAAAATGTTTCCACTGCAA
CATGGACAGGCATTGGAAGCGCAACTGCCCAATGTACTTGGCCGAAAATAAGAAAGCCAACGAAGAAGGCCATGTTCGAGATCATAAACCACGGAGTAAGGTAGTAATTA
ACGAGATTTCCGAAGAGGCTACAAACACGTCAACAAGAGTTGTTGATCAAACTGGCACTACAACAAGAGTTGTTGATGAAGCCAGCACATCGCGTCAGTCACATCCACCT
CAAGTGTTGAAGGTGCCTCGACATAGTGGGAGGATGGTGTCACAACCTGACCGCTACGTGGGTTTAACTGAAACTCAAGTTGTCATACCTGATGACGGCGTCGAGGATCC
ATTGACCTATAAGAAGGCAATGGAAGATACTGACAAGGACAAATGCGTCAAAGCAATGGACCTGGGAATGGAGTCGATGGGAGTGGGAGGTGTTGTGACGTGGGGAGTTA
CCTCACATCACCATTCTTCTTCTTCTCCAAGTGTTGGAGAAGGTGAAGAACAGTGCTACCACAAGCACGATCCCGAGACCCAAGAGGATAGCGAGGAAGATCCAGTGGTG
GTGTTCAAGGGGAATTCACTGAAGAAACATTCTTCAAAGAGTTGGAGCAAGGGTTTAGCTGCTCTCTTGAGGATGCTACTAGAGAACAGAGTTGTAGTTTTAGCTAAGGA
CAAGAGGAGTGCTGCCAAGGTCGTGGGGTCTATGTTGGAGGGCAGTTGGAAGAGGCGTGATTGTTGGAGACTATAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGGAGGGTTCATCAGTGCGAGAACACGTTCTCAACCTAATGGTCCACTTGAACGTGGCTGAGTCGAATGGGGCCGTCATAGACGAGCAGAGTCAGATCAGCTTCAT
TCTGGAATCTCTTATGAAGAGTTTCCTGCCATTCCGCAGCAATACAGTTATGAATAAGTTGGAGTACACTCTTACCACGCTCCTAAACGAGCTGCAGACTTACCAGTCTC
TTATGAAATGTAAGGGACAAGAAGGGGAGGCAAATGTTGCCACCTCAAAGAGGTTCAACAGAGGATCATCCTCTGGAACCAGGTCTGCGCCCTCTTCTTCTGGAAGTAGG
ACTTTTAAGAAGAAGAAGGCTGCTGGTAAGGGGTCTAAACCTGACTCAGCTGCTGCTGCCCAGAAAGGCAAGGTCAAGTTTGCAGAGAAAGAAAAATGTTTCCACTGCAA
CATGGACAGGCATTGGAAGCGCAACTGCCCAATGTACTTGGCCGAAAATAAGAAAGCCAACGAAGAAGGCCATGTTCGAGATCATAAACCACGGAGTAAGGTAGTAATTA
ACGAGATTTCCGAAGAGGCTACAAACACGTCAACAAGAGTTGTTGATCAAACTGGCACTACAACAAGAGTTGTTGATGAAGCCAGCACATCGCGTCAGTCACATCCACCT
CAAGTGTTGAAGGTGCCTCGACATAGTGGGAGGATGGTGTCACAACCTGACCGCTACGTGGGTTTAACTGAAACTCAAGTTGTCATACCTGATGACGGCGTCGAGGATCC
ATTGACCTATAAGAAGGCAATGGAAGATACTGACAAGGACAAATGCGTCAAAGCAATGGACCTGGGAATGGAGTCGATGGGAGTGGGAGGTGTTGTGACGTGGGGAGTTA
CCTCACATCACCATTCTTCTTCTTCTCCAAGTGTTGGAGAAGGTGAAGAACAGTGCTACCACAAGCACGATCCCGAGACCCAAGAGGATAGCGAGGAAGATCCAGTGGTG
GTGTTCAAGGGGAATTCACTGAAGAAACATTCTTCAAAGAGTTGGAGCAAGGGTTTAGCTGCTCTCTTGAGGATGCTACTAGAGAACAGAGTTGTAGTTTTAGCTAAGGA
CAAGAGGAGTGCTGCCAAGGTCGTGGGGTCTATGTTGGAGGGCAGTTGGAAGAGGCGTGATTGTTGGAGACTATAG
Protein sequenceShow/hide protein sequence
MKEGSSVREHVLNLMVHLNVAESNGAVIDEQSQISFILESLMKSFLPFRSNTVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATSKRFNRGSSSGTRSAPSSSGSR
TFKKKKAAGKGSKPDSAAAAQKGKVKFAEKEKCFHCNMDRHWKRNCPMYLAENKKANEEGHVRDHKPRSKVVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQSHPP
QVLKVPRHSGRMVSQPDRYVGLTETQVVIPDDGVEDPLTYKKAMEDTDKDKCVKAMDLGMESMGVGGVVTWGVTSHHHSSSSPSVGEGEEQCYHKHDPETQEDSEEDPVV
VFKGNSLKKHSSKSWSKGLAALLRMLLENRVVVLAKDKRSAAKVVGSMLEGSWKRRDCWRL