; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc04G11433 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc04G11433
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationClcChr04:24963390..24965181
RNA-Seq ExpressionClc04G11433
SyntenyClc04G11433
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043186.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]2.0e-4337.99Show/hide
Query:  DLPNKIFLRERLFSFKMNATKSMDENLDEFKKLTSEINRTDEEL--------------------------------------------------------
        DLPNK+F+RE+LFSFKMN  K++DENLDEFKK T+ +N+T E+L                                                        
Subjt:  DLPNKIFLRERLFSFKMNATKSMDENLDEFKKLTSEINRTDEEL--------------------------------------------------------

Query:  ------GKESAKNLNSDQKQEAQKTVIKCYFCHK-GHIKRNCPERKKT-----------------------ELRDYKKPREHGKADFSIGDDSYEYFETL
              GK S +  N +Q+    K  +KC+ CHK GH KRNCP+R K                        +  D ++ REHG+    +G++++EY E L
Subjt:  ------GKESAKNLNSDQKQEAQKTVIKCYFCHK-GHIKRNCPERKKT-----------------------ELRDYKKPREHGKADFSIGDDSYEYFETL

Query:  VATNEKAMTKNTEKEDWVLGSGCTFHMTSNEKWFVFYKQWDGGSVFMGNNHGCKVVGIGDVMLKLEDNREILLKNVRHI
        VATN+KAM  +TE+EDWVL SGCT+HMTS + WFV YK  +G SV+MGNN   +++G+G V+LKL +NRE+LLK  + +
Subjt:  VATNEKAMTKNTEKEDWVLGSGCTFHMTSNEKWFVFYKQWDGGSVFMGNNHGCKVVGIGDVMLKLEDNREILLKNVRHI

KAA0051442.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]3.0e-5537.34Show/hide
Query:  GTTITLQRLGYPKSLPGTLTAEEKQNIEKIAYRTLILNITDNVLRL-------------------EADLPNKIFLRERLFSFKMNATKSMDENLDEFKKL
        G+   L+ L  PK LP TLT  E++ +E++AY TLI+NITDNVLR                    + DLPNK+F++E+LFSFK N  K++DENLDEFKKL
Subjt:  GTTITLQRLGYPKSLPGTLTAEEKQNIEKIAYRTLILNITDNVLRL-------------------EADLPNKIFLRERLFSFKMNATKSMDENLDEFKKL

Query:  TSEINRTDEELG--KESAKNLNS----------------------------DQKQEAQKTVIKCYFCHKGHIKRNCPER-------------KKTELRDY
        T+ +N+T E+LG   E+A  +NS                              K+   KT  K     +G   R    R             +  +  D 
Subjt:  TSEINRTDEELG--KESAKNLNS----------------------------DQKQEAQKTVIKCYFCHKGHIKRNCPER-------------KKTELRDY

Query:  KKPREHGKADFSIGDDSYEYFETLVATNEKAMTKNTEKEDWVLGSGCTFHMTSNEKWFVFYKQWDGGSVFMGNNHGCKVVGIGDVMLKLEDNREILLKNV
        ++ REHG+    +G++++EY E L ATN+KAM   TE+ED VL SGCT+HMTS + WFV YK  +G SV+MGNN  C+++G+G V+LKL  NRE+LLK  
Subjt:  KKPREHGKADFSIGDDSYEYFETLVATNEKAMTKNTEKEDWVLGSGCTFHMTSNEKWFVFYKQWDGGSVFMGNNHGCKVVGIGDVMLKLEDNREILLKNV

Query:  RHIPEIRRNLISIGMLDEIGCLISVYEGFLKVSKNSKTVLEAPKLNCLSIMKSVLSKDHVLIAQSEKDEEFELWHRKLSHISE
                                       V +  + +L + K+  L  +K+V+   + LI+++EK  E ELWHR+LSHISE
Subjt:  RHIPEIRRNLISIGMLDEIGCLISVYEGFLKVSKNSKTVLEAPKLNCLSIMKSVLSKDHVLIAQSEKDEEFELWHRKLSHISE

KAA0054988.1 hypothetical protein E6C27_scaffold43052G001360 [Cucumis melo var. makuwa]3.7e-4532.1Show/hide
Query:  PKSLPGTLTAEEKQNIEKIAYRTLILNITDNVLR--LEADLPNKIFLRERLFSFKMNATK------------SMDENLDEFK------------------
        PK LP TLT  E++ +E++AY TLI+NITDNVLR  +E  +      ++   +F     K            S+ +   E K                  
Subjt:  PKSLPGTLTAEEKQNIEKIAYRTLILNITDNVLR--LEADLPNKIFLRERLFSFKMNATK------------SMDENLDEFK------------------

Query:  --------KLTSEINRTDEEL---GKES-AKNLNSDQKQEAQKTVIKCYFCHKGHIKRNCPERKKT-----------------------ELRDYKKPREH
                K  ++ +   E L   GK S  KN N +Q+    K  +KC+ CHKGH KRNCP+R K                        +  D ++ REH
Subjt:  --------KLTSEINRTDEEL---GKES-AKNLNSDQKQEAQKTVIKCYFCHKGHIKRNCPERKKT-----------------------ELRDYKKPREH

Query:  GKADFSIGDDSYEYFETLVATNEKAMTKNTEKEDWVLGSGCTFHMTSNEKWFVFYKQWDGGSVFMGNNHGCKVVGIGDVMLKLEDNREILLKNVRHIPEI
        G+    +G++++EY E L  TN++ M   TE+EDWVL SGCT+HMTS + W                                         + RH+P++
Subjt:  GKADFSIGDDSYEYFETLVATNEKAMTKNTEKEDWVLGSGCTFHMTSNEKWFVFYKQWDGGSVFMGNNHGCKVVGIGDVMLKLEDNREILLKNVRHIPEI

Query:  RRNLISIGMLDEIGCLISVYEGFLKVSKNSKTVLEAPKLNCLSIMKSVLSKDHVLIAQSEKDEEFELWHRKLSHISE
        +RNLIS+GMLD++GC I +  GF+KV +  + +L + K+  L  +K+V+   + LI+++EK+ E ELWH++LSHISE
Subjt:  RRNLISIGMLDEIGCLISVYEGFLKVSKNSKTVLEAPKLNCLSIMKSVLSKDHVLIAQSEKDEEFELWHRKLSHISE

TYK12279.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]2.0e-4337.99Show/hide
Query:  DLPNKIFLRERLFSFKMNATKSMDENLDEFKKLTSEINRTDEEL--------------------------------------------------------
        DLPNK+F+RE+LFSFKMN  K++DENLDEFKK T+ +N+T E+L                                                        
Subjt:  DLPNKIFLRERLFSFKMNATKSMDENLDEFKKLTSEINRTDEEL--------------------------------------------------------

Query:  ------GKESAKNLNSDQKQEAQKTVIKCYFCHK-GHIKRNCPERKKT-----------------------ELRDYKKPREHGKADFSIGDDSYEYFETL
              GK S +  N +Q+    K  +KC+ CHK GH KRNCP+R K                        +  D ++ REHG+    +G++++EY E L
Subjt:  ------GKESAKNLNSDQKQEAQKTVIKCYFCHK-GHIKRNCPERKKT-----------------------ELRDYKKPREHGKADFSIGDDSYEYFETL

Query:  VATNEKAMTKNTEKEDWVLGSGCTFHMTSNEKWFVFYKQWDGGSVFMGNNHGCKVVGIGDVMLKLEDNREILLKNVRHI
        VATN+KAM  +TE+EDWVL SGCT+HMTS + WFV YK  +G SV+MGNN   +++G+G V+LKL +NRE+LLK  + +
Subjt:  VATNEKAMTKNTEKEDWVLGSGCTFHMTSNEKWFVFYKQWDGGSVFMGNNHGCKVVGIGDVMLKLEDNREILLKNVRHI

TYK27723.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]2.2e-5036.78Show/hide
Query:  GTTITLQRLGYPKSLPGTLTAEEKQNIEKIAYRTLILNITDNVLRL-------------------EADLPNKIFLRERLFSFKMNATKSMDENLDEFKKL
        G+   L+ L  PK LP TLT  E++ +E++AY TLI+NITDNVLR                    + DL NK+F+RE+LFSFKMN  K++DENLDEFKKL
Subjt:  GTTITLQRLGYPKSLPGTLTAEEKQNIEKIAYRTLILNITDNVLRL-------------------EADLPNKIFLRERLFSFKMNATKSMDENLDEFKKL

Query:  TSEINRTDEELGKES----------------------------------------------------------------AKNLNSDQKQEAQKTVIKCYF
        T+ +N+T E+LG ES                                                                 KN N +Q+    K  +KC+ 
Subjt:  TSEINRTDEELGKES----------------------------------------------------------------AKNLNSDQKQEAQKTVIKCYF

Query:  CHK-GHIKRNCPERKKTELRD-YKKPREHGKADFS----------------------IGDDSYEYFETLVATNEKAMTKNTEKEDWVLGSGCTFHMTSNE
        CHK GH KRNCP+R K   RD  ++ R +G+ DF+                      +G++++EY + L ATN++AM    E+EDWVL SGCT++MTS +
Subjt:  CHK-GHIKRNCPERKKTELRD-YKKPREHGKADFS----------------------IGDDSYEYFETLVATNEKAMTKNTEKEDWVLGSGCTFHMTSNE

Query:  KWFVFYKQWDGGSVFMGNNHGCKVVGIGDVMLKLEDNREILLKNVRHI
         WFV YK  +G SV MGNN  C+++G+G V+LK  +N E+LLK  + +
Subjt:  KWFVFYKQWDGGSVFMGNNHGCKVVGIGDVMLKLEDNREILLKNVRHI

TrEMBL top hitse value%identityAlignment
A0A176WFF8 CCHC-type domain-containing protein3.6e-3836.36Show/hide
Query:  LPNKIFLRERLFSFKMNATKSMDENLDEFKKLTSEINRT--DEELGKESAKNLNSDQKQEAQKTV----------IKCYFCHK-GHIKRNCPE-RKKTEL
        L NKI+L+ERLF F+M+  KS++ NLDEF K+T E+  +   EEL  E+   +  +   EA + +           KC++C+K GH+K++C    KKT+ 
Subjt:  LPNKIFLRERLFSFKMNATKSMDENLDEFKKLTSEINRT--DEELGKESAKNLNSDQKQEAQKTV----------IKCYFCHK-GHIKRNCPE-RKKTEL

Query:  RDYKKPREHGKADFSIGDDSYEYFETLVATNEKAMTKNTEKEDWVLGSGCTFHMTSNEKWFVFYKQWDGGSVFMGNNHGCKVVGIGDVMLKLEDNREILL
              + H   D +   D Y   E L+A+ +       +  +W+L SGCT+HMT N+ W   Y++ + G V MGNNH C V GIG V +K+ D    +L
Subjt:  RDYKKPREHGKADFSIGDDSYEYFETLVATNEKAMTKNTEKEDWVLGSGCTFHMTSNEKWFVFYKQWDGGSVFMGNNHGCKVVGIGDVMLKLEDNREILL

Query:  KNVRHIPEIRRNLISIGMLDEIGCLISVYEGFLKVSKNSKTVLEAPKLNCLSIMKSVLSKDHVLIAQSEKDEEFELWHRKLSHISE
        +NVR +PE+ RNLISIG+LD++G    + +G + ++K + T+L+  K+  L  +          +A ++   +  LWHR+L HISE
Subjt:  KNVRHIPEIRRNLISIGMLDEIGCLISVYEGFLKVSKNSKTVLEAPKLNCLSIMKSVLSKDHVLIAQSEKDEEFELWHRKLSHISE

A0A5A7U6R2 Retrovirus-related Pol polyprotein from transposon TNT 1-941.4e-5537.34Show/hide
Query:  GTTITLQRLGYPKSLPGTLTAEEKQNIEKIAYRTLILNITDNVLRL-------------------EADLPNKIFLRERLFSFKMNATKSMDENLDEFKKL
        G+   L+ L  PK LP TLT  E++ +E++AY TLI+NITDNVLR                    + DLPNK+F++E+LFSFK N  K++DENLDEFKKL
Subjt:  GTTITLQRLGYPKSLPGTLTAEEKQNIEKIAYRTLILNITDNVLRL-------------------EADLPNKIFLRERLFSFKMNATKSMDENLDEFKKL

Query:  TSEINRTDEELG--KESAKNLNS----------------------------DQKQEAQKTVIKCYFCHKGHIKRNCPER-------------KKTELRDY
        T+ +N+T E+LG   E+A  +NS                              K+   KT  K     +G   R    R             +  +  D 
Subjt:  TSEINRTDEELG--KESAKNLNS----------------------------DQKQEAQKTVIKCYFCHKGHIKRNCPER-------------KKTELRDY

Query:  KKPREHGKADFSIGDDSYEYFETLVATNEKAMTKNTEKEDWVLGSGCTFHMTSNEKWFVFYKQWDGGSVFMGNNHGCKVVGIGDVMLKLEDNREILLKNV
        ++ REHG+    +G++++EY E L ATN+KAM   TE+ED VL SGCT+HMTS + WFV YK  +G SV+MGNN  C+++G+G V+LKL  NRE+LLK  
Subjt:  KKPREHGKADFSIGDDSYEYFETLVATNEKAMTKNTEKEDWVLGSGCTFHMTSNEKWFVFYKQWDGGSVFMGNNHGCKVVGIGDVMLKLEDNREILLKNV

Query:  RHIPEIRRNLISIGMLDEIGCLISVYEGFLKVSKNSKTVLEAPKLNCLSIMKSVLSKDHVLIAQSEKDEEFELWHRKLSHISE
                                       V +  + +L + K+  L  +K+V+   + LI+++EK  E ELWHR+LSHISE
Subjt:  RHIPEIRRNLISIGMLDEIGCLISVYEGFLKVSKNSKTVLEAPKLNCLSIMKSVLSKDHVLIAQSEKDEEFELWHRKLSHISE

A0A5A7UJ23 Integrase catalytic domain-containing protein1.8e-4532.1Show/hide
Query:  PKSLPGTLTAEEKQNIEKIAYRTLILNITDNVLR--LEADLPNKIFLRERLFSFKMNATK------------SMDENLDEFK------------------
        PK LP TLT  E++ +E++AY TLI+NITDNVLR  +E  +      ++   +F     K            S+ +   E K                  
Subjt:  PKSLPGTLTAEEKQNIEKIAYRTLILNITDNVLR--LEADLPNKIFLRERLFSFKMNATK------------SMDENLDEFK------------------

Query:  --------KLTSEINRTDEEL---GKES-AKNLNSDQKQEAQKTVIKCYFCHKGHIKRNCPERKKT-----------------------ELRDYKKPREH
                K  ++ +   E L   GK S  KN N +Q+    K  +KC+ CHKGH KRNCP+R K                        +  D ++ REH
Subjt:  --------KLTSEINRTDEEL---GKES-AKNLNSDQKQEAQKTVIKCYFCHKGHIKRNCPERKKT-----------------------ELRDYKKPREH

Query:  GKADFSIGDDSYEYFETLVATNEKAMTKNTEKEDWVLGSGCTFHMTSNEKWFVFYKQWDGGSVFMGNNHGCKVVGIGDVMLKLEDNREILLKNVRHIPEI
        G+    +G++++EY E L  TN++ M   TE+EDWVL SGCT+HMTS + W                                         + RH+P++
Subjt:  GKADFSIGDDSYEYFETLVATNEKAMTKNTEKEDWVLGSGCTFHMTSNEKWFVFYKQWDGGSVFMGNNHGCKVVGIGDVMLKLEDNREILLKNVRHIPEI

Query:  RRNLISIGMLDEIGCLISVYEGFLKVSKNSKTVLEAPKLNCLSIMKSVLSKDHVLIAQSEKDEEFELWHRKLSHISE
        +RNLIS+GMLD++GC I +  GF+KV +  + +L + K+  L  +K+V+   + LI+++EK+ E ELWH++LSHISE
Subjt:  RRNLISIGMLDEIGCLISVYEGFLKVSKNSKTVLEAPKLNCLSIMKSVLSKDHVLIAQSEKDEEFELWHRKLSHISE

A0A5D3DNU1 Putative gag-pol polyprotein3.2e-3929.16Show/hide
Query:  LPGTLTAEEKQNIEKIAYRTLILNITDNVLR-------------------LEADLPNKIFLRERLFSFKMNATKSMDENLDEFKKLTSEINRTDEELGKE
        LP  +T  EK++++++AY T++L ++D VLR                   L   LPNKI+++E+ F +KM+ +KS++ENLDEF+K+  ++N   E++  E
Subjt:  LPGTLTAEEKQNIEKIAYRTLILNITDNVLR-------------------LEADLPNKIFLRERLFSFKMNATKSMDENLDEFKKLTSEINRTDEELGKE

Query:  -----------------------------------SAKNLNSDQKQEAQKTVI----------------------------KCYFCHK-GHIKRNCPERK
                                           + K  N + K+E +   +                            KC+ CHK GH K+NCP  K
Subjt:  -----------------------------------SAKNLNSDQKQEAQKTVI----------------------------KCYFCHK-GHIKRNCPERK

Query:  KTELRDYKKPREHGKADFSIGDDSYEYFETLVATNEKAMTKNTEKED-WVLGSGCTFHMTSNEKWFVFYKQWDGGSVFMGNNHGCKVVGIGDVMLKLEDN
          E    +     G     I  D Y+  ET   + E  M  + + +D W++ SGCTFHMT +  +   +++ DGG V +G+N  C V G G V +   D 
Subjt:  KTELRDYKKPREHGKADFSIGDDSYEYFETLVATNEKAMTKNTEKED-WVLGSGCTFHMTSNEKWFVFYKQWDGGSVFMGNNHGCKVVGIGDVMLKLEDN

Query:  REILLKNVRHIPEIRRNLISIGMLDEIGCLISVYEGFLKVSKNSKTVLEAPKLNCLSIMKSVLSKDHVLIAQSEKDEEFELWHRKLSHISE
           +L NVR++P+++RNLIS+G LD  GC I    G +KV+K S   L     + L +++         IA  +  +   LWH++L+H+SE
Subjt:  REILLKNVRHIPEIRRNLISIGMLDEIGCLISVYEGFLKVSKNSKTVLEAPKLNCLSIMKSVLSKDHVLIAQSEKDEEFELWHRKLSHISE

A0A5D3DVM0 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-5036.78Show/hide
Query:  GTTITLQRLGYPKSLPGTLTAEEKQNIEKIAYRTLILNITDNVLRL-------------------EADLPNKIFLRERLFSFKMNATKSMDENLDEFKKL
        G+   L+ L  PK LP TLT  E++ +E++AY TLI+NITDNVLR                    + DL NK+F+RE+LFSFKMN  K++DENLDEFKKL
Subjt:  GTTITLQRLGYPKSLPGTLTAEEKQNIEKIAYRTLILNITDNVLRL-------------------EADLPNKIFLRERLFSFKMNATKSMDENLDEFKKL

Query:  TSEINRTDEELGKES----------------------------------------------------------------AKNLNSDQKQEAQKTVIKCYF
        T+ +N+T E+LG ES                                                                 KN N +Q+    K  +KC+ 
Subjt:  TSEINRTDEELGKES----------------------------------------------------------------AKNLNSDQKQEAQKTVIKCYF

Query:  CHK-GHIKRNCPERKKTELRD-YKKPREHGKADFS----------------------IGDDSYEYFETLVATNEKAMTKNTEKEDWVLGSGCTFHMTSNE
        CHK GH KRNCP+R K   RD  ++ R +G+ DF+                      +G++++EY + L ATN++AM    E+EDWVL SGCT++MTS +
Subjt:  CHK-GHIKRNCPERKKTELRD-YKKPREHGKADFS----------------------IGDDSYEYFETLVATNEKAMTKNTEKEDWVLGSGCTFHMTSNE

Query:  KWFVFYKQWDGGSVFMGNNHGCKVVGIGDVMLKLEDNREILLKNVRHI
         WFV YK  +G SV MGNN  C+++G+G V+LK  +N E+LLK  + +
Subjt:  KWFVFYKQWDGGSVFMGNNHGCKVVGIGDVMLKLEDNREILLKNVRHI

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.6e-1727.39Show/hide
Query:  RTDEELGKESAKNLNSDQKQEAQKTVIKCYFCHK-GHIKRNCPERKKTELRDYKKPREHGKADFSIGDDSYEYFETLVATNEKAMTKNTEKEDWVLGSGC
        R+    G+  A+      K  ++  V  CY C++ GH KR+CP  +K +     +  +   A     +D+   F   +   E+ M  +  + +WV+ +  
Subjt:  RTDEELGKESAKNLNSDQKQEAQKTVIKCYFCHK-GHIKRNCPERKKTELRDYKKPREHGKADFSIGDDSYEYFETLVATNEKAMTKNTEKEDWVLGSGC

Query:  TFHMTSNEKWFVFYKQWDGGSVFMGNNHGCKVVGIGDVMLKLEDNREILLKNVRHIPEIRRNLISIGMLDEIGCLISVYEGFL-----KVSKNSKTVLEA
        + H T     F  Y   D G+V MGN    K+ GIGD+ +K      ++LK+VRH+P++R NLIS   LD  G     YE +      +++K S  + + 
Subjt:  TFHMTSNEKWFVFYKQWDGGSVFMGNNHGCKVVGIGDVMLKLEDNREILLKNVRHIPEIRRNLISIGMLDEIGCLISVYEGFL-----KVSKNSKTVLEA

Query:  PKLNCLSIMKSVLSKDHVLIAQSEKDEEFELWHRKLSHISE
             L    + + +  +  AQ E     +LWH+++ H+SE
Subjt:  PKLNCLSIMKSVLSKDHVLIAQSEKDEEFELWHRKLSHISE

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGATCTTGGAGTGGACGTAGGCTCAAAGAGCGGAACCACTATAACTCTCCAGCGTTTAGGTTATCCTAAATCTTTGCCTGGGACTTTAACTGCTGAAGAGAAACA
AAATATTGAAAAAATTGCTTACAGAACGTTAATCTTGAACATTACTGATAATGTGTTGAGATTAGAGGCCGATCTCCCTAACAAAATCTTTCTTAGGGAAAGATTATTCT
CTTTCAAAATGAATGCTACAAAGTCGATGGATGAAAACTTGGACGAGTTCAAGAAGTTAACCTCAGAGATAAATCGAACAGACGAGGAACTTGGGAAGGAAAGTGCAAAG
AACTTGAATTCAGATCAGAAACAAGAGGCTCAGAAGACTGTGATCAAATGTTATTTTTGTCATAAAGGACACATCAAGAGGAACTGTCCAGAGAGGAAGAAGACAGAATT
AAGAGACTATAAGAAACCAAGGGAACATGGAAAGGCAGATTTCTCTATTGGAGATGATTCTTATGAGTATTTTGAAACATTGGTAGCAACCAATGAGAAAGCCATGACTA
AAAATACCGAGAAGGAAGACTGGGTCCTTGGCTCAGGTTGTACCTTTCATATGACTTCTAACGAGAAGTGGTTTGTATTCTACAAACAGTGGGATGGAGGCTCAGTATTC
ATGGGTAACAACCACGGATGTAAAGTGGTGGGAATAGGAGATGTGATGCTCAAACTTGAAGACAATAGAGAAATCTTGCTTAAAAATGTAAGACACATCCCAGAAATTAG
AAGGAACTTAATCTCAATTGGAATGCTTGATGAGATTGGATGTCTAATTTCAGTATATGAAGGGTTTCTAAAAGTCTCTAAGAACTCTAAGACAGTACTTGAAGCCCCAA
AACTGAATTGTCTGTCTATCATGAAGAGTGTTCTCTCGAAGGACCATGTACTGATAGCTCAAAGTGAAAAAGATGAAGAATTTGAGCTTTGGCATCGCAAGTTATCTCAC
ATAAGCGAGGGAGAATTTTTGTCTCTGCCGATTCTTTTTTTGCCAACGCAACAAAAAACGTTAGCAGAGTTAGTTTCTGCCGACGAATTTTTAATGCGTGGGCATAACAT
CATATCTGCCGACGAAAATATATGCGTGGGTAAAAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTGATCTTGGAGTGGACGTAGGCTCAAAGAGCGGAACCACTATAACTCTCCAGCGTTTAGGTTATCCTAAATCTTTGCCTGGGACTTTAACTGCTGAAGAGAAACA
AAATATTGAAAAAATTGCTTACAGAACGTTAATCTTGAACATTACTGATAATGTGTTGAGATTAGAGGCCGATCTCCCTAACAAAATCTTTCTTAGGGAAAGATTATTCT
CTTTCAAAATGAATGCTACAAAGTCGATGGATGAAAACTTGGACGAGTTCAAGAAGTTAACCTCAGAGATAAATCGAACAGACGAGGAACTTGGGAAGGAAAGTGCAAAG
AACTTGAATTCAGATCAGAAACAAGAGGCTCAGAAGACTGTGATCAAATGTTATTTTTGTCATAAAGGACACATCAAGAGGAACTGTCCAGAGAGGAAGAAGACAGAATT
AAGAGACTATAAGAAACCAAGGGAACATGGAAAGGCAGATTTCTCTATTGGAGATGATTCTTATGAGTATTTTGAAACATTGGTAGCAACCAATGAGAAAGCCATGACTA
AAAATACCGAGAAGGAAGACTGGGTCCTTGGCTCAGGTTGTACCTTTCATATGACTTCTAACGAGAAGTGGTTTGTATTCTACAAACAGTGGGATGGAGGCTCAGTATTC
ATGGGTAACAACCACGGATGTAAAGTGGTGGGAATAGGAGATGTGATGCTCAAACTTGAAGACAATAGAGAAATCTTGCTTAAAAATGTAAGACACATCCCAGAAATTAG
AAGGAACTTAATCTCAATTGGAATGCTTGATGAGATTGGATGTCTAATTTCAGTATATGAAGGGTTTCTAAAAGTCTCTAAGAACTCTAAGACAGTACTTGAAGCCCCAA
AACTGAATTGTCTGTCTATCATGAAGAGTGTTCTCTCGAAGGACCATGTACTGATAGCTCAAAGTGAAAAAGATGAAGAATTTGAGCTTTGGCATCGCAAGTTATCTCAC
ATAAGCGAGGGAGAATTTTTGTCTCTGCCGATTCTTTTTTTGCCAACGCAACAAAAAACGTTAGCAGAGTTAGTTTCTGCCGACGAATTTTTAATGCGTGGGCATAACAT
CATATCTGCCGACGAAAATATATGCGTGGGTAAAAAGTAG
Protein sequenceShow/hide protein sequence
MADLGVDVGSKSGTTITLQRLGYPKSLPGTLTAEEKQNIEKIAYRTLILNITDNVLRLEADLPNKIFLRERLFSFKMNATKSMDENLDEFKKLTSEINRTDEELGKESAK
NLNSDQKQEAQKTVIKCYFCHKGHIKRNCPERKKTELRDYKKPREHGKADFSIGDDSYEYFETLVATNEKAMTKNTEKEDWVLGSGCTFHMTSNEKWFVFYKQWDGGSVF
MGNNHGCKVVGIGDVMLKLEDNREILLKNVRHIPEIRRNLISIGMLDEIGCLISVYEGFLKVSKNSKTVLEAPKLNCLSIMKSVLSKDHVLIAQSEKDEEFELWHRKLSH
ISEGEFLSLPILFLPTQQKTLAELVSADEFLMRGHNIISADENICVGKK