; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G09020 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G09020
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationChr4:6813567..6816229
RNA-Seq ExpressionCSPI04G09020
SyntenyCSPI04G09020
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050409.1 hypothetical protein E6C27_scaffold1166G00260 [Cucumis melo var. makuwa]1.1e-10260.88Show/hide
Query:  KVATLCISEEQKIPESSPSKEQVKTHTAWTETNFICNNLILNGLTDELYDYYSTMSIAKEVWDVLQKKYDTEEARSKKYV-------------------N
        KVAT C +E+ K+ E  P+KEQ+K  T WTET+FIC NLILNGLTDELYDYYSTM+ AK+VW+ LQKKYDTEEA SKKY                    +
Subjt:  KVATLCISEEQKIPESSPSKEQVKTHTAWTETNFICNNLILNGLTDELYDYYSTMSIAKEVWDVLQKKYDTEEARSKKYV-------------------N

Query:  EIQKKADEIISEGMRLDNQFQVAVIIDKLPHLWKDFKTTLRHKTKEFSLESLITNCR------------------------------------------N
        EIQK A EII+EGM LD+QFQV +IIDKL  LWKDFK TLRHKTKEFSLE+L T  R                                           
Subjt:  EIQKKADEIISEGMRLDNQFQVAVIIDKLPHLWKDFKTTLRHKTKEFSLESLITNCR------------------------------------------N

Query:  RNHP--AAQANLVEDEYVAMIFEVNVIGGSGGWWLDTGAYRHVCHDLSLFRKYNEINDKNILLGDHHMTKVAGIGKVELKFTSGKVLVLKKVLHTLGIRK
        +N+P   + ANL+E+E VAMI EVNVIGGS GWWLDTGA RHVCHDLS+FRKYNE+ DKNILLGDHH+TKV GI +VELKFTSGK LV+KKVLHT  IRK
Subjt:  RNHP--AAQANLVEDEYVAMIFEVNVIGGSGGWWLDTGAYRHVCHDLSLFRKYNEINDKNILLGDHHMTKVAGIGKVELKFTSGKVLVLKKVLHTLGIRK

Query:  NLVFGYLLYKAGFSQTIGSNLFTLTKNNAFVGNGYRTEGM
        NLV  YLL KAGF+QTIGSNLFTL+KNN FVG GY T+GM
Subjt:  NLVFGYLLYKAGFSQTIGSNLFTLTKNNAFVGNGYRTEGM

KAA0059670.1 putative Polyprotein [Cucumis melo var. makuwa]6.4e-9057.4Show/hide
Query:  KVATLCISEEQKIPESSPSKEQVKTHTAWTETNFICNNLILNGLTDELYDYYSTMSIAKEVWDVLQKKYDTEEARSKKYVNEIQKKADEIISEGMRLDNQ
        KVAT C  E+ K+ E  P+KEQ+ +   WTET+FIC NLILNGLTDELYDYYSTM+ AKEVWDVLQ KYD EE  SKKYV          +S  +R    
Subjt:  KVATLCISEEQKIPESSPSKEQVKTHTAWTETNFICNNLILNGLTDELYDYYSTMSIAKEVWDVLQKKYDTEEARSKKYVNEIQKKADEIISEGMRLDNQ

Query:  FQVAVIIDKLPHLWKDFKTTLRHKTKEFSLESLITNCR------------------------------------------NRNHPAAQ--ANLVEDEYVA
           +V         +DFK TLRHKTKE SL+SLIT  R                                           +N+P ++  ANL+EDE VA
Subjt:  FQVAVIIDKLPHLWKDFKTTLRHKTKEFSLESLITNCR------------------------------------------NRNHPAAQ--ANLVEDEYVA

Query:  MIFEVNVIGGSGGWWLDTGAYRHVCHDLSLFRKYNEINDKNILLGDHHMTKVAGIGKVELKFTSGKVLVLKKVLHTLGIRKNLVFGYLLYKAGFSQTIGS
        MI EVNVI G  GWWLDTGA  HVCHDLS+FRKYNE+ DK ILLGDHH TKVAGIGKVELKFTSGK+LVLK+VLHTL IRKNLV  YLL KAGF+QTIGS
Subjt:  MIFEVNVIGGSGGWWLDTGAYRHVCHDLSLFRKYNEINDKNILLGDHHMTKVAGIGKVELKFTSGKVLVLKKVLHTLGIRKNLVFGYLLYKAGFSQTIGS

Query:  NLFTLTKNNAFVGNGYRTEGMSKLNLEMIRI
        +LFTLTKNN FVG GY T+GM KLNL++ +I
Subjt:  NLFTLTKNNAFVGNGYRTEGMSKLNLEMIRI

KAA0067915.1 putative Polyprotein [Cucumis melo var. makuwa]1.2e-8557.59Show/hide
Query:  MSIAKEVWDVLQKKYDTEEARSKKYV-------------------NEIQKKADEIISEGMRLDNQFQVAVIIDKLPHLWKDFKTTLRHKTKEFSLESLIT
        M+ AKEVWD LQKKYDTEEAR KKY                    +EIQK A EIISEGM LD+QFQVAVIIDKLP LWKDFK TLRHKTKEF LESLIT
Subjt:  MSIAKEVWDVLQKKYDTEEARSKKYV-------------------NEIQKKADEIISEGMRLDNQFQVAVIIDKLPHLWKDFKTTLRHKTKEFSLESLIT

Query:  -----------------------------------------------------------------------NCRNRNHPAAQANLVEDEYVAMIFEVNVI
                                                                               NCRNR+HP AQANL+E+E VAMIFEVNVI
Subjt:  -----------------------------------------------------------------------NCRNRNHPAAQANLVEDEYVAMIFEVNVI

Query:  GGSGGWWLDTGAYRHVCHDLSLFRKYNEINDKNILLGDHHMTKVAGIGKVELKFTSGKVLVLKKVLHTLGIRKNLVFGYLLYKAGFSQTIGSNLFTLTKN
        GGS GWWLDTGA+ HVCHDLSLFRKYNE+ DKNILLGDHH TKV  IG+VELKFT  K+LVLK+VLHT  IRKNLV  YLL KAGF+QTIGS+LFTLTKN
Subjt:  GGSGGWWLDTGAYRHVCHDLSLFRKYNEINDKNILLGDHHMTKVAGIGKVELKFTSGKVLVLKKVLHTLGIRKNLVFGYLLYKAGFSQTIGSNLFTLTKN

Query:  NAFVGNGYRTEGMSKLNLEMIRI
        N FV  GY T+GM KLNLE+ +I
Subjt:  NAFVGNGYRTEGMSKLNLEMIRI

PON99483.1 Zinc finger, CCHC-type, partial [Trema orientale]1.7e-8746.46Show/hide
Query:  KVATLCISEEQKIPESSPSKEQVKTHTAWTETNFICNNLILNGLTDELYDYYSTMSIAKEVWDVLQKKYDTEEARSKKYV-------------------N
        KVA +C SE+  +P + P + Q K + +W E +F+C N ILNGL+D+LYDYY++   AKE+WD LQKKYDTEEA +KKY                    +
Subjt:  KVATLCISEEQKIPESSPSKEQVKTHTAWTETNFICNNLILNGLTDELYDYYSTMSIAKEVWDVLQKKYDTEEARSKKYV-------------------N

Query:  EIQKKADEIISEGMRLDNQFQVAVIIDKLPHLWKDFKTTLRHKTKEFSLESLIT----------------------------------------------
        E+Q  + EIISEGM LD QFQVAV+IDKLP  WKDFK+ LRHKTKEFSLESLIT                                              
Subjt:  EIQKKADEIISEGMRLDNQFQVAVIIDKLPHLWKDFKTTLRHKTKEFSLESLIT----------------------------------------------

Query:  --------------------------------------------NCRNRNHPAAQANLVEDEYVAMIFEVNVIGGSGGWWLDTGAYRHVCHDLSLFRKYN
                                                    NCRNR  PA QANL E++ +AMI E+N++GGS GWW+DTGA RHVC++ +LF+ Y+
Subjt:  --------------------------------------------NCRNRNHPAAQANLVEDEYVAMIFEVNVIGGSGGWWLDTGAYRHVCHDLSLFRKYN

Query:  EINDKNILLGDHHMTKVAGIGKVELKFTSGKVLVLKKVLHTLGIRKNLVFGYLLYKAGFSQTIGSNLFTLTKNNAFVGNGYRTEGMSKLNLEMIRI
        E  DK +LLGD H T VAG G+VELKFTSGK L+LK VLHT  +RKNLV GYLL K GF+QTIG++LFT+TKNN FVG GY T+GM KLN++  +I
Subjt:  EINDKNILLGDHHMTKVAGIGKVELKFTSGKVLVLKKVLHTLGIRKNLVFGYLLYKAGFSQTIGSNLFTLTKNNAFVGNGYRTEGMSKLNLEMIRI

TYJ98000.1 hypothetical protein E5676_scaffold487G00230 [Cucumis melo var. makuwa]2.5e-10260.59Show/hide
Query:  KVATLCISEEQKIPESSPSKEQVKTHTAWTETNFICNNLILNGLTDELYDYYSTMSIAKEVWDVLQKKYDTEEARSKKYV-------------------N
        KVAT C  E+ K+ E  P+KEQ+K  T WTET+FIC NLILNGLTDELYDYYSTM+ AK+VW+ LQKKYDTEEA SKKY                    +
Subjt:  KVATLCISEEQKIPESSPSKEQVKTHTAWTETNFICNNLILNGLTDELYDYYSTMSIAKEVWDVLQKKYDTEEARSKKYV-------------------N

Query:  EIQKKADEIISEGMRLDNQFQVAVIIDKLPHLWKDFKTTLRHKTKEFSLESLIT--------------------------------------------NC
        EIQK A EII+EGM LD+QFQV +IIDKLP LWKDFK TLRHKTKEFSLE+LIT                                            N 
Subjt:  EIQKKADEIISEGMRLDNQFQVAVIIDKLPHLWKDFKTTLRHKTKEFSLESLIT--------------------------------------------NC

Query:  RNRNHPAAQANLVEDEYVAMIFEVNVIGGSGGWWLDTGAYRHVCHDLSLFRKYNEINDKNILLGDHHMTKVAGIGKVELKFTSGKVLVLKKVLHTLGIRK
        +N     + ANL+E+E VAMI EVNVIGGS GWWLDT A RHVCHDLS+FRKYNE+ DKNILLGDHH+TKV GI +VEL+FTSGK LVLKKV HT  IRK
Subjt:  RNRNHPAAQANLVEDEYVAMIFEVNVIGGSGGWWLDTGAYRHVCHDLSLFRKYNEINDKNILLGDHHMTKVAGIGKVELKFTSGKVLVLKKVLHTLGIRK

Query:  NLVFGYLLYKAGFSQTIGSNLFTLTKNNAFVGNGYRTEGM
        NLV  YLL KAGF+QTIGSNLFTL+KNN FVG GY T+GM
Subjt:  NLVFGYLLYKAGFSQTIGSNLFTLTKNNAFVGNGYRTEGM

TrEMBL top hitse value%identityAlignment
A0A2P5FP19 Zinc finger, CCHC-type (Fragment)8.4e-8846.46Show/hide
Query:  KVATLCISEEQKIPESSPSKEQVKTHTAWTETNFICNNLILNGLTDELYDYYSTMSIAKEVWDVLQKKYDTEEARSKKYV-------------------N
        KVA +C SE+  +P + P + Q K + +W E +F+C N ILNGL+D+LYDYY++   AKE+WD LQKKYDTEEA +KKY                    +
Subjt:  KVATLCISEEQKIPESSPSKEQVKTHTAWTETNFICNNLILNGLTDELYDYYSTMSIAKEVWDVLQKKYDTEEARSKKYV-------------------N

Query:  EIQKKADEIISEGMRLDNQFQVAVIIDKLPHLWKDFKTTLRHKTKEFSLESLIT----------------------------------------------
        E+Q  + EIISEGM LD QFQVAV+IDKLP  WKDFK+ LRHKTKEFSLESLIT                                              
Subjt:  EIQKKADEIISEGMRLDNQFQVAVIIDKLPHLWKDFKTTLRHKTKEFSLESLIT----------------------------------------------

Query:  --------------------------------------------NCRNRNHPAAQANLVEDEYVAMIFEVNVIGGSGGWWLDTGAYRHVCHDLSLFRKYN
                                                    NCRNR  PA QANL E++ +AMI E+N++GGS GWW+DTGA RHVC++ +LF+ Y+
Subjt:  --------------------------------------------NCRNRNHPAAQANLVEDEYVAMIFEVNVIGGSGGWWLDTGAYRHVCHDLSLFRKYN

Query:  EINDKNILLGDHHMTKVAGIGKVELKFTSGKVLVLKKVLHTLGIRKNLVFGYLLYKAGFSQTIGSNLFTLTKNNAFVGNGYRTEGMSKLNLEMIRI
        E  DK +LLGD H T VAG G+VELKFTSGK L+LK VLHT  +RKNLV GYLL K GF+QTIG++LFT+TKNN FVG GY T+GM KLN++  +I
Subjt:  EINDKNILLGDHHMTKVAGIGKVELKFTSGKVLVLKKVLHTLGIRKNLVFGYLLYKAGFSQTIGSNLFTLTKNNAFVGNGYRTEGMSKLNLEMIRI

A0A5A7UA92 Uncharacterized protein5.4e-10360.88Show/hide
Query:  KVATLCISEEQKIPESSPSKEQVKTHTAWTETNFICNNLILNGLTDELYDYYSTMSIAKEVWDVLQKKYDTEEARSKKYV-------------------N
        KVAT C +E+ K+ E  P+KEQ+K  T WTET+FIC NLILNGLTDELYDYYSTM+ AK+VW+ LQKKYDTEEA SKKY                    +
Subjt:  KVATLCISEEQKIPESSPSKEQVKTHTAWTETNFICNNLILNGLTDELYDYYSTMSIAKEVWDVLQKKYDTEEARSKKYV-------------------N

Query:  EIQKKADEIISEGMRLDNQFQVAVIIDKLPHLWKDFKTTLRHKTKEFSLESLITNCR------------------------------------------N
        EIQK A EII+EGM LD+QFQV +IIDKL  LWKDFK TLRHKTKEFSLE+L T  R                                           
Subjt:  EIQKKADEIISEGMRLDNQFQVAVIIDKLPHLWKDFKTTLRHKTKEFSLESLITNCR------------------------------------------N

Query:  RNHP--AAQANLVEDEYVAMIFEVNVIGGSGGWWLDTGAYRHVCHDLSLFRKYNEINDKNILLGDHHMTKVAGIGKVELKFTSGKVLVLKKVLHTLGIRK
        +N+P   + ANL+E+E VAMI EVNVIGGS GWWLDTGA RHVCHDLS+FRKYNE+ DKNILLGDHH+TKV GI +VELKFTSGK LV+KKVLHT  IRK
Subjt:  RNHP--AAQANLVEDEYVAMIFEVNVIGGSGGWWLDTGAYRHVCHDLSLFRKYNEINDKNILLGDHHMTKVAGIGKVELKFTSGKVLVLKKVLHTLGIRK

Query:  NLVFGYLLYKAGFSQTIGSNLFTLTKNNAFVGNGYRTEGM
        NLV  YLL KAGF+QTIGSNLFTL+KNN FVG GY T+GM
Subjt:  NLVFGYLLYKAGFSQTIGSNLFTLTKNNAFVGNGYRTEGM

A0A5A7VQD4 Putative Polyprotein6.0e-8657.59Show/hide
Query:  MSIAKEVWDVLQKKYDTEEARSKKYV-------------------NEIQKKADEIISEGMRLDNQFQVAVIIDKLPHLWKDFKTTLRHKTKEFSLESLIT
        M+ AKEVWD LQKKYDTEEAR KKY                    +EIQK A EIISEGM LD+QFQVAVIIDKLP LWKDFK TLRHKTKEF LESLIT
Subjt:  MSIAKEVWDVLQKKYDTEEARSKKYV-------------------NEIQKKADEIISEGMRLDNQFQVAVIIDKLPHLWKDFKTTLRHKTKEFSLESLIT

Query:  -----------------------------------------------------------------------NCRNRNHPAAQANLVEDEYVAMIFEVNVI
                                                                               NCRNR+HP AQANL+E+E VAMIFEVNVI
Subjt:  -----------------------------------------------------------------------NCRNRNHPAAQANLVEDEYVAMIFEVNVI

Query:  GGSGGWWLDTGAYRHVCHDLSLFRKYNEINDKNILLGDHHMTKVAGIGKVELKFTSGKVLVLKKVLHTLGIRKNLVFGYLLYKAGFSQTIGSNLFTLTKN
        GGS GWWLDTGA+ HVCHDLSLFRKYNE+ DKNILLGDHH TKV  IG+VELKFT  K+LVLK+VLHT  IRKNLV  YLL KAGF+QTIGS+LFTLTKN
Subjt:  GGSGGWWLDTGAYRHVCHDLSLFRKYNEINDKNILLGDHHMTKVAGIGKVELKFTSGKVLVLKKVLHTLGIRKNLVFGYLLYKAGFSQTIGSNLFTLTKN

Query:  NAFVGNGYRTEGMSKLNLEMIRI
        N FV  GY T+GM KLNLE+ +I
Subjt:  NAFVGNGYRTEGMSKLNLEMIRI

A0A5D3BDS3 Uncharacterized protein1.2e-10260.59Show/hide
Query:  KVATLCISEEQKIPESSPSKEQVKTHTAWTETNFICNNLILNGLTDELYDYYSTMSIAKEVWDVLQKKYDTEEARSKKYV-------------------N
        KVAT C  E+ K+ E  P+KEQ+K  T WTET+FIC NLILNGLTDELYDYYSTM+ AK+VW+ LQKKYDTEEA SKKY                    +
Subjt:  KVATLCISEEQKIPESSPSKEQVKTHTAWTETNFICNNLILNGLTDELYDYYSTMSIAKEVWDVLQKKYDTEEARSKKYV-------------------N

Query:  EIQKKADEIISEGMRLDNQFQVAVIIDKLPHLWKDFKTTLRHKTKEFSLESLIT--------------------------------------------NC
        EIQK A EII+EGM LD+QFQV +IIDKLP LWKDFK TLRHKTKEFSLE+LIT                                            N 
Subjt:  EIQKKADEIISEGMRLDNQFQVAVIIDKLPHLWKDFKTTLRHKTKEFSLESLIT--------------------------------------------NC

Query:  RNRNHPAAQANLVEDEYVAMIFEVNVIGGSGGWWLDTGAYRHVCHDLSLFRKYNEINDKNILLGDHHMTKVAGIGKVELKFTSGKVLVLKKVLHTLGIRK
        +N     + ANL+E+E VAMI EVNVIGGS GWWLDT A RHVCHDLS+FRKYNE+ DKNILLGDHH+TKV GI +VEL+FTSGK LVLKKV HT  IRK
Subjt:  RNRNHPAAQANLVEDEYVAMIFEVNVIGGSGGWWLDTGAYRHVCHDLSLFRKYNEINDKNILLGDHHMTKVAGIGKVELKFTSGKVLVLKKVLHTLGIRK

Query:  NLVFGYLLYKAGFSQTIGSNLFTLTKNNAFVGNGYRTEGM
        NLV  YLL KAGF+QTIGSNLFTL+KNN FVG GY T+GM
Subjt:  NLVFGYLLYKAGFSQTIGSNLFTLTKNNAFVGNGYRTEGM

A0A5D3DRT2 Putative Polyprotein3.1e-9057.4Show/hide
Query:  KVATLCISEEQKIPESSPSKEQVKTHTAWTETNFICNNLILNGLTDELYDYYSTMSIAKEVWDVLQKKYDTEEARSKKYVNEIQKKADEIISEGMRLDNQ
        KVAT C  E+ K+ E  P+KEQ+ +   WTET+FIC NLILNGLTDELYDYYSTM+ AKEVWDVLQ KYD EE  SKKYV          +S  +R    
Subjt:  KVATLCISEEQKIPESSPSKEQVKTHTAWTETNFICNNLILNGLTDELYDYYSTMSIAKEVWDVLQKKYDTEEARSKKYVNEIQKKADEIISEGMRLDNQ

Query:  FQVAVIIDKLPHLWKDFKTTLRHKTKEFSLESLITNCR------------------------------------------NRNHPAAQ--ANLVEDEYVA
           +V         +DFK TLRHKTKE SL+SLIT  R                                           +N+P ++  ANL+EDE VA
Subjt:  FQVAVIIDKLPHLWKDFKTTLRHKTKEFSLESLITNCR------------------------------------------NRNHPAAQ--ANLVEDEYVA

Query:  MIFEVNVIGGSGGWWLDTGAYRHVCHDLSLFRKYNEINDKNILLGDHHMTKVAGIGKVELKFTSGKVLVLKKVLHTLGIRKNLVFGYLLYKAGFSQTIGS
        MI EVNVI G  GWWLDTGA  HVCHDLS+FRKYNE+ DK ILLGDHH TKVAGIGKVELKFTSGK+LVLK+VLHTL IRKNLV  YLL KAGF+QTIGS
Subjt:  MIFEVNVIGGSGGWWLDTGAYRHVCHDLSLFRKYNEINDKNILLGDHHMTKVAGIGKVELKFTSGKVLVLKKVLHTLGIRKNLVFGYLLYKAGFSQTIGS

Query:  NLFTLTKNNAFVGNGYRTEGMSKLNLEMIRI
        +LFTLTKNN FVG GY T+GM KLNL++ +I
Subjt:  NLFTLTKNNAFVGNGYRTEGMSKLNLEMIRI

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.9e-0828.23Show/hide
Query:  VNVIGGSGGWWLDTGAYRHVCHDLSLFRKYNEINDKNILLGDHHMTKVAGIGKVELKFTSGKVLVLKKVLHTLGIRKNLVFGYLLYKAGFSQTIGSNLFT
        +++ G    W +DT A  H      LF +Y   +   + +G+   +K+AGIG + +K   G  LVLK V H   +R NL+ G  L + G+     +  + 
Subjt:  VNVIGGSGGWWLDTGAYRHVCHDLSLFRKYNEINDKNILLGDHHMTKVAGIGKVELKFTSGKVLVLKKVLHTLGIRKNLVFGYLLYKAGFSQTIGSNLFT

Query:  LTKNNAFVGNGYRTEGMSKLNLEM
        LTK +  +  G     + + N E+
Subjt:  LTKNNAFVGNGYRTEGMSKLNLEM

Arabidopsis top hitse value%identityAlignment
AT4G00980.1 zinc knuckle (CCHC-type) family protein3.1e-1033.08Show/hide
Query:  PESSPSK--EQVKTHTAWTETNFICNNLILNGLTDELYDYYS-TMSIAKEVWDVLQKKYDTEEARSKK-------------------YVNEIQKKADEII
        PE++P +      T   W   +++C   ++N L+D LY  YS     AKE+WD L+  Y  +E++SK+                    V    K AD I+
Subjt:  PESSPSK--EQVKTHTAWTETNFICNNLILNGLTDELYDYYS-TMSIAKEVWDVLQKKYDTEEARSKK-------------------YVNEIQKKADEII

Query:  SEGMRLDNQFQVAVIIDKLPHLWKDFKTTL
        S GM LD  F V+ II K P  W+ F T L
Subjt:  SEGMRLDNQFQVAVIIDKLPHLWKDFKTTL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTCCCTCGTCTTTCAAGGAGTATTGCAAACATCTTTCCCTAAAAACTGCCTATCTTCTCCATTCCCCTTCTCATCGACTTAACTTCCTCTTCCAAGTGGAGAAGCT
GGTTTTGATCAATGCTTGTGTGTATAGTAAATGCACCGGGCACCCTACAAAACTGCCGAATGTGGTAGCATATGCTGGGGAACGGTACATTAAGCGAGATAAAGGGCGAA
GTGCAACAACAATTTTGAGACGTATCAATCTGCTGCAATGGTCGGATTATTTCATGAAAGTGGCTACATTATGTATCTCTGAAGAGCAAAAAATCCCAGAGTCCAGTCCA
TCTAAGGAGCAAGTCAAAACTCATACGGCGTGGACAGAGACCAATTTTATATGTAATAATTTAATACTTAATGGTCTTACTGATGAGTTGTATGATTATTACAGCACAAT
GTCCATCGCGAAAGAAGTTTGGGATGTGTTACAAAAGAAGTATGACACTGAGGAAGCTAGATCCAAGAAGTATGTGAATGAGATTCAGAAGAAAGCAGATGAGATCATAA
GCGAAGGTATGCGGCTTGACAATCAATTTCAAGTTGCAGTTATTATTGATAAATTACCTCATCTGTGGAAGGATTTCAAAACTACTCTAAGGCACAAAACCAAGGAGTTC
TCGCTGGAAAGTCTCATCACGAATTGTAGAAATAGGAATCATCCTGCTGCACAGGCGAACCTGGTAGAAGATGAATATGTAGCCATGATTTTTGAAGTTAATGTCATAGG
GGGGTCTGGAGGTTGGTGGCTAGACACAGGTGCTTATCGCCATGTCTGTCATGACCTTAGTTTATTTAGAAAGTATAATGAGATAAATGATAAGAATATCCTTCTAGGAG
ACCATCACATGACAAAGGTGGCAGGCATTGGAAAAGTAGAACTGAAATTCACATCTGGCAAAGTGCTTGTGTTGAAGAAAGTTCTTCATACGCTTGGAATTCGAAAGAAT
TTGGTCTTTGGATATCTGCTTTACAAAGCTGGCTTCTCGCAAACTATAGGGTCAAATTTATTTACTTTAACTAAGAACAATGCTTTTGTGGGAAATGGCTACAGGACAGA
AGGCATGTCTAAACTAAATTTAGAAATGATAAGAATTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTTCCCTCGTCTTTCAAGGAGTATTGCAAACATCTTTCCCTAAAAACTGCCTATCTTCTCCATTCCCCTTCTCATCGACTTAACTTCCTCTTCCAAGTGGAGAAGCT
GGTTTTGATCAATGCTTGTGTGTATAGTAAATGCACCGGGCACCCTACAAAACTGCCGAATGTGGTAGCATATGCTGGGGAACGGTACATTAAGCGAGATAAAGGGCGAA
GTGCAACAACAATTTTGAGACGTATCAATCTGCTGCAATGGTCGGATTATTTCATGAAAGTGGCTACATTATGTATCTCTGAAGAGCAAAAAATCCCAGAGTCCAGTCCA
TCTAAGGAGCAAGTCAAAACTCATACGGCGTGGACAGAGACCAATTTTATATGTAATAATTTAATACTTAATGGTCTTACTGATGAGTTGTATGATTATTACAGCACAAT
GTCCATCGCGAAAGAAGTTTGGGATGTGTTACAAAAGAAGTATGACACTGAGGAAGCTAGATCCAAGAAGTATGTGAATGAGATTCAGAAGAAAGCAGATGAGATCATAA
GCGAAGGTATGCGGCTTGACAATCAATTTCAAGTTGCAGTTATTATTGATAAATTACCTCATCTGTGGAAGGATTTCAAAACTACTCTAAGGCACAAAACCAAGGAGTTC
TCGCTGGAAAGTCTCATCACGAATTGTAGAAATAGGAATCATCCTGCTGCACAGGCGAACCTGGTAGAAGATGAATATGTAGCCATGATTTTTGAAGTTAATGTCATAGG
GGGGTCTGGAGGTTGGTGGCTAGACACAGGTGCTTATCGCCATGTCTGTCATGACCTTAGTTTATTTAGAAAGTATAATGAGATAAATGATAAGAATATCCTTCTAGGAG
ACCATCACATGACAAAGGTGGCAGGCATTGGAAAAGTAGAACTGAAATTCACATCTGGCAAAGTGCTTGTGTTGAAGAAAGTTCTTCATACGCTTGGAATTCGAAAGAAT
TTGGTCTTTGGATATCTGCTTTACAAAGCTGGCTTCTCGCAAACTATAGGGTCAAATTTATTTACTTTAACTAAGAACAATGCTTTTGTGGGAAATGGCTACAGGACAGA
AGGCATGTCTAAACTAAATTTAGAAATGATAAGAATTTAA
Protein sequenceShow/hide protein sequence
MLPSSFKEYCKHLSLKTAYLLHSPSHRLNFLFQVEKLVLINACVYSKCTGHPTKLPNVVAYAGERYIKRDKGRSATTILRRINLLQWSDYFMKVATLCISEEQKIPESSP
SKEQVKTHTAWTETNFICNNLILNGLTDELYDYYSTMSIAKEVWDVLQKKYDTEEARSKKYVNEIQKKADEIISEGMRLDNQFQVAVIIDKLPHLWKDFKTTLRHKTKEF
SLESLITNCRNRNHPAAQANLVEDEYVAMIFEVNVIGGSGGWWLDTGAYRHVCHDLSLFRKYNEINDKNILLGDHHMTKVAGIGKVELKFTSGKVLVLKKVLHTLGIRKN
LVFGYLLYKAGFSQTIGSNLFTLTKNNAFVGNGYRTEGMSKLNLEMIRI