; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc06G08960 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc06G08960
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationClcChr06:11298128..11301115
RNA-Seq ExpressionClc06G08960
SyntenyClc06G08960
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYJ98683.1 hypothetical protein E5676_scaffold429G00120 [Cucumis melo var. makuwa]2.6e-6646.77Show/hide
Query:  CDNLAALHISSYQNP-NLVLIQESKKDEFDAAFIKLLWSSKDIGWAFVESIGKSGRILTMWDESKLMVTEVLKGGHSSSVKCMTTYKKICWITNVYNPND
        C   + +   S+ +P +LV+   ++  E D A IK LWSSKDIGW  VES G+ G ILTMWD SK+ V E LKGG+S S+  +T+ KK CWITNVY P D
Subjt:  CDNLAALHISSYQNP-NLVLIQESKKDEFDAAFIKLLWSSKDIGWAFVESIGKSGRILTMWDESKLMVTEVLKGGHSSSVKCMTTYKKICWITNVYNPND

Query:  YKERRYIWQELSSLADYCIEQWCLRGDFNITRSPQERSPIGSVTRGMRKFNKFIIVTQLMEIPLSNGRFTWSREGSSISRSLIGRFLVTKAWDDMFENSR
        Y+ERR++W  L SL+ YC   WC+ G  NITR   E  P+   TRGMR+FN  I    + E+PL NGR TWSREGSSISRSL+  F + K WD++ ENSR
Subjt:  YKERRYIWQELSSLADYCIEQWCLRGDFNITRSPQERSPIGSVTRGMRKFNKFIIVTQLMEIPLSNGRFTWSREGSSISRSLIGRFLVTKAWDDMFENSR

Query:  VSRQARTFSNHFPLLLKAAK-KRGNLVTKLNNEQGVPSKTFREIESIVLGFYSSIYLESPRLTSLPLNFCWTKIFKEQNALLTASCIVEEIFQALKALGK
        V R+A T S+HFPLLL+A   K G    + +N        F E   I+   ++        +TS+     +  I        T +    EIF+ALKALGK
Subjt:  VSRQARTFSNHFPLLLKAAK-KRGNLVTKLNNEQGVPSKTFREIESIVLGFYSSIYLESPRLTSLPLNFCWTKIFKEQNALLTASCIVEEIFQALKALGK

Query:  NKAPGPDGFT
        NK+P P+GFT
Subjt:  NKAPGPDGFT

XP_031744753.1 uncharacterized protein LOC101212255 isoform X1 [Cucumis sativus]1.2e-5869.14Show/hide
Query:  MFSPTVDHWASVEQILCYLKAAPGCGILYKDHDHTRVECFPDADWAGSREDRRSTSRYCVFVGGNLVLCKSKKQNVVSRSSVESNYRAMAQSVCEIVWIY
        M SPTVDHWA+VEQILCYLKAAPG GILYKDH HTRVECF DADWAGSREDRRSTS YCVFVGGNLV  KSKKQNVVSRSS ES YRAMAQSVCEIVWI+
Subjt:  MFSPTVDHWASVEQILCYLKAAPGCGILYKDHDHTRVECFPDADWAGSREDRRSTSRYCVFVGGNLVLCKSKKQNVVSRSSVESNYRAMAQSVCEIVWIY

Query:  QLLSKMGFKVTILAKLWCDNLAALHISSYQNPNLVLIQESKKDEFDAAFIKLLWSSKDIGWAFVESIGKSGRILT
        QLLS++GF +T+ AKLWCDN AALHI+S    N V  + +K  E D  FI+       +   +V++  + G ILT
Subjt:  QLLSKMGFKVTILAKLWCDNLAALHISSYQNPNLVLIQESKKDEFDAAFIKLLWSSKDIGWAFVESIGKSGRILT

XP_031744754.1 uncharacterized protein LOC101212255 isoform X2 [Cucumis sativus]1.2e-5869.14Show/hide
Query:  MFSPTVDHWASVEQILCYLKAAPGCGILYKDHDHTRVECFPDADWAGSREDRRSTSRYCVFVGGNLVLCKSKKQNVVSRSSVESNYRAMAQSVCEIVWIY
        M SPTVDHWA+VEQILCYLKAAPG GILYKDH HTRVECF DADWAGSREDRRSTS YCVFVGGNLV  KSKKQNVVSRSS ES YRAMAQSVCEIVWI+
Subjt:  MFSPTVDHWASVEQILCYLKAAPGCGILYKDHDHTRVECFPDADWAGSREDRRSTSRYCVFVGGNLVLCKSKKQNVVSRSSVESNYRAMAQSVCEIVWIY

Query:  QLLSKMGFKVTILAKLWCDNLAALHISSYQNPNLVLIQESKKDEFDAAFIKLLWSSKDIGWAFVESIGKSGRILT
        QLLS++GF +T+ AKLWCDN AALHI+S    N V  + +K  E D  FI+       +   +V++  + G ILT
Subjt:  QLLSKMGFKVTILAKLWCDNLAALHISSYQNPNLVLIQESKKDEFDAAFIKLLWSSKDIGWAFVESIGKSGRILT

XP_031744755.1 uncharacterized protein LOC101212255 isoform X3 [Cucumis sativus]1.2e-5869.14Show/hide
Query:  MFSPTVDHWASVEQILCYLKAAPGCGILYKDHDHTRVECFPDADWAGSREDRRSTSRYCVFVGGNLVLCKSKKQNVVSRSSVESNYRAMAQSVCEIVWIY
        M SPTVDHWA+VEQILCYLKAAPG GILYKDH HTRVECF DADWAGSREDRRSTS YCVFVGGNLV  KSKKQNVVSRSS ES YRAMAQSVCEIVWI+
Subjt:  MFSPTVDHWASVEQILCYLKAAPGCGILYKDHDHTRVECFPDADWAGSREDRRSTSRYCVFVGGNLVLCKSKKQNVVSRSSVESNYRAMAQSVCEIVWIY

Query:  QLLSKMGFKVTILAKLWCDNLAALHISSYQNPNLVLIQESKKDEFDAAFIKLLWSSKDIGWAFVESIGKSGRILT
        QLLS++GF +T+ AKLWCDN AALHI+S    N V  + +K  E D  FI+       +   +V++  + G ILT
Subjt:  QLLSKMGFKVTILAKLWCDNLAALHISSYQNPNLVLIQESKKDEFDAAFIKLLWSSKDIGWAFVESIGKSGRILT

XP_031744758.1 uncharacterized protein LOC101212255 isoform X5 [Cucumis sativus]1.2e-5869.14Show/hide
Query:  MFSPTVDHWASVEQILCYLKAAPGCGILYKDHDHTRVECFPDADWAGSREDRRSTSRYCVFVGGNLVLCKSKKQNVVSRSSVESNYRAMAQSVCEIVWIY
        M SPTVDHWA+VEQILCYLKAAPG GILYKDH HTRVECF DADWAGSREDRRSTS YCVFVGGNLV  KSKKQNVVSRSS ES YRAMAQSVCEIVWI+
Subjt:  MFSPTVDHWASVEQILCYLKAAPGCGILYKDHDHTRVECFPDADWAGSREDRRSTSRYCVFVGGNLVLCKSKKQNVVSRSSVESNYRAMAQSVCEIVWIY

Query:  QLLSKMGFKVTILAKLWCDNLAALHISSYQNPNLVLIQESKKDEFDAAFIKLLWSSKDIGWAFVESIGKSGRILT
        QLLS++GF +T+ AKLWCDN AALHI+S    N V  + +K  E D  FI+       +   +V++  + G ILT
Subjt:  QLLSKMGFKVTILAKLWCDNLAALHISSYQNPNLVLIQESKKDEFDAAFIKLLWSSKDIGWAFVESIGKSGRILT

TrEMBL top hitse value%identityAlignment
A0A5A7SQ84 Putative mitochondrial protein4.3e-5468.71Show/hide
Query:  MFSPTVDHWASVEQILCYLKAAPGCGILYKDHDHTRVECFPDADWAGSREDRRSTSRYCVFVGGNLVLCKSKKQNVVSRSSVESNYRAMAQSVCEIVWIY
        M SPTVDHWA+VEQILCY KAAPG GILYKDH HTRVECF DADWA SREDRRSTS YCVFVGG LV  KSKKQNVVSRSS +S YRA AQSVCEI WI+
Subjt:  MFSPTVDHWASVEQILCYLKAAPGCGILYKDHDHTRVECFPDADWAGSREDRRSTSRYCVFVGGNLVLCKSKKQNVVSRSSVESNYRAMAQSVCEIVWIY

Query:  QLLSKMGFKVTILAKLWCDNLAALHISSYQNPNLVLIQESKKDEFDAAFI----KLLWSSKDI
        QLLS++GF +T+ AKLWCDN  ALHI+S    N V  + +K  E D  FI    K+ W  +D+
Subjt:  QLLSKMGFKVTILAKLWCDNLAALHISSYQNPNLVLIQESKKDEFDAAFI----KLLWSSKDI

A0A5A7UHS1 Retrovirus-related Pol polyprotein from transposon TNT 1-944.7e-5363.43Show/hide
Query:  MFSPTVDHWASVEQILCYLKAAPGCGILYKDHDHTRVECFPDADWAGSREDRRSTSRYCVFVGGNLVLCKSKKQNVVSRSSVESNYRAMAQSVCEIVWIY
        M  PTVDHWA+VEQILCYLKAA G GILYKDH HT+V+CF DADW GSREDRRS S YCVFVGGNLV  KSKKQNVVS SS +S YRAMAQSVCEIVWI+
Subjt:  MFSPTVDHWASVEQILCYLKAAPGCGILYKDHDHTRVECFPDADWAGSREDRRSTSRYCVFVGGNLVLCKSKKQNVVSRSSVESNYRAMAQSVCEIVWIY

Query:  QLLSKMGFKVTILAKLWCDNLAALHISSYQNPNLVLIQESKKDEFDAAFIKLLWSSKDIGWAFVESIGKSGRILT
        QLLS++GF +T+  KLWCDN  ALHI+S    N V  +++K  E D  FI+       +   +V++  + G ILT
Subjt:  QLLSKMGFKVTILAKLWCDNLAALHISSYQNPNLVLIQESKKDEFDAAFIKLLWSSKDIGWAFVESIGKSGRILT

A0A5D3BHE3 Uncharacterized protein1.3e-6646.77Show/hide
Query:  CDNLAALHISSYQNP-NLVLIQESKKDEFDAAFIKLLWSSKDIGWAFVESIGKSGRILTMWDESKLMVTEVLKGGHSSSVKCMTTYKKICWITNVYNPND
        C   + +   S+ +P +LV+   ++  E D A IK LWSSKDIGW  VES G+ G ILTMWD SK+ V E LKGG+S S+  +T+ KK CWITNVY P D
Subjt:  CDNLAALHISSYQNP-NLVLIQESKKDEFDAAFIKLLWSSKDIGWAFVESIGKSGRILTMWDESKLMVTEVLKGGHSSSVKCMTTYKKICWITNVYNPND

Query:  YKERRYIWQELSSLADYCIEQWCLRGDFNITRSPQERSPIGSVTRGMRKFNKFIIVTQLMEIPLSNGRFTWSREGSSISRSLIGRFLVTKAWDDMFENSR
        Y+ERR++W  L SL+ YC   WC+ G  NITR   E  P+   TRGMR+FN  I    + E+PL NGR TWSREGSSISRSL+  F + K WD++ ENSR
Subjt:  YKERRYIWQELSSLADYCIEQWCLRGDFNITRSPQERSPIGSVTRGMRKFNKFIIVTQLMEIPLSNGRFTWSREGSSISRSLIGRFLVTKAWDDMFENSR

Query:  VSRQARTFSNHFPLLLKAAK-KRGNLVTKLNNEQGVPSKTFREIESIVLGFYSSIYLESPRLTSLPLNFCWTKIFKEQNALLTASCIVEEIFQALKALGK
        V R+A T S+HFPLLL+A   K G    + +N        F E   I+   ++        +TS+     +  I        T +    EIF+ALKALGK
Subjt:  VSRQARTFSNHFPLLLKAAK-KRGNLVTKLNNEQGVPSKTFREIESIVLGFYSSIYLESPRLTSLPLNFCWTKIFKEQNALLTASCIVEEIFQALKALGK

Query:  NKAPGPDGFT
        NK+P P+GFT
Subjt:  NKAPGPDGFT

A0A5D3CID2 Putative mitochondrial protein5.6e-5465.71Show/hide
Query:  MFSPTVDHWASVEQILCYLKAAPGCGILYKDHDHTRVECFPDADWAGSREDRRSTSRYCVFVGGNLVLCKSKKQNVVSRSSVESNYRAMAQSVCEIVWIY
        M  PTVDHWA VEQILCYLKAAPGCGIL KDH HTRVECF DADWAGSREDRRST  YCVFVGGNLV  KSKKQNVVS  S ES YRAM QSVCEIVWI+
Subjt:  MFSPTVDHWASVEQILCYLKAAPGCGILYKDHDHTRVECFPDADWAGSREDRRSTSRYCVFVGGNLVLCKSKKQNVVSRSSVESNYRAMAQSVCEIVWIY

Query:  QLLSKMGFKVTILAKLWCDNLAALHISSYQNPNLVLIQESKKDEFDAAFIKLLWSSKDIGWAFVESIGKSGRILT
        QLLS++GF +T+ AKL CDN AALHI+S    N V  + +K  E D  FI+       +   +V++  + G ILT
Subjt:  QLLSKMGFKVTILAKLWCDNLAALHISSYQNPNLVLIQESKKDEFDAAFIKLLWSSKDIGWAFVESIGKSGRILT

A0A5D3DZU1 Retrovirus-related Pol polyprotein from transposon TNT 1-944.7e-5363.43Show/hide
Query:  MFSPTVDHWASVEQILCYLKAAPGCGILYKDHDHTRVECFPDADWAGSREDRRSTSRYCVFVGGNLVLCKSKKQNVVSRSSVESNYRAMAQSVCEIVWIY
        M  PTVDHWA+VEQILCYLKAA G GILYKDH HT+V+CF DADW GSREDRRS S YCVFVGGNLV  KSKKQNVVS SS +S YRAMAQSVCEIVWI+
Subjt:  MFSPTVDHWASVEQILCYLKAAPGCGILYKDHDHTRVECFPDADWAGSREDRRSTSRYCVFVGGNLVLCKSKKQNVVSRSSVESNYRAMAQSVCEIVWIY

Query:  QLLSKMGFKVTILAKLWCDNLAALHISSYQNPNLVLIQESKKDEFDAAFIKLLWSSKDIGWAFVESIGKSGRILT
        QLLS++GF +T+  KLWCDN  ALHI+S    N V  +++K  E D  FI+       +   +V++  + G ILT
Subjt:  QLLSKMGFKVTILAKLWCDNLAALHISSYQNPNLVLIQESKKDEFDAAFIKLLWSSKDIGWAFVESIGKSGRILT

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.9e-1131.25Show/hide
Query:  WASVEQILCYLKAAPGCGILYKDH--DHTRVECFPDADWAGSREDRRSTSRYCV-FVGGNLVLCKSKKQNVVSRSSVESNYRAMAQSVCEIVWIYQLLSK
        W +++++L YLK      +++K +     ++  + D+DWAGS  DR+ST+ Y       NL+   +K+QN V+ SS E+ Y A+ ++V E +W+  LL+ 
Subjt:  WASVEQILCYLKAAPGCGILYKDH--DHTRVECFPDADWAGSREDRRSTSRYCV-FVGGNLVLCKSKKQNVVSRSSVESNYRAMAQSVCEIVWIYQLLSK

Query:  MGFKVTILAKLWCDNLAALHISSYQNPN
        +  K+    K++ DN   + I++  NP+
Subjt:  MGFKVTILAKLWCDNLAALHISSYQNPN

P0CV72 Secreted RxLR effector protein 1613.0e-1236.46Show/hide
Query:  PTVDHWASVEQILCYLKAAPGCGILYKDHDHTRVECFPDADWAGSREDRRSTSRYCVFVGGNLVLCKSKKQNVVSRSSVESNYRAMAQSVCEIVWI
        P   HW +++++L YL++    G+ +      ++  + DADWAG  E RRSTS Y   + G  V  +SKKQ  V+ SS E  Y A++++  E VW+
Subjt:  PTVDHWASVEQILCYLKAAPGCGILYKDHDHTRVECFPDADWAGSREDRRSTSRYCVFVGGNLVLCKSKKQNVVSRSSVESNYRAMAQSVCEIVWI

P92519 Uncharacterized mitochondrial protein AtMg008107.1e-1436.73Show/hide
Query:  MFSPTVDHWASVEQILCYLKAAPGCGILYKDHDHTRVECFPDADWAGSREDRRSTSRYCVFVGGNLVLCKSKKQNVVSRSSVESNYRAMAQSVCEIVW
        M  PT+  +  ++++L Y+K     G+    +    V+ F D+DWAG    RRST+ +C F+G N++   +K+Q  VSRSS E+ YRA+A +  E+ W
Subjt:  MFSPTVDHWASVEQILCYLKAAPGCGILYKDHDHTRVECFPDADWAGSREDRRSTSRYCVFVGGNLVLCKSKKQNVVSRSSVESNYRAMAQSVCEIVW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE18.7e-2035.1Show/hide
Query:  MFSPTVDHWASVEQILCYLKAAPGCGILYKDHDHTRVECFPDADWAGSREDRRSTSRYCVFVGGNLVLCKSKKQNVVSRSSVESNYRAMAQSVCEIVWIY
        M  PT +H  ++++IL YL   P  GI  K  +   +  + DADWAG ++D  ST+ Y V++G + +   SKKQ  V RSS E+ YR++A +  E+ WI 
Subjt:  MFSPTVDHWASVEQILCYLKAAPGCGILYKDHDHTRVECFPDADWAGSREDRRSTSRYCVFVGGNLVLCKSKKQNVVSRSSVESNYRAMAQSVCEIVWIY

Query:  QLLSKMGFKVTILAKLWCDNLAALHISSYQNPNLVLIQESKKDEFDAAFIK
         LL+++G ++T    ++CDN+ A ++ +    N V     K    D  FI+
Subjt:  QLLSKMGFKVTILAKLWCDNLAALHISSYQNPNLVLIQESKKDEFDAAFIK

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.7e-2035.1Show/hide
Query:  MFSPTVDHWASVEQILCYLKAAPGCGILYKDHDHTRVECFPDADWAGSREDRRSTSRYCVFVGGNLVLCKSKKQNVVSRSSVESNYRAMAQSVCEIVWIY
        M  PT DHW +++++L YL   P  GI  K  +   +  + DADWAG  +D  ST+ Y V++G + +   SKKQ  V RSS E+ YR++A +  E+ WI 
Subjt:  MFSPTVDHWASVEQILCYLKAAPGCGILYKDHDHTRVECFPDADWAGSREDRRSTSRYCVFVGGNLVLCKSKKQNVVSRSSVESNYRAMAQSVCEIVWIY

Query:  QLLSKMGFKVTILAKLWCDNLAALHISSYQNPNLVLIQESKKDEFDAAFIK
         LL+++G +++    ++CDN+ A ++ +    N V     K    D  FI+
Subjt:  QLLSKMGFKVTILAKLWCDNLAALHISSYQNPNLVLIQESKKDEFDAAFIK

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.9e-2234.9Show/hide
Query:  SPTVDHWASVEQILCYLKAAPGCGILYKDHDHTRVECFPDADWAGSREDRRSTSRYCVFVGGNLVLCKSKKQNVVSRSSVESNYRAMAQSVCEIVWIYQL
        +P + H  +V +IL Y+K   G G+ Y      +++ F DA +   ++ RRST+ YC+F+G +L+  KSKKQ VVS+SS E+ YRA++ +  E++W+ Q 
Subjt:  SPTVDHWASVEQILCYLKAAPGCGILYKDHDHTRVECFPDADWAGSREDRRSTSRYCVFVGGNLVLCKSKKQNVVSRSSVESNYRAMAQSVCEIVWIYQL

Query:  LSKMGFKVTILAKLWCDNLAALHISSYQNPNLVLIQESKKDEFDAAFIK
          ++   ++    L+CDN AA+HI++    N V  + +K  E D   ++
Subjt:  LSKMGFKVTILAKLWCDNLAALHISSYQNPNLVLIQESKKDEFDAAFIK

ATMG00240.1 Gag-Pol-related retrotransposon family protein6.8e-0432.69Show/hide
Query:  SVEQILCYLKAAPGCGILYKDHDHTRVECFPDADWAGSREDRRSTSRYCVFV
        +V ++L Y+K   G G+ Y      +++ F D+DWA   + RRS + +C  V
Subjt:  SVEQILCYLKAAPGCGILYKDHDHTRVECFPDADWAGSREDRRSTSRYCVFV

ATMG00810.1 DNA/RNA polymerases superfamily protein5.0e-1536.73Show/hide
Query:  MFSPTVDHWASVEQILCYLKAAPGCGILYKDHDHTRVECFPDADWAGSREDRRSTSRYCVFVGGNLVLCKSKKQNVVSRSSVESNYRAMAQSVCEIVW
        M  PT+  +  ++++L Y+K     G+    +    V+ F D+DWAG    RRST+ +C F+G N++   +K+Q  VSRSS E+ YRA+A +  E+ W
Subjt:  MFSPTVDHWASVEQILCYLKAAPGCGILYKDHDHTRVECFPDADWAGSREDRRSTSRYCVFVGGNLVLCKSKKQNVVSRSSVESNYRAMAQSVCEIVW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTTCTCCTACAGTGGACCATTGGGCTTCAGTAGAACAAATTCTATGTTATTTGAAAGCTGCACCTGGATGTGGGATCTTATATAAAGATCATGACCATACAAGAGT
TGAATGTTTTCCAGATGCTGATTGGGCTGGATCTCGAGAAGATAGGAGATCGACCTCTAGATATTGTGTCTTTGTAGGAGGAAACTTAGTATTGTGTAAGAGTAAGAAAC
AAAATGTAGTTTCACGTTCGAGTGTTGAGTCAAACTATAGGGCCATGGCACAATCTGTGTGCGAAATAGTGTGGATATATCAACTATTATCCAAGATGGGATTCAAAGTT
ACCATACTAGCTAAATTATGGTGTGATAATCTAGCTGCACTTCACATTTCATCTTACCAGAACCCGAATTTGGTTTTAATCCAAGAATCTAAAAAGGATGAGTTTGATGC
TGCATTTATTAAGTTGTTATGGAGCTCAAAGGATATTGGGTGGGCATTTGTGGAATCAATTGGCAAATCAGGTAGGATTTTAACTATGTGGGATGAAAGCAAGCTAATGG
TAACTGAAGTATTAAAAGGTGGTCACTCCTCATCGGTCAAATGTATGACTACTTACAAAAAAATTTGTTGGATTACAAACGTGTACAACCCTAATGACTACAAAGAACGA
AGATATATTTGGCAAGAGCTATCTTCTTTGGCAGATTATTGCATTGAACAATGGTGTCTTCGAGGAGATTTCAACATTACAAGATCACCTCAAGAACGATCTCCAATTGG
CAGTGTCACACGAGGCATGAGAAAATTCAACAAATTTATCATTGTCACTCAATTGATGGAAATCCCTTTATCAAATGGTAGATTCACTTGGTCAAGGGAAGGAAGCTCAA
TTTCAAGATCTCTTATTGGTAGATTCCTTGTTACAAAGGCTTGGGATGACATGTTTGAAAATTCTAGAGTTTCAAGGCAAGCTCGTACTTTTTCAAACCATTTTCCGTTA
CTACTTAAAGCTGCAAAGAAGAGGGGAAATTTAGTTACGAAACTGAATAATGAGCAGGGTGTTCCTTCCAAAACATTCCGTGAAATCGAGAGTATTGTGTTGGGCTTCTA
CTCCTCTATTTATCTCGAATCCCCTAGATTAACATCCTTACCCCTCAACTTCTGCTGGACGAAGATTTTTAAGGAACAAAATGCATTATTGACTGCCAGTTGCATAGTAG
AGGAAATTTTTCAGGCCTTAAAAGCACTTGGCAAGAATAAAGCTCCTGGACCAGATGGCTTTACAACTGAATTCTTACTTAAATACTGGAGTTCATTCCGATCAAATTTC
TTGAAGTTATTTGAGGAATTTTCTTCAAATTGGAGCGTATTATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTTTCTCCTACAGTGGACCATTGGGCTTCAGTAGAACAAATTCTATGTTATTTGAAAGCTGCACCTGGATGTGGGATCTTATATAAAGATCATGACCATACAAGAGT
TGAATGTTTTCCAGATGCTGATTGGGCTGGATCTCGAGAAGATAGGAGATCGACCTCTAGATATTGTGTCTTTGTAGGAGGAAACTTAGTATTGTGTAAGAGTAAGAAAC
AAAATGTAGTTTCACGTTCGAGTGTTGAGTCAAACTATAGGGCCATGGCACAATCTGTGTGCGAAATAGTGTGGATATATCAACTATTATCCAAGATGGGATTCAAAGTT
ACCATACTAGCTAAATTATGGTGTGATAATCTAGCTGCACTTCACATTTCATCTTACCAGAACCCGAATTTGGTTTTAATCCAAGAATCTAAAAAGGATGAGTTTGATGC
TGCATTTATTAAGTTGTTATGGAGCTCAAAGGATATTGGGTGGGCATTTGTGGAATCAATTGGCAAATCAGGTAGGATTTTAACTATGTGGGATGAAAGCAAGCTAATGG
TAACTGAAGTATTAAAAGGTGGTCACTCCTCATCGGTCAAATGTATGACTACTTACAAAAAAATTTGTTGGATTACAAACGTGTACAACCCTAATGACTACAAAGAACGA
AGATATATTTGGCAAGAGCTATCTTCTTTGGCAGATTATTGCATTGAACAATGGTGTCTTCGAGGAGATTTCAACATTACAAGATCACCTCAAGAACGATCTCCAATTGG
CAGTGTCACACGAGGCATGAGAAAATTCAACAAATTTATCATTGTCACTCAATTGATGGAAATCCCTTTATCAAATGGTAGATTCACTTGGTCAAGGGAAGGAAGCTCAA
TTTCAAGATCTCTTATTGGTAGATTCCTTGTTACAAAGGCTTGGGATGACATGTTTGAAAATTCTAGAGTTTCAAGGCAAGCTCGTACTTTTTCAAACCATTTTCCGTTA
CTACTTAAAGCTGCAAAGAAGAGGGGAAATTTAGTTACGAAACTGAATAATGAGCAGGGTGTTCCTTCCAAAACATTCCGTGAAATCGAGAGTATTGTGTTGGGCTTCTA
CTCCTCTATTTATCTCGAATCCCCTAGATTAACATCCTTACCCCTCAACTTCTGCTGGACGAAGATTTTTAAGGAACAAAATGCATTATTGACTGCCAGTTGCATAGTAG
AGGAAATTTTTCAGGCCTTAAAAGCACTTGGCAAGAATAAAGCTCCTGGACCAGATGGCTTTACAACTGAATTCTTACTTAAATACTGGAGTTCATTCCGATCAAATTTC
TTGAAGTTATTTGAGGAATTTTCTTCAAATTGGAGCGTATTATGA
Protein sequenceShow/hide protein sequence
MFSPTVDHWASVEQILCYLKAAPGCGILYKDHDHTRVECFPDADWAGSREDRRSTSRYCVFVGGNLVLCKSKKQNVVSRSSVESNYRAMAQSVCEIVWIYQLLSKMGFKV
TILAKLWCDNLAALHISSYQNPNLVLIQESKKDEFDAAFIKLLWSSKDIGWAFVESIGKSGRILTMWDESKLMVTEVLKGGHSSSVKCMTTYKKICWITNVYNPNDYKER
RYIWQELSSLADYCIEQWCLRGDFNITRSPQERSPIGSVTRGMRKFNKFIIVTQLMEIPLSNGRFTWSREGSSISRSLIGRFLVTKAWDDMFENSRVSRQARTFSNHFPL
LLKAAKKRGNLVTKLNNEQGVPSKTFREIESIVLGFYSSIYLESPRLTSLPLNFCWTKIFKEQNALLTASCIVEEIFQALKALGKNKAPGPDGFTTEFLLKYWSSFRSNF
LKLFEEFSSNWSVL