; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG04G003380 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG04G003380
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionReverse transcriptase domain-containing protein
Genome locationCG_Chr04:13059006..13061823
RNA-Seq ExpressionClCG04G003380
SyntenyClCG04G003380
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0004518 - nuclease activity (molecular function)
InterPro domainsIPR004808 - AP endonuclease 1
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBH07150.1 TatD related DNase [Prunus dulcis]1.5e-3838.38Show/hide
Query:  PDLVLIQETKKDAIELDVIKAIWSSKDIRWIYVEAFGKSGGTLTMWDESKVSILESLKGGYSLSVKCKSLSKKVDWVTNIYGPTYYKERRHIWPELASLS
        PD+V++ ETKK+ ++  ++  +W S+   W++  + G+SGG   +W+   VS+++S+ G +S+S++         W++ IYGP   +ER   W ELA L 
Subjt:  PDLVLIQETKKDAIELDVIKAIWSSKDIRWIYVEAFGKSGGTLTMWDESKVSILESLKGGYSLSVKCKSLSKKVDWVTNIYGPTYYKERRHIWPELASLS

Query:  AYCTEAWCLGGDLNMTRLIQERSPTGSWTKGMKKFNKFIENAHLLEIPLSNGHFTWSREGRGSSCSLLDKFLVSNNWEEAFDDTRVARQARLYSDHFP
         YC + WCLGGD N+ R   E+S  G  TK M+ FN FI+  +L +  L N  FTWS     + C  LD+FLVS +WE+ F   R     R+ SDH P
Subjt:  AYCTEAWCLGGDLNMTRLIQERSPTGSWTKGMKKFNKFIENAHLLEIPLSNGHFTWSREGRGSSCSLLDKFLVSNNWEEAFDDTRVARQARLYSDHFP

TYJ98683.1 hypothetical protein E5676_scaffold429G00120 [Cucumis melo var. makuwa]1.4e-5250Show/hide
Query:  NPD-LVLIQETKKDAIELDVIKAIWSSKDIRWIYVEAFGKSGGTLTMWDESKVSILESLKGGYSLSVKCKSLSKKVDWVTNIYGPTYYKERRHIWPELAS
        +PD LV+    +   I++ +IK++WSSKDI W  VE+FG+ GG LTMWD SK+ ++E+LKGGYSLS+   +  KK  W+TN+YGP  Y+ERR +W  L S
Subjt:  NPD-LVLIQETKKDAIELDVIKAIWSSKDIRWIYVEAFGKSGGTLTMWDESKVSILESLKGGYSLSVKCKSLSKKVDWVTNIYGPTYYKERRHIWPELAS

Query:  LSAYCTEAWCLGGDLNMTRLIQERSPTGSWTKGMKKFNKFIENAHLLEIPLSNGHFTWSREGRGSSCSLLDKFLVSNNWEEAFDDTRVARQARLYSDHFP
        LS YCT AWC+GG  N+TR   E  P    T+GM++FN  I++ ++ E+PL NG  TWSREG   S SLLD F +   W+E  +++RV R+A   SDHFP
Subjt:  LSAYCTEAWCLGGDLNMTRLIQERSPTGSWTKGMKKFNKFIENAHLLEIPLSNGHFTWSREGRGSSCSLLDKFLVSNNWEEAFDDTRVARQARLYSDHFP

VVA20479.1 Hypothetical predicted protein, partial [Prunus dulcis]1.1e-3838.38Show/hide
Query:  PDLVLIQETKKDAIELDVIKAIWSSKDIRWIYVEAFGKSGGTLTMWDESKVSILESLKGGYSLSVKCKSLSKKVDWVTNIYGPTYYKERRHIWPELASLS
        PD+V++ ETKK+ ++  ++  +W S+   W++  + G+SGG   +W+   VS+++S+ G +S+S++         W++ IYGP   +ER   W ELA L 
Subjt:  PDLVLIQETKKDAIELDVIKAIWSSKDIRWIYVEAFGKSGGTLTMWDESKVSILESLKGGYSLSVKCKSLSKKVDWVTNIYGPTYYKERRHIWPELASLS

Query:  AYCTEAWCLGGDLNMTRLIQERSPTGSWTKGMKKFNKFIENAHLLEIPLSNGHFTWSREGRGSSCSLLDKFLVSNNWEEAFDDTRVARQARLYSDHFP
         YC + WCLGGD N+ R   E+S  G  TK M+ FN FI+  +L +  L N  FTWS     + C  LD+FLVS +WE+ F   R     R+ SDH P
Subjt:  AYCTEAWCLGGDLNMTRLIQERSPTGSWTKGMKKFNKFIENAHLLEIPLSNGHFTWSREGRGSSCSLLDKFLVSNNWEEAFDDTRVARQARLYSDHFP

XP_030478286.1 uncharacterized protein LOC115695356 [Cannabis sativa]7.6e-4340.2Show/hide
Query:  TTKASNPDLVLIQETKKDAIELDVIKAIWSSKDIRWIYVEAFGKSGGTLTMWDESKVSILESLKGGYSLSVKCKSLSKKVDWVTNIYGPTYYKERRHIWP
        T    NPD+V++QE KK +++   I +IW S+   WI + A G+SGGTL +WD   +++L+S+ G +S+SV  ++  K+  W + +YGP  Y ER   W 
Subjt:  TTKASNPDLVLIQETKKDAIELDVIKAIWSSKDIRWIYVEAFGKSGGTLTMWDESKVSILESLKGGYSLSVKCKSLSKKVDWVTNIYGPTYYKERRHIWP

Query:  ELASLSAYCTEAWCLGGDLNMTRLIQERSPTGSWTKGMKKFNKFIENAHLLEIPLSNGHFTWSREGRGSSCSLLDKFLVSNNWEEAFDDTRVARQARLYS
        E+A LSA C + WCLG D N+ R +QE+  + SWTK MK F++ +    L++  L+NG FTWS       C  LD+F  +NNW   F   +     R+ S
Subjt:  ELASLSAYCTEAWCLGGDLNMTRLIQERSPTGSWTKGMKKFNKFIENAHLLEIPLSNGHFTWSREGRGSSCSLLDKFLVSNNWEEAFDDTRVARQARLYS

Query:  DHFP
        DH P
Subjt:  DHFP

XP_038876676.1 uncharacterized protein LOC120069076 [Benincasa hispida]2.1e-4851.04Show/hide
Query:  RTTKASNPDLVLIQETKKDAIELDVIKAIWSSKDIRWIYVEAFGKSGGTLTMWDESKVSILESLKGGYSLSVKCKSLSKKVDWVTNIYGPTYYKERRHIW
        R  K  NPD+VLIQETKKD IE   IK++WSSK++   +VEA GKSGG LT+WD+SK+ +    K  +SLS+KC++++KK+ W+TN+YGP  Y+ERR +W
Subjt:  RTTKASNPDLVLIQETKKDAIELDVIKAIWSSKDIRWIYVEAFGKSGGTLTMWDESKVSILESLKGGYSLSVKCKSLSKKVDWVTNIYGPTYYKERRHIW

Query:  PELASLSAYCTEAWCLGGDLNMTRLIQERSPTGSWTKGMKKFNKFIENAHLLEIPLSNGHFTWSREGRGSSCSLLDKFLVSNNWEEAFDDTR
         EL+SL+    + WC+GGD N  R   ER P G  T+ M  FNKFI   +LLEIPLSNG FTWS+EG   S S L  FL+    ++  +  R
Subjt:  PELASLSAYCTEAWCLGGDLNMTRLIQERSPTGSWTKGMKKFNKFIENAHLLEIPLSNGHFTWSREGRGSSCSLLDKFLVSNNWEEAFDDTR

TrEMBL top hitse value%identityAlignment
A0A5D3BHE3 Uncharacterized protein6.7e-5350Show/hide
Query:  NPD-LVLIQETKKDAIELDVIKAIWSSKDIRWIYVEAFGKSGGTLTMWDESKVSILESLKGGYSLSVKCKSLSKKVDWVTNIYGPTYYKERRHIWPELAS
        +PD LV+    +   I++ +IK++WSSKDI W  VE+FG+ GG LTMWD SK+ ++E+LKGGYSLS+   +  KK  W+TN+YGP  Y+ERR +W  L S
Subjt:  NPD-LVLIQETKKDAIELDVIKAIWSSKDIRWIYVEAFGKSGGTLTMWDESKVSILESLKGGYSLSVKCKSLSKKVDWVTNIYGPTYYKERRHIWPELAS

Query:  LSAYCTEAWCLGGDLNMTRLIQERSPTGSWTKGMKKFNKFIENAHLLEIPLSNGHFTWSREGRGSSCSLLDKFLVSNNWEEAFDDTRVARQARLYSDHFP
        LS YCT AWC+GG  N+TR   E  P    T+GM++FN  I++ ++ E+PL NG  TWSREG   S SLLD F +   W+E  +++RV R+A   SDHFP
Subjt:  LSAYCTEAWCLGGDLNMTRLIQERSPTGSWTKGMKKFNKFIENAHLLEIPLSNGHFTWSREGRGSSCSLLDKFLVSNNWEEAFDDTRVARQARLYSDHFP

A0A803P8A0 Uncharacterized protein1.2e-4143Show/hide
Query:  SNPDLVLIQETKKDAIELDVIKAIWSSKDIRWIYVEAFGKSGGTLTMWDESKVSILESLKGGYSLSVKCKSLSKKVDWVTNIYGPTYYKERRHIWPELAS
        +NPD+V++QE K+  ++   I +IW S+   WI + A G+SGGTL +WD   +S+L+SL G +S+SV   +  K+  W + +YGP  YK R   W ELA 
Subjt:  SNPDLVLIQETKKDAIELDVIKAIWSSKDIRWIYVEAFGKSGGTLTMWDESKVSILESLKGGYSLSVKCKSLSKKVDWVTNIYGPTYYKERRHIWPELAS

Query:  LSAYCTEAWCLGGDLNMTRLIQERSPTGSWTKGMKKFNKFIENAHLLEIPLSNGHFTWSREGRGSSCSLLDKFLVSNNWEEAFDDTRVARQARLYSDHFP
        LS+ C E+WC+GGD N+TR + E+  + S T+ MK F+  I    L++  L NG FTWS       CS LD+FL  NNW   F   R     RL SDH P
Subjt:  LSAYCTEAWCLGGDLNMTRLIQERSPTGSWTKGMKKFNKFIENAHLLEIPLSNGHFTWSREGRGSSCSLLDKFLVSNNWEEAFDDTRVARQARLYSDHFP

A0A803QEA6 Uncharacterized protein1.7e-4035.19Show/hide
Query:  LKSQESVVSASSDELKVQRKETPSNDLETEDSINEDLNRLFQAVEGNDSGALDEAIVSH--YPSKDIPKHLKLIIDICRISLRTTKASNPDLVLIQETKK
        L   E      +DE+ ++     SN +E+   +  ++ +     E  DS    E I++     S D  K   +   IC+        +NPDLV++QE K+
Subjt:  LKSQESVVSASSDELKVQRKETPSNDLETEDSINEDLNRLFQAVEGNDSGALDEAIVSH--YPSKDIPKHLKLIIDICRISLRTTKASNPDLVLIQETKK

Query:  DAIELDVIKAIWSSKDIRWIYVEAFGKSGGTLTMWDESKVSILESLKGGYSLSVKCKSLSKKVDWVTNIYGPTYYKERRHIWPELASLSAYCTEAWCLGG
          ++   I +IW S+   WI + A G+SGGTL +WD   +S+L+SL G +S+SV   +  K+  W + +YGP  YK R   W ELA LS+ C ++WC+ G
Subjt:  DAIELDVIKAIWSSKDIRWIYVEAFGKSGGTLTMWDESKVSILESLKGGYSLSVKCKSLSKKVDWVTNIYGPTYYKERRHIWPELASLSAYCTEAWCLGG

Query:  DLNMTRLIQERSPTGSWTKGMKKFNKFIENAHLLEIPLSNGHFTWSREGRGSSCSLLDKFLVSNNWEEAFDDTRVARQARLYSDHFP
        D N+TR + E+  + S+T+ MK F+  I    L++  L NG FTWS       CS LD+FL +NNW   F   R     R+ SDH P
Subjt:  DLNMTRLIQERSPTGSWTKGMKKFNKFIENAHLLEIPLSNGHFTWSREGRGSSCSLLDKFLVSNNWEEAFDDTRVARQARLYSDHFP

A0A803QI00 Uncharacterized protein2.4e-4243Show/hide
Query:  SNPDLVLIQETKKDAIELDVIKAIWSSKDIRWIYVEAFGKSGGTLTMWDESKVSILESLKGGYSLSVKCKSLSKKVDWVTNIYGPTYYKERRHIWPELAS
        +NPDLV++QE K+ +++   I +IW S+   WI + A G+SGGTL +WD   +++L+SL G +S+SV  K+  K   W + +YGP  YK R   W ELA 
Subjt:  SNPDLVLIQETKKDAIELDVIKAIWSSKDIRWIYVEAFGKSGGTLTMWDESKVSILESLKGGYSLSVKCKSLSKKVDWVTNIYGPTYYKERRHIWPELAS

Query:  LSAYCTEAWCLGGDLNMTRLIQERSPTGSWTKGMKKFNKFIENAHLLEIPLSNGHFTWSREGRGSSCSLLDKFLVSNNWEEAFDDTRVARQARLYSDHFP
        LSA C ++WC+GGD N+TR   E+  + S T+ MK F+  I    L++  L NG FTWS       CS LD+FL +NNW   +   R     RL SDH P
Subjt:  LSAYCTEAWCLGGDLNMTRLIQERSPTGSWTKGMKKFNKFIENAHLLEIPLSNGHFTWSREGRGSSCSLLDKFLVSNNWEEAFDDTRVARQARLYSDHFP

A0A803QQM3 Uncharacterized protein4.8e-4343.5Show/hide
Query:  SNPDLVLIQETKKDAIELDVIKAIWSSKDIRWIYVEAFGKSGGTLTMWDESKVSILESLKGGYSLSVKCKSLSKKVDWVTNIYGPTYYKERRHIWPELAS
        +NPDLV++QE K+  ++   I +IW S+   WI + A G+SGGTL +WD   +S+L+SL G +S+SV   +  K+  W + +YGP  YK R   W ELA 
Subjt:  SNPDLVLIQETKKDAIELDVIKAIWSSKDIRWIYVEAFGKSGGTLTMWDESKVSILESLKGGYSLSVKCKSLSKKVDWVTNIYGPTYYKERRHIWPELAS

Query:  LSAYCTEAWCLGGDLNMTRLIQERSPTGSWTKGMKKFNKFIENAHLLEIPLSNGHFTWSREGRGSSCSLLDKFLVSNNWEEAFDDTRVARQARLYSDHFP
        LS+ C E+WC+GGD N+TR + E+  + S T+ MK F+  I    L++  L NG FTWS       CS LD+FL SNNW   +   R     RL SDH P
Subjt:  LSAYCTEAWCLGGDLNMTRLIQERSPTGSWTKGMKKFNKFIENAHLLEIPLSNGHFTWSREGRGSSCSLLDKFLVSNNWEEAFDDTRVARQARLYSDHFP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGTTCGAAAAGCAATTCACTTATCGGAGTCGCATATGCAGTGGTTGGTAAAGCAACTTGCAGAGCTTCTTAACGTTTCAAGCGTTCGTTTTTTCTTAAAGGAAAT
GCGAGACACCTCAGGAGCAATGAGATTATCAAAATTCAAAAACAACATGGGGTGGAATTTGAGATGCGTGGTTTGGCCAGCTATCGGAGGTCGGTTCTTCATTCATATAC
CTGAAGGGTCTGCACAACAAGGATGGTCAGAGTTCTTGGAAATGCTTATTAGCTTCACCAATCGATCCAAGATTTATAGAGAAAAGACTGCGAGAAAAGAGAAGGCTCAT
GAATTTATTCTTCCCCTAATTAAGCAACAGCTGAGTTACGCAGAGGTACTATCAAATGGAATTAGTCAGCAAAAATCCCCTGTTCGGGCACCACTAAAACAAAGCAAGTT
GGCGGCAACTCTCGTGCATTTGAACTCCTCTATTTTTGCACCTGATAAACAGAGCAAGCATCAATGCTGGATCCGAGAAGAAATTGAAGTGTTTAAAGAAGATTTTATCA
AATTAAACCGGCCAACTTGCATCAAAGGCTACAGAGGATGGTTGGCAATAAGAAACCTGCTGCTGGAATACTGGAACAGAGCCACCTTTGAAGCTTTACGTTCCCATTTT
GGAGAAGCCAAAATCAAAGTCAAAAGGAATTTATGTGGTTTTATGCCATCAATTGAAATTAAGAATGAATTGAGAGGTATAATTTTTGTGAACTTCGGGGATATTGAAGC
CATGGAAACTCTTTCTTACGTACACAAAGAATTATTCTTTAAAGATTTCAACAATCCAATTGATCAAAGTCGTTTGATTAAAGTAGCAATGGATGAAGATGTTAGGATTA
TTTTATCAAACCACAACAGTGAAGTCCAAAAACCGATAGGCTCACCGGAGAAGACAACGGAAAAATCTGATCTTGAGAAAGTTGCCGTGACTACTTCAAGGAATGTCGAG
GGTCTCTTTCAACTTGAAAGTGCGCCACTCCAGGACACCAACCAAAAAGCTGAAAAGCTTATTGCGGGCGTAAGTGAAGTAGCTAAAAAGTCGGAAGTTATTAGGAGAGA
GAATCTCGTTGGCCGAAATTGTAGTCTTCCAAGCAACCAAAGCAAAAATGCTCTAAATTCCTTAATAATTATAGAATCCAGGGTGTCGACATCCCCTAGTCTCCCCTCGA
TTGAAAATGCATTCAATCCTGCCCTCAGGTTTGTTACCAATATCGCAGAGGAGATCAACACATTAGAAACTAAACCATCATTTTTGCCGAGTCTCAAGAAAGAACACATT
AACTCAAATGGGATCTCACTGATGATCCCTTTCATTCGCTCCGGTAATACGCTCCCCAATAAGAAATCGCCATTGTCTGCGGTCAGAAAAACAACCAACCTCTTCAAACC
CTTTCCAAAACATTTTGTGAAAGGAAGAACATCATTCCTTAATTCATGGTCAGCCCTCTCTAATTTAAATTTGCTTGAAGAAAGCTATGCCAAACTTCAAATACCAGAAT
ATCCTTCTAGTAGATCCGTAAAATATCCCTTTCCTTCTCAACTCTCTCAACAATCAGTGATAGTTCTCCTGGGCTCAAATATTCATTTTGTACGAGGTACCTTTTGTCCT
TATCCTACCAAATCAAGTCTTAAATCTCAAGAATCAGTGGTGAGTGCAAGCAGTGATGAATTGAAAGTGCAAAGAAAAGAAACTCCATCAAATGATCTCGAAACAGAGGA
CTCTATTAACGAGGATCTCAACAGGCTATTTCAAGCTGTAGAAGGTAATGATAGTGGTGCACTTGATGAAGCAATCGTCTCCCATTATCCAAGCAAAGACATTCCTAAGC
ACTTAAAATTAATCATTGACATTTGCAGGATATCCCTGAGAACAACTAAGGCTTCAAACCCGGATTTGGTGCTTATCCAAGAAACCAAGAAAGATGCCATTGAATTGGAT
GTTATCAAAGCTATCTGGAGTTCTAAGGACATTAGATGGATTTATGTTGAAGCCTTTGGCAAATCAGGGGGAACACTCACCATGTGGGATGAAAGTAAAGTATCAATTCT
AGAATCCCTCAAAGGAGGTTACTCACTCTCGGTAAAATGCAAATCCCTAAGTAAAAAGGTAGATTGGGTGACTAATATATATGGTCCTACTTATTACAAAGAAAGAAGAC
ACATTTGGCCGGAATTAGCTTCCCTCTCAGCTTATTGTACAGAGGCATGGTGTTTGGGGGGAGACTTGAACATGACCAGATTGATTCAAGAAAGATCCCCAACTGGAAGC
TGGACAAAAGGTATGAAGAAATTCAATAAATTCATAGAAAACGCTCATCTCTTGGAAATTCCACTCTCAAATGGTCACTTTACTTGGTCAAGAGAAGGAAGAGGTTCGTC
TTGCTCACTGCTGGACAAGTTTCTTGTTTCCAATAATTGGGAGGAGGCCTTCGATGACACGAGAGTGGCAAGGCAAGCAAGATTGTATTCTGATCACTTCCCCTATTATT
AG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGTTCGAAAAGCAATTCACTTATCGGAGTCGCATATGCAGTGGTTGGTAAAGCAACTTGCAGAGCTTCTTAACGTTTCAAGCGTTCGTTTTTTCTTAAAGGAAAT
GCGAGACACCTCAGGAGCAATGAGATTATCAAAATTCAAAAACAACATGGGGTGGAATTTGAGATGCGTGGTTTGGCCAGCTATCGGAGGTCGGTTCTTCATTCATATAC
CTGAAGGGTCTGCACAACAAGGATGGTCAGAGTTCTTGGAAATGCTTATTAGCTTCACCAATCGATCCAAGATTTATAGAGAAAAGACTGCGAGAAAAGAGAAGGCTCAT
GAATTTATTCTTCCCCTAATTAAGCAACAGCTGAGTTACGCAGAGGTACTATCAAATGGAATTAGTCAGCAAAAATCCCCTGTTCGGGCACCACTAAAACAAAGCAAGTT
GGCGGCAACTCTCGTGCATTTGAACTCCTCTATTTTTGCACCTGATAAACAGAGCAAGCATCAATGCTGGATCCGAGAAGAAATTGAAGTGTTTAAAGAAGATTTTATCA
AATTAAACCGGCCAACTTGCATCAAAGGCTACAGAGGATGGTTGGCAATAAGAAACCTGCTGCTGGAATACTGGAACAGAGCCACCTTTGAAGCTTTACGTTCCCATTTT
GGAGAAGCCAAAATCAAAGTCAAAAGGAATTTATGTGGTTTTATGCCATCAATTGAAATTAAGAATGAATTGAGAGGTATAATTTTTGTGAACTTCGGGGATATTGAAGC
CATGGAAACTCTTTCTTACGTACACAAAGAATTATTCTTTAAAGATTTCAACAATCCAATTGATCAAAGTCGTTTGATTAAAGTAGCAATGGATGAAGATGTTAGGATTA
TTTTATCAAACCACAACAGTGAAGTCCAAAAACCGATAGGCTCACCGGAGAAGACAACGGAAAAATCTGATCTTGAGAAAGTTGCCGTGACTACTTCAAGGAATGTCGAG
GGTCTCTTTCAACTTGAAAGTGCGCCACTCCAGGACACCAACCAAAAAGCTGAAAAGCTTATTGCGGGCGTAAGTGAAGTAGCTAAAAAGTCGGAAGTTATTAGGAGAGA
GAATCTCGTTGGCCGAAATTGTAGTCTTCCAAGCAACCAAAGCAAAAATGCTCTAAATTCCTTAATAATTATAGAATCCAGGGTGTCGACATCCCCTAGTCTCCCCTCGA
TTGAAAATGCATTCAATCCTGCCCTCAGGTTTGTTACCAATATCGCAGAGGAGATCAACACATTAGAAACTAAACCATCATTTTTGCCGAGTCTCAAGAAAGAACACATT
AACTCAAATGGGATCTCACTGATGATCCCTTTCATTCGCTCCGGTAATACGCTCCCCAATAAGAAATCGCCATTGTCTGCGGTCAGAAAAACAACCAACCTCTTCAAACC
CTTTCCAAAACATTTTGTGAAAGGAAGAACATCATTCCTTAATTCATGGTCAGCCCTCTCTAATTTAAATTTGCTTGAAGAAAGCTATGCCAAACTTCAAATACCAGAAT
ATCCTTCTAGTAGATCCGTAAAATATCCCTTTCCTTCTCAACTCTCTCAACAATCAGTGATAGTTCTCCTGGGCTCAAATATTCATTTTGTACGAGGTACCTTTTGTCCT
TATCCTACCAAATCAAGTCTTAAATCTCAAGAATCAGTGGTGAGTGCAAGCAGTGATGAATTGAAAGTGCAAAGAAAAGAAACTCCATCAAATGATCTCGAAACAGAGGA
CTCTATTAACGAGGATCTCAACAGGCTATTTCAAGCTGTAGAAGGTAATGATAGTGGTGCACTTGATGAAGCAATCGTCTCCCATTATCCAAGCAAAGACATTCCTAAGC
ACTTAAAATTAATCATTGACATTTGCAGGATATCCCTGAGAACAACTAAGGCTTCAAACCCGGATTTGGTGCTTATCCAAGAAACCAAGAAAGATGCCATTGAATTGGAT
GTTATCAAAGCTATCTGGAGTTCTAAGGACATTAGATGGATTTATGTTGAAGCCTTTGGCAAATCAGGGGGAACACTCACCATGTGGGATGAAAGTAAAGTATCAATTCT
AGAATCCCTCAAAGGAGGTTACTCACTCTCGGTAAAATGCAAATCCCTAAGTAAAAAGGTAGATTGGGTGACTAATATATATGGTCCTACTTATTACAAAGAAAGAAGAC
ACATTTGGCCGGAATTAGCTTCCCTCTCAGCTTATTGTACAGAGGCATGGTGTTTGGGGGGAGACTTGAACATGACCAGATTGATTCAAGAAAGATCCCCAACTGGAAGC
TGGACAAAAGGTATGAAGAAATTCAATAAATTCATAGAAAACGCTCATCTCTTGGAAATTCCACTCTCAAATGGTCACTTTACTTGGTCAAGAGAAGGAAGAGGTTCGTC
TTGCTCACTGCTGGACAAGTTTCTTGTTTCCAATAATTGGGAGGAGGCCTTCGATGACACGAGAGTGGCAAGGCAAGCAAGATTGTATTCTGATCACTTCCCCTATTATT
AG
Protein sequenceShow/hide protein sequence
MEVRKAIHLSESHMQWLVKQLAELLNVSSVRFFLKEMRDTSGAMRLSKFKNNMGWNLRCVVWPAIGGRFFIHIPEGSAQQGWSEFLEMLISFTNRSKIYREKTARKEKAH
EFILPLIKQQLSYAEVLSNGISQQKSPVRAPLKQSKLAATLVHLNSSIFAPDKQSKHQCWIREEIEVFKEDFIKLNRPTCIKGYRGWLAIRNLLLEYWNRATFEALRSHF
GEAKIKVKRNLCGFMPSIEIKNELRGIIFVNFGDIEAMETLSYVHKELFFKDFNNPIDQSRLIKVAMDEDVRIILSNHNSEVQKPIGSPEKTTEKSDLEKVAVTTSRNVE
GLFQLESAPLQDTNQKAEKLIAGVSEVAKKSEVIRRENLVGRNCSLPSNQSKNALNSLIIIESRVSTSPSLPSIENAFNPALRFVTNIAEEINTLETKPSFLPSLKKEHI
NSNGISLMIPFIRSGNTLPNKKSPLSAVRKTTNLFKPFPKHFVKGRTSFLNSWSALSNLNLLEESYAKLQIPEYPSSRSVKYPFPSQLSQQSVIVLLGSNIHFVRGTFCP
YPTKSSLKSQESVVSASSDELKVQRKETPSNDLETEDSINEDLNRLFQAVEGNDSGALDEAIVSHYPSKDIPKHLKLIIDICRISLRTTKASNPDLVLIQETKKDAIELD
VIKAIWSSKDIRWIYVEAFGKSGGTLTMWDESKVSILESLKGGYSLSVKCKSLSKKVDWVTNIYGPTYYKERRHIWPELASLSAYCTEAWCLGGDLNMTRLIQERSPTGS
WTKGMKKFNKFIENAHLLEIPLSNGHFTWSREGRGSSCSLLDKFLVSNNWEEAFDDTRVARQARLYSDHFPYY