; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC11G211530 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC11G211530
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionDUF659 domain-containing protein/Dimer_Tnp_hAT domain-containing protein
Genome locationCiama_Chr11:24099598..24100871
RNA-Seq ExpressionCaUC11G211530
SyntenyCaUC11G211530
Gene Ontology termsGO:0003677 - DNA binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR007021 - Domain of unknown function DUF659
IPR012337 - Ribonuclease H-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GFZ00831.1 hypothetical protein Acr_14g0004660 [Actinidia rufa]4.3e-6274.85Show/hide
Query:  GDPMFLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLITDNARNCKGAEQIIESQFPTIVWTPCVVHTLNLALKNICAVKNVENNLLVYEKCSWISK
        G PMFLKA++CSGE KDKYFIANL KEVI+EVG +NV+Q+ITDNA NCKGA Q+IE+QFP I+WTPCVVHTLNLALKNICA KNVENN+L Y +CSWIS 
Subjt:  GDPMFLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLITDNARNCKGAEQIIESQFPTIVWTPCVVHTLNLALKNICAVKNVENNLLVYEKCSWISK

Query:  IASDVMTLKNFIMNHSMRVAIFNEFVPLRLLSVAETRFASIIIML-WFKLIKGGLQAMVISDK
        +  DVM +K+FI NHSMR+A++NEFVPL+LLSVA+TRFAS ++ML  FKLIK GLQ MVISDK
Subjt:  IASDVMTLKNFIMNHSMRVAIFNEFVPLRLLSVAETRFASIIIML-WFKLIKGGLQAMVISDK

RWR74797.1 DUF659 domain-containing protein/Dimer_Tnp_hAT domain-containing protein [Cinnamomum micranthum f. kanehirae]9.5e-6274.12Show/hide
Query:  GDPMFLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLITDNARNCKGAEQIIESQFPTIVWTPCVVHTLNLALKNICAVKNVENNLLVYEKCSWISK
        G PMFLKA++CSGE KDKYFIANL KEVI +VG +NV+Q+ITDNA NCKGA QIIESQFP I+WTPCVVHTLNLAL NICA KNVENN L Y +CSWI  
Subjt:  GDPMFLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLITDNARNCKGAEQIIESQFPTIVWTPCVVHTLNLALKNICAVKNVENNLLVYEKCSWISK

Query:  IASDVMTLKNFIMNHSMRVAIFNEFVPLRLLSVAETRFASIIIML-WFKLIKGGLQAMVISDKMDMISRG
        I  DVM +K+FIMNHSMR+A+FNEFV L+LLSVA+TRFAS I+ML  FKLIK GLQAMVISDK      G
Subjt:  IASDVMTLKNFIMNHSMRVAIFNEFVPLRLLSVAETRFASIIIML-WFKLIKGGLQAMVISDKMDMISRG

XP_031743157.1 uncharacterized protein LOC116404561 [Cucumis sativus]5.1e-6357.81Show/hide
Query:  GYGIGVCSKVTSQDKADMQRLEDEVQDRMAKKTPRNIPLPPSFVPSVGKSPCPSIFEQKKRK------SAPNGDPMFLKAMECSGEVKDKYFIANLPKEV
        GY +   S +  ++KA+++RL   ++    KK               G S     +   +R+      +  NG PMFLK+++CSGE+KDKYFIAN  KEV
Subjt:  GYGIGVCSKVTSQDKADMQRLEDEVQDRMAKKTPRNIPLPPSFVPSVGKSPCPSIFEQKKRK------SAPNGDPMFLKAMECSGEVKDKYFIANLPKEV

Query:  IIEVGPQNVIQLITDNARNCKGAEQIIESQFPTIVWTPCVVHTLNLALKNICAVKNVENNLLVYEKCSWISKIASDVMTLKNFIMNHSMRVAIFNEFVPL
        I EVG +NV+Q+ITDNA NCKGA Q+IE+QFP I+WTPCVVHTLNLALKNICA KNVENN +VY +CSWI  IA D++ +K FIMNHSM +A+FNEFVPL
Subjt:  IIEVGPQNVIQLITDNARNCKGAEQIIESQFPTIVWTPCVVHTLNLALKNICAVKNVENNLLVYEKCSWISKIASDVMTLKNFIMNHSMRVAIFNEFVPL

Query:  RLLSVAETRFASIIIML-WFKLIKGGLQAMVISDKMD
        +LLS+AETRFAS+IIML  FKLIKGGLQAMVISDK +
Subjt:  RLLSVAETRFASIIIML-WFKLIKGGLQAMVISDKMD

XP_038721054.1 uncharacterized protein LOC120013346 isoform X2 [Tripterygium wilfordii]1.0e-6348.56Show/hide
Query:  SSSTSSNSGQSSTIPSIGATISSSSVEDESKPLRQYVTKNQRLNEGGLGG-----------------------------YGIGVCSKVTSQDKADMQRLE
        S +TS ++  S++ PS     SS  VED  KPL +YV + ++  +GG GG                              GI  C  VTS+D A+MQRLE
Subjt:  SSSTSSNSGQSSTIPSIGATISSSSVEDESKPLRQYVTKNQRLNEGGLGG-----------------------------YGIGVCSKVTSQDKADMQRLE

Query:  DEVQDRMAKKTPRNIPLPPSFVPSVGKSPCPSIFEQKKRKSAPN------GDPMFLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLITDNARNCKG
        +E ++R      R +    S     G S     +   +R+   N        PMFLKA++CSGE KDK+FI NL KEVI EVGPQNV+Q+ITDNA+NC G
Subjt:  DEVQDRMAKKTPRNIPLPPSFVPSVGKSPCPSIFEQKKRKSAPN------GDPMFLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLITDNARNCKG

Query:  AEQIIESQFPTIVWTPCVVHTLNLALKNICAVKNVENNLLVYEKCSWISKIASDVMTLKNFIMNHSMRVAIFNEFVPLRLLSVAETRFASIIIML-WFKL
        A  ++E  +P IVWTPCVVHTLNLAL+NICA KN+ENN +VY++CSWI+ ++ DV  +KNFIMNHSMR+AIFNEFVPL+LLS+A TRFAS+++ML  F L
Subjt:  AEQIIESQFPTIVWTPCVVHTLNLALKNICAVKNVENNLLVYEKCSWISKIASDVMTLKNFIMNHSMRVAIFNEFVPLRLLSVAETRFASIIIML-WFKL

Query:  IKGGLQAMVISDK
        IK  L +MVIS++
Subjt:  IKGGLQAMVISDK

XP_038891577.1 uncharacterized protein LOC120080967 [Benincasa hispida]7.5e-6741.87Show/hide
Query:  SVEDESKPLRQYVTKNQRLNEG---------------------------GLGGYGIGVCSKVTSQDKADMQRLEDEVQDRMAKKTPRNIPLPP-------
        + +D++KPL QYVTKN+RLNEG                            L G+GIG+C K+T +D A+MQ+LEDE + R+A+  P+ +PLPP       
Subjt:  SVEDESKPLRQYVTKNQRLNEG---------------------------GLGGYGIGVCSKVTSQDKADMQRLEDEVQDRMAKKTPRNIPLPP-------

Query:  --SFVPSVGK-----SPCPSIFEQKKRK------------------------------------------------------------------------
          SF    G      S   S  EQKKRK                                                                        
Subjt:  --SFVPSVGK-----SPCPSIFEQKKRK------------------------------------------------------------------------

Query:  ---------------------------------------SAPNGDPMFLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLITDNARNCKGAEQIIES
                                                   G PMFLKA++CSGE KDKYFIANL KEVI EVG +NVIQ+ITDNA NCKGA QIIES
Subjt:  ---------------------------------------SAPNGDPMFLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLITDNARNCKGAEQIIES

Query:  QFPTIVWTPCVVHTLNLALKNICAVKNVENNLLVYEKCSWISKIASDVMTLKNFIMNHSMRVAIFNEFVPLRLLSVAETRFAS-IIIMLWFKLIKGGLQA
        QFP+IVWTPCVVHTLNLALKNICA +N+ +N  V+ + SWIS+I++DVM +K+FIMNHSMR+A+FNEFV L+LL+VAETRF+S III+  FKLIKGGLQ 
Subjt:  QFPTIVWTPCVVHTLNLALKNICAVKNVENNLLVYEKCSWISKIASDVMTLKNFIMNHSMRVAIFNEFVPLRLLSVAETRFAS-IIIMLWFKLIKGGLQA

Query:  MVISDK
        +VIS+K
Subjt:  MVISDK

TrEMBL top hitse value%identityAlignment
A0A443N8D6 DUF659 domain-containing protein/Dimer_Tnp_hAT domain-containing protein4.6e-6274.12Show/hide
Query:  GDPMFLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLITDNARNCKGAEQIIESQFPTIVWTPCVVHTLNLALKNICAVKNVENNLLVYEKCSWISK
        G PMFLKA++CSGE KDKYFIANL KEVI +VG +NV+Q+ITDNA NCKGA QIIESQFP I+WTPCVVHTLNLAL NICA KNVENN L Y +CSWI  
Subjt:  GDPMFLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLITDNARNCKGAEQIIESQFPTIVWTPCVVHTLNLALKNICAVKNVENNLLVYEKCSWISK

Query:  IASDVMTLKNFIMNHSMRVAIFNEFVPLRLLSVAETRFASIIIML-WFKLIKGGLQAMVISDKMDMISRG
        I  DVM +K+FIMNHSMR+A+FNEFV L+LLSVA+TRFAS I+ML  FKLIK GLQAMVISDK      G
Subjt:  IASDVMTLKNFIMNHSMRVAIFNEFVPLRLLSVAETRFASIIIML-WFKLIKGGLQAMVISDKMDMISRG

A0A5B7AFB0 Uncharacterized protein9.3e-6341.12Show/hide
Query:  SSTIPSIGATISSSSVEDESKPLRQYVTKNQRLNEGG---------------------------LGGYGIGVCSKVTSQDKADMQRLEDEVQDRMAKKTP
        +ST PS     SSS  ED +KPL +YV K  +L++GG                           L G GI  CSKVT++D  +MQ+LEDEV+ R+     
Subjt:  SSTIPSIGATISSSSVEDESKPLRQYVTKNQRLNEGG---------------------------LGGYGIGVCSKVTSQDKADMQRLEDEVQDRMAKKTP

Query:  RNIPLPPSFVPSVGK-SPCPSIFEQKKRKSAPNGD-----------------------------------------------------------------
        + +PLP S +   G  S     ++ KKRK+  +G                                                                  
Subjt:  RNIPLPPSFVPSVGK-SPCPSIFEQKKRKSAPNGD-----------------------------------------------------------------

Query:  --------------------------------------------------PMFLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLITDNARNCKGAE
                                                          PMFLK ++CSGE KDKYFIANL +EVI EVG +NVIQ+ITDNA NCKGA 
Subjt:  --------------------------------------------------PMFLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLITDNARNCKGAE

Query:  QIIESQFPTIVWTPCVVHTLNLALKNICAVKNVENNLLVYEKCSWISKIASDVMTLKNFIMNHSMRVAIFNEFVPLRLLSVAETRFASIIIML-WFKLIK
        Q+IESQF  I WTPCVVHTLNLALKNICA KNVENN L Y +CSWIS IA DVM +K+FIMNHS+R+ +FNEFV L+LLSVA+TRFAS+I+M   FKLIK
Subjt:  QIIESQFPTIVWTPCVVHTLNLALKNICAVKNVENNLLVYEKCSWISKIASDVMTLKNFIMNHSMRVAIFNEFVPLRLLSVAETRFASIIIML-WFKLIK

Query:  GGLQAMVISDK
         GLQAMVISDK
Subjt:  GGLQAMVISDK

A0A6J1DUJ6 uncharacterized protein LOC111023231 isoform X23.0e-6141.03Show/hide
Query:  TISSSSVEDESKPLRQYVTKNQRLNEGG---------------------------LGGYGIGVCSKVTSQDKADMQRLEDEVQDRMAKKTPRNIPLPPSF
        +IS S++EDE  PL +YVTKNQRLNE G                           L GYGIG+C KVT +D A+MQRLEDE + R  K  P+ + LPP  
Subjt:  TISSSSVEDESKPLRQYVTKNQRLNEGG---------------------------LGGYGIGVCSKVTSQDKADMQRLEDEVQDRMAKKTPRNIPLPPSF

Query:  VPSVGKS-PCPSIFE---------QKKRKSA---------------------------------------------------------------------
         PS  ++  C S+ +          KKRKS+                                                                     
Subjt:  VPSVGKS-PCPSIFE---------QKKRKSA---------------------------------------------------------------------

Query:  ------------------------------------------PNGDPMFLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLITDNARNCKGAEQIIE
                                                   +G P+FLK ++CSGEVKDKYFI NL KEVI EVG QN+IQ+ITDN  NC+ A QIIE
Subjt:  ------------------------------------------PNGDPMFLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLITDNARNCKGAEQIIE

Query:  SQFPTIVWTPCVVHTLNLALKNICAVKNVENNLLVYEKCSWISKIASDVMTLKNFIMNHSMRVAIFNEFVPLRLLSVAETRFASIIIML-WFKLIKGGLQ
        SQF  IVWTPCVV TLNLALKNIC+ KN+E N  V+ +C WISK + DVM +K+FIMNH MR+A+F EFV L+LLS+AETRFA  I ML  FKLIK GLQ
Subjt:  SQFPTIVWTPCVVHTLNLALKNICAVKNVENNLLVYEKCSWISKIASDVMTLKNFIMNHSMRVAIFNEFVPLRLLSVAETRFASIIIML-WFKLIKGGLQ

Query:  AMVISDK
        AM ISDK
Subjt:  AMVISDK

A0A7J0EFU0 DUF659 domain-containing protein7.4e-6073.62Show/hide
Query:  GDPMFLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLITDNARNCKGAEQIIESQFPTIVWTPCVVHTLNLALKNICAVKNVENNLLVYEKCSWISK
        G PMFLKA++CSGE+KDKYF ANL K+VI EVGPQNV+Q+ITDNA NCK A Q+IE+QFP I+WTPCVV TLNLALKNICA K+VENN   Y +CSWIS 
Subjt:  GDPMFLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLITDNARNCKGAEQIIESQFPTIVWTPCVVHTLNLALKNICAVKNVENNLLVYEKCSWISK

Query:  IASDVMTLKNFIMNHSMRVAIFNEFVPLRLLSVAETRFASIIIML-WFKLIKGGLQAMVISDK
        IA D   +KNFI NHSMR+A++NEFV L+LLSVAE RFAS I+ML  FKLIKGGLQAMVI+DK
Subjt:  IASDVMTLKNFIMNHSMRVAIFNEFVPLRLLSVAETRFASIIIML-WFKLIKGGLQAMVISDK

A0A7J0FQA7 DUF659 domain-containing protein2.1e-6274.85Show/hide
Query:  GDPMFLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLITDNARNCKGAEQIIESQFPTIVWTPCVVHTLNLALKNICAVKNVENNLLVYEKCSWISK
        G PMFLKA++CSGE KDKYFIANL KEVI+EVG +NV+Q+ITDNA NCKGA Q+IE+QFP I+WTPCVVHTLNLALKNICA KNVENN+L Y +CSWIS 
Subjt:  GDPMFLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLITDNARNCKGAEQIIESQFPTIVWTPCVVHTLNLALKNICAVKNVENNLLVYEKCSWISK

Query:  IASDVMTLKNFIMNHSMRVAIFNEFVPLRLLSVAETRFASIIIML-WFKLIKGGLQAMVISDK
        +  DVM +K+FI NHSMR+A++NEFVPL+LLSVA+TRFAS ++ML  FKLIK GLQ MVISDK
Subjt:  IASDVMTLKNFIMNHSMRVAIFNEFVPLRLLSVAETRFASIIIML-WFKLIKGGLQAMVISDK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G22220.1 hAT transposon superfamily1.0e-0826.83Show/hide
Query:  MFLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLITDNARNCKGAEQIIESQFPTIVWTPCVVHTLNLALKNICAVKNVENNLLVYEKCSWISKIAS
        +FLK+++ S  +  +  +  L KEV+ E+G  NV+Q+IT    +   A + +   +P++ W PC  H ++  L+              + K  WI +I  
Subjt:  MFLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLITDNARNCKGAEQIIESQFPTIVWTPCVVHTLNLALKNICAVKNVENNLLVYEKCSWISKIAS

Query:  DVMTLKNFIMNHS-----MRVAIFNEFVPLRLLSVAETRFASIIIMLWFKLIKGGLQAMVISDK
           T+   I NHS     MR   F   +   + + + T F +   M     +K  LQAMV S +
Subjt:  DVMTLKNFIMNHS-----MRVAIFNEFVPLRLLSVAETRFASIIIMLWFKLIKGGLQAMVISDK

AT3G22220.2 hAT transposon superfamily1.0e-0826.83Show/hide
Query:  MFLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLITDNARNCKGAEQIIESQFPTIVWTPCVVHTLNLALKNICAVKNVENNLLVYEKCSWISKIAS
        +FLK+++ S  +  +  +  L KEV+ E+G  NV+Q+IT    +   A + +   +P++ W PC  H ++  L+              + K  WI +I  
Subjt:  MFLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLITDNARNCKGAEQIIESQFPTIVWTPCVVHTLNLALKNICAVKNVENNLLVYEKCSWISKIAS

Query:  DVMTLKNFIMNHS-----MRVAIFNEFVPLRLLSVAETRFASIIIMLWFKLIKGGLQAMVISDK
           T+   I NHS     MR   F   +   + + + T F +   M     +K  LQAMV S +
Subjt:  DVMTLKNFIMNHS-----MRVAIFNEFVPLRLLSVAETRFASIIIMLWFKLIKGGLQAMVISDK

AT4G08267.1 hAT transposon superfamily protein3.7e-1946.88Show/hide
Query:  LITDNARNCKGAEQIIESQFPTIVWTPCVVHTLNLALKNICAVK-NVENNLLVYEKCSWISKIASDVMTLKNFIMNHSMRVAIFNEFVPLRLLSVA
        ++T+NA N   +  +I ++F TI WTPCVVHTLNLALKN CA   +  NN +VY+ C WI  I+ +V  +KN IMN+ +R+ +F E   L+LL+++
Subjt:  LITDNARNCKGAEQIIESQFPTIVWTPCVVHTLNLALKNICAVK-NVENNLLVYEKCSWISKIASDVMTLKNFIMNHSMRVAIFNEFVPLRLLSVA

AT4G15020.1 hAT transposon superfamily1.9e-0725.93Show/hide
Query:  MFLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLITDNARNCKGAEQIIESQFPTIVWTPCVVHTLNLALKNICAVKNVENNLLVYEKCSWISKIAS
        +FLK+++ S  +     +  L  E++ EVG  NV+Q+IT        A + +   +P++ W PC  H ++  L+              + K  WIS+   
Subjt:  MFLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLITDNARNCKGAEQIIESQFPTIVWTPCVVHTLNLALKNICAVKNVENNLLVYEKCSWISKIAS

Query:  DVMTLKNFIMNHSMRVAIFNEF-----VPLRLLSVAETRFASIIIMLWFKLIKGGLQAMVIS
            +  F+ NHS  + +  +F     + L   S + T FA++  +     +K  LQAMV S
Subjt:  DVMTLKNFIMNHSMRVAIFNEF-----VPLRLLSVAETRFASIIIMLWFKLIKGGLQAMVIS

AT5G31412.1 hAT transposon superfamily protein1.7e-0834.78Show/hide
Query:  KRKSAPN------GDPMFLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLITDNARNCKGAEQIIESQFPTIVWTPCVVHTLNLALKNI
        KR+S  N      G   FL + + S       +I       I +VG +NV+Q++TDNA N   A ++++ + P I WT CV HT++L L+ I
Subjt:  KRKSAPN------GDPMFLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLITDNARNCKGAEQIIESQFPTIVWTPCVVHTLNLALKNI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATCTTTCTCTTCATCTACATCTAGTAACAGTGGTCAATCATCAACAATTCCTTCAATAGGAGCTACAATCTCATCCTCTAGTGTTGAAGATGAATCAAAACCACT
TCGGCAATATGTGACCAAGAATCAAAGATTAAATGAAGGAGGCTTAGGTGGTTATGGAATTGGAGTGTGTAGTAAAGTTACCTCTCAAGATAAGGCCGACATGCAAAGAT
TAGAAGATGAGGTGCAAGATCGTATGGCTAAAAAGACCCCTAGAAACATTCCTTTACCACCTTCGTTTGTACCTTCTGTGGGAAAATCTCCCTGTCCTTCTATCTTTGAG
CAAAAGAAAAGGAAAAGTGCTCCAAATGGTGATCCAATGTTTCTAAAAGCGATGGAATGCTCAGGTGAAGTCAAAGATAAATATTTTATTGCAAACCTGCCGAAGGAAGT
GATTATTGAAGTTGGCCCTCAAAATGTAATTCAATTGATTACTGATAATGCTCGTAATTGCAAGGGTGCAGAGCAAATTATTGAATCACAATTTCCGACAATTGTATGGA
CACCATGTGTAGTACATACTCTTAATCTTGCCTTGAAGAATATATGTGCAGTGAAAAATGTTGAAAACAATTTGCTTGTCTATGAGAAATGTAGTTGGATTTCTAAGATT
GCTAGTGATGTGATGACGCTGAAGAATTTTATTATGAATCATTCAATGAGGGTTGCTATTTTTAATGAGTTTGTACCTTTGAGATTACTTTCTGTGGCAGAAACACGTTT
TGCATCAATCATTATCATGCTTTGGTTCAAGCTTATTAAAGGTGGGTTGCAAGCTATGGTTATTAGTGACAAAATGGACATGATATCAAGAGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCATCTTTCTCTTCATCTACATCTAGTAACAGTGGTCAATCATCAACAATTCCTTCAATAGGAGCTACAATCTCATCCTCTAGTGTTGAAGATGAATCAAAACCACT
TCGGCAATATGTGACCAAGAATCAAAGATTAAATGAAGGAGGCTTAGGTGGTTATGGAATTGGAGTGTGTAGTAAAGTTACCTCTCAAGATAAGGCCGACATGCAAAGAT
TAGAAGATGAGGTGCAAGATCGTATGGCTAAAAAGACCCCTAGAAACATTCCTTTACCACCTTCGTTTGTACCTTCTGTGGGAAAATCTCCCTGTCCTTCTATCTTTGAG
CAAAAGAAAAGGAAAAGTGCTCCAAATGGTGATCCAATGTTTCTAAAAGCGATGGAATGCTCAGGTGAAGTCAAAGATAAATATTTTATTGCAAACCTGCCGAAGGAAGT
GATTATTGAAGTTGGCCCTCAAAATGTAATTCAATTGATTACTGATAATGCTCGTAATTGCAAGGGTGCAGAGCAAATTATTGAATCACAATTTCCGACAATTGTATGGA
CACCATGTGTAGTACATACTCTTAATCTTGCCTTGAAGAATATATGTGCAGTGAAAAATGTTGAAAACAATTTGCTTGTCTATGAGAAATGTAGTTGGATTTCTAAGATT
GCTAGTGATGTGATGACGCTGAAGAATTTTATTATGAATCATTCAATGAGGGTTGCTATTTTTAATGAGTTTGTACCTTTGAGATTACTTTCTGTGGCAGAAACACGTTT
TGCATCAATCATTATCATGCTTTGGTTCAAGCTTATTAAAGGTGGGTTGCAAGCTATGGTTATTAGTGACAAAATGGACATGATATCAAGAGGATGA
Protein sequenceShow/hide protein sequence
MASFSSSTSSNSGQSSTIPSIGATISSSSVEDESKPLRQYVTKNQRLNEGGLGGYGIGVCSKVTSQDKADMQRLEDEVQDRMAKKTPRNIPLPPSFVPSVGKSPCPSIFE
QKKRKSAPNGDPMFLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLITDNARNCKGAEQIIESQFPTIVWTPCVVHTLNLALKNICAVKNVENNLLVYEKCSWISKI
ASDVMTLKNFIMNHSMRVAIFNEFVPLRLLSVAETRFASIIIMLWFKLIKGGLQAMVISDKMDMISRG