; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc11G13155 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc11G13155
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionCCHC-type domain-containing protein
Genome locationClcChr11:23394743..23396064
RNA-Seq ExpressionClc11G13155
SyntenyClc11G13155
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG68535.1 hypothetical protein EZV62_003470 [Acer yangbiense]1.2e-3835.37Show/hide
Query:  MDTEELMKDWKKFRLSEEEERTRIEFDTNLDSLISDQMNHCLVGKLLTSRTIVPGIIKNTFSNEWRTNLNFSVDSLGRNVYLFKFDAKRDKEMVLKLGPW
        M + +L +  +   + +E+   R +    +D    + ++HCLVGK+L+ + +     K      W    N  ++ +G N ++F F+   D++ + + GPW
Subjt:  MDTEELMKDWKKFRLSEEEERTRIEFDTNLDSLISDQMNHCLVGKLLTSRTIVPGIIKNTFSNEWRTNLNFSVDSLGRNVYLFKFDAKRDKEMVLKLGPW

Query:  LFDNILLVLEDPKVNLRLSQLPFNKAAFWIRLIDIPLKFQNKFMAKKLGEAFGEFLKVDCYQDNFCWRENLRIKISVDISKPLRRGIWINLGTSQESLWI
         FD  L+VLE P+    +SQL FNKA FW+++ DIP+   NK MAK L E  G  +++   +   CW + LR+K+ +DISKPL+R + + L  S   + +
Subjt:  LFDNILLVLEDPKVNLRLSQLPFNKAAFWIRLIDIPLKFQNKFMAKKLGEAFGEFLKVDCYQDNFCWRENLRIKISVDISKPLRRGIWINLGTSQESLWI

Query:  QAKYERLPEFCYDCGRIGHLVKECRSC-LKENDIEG--WQFKSWLQ
          KYERLPEFCY CG++GH + +C     K+  IEG   +F SWL+
Subjt:  QAKYERLPEFCYDCGRIGHLVKECRSC-LKENDIEG--WQFKSWLQ

XP_022132681.1 uncharacterized protein LOC111005481 [Momordica charantia]9.1e-4735.22Show/hide
Query:  MDTEELMKDWKKFRLSEEEERTRIEFDTNLDSLISDQMNHCLVGKLLTSRTIVPGIIKNTFSNEWRTNLN-FSVDSLGRNVYLFKFDAKRDKEMVLKLGP
        M    L+++WK F+L+ EE++  ++ D++        +   L+ KLL+ R+I   ++KNT    W+ +   FSVD +G N++LF F+   D+  +L++GP
Subjt:  MDTEELMKDWKKFRLSEEEERTRIEFDTNLDSLISDQMNHCLVGKLLTSRTIVPGIIKNTFSNEWRTNLN-FSVDSLGRNVYLFKFDAKRDKEMVLKLGP

Query:  WLFDNILLVLEDPKVNLRLSQLPFNKAAFWIRLIDIPLKFQNKFMAKKLGEAFGEFLKVDCYQDNFCWRENLRIKISVDISKPLRRGIWINLGTSQESLW
        W FD  L+++++P    +   + F   + W+   D+ L   NK MA +LG A G F  V+   +NFCW   LR+++  D+ KPL RGI +NL       W
Subjt:  WLFDNILLVLEDPKVNLRLSQLPFNKAAFWIRLIDIPLKFQNKFMAKKLGEAFGEFLKVDCYQDNFCWRENLRIKISVDISKPLRRGIWINLGTSQESLW

Query:  IQAKYERLPEFCYDCGRIGHLVKECRSCLKENDIEGWQFKSWLQIQG
        I  +YERLP+F Y CGR+ H++K+C  C  ++  +  Q+  WL+ QG
Subjt:  IQAKYERLPEFCYDCGRIGHLVKECRSCLKENDIEGWQFKSWLQIQG

XP_022156185.1 uncharacterized protein LOC111023135 [Momordica charantia]4.2e-5239.33Show/hide
Query:  MDTEELMKDWKKFRLSEEEERTRIEFDTNLDSLISDQMNHCLVGKLLTSRTIVPGIIKNTFSNEWRTNLNFSVDSLGRNVYLFKFDAKRDKEMVLKLGPW
        MD E L+ DW+KF+L+ EE+   ++ D +   +    + + LVGKLL  R I   ++       W+     +V+S+G+N++LF F  + D   V+K GPW
Subjt:  MDTEELMKDWKKFRLSEEEERTRIEFDTNLDSLISDQMNHCLVGKLLTSRTIVPGIIKNTFSNEWRTNLNFSVDSLGRNVYLFKFDAKRDKEMVLKLGPW

Query:  LFDNILLVLEDPKVNLRLSQLPFNKAAFWIRLIDIPLKFQNKFMAKKLGEAFGEFLKVDCYQDNFCWRENLRIKISVDISKPLRRGIWINLGTSQESLWI
         FD  L+VL+ P  +  +S+L FN+ AFWI L D+P+ + NK MA +LG A G F+ VDC +  F W  +LRI++ +DI+KPLRRGI IN+       WI
Subjt:  LFDNILLVLEDPKVNLRLSQLPFNKAAFWIRLIDIPLKFQNKFMAKKLGEAFGEFLKVDCYQDNFCWRENLRIKISVDISKPLRRGIWINLGTSQESLWI

Query:  QAKYERLPEFCYDCGRIGHLVKEC--RSCLKENDIEGW-QFKSWLQIQGQN---PRRRRKASPRRDD
          +YERLP+FCY CG IGH   +C  R    ++D     ++  WL+  G      + R+  SP R+D
Subjt:  QAKYERLPEFCYDCGRIGHLVKEC--RSCLKENDIEGW-QFKSWLQIQGQN---PRRRRKASPRRDD

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]2.9e-4535.22Show/hide
Query:  MDTEELMKDWKKFRLSEEEERTRIEFDTNLDSLISDQMNHCLVGKLLTSRTIVPGIIKNTFSNEWRTNLN-FSVDSLGRNVYLFKFDAKRDKEMVLKLGP
        M T +L+++WK F+L+ EEE T I+ D +  +    ++   LVGKL   R I   ++KNT    W+   N F V SLG N++LF F    D+  + K GP
Subjt:  MDTEELMKDWKKFRLSEEEERTRIEFDTNLDSLISDQMNHCLVGKLLTSRTIVPGIIKNTFSNEWRTNLN-FSVDSLGRNVYLFKFDAKRDKEMVLKLGP

Query:  WLFDNILLVLEDPKVNLRLSQLPFNKAAFWIRLIDIPLKFQNKFMAKKLGEAFGEFLKVDCYQDNFCWRENLRIKISVDISKPLRRGIWINLGTSQESLW
        W FD  L+++  P   +  S+L F K   W+R  D+PL    + MA +LG A G F + DC   N  W  NLR+++ +DISKPLRRGI +NL       W
Subjt:  WLFDNILLVLEDPKVNLRLSQLPFNKAAFWIRLIDIPLKFQNKFMAKKLGEAFGEFLKVDCYQDNFCWRENLRIKISVDISKPLRRGIWINLGTSQESLW

Query:  IQAKYERLPEFCYDCGRIGHLVKECRSCLKENDIEGWQFKSWLQIQG----QNPRRRRKASPRRDDKGDVPFNASAAKFGSPNEFDHTRDVEGSHLDMPI
        I  +YERLP+FCY CG      K              Q+ SWL+ QG      P+ ++      D  G+  F++S +  G+ ++      V+ +    PI
Subjt:  IQAKYERLPEFCYDCGRIGHLVKECRSCLKENDIEGWQFKSWLQIQG----QNPRRRRKASPRRDDKGDVPFNASAAKFGSPNEFDHTRDVEGSHLDMPI

Query:  PAKLNS-VGGVPKGGEVP
           + S V   PK G  P
Subjt:  PAKLNS-VGGVPKGGEVP

XP_028122006.1 uncharacterized protein LOC114319195 [Camellia sinensis]3.5e-3835.39Show/hide
Query:  SEEEERTRIEFDTNLDSLISDQMNHCLVGKLLTSRTIVPGIIKNTFSNEWRTNLNFSVDSLGRNVYLFKFDAKRDKEMVLKLGPWLFDNILLVLEDPKVN
        SEE+   RI  ++   SL+  + + CLVGKLLT R      +KNT  + W+      V  +G N+++F F    DK  VL  GPW FD  LL+L +   N
Subjt:  SEEEERTRIEFDTNLDSLISDQMNHCLVGKLLTSRTIVPGIIKNTFSNEWRTNLNFSVDSLGRNVYLFKFDAKRDKEMVLKLGPWLFDNILLVLEDPKVN

Query:  LRLSQLPFNKAAFWIRLIDIPLKFQNKFMAKKLGEAFGEFLKVDCYQDNFCWRENLRIKISVDISKPLRRGIWINLGTSQESLWIQAKYERLPEFCYDCG
        ++ S +   +  FW+ + ++PL   NK + + +G A G+F+ +D       W   +RI++++D+ KPLRRG+ + L +S E +W+  KYERLP +CY CG
Subjt:  LRLSQLPFNKAAFWIRLIDIPLKFQNKFMAKKLGEAFGEFLKVDCYQDNFCWRENLRIKISVDISKPLRRGIWINLGTSQESLWIQAKYERLPEFCYDCG

Query:  RIGHLVKECRSCLKEND---IEGWQFKSWLQIQGQNPRRRRKA
        R+GH  +EC   L   D   ++  Q+ +WL++     +  R+A
Subjt:  RIGHLVKECRSCLKEND---IEGWQFKSWLQIQGQNPRRRRKA

TrEMBL top hitse value%identityAlignment
A0A2N9GF83 CCHC-type domain-containing protein1.5e-3935.19Show/hide
Query:  EELMKDWKKFRLSEEEERTRIEFDTNLDSLISDQMNHCLVGKLLTSRTIVPGIIKNTFSNEWRTNLNFSVDSLGRNVYLFKFDAKRDKEMVLKLGPWLFD
        EEL++DW++F L+ E+E      D           +HCL+GKLL S++     +K T    W          +G N++LF+F  + D + V K  PWLFD
Subjt:  EELMKDWKKFRLSEEEERTRIEFDTNLDSLISDQMNHCLVGKLLTSRTIVPGIIKNTFSNEWRTNLNFSVDSLGRNVYLFKFDAKRDKEMVLKLGPWLFD

Query:  NILLVLEDPKVNLRLSQLPFNKAAFWIRLIDIPLKFQNKFMAKKLGEAFGEFLKVDCYQDNFCWRENLRIKISVDISKPLRRGIWINLGTSQESLWIQAK
        N LLVL +   +   +Q+ FN   FW++L  +PL +  K   +++G A G   KVD  +D   W   LR++ISVDI+KP++RG  +  G++ + +WI  K
Subjt:  NILLVLEDPKVNLRLSQLPFNKAAFWIRLIDIPLKFQNKFMAKKLGEAFGEFLKVDCYQDNFCWRENLRIKISVDISKPLRRGIWINLGTSQESLWIQAK

Query:  YERLPEFCYDCGRIGHLVKECRSCLKENDIEGWQFKS---WLQ----------IQGQNPRRRRKASPRRDDKGDVPFNASAAKFGSP
        YERLP FC+ CG++GH  +EC   L+  ++    FK    WL+          + G + RR   +S      G V  NA A KF +P
Subjt:  YERLPEFCYDCGRIGHLVKECRSCLKENDIEGWQFKS---WLQ----------IQGQNPRRRRKASPRRDDKGDVPFNASAAKFGSP

A0A5C7IHI0 CCHC-type domain-containing protein5.8e-3935.37Show/hide
Query:  MDTEELMKDWKKFRLSEEEERTRIEFDTNLDSLISDQMNHCLVGKLLTSRTIVPGIIKNTFSNEWRTNLNFSVDSLGRNVYLFKFDAKRDKEMVLKLGPW
        M + +L +  +   + +E+   R +    +D    + ++HCLVGK+L+ + +     K      W    N  ++ +G N ++F F+   D++ + + GPW
Subjt:  MDTEELMKDWKKFRLSEEEERTRIEFDTNLDSLISDQMNHCLVGKLLTSRTIVPGIIKNTFSNEWRTNLNFSVDSLGRNVYLFKFDAKRDKEMVLKLGPW

Query:  LFDNILLVLEDPKVNLRLSQLPFNKAAFWIRLIDIPLKFQNKFMAKKLGEAFGEFLKVDCYQDNFCWRENLRIKISVDISKPLRRGIWINLGTSQESLWI
         FD  L+VLE P+    +SQL FNKA FW+++ DIP+   NK MAK L E  G  +++   +   CW + LR+K+ +DISKPL+R + + L  S   + +
Subjt:  LFDNILLVLEDPKVNLRLSQLPFNKAAFWIRLIDIPLKFQNKFMAKKLGEAFGEFLKVDCYQDNFCWRENLRIKISVDISKPLRRGIWINLGTSQESLWI

Query:  QAKYERLPEFCYDCGRIGHLVKECRSC-LKENDIEG--WQFKSWLQ
          KYERLPEFCY CG++GH + +C     K+  IEG   +F SWL+
Subjt:  QAKYERLPEFCYDCGRIGHLVKECRSC-LKENDIEG--WQFKSWLQ

A0A6J1BSZ1 uncharacterized protein LOC1110054814.4e-4735.22Show/hide
Query:  MDTEELMKDWKKFRLSEEEERTRIEFDTNLDSLISDQMNHCLVGKLLTSRTIVPGIIKNTFSNEWRTNLN-FSVDSLGRNVYLFKFDAKRDKEMVLKLGP
        M    L+++WK F+L+ EE++  ++ D++        +   L+ KLL+ R+I   ++KNT    W+ +   FSVD +G N++LF F+   D+  +L++GP
Subjt:  MDTEELMKDWKKFRLSEEEERTRIEFDTNLDSLISDQMNHCLVGKLLTSRTIVPGIIKNTFSNEWRTNLN-FSVDSLGRNVYLFKFDAKRDKEMVLKLGP

Query:  WLFDNILLVLEDPKVNLRLSQLPFNKAAFWIRLIDIPLKFQNKFMAKKLGEAFGEFLKVDCYQDNFCWRENLRIKISVDISKPLRRGIWINLGTSQESLW
        W FD  L+++++P    +   + F   + W+   D+ L   NK MA +LG A G F  V+   +NFCW   LR+++  D+ KPL RGI +NL       W
Subjt:  WLFDNILLVLEDPKVNLRLSQLPFNKAAFWIRLIDIPLKFQNKFMAKKLGEAFGEFLKVDCYQDNFCWRENLRIKISVDISKPLRRGIWINLGTSQESLW

Query:  IQAKYERLPEFCYDCGRIGHLVKECRSCLKENDIEGWQFKSWLQIQG
        I  +YERLP+F Y CGR+ H++K+C  C  ++  +  Q+  WL+ QG
Subjt:  IQAKYERLPEFCYDCGRIGHLVKECRSCLKENDIEGWQFKSWLQIQG

A0A6J1DU55 uncharacterized protein LOC1110231352.1e-5239.33Show/hide
Query:  MDTEELMKDWKKFRLSEEEERTRIEFDTNLDSLISDQMNHCLVGKLLTSRTIVPGIIKNTFSNEWRTNLNFSVDSLGRNVYLFKFDAKRDKEMVLKLGPW
        MD E L+ DW+KF+L+ EE+   ++ D +   +    + + LVGKLL  R I   ++       W+     +V+S+G+N++LF F  + D   V+K GPW
Subjt:  MDTEELMKDWKKFRLSEEEERTRIEFDTNLDSLISDQMNHCLVGKLLTSRTIVPGIIKNTFSNEWRTNLNFSVDSLGRNVYLFKFDAKRDKEMVLKLGPW

Query:  LFDNILLVLEDPKVNLRLSQLPFNKAAFWIRLIDIPLKFQNKFMAKKLGEAFGEFLKVDCYQDNFCWRENLRIKISVDISKPLRRGIWINLGTSQESLWI
         FD  L+VL+ P  +  +S+L FN+ AFWI L D+P+ + NK MA +LG A G F+ VDC +  F W  +LRI++ +DI+KPLRRGI IN+       WI
Subjt:  LFDNILLVLEDPKVNLRLSQLPFNKAAFWIRLIDIPLKFQNKFMAKKLGEAFGEFLKVDCYQDNFCWRENLRIKISVDISKPLRRGIWINLGTSQESLWI

Query:  QAKYERLPEFCYDCGRIGHLVKEC--RSCLKENDIEGW-QFKSWLQIQGQN---PRRRRKASPRRDD
          +YERLP+FCY CG IGH   +C  R    ++D     ++  WL+  G      + R+  SP R+D
Subjt:  QAKYERLPEFCYDCGRIGHLVKEC--RSCLKENDIEGW-QFKSWLQIQGQN---PRRRRKASPRRDD

A0A6J1DX30 uncharacterized protein LOC1110248741.4e-4535.22Show/hide
Query:  MDTEELMKDWKKFRLSEEEERTRIEFDTNLDSLISDQMNHCLVGKLLTSRTIVPGIIKNTFSNEWRTNLN-FSVDSLGRNVYLFKFDAKRDKEMVLKLGP
        M T +L+++WK F+L+ EEE T I+ D +  +    ++   LVGKL   R I   ++KNT    W+   N F V SLG N++LF F    D+  + K GP
Subjt:  MDTEELMKDWKKFRLSEEEERTRIEFDTNLDSLISDQMNHCLVGKLLTSRTIVPGIIKNTFSNEWRTNLN-FSVDSLGRNVYLFKFDAKRDKEMVLKLGP

Query:  WLFDNILLVLEDPKVNLRLSQLPFNKAAFWIRLIDIPLKFQNKFMAKKLGEAFGEFLKVDCYQDNFCWRENLRIKISVDISKPLRRGIWINLGTSQESLW
        W FD  L+++  P   +  S+L F K   W+R  D+PL    + MA +LG A G F + DC   N  W  NLR+++ +DISKPLRRGI +NL       W
Subjt:  WLFDNILLVLEDPKVNLRLSQLPFNKAAFWIRLIDIPLKFQNKFMAKKLGEAFGEFLKVDCYQDNFCWRENLRIKISVDISKPLRRGIWINLGTSQESLW

Query:  IQAKYERLPEFCYDCGRIGHLVKECRSCLKENDIEGWQFKSWLQIQG----QNPRRRRKASPRRDDKGDVPFNASAAKFGSPNEFDHTRDVEGSHLDMPI
        I  +YERLP+FCY CG      K              Q+ SWL+ QG      P+ ++      D  G+  F++S +  G+ ++      V+ +    PI
Subjt:  IQAKYERLPEFCYDCGRIGHLVKECRSCLKENDIEGWQFKSWLQIQG----QNPRRRRKASPRRDDKGDVPFNASAAKFGSPNEFDHTRDVEGSHLDMPI

Query:  PAKLNS-VGGVPKGGEVP
           + S V   PK G  P
Subjt:  PAKLNS-VGGVPKGGEVP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G31430.1 unknown protein4.1e-1329.14Show/hide
Query:  FKFDAKRDKEMVLKLGPWLFDNILLVLE--DPKVNLRLSQLPFNKAAFWIRLIDIPLKFQNKFMAKKLGEAFGEFLKVDCYQDNFCWRENLRIKISVDIS
        F F  +   E VL+ GPW F++ +++L+  +P++ L     PF    FW+++  IP +F N+ + + +G A G+ L  D   +     +  R+ +  DI+
Subjt:  FKFDAKRDKEMVLKLGPWLFDNILLVLE--DPKVNLRLSQLPFNKAAFWIRLIDIPLKFQNKFMAKKLGEAFGEFLKVDCYQDNFCWRENLRIKISVDIS

Query:  KPLRRGIWINLGTSQESLWIQAKYERLPEFCYDCGRIGHLVKECRSCLKEN
         PLR            +L ++ +YERL  FC  CG + H   +  +CL +N
Subjt:  KPLRRGIWINLGTSQESLWIQAKYERLPEFCYDCGRIGHLVKECRSCLKEN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACACAGAGGAACTAATGAAAGATTGGAAGAAATTTCGGCTATCCGAAGAAGAGGAAAGGACACGAATTGAGTTTGATACAAATCTTGATTCTTTGATCTCTGATCA
AATGAATCACTGTTTAGTGGGGAAACTCTTGACTTCCAGAACAATTGTGCCGGGAATTATCAAGAATACTTTTTCCAATGAATGGAGAACAAACCTTAATTTTAGTGTGG
ATAGCCTTGGAAGAAATGTTTATCTGTTCAAATTTGATGCCAAGAGAGATAAAGAAATGGTCTTAAAATTAGGTCCTTGGTTATTTGATAATATTTTATTAGTGTTGGAA
GATCCAAAAGTGAACTTACGTTTATCTCAACTCCCTTTTAATAAAGCAGCTTTTTGGATCCGGTTGATAGACATTCCGCTGAAATTTCAAAACAAATTTATGGCAAAGAA
ACTTGGTGAAGCGTTTGGTGAATTCTTAAAAGTGGACTGTTATCAAGATAATTTTTGTTGGAGGGAAAATCTGAGGATCAAAATTTCGGTGGATATTTCTAAACCATTGA
GAAGAGGAATCTGGATAAACTTGGGAACAAGTCAAGAAAGTCTATGGATACAGGCTAAATATGAAAGATTGCCGGAGTTCTGTTACGATTGTGGTCGAATTGGTCATCTG
GTTAAAGAGTGCAGATCTTGTTTAAAAGAAAATGACATTGAAGGCTGGCAATTCAAAAGTTGGCTACAAATTCAAGGTCAGAACCCAAGAAGAAGGAGGAAAGCCTCACC
GCGAAGAGATGATAAGGGAGATGTCCCTTTCAATGCCTCTGCGGCAAAATTTGGAAGTCCAAATGAGTTTGATCATACCAGAGATGTTGAAGGGAGTCATTTAGATATGC
CTATACCTGCCAAACTAAATTCGGTTGGAGGGGTACCCAAGGGTGGAGAAGTTCCTAAATCTGTTGCTTTAGAATTCACTAAAATAAGAGGAAACCTGGGGTTGAAGCAA
AAGAAATGGAAAAGAAGAGCTCGATTAGTGATGGTGGAAAACACACACATGAAAGAGATACAATCTTCCATGAAAAGAGCAGAGGAAGGAGATTTGAACCAGTCTGCCAA
CAAAAGAAGAAGGGAAGTTTCTCTCCCTCCTGTTGAAACACCTTTTAATTCTAAATCAGCGGAGCTGTTTGAGCAGCTTTGCCGGAAACAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGACACAGAGGAACTAATGAAAGATTGGAAGAAATTTCGGCTATCCGAAGAAGAGGAAAGGACACGAATTGAGTTTGATACAAATCTTGATTCTTTGATCTCTGATCA
AATGAATCACTGTTTAGTGGGGAAACTCTTGACTTCCAGAACAATTGTGCCGGGAATTATCAAGAATACTTTTTCCAATGAATGGAGAACAAACCTTAATTTTAGTGTGG
ATAGCCTTGGAAGAAATGTTTATCTGTTCAAATTTGATGCCAAGAGAGATAAAGAAATGGTCTTAAAATTAGGTCCTTGGTTATTTGATAATATTTTATTAGTGTTGGAA
GATCCAAAAGTGAACTTACGTTTATCTCAACTCCCTTTTAATAAAGCAGCTTTTTGGATCCGGTTGATAGACATTCCGCTGAAATTTCAAAACAAATTTATGGCAAAGAA
ACTTGGTGAAGCGTTTGGTGAATTCTTAAAAGTGGACTGTTATCAAGATAATTTTTGTTGGAGGGAAAATCTGAGGATCAAAATTTCGGTGGATATTTCTAAACCATTGA
GAAGAGGAATCTGGATAAACTTGGGAACAAGTCAAGAAAGTCTATGGATACAGGCTAAATATGAAAGATTGCCGGAGTTCTGTTACGATTGTGGTCGAATTGGTCATCTG
GTTAAAGAGTGCAGATCTTGTTTAAAAGAAAATGACATTGAAGGCTGGCAATTCAAAAGTTGGCTACAAATTCAAGGTCAGAACCCAAGAAGAAGGAGGAAAGCCTCACC
GCGAAGAGATGATAAGGGAGATGTCCCTTTCAATGCCTCTGCGGCAAAATTTGGAAGTCCAAATGAGTTTGATCATACCAGAGATGTTGAAGGGAGTCATTTAGATATGC
CTATACCTGCCAAACTAAATTCGGTTGGAGGGGTACCCAAGGGTGGAGAAGTTCCTAAATCTGTTGCTTTAGAATTCACTAAAATAAGAGGAAACCTGGGGTTGAAGCAA
AAGAAATGGAAAAGAAGAGCTCGATTAGTGATGGTGGAAAACACACACATGAAAGAGATACAATCTTCCATGAAAAGAGCAGAGGAAGGAGATTTGAACCAGTCTGCCAA
CAAAAGAAGAAGGGAAGTTTCTCTCCCTCCTGTTGAAACACCTTTTAATTCTAAATCAGCGGAGCTGTTTGAGCAGCTTTGCCGGAAACAATGA
Protein sequenceShow/hide protein sequence
MDTEELMKDWKKFRLSEEEERTRIEFDTNLDSLISDQMNHCLVGKLLTSRTIVPGIIKNTFSNEWRTNLNFSVDSLGRNVYLFKFDAKRDKEMVLKLGPWLFDNILLVLE
DPKVNLRLSQLPFNKAAFWIRLIDIPLKFQNKFMAKKLGEAFGEFLKVDCYQDNFCWRENLRIKISVDISKPLRRGIWINLGTSQESLWIQAKYERLPEFCYDCGRIGHL
VKECRSCLKENDIEGWQFKSWLQIQGQNPRRRRKASPRRDDKGDVPFNASAAKFGSPNEFDHTRDVEGSHLDMPIPAKLNSVGGVPKGGEVPKSVALEFTKIRGNLGLKQ
KKWKRRARLVMVENTHMKEIQSSMKRAEEGDLNQSANKRRREVSLPPVETPFNSKSAELFEQLCRKQ