; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10011625 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10011625
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionCCHC-type domain-containing protein
Genome locationChr01:8404781..8410292
RNA-Seq ExpressionHG10011625
SyntenyHG10011625
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022132681.1 uncharacterized protein LOC111005481 [Momordica charantia]3.1e-4133.59Show/hide
Query:  LSHDDLIEEWRKFNLTTAKEETAFDVDHTAVGNTENELGCCLVGKLLCDRFITSVAMYNIFRKAWKVE----------------------NGLQVALLSP
        ++  +L+EEW+ F LT+ +++ A D+D +A+  T   L   L+ KLL  R I+   + N  + AWK++                      +  ++  + P
Subjt:  LSHDDLIEEWRKFNLTTAKEETAFDVDHTAVGNTENELGCCLVGKLLCDRFITSVAMYNIFRKAWKVE----------------------NGLQVALLSP

Query:  WFLDNKHLLILQSPMVDVRPTDLEFKWVPLWVHFYNIPLCCFNHLMAERIGNTVGIFDEFENDQGMMIGKESLRIQIKIDITKPIRRDVKINLEGPIRGC
        W  D + L+I+ +P+   +P D++F+ V LWVHF+++ L C N  MA R+GN +G+F++ E++         LR++++ D+ KP+ R +K+NL+GP+ GC
Subjt:  WFLDNKHLLILQSPMVDVRPTDLEFKWVPLWVHFYNIPLCCFNHLMAERIGNTVGIFDEFENDQGMMIGKESLRIQIKIDITKPIRRDVKINLEGPIRGC

Query:  WIPMRYKRLPDLCSFCGIIGHNFKDCNSFYKTDHKGKKFHQYGPWLKYSKRPSSFN
        WIP++Y+RLPD    CG + H  KDC+     D   K   QYGPWL++    +S N
Subjt:  WIPMRYKRLPDLCSFCGIIGHNFKDCNSFYKTDHKGKKFHQYGPWLKYSKRPSSFN

XP_022149484.1 uncharacterized protein LOC111017902 [Momordica charantia]2.9e-2629.41Show/hide
Query:  DDLIEEWRKFNLTTAKEETAFDVDHTAVGNTENELGCCLVGKLLCDRFITSVAMYNIFRKAWKVEN-------GLQVALL--------------SPWFLD
        D++ + W  F  T  + ET   +D      T + +  C+V KL   + I++ A+ ++ +  W+V N       G+ + ++               PW   
Subjt:  DDLIEEWRKFNLTTAKEETAFDVDHTAVGNTENELGCCLVGKLLCDRFITSVAMYNIFRKAWKVEN-------GLQVALL--------------SPWFLD

Query:  NKHLLILQSPMVDVRPTDLEFKWVPLWVHFYNIPLCCFNHLMAERIGNTVGIFDEFENDQGMMIGKESLRIQIKIDITKPIRRDVKI-NLEGPIRGCWIP
        NK LL+L SP    +P D+ F +   W+  +NIP  C +  MA  +G  +G  +E E D         +R+++KID++KP+RR +K+ N +G  +  W P
Subjt:  NKHLLILQSPMVDVRPTDLEFKWVPLWVHFYNIPLCCFNHLMAERIGNTVGIFDEFENDQGMMIGKESLRIQIKIDITKPIRRDVKI-NLEGPIRGCWIP

Query:  MRYKRLPDLCSFCGIIGHNFKDCNSFYKTDHKGKKFHQYGPWLKYSKRPSSFNSP
        +RY++LPD C  CG IGH+ ++C    K         QYG WL+ +    S + P
Subjt:  MRYKRLPDLCSFCGIIGHNFKDCNSFYKTDHKGKKFHQYGPWLKYSKRPSSFNSP

XP_022156185.1 uncharacterized protein LOC111023135 [Momordica charantia]1.1e-4637.5Show/hide
Query:  LSHDDLIEEWRKFNLTTAKEETAFDVDHTAVGNTENELGCCLVGKLLCDRFITSVAMYNIFRKAWKVENGLQVALL---------------------SPW
        + H++L+ +W+KF LT+ ++E A DVD  AV   E  L   LVGKLL  R I++  +  +   AWKVE+ L V  +                      PW
Subjt:  LSHDDLIEEWRKFNLTTAKEETAFDVDHTAVGNTENELGCCLVGKLLCDRFITSVAMYNIFRKAWKVENGLQVALL---------------------SPW

Query:  FLDNKHLLILQSPMVDVRPTDLEFKWVPLWVHFYNIPLCCFNHLMAERIGNTVGIFDEFE-NDQGMMIGKESLRIQIKIDITKPIRRDVKINLEGPIRGC
        F D K L++LQ P      ++LEF  V  W+H +++P+   N  MA R+GN +G F + + N++G   G  SLRI++ IDITKP+RR +KIN++GP+ GC
Subjt:  FLDNKHLLILQSPMVDVRPTDLEFKWVPLWVHFYNIPLCCFNHLMAERIGNTVGIFDEFE-NDQGMMIGKESLRIQIKIDITKPIRRDVKINLEGPIRGC

Query:  WIPMRYKRLPDLCSFCGIIGHNFKDCNSFY-KTDHKGKKFHQYGPWLKYSKRPSSFNSPLKIINP-QMSSTLDSVRNLKTNSVEKRER
        WIP++Y+RLPD C FCG+IGH+  DC++ Y       +   +YGPWL++    +      K  +P +  S   S  N K   VE+ ++
Subjt:  WIPMRYKRLPDLCSFCGIIGHNFKDCNSFY-KTDHKGKKFHQYGPWLKYSKRPSSFNSPLKIINP-QMSSTLDSVRNLKTNSVEKRER

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]1.8e-4138.93Show/hide
Query:  DLIEEWRKFNLTTAKEETAFDVDHTAVGNTENELGCCLVGKLLCDRFITSVAMYNIFRKAWKVEN--------GLQVALLS--------------PWFLD
        DL+EEW+ F LT+ +EETA DVD +A   T + L   LVGKL   R IT   M N  R AWK+EN        G  + L S              PW  D
Subjt:  DLIEEWRKFNLTTAKEETAFDVDHTAVGNTENELGCCLVGKLLCDRFITSVAMYNIFRKAWKVEN--------GLQVALLS--------------PWFLD

Query:  NKHLLILQSPMVDVRPTDLEFKWVPLWVHFYNIPLCCFNHLMAERIGNTVGIFDEFENDQGMMIGKESLRIQIKIDITKPIRRDVKINLEGPIRGCWIPM
         + L+++  P+  + P++L+F  +P+WV F+++PL C    MA R+GN +G F+E + D        +LR+++ +DI+KP+RR +K+NL+GPI G WIP+
Subjt:  NKHLLILQSPMVDVRPTDLEFKWVPLWVHFYNIPLCCFNHLMAERIGNTVGIFDEFENDQGMMIGKESLRIQIKIDITKPIRRDVKINLEGPIRGCWIPM

Query:  RYKRLPDLCSFCGIIGHNFKDCNSFYKTDHKGKKFHQYGPWLKY
        +Y+RLPD C  CG+                  +K HQYG WL+Y
Subjt:  RYKRLPDLCSFCGIIGHNFKDCNSFYKTDHKGKKFHQYGPWLKY

XP_028117212.1 uncharacterized protein LOC114314884 [Camellia sinensis]5.8e-2732.1Show/hide
Query:  DLIEEWRKFNLTTAKEETAFDVDHTAVGNTENELGCCLVGKLLCDRFITSVAMYNIFRKAWKVENGL---------------------QVALLSPWFLDN
        D IE+     + T +EE    VD   + +TE   G CLVGKLL  R     A+ +     WK   G+                     +V +  PW  D 
Subjt:  DLIEEWRKFNLTTAKEETAFDVDHTAVGNTENELGCCLVGKLLCDRFITSVAMYNIFRKAWKVENGL---------------------QVALLSPWFLDN

Query:  KHLLILQSPMVDVRPTDLEFKWVPLWVHFYNIPLCCFNHLMAERIGNTVGIFDEFENDQGMMIGKESLRIQIKIDITKPIRRDVKINLEGPIRGCWIPMR
        K L++L+     ++P+++ F  +  WVH  N+PL      +   +GNT+G F + E   G +    +L I+I+I+I KP+RR +K+ L G     WI ++
Subjt:  KHLLILQSPMVDVRPTDLEFKWVPLWVHFYNIPLCCFNHLMAERIGNTVGIFDEFENDQGMMIGKESLRIQIKIDITKPIRRDVKINLEGPIRGCWIPMR

Query:  YKRLPDLCSFCGIIGHNFKDC-NSFYKTDHKGKKFHQYGPWLK
        Y+RLP+ C  CG++GH+  DC N   +  H+ +   QYGPWL+
Subjt:  YKRLPDLCSFCGIIGHNFKDC-NSFYKTDHKGKKFHQYGPWLK

TrEMBL top hitse value%identityAlignment
A0A2N9F7A6 Uncharacterized protein5.8e-2526.74Show/hide
Query:  DDLIEEWRKFNLTTAKEETAFDVDHTAVGNTENELGCCLVGKLLCDRFITSVAMYNIFRKAWKVENGLQVALL---------------------SPWFLD
        + L+EEWRKF+L T  E   F VD  A+G++++    CL+G+L+ D++    A+ +   + W V  G+ +  +                     SPW  +
Subjt:  DDLIEEWRKFNLTTAKEETAFDVDHTAVGNTENELGCCLVGKLLCDRFITSVAMYNIFRKAWKVENGLQVALL---------------------SPWFLD

Query:  NKHLLILQSPMVDVRPTDLEFKWVPLWVHFYNIPLCCFNHLMAERIGNTVG---IFDEFENDQGMMIGKESLRIQIKIDITKPIRRDVKINLEGPIRGCW
        N H+L L         T ++F     WV  + +PL        ER+G T+G     D  EN  G  +    LR +I +DI+KPI R   IN    +   W
Subjt:  NKHLLILQSPMVDVRPTDLEFKWVPLWVHFYNIPLCCFNHLMAERIGNTVG---IFDEFENDQGMMIGKESLRIQIKIDITKPIRRDVKINLEGPIRGCW

Query:  IPMRYKRLPDLCSFCGIIGHNFKDCNSFYKTDHK-GKKFHQYGPWLKYS-----KRPSSFNSPLKIINPQMSSTLDSVRNLKTNSVEKRERITESPQGSS
        +  +Y+RLP LC  CG+IGH  +DC +  +   K G+   QYGPWL+ +     +R   +    K    + ++  +S  NL  N   + +    S +G+S
Subjt:  IPMRYKRLPDLCSFCGIIGHNFKDCNSFYKTDHK-GKKFHQYGPWLKYS-----KRPSSFNSPLKIINPQMSSTLDSVRNLKTNSVEKRERITESPQGSS

Query:  LTQIAVGTLPKRNREADEEEEGKKELPQTFHF----SKLKERAFMSFEEEHVEGSSIKCKKNLTMDFE------GILVEPALDPNLPMQ
               ++P +N E+      +K      H     +K KE + + F +E  +G       N+ +  +      G++  P++  N+ M+
Subjt:  LTQIAVGTLPKRNREADEEEEGKKELPQTFHF----SKLKERAFMSFEEEHVEGSSIKCKKNLTMDFE------GILVEPALDPNLPMQ

A0A6J1BSZ1 uncharacterized protein LOC1110054811.5e-4133.59Show/hide
Query:  LSHDDLIEEWRKFNLTTAKEETAFDVDHTAVGNTENELGCCLVGKLLCDRFITSVAMYNIFRKAWKVE----------------------NGLQVALLSP
        ++  +L+EEW+ F LT+ +++ A D+D +A+  T   L   L+ KLL  R I+   + N  + AWK++                      +  ++  + P
Subjt:  LSHDDLIEEWRKFNLTTAKEETAFDVDHTAVGNTENELGCCLVGKLLCDRFITSVAMYNIFRKAWKVE----------------------NGLQVALLSP

Query:  WFLDNKHLLILQSPMVDVRPTDLEFKWVPLWVHFYNIPLCCFNHLMAERIGNTVGIFDEFENDQGMMIGKESLRIQIKIDITKPIRRDVKINLEGPIRGC
        W  D + L+I+ +P+   +P D++F+ V LWVHF+++ L C N  MA R+GN +G+F++ E++         LR++++ D+ KP+ R +K+NL+GP+ GC
Subjt:  WFLDNKHLLILQSPMVDVRPTDLEFKWVPLWVHFYNIPLCCFNHLMAERIGNTVGIFDEFENDQGMMIGKESLRIQIKIDITKPIRRDVKINLEGPIRGC

Query:  WIPMRYKRLPDLCSFCGIIGHNFKDCNSFYKTDHKGKKFHQYGPWLKYSKRPSSFN
        WIP++Y+RLPD    CG + H  KDC+     D   K   QYGPWL++    +S N
Subjt:  WIPMRYKRLPDLCSFCGIIGHNFKDCNSFYKTDHKGKKFHQYGPWLKYSKRPSSFN

A0A6J1D765 uncharacterized protein LOC1110179021.4e-2629.41Show/hide
Query:  DDLIEEWRKFNLTTAKEETAFDVDHTAVGNTENELGCCLVGKLLCDRFITSVAMYNIFRKAWKVEN-------GLQVALL--------------SPWFLD
        D++ + W  F  T  + ET   +D      T + +  C+V KL   + I++ A+ ++ +  W+V N       G+ + ++               PW   
Subjt:  DDLIEEWRKFNLTTAKEETAFDVDHTAVGNTENELGCCLVGKLLCDRFITSVAMYNIFRKAWKVEN-------GLQVALL--------------SPWFLD

Query:  NKHLLILQSPMVDVRPTDLEFKWVPLWVHFYNIPLCCFNHLMAERIGNTVGIFDEFENDQGMMIGKESLRIQIKIDITKPIRRDVKI-NLEGPIRGCWIP
        NK LL+L SP    +P D+ F +   W+  +NIP  C +  MA  +G  +G  +E E D         +R+++KID++KP+RR +K+ N +G  +  W P
Subjt:  NKHLLILQSPMVDVRPTDLEFKWVPLWVHFYNIPLCCFNHLMAERIGNTVGIFDEFENDQGMMIGKESLRIQIKIDITKPIRRDVKI-NLEGPIRGCWIP

Query:  MRYKRLPDLCSFCGIIGHNFKDCNSFYKTDHKGKKFHQYGPWLKYSKRPSSFNSP
        +RY++LPD C  CG IGH+ ++C    K         QYG WL+ +    S + P
Subjt:  MRYKRLPDLCSFCGIIGHNFKDCNSFYKTDHKGKKFHQYGPWLKYSKRPSSFNSP

A0A6J1DU55 uncharacterized protein LOC1110231355.4e-4737.5Show/hide
Query:  LSHDDLIEEWRKFNLTTAKEETAFDVDHTAVGNTENELGCCLVGKLLCDRFITSVAMYNIFRKAWKVENGLQVALL---------------------SPW
        + H++L+ +W+KF LT+ ++E A DVD  AV   E  L   LVGKLL  R I++  +  +   AWKVE+ L V  +                      PW
Subjt:  LSHDDLIEEWRKFNLTTAKEETAFDVDHTAVGNTENELGCCLVGKLLCDRFITSVAMYNIFRKAWKVENGLQVALL---------------------SPW

Query:  FLDNKHLLILQSPMVDVRPTDLEFKWVPLWVHFYNIPLCCFNHLMAERIGNTVGIFDEFE-NDQGMMIGKESLRIQIKIDITKPIRRDVKINLEGPIRGC
        F D K L++LQ P      ++LEF  V  W+H +++P+   N  MA R+GN +G F + + N++G   G  SLRI++ IDITKP+RR +KIN++GP+ GC
Subjt:  FLDNKHLLILQSPMVDVRPTDLEFKWVPLWVHFYNIPLCCFNHLMAERIGNTVGIFDEFE-NDQGMMIGKESLRIQIKIDITKPIRRDVKINLEGPIRGC

Query:  WIPMRYKRLPDLCSFCGIIGHNFKDCNSFY-KTDHKGKKFHQYGPWLKYSKRPSSFNSPLKIINP-QMSSTLDSVRNLKTNSVEKRER
        WIP++Y+RLPD C FCG+IGH+  DC++ Y       +   +YGPWL++    +      K  +P +  S   S  N K   VE+ ++
Subjt:  WIPMRYKRLPDLCSFCGIIGHNFKDCNSFY-KTDHKGKKFHQYGPWLKYSKRPSSFNSPLKIINP-QMSSTLDSVRNLKTNSVEKRER

A0A6J1DX30 uncharacterized protein LOC1110248748.9e-4238.93Show/hide
Query:  DLIEEWRKFNLTTAKEETAFDVDHTAVGNTENELGCCLVGKLLCDRFITSVAMYNIFRKAWKVEN--------GLQVALLS--------------PWFLD
        DL+EEW+ F LT+ +EETA DVD +A   T + L   LVGKL   R IT   M N  R AWK+EN        G  + L S              PW  D
Subjt:  DLIEEWRKFNLTTAKEETAFDVDHTAVGNTENELGCCLVGKLLCDRFITSVAMYNIFRKAWKVEN--------GLQVALLS--------------PWFLD

Query:  NKHLLILQSPMVDVRPTDLEFKWVPLWVHFYNIPLCCFNHLMAERIGNTVGIFDEFENDQGMMIGKESLRIQIKIDITKPIRRDVKINLEGPIRGCWIPM
         + L+++  P+  + P++L+F  +P+WV F+++PL C    MA R+GN +G F+E + D        +LR+++ +DI+KP+RR +K+NL+GPI G WIP+
Subjt:  NKHLLILQSPMVDVRPTDLEFKWVPLWVHFYNIPLCCFNHLMAERIGNTVGIFDEFENDQGMMIGKESLRIQIKIDITKPIRRDVKINLEGPIRGCWIPM

Query:  RYKRLPDLCSFCGIIGHNFKDCNSFYKTDHKGKKFHQYGPWLKY
        +Y+RLPD C  CG+                  +K HQYG WL+Y
Subjt:  RYKRLPDLCSFCGIIGHNFKDCNSFYKTDHKGKKFHQYGPWLKY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein5.6e-0427.5Show/hide
Query:  PKFRDSSPRAPSQKLAHAKGKNPKATDWKKKGPGVDVIPGCPVCKRKEETVTHALFDCSRAKVVWASLFSGSGRLDVQNKDILDIWTELAMKFDDSELTQ
        PK +    RA SQ L         AT  +    G+ + P CP C R+ E++ HALF C  A + W    S   R  + + D  +  + +     D+ ++ 
Subjt:  PKFRDSSPRAPSQKLAHAKGKNPKATDWKKKGPGVDVIPGCPVCKRKEETVTHALFDCSRAKVVWASLFSGSGRLDVQNKDILDIWTELAMKFDDSELTQ

Query:  -----ACINSWAIWGDRNKV
                  W IW  RN V
Subjt:  -----ACINSWAIWGDRNKV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGCGGATCTCTCACATGATGATTTGATTGAGGAGTGGAGAAAGTTTAATTTAACGACGGCAAAGGAGGAAACAGCCTTTGATGTTGATCATACGGCGGTGGGAAA
TACTGAGAACGAATTGGGATGTTGTTTGGTAGGGAAGTTGTTATGTGATCGGTTCATTACCAGTGTAGCCATGTACAACATCTTTCGAAAAGCTTGGAAAGTGGAAAATG
GTTTACAAGTAGCGCTTTTATCCCCTTGGTTTTTGGACAACAAACACCTGCTCATTCTGCAATCTCCAATGGTGGATGTTAGACCAACTGATCTGGAATTCAAATGGGTG
CCACTTTGGGTCCATTTCTACAATATACCCCTCTGTTGTTTCAACCACCTCATGGCCGAAAGAATTGGTAATACCGTCGGAATATTTGATGAATTTGAAAACGACCAAGG
TATGATGATCGGGAAGGAGAGTTTACGGATCCAGATCAAGATAGATATAACCAAACCAATTCGCCGTGATGTGAAAATTAATCTCGAAGGACCAATTAGGGGCTGCTGGA
TTCCAATGAGGTACAAGAGACTCCCAGACCTCTGTTCTTTTTGTGGAATCATAGGGCATAATTTCAAAGATTGCAATTCATTTTACAAGACTGATCACAAAGGAAAAAAA
TTTCATCAGTATGGCCCCTGGTTGAAATATTCCAAGCGCCCATCATCCTTCAATTCGCCTTTGAAAATAATCAATCCGCAGATGAGTTCGACGCTTGACTCAGTTAGGAA
TTTAAAAACCAATTCTGTGGAAAAACGGGAACGAATCACGGAATCGCCTCAGGGAAGCTCGCTGACCCAAATAGCAGTCGGAACGTTACCGAAGAGGAATCGCGAAGCGG
ATGAGGAGGAAGAAGGAAAAAAGGAGTTGCCCCAAACGTTTCATTTTTCGAAGCTGAAGGAAAGGGCATTTATGTCTTTTGAGGAGGAACACGTGGAGGGATCAAGCATT
AAGTGCAAAAAGAATTTAACAATGGATTTTGAAGGGATCTTGGTGGAGCCCGCGCTCGATCCTAATTTGCCCATGCAAATACCAATTCTGGGTAGTACTGAACCATTACA
AACTTCTGGTACCGTAGGCTTCAATTATTTTGGGCCACAAAATGGCCCAAATTTTGGGCTAACTTTAAAGAAATCAGAGGAGTTTACTGCCCAGACAACAAAGCCCAAAT
TCCGTGATTCTAGCCCAAGAGCTCCATCTCAGAAATTGGCTCATGCAAAGGGGAAAAACCCAAAGGCAACAGATTGGAAAAAAAAAGGCCCGGGTGTGGATGTTATACCT
GGTTGCCCGGTATGCAAAAGAAAAGAGGAAACTGTGACTCATGCTTTGTTTGATTGTTCTAGGGCCAAGGTTGTGTGGGCATCTTTATTTAGTGGCAGCGGGCGTTTGGA
TGTTCAGAATAAAGACATTTTAGATATTTGGACAGAGTTGGCCATGAAATTTGATGATTCTGAACTAACTCAGGCTTGCATTAATTCCTGGGCAATTTGGGGGGATAGGA
ATAAAGTTCACAAATCAGAAAGTCTCCCTTCGTTAGAGCATCAGTGTGAATGGATTTTGGAGTATATGATGGAAATTGGGACGAAACCAAATGAACAATTTAGTTCTATT
ATAGGACCCCAAGTGTACAATATCTCAAATAGTGCTTTGAACATTCTTCATGTTGATGCGGCGTGTAGATTGGATTCCCCTATGGTGGGTTATGGTGGGGTTATTAGTAC
TCCAACTGGGTGCTTGGTGGGTACTATGCATGGTTTCAAGAATACATCTCTGAGTCCATTAGGAGCTGAAACTTTGGCTATTTATGAGGGCCTTCGTTTAGCCATTAGAA
TGGAGTTACCTCATATTTTGGTATTTATCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGCGGATCTCTCACATGATGATTTGATTGAGGAGTGGAGAAAGTTTAATTTAACGACGGCAAAGGAGGAAACAGCCTTTGATGTTGATCATACGGCGGTGGGAAA
TACTGAGAACGAATTGGGATGTTGTTTGGTAGGGAAGTTGTTATGTGATCGGTTCATTACCAGTGTAGCCATGTACAACATCTTTCGAAAAGCTTGGAAAGTGGAAAATG
GTTTACAAGTAGCGCTTTTATCCCCTTGGTTTTTGGACAACAAACACCTGCTCATTCTGCAATCTCCAATGGTGGATGTTAGACCAACTGATCTGGAATTCAAATGGGTG
CCACTTTGGGTCCATTTCTACAATATACCCCTCTGTTGTTTCAACCACCTCATGGCCGAAAGAATTGGTAATACCGTCGGAATATTTGATGAATTTGAAAACGACCAAGG
TATGATGATCGGGAAGGAGAGTTTACGGATCCAGATCAAGATAGATATAACCAAACCAATTCGCCGTGATGTGAAAATTAATCTCGAAGGACCAATTAGGGGCTGCTGGA
TTCCAATGAGGTACAAGAGACTCCCAGACCTCTGTTCTTTTTGTGGAATCATAGGGCATAATTTCAAAGATTGCAATTCATTTTACAAGACTGATCACAAAGGAAAAAAA
TTTCATCAGTATGGCCCCTGGTTGAAATATTCCAAGCGCCCATCATCCTTCAATTCGCCTTTGAAAATAATCAATCCGCAGATGAGTTCGACGCTTGACTCAGTTAGGAA
TTTAAAAACCAATTCTGTGGAAAAACGGGAACGAATCACGGAATCGCCTCAGGGAAGCTCGCTGACCCAAATAGCAGTCGGAACGTTACCGAAGAGGAATCGCGAAGCGG
ATGAGGAGGAAGAAGGAAAAAAGGAGTTGCCCCAAACGTTTCATTTTTCGAAGCTGAAGGAAAGGGCATTTATGTCTTTTGAGGAGGAACACGTGGAGGGATCAAGCATT
AAGTGCAAAAAGAATTTAACAATGGATTTTGAAGGGATCTTGGTGGAGCCCGCGCTCGATCCTAATTTGCCCATGCAAATACCAATTCTGGGTAGTACTGAACCATTACA
AACTTCTGGTACCGTAGGCTTCAATTATTTTGGGCCACAAAATGGCCCAAATTTTGGGCTAACTTTAAAGAAATCAGAGGAGTTTACTGCCCAGACAACAAAGCCCAAAT
TCCGTGATTCTAGCCCAAGAGCTCCATCTCAGAAATTGGCTCATGCAAAGGGGAAAAACCCAAAGGCAACAGATTGGAAAAAAAAAGGCCCGGGTGTGGATGTTATACCT
GGTTGCCCGGTATGCAAAAGAAAAGAGGAAACTGTGACTCATGCTTTGTTTGATTGTTCTAGGGCCAAGGTTGTGTGGGCATCTTTATTTAGTGGCAGCGGGCGTTTGGA
TGTTCAGAATAAAGACATTTTAGATATTTGGACAGAGTTGGCCATGAAATTTGATGATTCTGAACTAACTCAGGCTTGCATTAATTCCTGGGCAATTTGGGGGGATAGGA
ATAAAGTTCACAAATCAGAAAGTCTCCCTTCGTTAGAGCATCAGTGTGAATGGATTTTGGAGTATATGATGGAAATTGGGACGAAACCAAATGAACAATTTAGTTCTATT
ATAGGACCCCAAGTGTACAATATCTCAAATAGTGCTTTGAACATTCTTCATGTTGATGCGGCGTGTAGATTGGATTCCCCTATGGTGGGTTATGGTGGGGTTATTAGTAC
TCCAACTGGGTGCTTGGTGGGTACTATGCATGGTTTCAAGAATACATCTCTGAGTCCATTAGGAGCTGAAACTTTGGCTATTTATGAGGGCCTTCGTTTAGCCATTAGAA
TGGAGTTACCTCATATTTTGGTATTTATCTGA
Protein sequenceShow/hide protein sequence
MEADLSHDDLIEEWRKFNLTTAKEETAFDVDHTAVGNTENELGCCLVGKLLCDRFITSVAMYNIFRKAWKVENGLQVALLSPWFLDNKHLLILQSPMVDVRPTDLEFKWV
PLWVHFYNIPLCCFNHLMAERIGNTVGIFDEFENDQGMMIGKESLRIQIKIDITKPIRRDVKINLEGPIRGCWIPMRYKRLPDLCSFCGIIGHNFKDCNSFYKTDHKGKK
FHQYGPWLKYSKRPSSFNSPLKIINPQMSSTLDSVRNLKTNSVEKRERITESPQGSSLTQIAVGTLPKRNREADEEEEGKKELPQTFHFSKLKERAFMSFEEEHVEGSSI
KCKKNLTMDFEGILVEPALDPNLPMQIPILGSTEPLQTSGTVGFNYFGPQNGPNFGLTLKKSEEFTAQTTKPKFRDSSPRAPSQKLAHAKGKNPKATDWKKKGPGVDVIP
GCPVCKRKEETVTHALFDCSRAKVVWASLFSGSGRLDVQNKDILDIWTELAMKFDDSELTQACINSWAIWGDRNKVHKSESLPSLEHQCEWILEYMMEIGTKPNEQFSSI
IGPQVYNISNSALNILHVDAACRLDSPMVGYGGVISTPTGCLVGTMHGFKNTSLSPLGAETLAIYEGLRLAIRMELPHILVFI