; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg014906 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg014906
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionCCHC-type domain-containing protein
Genome locationscaffold3:45456510..45462159
RNA-Seq ExpressionSpg014906
SyntenySpg014906
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR025558 - Domain of unknown function DUF4283
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG57331.1 hypothetical protein EZV62_018644 [Acer yangbiense]3.3e-2035.29Show/hide
Query:  ADQLQEQIKNLSLVEQEKRRVVEIEETDIEATNKDLSNVAACKILSTKMVNQDMFMEKIPRIWGIEGRVTIEKDGRNLFLCKFMNQKDKNRVTKGGPWSF
        AD++ +    LSL E+E   ++ +++T  +   K L      KILS K+VN D FM  IP+IW I+  V IE  G N+F   F   +D+ +V +GGPW F
Subjt:  ADQLQEQIKNLSLVEQEKRRVVEIEETDIEATNKDLSNVAACKILSTKMVNQDMFMEKIPRIWGIEGRVTIEKDGRNLFLCKFMNQKDKNRVTKGGPWSF

Query:  DSSLLLFAEPKGNISVSALDFRYASFWVHFHKLPRVCYSRKYAEALGNSIGIYEYVDTDEKGNISGETLR
        D +LL+ AEP+G   +  + F   +FWV  H +P +C +++    LG  IG +  +D    G  + + +R
Subjt:  DSSLLLFAEPKGNISVSALDFRYASFWVHFHKLPRVCYSRKYAEALGNSIGIYEYVDTDEKGNISGETLR

TXG73339.1 hypothetical protein EZV62_001918 [Acer yangbiense]3.3e-2036.65Show/hide
Query:  ADQLQEQIKNLSLVEQEKRRVVEIEETDIEATNKDLSNVAACKILSTKMVNQDMFMEKIPRIWGIEGRVTIEKDGRNLFLCKFMNQKDKNRVTKGGPWSF
        A ++ +  +NLS+ E E   V+E  ET+IE   KD+      K+L+ K VN++ F   I +IW   G+V +E    N+F+  F+ Q+D+NRV + GPW F
Subjt:  ADQLQEQIKNLSLVEQEKRRVVEIEETDIEATNKDLSNVAACKILSTKMVNQDMFMEKIPRIWGIEGRVTIEKDGRNLFLCKFMNQKDKNRVTKGGPWSF

Query:  DSSLLLFAEPKGNISVSALDFRYASFWVHFHKLPRVCYSRKYAEALGNSIGIYEYVDTDEK
          SL++  +PKG  + S L F  A+FWV  H  P +C +R+ A+ +   IG    +  D K
Subjt:  DSSLLLFAEPKGNISVSALDFRYASFWVHFHKLPRVCYSRKYAEALGNSIGIYEYVDTDEK

XP_015380691.1 uncharacterized protein LOC107174364 [Citrus sinensis]4.3e-2036.99Show/hide
Query:  ETDIEAT-NKDLSNVAACKILSTKMVNQDMFMEKIPRIWGIEGRVTIEKDGRNLFLCKFMNQKDKNRVTKGGPWSFDSSLLLFAEPKGNISVSALDFRYA
        E +I+AT  K L+     K+L T+ VN++ F   + ++W     V IE  G N F+ KF  + DK RV  GGPW FD +LL+  EPKG   ++   F + 
Subjt:  ETDIEAT-NKDLSNVAACKILSTKMVNQDMFMEKIPRIWGIEGRVTIEKDGRNLFLCKFMNQKDKNRVTKGGPWSFDSSLLLFAEPKGNISVSALDFRYA

Query:  SFWVHFHKLPRVCYSRKYAEALGNSIGIYEYVDTDEKGNISGETLR
        +FW+    +P  C  ++  + LG  IG  E ++TDE G   GE  R
Subjt:  SFWVHFHKLPRVCYSRKYAEALGNSIGIYEYVDTDEKGNISGETLR

XP_024042230.1 uncharacterized protein LOC112099248 [Citrus clementina]1.0e-2134.91Show/hide
Query:  DQLQEQIKNLSLVEQEKRRVVEIEETDIEATNKDLSNVAACKILSTKMVNQDMFMEKIPRIWGIEGRVTIEKDGRNLFLCKFMNQKDKNRVTKGGPWSFD
        D+L  + K +++ E++K RV  +E +      K L+N    K+L T++VN++     + ++W     V IE  G N+F+ KF  + DK RV  GGPW FD
Subjt:  DQLQEQIKNLSLVEQEKRRVVEIEETDIEATNKDLSNVAACKILSTKMVNQDMFMEKIPRIWGIEGRVTIEKDGRNLFLCKFMNQKDKNRVTKGGPWSFD

Query:  SSLLLFAEPKGNISVSALDFRYASFWVHFHKLPRVCYSRKYAEALGNSIGIYEYVDTDEKGNISGETLR
         +L++  EP+G   V    F + SFW+    +P  C  +++   LG  IG+ E V+TDE G+  GE  R
Subjt:  SSLLLFAEPKGNISVSALDFRYASFWVHFHKLPRVCYSRKYAEALGNSIGIYEYVDTDEKGNISGETLR

XP_024043038.1 uncharacterized protein LOC112099799 [Citrus clementina]8.6e-2133.72Show/hide
Query:  DQLQEQIKNLSLVEQEKRRVVEIEETDIEATNKDLSNVAAC---KILSTKMVNQDMFMEKIPRIWGIEGRVTIEKDGRNLFLCKFMNQKDKNRVTKGGPW
        D+L  + K +++ E++K RV      +++   K    +A C   K+L ++ VN++     + ++W     V IE  G N+F+ +F  + DK RV  GGPW
Subjt:  DQLQEQIKNLSLVEQEKRRVVEIEETDIEATNKDLSNVAAC---KILSTKMVNQDMFMEKIPRIWGIEGRVTIEKDGRNLFLCKFMNQKDKNRVTKGGPW

Query:  SFDSSLLLFAEPKGNISVSALDFRYASFWVHFHKLPRVCYSRKYAEALGNSIGIYEYVDTDEKGNISGETLR
         FD +L++  EP+G   V+   F + SFWV    +P  C  + + +ALG  IGI E V+TD+ G+  GE  R
Subjt:  SFDSSLLLFAEPKGNISVSALDFRYASFWVHFHKLPRVCYSRKYAEALGNSIGIYEYVDTDEKGNISGETLR

TrEMBL top hitse value%identityAlignment
A0A5C7HJY7 DUF4283 domain-containing protein1.6e-2035.29Show/hide
Query:  ADQLQEQIKNLSLVEQEKRRVVEIEETDIEATNKDLSNVAACKILSTKMVNQDMFMEKIPRIWGIEGRVTIEKDGRNLFLCKFMNQKDKNRVTKGGPWSF
        AD++ +    LSL E+E   ++ +++T  +   K L      KILS K+VN D FM  IP+IW I+  V IE  G N+F   F   +D+ +V +GGPW F
Subjt:  ADQLQEQIKNLSLVEQEKRRVVEIEETDIEATNKDLSNVAACKILSTKMVNQDMFMEKIPRIWGIEGRVTIEKDGRNLFLCKFMNQKDKNRVTKGGPWSF

Query:  DSSLLLFAEPKGNISVSALDFRYASFWVHFHKLPRVCYSRKYAEALGNSIGIYEYVDTDEKGNISGETLR
        D +LL+ AEP+G   +  + F   +FWV  H +P +C +++    LG  IG +  +D    G  + + +R
Subjt:  DSSLLLFAEPKGNISVSALDFRYASFWVHFHKLPRVCYSRKYAEALGNSIGIYEYVDTDEKGNISGETLR

A0A5C7I6E5 CCHC-type domain-containing protein4.6e-2031.79Show/hide
Query:  ADQLQEQIKNLSLVEQEKRRVVEIEETDIEATNKDLSNVAAC---KILSTKMVNQDMFMEKIPRIWGIEGRVTIEKDGRNLFLCKFMNQKDKNRVTKGGP
        A+++ +    LS+ E+E    +     ++  T++    +A C   K+L++++V +++F++ + +IW + G V IE    N+F   F N +D+ RV +GGP
Subjt:  ADQLQEQIKNLSLVEQEKRRVVEIEETDIEATNKDLSNVAAC---KILSTKMVNQDMFMEKIPRIWGIEGRVTIEKDGRNLFLCKFMNQKDKNRVTKGGP

Query:  WSFDSSLLLFAEPKGNISVSALDFRYASFWVHFHKLPRVCYSRKYAEALGNSIGIYEYVDTDEKGNISGETLR
        WSFD ++++F EP G   +S L F Y  FWV  H LP +C + +    LG+ IG     D+    + SG  +R
Subjt:  WSFDSSLLLFAEPKGNISVSALDFRYASFWVHFHKLPRVCYSRKYAEALGNSIGIYEYVDTDEKGNISGETLR

A0A5C7IND8 CCHC-type domain-containing protein3.5e-2034.71Show/hide
Query:  MAGADQLQEQIKNLSLVEQEKRRVVEIEETDIEATNKDLSNVAACKILSTKMVNQDMFMEKIPRIWGIEGRVTIEKDGRNLFLCKFMNQKDKNRVTKGGP
        M  A+++ +  + L+L E+E   +++++E       K L+   A K+LS+K VN+D FM  +P+IW       IE    N F   F N+KD+ R+  G P
Subjt:  MAGADQLQEQIKNLSLVEQEKRRVVEIEETDIEATNKDLSNVAACKILSTKMVNQDMFMEKIPRIWGIEGRVTIEKDGRNLFLCKFMNQKDKNRVTKGGP

Query:  WSFDSSLLLFAEPKGNISVSALDFRYASFWVHFHKLPRVCYSRKYAEALGNSIGIYEYVDTDEKGNISGE
        WSFD +LL+  EPKG   +  + F   +FWV  H++P +C + +   ALGN IG  + +D    G+  G+
Subjt:  WSFDSSLLLFAEPKGNISVSALDFRYASFWVHFHKLPRVCYSRKYAEALGNSIGIYEYVDTDEKGNISGE

A0A5C7IVM6 DUF4283 domain-containing protein3.5e-2041.41Show/hide
Query:  KILSTKMVNQDMFMEKIPRIWGIEGRVTIEKDGRNLFLCKFMNQKDKNRVTKGGPWSFDSSLLLFAEPKGNISVSALDFRYASFWVHFHKLPRVCYSRKY
        KILS KMVN+D FM  I +IW +   V IE    N+F  +F ++ D  RV  GGPWSFD++L+    P+G  S+ +L+F +A FWV  H++P +C +++ 
Subjt:  KILSTKMVNQDMFMEKIPRIWGIEGRVTIEKDGRNLFLCKFMNQKDKNRVTKGGPWSFDSSLLLFAEPKGNISVSALDFRYASFWVHFHKLPRVCYSRKY

Query:  AEALGNSIGIYEYVDTDEKGNISGETLR
           LG  IG    VD    G  SG+ +R
Subjt:  AEALGNSIGIYEYVDTDEKGNISGETLR

A0A5C7IW83 CCHC-type domain-containing protein1.6e-2036.65Show/hide
Query:  ADQLQEQIKNLSLVEQEKRRVVEIEETDIEATNKDLSNVAACKILSTKMVNQDMFMEKIPRIWGIEGRVTIEKDGRNLFLCKFMNQKDKNRVTKGGPWSF
        A ++ +  +NLS+ E E   V+E  ET+IE   KD+      K+L+ K VN++ F   I +IW   G+V +E    N+F+  F+ Q+D+NRV + GPW F
Subjt:  ADQLQEQIKNLSLVEQEKRRVVEIEETDIEATNKDLSNVAACKILSTKMVNQDMFMEKIPRIWGIEGRVTIEKDGRNLFLCKFMNQKDKNRVTKGGPWSF

Query:  DSSLLLFAEPKGNISVSALDFRYASFWVHFHKLPRVCYSRKYAEALGNSIGIYEYVDTDEK
          SL++  +PKG  + S L F  A+FWV  H  P +C +R+ A+ +   IG    +  D K
Subjt:  DSSLLLFAEPKGNISVSALDFRYASFWVHFHKLPRVCYSRKYAEALGNSIGIYEYVDTDEK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.5e-1024.45Show/hide
Query:  ANSKGKFTDSVWAIIAEHLKAE-ELEDAAII-------LWSIWTFRNKVTVAKDKANILNLSRQIKRHLREQRSSRNPNLMEAILESQKNHEV-WSPPPT
        A  +G++TDS++A +   L  E E+     I       LW +W  RN++     + +   + R+      E  + R      +  + ++N  V W  PP 
Subjt:  ANSKGKFTDSVWAIIAEHLKAE-ELEDAAII-------LWSIWTFRNKVTVAKDKANILNLSRQIKRHLREQRSSRNPNLMEAILESQKNHEV-WSPPPT

Query:  NFVKINVDASWNASQSKGGTGWIIRDSLGSPIGMDCSTIRRNWSIKALEALAIKQGLEAYLQHQNNRASGALVESDSMEAVRALNHEEVDVSEMKVIIDD
         +VK N DA+W     + G GWI+R+  G  + M    + R  ++   E  A++  +    +    R    + ESD+   V  LN ++     ++  ++D
Subjt:  NFVKINVDASWNASQSKGGTGWIIRDSLGSPIGMDCSTIRRNWSIKALEALAIKQGLEAYLQHQNNRASGALVESDSMEAVRALNHEEVDVSEMKVIIDD

Query:  IEGLAESTGGISFVKCRRSLNKIAHVLAR
        I+ L      + F    R  NK+A  +AR
Subjt:  IEGLAESTGGISFVKCRRSLNKIAHVLAR

AT3G09510.1 Ribonuclease H-like superfamily protein2.8e-0927.41Show/hide
Query:  ILWSIWTFRNKVTVAK--DKANILNLSRQIKRH---LREQRSSRNPNLMEAILESQKNHEVWSPPPTNFVKINVDASWNASQSKGGTGWIIRDSLGSPIG
        ++W IW  RN V   K  +  +   LS + + H      Q   + P+    I E   N   W  PP  +VK N DA ++  + +   GWIIR+  G+PI 
Subjt:  ILWSIWTFRNKVTVAK--DKANILNLSRQIKRH---LREQRSSRNPNLMEAILESQKNHEVWSPPPTNFVKINVDASWNASQSKGGTGWIIRDSLGSPIG

Query:  MDCSTIRRNWSIKALEALAIKQGLEAYLQHQNNRASGALVESDSMEAVRALNHEEVDVSEMKVIIDDIEGLAESTGGISFVKCRRSLNKIAHVLARF
             +    +    E  A+   L A  Q      +   +E D    +  +N      S +   ++DI   A     I F   RR  NK+AHVLA++
Subjt:  MDCSTIRRNWSIKALEALAIKQGLEAYLQHQNNRASGALVESDSMEAVRALNHEEVDVSEMKVIIDDIEGLAESTGGISFVKCRRSLNKIAHVLARF

AT4G09775.1 BEST Arabidopsis thaliana protein match is: Ribonuclease H-like superfamily protein (TAIR:AT2G02650.1)3.7e-0635.94Show/hide
Query:  WSPPPTNFVKINVDASWNASQSKGGTGWIIRDSLGSPIGMDCSTIRRNWSIKALEALAIKQGLE
        WSPPP  ++K N D+ +   +    T WIIRDS G  I   C+ +++++S    EAL     L+
Subjt:  WSPPPTNFVKINVDASWNASQSKGGTGWIIRDSLGSPIGMDCSTIRRNWSIKALEALAIKQGLE

AT4G29090.1 Ribonuclease H-like superfamily protein4.5e-1227Show/hide
Query:  ILWSIWTFRNKVTVAKDKANILNLSRQIKRHLREQRSSRNPNLMEAILESQKNHEV---WSPPPTNFVKINVDASWNASQSKGGTGWIIRDSLGSPIGMD
        +LW +W  RN++     + N   + R+ +  L E R            + Q N      W PPP  +VK N DA+WN    + G GW++R+  G    M 
Subjt:  ILWSIWTFRNKVTVAKDKANILNLSRQIKRHLREQRSSRNPNLMEAILESQKNHEV---WSPPPTNFVKINVDASWNASQSKGGTGWIIRDSLGSPIGMD

Query:  CSTIRRNWSIKALEALAIKQGLEAYLQHQNNRASGALVESDSMEAVRALNHEEVDVSEMKVIIDDIEGLAESTGGISFVKCRRSLNKIAHVLARFAAGLL
           + +  S+   E  A++  + +  + Q N     + ESDS   +  LN++E+  S +K  I D++ L      + FV   R  N +A  +AR +   L
Subjt:  CSTIRRNWSIKALEALAIKQGLEAYLQHQNNRASGALVESDSMEAVRALNHEEVDVSEMKVIIDDIEGLAESTGGISFVKCRRSLNKIAHVLARFAAGLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGGAGCTGATCAACTGCAAGAGCAAATAAAAAATCTCAGTCTCGTCGAGCAGGAAAAAAGAAGAGTGGTGGAGATTGAAGAAACAGATATTGAAGCAACAAACAA
AGATCTTTCCAATGTGGCAGCGTGCAAAATCCTATCCACCAAAATGGTCAACCAAGATATGTTCATGGAAAAGATTCCACGAATTTGGGGGATAGAAGGAAGAGTCACCA
TTGAGAAGGATGGAAGAAATTTGTTCCTCTGCAAATTTATGAACCAAAAAGACAAAAACAGAGTTACAAAAGGGGGGCCTTGGAGCTTTGACAGCAGCCTCCTCTTGTTT
GCCGAACCGAAAGGAAACATCAGTGTGAGCGCTCTGGATTTCAGGTACGCATCTTTCTGGGTCCATTTCCATAAACTCCCACGAGTGTGTTATAGCAGGAAGTATGCAGA
AGCGCTGGGGAACTCTATAGGCATCTATGAGTATGTGGACACAGACGAGAAAGGAAACATAAGTGGAGAAACTCTGCGGGGAAGCAAGGGCATATACAGAAACAAAAAAC
ATCACGACAGATTCGAATTCTTCAGAGGAAGAGGTAGGGGAAGAGGAGTCGGAAATAGAGGCTGGAATGAGGTCACACACGATCAAGATGGAGCTACAGAAGTTCACGAA
AATGGGCATGAAACGCCGGCCAACCAGCCGGAAAAACCTCCGGAGAAAGACACGGGCGAAGCTGGTAGGAACTCTGACAACCAAACAGAACACTCTAAGAATATAGCCGT
TGAAAACACAATGATGGAAATCATTGAGACCGAGGTCAGGCCAAGCAATGAACATGGTGCTAAAGGACAAACGAATGAGGGACCCACAGGACAGCATGAAAAGCACTTTA
TCTCCTTAAACAACGAAAAAGGGCTGTTAGAAATAAAAGGAAAAGGCCAAGCCCATCAAGAAAACAACACTCTGACCGACAGAGACCTGGAGGAAAACAATAGAATGAAG
AATGAAAACCAGACAGTGAGGAATAAGAAAGAAGTTGAAAACCTCAAATGGGCTAGAGTTCATGAGCTCATGCTTGATTCTGGTGACTGGAACATCCCCCTCATCAAGAG
CAATTTCATCCCCGTTGATGTGGAAGATATTCTTGCTATTCCGCTGGGAAGAAGGGCTGCAAAGGACGAAATTATTTGGGATGCCAATTCCAAAGGAAAATTCACGGATT
CAGTTTGGGCAATAATAGCAGAACACTTAAAGGCTGAGGAGCTAGAAGATGCAGCAATCATATTATGGTCGATATGGACTTTCCGGAACAAAGTGACAGTGGCAAAAGAC
AAAGCAAATATTCTCAATCTTTCCAGGCAAATCAAAAGACATTTGAGGGAGCAGAGGTCTAGCAGGAACCCAAACCTGATGGAAGCAATATTGGAGAGCCAGAAGAATCA
TGAAGTTTGGTCCCCTCCTCCGACCAATTTCGTGAAGATCAACGTCGATGCCTCTTGGAACGCATCACAATCAAAAGGTGGTACTGGTTGGATCATTCGTGATTCTCTTG
GATCTCCGATTGGTATGGACTGCTCGACAATTCGCCGGAATTGGTCAATAAAAGCTCTCGAAGCGCTAGCTATTAAGCAAGGGTTGGAAGCTTACCTCCAACACCAGAAC
AATCGAGCCTCAGGCGCGCTAGTCGAGTCGGACTCCATGGAAGCTGTTCGAGCCCTCAATCATGAAGAGGTTGATGTCTCCGAGATGAAGGTCATTATCGATGATATAGA
GGGGCTTGCCGAGTCGACCGGAGGAATCTCCTTCGTTAAATGTCGCAGGTCCCTCAACAAAATCGCGCATGTCCTCGCGCGCTTTGCTGCCGGTTTGTTGCCGAATTTTG
AATTGTCCCCCGTCGTTGGGAACGGTTTTTTTGGGCTTGTTTTTCCCTCTTCCACGCTGGAAGAGAGTTGGGTTTGGTGGGAAGGTGAACTGCCTTTCTTGATTTCCCAT
TTATTATGGGAAGATTTAGGTGTATCTAATTTTTCAGTCTTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGGAGCTGATCAACTGCAAGAGCAAATAAAAAATCTCAGTCTCGTCGAGCAGGAAAAAAGAAGAGTGGTGGAGATTGAAGAAACAGATATTGAAGCAACAAACAA
AGATCTTTCCAATGTGGCAGCGTGCAAAATCCTATCCACCAAAATGGTCAACCAAGATATGTTCATGGAAAAGATTCCACGAATTTGGGGGATAGAAGGAAGAGTCACCA
TTGAGAAGGATGGAAGAAATTTGTTCCTCTGCAAATTTATGAACCAAAAAGACAAAAACAGAGTTACAAAAGGGGGGCCTTGGAGCTTTGACAGCAGCCTCCTCTTGTTT
GCCGAACCGAAAGGAAACATCAGTGTGAGCGCTCTGGATTTCAGGTACGCATCTTTCTGGGTCCATTTCCATAAACTCCCACGAGTGTGTTATAGCAGGAAGTATGCAGA
AGCGCTGGGGAACTCTATAGGCATCTATGAGTATGTGGACACAGACGAGAAAGGAAACATAAGTGGAGAAACTCTGCGGGGAAGCAAGGGCATATACAGAAACAAAAAAC
ATCACGACAGATTCGAATTCTTCAGAGGAAGAGGTAGGGGAAGAGGAGTCGGAAATAGAGGCTGGAATGAGGTCACACACGATCAAGATGGAGCTACAGAAGTTCACGAA
AATGGGCATGAAACGCCGGCCAACCAGCCGGAAAAACCTCCGGAGAAAGACACGGGCGAAGCTGGTAGGAACTCTGACAACCAAACAGAACACTCTAAGAATATAGCCGT
TGAAAACACAATGATGGAAATCATTGAGACCGAGGTCAGGCCAAGCAATGAACATGGTGCTAAAGGACAAACGAATGAGGGACCCACAGGACAGCATGAAAAGCACTTTA
TCTCCTTAAACAACGAAAAAGGGCTGTTAGAAATAAAAGGAAAAGGCCAAGCCCATCAAGAAAACAACACTCTGACCGACAGAGACCTGGAGGAAAACAATAGAATGAAG
AATGAAAACCAGACAGTGAGGAATAAGAAAGAAGTTGAAAACCTCAAATGGGCTAGAGTTCATGAGCTCATGCTTGATTCTGGTGACTGGAACATCCCCCTCATCAAGAG
CAATTTCATCCCCGTTGATGTGGAAGATATTCTTGCTATTCCGCTGGGAAGAAGGGCTGCAAAGGACGAAATTATTTGGGATGCCAATTCCAAAGGAAAATTCACGGATT
CAGTTTGGGCAATAATAGCAGAACACTTAAAGGCTGAGGAGCTAGAAGATGCAGCAATCATATTATGGTCGATATGGACTTTCCGGAACAAAGTGACAGTGGCAAAAGAC
AAAGCAAATATTCTCAATCTTTCCAGGCAAATCAAAAGACATTTGAGGGAGCAGAGGTCTAGCAGGAACCCAAACCTGATGGAAGCAATATTGGAGAGCCAGAAGAATCA
TGAAGTTTGGTCCCCTCCTCCGACCAATTTCGTGAAGATCAACGTCGATGCCTCTTGGAACGCATCACAATCAAAAGGTGGTACTGGTTGGATCATTCGTGATTCTCTTG
GATCTCCGATTGGTATGGACTGCTCGACAATTCGCCGGAATTGGTCAATAAAAGCTCTCGAAGCGCTAGCTATTAAGCAAGGGTTGGAAGCTTACCTCCAACACCAGAAC
AATCGAGCCTCAGGCGCGCTAGTCGAGTCGGACTCCATGGAAGCTGTTCGAGCCCTCAATCATGAAGAGGTTGATGTCTCCGAGATGAAGGTCATTATCGATGATATAGA
GGGGCTTGCCGAGTCGACCGGAGGAATCTCCTTCGTTAAATGTCGCAGGTCCCTCAACAAAATCGCGCATGTCCTCGCGCGCTTTGCTGCCGGTTTGTTGCCGAATTTTG
AATTGTCCCCCGTCGTTGGGAACGGTTTTTTTGGGCTTGTTTTTCCCTCTTCCACGCTGGAAGAGAGTTGGGTTTGGTGGGAAGGTGAACTGCCTTTCTTGATTTCCCAT
TTATTATGGGAAGATTTAGGTGTATCTAATTTTTCAGTCTTTTAA
Protein sequenceShow/hide protein sequence
MAGADQLQEQIKNLSLVEQEKRRVVEIEETDIEATNKDLSNVAACKILSTKMVNQDMFMEKIPRIWGIEGRVTIEKDGRNLFLCKFMNQKDKNRVTKGGPWSFDSSLLLF
AEPKGNISVSALDFRYASFWVHFHKLPRVCYSRKYAEALGNSIGIYEYVDTDEKGNISGETLRGSKGIYRNKKHHDRFEFFRGRGRGRGVGNRGWNEVTHDQDGATEVHE
NGHETPANQPEKPPEKDTGEAGRNSDNQTEHSKNIAVENTMMEIIETEVRPSNEHGAKGQTNEGPTGQHEKHFISLNNEKGLLEIKGKGQAHQENNTLTDRDLEENNRMK
NENQTVRNKKEVENLKWARVHELMLDSGDWNIPLIKSNFIPVDVEDILAIPLGRRAAKDEIIWDANSKGKFTDSVWAIIAEHLKAEELEDAAIILWSIWTFRNKVTVAKD
KANILNLSRQIKRHLREQRSSRNPNLMEAILESQKNHEVWSPPPTNFVKINVDASWNASQSKGGTGWIIRDSLGSPIGMDCSTIRRNWSIKALEALAIKQGLEAYLQHQN
NRASGALVESDSMEAVRALNHEEVDVSEMKVIIDDIEGLAESTGGISFVKCRRSLNKIAHVLARFAAGLLPNFELSPVVGNGFFGLVFPSSTLEESWVWWEGELPFLISH
LLWEDLGVSNFSVF