; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0007722 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0007722
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr9:3747088..3748768
RNA-Seq ExpressionLag0007722
SyntenyLag0007722
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_015380691.1 uncharacterized protein LOC107174364 [Citrus sinensis]7.9e-2533.17Show/hide
Query:  LENAILCKIFTNRKISLEVFCSMMPKIWNQEHTI-IHHMGFNLFLCKFWNTRIKGRIIDFGPWFYDKAMLLMEEPKGDIYDEDMDF--------------
        L   ++ K+   R ++ E F S + ++W     + I  +G N F+ KF     K R++  GPW +D+A+L++ EPKG        F              
Subjt:  LENAILCKIFTNRKISLEVFCSMMPKIWNQEHTI-IHHMGFNLFLCKFWNTRIKGRIIDFGPWFYDKAMLLMEEPKGDIYDEDMDF--------------

Query:  ----KELVVAIGSLLGKVEQVDLDEDNNKQWEFSLRIKIQIDVSVPLKRGIFLQSSKGDNDRWIKVTYEKLLDFCYGCGLLGHIIKECEVNYGSNNEELS
            KEL+  +G ++G VE+++ DE+     EF+ RI++ I++++PLK+ +FL+  +G++D  + V YE+L DFCY CG++GH  KEC    G   E+L 
Subjt:  ----KELVVAIGSLLGKVEQVDLDEDNNKQWEFSLRIKIQIDVSVPLKRGIFLQSSKGDNDRWIKVTYEKLLDFCYGCGLLGHIIKECEVNYGSNNEELS

Query:  YG
        YG
Subjt:  YG

XP_021713609.1 uncharacterized protein LOC110681792 [Chenopodium quinoa]1.2e-2530.49Show/hide
Query:  DSIIKQLGELKVTDAKRACM-YKLQEDATDKTKHQLENAILCKIFTNRKISLEVFCSMMPKIWN-QEHTIIHHMGFNLFLCKFWNTRIKGRIIDFGPWFY
        D ++K   +L++T+ +   +  +  E  T+ TK QL   ++ K++T +  +LE     +  +W  +E   +  +  NLF  +F+    K R+++  PWF+
Subjt:  DSIIKQLGELKVTDAKRACM-YKLQEDATDKTKHQLENAILCKIFTNRKISLEVFCSMMPKIWN-QEHTIIHHMGFNLFLCKFWNTRIKGRIIDFGPWFY

Query:  DKAMLLMEEPKGDIYDEDMDFKELVV------------AIGSLLGKVEQVDLDEDNNKQWEFSLRIKIQIDVSVPLKRGIFLQSSKGDNDRWIKVTYEKL
        D+ +LL++E  G+    ++ FK   +            +I  +    E ++LDE +   W  S+RIK+ +D++ PL+RG+FL + + ++ RWI V YE+L
Subjt:  DKAMLLMEEPKGDIYDEDMDFKELVV------------AIGSLLGKVEQVDLDEDNNKQWEFSLRIKIQIDVSVPLKRGIFLQSSKGDNDRWIKVTYEKL

Query:  LDFCYGCGLLGHIIKECEVNYGSNNEE----LSYGPSLRKPVKLKT
         DFC+ CG L H  KEC+    +   E      YGP LR   K KT
Subjt:  LDFCYGCGLLGHIIKECEVNYGSNNEE----LSYGPSLRKPVKLKT

XP_022132681.1 uncharacterized protein LOC111005481 [Momordica charantia]1.1e-2329.92Show/hide
Query:  AADSIIKQLGELKVTDAKRACMYKLQEDATDKTKHQLENAILCKIFTNRKISLEVFCSMMPKIWNQEHTI--IHHMGFNLFLCKFWNTRIKGRIIDFGPW
        AA +++++    K+T  +      +   A + T   LE +++CK+ + R IS  V  + +   W  +     +  +GFN+FL  F  +  + RI+  GPW
Subjt:  AADSIIKQLGELKVTDAKRACMYKLQEDATDKTKHQLENAILCKIFTNRKISLEVFCSMMPKIWNQEHTI--IHHMGFNLFLCKFWNTRIKGRIIDFGPW

Query:  FYDKAMLLMEEPKGDIYDEDMDF------------------KELVVAIGSLLGKVEQVDLDEDNNKQWEFSLRIKIQIDVSVPLKRGIFLQSSKGDNDRW
         +D+A+++++ P       DMDF                  K +   +G+ +G  E V+    NN  W   LR++++ DV  PL RGI L         W
Subjt:  FYDKAMLLMEEPKGDIYDEDMDF------------------KELVVAIGSLLGKVEQVDLDEDNNKQWEFSLRIKIQIDVSVPLKRGIFLQSSKGDNDRW

Query:  IKVTYEKLLDFCYGCGLLGHIIKEC-EVNYGSNNEELSYGPSLR
        I + YE+L DF Y CG L HI+K+C +    S ++ L YGP LR
Subjt:  IKVTYEKLLDFCYGCGLLGHIIKEC-EVNYGSNNEELSYGPSLR

XP_022149484.1 uncharacterized protein LOC111017902 [Momordica charantia]5.1e-2428.93Show/hide
Query:  TKHQLENAILCKIFTNRKISLEVFCSMMPKIWN-QEHTIIHHMGFNLFLCKFWNTRIKGRIIDFGPWFYDKAMLLMEEPKGDIYDEDMDFK---------
        T   ++  ++ K+ T+++IS E   S+M  +W     T    +G N+++  F +   K R++  GPW ++K++L++  P       DM+F          
Subjt:  TKHQLENAILCKIFTNRKISLEVFCSMMPKIWN-QEHTIIHHMGFNLFLCKFWNTRIKGRIIDFGPWFYDKAMLLMEEPKGDIYDEDMDFK---------

Query:  ---------ELVVAIGSLLGKVEQVDLDEDNNKQWEFSLRIKIQIDVSVPLKRGIFLQSSKGDNDRWIKVTYEKLLDFCYGCGLLGHIIKECE--VNYGS
                 E+   +G+ LG VE+++ D  +     F +R++++IDVS PL+RGI L++S G  D W  + YEKL DFCY CG +GH  +ECE      +
Subjt:  ---------ELVVAIGSLLGKVEQVDLDEDNNKQWEFSLRIKIQIDVSVPLKRGIFLQSSKGDNDRWIKVTYEKLLDFCYGCGLLGHIIKECE--VNYGS

Query:  NNEELSYGPSLRKPVKLKTHEFTHGSTSTMFGR-GRG---RGADGGRGSWRYTVAEEVEKEQEGTNSRFQAAKEAGQQEIGQSSRPESDRPEVASLVRKL
         N    YG  LR  +  K+            GR GRG    G  GGRG WR    +E  ++ +G  S  + A E G   +   +       E+     K+
Subjt:  NNEELSYGPSLRKPVKLKTHEFTHGSTSTMFGR-GRG---RGADGGRGSWRYTVAEEVEKEQEGTNSRFQAAKEAGQQEIGQSSRPESDRPEVASLVRKL

Query:  TAGKCNKKCAENQEQLIR
        T+    ++   N  Q +R
Subjt:  TAGKCNKKCAENQEQLIR

XP_023921482.1 uncharacterized protein LOC112032948 [Quercus suber]1.9e-2334.83Show/hide
Query:  ENAILCKIFTNRKISLEVFCSMMPKIWNQEHTI-IHHMGFNLFLCKFWNTRIKGRIIDFGPWFYDKAMLLMEEPKGDIYDEDMDF--KELVVAIGSLLGK
        +N +L KI  ++ IS+      M  +W     + I  +  +LFL +F + R K +++D  PW Y+K ++L++E  G    ++++   KE+ +AIG+ LG+
Subjt:  ENAILCKIFTNRKISLEVFCSMMPKIWNQEHTI-IHHMGFNLFLCKFWNTRIKGRIIDFGPWFYDKAMLLMEEPKGDIYDEDMDF--KELVVAIGSLLGK

Query:  VEQVDLDEDNNKQWEFSLRIKIQIDVSVPLKRGIFLQSSKGDNDRWIKVTYEKLLDFCYGCGLLGHIIKECEVNYGSN
        V ++D+ E    QWE  LR+K+++DV+  L RG  + + +G   RW+   YE+L +FCY CGLL H +K+C  N   N
Subjt:  VEQVDLDEDNNKQWEFSLRIKIQIDVSVPLKRGIFLQSSKGDNDRWIKVTYEKLLDFCYGCGLLGHIIKECEVNYGSN

TrEMBL top hitse value%identityAlignment
A0A6J1BSZ1 uncharacterized protein LOC1110054815.5e-2429.92Show/hide
Query:  AADSIIKQLGELKVTDAKRACMYKLQEDATDKTKHQLENAILCKIFTNRKISLEVFCSMMPKIWNQEHTI--IHHMGFNLFLCKFWNTRIKGRIIDFGPW
        AA +++++    K+T  +      +   A + T   LE +++CK+ + R IS  V  + +   W  +     +  +GFN+FL  F  +  + RI+  GPW
Subjt:  AADSIIKQLGELKVTDAKRACMYKLQEDATDKTKHQLENAILCKIFTNRKISLEVFCSMMPKIWNQEHTI--IHHMGFNLFLCKFWNTRIKGRIIDFGPW

Query:  FYDKAMLLMEEPKGDIYDEDMDF------------------KELVVAIGSLLGKVEQVDLDEDNNKQWEFSLRIKIQIDVSVPLKRGIFLQSSKGDNDRW
         +D+A+++++ P       DMDF                  K +   +G+ +G  E V+    NN  W   LR++++ DV  PL RGI L         W
Subjt:  FYDKAMLLMEEPKGDIYDEDMDF------------------KELVVAIGSLLGKVEQVDLDEDNNKQWEFSLRIKIQIDVSVPLKRGIFLQSSKGDNDRW

Query:  IKVTYEKLLDFCYGCGLLGHIIKEC-EVNYGSNNEELSYGPSLR
        I + YE+L DF Y CG L HI+K+C +    S ++ L YGP LR
Subjt:  IKVTYEKLLDFCYGCGLLGHIIKEC-EVNYGSNNEELSYGPSLR

A0A6J1D765 uncharacterized protein LOC1110179022.5e-2428.93Show/hide
Query:  TKHQLENAILCKIFTNRKISLEVFCSMMPKIWN-QEHTIIHHMGFNLFLCKFWNTRIKGRIIDFGPWFYDKAMLLMEEPKGDIYDEDMDFK---------
        T   ++  ++ K+ T+++IS E   S+M  +W     T    +G N+++  F +   K R++  GPW ++K++L++  P       DM+F          
Subjt:  TKHQLENAILCKIFTNRKISLEVFCSMMPKIWN-QEHTIIHHMGFNLFLCKFWNTRIKGRIIDFGPWFYDKAMLLMEEPKGDIYDEDMDFK---------

Query:  ---------ELVVAIGSLLGKVEQVDLDEDNNKQWEFSLRIKIQIDVSVPLKRGIFLQSSKGDNDRWIKVTYEKLLDFCYGCGLLGHIIKECE--VNYGS
                 E+   +G+ LG VE+++ D  +     F +R++++IDVS PL+RGI L++S G  D W  + YEKL DFCY CG +GH  +ECE      +
Subjt:  ---------ELVVAIGSLLGKVEQVDLDEDNNKQWEFSLRIKIQIDVSVPLKRGIFLQSSKGDNDRWIKVTYEKLLDFCYGCGLLGHIIKECE--VNYGS

Query:  NNEELSYGPSLRKPVKLKTHEFTHGSTSTMFGR-GRG---RGADGGRGSWRYTVAEEVEKEQEGTNSRFQAAKEAGQQEIGQSSRPESDRPEVASLVRKL
         N    YG  LR  +  K+            GR GRG    G  GGRG WR    +E  ++ +G  S  + A E G   +   +       E+     K+
Subjt:  NNEELSYGPSLRKPVKLKTHEFTHGSTSTMFGR-GRG---RGADGGRGSWRYTVAEEVEKEQEGTNSRFQAAKEAGQQEIGQSSRPESDRPEVASLVRKL

Query:  TAGKCNKKCAENQEQLIR
        T+    ++   N  Q +R
Subjt:  TAGKCNKKCAENQEQLIR

A0A6J1DU55 uncharacterized protein LOC1110231351.6e-2327.46Show/hide
Query:  DSIIKQLGELKVTDAKRACMYKLQEDATDKTKHQLENAILCKIFTNRKISLEVFCSMMPKIWNQEHTI-IHHMGFNLFLCKFWNTRIKGRIIDFGPWFYD
        ++++    + K+T  +      +  DA    +  L  +++ K+   R IS +V   ++   W  EH + +  +G NLFL  F       R++  GPWF+D
Subjt:  DSIIKQLGELKVTDAKRACMYKLQEDATDKTKHQLENAILCKIFTNRKISLEVFCSMMPKIWNQEHTI-IHHMGFNLFLCKFWNTRIKGRIIDFGPWFYD

Query:  KAMLLMEEPKGDIYDEDMDF------------------KELVVAIGSLLGKVEQVDLDEDNNKQWEFSLRIKIQIDVSVPLKRGIFLQSSKGDNDRWIKV
        KA++++++P       +++F                  K + + +G+ +G    VD +E     W  SLRI++ ID++ PL+RGI +         WI +
Subjt:  KAMLLMEEPKGDIYDEDMDF------------------KELVVAIGSLLGKVEQVDLDEDNNKQWEFSLRIKIQIDVSVPLKRGIFLQSSKGDNDRWIKV

Query:  TYEKLLDFCYGCGLLGHIIKECEVNYGSNNEE----LSYGPSLR
         YE+L DFCY CG++GH   +C+  Y +  ++      YGP LR
Subjt:  TYEKLLDFCYGCGLLGHIIKECEVNYGSNNEE----LSYGPSLR

A0A6P9EGW2 uncharacterized protein LOC1183486451.2e-2330.19Show/hide
Query:  LKVTDAKRACMYKLQEDATDKTKHQLENA-ILCKIFTNRKISLEVFCSMMPKIWNQEHTI-IHHMGFNLFLCKFWNTRIKGRIIDFGPWFYDKAMLLMEE
        LK+T+ ++   Y  +E+ T    H + N+ ++  +  +R+++   F + M ++WN E  I    +G N FL +F N+ ++ R++   PW +D+ ++ ++E
Subjt:  LKVTDAKRACMYKLQEDATDKTKHQLENA-ILCKIFTNRKISLEVFCSMMPKIWNQEHTI-IHHMGFNLFLCKFWNTRIKGRIIDFGPWFYDKAMLLMEE

Query:  PKGDIYDEDMDF------------------KELVVAIGSLLGKVEQVDLDEDNNKQWEFSLRIKIQIDVSVPLKRGIFLQSSKGDNDRWIKVTYEKLLDF
         KG +  +D+DF                  K     +G+  GKV  VD+DE   + W   LR+K+ ID+S PL RG  +  + GD   WI   YEKL  F
Subjt:  PKGDIYDEDMDF------------------KELVVAIGSLLGKVEQVDLDEDNNKQWEFSLRIKIQIDVSVPLKRGIFLQSSKGDNDRWIKVTYEKLLDF

Query:  CYGCGLLGHIIKECEVNYGSNN----EELSYGPSLR-KPVKLKTHEFTHGSTSTMFGRGRGRGAD
        CY CG++ H    C  ++   +      L YGP LR  P +   H  TH ++      G GR A+
Subjt:  CYGCGLLGHIIKECEVNYGSNN----EELSYGPSLR-KPVKLKTHEFTHGSTSTMFGRGRGRGAD

A0A803MME0 Uncharacterized protein2.9e-2529.64Show/hide
Query:  ADSIIKQLGELKVT-DAKRACMYKLQEDATDKTKHQLENAILCKIFTNRKISLEVFCSMMPKIWN-QEHTIIHHMGFNLFLCKFWNTRIKGRIIDFGPWF
        AD +IK+  +L++T D       +L  +  D +K QL  A++ K+ T +  ++E     + K+W   ++ ++  +  N F+ +F+N   K R++D  PWF
Subjt:  ADSIIKQLGELKVT-DAKRACMYKLQEDATDKTKHQLENAILCKIFTNRKISLEVFCSMMPKIWN-QEHTIIHHMGFNLFLCKFWNTRIKGRIIDFGPWF

Query:  YDKAMLLMEEPKGDIYDEDMDFK------------------ELVVAIGSLLGKVEQVDLDEDNNKQWEFSLRIKIQIDVSVPLKRGIFLQSSKGDNDRWI
        +D  +LL++E +G++   ++DF+                  + + +IG  LG    ++LD++    W   +R K+ ++V  PL+RG+FL + +    RWI
Subjt:  YDKAMLLMEEPKGDIYDEDMDFK------------------ELVVAIGSLLGKVEQVDLDEDNNKQWEFSLRIKIQIDVSVPLKRGIFLQSSKGDNDRWI

Query:  KVTYEKLLDFCYGCGLLGHIIKECEVNYGSNNEE----LSYGPSLR-KPVKLK
         + YE+L DFC+ CG+L H  KEC      + E+      YGP +R  P+KL+
Subjt:  KVTYEKLLDFCYGCGLLGHIIKECEVNYGSNNEE----LSYGPSLR-KPVKLK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G42140.1 zinc ion binding;nucleic acid binding2.1e-0438.78Show/hide
Query:  QIDVSVPLKRGIFLQSSKGDNDRWIKVTYEKLLDFCYGCGLLGHIIKEC
        +I  S+  + G+FL+++ G +   +K  YEKL +FC  CG+L H   EC
Subjt:  QIDVSVPLKRGIFLQSSKGDNDRWIKVTYEKLLDFCYGCGLLGHIIKEC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCTCTGTTTTATCTCTGAAGCGGTGGACGAAGGCCGCCTATGTGCTGAAAGGGCAGCTTGTCTGCTTGCGGCGGAATTTTTTCGGTAGATTTTCTAGTTGTTTTCT
GACGACGATGGCGACACAAGATGCTGCCGACTCCATTATAAAGCAATTGGGTGAACTGAAAGTGACAGATGCAAAAAGAGCTTGCATGTATAAGTTGCAAGAGGATGCAA
CTGACAAGACAAAACATCAACTGGAGAATGCAATACTTTGTAAAATTTTCACAAACAGGAAGATTAGTCTAGAAGTTTTTTGTTCCATGATGCCAAAGATATGGAATCAA
GAGCATACAATAATCCACCATATGGGTTTTAACTTGTTCTTATGCAAATTTTGGAATACTCGGATTAAAGGAAGAATCATTGATTTTGGACCTTGGTTTTACGACAAAGC
AATGCTTCTAATGGAAGAACCAAAAGGAGATATCTACGATGAAGATATGGACTTCAAGGAATTGGTTGTGGCAATTGGAAGTCTTCTAGGAAAAGTTGAACAAGTAGATC
TTGATGAGGATAACAATAAACAATGGGAATTCTCCCTTAGAATAAAGATTCAAATTGATGTCTCCGTCCCTTTGAAACGTGGAATTTTCTTGCAATCGAGTAAGGGTGAT
AATGATAGATGGATCAAAGTTACATATGAAAAATTACTTGATTTTTGCTATGGGTGTGGATTACTGGGGCATATAATAAAAGAATGTGAGGTCAATTATGGGTCAAATAA
TGAGGAGTTGTCCTATGGTCCTTCGTTACGGAAACCAGTAAAACTAAAAACTCATGAATTCACTCATGGATCAACCTCCACCATGTTTGGAAGAGGGAGAGGGAGGGGTG
CAGATGGTGGAAGAGGAAGCTGGAGGTATACAGTAGCTGAAGAGGTGGAAAAGGAACAAGAAGGTACCAACAGTCGGTTCCAGGCTGCAAAAGAAGCTGGTCAGCAGGAA
ATTGGTCAAAGCTCACGACCGGAATCTGACCGGCCGGAAGTTGCCTCTCTGGTGAGAAAGCTAACGGCTGGAAAATGTAATAAGAAATGCGCGGAAAATCAGGAGCAGTT
AATTAGAAACGACGAAAATAAAGGGGAAGGCGCAAAAATTGGGTCCAACGGTAATATTAAAGGGCTGTCCACAATAAACACAGATTCGGATCTCCCATTAATGGATATTG
ACAAGAATTGGGCTGGGCATTATGAAGATATGGACAACCAACATAGGGAAATAAATTCAGTTTTAAATAATACTCGTTCCATATCTTCCAAACAAGAAATCTCTGCAGAA
ATTCATGGTGTGACAGAAGCTAGTCTGGATTCCACAAATCTAGTGAAGGGAAATAAGCTTTTAAAGGAAGAAAGTGGAAAAAAGAATTTGAAAGGAAACATTACAACTTG
GAAGAGGATAGCCAGAATGATAGAACCAGAGGGGACGTGTGATAAGCAGCAACCTTTGATGGGAAGTGTAGTTGGACAGAAACATAATATTGACTTTGATGAGGTAGAGC
AGGCTAAACAGAAGGTGTTGATTATTGGTGGTCAAAACAATGCAATATCGGTGGAGGCTGCTGGACAGCCCCGCCGAGCACAATTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTCTCTGTTTTATCTCTGAAGCGGTGGACGAAGGCCGCCTATGTGCTGAAAGGGCAGCTTGTCTGCTTGCGGCGGAATTTTTTCGGTAGATTTTCTAGTTGTTTTCT
GACGACGATGGCGACACAAGATGCTGCCGACTCCATTATAAAGCAATTGGGTGAACTGAAAGTGACAGATGCAAAAAGAGCTTGCATGTATAAGTTGCAAGAGGATGCAA
CTGACAAGACAAAACATCAACTGGAGAATGCAATACTTTGTAAAATTTTCACAAACAGGAAGATTAGTCTAGAAGTTTTTTGTTCCATGATGCCAAAGATATGGAATCAA
GAGCATACAATAATCCACCATATGGGTTTTAACTTGTTCTTATGCAAATTTTGGAATACTCGGATTAAAGGAAGAATCATTGATTTTGGACCTTGGTTTTACGACAAAGC
AATGCTTCTAATGGAAGAACCAAAAGGAGATATCTACGATGAAGATATGGACTTCAAGGAATTGGTTGTGGCAATTGGAAGTCTTCTAGGAAAAGTTGAACAAGTAGATC
TTGATGAGGATAACAATAAACAATGGGAATTCTCCCTTAGAATAAAGATTCAAATTGATGTCTCCGTCCCTTTGAAACGTGGAATTTTCTTGCAATCGAGTAAGGGTGAT
AATGATAGATGGATCAAAGTTACATATGAAAAATTACTTGATTTTTGCTATGGGTGTGGATTACTGGGGCATATAATAAAAGAATGTGAGGTCAATTATGGGTCAAATAA
TGAGGAGTTGTCCTATGGTCCTTCGTTACGGAAACCAGTAAAACTAAAAACTCATGAATTCACTCATGGATCAACCTCCACCATGTTTGGAAGAGGGAGAGGGAGGGGTG
CAGATGGTGGAAGAGGAAGCTGGAGGTATACAGTAGCTGAAGAGGTGGAAAAGGAACAAGAAGGTACCAACAGTCGGTTCCAGGCTGCAAAAGAAGCTGGTCAGCAGGAA
ATTGGTCAAAGCTCACGACCGGAATCTGACCGGCCGGAAGTTGCCTCTCTGGTGAGAAAGCTAACGGCTGGAAAATGTAATAAGAAATGCGCGGAAAATCAGGAGCAGTT
AATTAGAAACGACGAAAATAAAGGGGAAGGCGCAAAAATTGGGTCCAACGGTAATATTAAAGGGCTGTCCACAATAAACACAGATTCGGATCTCCCATTAATGGATATTG
ACAAGAATTGGGCTGGGCATTATGAAGATATGGACAACCAACATAGGGAAATAAATTCAGTTTTAAATAATACTCGTTCCATATCTTCCAAACAAGAAATCTCTGCAGAA
ATTCATGGTGTGACAGAAGCTAGTCTGGATTCCACAAATCTAGTGAAGGGAAATAAGCTTTTAAAGGAAGAAAGTGGAAAAAAGAATTTGAAAGGAAACATTACAACTTG
GAAGAGGATAGCCAGAATGATAGAACCAGAGGGGACGTGTGATAAGCAGCAACCTTTGATGGGAAGTGTAGTTGGACAGAAACATAATATTGACTTTGATGAGGTAGAGC
AGGCTAAACAGAAGGTGTTGATTATTGGTGGTCAAAACAATGCAATATCGGTGGAGGCTGCTGGACAGCCCCGCCGAGCACAATTGTAA
Protein sequenceShow/hide protein sequence
MVSVLSLKRWTKAAYVLKGQLVCLRRNFFGRFSSCFLTTMATQDAADSIIKQLGELKVTDAKRACMYKLQEDATDKTKHQLENAILCKIFTNRKISLEVFCSMMPKIWNQ
EHTIIHHMGFNLFLCKFWNTRIKGRIIDFGPWFYDKAMLLMEEPKGDIYDEDMDFKELVVAIGSLLGKVEQVDLDEDNNKQWEFSLRIKIQIDVSVPLKRGIFLQSSKGD
NDRWIKVTYEKLLDFCYGCGLLGHIIKECEVNYGSNNEELSYGPSLRKPVKLKTHEFTHGSTSTMFGRGRGRGADGGRGSWRYTVAEEVEKEQEGTNSRFQAAKEAGQQE
IGQSSRPESDRPEVASLVRKLTAGKCNKKCAENQEQLIRNDENKGEGAKIGSNGNIKGLSTINTDSDLPLMDIDKNWAGHYEDMDNQHREINSVLNNTRSISSKQEISAE
IHGVTEASLDSTNLVKGNKLLKEESGKKNLKGNITTWKRIARMIEPEGTCDKQQPLMGSVVGQKHNIDFDEVEQAKQKVLIIGGQNNAISVEAAGQPRRAQL