; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0015844 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0015844
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr12:27077886..27078905
RNA-Seq ExpressionLag0015844
SyntenyLag0015844
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_006484927.1 uncharacterized protein LOC102626623 [Citrus sinensis]9.9e-4236.67Show/hide
Query:  MATEELINEWKRFSLRESEKDTFFKLESKIVEDVKNQVSHSLVGKLISSRTIAVSAIKNAMNGAWKTRKKFDVEGSGRNMFAFKFQSQEDRNWVLHNGPW
        M TEELI   K  +L++ E D    +   +  D +   +H LVGK++ +R +    ++ AM  AWKT K+  VE  G N+F FKF +++D+  VL  GPW
Subjt:  MATEELINEWKRFSLRESEKDTFFKLESKIVEDVKNQVSHSLVGKLISSRTIAVSAIKNAMNGAWKTRKKFDVEGSGRNMFAFKFQSQEDRNWVLHNGPW

Query:  LFGRSLVILEEPKVNLRTSELSFNKTEFWLRLINLPVGFKNKHAAAMMGQKIGEFLEVDSEKEGLHWGESMRIRVRLDITKPLLRGFMLKADGIEGDCWV
         F ++L++L EP          F  T FW+++  +P+    K     +G+KIG+  EV++++EG   G   R+R+ +DITKPL R  +LK +G E D  +
Subjt:  LFGRSLVILEEPKVNLRTSELSFNKTEFWLRLINLPVGFKNKHAAAMMGQKIGEFLEVDSEKEGLHWGESMRIRVRLDITKPLLRGFMLKADGIEGDCWV

Query:  TIRYERLPDFCFNCGCIGHTVKECSGEHKEEGQGRKSLEFGGWLK----FQGLRANRQQDSPTRDSNDKD
         I+Y+RLPDFCF CG IGH  +EC    + +GQ ++ L +GGW+K    F+  + +R ++  +R  +  D
Subjt:  TIRYERLPDFCFNCGCIGHTVKECSGEHKEEGQGRKSLEFGGWLK----FQGLRANRQQDSPTRDSNDKD

XP_022132681.1 uncharacterized protein LOC111005481 [Momordica charantia]1.5e-4538.15Show/hide
Query:  MATEELINEWKRFSLRESEKDTFFKLESKIVEDVKNQVSHSLVGKLISSRTIAVSAIKNAMNGAWKTR-KKFDVEGSGRNMFAFKFQSQEDRNWVLHNGP
        MA   L+ EWK F L   E      ++S  +E     +  SL+ KL+S R+I+ + +KN +  AWK   K F V+  G N+F F F    DRN +L  GP
Subjt:  MATEELINEWKRFSLRESEKDTFFKLESKIVEDVKNQVSHSLVGKLISSRTIAVSAIKNAMNGAWKTR-KKFDVEGSGRNMFAFKFQSQEDRNWVLHNGP

Query:  WLFGRSLVILEEPKVNLRTSELSFNKTEFWLRLINLPVGFKNKHAAAMMGQKIGEFLEVDSEKEGLHWGESMRIRVRLDITKPLLRGFMLKADGIEGDCW
        W F R+L+I++ P    +  ++ F     W+   +L +   NK  A  +G  IG F +V+S      WG  +R+RVR D+ KPL RG  L  DG  G CW
Subjt:  WLFGRSLVILEEPKVNLRTSELSFNKTEFWLRLINLPVGFKNKHAAAMMGQKIGEFLEVDSEKEGLHWGESMRIRVRLDITKPLLRGFMLKADGIEGDCW

Query:  VTIRYERLPDFCFNCGCIGHTVKECSGEHKEEGQGRKSLEFGGWLKFQG
        + I+YERLPDF ++CG + H +K+CS    +     K+L++G WL+FQG
Subjt:  VTIRYERLPDFCFNCGCIGHTVKECSGEHKEEGQGRKSLEFGGWLKFQG

XP_022156185.1 uncharacterized protein LOC111023135 [Momordica charantia]5.6e-5336.15Show/hide
Query:  MATEELINEWKRFSLRESEKDTFFKLESKIVEDVKNQVSHSLVGKLISSRTIAVSAIKNAMNGAWKTRKKFDVEGSGRNMFAFKFQSQEDRNWVLHNGPW
        M  E L+ +W++F L   E +    ++   V+  +  +++SLVGKL++ R I+   +   +  AWK   +  VE  G+N+F F F  + D N V+  GPW
Subjt:  MATEELINEWKRFSLRESEKDTFFKLESKIVEDVKNQVSHSLVGKLISSRTIAVSAIKNAMNGAWKTRKKFDVEGSGRNMFAFKFQSQEDRNWVLHNGPW

Query:  LFGRSLVILEEPKVNLRTSELSFNKTEFWLRLINLPVGFKNKHAAAMMGQKIGEFLEVDSEKEGLHWGESMRIRVRLDITKPLLRGFMLKADGIEGDCWV
         F ++L++L++P  +   SEL FN+  FW+ L +LP+ + NK  A  +G  IG F++VD  ++G  WG S+RIRV +DITKPL RG  +  DG  G CW+
Subjt:  LFGRSLVILEEPKVNLRTSELSFNKTEFWLRLINLPVGFKNKHAAAMMGQKIGEFLEVDSEKEGLHWGESMRIRVRLDITKPLLRGFMLKADGIEGDCWV

Query:  TIRYERLPDFCFNCGCIGHTVKECSGEH-KEEGQGRKSLEFGGWLKFQGLRANRQQ----DSPTRDSNDKDEFKDKDEQNINST--EITRNNHDDG
         I+YERLPDFC+ CG IGH+  +C   +   +   R + E+G WL+F G +A  Q+     SP R+ +      +  E+ +  T  +++   + DG
Subjt:  TIRYERLPDFCFNCGCIGHTVKECSGEH-KEEGQGRKSLEFGGWLKFQGLRANRQQ----DSPTRDSNDKDEFKDKDEQNINST--EITRNNHDDG

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]6.2e-4436.94Show/hide
Query:  MATEELINEWKRFSLRESEKDTFFKLESKIVEDVKNQVSHSLVGKLISSRTIAVSAIKNAMNGAWKTRKK-FDVEGSGRNMFAFKFQSQEDRNWVLHNGP
        MAT +L+ EWK F L   E++T   +++       +++   LVGKL   R I    +KN M  AWK     F+V+  G N+F F F    DRN +  +GP
Subjt:  MATEELINEWKRFSLRESEKDTFFKLESKIVEDVKNQVSHSLVGKLISSRTIAVSAIKNAMNGAWKTRKK-FDVEGSGRNMFAFKFQSQEDRNWVLHNGP

Query:  WLFGRSLVILEEPKVNLRTSELSFNKTEFWLRLINLPVGFKNKHAAAMMGQKIGEFLEVDSEKEGLHWGESMRIRVRLDITKPLLRGFMLKADGIEGDCW
        W F R+LV++ +P   +  SEL F K   W+R  +LP+G   +  A  +G  +G F E D +     WG ++R+RV LDI+KPL RG  L  DG  G  W
Subjt:  WLFGRSLVILEEPKVNLRTSELSFNKTEFWLRLINLPVGFKNKHAAAMMGQKIGEFLEVDSEKEGLHWGESMRIRVRLDITKPLLRGFMLKADGIEGDCW

Query:  VTIRYERLPDFCFNCGCIGHTVKECSGEHKEEGQGRKSLEFGGWLKFQGL--RANRQQDSPTRDSNDK
        + I+YERLPDFC++CG                   RK  ++G WL++QG       Q   P  D  DK
Subjt:  VTIRYERLPDFCFNCGCIGHTVKECSGEHKEEGQGRKSLEFGGWLKFQGL--RANRQQDSPTRDSNDK

XP_024033132.1 uncharacterized protein LOC112095437 [Citrus clementina]9.2e-4036.63Show/hide
Query:  MATEELINEWKRFSLRESEKDTFFKLESKIVEDVKNQVSHSLVGKLISSRTIAVSAIKNAMNGAWKTRKKFDVEGSGRNMFAFKFQSQEDRNWVLHNGPW
        M TEELI + +  +L   E D F     ++ E     V+  LVGK++ +R +    +K A++ AW+T   F VE  G N+F FKF S+ D+  V + GPW
Subjt:  MATEELINEWKRFSLRESEKDTFFKLESKIVEDVKNQVSHSLVGKLISSRTIAVSAIKNAMNGAWKTRKKFDVEGSGRNMFAFKFQSQEDRNWVLHNGPW

Query:  LFGRSLVILEEPKVNLRTSELSFNKTEFWLRLINLPVGFKNKHAAAMMGQKIGEFLEVDSEKEGLHWGESMRIRVRLDITKPLLRGFMLKADGIEGDCWV
         F R+L++L+EPK      + SF+   FW+R+ N+P+   +      +G ++G+  ++  + +G  +GE +RI+V ++ITKPL +  +LK +  E D  +
Subjt:  LFGRSLVILEEPKVNLRTSELSFNKTEFWLRLINLPVGFKNKHAAAMMGQKIGEFLEVDSEKEGLHWGESMRIRVRLDITKPLLRGFMLKADGIEGDCWV

Query:  TIRYERLPDFCFNCGCIGHTVKECSGEHKEEGQGRKSLEFGGWLKFQGLRANRQQDSPTRDSNDKDEFKDKDE
         + YERLPDFCF CGCIGH  +EC  E+K  GQ +++L FG WLK   L A+R + +  ++   +   K ++E
Subjt:  TIRYERLPDFCFNCGCIGHTVKECSGEHKEEGQGRKSLEFGGWLKFQGLRANRQQDSPTRDSNDKDEFKDKDE

TrEMBL top hitse value%identityAlignment
A0A5C7GU64 CCHC-type domain-containing protein6.5e-3934.96Show/hide
Query:  MATEELINEWKRFSLRESEKDTFFKLESKIVEDVKNQVSHSLVGKLISSRTIAVSAIKNAMNGAWKTRKKFDVEGSGRNMFAFKFQSQEDRNWVLHNGPW
        M+  E+   ++  SL E E     ++    ++D    V   LVGK+++ + +   A K  +   W    + +VE  G N F F F ++E RN V + GPW
Subjt:  MATEELINEWKRFSLRESEKDTFFKLESKIVEDVKNQVSHSLVGKLISSRTIAVSAIKNAMNGAWKTRKKFDVEGSGRNMFAFKFQSQEDRNWVLHNGPW

Query:  LFGRSLVILEEPKVNLRTSELSFNKTEFWLRLINLPVGFKNKHAAAMMGQKIGEFLEVDSEKEGLHWGESMRIRVRLDITKPLLRGFMLKADGIEGDCWV
        LFG+SL++LE+PK     ++L FN+ +FW+++ ++P+   N+     + + IGE +E+ +E     WG+ +R++V++DITKPL R   +K    E    V
Subjt:  LFGRSLVILEEPKVNLRTSELSFNKTEFWLRLINLPVGFKNKHAAAMMGQKIGEFLEVDSEKEGLHWGESMRIRVRLDITKPLLRGFMLKADGIEGDCWV

Query:  TIRYERLPDFCFNCGCIGHTVKECSGEH-KEEGQGRKSLEFGGWLK
         ++YERLPDFCF CG IGH+V+EC  E  K      +  +FG W++
Subjt:  TIRYERLPDFCFNCGCIGHTVKECSGEH-KEEGQGRKSLEFGGWLK

A0A5C7IJL3 CCHC-type domain-containing protein2.9e-3933.33Show/hide
Query:  MATEELINEWKRFSLRESEKDTFFKLESKIVEDVKNQVSHSLVGKLISSRTIAVSAIKNAMNGAWKTRKKFDVEGSGRNMFAFKFQSQEDRNWVLHNGPW
        M+  E+    +  S+++ +++   ++   + E+    V H LVGK++S + +   A  + +   W    K ++E  G N+F F FQ+ EDR+ V   GPW
Subjt:  MATEELINEWKRFSLRESEKDTFFKLESKIVEDVKNQVSHSLVGKLISSRTIAVSAIKNAMNGAWKTRKKFDVEGSGRNMFAFKFQSQEDRNWVLHNGPW

Query:  LFGRSLVILEEPKVNLRTSELSFNKTEFWLRLINLPVGFKNKHAAAMMGQKIGEFLEVDSEKEGLHWGESMRIRVRLDITKPLLRGFMLKADGIEGDCWV
         F +SL++LE P+     ++L FNK +FW+++ ++P+   N+ +A  + ++IGE +E+ SE     WG+ +R++VR+DI++PL R   L  D       V
Subjt:  LFGRSLVILEEPKVNLRTSELSFNKTEFWLRLINLPVGFKNKHAAAMMGQKIGEFLEVDSEKEGLHWGESMRIRVRLDITKPLLRGFMLKADGIEGDCWV

Query:  TIRYERLPDFCFNCGCIGHTVKECSG-EHKEEGQGRKSLEFGGWLK
         ++YER+P+FCF CG +GH + ECS  E K+E     +  FG W++
Subjt:  TIRYERLPDFCFNCGCIGHTVKECSG-EHKEEGQGRKSLEFGGWLK

A0A6J1BSZ1 uncharacterized protein LOC1110054817.1e-4638.15Show/hide
Query:  MATEELINEWKRFSLRESEKDTFFKLESKIVEDVKNQVSHSLVGKLISSRTIAVSAIKNAMNGAWKTR-KKFDVEGSGRNMFAFKFQSQEDRNWVLHNGP
        MA   L+ EWK F L   E      ++S  +E     +  SL+ KL+S R+I+ + +KN +  AWK   K F V+  G N+F F F    DRN +L  GP
Subjt:  MATEELINEWKRFSLRESEKDTFFKLESKIVEDVKNQVSHSLVGKLISSRTIAVSAIKNAMNGAWKTR-KKFDVEGSGRNMFAFKFQSQEDRNWVLHNGP

Query:  WLFGRSLVILEEPKVNLRTSELSFNKTEFWLRLINLPVGFKNKHAAAMMGQKIGEFLEVDSEKEGLHWGESMRIRVRLDITKPLLRGFMLKADGIEGDCW
        W F R+L+I++ P    +  ++ F     W+   +L +   NK  A  +G  IG F +V+S      WG  +R+RVR D+ KPL RG  L  DG  G CW
Subjt:  WLFGRSLVILEEPKVNLRTSELSFNKTEFWLRLINLPVGFKNKHAAAMMGQKIGEFLEVDSEKEGLHWGESMRIRVRLDITKPLLRGFMLKADGIEGDCW

Query:  VTIRYERLPDFCFNCGCIGHTVKECSGEHKEEGQGRKSLEFGGWLKFQG
        + I+YERLPDF ++CG + H +K+CS    +     K+L++G WL+FQG
Subjt:  VTIRYERLPDFCFNCGCIGHTVKECSGEHKEEGQGRKSLEFGGWLKFQG

A0A6J1DU55 uncharacterized protein LOC1110231352.7e-5336.15Show/hide
Query:  MATEELINEWKRFSLRESEKDTFFKLESKIVEDVKNQVSHSLVGKLISSRTIAVSAIKNAMNGAWKTRKKFDVEGSGRNMFAFKFQSQEDRNWVLHNGPW
        M  E L+ +W++F L   E +    ++   V+  +  +++SLVGKL++ R I+   +   +  AWK   +  VE  G+N+F F F  + D N V+  GPW
Subjt:  MATEELINEWKRFSLRESEKDTFFKLESKIVEDVKNQVSHSLVGKLISSRTIAVSAIKNAMNGAWKTRKKFDVEGSGRNMFAFKFQSQEDRNWVLHNGPW

Query:  LFGRSLVILEEPKVNLRTSELSFNKTEFWLRLINLPVGFKNKHAAAMMGQKIGEFLEVDSEKEGLHWGESMRIRVRLDITKPLLRGFMLKADGIEGDCWV
         F ++L++L++P  +   SEL FN+  FW+ L +LP+ + NK  A  +G  IG F++VD  ++G  WG S+RIRV +DITKPL RG  +  DG  G CW+
Subjt:  LFGRSLVILEEPKVNLRTSELSFNKTEFWLRLINLPVGFKNKHAAAMMGQKIGEFLEVDSEKEGLHWGESMRIRVRLDITKPLLRGFMLKADGIEGDCWV

Query:  TIRYERLPDFCFNCGCIGHTVKECSGEH-KEEGQGRKSLEFGGWLKFQGLRANRQQ----DSPTRDSNDKDEFKDKDEQNINST--EITRNNHDDG
         I+YERLPDFC+ CG IGH+  +C   +   +   R + E+G WL+F G +A  Q+     SP R+ +      +  E+ +  T  +++   + DG
Subjt:  TIRYERLPDFCFNCGCIGHTVKECSGEH-KEEGQGRKSLEFGGWLKFQGLRANRQQ----DSPTRDSNDKDEFKDKDEQNINST--EITRNNHDDG

A0A6J1DX30 uncharacterized protein LOC1110248743.0e-4436.94Show/hide
Query:  MATEELINEWKRFSLRESEKDTFFKLESKIVEDVKNQVSHSLVGKLISSRTIAVSAIKNAMNGAWKTRKK-FDVEGSGRNMFAFKFQSQEDRNWVLHNGP
        MAT +L+ EWK F L   E++T   +++       +++   LVGKL   R I    +KN M  AWK     F+V+  G N+F F F    DRN +  +GP
Subjt:  MATEELINEWKRFSLRESEKDTFFKLESKIVEDVKNQVSHSLVGKLISSRTIAVSAIKNAMNGAWKTRKK-FDVEGSGRNMFAFKFQSQEDRNWVLHNGP

Query:  WLFGRSLVILEEPKVNLRTSELSFNKTEFWLRLINLPVGFKNKHAAAMMGQKIGEFLEVDSEKEGLHWGESMRIRVRLDITKPLLRGFMLKADGIEGDCW
        W F R+LV++ +P   +  SEL F K   W+R  +LP+G   +  A  +G  +G F E D +     WG ++R+RV LDI+KPL RG  L  DG  G  W
Subjt:  WLFGRSLVILEEPKVNLRTSELSFNKTEFWLRLINLPVGFKNKHAAAMMGQKIGEFLEVDSEKEGLHWGESMRIRVRLDITKPLLRGFMLKADGIEGDCW

Query:  VTIRYERLPDFCFNCGCIGHTVKECSGEHKEEGQGRKSLEFGGWLKFQGL--RANRQQDSPTRDSNDK
        + I+YERLPDFC++CG                   RK  ++G WL++QG       Q   P  D  DK
Subjt:  VTIRYERLPDFCFNCGCIGHTVKECSGEHKEEGQGRKSLEFGGWLKFQGL--RANRQQDSPTRDSNDK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G42140.1 zinc ion binding;nucleic acid binding5.3e-0923.12Show/hide
Query:  KKFDVEGSGR----NMFAFKFQSQEDRNWVLHNGPWLFGRSLVILEEPKVNLRTSELSFNKTEFWLRLINLPVGFKNKHAAAMMGQKIGEFLEVDSEKEG
        K  D E  GR    +   F FQS+E    +L  GPW F   + +++  +     S+  F +  FW+++  +P+ F        +G+++G FLE +  ++ 
Subjt:  KKFDVEGSGR----NMFAFKFQSQEDRNWVLHNGPWLFGRSLVILEEPKVNLRTSELSFNKTEFWLRLINLPVGFKNKHAAAMMGQKIGEFLEVDSEKEG

Query:  LHWGESMRIRVRLDITKPLLRGFMLKADGIEGDCWVTIRYERLPDFCFNCGCIGHTVKEC
                    + + K                     +YE+L +FC  CG + H   EC
Subjt:  LHWGESMRIRVRLDITKPLLRGFMLKADGIEGDCWVTIRYERLPDFCFNCGCIGHTVKEC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAACTGAAGAATTGATCAATGAATGGAAGAGATTCAGTCTTCGGGAAAGCGAGAAGGATACATTTTTCAAGCTCGAATCAAAGATTGTTGAAGATGTTAAAAATCA
AGTAAGCCACTCGTTAGTTGGTAAGCTTATATCAAGCAGAACAATTGCAGTCTCGGCCATCAAGAACGCTATGAACGGAGCTTGGAAGACCAGAAAGAAGTTTGATGTTG
AAGGGTCCGGTAGAAACATGTTCGCCTTCAAGTTCCAGAGTCAAGAAGACAGAAATTGGGTTCTCCACAATGGTCCGTGGCTTTTTGGCAGGAGCTTGGTAATTTTGGAG
GAGCCTAAAGTCAATCTTCGAACGTCAGAACTAAGTTTTAACAAAACAGAATTCTGGCTACGTTTGATCAATCTCCCAGTCGGATTCAAGAATAAGCACGCAGCGGCAAT
GATGGGTCAAAAAATTGGAGAATTTCTTGAGGTTGATAGCGAAAAAGAAGGGCTACATTGGGGAGAAAGCATGAGAATCAGAGTTCGTCTGGACATAACAAAACCTCTTC
TACGAGGATTTATGCTAAAAGCAGACGGAATTGAAGGTGATTGCTGGGTCACAATCAGATATGAGAGGCTCCCGGATTTCTGCTTCAATTGTGGCTGCATTGGACATACT
GTAAAGGAATGTTCTGGGGAACATAAGGAAGAGGGACAAGGGCGGAAAAGCTTGGAATTTGGGGGATGGCTGAAATTTCAGGGATTGAGAGCAAATAGGCAACAAGACTC
TCCCACCCGTGACTCCAACGACAAAGATGAATTCAAGGACAAAGATGAGCAAAATATTAATTCGACTGAGATTACAAGAAACAATCATGATGATGGCAGGCCGATTGGAA
GAAGCTTGGATCCAAGAGAACCATTTGTTCCAAGACAATCTGATTTAGAAAATGTGGTTGCTCCTGACATAGTTATGAGAAAAGATAAAGGAAAAATGGTGGATCTGAAT
GATACAAACATGGAGAATTGGGATTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAACTGAAGAATTGATCAATGAATGGAAGAGATTCAGTCTTCGGGAAAGCGAGAAGGATACATTTTTCAAGCTCGAATCAAAGATTGTTGAAGATGTTAAAAATCA
AGTAAGCCACTCGTTAGTTGGTAAGCTTATATCAAGCAGAACAATTGCAGTCTCGGCCATCAAGAACGCTATGAACGGAGCTTGGAAGACCAGAAAGAAGTTTGATGTTG
AAGGGTCCGGTAGAAACATGTTCGCCTTCAAGTTCCAGAGTCAAGAAGACAGAAATTGGGTTCTCCACAATGGTCCGTGGCTTTTTGGCAGGAGCTTGGTAATTTTGGAG
GAGCCTAAAGTCAATCTTCGAACGTCAGAACTAAGTTTTAACAAAACAGAATTCTGGCTACGTTTGATCAATCTCCCAGTCGGATTCAAGAATAAGCACGCAGCGGCAAT
GATGGGTCAAAAAATTGGAGAATTTCTTGAGGTTGATAGCGAAAAAGAAGGGCTACATTGGGGAGAAAGCATGAGAATCAGAGTTCGTCTGGACATAACAAAACCTCTTC
TACGAGGATTTATGCTAAAAGCAGACGGAATTGAAGGTGATTGCTGGGTCACAATCAGATATGAGAGGCTCCCGGATTTCTGCTTCAATTGTGGCTGCATTGGACATACT
GTAAAGGAATGTTCTGGGGAACATAAGGAAGAGGGACAAGGGCGGAAAAGCTTGGAATTTGGGGGATGGCTGAAATTTCAGGGATTGAGAGCAAATAGGCAACAAGACTC
TCCCACCCGTGACTCCAACGACAAAGATGAATTCAAGGACAAAGATGAGCAAAATATTAATTCGACTGAGATTACAAGAAACAATCATGATGATGGCAGGCCGATTGGAA
GAAGCTTGGATCCAAGAGAACCATTTGTTCCAAGACAATCTGATTTAGAAAATGTGGTTGCTCCTGACATAGTTATGAGAAAAGATAAAGGAAAAATGGTGGATCTGAAT
GATACAAACATGGAGAATTGGGATTCTTGA
Protein sequenceShow/hide protein sequence
MATEELINEWKRFSLRESEKDTFFKLESKIVEDVKNQVSHSLVGKLISSRTIAVSAIKNAMNGAWKTRKKFDVEGSGRNMFAFKFQSQEDRNWVLHNGPWLFGRSLVILE
EPKVNLRTSELSFNKTEFWLRLINLPVGFKNKHAAAMMGQKIGEFLEVDSEKEGLHWGESMRIRVRLDITKPLLRGFMLKADGIEGDCWVTIRYERLPDFCFNCGCIGHT
VKECSGEHKEEGQGRKSLEFGGWLKFQGLRANRQQDSPTRDSNDKDEFKDKDEQNINSTEITRNNHDDGRPIGRSLDPREPFVPRQSDLENVVAPDIVMRKDKGKMVDLN
DTNMENWDS