; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg030184 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg030184
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionCCHC-type domain-containing protein
Genome locationscaffold6:14452829..14460579
RNA-Seq ExpressionSpg030184
SyntenySpg030184
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG48193.1 hypothetical protein EZV62_027487 [Acer yangbiense]4.9e-2532.21Show/hide
Query:  DEKASVFHLQENDIDKSEKKLINALLCKSLTHKKINLEVFRGMMPRIWGQ-EQTIIDHVGANIFLCKFKNARIKGFIQEAEPWFYNKSLLVFEEPRGDIN
        DE A+V  + E+ I   ++ +   L+ K LT KK+N E F+G++ +IW Q  Q  ++ VG N F+  F N   +  +    PW + KSL+V E+P+G  +
Subjt:  DEKASVFHLQENDIDKSEKKLINALLCKSLTHKKINLEVFRGMMPRIWGQ-EQTIIDHVGANIFLCKFKNARIKGFIQEAEPWFYNKSLLVFEEPRGDIN

Query:  AEDMDFRNA------------------ATEIGSLLGKVEQVDIDEETEPKMGCSLWIKIQIDVKKPLKRGIFMKSGTNGGEKWIGITYEKLPDFCYGCGW
           + F  A                     +   +G+V  V+I  E+    G  + +K+Q+D+ KPLKR + +K G       + + YE+LPDFC+ CG 
Subjt:  AEDMDFRNA------------------ATEIGSLLGKVEQVDIDEETEPKMGCSLWIKIQIDVKKPLKRGIFMKSGTNGGEKWIGITYEKLPDFCYGCGW

Query:  LGHTIREC
        +GH++REC
Subjt:  LGHTIREC

TXG53376.1 hypothetical protein EZV62_022545 [Acer yangbiense]7.1e-2430.94Show/hide
Query:  AEELAAQLTSLKVTADEKASVFHLQENDIDKSEKKLINALLCKSLTHKKINLEVFRGMMPRIWGQ-EQTIIDHVGANIFLCKFKNARIKGFIQEAEPWFY
        A E+A    +L + A+E   V  + E       K +   L+ K L++KK+N E F+G++ +IW    Q  ++ VG N+F+  F N   +  + +  PW +
Subjt:  AEELAAQLTSLKVTADEKASVFHLQENDIDKSEKKLINALLCKSLTHKKINLEVFRGMMPRIWGQ-EQTIIDHVGANIFLCKFKNARIKGFIQEAEPWFY

Query:  NKSLLVFEEPRGDINAEDMDF------------------RNAATEIGSLLGKVEQVDIDEETEPKMGCSLWIKIQIDVKKPLKRGIFMKSGTNGGEKWIG
          SL+V E+  G+ N   + F                  R  A  +   +G+V  V+I  E+    G  +W+K++ID+ KPLKR + +K G       + 
Subjt:  NKSLLVFEEPRGDINAEDMDF------------------RNAATEIGSLLGKVEQVDIDEETEPKMGCSLWIKIQIDVKKPLKRGIFMKSGTNGGEKWIG

Query:  ITYEKLPDFCYGCGWLGHTIREC
        + YEKLPDFCY CG +G+ ++EC
Subjt:  ITYEKLPDFCYGCGWLGHTIREC

XP_015382889.1 uncharacterized protein LOC102626150 [Citrus sinensis]9.2e-2431.6Show/hide
Query:  KSEKKLINALLCKSLTHKKINLEVFRGMMPRIWGQEQTI-IDHVGANIFLCKFKNARIKGFIQEAEPWFYNKSLLVFEEPRGDINAEDMDF---------
        K EK +   L+ K L  +K+++E  +  M R+W   + + I+ +G N+F+ KF +   K  I    PW ++++L+V  EP G  + +   F         
Subjt:  KSEKKLINALLCKSLTHKKINLEVFRGMMPRIWGQEQTI-IDHVGANIFLCKFKNARIKGFIQEAEPWFYNKSLLVFEEPRGDINAEDMDF---------

Query:  ---------RNAATEIGSLLGKVEQVDIDEETEPKMGCSLWIKIQIDVKKPLKRGIFMKSGTNGGEKW-IGITYEKLPDFCYGCGWLGHTIRECEVCVNS
                 +  AT++G+++GKVE+VD ++  E  +G  L ++I +D+ KPLK+ I ++      E   + + YE+LPDFC+ CG +GH  REC   ++ 
Subjt:  ---------RNAATEIGSLLGKVEQVDIDEETEPKMGCSLWIKIQIDVKKPLKRGIFMKSGTNGGEKW-IGITYEKLPDFCYGCGWLGHTIRECEVCVNS

Query:  KEEDLPYGPRLR
         +++L YGP LR
Subjt:  KEEDLPYGPRLR

XP_022132681.1 uncharacterized protein LOC111005481 [Momordica charantia]1.2e-2330.04Show/hide
Query:  AEELAAQLTSLKVTADEKASVFHLQENDIDKSEKKLINALLCKSLTHKKINLEVFRGMMPRIWGQE--QTIIDHVGANIFLCKFKNARIKGFIQEAEPWF
        A  L  +  + K+T++E      +  + ++ + K L  +L+CK L+ + I+  V +  +   W  +     +D +G NIFL  F  +  +  I    PW 
Subjt:  AEELAAQLTSLKVTADEKASVFHLQENDIDKSEKKLINALLCKSLTHKKINLEVFRGMMPRIWGQE--QTIIDHVGANIFLCKFKNARIKGFIQEAEPWF

Query:  YNKSLLVFEEPRGDINAEDMDFRNA------------------ATEIGSLLGKVEQVDIDEETEPKMGCSLWIKIQIDVKKPLKRGIFMKSGTNGGEKWI
        ++++L++ + P       DMDFRN                   AT +G+ +G  E V+         G  L ++++ DV KPL RGI +      G  WI
Subjt:  YNKSLLVFEEPRGDINAEDMDFRNA------------------ATEIGSLLGKVEQVDIDEETEPKMGCSLWIKIQIDVKKPLKRGIFMKSGTNGGEKWI

Query:  GITYEKLPDFCYGCGWLGHTIREC-EVCVNSKEEDLPYGPRLR
         I YE+LPDF Y CG L H +++C + CV+S  ++L YGP LR
Subjt:  GITYEKLPDFCYGCGWLGHTIREC-EVCVNSKEEDLPYGPRLR

XP_022149484.1 uncharacterized protein LOC111017902 [Momordica charantia]7.1e-2432.53Show/hide
Query:  EELAAQLTSLKVTADEKASVFHLQENDIDKSEKKLI--NALLC---KSLTHKKINLEVFRGMMPRIWG-QEQTIIDHVGANIFLCKFKNARIKGFIQEAE
        +E+       K T DE  +V       ID+ +  L   N  LC   K  T K+I+ E  R +M  +W     T  + +G NI++  FK+   K  +  + 
Subjt:  EELAAQLTSLKVTADEKASVFHLQENDIDKSEKKLI--NALLC---KSLTHKKINLEVFRGMMPRIWG-QEQTIIDHVGANIFLCKFKNARIKGFIQEAE

Query:  PWFYNKSLLVFEEPRGDINAEDMDFR------------------NAATEIGSLLGKVEQVDIDEETEPKMGCSLWIKIQIDVKKPLKRGIFMKSGTNGGE
        PW +NKSLLV   P       DM+F                     A  +G+ LG VE+++ D   +   G  + ++++IDV KPL+RGI +K+ ++G +
Subjt:  PWFYNKSLLVFEEPRGDINAEDMDFR------------------NAATEIGSLLGKVEQVDIDEETEPKMGCSLWIKIQIDVKKPLKRGIFMKSGTNGGE

Query:  KWIGITYEKLPDFCYGCGWLGHTIRECE-----VCVNSKEEDLPYGPRLREHVNLIGREINFFPRYVNYFAGR---------GRGRMGDSWR
         W  + YEKLPDFCY CG +GH+ RECE     V  NS E+   YG  LR    L+ + ++     V +  GR         GRG  GD WR
Subjt:  KWIGITYEKLPDFCYGCGWLGHTIRECE-----VCVNSKEEDLPYGPRLREHVNLIGREINFFPRYVNYFAGR---------GRGRMGDSWR

TrEMBL top hitse value%identityAlignment
A0A5C7GU64 CCHC-type domain-containing protein2.4e-2532.21Show/hide
Query:  DEKASVFHLQENDIDKSEKKLINALLCKSLTHKKINLEVFRGMMPRIWGQ-EQTIIDHVGANIFLCKFKNARIKGFIQEAEPWFYNKSLLVFEEPRGDIN
        DE A+V  + E+ I   ++ +   L+ K LT KK+N E F+G++ +IW Q  Q  ++ VG N F+  F N   +  +    PW + KSL+V E+P+G  +
Subjt:  DEKASVFHLQENDIDKSEKKLINALLCKSLTHKKINLEVFRGMMPRIWGQ-EQTIIDHVGANIFLCKFKNARIKGFIQEAEPWFYNKSLLVFEEPRGDIN

Query:  AEDMDFRNA------------------ATEIGSLLGKVEQVDIDEETEPKMGCSLWIKIQIDVKKPLKRGIFMKSGTNGGEKWIGITYEKLPDFCYGCGW
           + F  A                     +   +G+V  V+I  E+    G  + +K+Q+D+ KPLKR + +K G       + + YE+LPDFC+ CG 
Subjt:  AEDMDFRNA------------------ATEIGSLLGKVEQVDIDEETEPKMGCSLWIKIQIDVKKPLKRGIFMKSGTNGGEKWIGITYEKLPDFCYGCGW

Query:  LGHTIREC
        +GH++REC
Subjt:  LGHTIREC

A0A5C7HB59 Uncharacterized protein3.4e-2430.94Show/hide
Query:  AEELAAQLTSLKVTADEKASVFHLQENDIDKSEKKLINALLCKSLTHKKINLEVFRGMMPRIWGQ-EQTIIDHVGANIFLCKFKNARIKGFIQEAEPWFY
        A E+A    +L + A+E   V  + E       K +   L+ K L++KK+N E F+G++ +IW    Q  ++ VG N+F+  F N   +  + +  PW +
Subjt:  AEELAAQLTSLKVTADEKASVFHLQENDIDKSEKKLINALLCKSLTHKKINLEVFRGMMPRIWGQ-EQTIIDHVGANIFLCKFKNARIKGFIQEAEPWFY

Query:  NKSLLVFEEPRGDINAEDMDF------------------RNAATEIGSLLGKVEQVDIDEETEPKMGCSLWIKIQIDVKKPLKRGIFMKSGTNGGEKWIG
          SL+V E+  G+ N   + F                  R  A  +   +G+V  V+I  E+    G  +W+K++ID+ KPLKR + +K G       + 
Subjt:  NKSLLVFEEPRGDINAEDMDF------------------RNAATEIGSLLGKVEQVDIDEETEPKMGCSLWIKIQIDVKKPLKRGIFMKSGTNGGEKWIG

Query:  ITYEKLPDFCYGCGWLGHTIREC
        + YEKLPDFCY CG +G+ ++EC
Subjt:  ITYEKLPDFCYGCGWLGHTIREC

A0A6J1BSZ1 uncharacterized protein LOC1110054815.8e-2430.04Show/hide
Query:  AEELAAQLTSLKVTADEKASVFHLQENDIDKSEKKLINALLCKSLTHKKINLEVFRGMMPRIWGQE--QTIIDHVGANIFLCKFKNARIKGFIQEAEPWF
        A  L  +  + K+T++E      +  + ++ + K L  +L+CK L+ + I+  V +  +   W  +     +D +G NIFL  F  +  +  I    PW 
Subjt:  AEELAAQLTSLKVTADEKASVFHLQENDIDKSEKKLINALLCKSLTHKKINLEVFRGMMPRIWGQE--QTIIDHVGANIFLCKFKNARIKGFIQEAEPWF

Query:  YNKSLLVFEEPRGDINAEDMDFRNA------------------ATEIGSLLGKVEQVDIDEETEPKMGCSLWIKIQIDVKKPLKRGIFMKSGTNGGEKWI
        ++++L++ + P       DMDFRN                   AT +G+ +G  E V+         G  L ++++ DV KPL RGI +      G  WI
Subjt:  YNKSLLVFEEPRGDINAEDMDFRNA------------------ATEIGSLLGKVEQVDIDEETEPKMGCSLWIKIQIDVKKPLKRGIFMKSGTNGGEKWI

Query:  GITYEKLPDFCYGCGWLGHTIREC-EVCVNSKEEDLPYGPRLR
         I YE+LPDF Y CG L H +++C + CV+S  ++L YGP LR
Subjt:  GITYEKLPDFCYGCGWLGHTIREC-EVCVNSKEEDLPYGPRLR

A0A6J1D765 uncharacterized protein LOC1110179023.4e-2432.53Show/hide
Query:  EELAAQLTSLKVTADEKASVFHLQENDIDKSEKKLI--NALLC---KSLTHKKINLEVFRGMMPRIWG-QEQTIIDHVGANIFLCKFKNARIKGFIQEAE
        +E+       K T DE  +V       ID+ +  L   N  LC   K  T K+I+ E  R +M  +W     T  + +G NI++  FK+   K  +  + 
Subjt:  EELAAQLTSLKVTADEKASVFHLQENDIDKSEKKLI--NALLC---KSLTHKKINLEVFRGMMPRIWG-QEQTIIDHVGANIFLCKFKNARIKGFIQEAE

Query:  PWFYNKSLLVFEEPRGDINAEDMDFR------------------NAATEIGSLLGKVEQVDIDEETEPKMGCSLWIKIQIDVKKPLKRGIFMKSGTNGGE
        PW +NKSLLV   P       DM+F                     A  +G+ LG VE+++ D   +   G  + ++++IDV KPL+RGI +K+ ++G +
Subjt:  PWFYNKSLLVFEEPRGDINAEDMDFR------------------NAATEIGSLLGKVEQVDIDEETEPKMGCSLWIKIQIDVKKPLKRGIFMKSGTNGGE

Query:  KWIGITYEKLPDFCYGCGWLGHTIRECE-----VCVNSKEEDLPYGPRLREHVNLIGREINFFPRYVNYFAGR---------GRGRMGDSWR
         W  + YEKLPDFCY CG +GH+ RECE     V  NS E+   YG  LR    L+ + ++     V +  GR         GRG  GD WR
Subjt:  KWIGITYEKLPDFCYGCGWLGHTIRECE-----VCVNSKEEDLPYGPRLREHVNLIGREINFFPRYVNYFAGR---------GRGRMGDSWR

A0A6J1DU55 uncharacterized protein LOC1110231351.0e-2327.36Show/hide
Query:  DAEELAAQLTSLKVTADEKASVFHLQENDIDKSEKKLINALLCKSLTHKKINLEVFRGMMPRIWGQE-QTIIDHVGANIFLCKFKNARIKGFIQEAEPWF
        D E L A     K+T++E      +  + +  +E+ L  +L+ K L  + I+ +V   ++   W  E Q  ++ +G N+FL  F        + +  PWF
Subjt:  DAEELAAQLTSLKVTADEKASVFHLQENDIDKSEKKLINALLCKSLTHKKINLEVFRGMMPRIWGQE-QTIIDHVGANIFLCKFKNARIKGFIQEAEPWF

Query:  YNKSLLVFEEPRGDINAEDMDF------------------RNAATEIGSLLGKVEQVDIDEETEPKMGCSLWIKIQIDVKKPLKRGIFMKSGTNGGEKWI
        ++K+L+V ++P    N  +++F                  +  A  +G+ +G    VD +E+     G SL I++ ID+ KPL+RGI +      G  WI
Subjt:  YNKSLLVFEEPRGDINAEDMDF------------------RNAATEIGSLLGKVEQVDIDEETEPKMGCSLWIKIQIDVKKPLKRGIFMKSGTNGGEKWI

Query:  GITYEKLPDFCYGCGWLGHTIRECEVCVNSKEED----LPYGPRLREHVNLIGREINFFPRYVNYFAGRGRGRMGDSWRNNIYVDEDDGTNGMQNKESET
         I YE+LPDFCY CG +GH+  +C+    + ++D      YGP L               R+V   AG  +GR G S       ++  G++ M +KE   
Subjt:  GITYEKLPDFCYGCGWLGHTIRECEVCVNSKEED----LPYGPRLREHVNLIGREINFFPRYVNYFAGRGRGRMGDSWRNNIYVDEDDGTNGMQNKESET

Query:  NKVQAYRKKPLSPEVHED
         +     K+ LS + ++D
Subjt:  NKVQAYRKKPLSPEVHED

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein3.0e-0423.66Show/hide
Query:  LIWR----PNHEEAQRKQSTPSNAHLLQHEETHD-------ERCTSESVREEVGISLRWMEPPVGMWKLNSDASWCAKMNRGGIGWILRRWDGTPLTAGY
        LIWR     N+    + + +PS   L    ETHD        + T    R+     + W  PP    K N DA +  +      GWI+R   GTP++ G 
Subjt:  LIWR----PNHEEAQRKQSTPSNAHLLQHEETHD-------ERCTSESVREEVGISLRWMEPPVGMWKLNSDASWCAKMNRGGIGWILRRWDGTPLTAGY

Query:  KVVRQIWKLSWLEALALVEGMKSV-SRSSPKLIIELDSVQVVHQLQGKHKDLTGLSLFIAEVKRLLSEFQVHEIRHINRKYNGMAH
          +         E  AL+  ++    R   ++ +E D   +++ + G     + L+  + ++    ++F   +   I RK N +AH
Subjt:  KVVRQIWKLSWLEALALVEGMKSV-SRSSPKLIIELDSVQVVHQLQGKHKDLTGLSLFIAEVKRLLSEFQVHEIRHINRKYNGMAH

AT4G29090.1 Ribonuclease H-like superfamily protein4.7e-1024.11Show/hide
Query:  LENSWATQDYMEYFWRDKDGRIENSTLCRSLIV---CWQIWQQRNEVVHEHTTVDIEHLQEKIYRYLEEFHIKTEGEETNLIWRPNHEEAQRKQSTPSNA
        L   WA   Y+  +W    G         S +V    W++W+ RNE+V      + + +  +    LEE+ I+TE E      + N              
Subjt:  LENSWATQDYMEYFWRDKDGRIENSTLCRSLIV---CWQIWQQRNEVVHEHTTVDIEHLQEKIYRYLEEFHIKTEGEETNLIWRPNHEEAQRKQSTPSNA

Query:  HLLQHEETHDERCTSESVREEVGISLRWMEPPVGMWKLNSDASWCAKMNRGGIGWILRRWDGTPLTAGYKVVRQIWKLSWLEALALVEGMKSVSRSSPKL
                          R   G   RW  PP    K N+DA+W     R GIGW+LR   G     G + + ++  +   E  A+   + S+SR     
Subjt:  HLLQHEETHDERCTSESVREEVGISLRWMEPPVGMWKLNSDASWCAKMNRGGIGWILRRWDGTPLTAGYKVVRQIWKLSWLEALALVEGMKSVSRSSPKL

Query:  IIELDSVQVVHQLQGKHKDLTGLSLFIAEVKRLLSEFQVHEIRHINRKYNGMA
        +I     QV+ ++    +    L   I +++RLLS+F   +   I R+ N +A
Subjt:  IIELDSVQVVHQLQGKHKDLTGLSLFIAEVKRLLSEFQVHEIRHINRKYNGMA

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein7.8e-0520.59Show/hide
Query:  FWRDKDGRIENSTLCRSLIVCWQIWQQRNEVVHEHTTVDIEHLQEKIYRYLEEFHIKTEGEETNLIWRPNHEEAQRKQSTPSNAHLLQHEETHDERCTSE
        F    D R++  T      + W+IW+  N++V  HT    +   E      +E+   T           N ++   + + PS                  
Subjt:  FWRDKDGRIENSTLCRSLIVCWQIWQQRNEVVHEHTTVDIEHLQEKIYRYLEEFHIKTEGEETNLIWRPNHEEAQRKQSTPSNAHLLQHEETHDERCTSE

Query:  SVREEVGISLRWMEPPVGMWKLNSDASWCAKMNRGGIGWILRRWDGTPLTAGYKVVRQIWKLSWLEALALVEGMK-SVSRSSPKLIIELDSVQVVHQLQG
                + +W  P     K N DAS   +    G+GWILR   GT +  G    +        E   L+  ++ S      K+I E D+  +   +  
Subjt:  SVREEVGISLRWMEPPVGMWKLNSDASWCAKMNRGGIGWILRRWDGTPLTAGYKVVRQIWKLSWLEALALVEGMK-SVSRSSPKLIIELDSVQVVHQLQG

Query:  KHKDLTGLSLFIAEVKRLLSEFQVHEIRHINRKYNGMA
        K  +   L  F+  ++  +  F+  E    +R+ NG A
Subjt:  KHKDLTGLSLFIAEVKRLLSEFQVHEIRHINRKYNGMA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAAGAAAAAAAAAAACTTTGTCGGCGGTGGCCGGCCGTCGACAGCAGGCGGCCGGTCGCCGGAGAAGAAGAAGAAATGGGAGGGAGAAAAGGGAGGGAGAGATGA
GAGTGTGCTGGTAAAGATGGATCCAAAAGAAGATGCTGAAGAACTAGCAGCACAATTAACAAGCTTGAAGGTCACGGCTGATGAAAAGGCAAGTGTCTTTCACCTGCAGG
AGAATGACATAGATAAGTCAGAGAAAAAATTGATCAATGCACTGCTTTGCAAGAGCTTAACTCACAAAAAGATCAACCTAGAGGTCTTCAGGGGAATGATGCCTCGAATA
TGGGGGCAAGAACAAACAATTATAGATCATGTGGGCGCAAATATTTTTCTTTGCAAATTCAAAAATGCAAGAATAAAGGGTTTCATACAAGAAGCTGAACCATGGTTTTA
TAATAAATCACTCCTTGTATTTGAAGAACCAAGAGGAGATATCAATGCAGAGGATATGGATTTCAGGAATGCGGCGACGGAAATTGGAAGCTTACTGGGGAAGGTTGAAC
AAGTAGATATAGATGAAGAGACGGAACCAAAAATGGGATGCTCTCTCTGGATCAAAATTCAAATCGATGTGAAAAAACCTCTGAAAAGAGGAATATTCATGAAATCCGGA
ACCAATGGGGGTGAAAAATGGATTGGAATCACATACGAAAAGCTACCCGACTTCTGTTACGGATGTGGATGGCTAGGACATACAATCAGGGAATGCGAAGTCTGTGTCAA
CTCAAAAGAAGAAGATCTGCCATATGGTCCTAGGTTACGTGAACATGTTAATCTTATAGGAAGAGAAATTAATTTTTTTCCAAGGTATGTGAATTATTTCGCTGGCAGAG
GAAGGGGCAGAATGGGGGATAGCTGGAGAAATAACATTTATGTGGATGAAGACGATGGAACAAATGGAATGCAGAACAAAGAATCAGAAACAAACAAGGTCCAAGCTTAC
CGGAAAAAGCCACTTTCGCCGGAAGTGCATGAAGATCCGGTCGGAAGAGGTCAGAGGGATCAAACGGTAAAAAATTCAGAAAAGGAAAGAGAGAAGCCAGAGATTGATGA
GGGCAATAATGTTTCAATTAATGCTGATGGAATTTCAGGGGATATCATGACCGAAAAGGAAACTGAAAATATGGGAATAAATAGCACCAACGGACAGAATGATATAGGAA
TGAATAGTACCAACGGACAGAAGGAAGGATTTGCAATGATGGAGGTAGATCTAAATGGGCCTGAACAATTAAAGGACAGTACAAGCAGTAACTCTCAGGATGTCTCAAAA
CCTATAGTAGAGCGGGATAAAAACAAGGCAAAAATGGGACAAGAGGAGACCGACAATACAAAACAAAAAAAAAGAAAACAACATGGAAGAGGAAGGAGAGGAAATATGAT
CCATAAGAGGGCTATTGCTAGAAAGGAGCGAGAGATTCAAGAGCTTAGTAATGGAAATGATCAAGATAGCATGCTAGAGAACTCATGGGCTACACAAGATTACATGGAAT
ATTTCTGGCGGGACAAAGATGGAAGGATCGAAAATAGCACATTATGCAGGAGCTTGATAGTGTGCTGGCAGATTTGGCAACAGAGGAATGAAGTAGTGCATGAACATACC
ACAGTAGATATAGAGCATTTGCAGGAAAAAATCTATCGGTATCTAGAGGAATTTCACATTAAAACAGAAGGCGAAGAGACGAACCTGATCTGGAGACCTAATCATGAGGA
AGCTCAACGGAAGCAGTCGACGCCAAGCAACGCTCACCTGTTGCAACACGAGGAAACCCATGACGAGCGCTGCACCAGTGAATCCGTGAGAGAAGAAGTGGGTATCTCCT
TGCGATGGATGGAACCGCCGGTGGGGATGTGGAAGCTAAATAGCGATGCCTCATGGTGTGCGAAGATGAACCGTGGTGGAATAGGGTGGATTTTACGCCGATGGGATGGT
ACTCCTTTGACAGCAGGATACAAAGTGGTTAGACAAATCTGGAAACTCAGTTGGTTAGAGGCCTTAGCGTTGGTGGAAGGGATGAAATCCGTGAGTCGATCCTCTCCTAA
GTTGATTATCGAGCTTGACTCAGTGCAGGTGGTACATCAGCTTCAGGGGAAACACAAAGATCTCACCGGACTTTCTCTCTTTATCGCAGAAGTTAAGCGCCTTCTATCTG
AGTTTCAGGTTCACGAGATTAGGCATATCAATAGGAAGTACAATGGCATGGCACACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGAAGAAAAAAAAAAACTTTGTCGGCGGTGGCCGGCCGTCGACAGCAGGCGGCCGGTCGCCGGAGAAGAAGAAGAAATGGGAGGGAGAAAAGGGAGGGAGAGATGA
GAGTGTGCTGGTAAAGATGGATCCAAAAGAAGATGCTGAAGAACTAGCAGCACAATTAACAAGCTTGAAGGTCACGGCTGATGAAAAGGCAAGTGTCTTTCACCTGCAGG
AGAATGACATAGATAAGTCAGAGAAAAAATTGATCAATGCACTGCTTTGCAAGAGCTTAACTCACAAAAAGATCAACCTAGAGGTCTTCAGGGGAATGATGCCTCGAATA
TGGGGGCAAGAACAAACAATTATAGATCATGTGGGCGCAAATATTTTTCTTTGCAAATTCAAAAATGCAAGAATAAAGGGTTTCATACAAGAAGCTGAACCATGGTTTTA
TAATAAATCACTCCTTGTATTTGAAGAACCAAGAGGAGATATCAATGCAGAGGATATGGATTTCAGGAATGCGGCGACGGAAATTGGAAGCTTACTGGGGAAGGTTGAAC
AAGTAGATATAGATGAAGAGACGGAACCAAAAATGGGATGCTCTCTCTGGATCAAAATTCAAATCGATGTGAAAAAACCTCTGAAAAGAGGAATATTCATGAAATCCGGA
ACCAATGGGGGTGAAAAATGGATTGGAATCACATACGAAAAGCTACCCGACTTCTGTTACGGATGTGGATGGCTAGGACATACAATCAGGGAATGCGAAGTCTGTGTCAA
CTCAAAAGAAGAAGATCTGCCATATGGTCCTAGGTTACGTGAACATGTTAATCTTATAGGAAGAGAAATTAATTTTTTTCCAAGGTATGTGAATTATTTCGCTGGCAGAG
GAAGGGGCAGAATGGGGGATAGCTGGAGAAATAACATTTATGTGGATGAAGACGATGGAACAAATGGAATGCAGAACAAAGAATCAGAAACAAACAAGGTCCAAGCTTAC
CGGAAAAAGCCACTTTCGCCGGAAGTGCATGAAGATCCGGTCGGAAGAGGTCAGAGGGATCAAACGGTAAAAAATTCAGAAAAGGAAAGAGAGAAGCCAGAGATTGATGA
GGGCAATAATGTTTCAATTAATGCTGATGGAATTTCAGGGGATATCATGACCGAAAAGGAAACTGAAAATATGGGAATAAATAGCACCAACGGACAGAATGATATAGGAA
TGAATAGTACCAACGGACAGAAGGAAGGATTTGCAATGATGGAGGTAGATCTAAATGGGCCTGAACAATTAAAGGACAGTACAAGCAGTAACTCTCAGGATGTCTCAAAA
CCTATAGTAGAGCGGGATAAAAACAAGGCAAAAATGGGACAAGAGGAGACCGACAATACAAAACAAAAAAAAAGAAAACAACATGGAAGAGGAAGGAGAGGAAATATGAT
CCATAAGAGGGCTATTGCTAGAAAGGAGCGAGAGATTCAAGAGCTTAGTAATGGAAATGATCAAGATAGCATGCTAGAGAACTCATGGGCTACACAAGATTACATGGAAT
ATTTCTGGCGGGACAAAGATGGAAGGATCGAAAATAGCACATTATGCAGGAGCTTGATAGTGTGCTGGCAGATTTGGCAACAGAGGAATGAAGTAGTGCATGAACATACC
ACAGTAGATATAGAGCATTTGCAGGAAAAAATCTATCGGTATCTAGAGGAATTTCACATTAAAACAGAAGGCGAAGAGACGAACCTGATCTGGAGACCTAATCATGAGGA
AGCTCAACGGAAGCAGTCGACGCCAAGCAACGCTCACCTGTTGCAACACGAGGAAACCCATGACGAGCGCTGCACCAGTGAATCCGTGAGAGAAGAAGTGGGTATCTCCT
TGCGATGGATGGAACCGCCGGTGGGGATGTGGAAGCTAAATAGCGATGCCTCATGGTGTGCGAAGATGAACCGTGGTGGAATAGGGTGGATTTTACGCCGATGGGATGGT
ACTCCTTTGACAGCAGGATACAAAGTGGTTAGACAAATCTGGAAACTCAGTTGGTTAGAGGCCTTAGCGTTGGTGGAAGGGATGAAATCCGTGAGTCGATCCTCTCCTAA
GTTGATTATCGAGCTTGACTCAGTGCAGGTGGTACATCAGCTTCAGGGGAAACACAAAGATCTCACCGGACTTTCTCTCTTTATCGCAGAAGTTAAGCGCCTTCTATCTG
AGTTTCAGGTTCACGAGATTAGGCATATCAATAGGAAGTACAATGGCATGGCACACTAG
Protein sequenceShow/hide protein sequence
MEKKKKNFVGGGRPSTAGGRSPEKKKKWEGEKGGRDESVLVKMDPKEDAEELAAQLTSLKVTADEKASVFHLQENDIDKSEKKLINALLCKSLTHKKINLEVFRGMMPRI
WGQEQTIIDHVGANIFLCKFKNARIKGFIQEAEPWFYNKSLLVFEEPRGDINAEDMDFRNAATEIGSLLGKVEQVDIDEETEPKMGCSLWIKIQIDVKKPLKRGIFMKSG
TNGGEKWIGITYEKLPDFCYGCGWLGHTIRECEVCVNSKEEDLPYGPRLREHVNLIGREINFFPRYVNYFAGRGRGRMGDSWRNNIYVDEDDGTNGMQNKESETNKVQAY
RKKPLSPEVHEDPVGRGQRDQTVKNSEKEREKPEIDEGNNVSINADGISGDIMTEKETENMGINSTNGQNDIGMNSTNGQKEGFAMMEVDLNGPEQLKDSTSSNSQDVSK
PIVERDKNKAKMGQEETDNTKQKKRKQHGRGRRGNMIHKRAIARKEREIQELSNGNDQDSMLENSWATQDYMEYFWRDKDGRIENSTLCRSLIVCWQIWQQRNEVVHEHT
TVDIEHLQEKIYRYLEEFHIKTEGEETNLIWRPNHEEAQRKQSTPSNAHLLQHEETHDERCTSESVREEVGISLRWMEPPVGMWKLNSDASWCAKMNRGGIGWILRRWDG
TPLTAGYKVVRQIWKLSWLEALALVEGMKSVSRSSPKLIIELDSVQVVHQLQGKHKDLTGLSLFIAEVKRLLSEFQVHEIRHINRKYNGMAH