; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg021552 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg021552
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionCCHC-type domain-containing protein
Genome locationscaffold9:7650639..7658546
RNA-Seq ExpressionSpg021552
SyntenySpg021552
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily
IPR040256 - Uncharacterized protein At4g02000-like
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
MCH88312.1 zinc CCHC-type-like protein [Trifolium medium]1.1e-3735.25Show/hide
Query:  EWRRFSLMEEEKEGFFTLETDEIKEIEGQLEHNLVGKLLSNRFISKMAIKNAMMGAWKTRGDFNVEILSKNIFSFKFENTEDKKWILNNGPWLFDKSLLV
        EWR+  L +EE+EG   +E++E    E   + +LVGKL +    +    K  +  AW+ +    V+ L+KN+F F+F +  D   ++ NGPW FD++L+V
Subjt:  EWRRFSLMEEEKEGFFTLETDEIKEIEGQLEHNLVGKLLSNRFISKMAIKNAMMGAWKTRGDFNVEILSKNIFSFKFENTEDKKWILNNGPWLFDKSLLV

Query:  LESPTTSYRVTDMEFKKVDLWLRIINLPMGFRNELIARKIGDKLGTFLEWDKDKSNNSWGNSIRVRVKIDISKPLRRGFMFKIKESSEECWISIRYERLH
        L+  +   + +D+E    + W RI +LP+  R+E +A+K+GD +G+F+E D  K  N  G  +RV+  ID+ KPL+RG +  +K   ++  +  ++ERL 
Subjt:  LESPTTSYRVTDMEFKKVDLWLRIINLPMGFRNELIARKIGDKLGTFLEWDKDKSNNSWGNSIRVRVKIDISKPLRRGFMFKIKESSEECWISIRYERLH

Query:  DLCFYCGRIGHTAKECTEMKNGDKA-----KEREFEFGSWLK---FQGFYQQAKKQEYQGN
          CF CGRIGH  K+C E+++ D A     +E+E  FG WL+      F  + KK+   G+
Subjt:  DLCFYCGRIGHTAKECTEMKNGDKA-----KEREFEFGSWLK---FQGFYQQAKKQEYQGN

TXG72009.1 hypothetical protein EZV62_000588 [Acer yangbiense]4.9e-3837.14Show/hide
Query:  MDIDELVNEWRRFSLMEEEKEGFFTLETDEIKEIEGQLEHNLVGKLLSNRFISKMAIKNAMMGAWKTRGDFNVEILSKNIFSFKFENTEDKKWILNNGPW
        M  DE+    +  SL + E      L+ +  K+   ++   L+GKLLSN+ +++ A  +     W+T  DF +E++S N FSF F++  D+  +L  GPW
Subjt:  MDIDELVNEWRRFSLMEEEKEGFFTLETDEIKEIEGQLEHNLVGKLLSNRFISKMAIKNAMMGAWKTRGDFNVEILSKNIFSFKFENTEDKKWILNNGPW

Query:  LFDKSLLVLESPTTSYRVTDMEFKKVDLWLRIINLPMGFRNELIARKIGDKLGTFLEWDKDKSNNSWGNSIRVRVKIDISKPLRRGFMFKIKESSEECWI
         FDK+LLVLE PT    +  M+F KV  W++I N+P+      I + +G+ +G   E D  KS    G  IRVRV I + KPLRR     +     E  +
Subjt:  LFDKSLLVLESPTTSYRVTDMEFKKVDLWLRIINLPMGFRNELIARKIGDKLGTFLEWDKDKSNNSWGNSIRVRVKIDISKPLRRGFMFKIKESSEECWI

Query:  SIRYERLHDLCFYCGRIGHTAKECTEMKNGDKAKEREFEFGSWLK
         ++YERL + CF CG +GHT + C + K GD  +  +  FGSWLK
Subjt:  SIRYERLHDLCFYCGRIGHTAKECTEMKNGDKAKEREFEFGSWLK

XP_022132681.1 uncharacterized protein LOC111005481 [Momordica charantia]1.2e-4135.34Show/hide
Query:  MDIDELVNEWRRFSLMEEEKEGFFTLETDEIKEIEGQLEHNLVGKLLSNRFISKMAIKNAMMGAWKTR-GDFNVEILSKNIFSFKFENTEDKKWILNNGP
        M    L+ EW+ F L  EE +    +++  ++     LE +L+ KLLS R IS   +KN +  AWK     F+V+I+  NIF F F  + D+  IL  GP
Subjt:  MDIDELVNEWRRFSLMEEEKEGFFTLETDEIKEIEGQLEHNLVGKLLSNRFISKMAIKNAMMGAWKTR-GDFNVEILSKNIFSFKFENTEDKKWILNNGP

Query:  WLFDKSLLVLESPTTSYRVTDMEFKKVDLWLRIINLPMGFRNELIARKIGDKLGTFLEWDKDKSNNSWGNSIRVRVKIDISKPLRRGFMFKIKESSEECW
        W FD++L+++++P +  +  DM+F+ V LW+   +L +   N+ +A ++G+ +G F + + + +N  WG+ +RVRV+ D+ KPL RG    +      CW
Subjt:  WLFDKSLLVLESPTTSYRVTDMEFKKVDLWLRIINLPMGFRNELIARKIGDKLGTFLEWDKDKSNNSWGNSIRVRVKIDISKPLRRGFMFKIKESSEECW

Query:  ISIRYERLHDLCFYCGRIGHTAKECTEMKNGDKAKEREFEFGSWLKFQG
        I I+YERL D  ++CGR+ H  K+C++      +K    ++G WL+FQG
Subjt:  ISIRYERLHDLCFYCGRIGHTAKECTEMKNGDKAKEREFEFGSWLKFQG

XP_022156185.1 uncharacterized protein LOC111023135 [Momordica charantia]4.4e-4736.45Show/hide
Query:  MDIDELVNEWRRFSLMEEEKEGFFTLETDEIKEIEGQLEHNLVGKLLSNRFISKMAIKNAMMGAWKTRGDFNVEILSKNIFSFKFENTEDKKWILNNGPW
        MD + L+ +W++F L  EE E    ++ D +K  E  L ++LVGKLL+ R IS   +   ++ AWK      VE + KN+F F F    D   ++  GPW
Subjt:  MDIDELVNEWRRFSLMEEEKEGFFTLETDEIKEIEGQLEHNLVGKLLSNRFISKMAIKNAMMGAWKTRGDFNVEILSKNIFSFKFENTEDKKWILNNGPW

Query:  LFDKSLLVLESPTTSYRVTDMEFKKVDLWLRIINLPMGFRNELIARKIGDKLGTFLEWDKDKSNNSWGNSIRVRVKIDISKPLRRGFMFKIKESSEECWI
         FDK+L+VL+ P +S  ++++EF +V  W+ + +LPM + N+ +A ++G+ +G F++ D ++   SWG S+R+RV IDI+KPLRRG    I      CWI
Subjt:  LFDKSLLVLESPTTSYRVTDMEFKKVDLWLRIINLPMGFRNELIARKIGDKLGTFLEWDKDKSNNSWGNSIRVRVKIDISKPLRRGFMFKIKESSEECWI

Query:  SIRYERLHDLCFYCGRIGHTAKECTEMKNGDKAKER-EFEFGSWLKFQGFYQQAKKQEYQGNKNQN-SQEDNQDSNQHMEK---TKKPRAGLDVDLNQD
         I+YERL D C++CG IGH++ +C       +   R   E+G WL+F G    A+K    G K ++ ++ED+  S+    K    ++ +  L    NQD
Subjt:  SIRYERLHDLCFYCGRIGHTAKECTEMKNGDKAKER-EFEFGSWLKFQGFYQQAKKQEYQGNKNQN-SQEDNQDSNQHMEK---TKKPRAGLDVDLNQD

XP_028124075.1 uncharacterized protein LOC114321128 [Camellia sinensis]8.3e-3831.89Show/hide
Query:  DELVNEWRRFSLMEEEKEGFFTLETDEIKEIEGQLEHNLVGKLLSNRFISKMAIKNAMMGAWKTRGDFNVEILSKNIFSFKFENTEDKKWILNNGPWLFD
        D LV+  R  SL  EE +    +  D    + G+ +  LVGKLL+ R  +  A+KN ++  W+      V ++  N+F F F +  DK+ +L+NGPW FD
Subjt:  DELVNEWRRFSLMEEEKEGFFTLETDEIKEIEGQLEHNLVGKLLSNRFISKMAIKNAMMGAWKTRGDFNVEILSKNIFSFKFENTEDKKWILNNGPWLFD

Query:  KSLLVLESPTTSYRVTDMEFKKVDLWLRIINLPMGFRNELIARKIGDKLGTFLEWDKDKSNNSWGNSIRVRVKIDISKPLRRGFMFKIKESSEECWISIR
        K LL+L     + + +D++   V  W+ + NLP+   N+ + + +G+ +G F++ D +    +WG ++R+RV ID+ KPLRRG    +  S++  W+  +
Subjt:  KSLLVLESPTTSYRVTDMEFKKVDLWLRIINLPMGFRNELIARKIGDKLGTFLEWDKDKSNNSWGNSIRVRVKIDISKPLRRGFMFKIKESSEECWISIR

Query:  YERLHDLCFYCGRIGHTAKECTE-MKNGDKAKEREFEFGSWLKFQGFYQQAKKQ
        YERL   C++CGR+GH+ +EC + +   D  +    ++G+WL+      +  ++
Subjt:  YERLHDLCFYCGRIGHTAKECTE-MKNGDKAKEREFEFGSWLKFQGFYQQAKKQ

TrEMBL top hitse value%identityAlignment
A0A5C7ISE3 CCHC-type domain-containing protein2.4e-3837.14Show/hide
Query:  MDIDELVNEWRRFSLMEEEKEGFFTLETDEIKEIEGQLEHNLVGKLLSNRFISKMAIKNAMMGAWKTRGDFNVEILSKNIFSFKFENTEDKKWILNNGPW
        M  DE+    +  SL + E      L+ +  K+   ++   L+GKLLSN+ +++ A  +     W+T  DF +E++S N FSF F++  D+  +L  GPW
Subjt:  MDIDELVNEWRRFSLMEEEKEGFFTLETDEIKEIEGQLEHNLVGKLLSNRFISKMAIKNAMMGAWKTRGDFNVEILSKNIFSFKFENTEDKKWILNNGPW

Query:  LFDKSLLVLESPTTSYRVTDMEFKKVDLWLRIINLPMGFRNELIARKIGDKLGTFLEWDKDKSNNSWGNSIRVRVKIDISKPLRRGFMFKIKESSEECWI
         FDK+LLVLE PT    +  M+F KV  W++I N+P+      I + +G+ +G   E D  KS    G  IRVRV I + KPLRR     +     E  +
Subjt:  LFDKSLLVLESPTTSYRVTDMEFKKVDLWLRIINLPMGFRNELIARKIGDKLGTFLEWDKDKSNNSWGNSIRVRVKIDISKPLRRGFMFKIKESSEECWI

Query:  SIRYERLHDLCFYCGRIGHTAKECTEMKNGDKAKEREFEFGSWLK
         ++YERL + CF CG +GHT + C + K GD  +  +  FGSWLK
Subjt:  SIRYERLHDLCFYCGRIGHTAKECTEMKNGDKAKEREFEFGSWLK

A0A6J1BSZ1 uncharacterized protein LOC1110054816.0e-4235.34Show/hide
Query:  MDIDELVNEWRRFSLMEEEKEGFFTLETDEIKEIEGQLEHNLVGKLLSNRFISKMAIKNAMMGAWKTR-GDFNVEILSKNIFSFKFENTEDKKWILNNGP
        M    L+ EW+ F L  EE +    +++  ++     LE +L+ KLLS R IS   +KN +  AWK     F+V+I+  NIF F F  + D+  IL  GP
Subjt:  MDIDELVNEWRRFSLMEEEKEGFFTLETDEIKEIEGQLEHNLVGKLLSNRFISKMAIKNAMMGAWKTR-GDFNVEILSKNIFSFKFENTEDKKWILNNGP

Query:  WLFDKSLLVLESPTTSYRVTDMEFKKVDLWLRIINLPMGFRNELIARKIGDKLGTFLEWDKDKSNNSWGNSIRVRVKIDISKPLRRGFMFKIKESSEECW
        W FD++L+++++P +  +  DM+F+ V LW+   +L +   N+ +A ++G+ +G F + + + +N  WG+ +RVRV+ D+ KPL RG    +      CW
Subjt:  WLFDKSLLVLESPTTSYRVTDMEFKKVDLWLRIINLPMGFRNELIARKIGDKLGTFLEWDKDKSNNSWGNSIRVRVKIDISKPLRRGFMFKIKESSEECW

Query:  ISIRYERLHDLCFYCGRIGHTAKECTEMKNGDKAKEREFEFGSWLKFQG
        I I+YERL D  ++CGR+ H  K+C++      +K    ++G WL+FQG
Subjt:  ISIRYERLHDLCFYCGRIGHTAKECTEMKNGDKAKEREFEFGSWLKFQG

A0A6J1DU55 uncharacterized protein LOC1110231352.1e-4736.45Show/hide
Query:  MDIDELVNEWRRFSLMEEEKEGFFTLETDEIKEIEGQLEHNLVGKLLSNRFISKMAIKNAMMGAWKTRGDFNVEILSKNIFSFKFENTEDKKWILNNGPW
        MD + L+ +W++F L  EE E    ++ D +K  E  L ++LVGKLL+ R IS   +   ++ AWK      VE + KN+F F F    D   ++  GPW
Subjt:  MDIDELVNEWRRFSLMEEEKEGFFTLETDEIKEIEGQLEHNLVGKLLSNRFISKMAIKNAMMGAWKTRGDFNVEILSKNIFSFKFENTEDKKWILNNGPW

Query:  LFDKSLLVLESPTTSYRVTDMEFKKVDLWLRIINLPMGFRNELIARKIGDKLGTFLEWDKDKSNNSWGNSIRVRVKIDISKPLRRGFMFKIKESSEECWI
         FDK+L+VL+ P +S  ++++EF +V  W+ + +LPM + N+ +A ++G+ +G F++ D ++   SWG S+R+RV IDI+KPLRRG    I      CWI
Subjt:  LFDKSLLVLESPTTSYRVTDMEFKKVDLWLRIINLPMGFRNELIARKIGDKLGTFLEWDKDKSNNSWGNSIRVRVKIDISKPLRRGFMFKIKESSEECWI

Query:  SIRYERLHDLCFYCGRIGHTAKECTEMKNGDKAKER-EFEFGSWLKFQGFYQQAKKQEYQGNKNQN-SQEDNQDSNQHMEK---TKKPRAGLDVDLNQD
         I+YERL D C++CG IGH++ +C       +   R   E+G WL+F G    A+K    G K ++ ++ED+  S+    K    ++ +  L    NQD
Subjt:  SIRYERLHDLCFYCGRIGHTAKECTEMKNGDKAKER-EFEFGSWLKFQGFYQQAKKQEYQGNKNQN-SQEDNQDSNQHMEK---TKKPRAGLDVDLNQD

A0A803LK12 Uncharacterized protein2.4e-3834.43Show/hide
Query:  DELVNEWRRFSLMEEEKE--GFFTLETDEIKEIEGQLEHNLVGKLLSNR-FISKMAIKNAMMGAWKTRGDFNVEILSKNIFSFKFENTEDKKWILNNGPW
        +EL+ +W  F L +EE    G  ++E D + +   +LE +LVGK+L+ + +     +K  + G W+   D  V ++  N+F F+F N EDK  +L+  PW
Subjt:  DELVNEWRRFSLMEEEKE--GFFTLETDEIKEIEGQLEHNLVGKLLSNR-FISKMAIKNAMMGAWKTRGDFNVEILSKNIFSFKFENTEDKKWILNNGPW

Query:  LFDKSLLVLESPTTSYRVTDMEFKKVDLWLRIINLPMGFRNELIARKIGDKLGTFLEWDKDKSNNSWGNSIRVRVKIDISKPLRRGFMFKIKESSEECWI
         FDK +L+L++     + +++ F+    W+R++++  G RNE  AR++GD +G FLE+D ++    W   +R++V +DI+KPLRRG       S  + W+
Subjt:  LFDKSLLVLESPTTSYRVTDMEFKKVDLWLRIINLPMGFRNELIARKIGDKLGTFLEWDKDKSNNSWGNSIRVRVKIDISKPLRRGFMFKIKESSEECWI

Query:  SIRYERLHDLCFYCGRIGHTAKECTEMKNGDKAKEREFEFGSWL
        S +YERL D  ++CGR+GH  K+C E +  D+ +   F++G ++
Subjt:  SIRYERLHDLCFYCGRIGHTAKECTEMKNGDKAKEREFEFGSWL

A0A803N409 Uncharacterized protein4.0e-3834.43Show/hide
Query:  DELVNEWRRFSLMEEEKEGFFTL-ETDEIKEIEGQLEHNLVGKLLSNRFISKMAIKNAMMGAWKTRGDFNVEILSKNIFSFKFENTEDKKWILNNGPWLF
        D+++ EW + SL EEE +    L E D++ + E Q++  LVGKL + +  +  A+K  +   WK   +  + ++  N+F F+F N EDK+W+++  PW F
Subjt:  DELVNEWRRFSLMEEEKEGFFTL-ETDEIKEIEGQLEHNLVGKLLSNRFISKMAIKNAMMGAWKTRGDFNVEILSKNIFSFKFENTEDKKWILNNGPWLF

Query:  DKSLLVLESPTTSYRVTDMEFKKVDLWLRIINLPMGFRNELIARKIGDKLGTFLEWDKDKSNNSWGNSIRVRVKIDISKPLRRGFMFKIKESSEECWISI
        D  L++L       + +D++  +  +W+R+ ++P   R   + R++GD LG F+++D +     WG  +R++V +DI KPL RG +F     S+  WI I
Subjt:  DKSLLVLESPTTSYRVTDMEFKKVDLWLRIINLPMGFRNELIARKIGDKLGTFLEWDKDKSNNSWGNSIRVRVKIDISKPLRRGFMFKIKESSEECWISI

Query:  RYERLHDLCFYCGRIGHTAKECTEMKNGD-KAKEREFEFGSWLK
        +YE+L D CFYCG + HT K C   +  D K  E  +++G WL+
Subjt:  RYERLHDLCFYCGRIGHTAKECTEMKNGD-KAKEREFEFGSWLK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein8.1e-0726.17Show/hide
Query:  ICWSIWSDRNKIVHGE--EIPSIQYRSRWIKNYLEEFMRANARRPPSVVSSLQSEQDSRMAIHWRPPPRGFWKINSDAACFSSPPSSSIRAVCRDSEGAV
        + W IW  RN +V  +  E PS    S   + +       + ++ PS      + Q +   I WR PP  + K N DA        ++   + R+  G  
Subjt:  ICWSIWSDRNKIVHGE--EIPSIQYRSRWIKNYLEEFMRANARRPPSVVSSLQSEQDSRMAIHWRPPPRGFWKINSDAACFSSPPSSSIRAVCRDSEGAV

Query:  MFALSNSTNINMEAHLAELWALSEGVKMALSMGCTTRKVESDNLIATKFLNREIPVWRDVEAIIVSIWSLANSFKDISFSFIPRECNKVAHELARYGAFS
        +   S           AE  AL   ++     G T   +E D       +N  I     +   +  I   AN F  I F FI R+ NK+AH LA+YG   
Subjt:  MFALSNSTNINMEAHLAELWALSEGVKMALSMGCTTRKVESDNLIATKFLNREIPVWRDVEAIIVSIWSLANSFKDISFSFIPRECNKVAHELARYGAFS

Query:  FHQETWVGNYPSWL
            +  G+ P WL
Subjt:  FHQETWVGNYPSWL

AT3G42140.1 zinc ion binding;nucleic acid binding4.2e-1126.45Show/hide
Query:  ILSKNIFSFKFENTEDKKWILNNGPWLFDKSLLVLESPTTSYRVTDMEFKKVDLWLRIINLPMGFRNELIARKIGDKLGTFLEWDKDKSNNSWGNSIRVR
        IL  +   F F++ E    IL  GPW F+  + V++  T  +  +D EFK++  W++I  +P+ F    I   IG+++G FLE        + G  + V 
Subjt:  ILSKNIFSFKFENTEDKKWILNNGPWLFDKSLLVLESPTTSYRVTDMEFKKVDLWLRIINLPMGFRNELIARKIGDKLGTFLEWDKDKSNNSWGNSIRVR

Query:  VKIDISKPLRRGFMFKIKESSEECWISIRYERLHDLCFYCGRIGHTAKECTEMKN
                                 +  +YE+L + C  CG + H A EC    N
Subjt:  VKIDISKPLRRGFMFKIKESSEECWISIRYERLHDLCFYCGRIGHTAKECTEMKN

AT4G09490.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.2e-0537.21Show/hide
Query:  LAELWALSEGVKMALSMGCTTRKVESDNLIATKFLNREIPVWRDVEAIIVSIWSLANSFKDISFSFIPRECNKVAHELARYGAFSF
        +AE  AL   ++ A S+G T   + SD+      +  E P   +   II  I +L+  F D+SFSF+PR  N+VA ELA+    SF
Subjt:  LAELWALSEGVKMALSMGCTTRKVESDNLIATKFLNREIPVWRDVEAIIVSIWSLANSFKDISFSFIPRECNKVAHELARYGAFSF

AT4G29090.1 Ribonuclease H-like superfamily protein1.3e-0925Show/hide
Query:  ICWSIWSDRNKIVHGEEIPSIQYRSRWIKNYLEEFMRANARRPPSVVSSLQSEQDSRMAIHWRPPPRGFWKINSDAACFSSPPSSSIRAVCRDSEGAVMF
        + W +W +RN++V      + Q   R  ++ LEE+      R  +     + + +      WRPPP  + K N+DA          I  V R+ +G V +
Subjt:  ICWSIWSDRNKIVHGEEIPSIQYRSRWIKNYLEEFMRANARRPPSVVSSLQSEQDSRMAIHWRPPPRGFWKINSDAACFSSPPSSSIRAVCRDSEGAVMF

Query:  ALSNSTNINMEAHLAELWALSEGVKMALSMGCTTRKVESDNLIATKFLNREIPVWRDVEAIIVSIWSLANSFKDISFSFIPRECNKVAHELARYG-AFSF
          + +         AEL A+   V             ESD+ +  + LN +  +W  ++  I  +  L + F ++ F FIPRE N +A  +AR   +F  
Subjt:  ALSNSTNINMEAHLAELWALSEGVKMALSMGCTTRKVESDNLIATKFLNREIPVWRDVEAIIVSIWSLANSFKDISFSFIPRECNKVAHELARYG-AFSF

Query:  HQETWVGNYPSW
        +        PSW
Subjt:  HQETWVGNYPSW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCGGAGAGAAGGCGATCGAGAAAGTCACGATTGGGAGAGTCTGAGAAGGAAATCGTTGGCGGTGACCTAACGAAAGGGGAGCGGCGAGTAGGAAGGGAGAGATT
TAGTGGGAGGCGATGGAGGAGCCTGGAGAGAGCATCAGCACGTGGCGGAGGTGGAGAGATGTTGATTGCATGCCTCCTACACCTTGTCCCCGAGTTGAAATCACTGGAAT
CGAACCCCCCATGTGCACTGGATGAAGAAGGGGACACAGCAGATGGTCTTGTGCTGGCTCGCAAGTTGCAAATGGACATTGATGAACTAGTTAATGAATGGAGAAGGTTC
AGTTTAATGGAAGAAGAAAAAGAAGGTTTCTTTACCTTGGAAACGGATGAGATCAAGGAGATTGAAGGGCAACTTGAGCATAACCTTGTGGGCAAACTCCTGTCAAACAG
ATTCATATCAAAGATGGCTATCAAGAACGCAATGATGGGAGCCTGGAAAACGAGGGGTGATTTCAATGTAGAAATCCTTAGCAAAAATATTTTCTCCTTCAAGTTTGAAA
ATACTGAAGACAAGAAGTGGATACTCAACAATGGCCCTTGGCTTTTTGACAAAAGCCTCCTCGTTCTCGAGTCCCCAACAACCAGCTATAGAGTCACAGATATGGAATTC
AAGAAAGTAGATCTTTGGTTAAGGATAATAAATCTTCCAATGGGTTTTCGGAATGAACTAATAGCCAGAAAGATTGGAGACAAGTTGGGCACTTTCCTTGAATGGGATAA
AGATAAATCAAACAACTCATGGGGCAACAGCATAAGAGTTCGGGTCAAGATAGACATTTCGAAACCTCTTCGAAGAGGTTTCATGTTTAAAATAAAAGAATCAAGCGAAG
AATGTTGGATATCCATAAGATACGAAAGATTACATGATCTATGCTTTTATTGTGGGAGAATCGGTCATACTGCAAAGGAATGCACTGAAATGAAAAACGGTGATAAAGCA
AAAGAAAGGGAGTTTGAGTTTGGATCGTGGCTAAAGTTTCAAGGCTTCTATCAACAAGCTAAGAAACAAGAATATCAGGGAAACAAAAATCAGAATTCACAAGAAGACAA
CCAGGATAGTAACCAACATATGGAAAAGACAAAAAAACCAAGAGCGGGCCTGGATGTAGATTTAAACCAAGATAGCCCAATTTATGATGTTCCAGTACATACAATCAACA
GGGAAGGGGAAGAGGAAGGAGAAATAAATGCATTACTAAGAATGGAAATTACTGAAGAAGAAACATGCAATGATCCTTTACAGATAAAGGAGATAAGAAAAGGGTTGGTT
CAATTTGATATGATACAAGAGGATGAGATACATAGTTCAGCAGTAGATACCTCTTCAGTAAGGAAGAAATCAAGCTGGAAAAGAAAGACAAACTCAGGACAAATGGTAGC
AGATTTTATACTTCCTTCAGGTGGTTGGGATGTGCCAAAGCTTAGAGAAGCAGTCCTTGATTTAGATTTTGATATTATCAGAGTAATCTGCTGGTCCATCTGGTCGGACA
GAAACAAAATCGTTCATGGTGAAGAGATTCCGTCTATCCAATACAGGAGCCGATGGATTAAGAACTACCTTGAAGAGTTTATGCGCGCAAATGCAAGGAGACCTCCTTCA
GTTGTATCTAGTCTTCAGTCGGAGCAAGATTCGAGAATGGCGATCCATTGGCGTCCTCCACCAAGAGGGTTCTGGAAAATCAACTCAGATGCCGCATGTTTCTCCTCGCC
TCCGTCTTCGAGCATCAGAGCAGTTTGTAGAGATTCAGAAGGAGCTGTGATGTTTGCCCTATCAAATTCCACTAATATCAATATGGAGGCCCATTTAGCCGAATTATGGG
CTTTGTCTGAAGGTGTTAAGATGGCTTTATCCATGGGCTGCACAACCCGGAAGGTGGAATCAGACAATCTTATAGCAACCAAGTTCCTTAATAGAGAGATCCCTGTTTGG
AGAGATGTTGAAGCTATCATAGTTTCTATTTGGAGCTTAGCCAATTCTTTTAAAGATATTTCATTTAGTTTTATTCCAAGAGAATGTAATAAGGTAGCCCATGAGTTAGC
TAGATATGGAGCTTTTTCATTCCATCAGGAAACTTGGGTCGGTAATTACCCGAGTTGGCTTTGTAATTGGTCGAGAATGACGTCTTTTCTTGTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGGCGGAGAGAAGGCGATCGAGAAAGTCACGATTGGGAGAGTCTGAGAAGGAAATCGTTGGCGGTGACCTAACGAAAGGGGAGCGGCGAGTAGGAAGGGAGAGATT
TAGTGGGAGGCGATGGAGGAGCCTGGAGAGAGCATCAGCACGTGGCGGAGGTGGAGAGATGTTGATTGCATGCCTCCTACACCTTGTCCCCGAGTTGAAATCACTGGAAT
CGAACCCCCCATGTGCACTGGATGAAGAAGGGGACACAGCAGATGGTCTTGTGCTGGCTCGCAAGTTGCAAATGGACATTGATGAACTAGTTAATGAATGGAGAAGGTTC
AGTTTAATGGAAGAAGAAAAAGAAGGTTTCTTTACCTTGGAAACGGATGAGATCAAGGAGATTGAAGGGCAACTTGAGCATAACCTTGTGGGCAAACTCCTGTCAAACAG
ATTCATATCAAAGATGGCTATCAAGAACGCAATGATGGGAGCCTGGAAAACGAGGGGTGATTTCAATGTAGAAATCCTTAGCAAAAATATTTTCTCCTTCAAGTTTGAAA
ATACTGAAGACAAGAAGTGGATACTCAACAATGGCCCTTGGCTTTTTGACAAAAGCCTCCTCGTTCTCGAGTCCCCAACAACCAGCTATAGAGTCACAGATATGGAATTC
AAGAAAGTAGATCTTTGGTTAAGGATAATAAATCTTCCAATGGGTTTTCGGAATGAACTAATAGCCAGAAAGATTGGAGACAAGTTGGGCACTTTCCTTGAATGGGATAA
AGATAAATCAAACAACTCATGGGGCAACAGCATAAGAGTTCGGGTCAAGATAGACATTTCGAAACCTCTTCGAAGAGGTTTCATGTTTAAAATAAAAGAATCAAGCGAAG
AATGTTGGATATCCATAAGATACGAAAGATTACATGATCTATGCTTTTATTGTGGGAGAATCGGTCATACTGCAAAGGAATGCACTGAAATGAAAAACGGTGATAAAGCA
AAAGAAAGGGAGTTTGAGTTTGGATCGTGGCTAAAGTTTCAAGGCTTCTATCAACAAGCTAAGAAACAAGAATATCAGGGAAACAAAAATCAGAATTCACAAGAAGACAA
CCAGGATAGTAACCAACATATGGAAAAGACAAAAAAACCAAGAGCGGGCCTGGATGTAGATTTAAACCAAGATAGCCCAATTTATGATGTTCCAGTACATACAATCAACA
GGGAAGGGGAAGAGGAAGGAGAAATAAATGCATTACTAAGAATGGAAATTACTGAAGAAGAAACATGCAATGATCCTTTACAGATAAAGGAGATAAGAAAAGGGTTGGTT
CAATTTGATATGATACAAGAGGATGAGATACATAGTTCAGCAGTAGATACCTCTTCAGTAAGGAAGAAATCAAGCTGGAAAAGAAAGACAAACTCAGGACAAATGGTAGC
AGATTTTATACTTCCTTCAGGTGGTTGGGATGTGCCAAAGCTTAGAGAAGCAGTCCTTGATTTAGATTTTGATATTATCAGAGTAATCTGCTGGTCCATCTGGTCGGACA
GAAACAAAATCGTTCATGGTGAAGAGATTCCGTCTATCCAATACAGGAGCCGATGGATTAAGAACTACCTTGAAGAGTTTATGCGCGCAAATGCAAGGAGACCTCCTTCA
GTTGTATCTAGTCTTCAGTCGGAGCAAGATTCGAGAATGGCGATCCATTGGCGTCCTCCACCAAGAGGGTTCTGGAAAATCAACTCAGATGCCGCATGTTTCTCCTCGCC
TCCGTCTTCGAGCATCAGAGCAGTTTGTAGAGATTCAGAAGGAGCTGTGATGTTTGCCCTATCAAATTCCACTAATATCAATATGGAGGCCCATTTAGCCGAATTATGGG
CTTTGTCTGAAGGTGTTAAGATGGCTTTATCCATGGGCTGCACAACCCGGAAGGTGGAATCAGACAATCTTATAGCAACCAAGTTCCTTAATAGAGAGATCCCTGTTTGG
AGAGATGTTGAAGCTATCATAGTTTCTATTTGGAGCTTAGCCAATTCTTTTAAAGATATTTCATTTAGTTTTATTCCAAGAGAATGTAATAAGGTAGCCCATGAGTTAGC
TAGATATGGAGCTTTTTCATTCCATCAGGAAACTTGGGTCGGTAATTACCCGAGTTGGCTTTGTAATTGGTCGAGAATGACGTCTTTTCTTGTTTAG
Protein sequenceShow/hide protein sequence
MAAERRRSRKSRLGESEKEIVGGDLTKGERRVGRERFSGRRWRSLERASARGGGGEMLIACLLHLVPELKSLESNPPCALDEEGDTADGLVLARKLQMDIDELVNEWRRF
SLMEEEKEGFFTLETDEIKEIEGQLEHNLVGKLLSNRFISKMAIKNAMMGAWKTRGDFNVEILSKNIFSFKFENTEDKKWILNNGPWLFDKSLLVLESPTTSYRVTDMEF
KKVDLWLRIINLPMGFRNELIARKIGDKLGTFLEWDKDKSNNSWGNSIRVRVKIDISKPLRRGFMFKIKESSEECWISIRYERLHDLCFYCGRIGHTAKECTEMKNGDKA
KEREFEFGSWLKFQGFYQQAKKQEYQGNKNQNSQEDNQDSNQHMEKTKKPRAGLDVDLNQDSPIYDVPVHTINREGEEEGEINALLRMEITEEETCNDPLQIKEIRKGLV
QFDMIQEDEIHSSAVDTSSVRKKSSWKRKTNSGQMVADFILPSGGWDVPKLREAVLDLDFDIIRVICWSIWSDRNKIVHGEEIPSIQYRSRWIKNYLEEFMRANARRPPS
VVSSLQSEQDSRMAIHWRPPPRGFWKINSDAACFSSPPSSSIRAVCRDSEGAVMFALSNSTNINMEAHLAELWALSEGVKMALSMGCTTRKVESDNLIATKFLNREIPVW
RDVEAIIVSIWSLANSFKDISFSFIPRECNKVAHELARYGAFSFHQETWVGNYPSWLCNWSRMTSFLV