; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg025913 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg025913
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionCCHC-type domain-containing protein
Genome locationscaffold7:864597..870367
RNA-Seq ExpressionSpg025913
SyntenySpg025913
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR036397 - Ribonuclease H superfamily
IPR040256 - Uncharacterized protein At4g02000-like
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG48193.1 hypothetical protein EZV62_027487 [Acer yangbiense]3.1e-3730.34Show/hide
Query:  ELLVKQLTELKVTEEEKACVFKLKGEEINKSEKKLENALICKILTQKKINPEMFKSKMPRIWGQ-EQTIISQVGFNLYLCKFKNGRIKSLIKETGPWFFD
        E  + QL E    E+E A V ++  + I   ++ ++  L+ K+LT KK+N E FK  + +IW Q  Q  +  VG N ++  F N   ++ +   GPW F 
Subjt:  ELLVKQLTELKVTEEEKACVFKLKGEEINKSEKKLENALICKILTQKKINPEMFKSKMPRIWGQ-EQTIISQVGFNLYLCKFKNGRIKSLIKETGPWFFD

Query:  KALILLKEPKGGNYGEDMDFRYVSFWIHFHKLPFACFSRNAATEIGGLLGKVEQIDLEEEIDQNWGGSLRVKIQIDVSKPLKRGIFLTPEDSNENRWIPI
        K+LI+L++PKG      + F    FW+  H +P  C ++     +   +G+V +I  E    + WG  +RVK+Q+D++KPLKR + +      E   + +
Subjt:  KALILLKEPKGGNYGEDMDFRYVSFWIHFHKLPFACFSRNAATEIGGLLGKVEQIDLEEEIDQNWGGSLRVKIQIDVSKPLKRGIFLTPEDSNENRWIPI

Query:  TYEKLPDFCYGCGYLGHTIKEC--ESIDQAGSSEEELEYGAWLREPIFLKMREAESLPRSIVNQAGRGRGRIWEGGRGGWRSSIQEEEEDG----LDGTI
         YE+LPDFC+ CG +GH+++EC  E   +A    ++ ++G+W+R      + ++++   S+   +   RGR  EG R        E E DG      G +
Subjt:  TYEKLPDFCYGCGYLGHTIKEC--ESIDQAGSSEEELEYGAWLREPIFLKMREAESLPRSIVNQAGRGRGRIWEGGRGGWRSSIQEEEEDG----LDGTI

Query:  QKERGDDRSDSVLAGGKLKTPAN
          ++    S S +A  K  T  N
Subjt:  QKERGDDRSDSVLAGGKLKTPAN

TXG54013.1 hypothetical protein EZV62_019269 [Acer yangbiense]2.5e-3936.24Show/hide
Query:  EEEKACVFKLKGEEINKSEKKLENALICKILTQKKINPEMFKSKMPRIWGQEQTIISQ-VGFNLYLCKFKNGRIKSLIKETGPWFFDKALILLKEPKGGN
        +++   + ++KG    + E+ L  +LI K +T K IN E FKS +  IW  +  +  + +G N++  +F+N   +  I E GPW FDK L++L+E  G  
Subjt:  EEEKACVFKLKGEEINKSEKKLENALICKILTQKKINPEMFKSKMPRIWGQEQTIISQ-VGFNLYLCKFKNGRIKSLIKETGPWFFDKALILLKEPKGGN

Query:  YGEDMDFRYVSFWIHFHKLPFACFSRNAATEIGGLLGKVEQIDLEEEIDQNWGGSLRVKIQIDVSKPLKRGIFLTPEDSNENRWIPITYEKLPDFCYGCG
           D+ FRYV FWI  H LP AC +R     +GGL+G+V++ID  E   +  G  +R+++ IDV  PLKRG+ +   D  +   + I YE+LP+FCY CG
Subjt:  YGEDMDFRYVSFWIHFHKLPFACFSRNAATEIGGLLGKVEQIDLEEEIDQNWGGSLRVKIQIDVSKPLKRGIFLTPEDSNENRWIPITYEKLPDFCYGCG

Query:  YLGHTIKECESIDQAGSSEEELEYGAWLR
         +GH +++C    +  +S    ++G W+R
Subjt:  YLGHTIKECESIDQAGSSEEELEYGAWLR

XP_022132681.1 uncharacterized protein LOC111005481 [Momordica charantia]1.4e-3736.51Show/hide
Query:  LVKQLTELKVTEEEKACVFKLKGEEINKSEKKLENALICKILTQKKINPEMFKSKMPRIWGQEQTIISQ--VGFNLYLCKFKNGRIKSLIKETGPWFFDK
        L+++    K+T EE      +    +  + K LE +LICK+L+++ I+  + K+ +   W  +    S   +GFN++L  F     ++ I   GPW FD+
Subjt:  LVKQLTELKVTEEEKACVFKLKGEEINKSEKKLENALICKILTQKKINPEMFKSKMPRIWGQEQTIISQ--VGFNLYLCKFKNGRIKSLIKETGPWFFDK

Query:  ALILLKEPKGGNYGEDMDFRYVSFWIHFHKLPFACFSRNAATEIGGLLGKVEQIDLEEEIDQNWGGSLRVKIQIDVSKPLKRGIFLTPEDSNENRWIPIT
        ALI++  P       DMDFR VS W+HF  L  AC ++  AT +G  +G  E ++     +  WG  LRV+++ DV KPL RGI L  +      WIPI 
Subjt:  ALILLKEPKGGNYGEDMDFRYVSFWIHFHKLPFACFSRNAATEIGGLLGKVEQIDLEEEIDQNWGGSLRVKIQIDVSKPLKRGIFLTPEDSNENRWIPIT

Query:  YEKLPDFCYGCGYLGHTIKECESIDQAGSSEEELEYGAWLR
        YE+LPDF Y CG L H +K+C       S  + L+YG WLR
Subjt:  YEKLPDFCYGCGYLGHTIKECESIDQAGSSEEELEYGAWLR

XP_022149484.1 uncharacterized protein LOC111017902 [Momordica charantia]1.0e-4036.39Show/hide
Query:  KVTEEEKACVFKLKGEEINKSEKKLENALICKILTQKKINPEMFKSKMPRIWG-QEQTIISQVGFNLYLCKFKNGRIKSLIKETGPWFFDKALILLKEPK
        K T +E   V   +G+ I  ++  ++  ++ K+ T K+I+ E  +S M  +W     T    +G N+Y+  FK+   KS +  +GPW F+K+L++L  P 
Subjt:  KVTEEEKACVFKLKGEEINKSEKKLENALICKILTQKKINPEMFKSKMPRIWG-QEQTIISQVGFNLYLCKFKNGRIKSLIKETGPWFFDKALILLKEPK

Query:  GGNYGEDMDFRYVSFWIHFHKLPFACFSRNAATEIGGLLGKVEQIDLEEEIDQNWGGS-LRVKIQIDVSKPLKRGIFLTPEDSNENRWIPITYEKLPDFC
          N   DM+F + +FWI  H +PF C S   A  +G  LG VE+I  E +    W G  +RV+++IDVSKPL+RGI L   D  ++ W P+ YEKLPDFC
Subjt:  GGNYGEDMDFRYVSFWIHFHKLPFACFSRNAATEIGGLLGKVEQIDLEEEIDQNWGGS-LRVKIQIDVSKPLKRGIFLTPEDSNENRWIPITYEKLPDFC

Query:  YGCGYLGHTIKECESIDQAGSSEEELEYGAWLREPIFLKMREAESLP-RSIVNQAGR-GRGRIWEGGRGG---WRSSIQEEEEDGLDGTIQKER
        Y CG +GH+ +ECE   +  ++    +YG WLR  +   ++++ S P   +  + GR GRG    GGRGG   WR   ++E    +DG     R
Subjt:  YGCGYLGHTIKECESIDQAGSSEEELEYGAWLREPIFLKMREAESLP-RSIVNQAGR-GRGRIWEGGRGG---WRSSIQEEEEDGLDGTIQKER

XP_035541689.1 uncharacterized protein LOC118344688 [Juglans regia]3.1e-3737.45Show/hide
Query:  LKVTEEEKACVFKLKGEEINKSEKKLENALICKILTQKKINPEMFKSKMPRIWGQEQTI-ISQVGFNLYLCKFKNGRIKSLIKETGPWFFDKALILLKEP
        LK+TEEE+   + L  EEI  S+    + L+  +   +++N   FK+ M R+W  E  I    +G N +L KF N  +++ +  + PW FD+ LI ++E 
Subjt:  LKVTEEEKACVFKLKGEEINKSEKKLENALICKILTQKKINPEMFKSKMPRIWGQEQTI-ISQVGFNLYLCKFKNGRIKSLIKETGPWFFDKALILLKEP

Query:  KGGNYGEDMDFRYVSFWIHFHKLPFACFSRNAATEIGGLLGKVEQIDLEEEIDQNWGGSLRVKIQIDVSKPLKRGIFLTPEDSNENRWIPITYEKLPDFC
        KG    +D+DF    FW+  H LPFA  ++    ++G   GKV  +D++E+  ++WGG LRVK+ I++SKPL RG  +   D+    WIP  YE+LP FC
Subjt:  KGGNYGEDMDFRYVSFWIHFHKLPFACFSRNAATEIGGLLGKVEQIDLEEEIDQNWGGSLRVKIQIDVSKPLKRGIFLTPEDSNENRWIPITYEKLPDFC

Query:  YGCGYLGHTIKECESIDQAGSSEEEL--EYGAWLR
        Y CG + H+   C      GS+  E   +YG WLR
Subjt:  YGCGYLGHTIKECESIDQAGSSEEEL--EYGAWLR

TrEMBL top hitse value%identityAlignment
A0A5C7GU64 CCHC-type domain-containing protein1.5e-3730.34Show/hide
Query:  ELLVKQLTELKVTEEEKACVFKLKGEEINKSEKKLENALICKILTQKKINPEMFKSKMPRIWGQ-EQTIISQVGFNLYLCKFKNGRIKSLIKETGPWFFD
        E  + QL E    E+E A V ++  + I   ++ ++  L+ K+LT KK+N E FK  + +IW Q  Q  +  VG N ++  F N   ++ +   GPW F 
Subjt:  ELLVKQLTELKVTEEEKACVFKLKGEEINKSEKKLENALICKILTQKKINPEMFKSKMPRIWGQ-EQTIISQVGFNLYLCKFKNGRIKSLIKETGPWFFD

Query:  KALILLKEPKGGNYGEDMDFRYVSFWIHFHKLPFACFSRNAATEIGGLLGKVEQIDLEEEIDQNWGGSLRVKIQIDVSKPLKRGIFLTPEDSNENRWIPI
        K+LI+L++PKG      + F    FW+  H +P  C ++     +   +G+V +I  E    + WG  +RVK+Q+D++KPLKR + +      E   + +
Subjt:  KALILLKEPKGGNYGEDMDFRYVSFWIHFHKLPFACFSRNAATEIGGLLGKVEQIDLEEEIDQNWGGSLRVKIQIDVSKPLKRGIFLTPEDSNENRWIPI

Query:  TYEKLPDFCYGCGYLGHTIKEC--ESIDQAGSSEEELEYGAWLREPIFLKMREAESLPRSIVNQAGRGRGRIWEGGRGGWRSSIQEEEEDG----LDGTI
         YE+LPDFC+ CG +GH+++EC  E   +A    ++ ++G+W+R      + ++++   S+   +   RGR  EG R        E E DG      G +
Subjt:  TYEKLPDFCYGCGYLGHTIKEC--ESIDQAGSSEEELEYGAWLREPIFLKMREAESLPRSIVNQAGRGRGRIWEGGRGGWRSSIQEEEEDG----LDGTI

Query:  QKERGDDRSDSVLAGGKLKTPAN
          ++    S S +A  K  T  N
Subjt:  QKERGDDRSDSVLAGGKLKTPAN

A0A5C7H9Y2 CCHC-type domain-containing protein1.2e-3936.24Show/hide
Query:  EEEKACVFKLKGEEINKSEKKLENALICKILTQKKINPEMFKSKMPRIWGQEQTIISQ-VGFNLYLCKFKNGRIKSLIKETGPWFFDKALILLKEPKGGN
        +++   + ++KG    + E+ L  +LI K +T K IN E FKS +  IW  +  +  + +G N++  +F+N   +  I E GPW FDK L++L+E  G  
Subjt:  EEEKACVFKLKGEEINKSEKKLENALICKILTQKKINPEMFKSKMPRIWGQEQTIISQ-VGFNLYLCKFKNGRIKSLIKETGPWFFDKALILLKEPKGGN

Query:  YGEDMDFRYVSFWIHFHKLPFACFSRNAATEIGGLLGKVEQIDLEEEIDQNWGGSLRVKIQIDVSKPLKRGIFLTPEDSNENRWIPITYEKLPDFCYGCG
           D+ FRYV FWI  H LP AC +R     +GGL+G+V++ID  E   +  G  +R+++ IDV  PLKRG+ +   D  +   + I YE+LP+FCY CG
Subjt:  YGEDMDFRYVSFWIHFHKLPFACFSRNAATEIGGLLGKVEQIDLEEEIDQNWGGSLRVKIQIDVSKPLKRGIFLTPEDSNENRWIPITYEKLPDFCYGCG

Query:  YLGHTIKECESIDQAGSSEEELEYGAWLR
         +GH +++C    +  +S    ++G W+R
Subjt:  YLGHTIKECESIDQAGSSEEELEYGAWLR

A0A6J1BSZ1 uncharacterized protein LOC1110054816.7e-3836.51Show/hide
Query:  LVKQLTELKVTEEEKACVFKLKGEEINKSEKKLENALICKILTQKKINPEMFKSKMPRIWGQEQTIISQ--VGFNLYLCKFKNGRIKSLIKETGPWFFDK
        L+++    K+T EE      +    +  + K LE +LICK+L+++ I+  + K+ +   W  +    S   +GFN++L  F     ++ I   GPW FD+
Subjt:  LVKQLTELKVTEEEKACVFKLKGEEINKSEKKLENALICKILTQKKINPEMFKSKMPRIWGQEQTIISQ--VGFNLYLCKFKNGRIKSLIKETGPWFFDK

Query:  ALILLKEPKGGNYGEDMDFRYVSFWIHFHKLPFACFSRNAATEIGGLLGKVEQIDLEEEIDQNWGGSLRVKIQIDVSKPLKRGIFLTPEDSNENRWIPIT
        ALI++  P       DMDFR VS W+HF  L  AC ++  AT +G  +G  E ++     +  WG  LRV+++ DV KPL RGI L  +      WIPI 
Subjt:  ALILLKEPKGGNYGEDMDFRYVSFWIHFHKLPFACFSRNAATEIGGLLGKVEQIDLEEEIDQNWGGSLRVKIQIDVSKPLKRGIFLTPEDSNENRWIPIT

Query:  YEKLPDFCYGCGYLGHTIKECESIDQAGSSEEELEYGAWLR
        YE+LPDF Y CG L H +K+C       S  + L+YG WLR
Subjt:  YEKLPDFCYGCGYLGHTIKECESIDQAGSSEEELEYGAWLR

A0A6J1D765 uncharacterized protein LOC1110179025.0e-4136.39Show/hide
Query:  KVTEEEKACVFKLKGEEINKSEKKLENALICKILTQKKINPEMFKSKMPRIWG-QEQTIISQVGFNLYLCKFKNGRIKSLIKETGPWFFDKALILLKEPK
        K T +E   V   +G+ I  ++  ++  ++ K+ T K+I+ E  +S M  +W     T    +G N+Y+  FK+   KS +  +GPW F+K+L++L  P 
Subjt:  KVTEEEKACVFKLKGEEINKSEKKLENALICKILTQKKINPEMFKSKMPRIWG-QEQTIISQVGFNLYLCKFKNGRIKSLIKETGPWFFDKALILLKEPK

Query:  GGNYGEDMDFRYVSFWIHFHKLPFACFSRNAATEIGGLLGKVEQIDLEEEIDQNWGGS-LRVKIQIDVSKPLKRGIFLTPEDSNENRWIPITYEKLPDFC
          N   DM+F + +FWI  H +PF C S   A  +G  LG VE+I  E +    W G  +RV+++IDVSKPL+RGI L   D  ++ W P+ YEKLPDFC
Subjt:  GGNYGEDMDFRYVSFWIHFHKLPFACFSRNAATEIGGLLGKVEQIDLEEEIDQNWGGS-LRVKIQIDVSKPLKRGIFLTPEDSNENRWIPITYEKLPDFC

Query:  YGCGYLGHTIKECESIDQAGSSEEELEYGAWLREPIFLKMREAESLP-RSIVNQAGR-GRGRIWEGGRGG---WRSSIQEEEEDGLDGTIQKER
        Y CG +GH+ +ECE   +  ++    +YG WLR  +   ++++ S P   +  + GR GRG    GGRGG   WR   ++E    +DG     R
Subjt:  YGCGYLGHTIKECESIDQAGSSEEELEYGAWLREPIFLKMREAESLP-RSIVNQAGR-GRGRIWEGGRGG---WRSSIQEEEEDGLDGTIQKER

A0A6P9E2G0 uncharacterized protein LOC1183446881.5e-3737.45Show/hide
Query:  LKVTEEEKACVFKLKGEEINKSEKKLENALICKILTQKKINPEMFKSKMPRIWGQEQTI-ISQVGFNLYLCKFKNGRIKSLIKETGPWFFDKALILLKEP
        LK+TEEE+   + L  EEI  S+    + L+  +   +++N   FK+ M R+W  E  I    +G N +L KF N  +++ +  + PW FD+ LI ++E 
Subjt:  LKVTEEEKACVFKLKGEEINKSEKKLENALICKILTQKKINPEMFKSKMPRIWGQEQTI-ISQVGFNLYLCKFKNGRIKSLIKETGPWFFDKALILLKEP

Query:  KGGNYGEDMDFRYVSFWIHFHKLPFACFSRNAATEIGGLLGKVEQIDLEEEIDQNWGGSLRVKIQIDVSKPLKRGIFLTPEDSNENRWIPITYEKLPDFC
        KG    +D+DF    FW+  H LPFA  ++    ++G   GKV  +D++E+  ++WGG LRVK+ I++SKPL RG  +   D+    WIP  YE+LP FC
Subjt:  KGGNYGEDMDFRYVSFWIHFHKLPFACFSRNAATEIGGLLGKVEQIDLEEEIDQNWGGSLRVKIQIDVSKPLKRGIFLTPEDSNENRWIPITYEKLPDFC

Query:  YGCGYLGHTIKECESIDQAGSSEEEL--EYGAWLR
        Y CG + H+   C      GS+  E   +YG WLR
Subjt:  YGCGYLGHTIKECESIDQAGSSEEEL--EYGAWLR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein4.6e-0727.95Show/hide
Query:  WIPIPEGCWKLSCDASWSEDLQRGGLGWILRDWGGKPIMAGYRSICRRWKISWLETMAITEGLRATSFAISPATQSFAPQIRVESDCLQVVRLINGEDVD
        W   P    K + DA+W  +  R G+GWILR+  G  +  G R++ R   +   E       L A  +A+   ++    +I  ESD   +V L+N +D  
Subjt:  WIPIPEGCWKLSCDASWSEDLQRGGLGWILRDWGGKPIMAGYRSICRRWKISWLETMAITEGLRATSFAISPATQSFAPQIRVESDCLQVVRLINGEDVD

Query:  GTELDFFIKEVQQLIAMRRIDLTSHIPRAYNQMAHRLAHMACD-ENESKLWVHSFPSWLLS
         T L   ++++QQL+           PR  N++A R+A  +    N         P WL S
Subjt:  GTELDFFIKEVQQLIAMRRIDLTSHIPRAYNQMAHRLAHMACD-ENESKLWVHSFPSWLLS

AT3G42140.1 zinc ion binding;nucleic acid binding1.9e-0520.83Show/hide
Query:  IKETGPWFFDKALILLKEPKGGNYGEDMDFRYVSFWIHFHKLPFACFSRNAATEIGGLLGKVEQIDLEEEIDQNWGGSLRVKIQIDVSKPLKRGIFLTPE
        I   GPW F+  + +++  +      D +F+ + FWI    +P    +    T IG                                   + G+FL   
Subjt:  IKETGPWFFDKALILLKEPKGGNYGEDMDFRYVSFWIHFHKLPFACFSRNAATEIGGLLGKVEQIDLEEEIDQNWGGSLRVKIQIDVSKPLKRGIFLTPE

Query:  DSNENRWIPITYEKLPDFCYGCGYLGHTIKECESIDQAGSSEEE
           +   +   YEKL +FC  CG L H   EC +    G   ++
Subjt:  DSNENRWIPITYEKLPDFCYGCGYLGHTIKECESIDQAGSSEEE

AT4G29090.1 Ribonuclease H-like superfamily protein6.1e-0726.54Show/hide
Query:  WIPIPEGCWKLSCDASWSEDLQRGGLGWILRDWGGKPIMAGYRSICRRWKISWLETMAITEGLRATSFAISPATQSFAPQIRVESDCLQVVRLINGEDVD
        W P P    K + DA+W+ D +R G+GW+LR+  G+    G R++ +      L+++   E L A  +A+   ++     +  ESD   ++ ++N +++ 
Subjt:  WIPIPEGCWKLSCDASWSEDLQRGGLGWILRDWGGKPIMAGYRSICRRWKISWLETMAITEGLRATSFAISPATQSFAPQIRVESDCLQVVRLINGEDVD

Query:  GTELDFFIKEVQQLIAMRRIDLTSHIPRAYNQMAHRLAHMACD--ENESKLWVHSFPSWLLS
           L   I+++Q+L++         IPR  N +A R+A  +      + KL+    PSW  S
Subjt:  GTELDFFIKEVQQLIAMRRIDLTSHIPRAYNQMAHRLAHMACD--ENESKLWVHSFPSWLLS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCGCCGACAAGAAGAAGAGCTCCTAGTCAAACAGTTAACGGAGTTGAAAGTCACAGAAGAGGAAAAAGCATGTGTCTTCAAGCTGAAAGGAGAGGAAATAAATAA
ATCAGAAAAGAAGTTGGAAAATGCCCTCATTTGCAAGATTCTGACACAAAAGAAAATTAACCCAGAGATGTTTAAGTCCAAAATGCCACGGATTTGGGGTCAGGAACAGA
CTATCATTAGTCAAGTGGGATTCAATTTGTATCTTTGCAAGTTCAAGAACGGGCGTATCAAGAGCTTAATCAAAGAAACCGGACCATGGTTTTTCGACAAGGCTCTCATC
CTGTTAAAGGAACCAAAGGGAGGCAATTACGGCGAGGATATGGATTTCAGGTACGTATCGTTTTGGATTCATTTCCATAAACTTCCTTTTGCTTGTTTTTCCAGGAATGC
AGCAACTGAAATAGGAGGTCTTCTTGGAAAAGTGGAACAAATCGATTTGGAGGAGGAGATAGATCAAAATTGGGGAGGATCTTTACGTGTGAAGATCCAAATTGATGTAT
CCAAACCTTTGAAGCGGGGAATTTTTTTGACACCGGAGGATTCTAATGAAAACCGATGGATTCCGATTACGTATGAAAAGTTACCCGACTTTTGTTATGGTTGTGGATAT
CTTGGTCACACTATAAAGGAATGTGAAAGTATAGATCAAGCTGGGTCATCTGAGGAGGAATTGGAGTATGGTGCTTGGCTTCGTGAACCTATCTTTCTAAAGATGAGAGA
GGCAGAGTCATTGCCCAGATCCATAGTCAATCAAGCTGGTCGGGGTAGAGGTAGAATCTGGGAAGGAGGAAGAGGAGGATGGAGAAGTTCCATCCAAGAAGAAGAAGAAG
ACGGGCTCGACGGTACAATTCAGAAAGAAAGAGGAGATGACAGGTCTGATTCAGTATTGGCCGGCGGGAAGTTGAAGACTCCGGCGAACAGTCCGGTGAGTCCATGTCCA
GAAACGGCAGCAACGGCTACTAAGCCAGAAAAGGAAAATTCGGAAAGCAAGGGTAACGGTAACGTATCTGCCATTAATGAAAATTTCTCAGAAAAAGAGGAATTAAATTC
AAATATTACAGACGAAAATTCTGGTGCAAATGGTCAATCAAAGATTCATATTTATGACTCTAATATGGAGGTGGATCAAGACAATGATGGGACAAATCGAAGGGGACAGC
TGGCGCAAGGGGAGAGTGTTAATGTCATTGAGGATATTGATTTACAACCTGCTATGGATACTATTCAGAAAGCTAAGTTGGAAGGCAAGCTTAAAGTATGGAAAAGGATT
CCACGGTCCCAACAGCAGGATGATTCTAAAATCATTATGGAAACAACTAGCAAGACCATTTCAGGGGCTAAACATTCAATTAATCACAGTCAAGCACAAATAAAAGCAAT
GGTTCAGCAGCAGATTGAAAGCTCCATCAACGAATTAATCAGAGAAGAAGGTACTTACCAGTCGAGTCCCTACCCGAATGTCGAGCACAACGCGAACCCAGGCTCTCCCT
TGCAGACGGCAGAACAGAGGCATTGGATCCCGATTCCTGAAGGCTGTTGGAAGCTGAGTTGTGATGCGTCATGGAGTGAAGATCTTCAACGGGGTGGATTAGGATGGATT
CTTCGAGACTGGGGAGGGAAACCGATAATGGCGGGTTACAGAAGTATTTGCCGGAGGTGGAAGATTAGTTGGCTTGAGACTATGGCGATCACTGAGGGACTGCGTGCGAC
TTCATTTGCGATCTCCCCTGCAACCCAGAGCTTCGCCCCTCAAATTCGCGTGGAAAGTGACTGTCTCCAGGTGGTTCGGCTGATTAATGGTGAAGATGTAGATGGTACTG
AACTTGACTTTTTCATAAAGGAAGTCCAACAACTTATTGCCATGAGAAGGATAGATCTGACTTCTCATATCCCCAGGGCTTATAACCAAATGGCCCATAGGTTGGCCCAT
ATGGCTTGTGATGAAAATGAGTCAAAATTATGGGTTCACTCCTTTCCAAGTTGGCTTTTATCTTACAATGAAGCTGATGTTGGCTATTTCCTTGACAATAGTGGGGGTTC
ATGTCCCACTAATGTCAATCTTTTGAACTTTTTGCATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTCGCCGACAAGAAGAAGAGCTCCTAGTCAAACAGTTAACGGAGTTGAAAGTCACAGAAGAGGAAAAAGCATGTGTCTTCAAGCTGAAAGGAGAGGAAATAAATAA
ATCAGAAAAGAAGTTGGAAAATGCCCTCATTTGCAAGATTCTGACACAAAAGAAAATTAACCCAGAGATGTTTAAGTCCAAAATGCCACGGATTTGGGGTCAGGAACAGA
CTATCATTAGTCAAGTGGGATTCAATTTGTATCTTTGCAAGTTCAAGAACGGGCGTATCAAGAGCTTAATCAAAGAAACCGGACCATGGTTTTTCGACAAGGCTCTCATC
CTGTTAAAGGAACCAAAGGGAGGCAATTACGGCGAGGATATGGATTTCAGGTACGTATCGTTTTGGATTCATTTCCATAAACTTCCTTTTGCTTGTTTTTCCAGGAATGC
AGCAACTGAAATAGGAGGTCTTCTTGGAAAAGTGGAACAAATCGATTTGGAGGAGGAGATAGATCAAAATTGGGGAGGATCTTTACGTGTGAAGATCCAAATTGATGTAT
CCAAACCTTTGAAGCGGGGAATTTTTTTGACACCGGAGGATTCTAATGAAAACCGATGGATTCCGATTACGTATGAAAAGTTACCCGACTTTTGTTATGGTTGTGGATAT
CTTGGTCACACTATAAAGGAATGTGAAAGTATAGATCAAGCTGGGTCATCTGAGGAGGAATTGGAGTATGGTGCTTGGCTTCGTGAACCTATCTTTCTAAAGATGAGAGA
GGCAGAGTCATTGCCCAGATCCATAGTCAATCAAGCTGGTCGGGGTAGAGGTAGAATCTGGGAAGGAGGAAGAGGAGGATGGAGAAGTTCCATCCAAGAAGAAGAAGAAG
ACGGGCTCGACGGTACAATTCAGAAAGAAAGAGGAGATGACAGGTCTGATTCAGTATTGGCCGGCGGGAAGTTGAAGACTCCGGCGAACAGTCCGGTGAGTCCATGTCCA
GAAACGGCAGCAACGGCTACTAAGCCAGAAAAGGAAAATTCGGAAAGCAAGGGTAACGGTAACGTATCTGCCATTAATGAAAATTTCTCAGAAAAAGAGGAATTAAATTC
AAATATTACAGACGAAAATTCTGGTGCAAATGGTCAATCAAAGATTCATATTTATGACTCTAATATGGAGGTGGATCAAGACAATGATGGGACAAATCGAAGGGGACAGC
TGGCGCAAGGGGAGAGTGTTAATGTCATTGAGGATATTGATTTACAACCTGCTATGGATACTATTCAGAAAGCTAAGTTGGAAGGCAAGCTTAAAGTATGGAAAAGGATT
CCACGGTCCCAACAGCAGGATGATTCTAAAATCATTATGGAAACAACTAGCAAGACCATTTCAGGGGCTAAACATTCAATTAATCACAGTCAAGCACAAATAAAAGCAAT
GGTTCAGCAGCAGATTGAAAGCTCCATCAACGAATTAATCAGAGAAGAAGGTACTTACCAGTCGAGTCCCTACCCGAATGTCGAGCACAACGCGAACCCAGGCTCTCCCT
TGCAGACGGCAGAACAGAGGCATTGGATCCCGATTCCTGAAGGCTGTTGGAAGCTGAGTTGTGATGCGTCATGGAGTGAAGATCTTCAACGGGGTGGATTAGGATGGATT
CTTCGAGACTGGGGAGGGAAACCGATAATGGCGGGTTACAGAAGTATTTGCCGGAGGTGGAAGATTAGTTGGCTTGAGACTATGGCGATCACTGAGGGACTGCGTGCGAC
TTCATTTGCGATCTCCCCTGCAACCCAGAGCTTCGCCCCTCAAATTCGCGTGGAAAGTGACTGTCTCCAGGTGGTTCGGCTGATTAATGGTGAAGATGTAGATGGTACTG
AACTTGACTTTTTCATAAAGGAAGTCCAACAACTTATTGCCATGAGAAGGATAGATCTGACTTCTCATATCCCCAGGGCTTATAACCAAATGGCCCATAGGTTGGCCCAT
ATGGCTTGTGATGAAAATGAGTCAAAATTATGGGTTCACTCCTTTCCAAGTTGGCTTTTATCTTACAATGAAGCTGATGTTGGCTATTTCCTTGACAATAGTGGGGGTTC
ATGTCCCACTAATGTCAATCTTTTGAACTTTTTGCATTGA
Protein sequenceShow/hide protein sequence
MARRQEEELLVKQLTELKVTEEEKACVFKLKGEEINKSEKKLENALICKILTQKKINPEMFKSKMPRIWGQEQTIISQVGFNLYLCKFKNGRIKSLIKETGPWFFDKALI
LLKEPKGGNYGEDMDFRYVSFWIHFHKLPFACFSRNAATEIGGLLGKVEQIDLEEEIDQNWGGSLRVKIQIDVSKPLKRGIFLTPEDSNENRWIPITYEKLPDFCYGCGY
LGHTIKECESIDQAGSSEEELEYGAWLREPIFLKMREAESLPRSIVNQAGRGRGRIWEGGRGGWRSSIQEEEEDGLDGTIQKERGDDRSDSVLAGGKLKTPANSPVSPCP
ETAATATKPEKENSESKGNGNVSAINENFSEKEELNSNITDENSGANGQSKIHIYDSNMEVDQDNDGTNRRGQLAQGESVNVIEDIDLQPAMDTIQKAKLEGKLKVWKRI
PRSQQQDDSKIIMETTSKTISGAKHSINHSQAQIKAMVQQQIESSINELIREEGTYQSSPYPNVEHNANPGSPLQTAEQRHWIPIPEGCWKLSCDASWSEDLQRGGLGWI
LRDWGGKPIMAGYRSICRRWKISWLETMAITEGLRATSFAISPATQSFAPQIRVESDCLQVVRLINGEDVDGTELDFFIKEVQQLIAMRRIDLTSHIPRAYNQMAHRLAH
MACDENESKLWVHSFPSWLLSYNEADVGYFLDNSGGSCPTNVNLLNFLH