; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg001896 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg001896
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRNase H domain-containing protein
Genome locationscaffold10:4431840..4439533
RNA-Seq ExpressionSpg001896
SyntenySpg001896
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_015381032.1 uncharacterized protein LOC107174551 [Citrus sinensis]9.4e-1931.42Show/hide
Query:  LNMPTGNKDSKDEIVWHLDKKGIFIVENWLPKDYWGWMEENLSMEELAKSIILMWKLWDFRNR---AEVHNHPAAVKIIHQSIEASLK---EWETSYLKT
        + +P   +  +D+++WH DKKG  IV  WL +    W +++  +E  A+   L+W +W+ RN+        +P  V    ++I  S K   + E +Y KT
Subjt:  LNMPTGNKDSKDEIVWHLDKKGIFIVENWLPKDYWGWMEENLSMEELAKSIILMWKLWDFRNR---AEVHNHPAAVKIIHQSIEASLK---EWETSYLKT

Query:  HSPIR--------PRNPTSQVLWEKPPPNSW-KLNFDAAWNEKEGRGGLGWLVRDSDGSLICFGMKQISKKWAIKNLEACAMLEGIKQVANTCNRLGIPL
           IR          N   Q  W  PPPN W K+N DAA + +    GLG +VRDSDG+     +K +    ++   EA AM  G+ QVA   N   I  
Subjt:  HSPIR--------PRNPTSQVLWEKPPPNSW-KLNFDAAWNEKEGRGGLGWLVRDSDGSLICFGMKQISKKWAIKNLEACAMLEGIKQVANTCNRLGIPL

Query:  EV-ESDAIEVVNVLNGKSEDISDLQIFIDEIKDLPARAHSMYFRHCNRLLNIAAYCVARSA
         + ESD++EV++++N KS  ++++   I +I++      +   +HC R  N AA+ +A+ A
Subjt:  EV-ESDAIEVVNVLNGKSEDISDLQIFIDEIKDLPARAHSMYFRHCNRLLNIAAYCVARSA

XP_022143317.1 uncharacterized protein LOC111013216 [Momordica charantia]1.6e-1832.11Show/hide
Query:  NWLPKDYWGWMEENLSMEELAKSIILMWKLWDFRNRA--------EVHNHPAAVKIIHQSIEASLKEWETSYLKTHSPIRPRNPTS----QVLWEKPPPN
        +W  KD W W+   LS EE+A S+++ W++W+ RNR+        E     + V  I+ +I+      +T   + +    PR   +     V W  PP N
Subjt:  NWLPKDYWGWMEENLSMEELAKSIILMWKLWDFRNRA--------EVHNHPAAVKIIHQSIEASLKEWETSYLKTHSPIRPRNPTS----QVLWEKPPPN

Query:  SWKLNFDAAWNEKEGRGGLGWLVRDSDGSLICFGMKQISKKWAIKNLEACAMLEGIKQVANTCNRLGIPLEVESDAIEVVNVLNGKSEDI
         WKLN DA+W+E+   GG+GW++ D  G ++  G  +I +K  I  LE   ++ G+ Q  N  +R   P+ +ESD++EV+ ++  K ED+
Subjt:  SWKLNFDAAWNEKEGRGGLGWLVRDSDGSLICFGMKQISKKWAIKNLEACAMLEGIKQVANTCNRLGIPLEVESDAIEVVNVLNGKSEDI

XP_022154991.1 uncharacterized protein LOC111022134 isoform X2 [Momordica charantia]2.1e-1828.92Show/hide
Query:  WHLDKKGIFIVE--------------NWLPKDYWGWMEENLSMEELAKSIILMWKLWDFRNRAEVHNHPAAVKIIHQSIEASL--KEWETSYLKTHS---
        W  DK  + + E              NW  K+YW W+ +    EE  +S+I+  ++W+ RN++      +  + I  +I+  +     + + LK  S   
Subjt:  WHLDKKGIFIVE--------------NWLPKDYWGWMEENLSMEELAKSIILMWKLWDFRNRAEVHNHPAAVKIIHQSIEASL--KEWETSYLKTHS---

Query:  -PIRPRNPTSQVLWEKPPPNSWKLNFDAAWNEKEGRGGLGWLVRDSDGSLICFGMKQISKKWAIKNLEACAMLEGIKQVANTCNRLGIPLEVESDAIEVV
         PIR     ++  W+ P  NSWKLN DAAW       G+GW++RD  G +I  G + I  +  I  LE  A+ EG++ +     R   P+ +ESD++E +
Subjt:  -PIRPRNPTSQVLWEKPPPNSWKLNFDAAWNEKEGRGGLGWLVRDSDGSLICFGMKQISKKWAIKNLEACAMLEGIKQVANTCNRLGIPLEVESDAIEVV

Query:  NVLN
        ++L+
Subjt:  NVLN

XP_022155262.1 uncharacterized protein LOC111022403 [Momordica charantia]2.2e-2030.63Show/hide
Query:  MEENLSMEELAKSIILMWKLWDFRN----RAEVHNHPAAVKIIHQSIEASLKEWETSYLKTHSPIRPRNPTSQVLWEKPPPNSWKLNFDAAWNEKEGRGG
        M +  S E+L   +I  W +W+ RN    R E  +    ++ + + +  S  + ETS    H  +      +++ WE PP + W LN DA+W++   RGG
Subjt:  MEENLSMEELAKSIILMWKLWDFRN----RAEVHNHPAAVKIIHQSIEASLKEWETSYLKTHSPIRPRNPTSQVLWEKPPPNSWKLNFDAAWNEKEGRGG

Query:  LGWLVRDSDGSLICFGMKQISKKWAIKNLEACAMLEGIKQVANTCNRLGI--PLEVESDAIEVVNVLNGKSEDISDLQIFIDEIKDLPARAHSMYFRHCN
        +GW++R  DG ++  G + +     +K LEA A+LEG++ + N    LG+  PL +E+D+ EV ++LN K ED++     ++EI +L      + F    
Subjt:  LGWLVRDSDGSLICFGMKQISKKWAIKNLEACAMLEGIKQVANTCNRLGI--PLEVESDAIEVVNVLNGKSEDISDLQIFIDEIKDLPARAHSMYFRHCN

Query:  RLLNIAAYCVARSAASYSPSRI
        R  N  A+ +A+ A+    S I
Subjt:  RLLNIAAYCVARSAASYSPSRI

XP_024041966.1 uncharacterized protein LOC112099096 [Citrus clementina]3.2e-1932.02Show/hide
Query:  LNMPTGNKDSKDEIVWHLDKKGIFIVENWLPKDYWGWMEENLSMEELAKSIILMWKLWDFRNR---AEVHNHPAAVKIIHQSIEASLK---EWETSYLKT
        + +P   +  +D+++WH DKKG  IV  WL +    W +++  +E  A+   L+W +W+ RN+        +P  V    ++I  S K   + E +Y KT
Subjt:  LNMPTGNKDSKDEIVWHLDKKGIFIVENWLPKDYWGWMEENLSMEELAKSIILMWKLWDFRNR---AEVHNHPAAVKIIHQSIEASLK---EWETSYLKT

Query:  HSPIRPRNPTSQVLWEKPPPNSW-KLNFDAAWNEKEGRGGLGWLVRDSDGSLICFGMKQISKKWAIKNLEACAMLEGIKQVANTCNRLGIPLEV-ESDAI
              RN   Q  W  PPPN W K+N DAA + +    GLG +VRDSDG+     +K +    ++   EA AM  G+ QVA   N   I   + ESD++
Subjt:  HSPIRPRNPTSQVLWEKPPPNSW-KLNFDAAWNEKEGRGGLGWLVRDSDGSLICFGMKQISKKWAIKNLEACAMLEGIKQVANTCNRLGIPLEV-ESDAI

Query:  EVVNVLNGKSEDISDLQIFIDEIKDLPARAHSMYFRHCNRLLNIAAYCVARSA
        EV++++N KS  ++++   I +I++      +   +HC R  N AA+ +A+ A
Subjt:  EVVNVLNGKSEDISDLQIFIDEIKDLPARAHSMYFRHCNRLLNIAAYCVARSA

TrEMBL top hitse value%identityAlignment
A0A6J1CP26 uncharacterized protein LOC1110134123.6e-1628.44Show/hide
Query:  EELAKSIILMWKLWDFRNRAEVHNHPAAVKIIHQSIE----------ASLKEWETSYLKTHSPIRPRNPTSQVLWEKPPPNSWKLNFDAAWNEKEGRGGL
        EE  +S+I+ W++W+ RN++         + I  +I+           +LK   T+  K    IR     +   W+ P  NSWKLN +AAW      GG+
Subjt:  EELAKSIILMWKLWDFRNRAEVHNHPAAVKIIHQSIE----------ASLKEWETSYLKTHSPIRPRNPTSQVLWEKPPPNSWKLNFDAAWNEKEGRGGL

Query:  GWLVRDSDGSLICFGMKQISKKWAIKNLEACAMLEGIKQVANTCNRLGIPLEVESDAIEVVNVLNGKSEDISDLQIFIDEIKDLPARAHSMYFRHCNRLL
        GW++RD  G +I    + I  +  I  LE  A+ EG++ +     R   P+ +ESD++E +++L+ + +D +++   ++EI  +      +  RH +R  
Subjt:  GWLVRDSDGSLICFGMKQISKKWAIKNLEACAMLEGIKQVANTCNRLGIPLEVESDAIEVVNVLNGKSEDISDLQIFIDEIKDLPARAHSMYFRHCNRLL

Query:  NIAAYCVARSA
        N  A+ +AR A
Subjt:  NIAAYCVARSA

A0A6J1CQG0 uncharacterized protein LOC1110132167.7e-1932.11Show/hide
Query:  NWLPKDYWGWMEENLSMEELAKSIILMWKLWDFRNRA--------EVHNHPAAVKIIHQSIEASLKEWETSYLKTHSPIRPRNPTS----QVLWEKPPPN
        +W  KD W W+   LS EE+A S+++ W++W+ RNR+        E     + V  I+ +I+      +T   + +    PR   +     V W  PP N
Subjt:  NWLPKDYWGWMEENLSMEELAKSIILMWKLWDFRNRA--------EVHNHPAAVKIIHQSIEASLKEWETSYLKTHSPIRPRNPTS----QVLWEKPPPN

Query:  SWKLNFDAAWNEKEGRGGLGWLVRDSDGSLICFGMKQISKKWAIKNLEACAMLEGIKQVANTCNRLGIPLEVESDAIEVVNVLNGKSEDI
         WKLN DA+W+E+   GG+GW++ D  G ++  G  +I +K  I  LE   ++ G+ Q  N  +R   P+ +ESD++EV+ ++  K ED+
Subjt:  SWKLNFDAAWNEKEGRGGLGWLVRDSDGSLICFGMKQISKKWAIKNLEACAMLEGIKQVANTCNRLGIPLEVESDAIEVVNVLNGKSEDI

A0A6J1DL64 uncharacterized protein LOC111022134 isoform X12.3e-1830.9Show/hide
Query:  NWLPKDYWGWMEENLSMEELAKSIILMWKLWDFRNRAEVHNHPAAVKIIHQSIEASL--KEWETSYLKTHS----PIRPRNPTSQVLWEKPPPNSWKLNF
        NW  K+YW W+ +    EE  +S+I+  ++W+ RN++      +  + I  +I+  +     + + LK  S    PIR     ++  W+ P  NSWKLN 
Subjt:  NWLPKDYWGWMEENLSMEELAKSIILMWKLWDFRNRAEVHNHPAAVKIIHQSIEASL--KEWETSYLKTHS----PIRPRNPTSQVLWEKPPPNSWKLNF

Query:  DAAWNEKEGRGGLGWLVRDSDGSLICFGMKQISKKWAIKNLEACAMLEGIKQVANTCNRLGIPLEVESDAIEVVNVLN
        DAAW       G+GW++RD  G +I  G + I  +  I  LE  A+ EG++ +     R   P+ +ESD++E +++L+
Subjt:  DAAWNEKEGRGGLGWLVRDSDGSLICFGMKQISKKWAIKNLEACAMLEGIKQVANTCNRLGIPLEVESDAIEVVNVLN

A0A6J1DNV9 uncharacterized protein LOC1110224031.1e-2030.63Show/hide
Query:  MEENLSMEELAKSIILMWKLWDFRN----RAEVHNHPAAVKIIHQSIEASLKEWETSYLKTHSPIRPRNPTSQVLWEKPPPNSWKLNFDAAWNEKEGRGG
        M +  S E+L   +I  W +W+ RN    R E  +    ++ + + +  S  + ETS    H  +      +++ WE PP + W LN DA+W++   RGG
Subjt:  MEENLSMEELAKSIILMWKLWDFRN----RAEVHNHPAAVKIIHQSIEASLKEWETSYLKTHSPIRPRNPTSQVLWEKPPPNSWKLNFDAAWNEKEGRGG

Query:  LGWLVRDSDGSLICFGMKQISKKWAIKNLEACAMLEGIKQVANTCNRLGI--PLEVESDAIEVVNVLNGKSEDISDLQIFIDEIKDLPARAHSMYFRHCN
        +GW++R  DG ++  G + +     +K LEA A+LEG++ + N    LG+  PL +E+D+ EV ++LN K ED++     ++EI +L      + F    
Subjt:  LGWLVRDSDGSLICFGMKQISKKWAIKNLEACAMLEGIKQVANTCNRLGI--PLEVESDAIEVVNVLNGKSEDISDLQIFIDEIKDLPARAHSMYFRHCN

Query:  RLLNIAAYCVARSAASYSPSRI
        R  N  A+ +A+ A+    S I
Subjt:  RLLNIAAYCVARSAASYSPSRI

A0A6J1DQC9 uncharacterized protein LOC111022134 isoform X21.0e-1828.92Show/hide
Query:  WHLDKKGIFIVE--------------NWLPKDYWGWMEENLSMEELAKSIILMWKLWDFRNRAEVHNHPAAVKIIHQSIEASL--KEWETSYLKTHS---
        W  DK  + + E              NW  K+YW W+ +    EE  +S+I+  ++W+ RN++      +  + I  +I+  +     + + LK  S   
Subjt:  WHLDKKGIFIVE--------------NWLPKDYWGWMEENLSMEELAKSIILMWKLWDFRNRAEVHNHPAAVKIIHQSIEASL--KEWETSYLKTHS---

Query:  -PIRPRNPTSQVLWEKPPPNSWKLNFDAAWNEKEGRGGLGWLVRDSDGSLICFGMKQISKKWAIKNLEACAMLEGIKQVANTCNRLGIPLEVESDAIEVV
         PIR     ++  W+ P  NSWKLN DAAW       G+GW++RD  G +I  G + I  +  I  LE  A+ EG++ +     R   P+ +ESD++E +
Subjt:  -PIRPRNPTSQVLWEKPPPNSWKLNFDAAWNEKEGRGGLGWLVRDSDGSLICFGMKQISKKWAIKNLEACAMLEGIKQVANTCNRLGIPLEVESDAIEVV

Query:  NVLN
        ++L+
Subjt:  NVLN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G52990.1 thioredoxin family protein3.1e-0427.82Show/hide
Query:  KLNFDAAWNEKEGRGGLGWLVRDSDGSLICFGMKQISKKWAIKNLEACAMLEGIKQVANTCNRLGIPLEV-ESDAIEVVNVLNGKSEDISDLQIFIDEIK
        K N+DA+ +E +   GLGWL+R+S G+++  GM +   +   +  E  A++  I+      +  G    + E D   V  ++N KS D   L+ ++D IK
Subjt:  KLNFDAAWNEKEGRGGLGWLVRDSDGSLICFGMKQISKKWAIKNLEACAMLEGIKQVANTCNRLGIPLEV-ESDAIEVVNVLNGKSEDISDLQIFIDEIK

Query:  DLPARAHSMYFRHCNRLLNIAAYCVARSAASYS
               S  F   +R  N  A  + + A   S
Subjt:  DLPARAHSMYFRHCNRLLNIAAYCVARSAASYS

AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.1e-1228.64Show/hide
Query:  LMWKLWDFRNRAEVHNHP-AAVKIIHQSIEASLKEWETSYL---KTHSPIRPRNPTSQVLWEKPPPNSWKLNFDAAWNEKEGRGGLGWLVRDSDGSLICF
        L+W+LW  RN          A +++ +++E   +EW T      K   P   RN + Q  W+ PP    K N DA W  +  R G+GW++R+  G ++  
Subjt:  LMWKLWDFRNRAEVHNHP-AAVKIIHQSIEASLKEWETSYL---KTHSPIRPRNPTSQVLWEKPPPNSWKLNFDAAWNEKEGRGGLGWLVRDSDGSLICF

Query:  GMKQISKKWAIKNLEACAMLEGIKQVANTCNRLGIP-LEVESDAIEVVNVLNGKSEDI-SDLQIFIDEIKDLPARAHSMYFRHCNRLLNIAAYCVARSAA
        G + + +   +   E    LE ++    T +R     +  ESDA  +VN+LN  S+D    LQ  +++I+ L      + F    R  N  A  +AR + 
Subjt:  GMKQISKKWAIKNLEACAMLEGIKQVANTCNRLGIP-LEVESDAIEVVNVLNGKSEDI-SDLQIFIDEIKDLPARAHSMYFRHCNRLLNIAAYCVARSAA

Query:  SYSPSRIDPRR----PSWVR
        S+  S  DP+     P W+R
Subjt:  SYSPSRIDPRR----PSWVR

AT3G09510.1 Ribonuclease H-like superfamily protein5.0e-1025.5Show/hide
Query:  LMWKLWDFRNRAEVHNHPAAVKIIHQSIEASLKEW---ETSYLKTHSPIRPRNPTSQVLWEKPPPNSWKLNFDAAWNEKEGRGGLGWLVRDSDGSLICFG
        L+W++W  RN    +    +      S +A   +W     S+ KT SP R +   +++ W  PP    K NFDA ++ ++     GW++R+  G+ I +G
Subjt:  LMWKLWDFRNRAEVHNHPAAVKIIHQSIEASLKEW---ETSYLKTHSPIRPRNPTSQVLWEKPPPNSWKLNFDAAWNEKEGRGGLGWLVRDSDGSLICFG

Query:  MKQISKKWAIKNLEACAMLEGIKQVANTCNRLGIPLEVESDAIEVVNVLNGKSEDISDLQIFIDEIKDLPARAHSMYFRHCNRLLNIAAYCVARSAASYS
          +++        E  A+L  ++Q   T  R    + +E D   ++N++NG S   S L   +++I     +  S+ F    R  N  A+ +A+   +YS
Subjt:  MKQISKKWAIKNLEACAMLEGIKQVANTCNRLGIPLEVESDAIEVVNVLNGKSEDISDLQIFIDEIKDLPARAHSMYFRHCNRLLNIAAYCVARSAASYS

AT4G29090.1 Ribonuclease H-like superfamily protein5.3e-1227.19Show/hide
Query:  LMWKLWDFRNRAEVHNHPAAVKIIHQSIEASLKEWETSYLKTHSPIRPR-NPTSQVLWEKPPPNSW-KLNFDAAWNEKEGRGGLGWLVRDSDGSLICFGM
        L+W+LW  RN           + + +  E  L+EW           +P+ N +S   W +PPP+ W K N DA WN    R G+GW++R+  G +   G 
Subjt:  LMWKLWDFRNRAEVHNHPAAVKIIHQSIEASLKEWETSYLKTHSPIRPR-NPTSQVLWEKPPPNSW-KLNFDAAWNEKEGRGGLGWLVRDSDGSLICFGM

Query:  KQISKKWAIKNLEACAMLEGIKQVANTCNRLGIPLEV-ESDAIEVVNVLNGKSEDISDLQIFIDEIKDLPARAHSMYFRHCNRLLNIAAYCVARSAASYS
        + + K  ++   E    LE ++    + +R      + ESD+  ++ +LN   E    L+  I +++ L ++   + F    R  N  A  VAR + S+ 
Subjt:  KQISKKWAIKNLEACAMLEGIKQVANTCNRLGIPLEV-ESDAIEVVNVLNGKSEDISDLQIFIDEIKDLPARAHSMYFRHCNRLLNIAAYCVARSAASYS

Query:  PSRIDPRR----PSWVR
            DP+     PSW R
Subjt:  PSRIDPRR----PSWVR

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein8.5e-1026.77Show/hide
Query:  LMWKLWDFRNRAEVHNHPAAVKIIHQSIEASL---KEWETSYLKTHSPIRPRN--PTSQVLWEKPPPNSWKLNFDAAWNEKEGRGGLGWLVRDSDGSLIC
        LMW++W   N   V NH         ++E +L   KEW  + +        RN  P+    W  P  +  K N+DA+ +E+    GLGW++R+S G++I 
Subjt:  LMWKLWDFRNRAEVHNHPAAVKIIHQSIEASL---KEWETSYLKTHSPIRPRN--PTSQVLWEKPPPNSWKLNFDAAWNEKEGRGGLGWLVRDSDGSLIC

Query:  FGMKQISKKWAIKNLEACAMLEGIKQVANTCNRLGIPLEVESDAIEVVNVLNGKSEDISDLQIFIDEIKDLPARAHSMYFRHCNRLLNIAAYCVARSA
         GM +   +   +  E   ++  I+      ++  I    E D   +  ++N KS +   LQ F+D I+       S+ F   +R  N  A  +A+ A
Subjt:  FGMKQISKKWAIKNLEACAMLEGIKQVANTCNRLGIPLEVESDAIEVVNVLNGKSEDISDLQIFIDEIKDLPARAHSMYFRHCNRLLNIAAYCVARSA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAATTCTCCCTTCAGGACACTTGTGACAGCCTCAATATGCCGACAGGAAACAAAGACTCCAAGGATGAAATAGTCTGGCACCTCGACAAGAAAGGCATATTTAT
CGTGGAGAATTGGTTGCCTAAAGATTATTGGGGATGGATGGAAGAAAATCTAAGCATGGAGGAATTGGCTAAAAGCATCATTCTCATGTGGAAGTTATGGGACTTCAGAA
ACAGAGCAGAGGTTCACAATCATCCAGCAGCAGTTAAGATCATCCATCAAAGCATTGAAGCAAGTCTAAAAGAATGGGAAACTTCTTACCTCAAGACCCATTCTCCGATT
AGGCCGAGGAACCCTACGAGTCAAGTGCTTTGGGAGAAACCGCCGCCAAACTCCTGGAAATTAAATTTTGACGCTGCCTGGAATGAGAAAGAAGGTCGAGGAGGTCTTGG
CTGGCTCGTGCGTGACTCAGATGGATCCTTGATCTGTTTCGGCATGAAACAAATTAGCAAAAAATGGGCTATAAAGAATCTGGAAGCGTGTGCCATGCTAGAAGGGATCA
AGCAAGTGGCTAATACCTGTAATCGGCTTGGCATCCCTCTGGAAGTGGAATCGGACGCGATTGAGGTCGTCAACGTCCTCAACGGGAAGTCGGAAGACATCTCTGATCTG
CAGATATTCATCGATGAAATCAAAGATCTTCCTGCCCGTGCTCATTCCATGTATTTCCGTCATTGTAATCGTCTTTTGAACATAGCCGCCTACTGTGTTGCGAGAAGCGC
TGCCAGCTACAGCCCTTCACGTATTGATCCAAGGCGTCCATCATGGGTGAGAGTGGCCAATACGCCGACTCAATATGCCTTCCTTTTTGGGGACAAGACCGAGTGGGAGG
CTGGGGACATGACAACACAAGAAGGAATTCACTCCTTCCCACTTCTAGGAGAAGTAGATGAGTGTCCATTATGTCCCCCTGCTAGCTCAGTTATAGCGGGGAAGATCAAA
TGGTGGTGTTCTTCGTTGTTGCAGAGCAATCAAGAATCATGTCCGTTGAAGTATTCAGAGTCGTCAACGGAGCATCGAAGGCAGTGTGGCAAACACCACTCTGGTGTGCA
GTTTTTGCTGGTTTTGCAAGTCCCGTCTTCCCCGCTTTCTATAAATTCATTGTTGGTGTCATGTGAAGGTCAGAGTCTTGAAGGTTTGTTCACCTTTGAAGGATTCTCCT
GGATGATTGAAGTAGACTTCTGGCTTCAGAGGCTTCAGTCTTCAGAGTCTTGCAGGAAGCTTCAAGCTTTTGAATTTTGCAGAAGCTTCAATCTTCGGAGTCTTGCAGCA
GGATCTGGACTTCAATCTTCAAGATTTTTGCAGGAGGAACGAGGCTTCAATCTTCAAGTCTTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGAATTCTCCCTTCAGGACACTTGTGACAGCCTCAATATGCCGACAGGAAACAAAGACTCCAAGGATGAAATAGTCTGGCACCTCGACAAGAAAGGCATATTTAT
CGTGGAGAATTGGTTGCCTAAAGATTATTGGGGATGGATGGAAGAAAATCTAAGCATGGAGGAATTGGCTAAAAGCATCATTCTCATGTGGAAGTTATGGGACTTCAGAA
ACAGAGCAGAGGTTCACAATCATCCAGCAGCAGTTAAGATCATCCATCAAAGCATTGAAGCAAGTCTAAAAGAATGGGAAACTTCTTACCTCAAGACCCATTCTCCGATT
AGGCCGAGGAACCCTACGAGTCAAGTGCTTTGGGAGAAACCGCCGCCAAACTCCTGGAAATTAAATTTTGACGCTGCCTGGAATGAGAAAGAAGGTCGAGGAGGTCTTGG
CTGGCTCGTGCGTGACTCAGATGGATCCTTGATCTGTTTCGGCATGAAACAAATTAGCAAAAAATGGGCTATAAAGAATCTGGAAGCGTGTGCCATGCTAGAAGGGATCA
AGCAAGTGGCTAATACCTGTAATCGGCTTGGCATCCCTCTGGAAGTGGAATCGGACGCGATTGAGGTCGTCAACGTCCTCAACGGGAAGTCGGAAGACATCTCTGATCTG
CAGATATTCATCGATGAAATCAAAGATCTTCCTGCCCGTGCTCATTCCATGTATTTCCGTCATTGTAATCGTCTTTTGAACATAGCCGCCTACTGTGTTGCGAGAAGCGC
TGCCAGCTACAGCCCTTCACGTATTGATCCAAGGCGTCCATCATGGGTGAGAGTGGCCAATACGCCGACTCAATATGCCTTCCTTTTTGGGGACAAGACCGAGTGGGAGG
CTGGGGACATGACAACACAAGAAGGAATTCACTCCTTCCCACTTCTAGGAGAAGTAGATGAGTGTCCATTATGTCCCCCTGCTAGCTCAGTTATAGCGGGGAAGATCAAA
TGGTGGTGTTCTTCGTTGTTGCAGAGCAATCAAGAATCATGTCCGTTGAAGTATTCAGAGTCGTCAACGGAGCATCGAAGGCAGTGTGGCAAACACCACTCTGGTGTGCA
GTTTTTGCTGGTTTTGCAAGTCCCGTCTTCCCCGCTTTCTATAAATTCATTGTTGGTGTCATGTGAAGGTCAGAGTCTTGAAGGTTTGTTCACCTTTGAAGGATTCTCCT
GGATGATTGAAGTAGACTTCTGGCTTCAGAGGCTTCAGTCTTCAGAGTCTTGCAGGAAGCTTCAAGCTTTTGAATTTTGCAGAAGCTTCAATCTTCGGAGTCTTGCAGCA
GGATCTGGACTTCAATCTTCAAGATTTTTGCAGGAGGAACGAGGCTTCAATCTTCAAGTCTTGTAG
Protein sequenceShow/hide protein sequence
MEEFSLQDTCDSLNMPTGNKDSKDEIVWHLDKKGIFIVENWLPKDYWGWMEENLSMEELAKSIILMWKLWDFRNRAEVHNHPAAVKIIHQSIEASLKEWETSYLKTHSPI
RPRNPTSQVLWEKPPPNSWKLNFDAAWNEKEGRGGLGWLVRDSDGSLICFGMKQISKKWAIKNLEACAMLEGIKQVANTCNRLGIPLEVESDAIEVVNVLNGKSEDISDL
QIFIDEIKDLPARAHSMYFRHCNRLLNIAAYCVARSAASYSPSRIDPRRPSWVRVANTPTQYAFLFGDKTEWEAGDMTTQEGIHSFPLLGEVDECPLCPPASSVIAGKIK
WWCSSLLQSNQESCPLKYSESSTEHRRQCGKHHSGVQFLLVLQVPSSPLSINSLLVSCEGQSLEGLFTFEGFSWMIEVDFWLQRLQSSESCRKLQAFEFCRSFNLRSLAA
GSGLQSSRFLQEERGFNLQVL