; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg012997 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg012997
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRibonuclease H-like superfamily protein
Genome locationscaffold1:15545918..15548612
RNA-Seq ExpressionSpg012997
SyntenySpg012997
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143535.1 uncharacterized protein LOC111013412 [Momordica charantia]3.0e-1831.94Show/hide
Query:  SLVICWQIWEQRNHVIHSRQQPNMVKLRFQIQRYIEEFIKSEEKDQDQISIHTTDSDCPPIA----NAAASYRRPKIPNSWPSVPLGVFRLSCDASWSHT
        S++I WQIWE RN  I     P    ++  I RYI   I S  ++ + +   +T+ D   I     N  A + +P   NSW        +L+ +A+W   
Subjt:  SLVICWQIWEQRNHVIHSRQQPNMVKLRFQIQRYIEEFIKSEEKDQDQISIHTTDSDCPPIA----NAAASYRRPKIPNSWPSVPLGVFRLSCDASWSHT

Query:  KERGGIGWVLRDCYGIPLQAGYKHVNRIWKISWMEAHAICEGLKSL-PAMSSPLRIETDALRVVKLLTKTEEDESELSNFIMEAHALMTSFQIDSVIHIH
           GGIGW+LRD  G  ++A  + +     I+++E  AICEGL+++      P+ +E+D+L  + LL +  +D++E+   + E   +M   +I S+ HI 
Subjt:  KERGGIGWVLRDCYGIPLQAGYKHVNRIWKISWMEAHAICEGLKSL-PAMSSPLRIETDALRVVKLLTKTEEDESELSNFIMEAHALMTSFQIDSVIHIH

Query:  RIYNGVAHSLAQTACD
        R  N VAH LA+ A +
Subjt:  RIYNGVAHSLAQTACD

XP_022154990.1 uncharacterized protein LOC111022134 isoform X1 [Momordica charantia]4.9e-2128.81Show/hide
Query:  RRQPETGTHLFWECKEVREIWATVCPLTDRNNSNNRVGWSTADFLLWSWMKNEGRTLDEVKMAVSLVICWQIWEQRNHVIHSRQQPNMVKLRFQIQRYIE
        R++ ET  H+ WECK +++IW    P+       +R  W+T ++  W W+ ++     E +   S++I  QIWE RN  I          ++  I RYI 
Subjt:  RRQPETGTHLFWECKEVREIWATVCPLTDRNNSNNRVGWSTADFLLWSWMKNEGRTLDEVKMAVSLVICWQIWEQRNHVIHSRQQPNMVKLRFQIQRYIE

Query:  EFIKSEEKDQDQISIHTTDSDCPPIANAAASYR---RPKIPNSWPSVPLGVFRLSCDASWSHTKERGGIGWVLRDCYGIPLQAGYKHVNRIWKISWMEAH
          I S  +D    ++     D  PI     + R   +P   NSW        +L+ DA+W       GIGW+LRD  G  ++ G + +     I+++E  
Subjt:  EFIKSEEKDQDQISIHTTDSDCPPIANAAASYR---RPKIPNSWPSVPLGVFRLSCDASWSHTKERGGIGWVLRDCYGIPLQAGYKHVNRIWKISWMEAH

Query:  AICEGLKSL-PAMSSPLRIETDALRVVKLLTKTEEDESELSNF
        AICEGL+++      P+ +E+D+L  + LL +  + + +L  F
Subjt:  AICEGLKSL-PAMSSPLRIETDALRVVKLLTKTEEDESELSNF

XP_022155262.1 uncharacterized protein LOC111022403 [Momordica charantia]1.2e-1930.57Show/hide
Query:  VSLVICWQIWEQRNHVIHSRQQPNMVKLRFQIQRYIEEFIKSEEKDQDQISIHTTDSDCPPIANAAASYRRPKIPNSWPSVPLGVFRLSCDASWSHTKER
        V L+  W IW  RN+VI   +  +   +  Q+ +++ E   S + +     +H T ++                   W   P+ ++ L+ DASWS +  R
Subjt:  VSLVICWQIWEQRNHVIHSRQQPNMVKLRFQIQRYIEEFIKSEEKDQDQISIHTTDSDCPPIANAAASYRRPKIPNSWPSVPLGVFRLSCDASWSHTKER

Query:  GGIGWVLRDCYGIPLQAGYKHVNRIWKISWMEAHAICEGLKSLPAMS--SPLRIETDALRVVKLLTKTEEDESELSNFIMEAHALMTSFQIDSVIHIHRI
        GGIGW++R   G  + AG + V     +  +EA AI EGL++L  +    PL IETD+  V  LL +  ED ++    + E   L  S +I +   + R 
Subjt:  GGIGWVLRDCYGIPLQAGYKHVNRIWKISWMEAHAICEGLKSLPAMS--SPLRIETDALRVVKLLTKTEEDESELSNFIMEAHALMTSFQIDSVIHIHRI

Query:  YNGVAHSLAQTACDLNDNKCWAQDIPFWV
         NG AHSLAQ A  L ++  W    P W+
Subjt:  YNGVAHSLAQTACDLNDNKCWAQDIPFWV

XP_030483269.1 uncharacterized protein LOC115699866 [Cannabis sativa]5.1e-1825Show/hide
Query:  LIDTIPSNFCLQTFENYLCRTLRRQPETGTHLFWECKEVREIWATVCPLTDRNNSNNRVGWSTADFLLWSWMKNEGRTLDEVKMAVSLVIC--WQIWEQR
        +ID+  S+ C   +E+              H  + C + R++W     + D +++NN       D+L+         TL   K    L++C  W IW +R
Subjt:  LIDTIPSNFCLQTFENYLCRTLRRQPETGTHLFWECKEVREIWATVCPLTDRNNSNNRVGWSTADFLLWSWMKNEGRTLDEVKMAVSLVIC--WQIWEQR

Query:  NHVIHSRQQPNMVKLRFQIQRYIEEFIKSEEKDQDQISIHTTDSDCPPIANAAASYRRPKIPNSWPSVPLGVFRLSCDASWSHTKERGGIGWVLRDCYGI
        N VIH         +     +++++F +S  K+Q Q ++  T +   P+ +  A+ ++     SW    L   +L+ DA+ +  ++  G+G ++R+  G 
Subjt:  NHVIHSRQQPNMVKLRFQIQRYIEEFIKSEEKDQDQISIHTTDSDCPPIANAAASYRRPKIPNSWPSVPLGVFRLSCDASWSHTKERGGIGWVLRDCYGI

Query:  PLQAGYKHVNRIWKISWMEAHAICEGLKSLPAMSSPL-RIETDALRVVKLLTKTEEDESELSNFIMEAHALMTSFQIDSVIHIHRIYNGVAHSLAQTACD
         + A  K V   ++   +EA A+   L     +   +  +ETDALRV   +    ++ S   + IM+   L+  F   +V H+ R  N  AH LA+ A +
Subjt:  PLQAGYKHVNRIWKISWMEAHAICEGLKSLPAMSSPL-RIETDALRVVKLLTKTEEDESELSNFIMEAHALMTSFQIDSVIHIHRIYNGVAHSLAQTACD

Query:  LNDNKCWAQDIP
        L+++ CW  +IP
Subjt:  LNDNKCWAQDIP

XP_030496634.1 uncharacterized protein LOC115712492 [Cannabis sativa]2.3e-1823.73Show/hide
Query:  QTFENYLCRTLRRQPETGTHLFWECKEVREIWATVCPLTDRNNSNNRVGWSTADFLLWSWMKNEGRTLDEVKMAVSLVIC--WQIWEQRNHVIHSRQQPN
        +  ++  C       E+  H  + C   R++W       D  +++N       D+L+     +        K  + L+IC  W IW +RN VIH     +
Subjt:  QTFENYLCRTLRRQPETGTHLFWECKEVREIWATVCPLTDRNNSNNRVGWSTADFLLWSWMKNEGRTLDEVKMAVSLVIC--WQIWEQRNHVIHSRQQPN

Query:  MVKLRFQIQRYIEEFIKSEEKDQDQI---SIHTTDS-DCPPIANAAASYRRPKIPNSWPSVPLGVFRLSCDASWSHTKERGGIGWVLRDCYGIPLQAGYK
           +      ++ ++ ++  K+Q Q    + HT+ S       +A+ S +R     SW    +   +++ DA+ +  K+  G+G ++RD +G  + A  K
Subjt:  MVKLRFQIQRYIEEFIKSEEKDQDQI---SIHTTDS-DCPPIANAAASYRRPKIPNSWPSVPLGVFRLSCDASWSHTKERGGIGWVLRDCYGIPLQAGYK

Query:  HVNRIWKISWMEAHAICEGLK-SLPAMSSPLRIETDALRVVKLLTKTEEDESELSNFIMEAHALMTSFQIDSVIHIHRIYNGVAHSLAQTACDLNDNKCW
         V   ++   MEA A+   L  ++        +ETDALRV   L     + S  ++ IM+   L++ F   ++ H+ R  N  AH LA+ A +L+++  W
Subjt:  HVNRIWKISWMEAHAICEGLK-SLPAMSSPLRIETDALRVVKLLTKTEEDESELSNFIMEAHALMTSFQIDSVIHIHRIYNGVAHSLAQTACDLNDNKCW

Query:  AQDIPFWVLDLCSKDV
          +IP  +  +   D+
Subjt:  AQDIPFWVLDLCSKDV

TrEMBL top hitse value%identityAlignment
A0A6J1DL64 uncharacterized protein LOC111022134 isoform X12.4e-2128.81Show/hide
Query:  RRQPETGTHLFWECKEVREIWATVCPLTDRNNSNNRVGWSTADFLLWSWMKNEGRTLDEVKMAVSLVICWQIWEQRNHVIHSRQQPNMVKLRFQIQRYIE
        R++ ET  H+ WECK +++IW    P+       +R  W+T ++  W W+ ++     E +   S++I  QIWE RN  I          ++  I RYI 
Subjt:  RRQPETGTHLFWECKEVREIWATVCPLTDRNNSNNRVGWSTADFLLWSWMKNEGRTLDEVKMAVSLVICWQIWEQRNHVIHSRQQPNMVKLRFQIQRYIE

Query:  EFIKSEEKDQDQISIHTTDSDCPPIANAAASYR---RPKIPNSWPSVPLGVFRLSCDASWSHTKERGGIGWVLRDCYGIPLQAGYKHVNRIWKISWMEAH
          I S  +D    ++     D  PI     + R   +P   NSW        +L+ DA+W       GIGW+LRD  G  ++ G + +     I+++E  
Subjt:  EFIKSEEKDQDQISIHTTDSDCPPIANAAASYR---RPKIPNSWPSVPLGVFRLSCDASWSHTKERGGIGWVLRDCYGIPLQAGYKHVNRIWKISWMEAH

Query:  AICEGLKSL-PAMSSPLRIETDALRVVKLLTKTEEDESELSNF
        AICEGL+++      P+ +E+D+L  + LL +  + + +L  F
Subjt:  AICEGLKSL-PAMSSPLRIETDALRVVKLLTKTEEDESELSNF

A0A6J1DNV9 uncharacterized protein LOC1110224035.9e-2030.57Show/hide
Query:  VSLVICWQIWEQRNHVIHSRQQPNMVKLRFQIQRYIEEFIKSEEKDQDQISIHTTDSDCPPIANAAASYRRPKIPNSWPSVPLGVFRLSCDASWSHTKER
        V L+  W IW  RN+VI   +  +   +  Q+ +++ E   S + +     +H T ++                   W   P+ ++ L+ DASWS +  R
Subjt:  VSLVICWQIWEQRNHVIHSRQQPNMVKLRFQIQRYIEEFIKSEEKDQDQISIHTTDSDCPPIANAAASYRRPKIPNSWPSVPLGVFRLSCDASWSHTKER

Query:  GGIGWVLRDCYGIPLQAGYKHVNRIWKISWMEAHAICEGLKSLPAMS--SPLRIETDALRVVKLLTKTEEDESELSNFIMEAHALMTSFQIDSVIHIHRI
        GGIGW++R   G  + AG + V     +  +EA AI EGL++L  +    PL IETD+  V  LL +  ED ++    + E   L  S +I +   + R 
Subjt:  GGIGWVLRDCYGIPLQAGYKHVNRIWKISWMEAHAICEGLKSLPAMS--SPLRIETDALRVVKLLTKTEEDESELSNFIMEAHALMTSFQIDSVIHIHRI

Query:  YNGVAHSLAQTACDLNDNKCWAQDIPFWV
         NG AHSLAQ A  L ++  W    P W+
Subjt:  YNGVAHSLAQTACDLNDNKCWAQDIPFWV

A0A803PVM0 Uncharacterized protein7.7e-2029.58Show/hide
Query:  VIC--WQIWEQRNHVIHSRQQPNMVKLRFQIQRYIEEFIKSEEKDQDQISIHTTDSDCPPIANAAASYRRPKIPNSWPSVPLGVFRLSCDASWSHTKERG
        ++C  W IW +RN V+H     N   +     +Y++++++S+ ++Q Q  + T  +   P+    AS + PK P  W       F+L+ DA+ +H  +  
Subjt:  VIC--WQIWEQRNHVIHSRQQPNMVKLRFQIQRYIEEFIKSEEKDQDQISIHTTDSDCPPIANAAASYRRPKIPNSWPSVPLGVFRLSCDASWSHTKERG

Query:  GIGWVLRDCYGIPLQAGYKHVNRIWKISWMEAHAICEGLK-SLPAMSSPLRIETDALRVVKLLTKTEEDESELSNFIMEAHALMTSFQIDSVIHIHRIYN
        G+G VLRD  G  + A  K V   ++   MEA A+   L  +L        IETDALRV   +    ++ S  ++ IM+   L++ F   +V H+ R  N
Subjt:  GIGWVLRDCYGIPLQAGYKHVNRIWKISWMEAHAICEGLK-SLPAMSSPLRIETDALRVVKLLTKTEEDESELSNFIMEAHALMTSFQIDSVIHIHRIYN

Query:  GVAHSLAQTACDLNDNKCWAQDIPFWVLDLCSKDVSNFSH
          AH LA+ A +L+++ CW  +IP  +  +    V+  +H
Subjt:  GVAHSLAQTACDLNDNKCWAQDIPFWVLDLCSKDVSNFSH

A0A803Q8E0 Uncharacterized protein3.7e-2223.73Show/hide
Query:  QTFENYLCRTLRRQPETGTHLFWECKEVREIWATVCPLTDRNNSNNRVGWSTADFLLWSWMKNEGRTLDEVKMAVSLVIC--WQIWEQRNHVIHSRQQPN
        +  ++  C       E+  H  + C   R++W     + D  +++N       D+L++    +        K  + L+IC  W IW +RN VIH     +
Subjt:  QTFENYLCRTLRRQPETGTHLFWECKEVREIWATVCPLTDRNNSNNRVGWSTADFLLWSWMKNEGRTLDEVKMAVSLVIC--WQIWEQRNHVIHSRQQPN

Query:  MVKLRFQIQRYIEEFIKSEEKDQDQI---SIHTTDSDCPPIANAAAS-YRRPKIPNSWPSVPLGVFRLSCDASWSHTKERGGIGWVLRDCYGIPLQAGYK
           +      Y+ ++ ++  K+Q Q    ++HT+ +    + N+A +  +RP   +SW    +   +++ DA+ +  ++  G+G ++R  +G  + A  K
Subjt:  MVKLRFQIQRYIEEFIKSEEKDQDQI---SIHTTDSDCPPIANAAAS-YRRPKIPNSWPSVPLGVFRLSCDASWSHTKERGGIGWVLRDCYGIPLQAGYK

Query:  HVNRIWKISWMEAHAICEGLK-SLPAMSSPLRIETDALRVVKLLTKTEEDESELSNFIMEAHALMTSFQIDSVIHIHRIYNGVAHSLAQTACDLNDNKCW
         V   ++   MEA A+   L  ++        +ETDALRV   L     + S  ++ IM+   L++ F   ++ H+ R  N  AH LA+ A +L+++ CW
Subjt:  HVNRIWKISWMEAHAICEGLK-SLPAMSSPLRIETDALRVVKLLTKTEEDESELSNFIMEAHALMTSFQIDSVIHIHRIYNGVAHSLAQTACDLNDNKCW

Query:  AQDIPFWVLDLCSKDV
          +IP  +  +   D+
Subjt:  AQDIPFWVLDLCSKDV

A0A803QHJ8 Uncharacterized protein1.3e-1929.8Show/hide
Query:  CRTLRRQPETGTHLFWECKEVREIWATVCPLTDRNNSNNRVGWSTADFLLWSWMKNEGRTLDEVKMAVSLVIC--WQIWEQRNHVIHSR-QQPNMVKLRF
        C       E+  H  + C   R++W       D +N++N       D+L+         TL   K    L++C  W IW  RN V H    +P+     F
Subjt:  CRTLRRQPETGTHLFWECKEVREIWATVCPLTDRNNSNNRVGWSTADFLLWSWMKNEGRTLDEVKMAVSLVIC--WQIWEQRNHVIHSR-QQPNMVKLRF

Query:  QIQRYIEEFIKSEEKD-QDQISIHT-----TDSDCPPIANAAASYRRPKIPNSW-PSVPLGVFRLSCDASWSHTKERGGIGWVLRDCYGIPLQAGYKHVN
         +   +E FI S+ K  Q   SI T     T S  P IA      +     +SW P  P G+ +L+ DA+ +H  +  GIG V+RD +G  + A  K V 
Subjt:  QIQRYIEEFIKSEEKD-QDQISIHT-----TDSDCPPIANAAASYRRPKIPNSW-PSVPLGVFRLSCDASWSHTKERGGIGWVLRDCYGIPLQAGYKHVN

Query:  RIWKISWMEAHAICEGLKSLPAMSSPL-RIETDALRVVKLLTKTEEDESELSNFIMEAHALMTSFQIDSVIHIHRIYNGVAHSLAQTACDLNDNKCWAQD
          ++   MEA A+   L  +  M   L  I+TDALRV   L  +  + S  ++ I++   L++ F   ++ H+ R  N  AH LA+ A +L+++ CW  +
Subjt:  RIWKISWMEAHAICEGLKSLPAMSSPL-RIETDALRVVKLLTKTEEDESELSNFIMEAHALMTSFQIDSVIHIHRIYNGVAHSLAQTACDLNDNKCWAQD

Query:  IP
        IP
Subjt:  IP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.3e-0720.09Show/hide
Query:  ETGTHLFWECKEVREIWATVCPLTDRNNSNNRVGWSTADFLLWSWMKNEGRTLDEVKMAVSLV--ICWQIWEQRNHVIHSRQQPNMVKLRFQIQRYIEEF
        ET  HL ++C   R +WA + P+           W+ + +    W+ N    + ++    +LV  + W++W+ RN ++   ++ +  ++  +     EE+
Subjt:  ETGTHLFWECKEVREIWATVCPLTDRNNSNNRVGWSTADFLLWSWMKNEGRTLDEVKMAVSLV--ICWQIWEQRNHVIHSRQQPNMVKLRFQIQRYIEEF

Query:  IKSEEKDQDQISIHTTDSDCPPIANAAASYRRPKIPNSWPSVPLGVFRLSCDASWSHTKERGGIGWVLRDCYGIPLQAGYKHVNRIWKISWMEAHAICEG
            E +                  A+       +   W + P    + + DA+W     R GIGW+LR+  G  L  G + + R   +   E  A+   
Subjt:  IKSEEKDQDQISIHTTDSDCPPIANAAASYRRPKIPNSWPSVPLGVFRLSCDASWSHTKERGGIGWVLRDCYGIPLQAGYKHVNRIWKISWMEAHAICEG

Query:  LKSLPAMS-SPLRIETDALRVVKLLTKTE
        + ++   +   +  E+DA  +V LL   +
Subjt:  LKSLPAMS-SPLRIETDALRVVKLLTKTE

AT4G29090.1 Ribonuclease H-like superfamily protein5.8e-1222.95Show/hide
Query:  ETGTHLFWECKEVREIWA-TVCPLTDRNNSNNRVGWSTADFLLWSWMKNEGRTLDEVKMAVSLV--ICWQIWEQRNHVIHSRQQPNMVKLRFQIQRYIEE
        ET  HL ++C   R  WA +  P+           W+ + ++   W+ N G    + + A  LV  + W++W+ RN ++   ++ N            +E
Subjt:  ETGTHLFWECKEVREIWA-TVCPLTDRNNSNNRVGWSTADFLLWSWMKNEGRTLDEVKMAVSLV--ICWQIWEQRNHVIHSRQQPNMVKLRFQIQRYIEE

Query:  FIKSEEKDQDQISIHTTDSDC--PPIANAAASYRRPKIPNSWPSVPLGVFRLSCDASWSHTKERGGIGWVLRDCYGIPLQAGYKHVNRIWKISWMEAHAI
         ++  E D ++  I T    C   P  N ++  R    P+ W        + + DA+W+   ER GIGWVLR+  G     G + + ++  +   E  A+
Subjt:  FIKSEEKDQDQISIHTTDSDC--PPIANAAASYRRPKIPNSWPSVPLGVFRLSCDASWSHTKERGGIGWVLRDCYGIPLQAGYKHVNRIWKISWMEAHAI

Query:  CEGLKSLPAMS-SPLRIETDALRVVKLLTKTEEDESELSNFIMEAHALMTSFQIDSVIHIHRIYNGVAHSLAQTACD-LNDNKCWAQDIPFW
           + SL     + +  E+D+  ++++L   +E    L   I +   L++ F     + I R  N +A  +A+ +   LN +      +P W
Subjt:  CEGLKSLPAMS-SPLRIETDALRVVKLLTKTEEDESELSNFIMEAHALMTSFQIDSVIHIHRIYNGVAHSLAQTACD-LNDNKCWAQDIPFW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATTGGAAGCTAAGAATTAGACAGTTGTGGCTGTTGGGTTGCTGTCTTTCCCCTGTATTTTCTATTATTTTAGAAGCATATGAATTAGCTATACGTGGAAGTGTTTA
TTCCACAATTCTTCCCTGTTTGATTGACACTATTCCTTCCAACTTTTGTCTCCAAACTTTTGAAAATTACCTTTGTCGTACTCTCAGGAGACAACCTGAAACGGGTACTC
ATTTGTTCTGGGAGTGTAAAGAGGTAAGGGAAATTTGGGCTACTGTTTGTCCTCTTACTGATAGAAATAACTCGAATAACAGGGTGGGATGGTCAACGGCTGATTTCCTC
CTATGGTCGTGGATGAAGAATGAGGGAAGGACGCTGGACGAGGTCAAAATGGCGGTTAGCTTGGTGATTTGTTGGCAAATTTGGGAGCAAAGGAATCATGTAATACACAG
CAGACAACAGCCAAACATGGTGAAATTAAGGTTTCAGATCCAGAGATATATTGAGGAATTCATAAAGTCTGAGGAAAAAGACCAAGATCAGATTTCTATCCATACAACGG
ATAGTGATTGCCCTCCCATCGCAAACGCTGCTGCTTCTTACAGACGCCCTAAAATTCCAAATTCGTGGCCGTCGGTTCCGTTGGGTGTTTTCCGATTGAGTTGCGATGCC
TCCTGGAGTCATACAAAAGAAAGGGGTGGAATTGGTTGGGTGTTGAGAGATTGTTATGGAATACCTTTACAAGCAGGGTACAAGCACGTAAATCGCATCTGGAAAATTAG
TTGGATGGAAGCTCATGCGATTTGTGAGGGCCTGAAATCCTTGCCGGCGATGTCTTCTCCACTTCGGATTGAGACGGACGCTTTGCGTGTGGTTAAGCTGCTGACTAAGA
CAGAAGAAGATGAATCAGAGTTAAGTAACTTCATAATGGAAGCCCATGCCTTAATGACCTCTTTTCAAATAGATTCTGTGATCCACATTCACAGAATTTACAATGGGGTA
GCACATTCTTTGGCCCAAACAGCTTGCGATCTTAATGATAATAAGTGTTGGGCTCAAGATATTCCTTTTTGGGTTCTAGATTTATGTTCAAAGGATGTAAGCAATTTTTC
TCACACTTGTGGGGGATCTTGTCCCACAGGTGTCTCTTTTTTGGGAGCTATTTCTAGCTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCATTGGAAGCTAAGAATTAGACAGTTGTGGCTGTTGGGTTGCTGTCTTTCCCCTGTATTTTCTATTATTTTAGAAGCATATGAATTAGCTATACGTGGAAGTGTTTA
TTCCACAATTCTTCCCTGTTTGATTGACACTATTCCTTCCAACTTTTGTCTCCAAACTTTTGAAAATTACCTTTGTCGTACTCTCAGGAGACAACCTGAAACGGGTACTC
ATTTGTTCTGGGAGTGTAAAGAGGTAAGGGAAATTTGGGCTACTGTTTGTCCTCTTACTGATAGAAATAACTCGAATAACAGGGTGGGATGGTCAACGGCTGATTTCCTC
CTATGGTCGTGGATGAAGAATGAGGGAAGGACGCTGGACGAGGTCAAAATGGCGGTTAGCTTGGTGATTTGTTGGCAAATTTGGGAGCAAAGGAATCATGTAATACACAG
CAGACAACAGCCAAACATGGTGAAATTAAGGTTTCAGATCCAGAGATATATTGAGGAATTCATAAAGTCTGAGGAAAAAGACCAAGATCAGATTTCTATCCATACAACGG
ATAGTGATTGCCCTCCCATCGCAAACGCTGCTGCTTCTTACAGACGCCCTAAAATTCCAAATTCGTGGCCGTCGGTTCCGTTGGGTGTTTTCCGATTGAGTTGCGATGCC
TCCTGGAGTCATACAAAAGAAAGGGGTGGAATTGGTTGGGTGTTGAGAGATTGTTATGGAATACCTTTACAAGCAGGGTACAAGCACGTAAATCGCATCTGGAAAATTAG
TTGGATGGAAGCTCATGCGATTTGTGAGGGCCTGAAATCCTTGCCGGCGATGTCTTCTCCACTTCGGATTGAGACGGACGCTTTGCGTGTGGTTAAGCTGCTGACTAAGA
CAGAAGAAGATGAATCAGAGTTAAGTAACTTCATAATGGAAGCCCATGCCTTAATGACCTCTTTTCAAATAGATTCTGTGATCCACATTCACAGAATTTACAATGGGGTA
GCACATTCTTTGGCCCAAACAGCTTGCGATCTTAATGATAATAAGTGTTGGGCTCAAGATATTCCTTTTTGGGTTCTAGATTTATGTTCAAAGGATGTAAGCAATTTTTC
TCACACTTGTGGGGGATCTTGTCCCACAGGTGTCTCTTTTTTGGGAGCTATTTCTAGCTCTTAA
Protein sequenceShow/hide protein sequence
MHWKLRIRQLWLLGCCLSPVFSIILEAYELAIRGSVYSTILPCLIDTIPSNFCLQTFENYLCRTLRRQPETGTHLFWECKEVREIWATVCPLTDRNNSNNRVGWSTADFL
LWSWMKNEGRTLDEVKMAVSLVICWQIWEQRNHVIHSRQQPNMVKLRFQIQRYIEEFIKSEEKDQDQISIHTTDSDCPPIANAAASYRRPKIPNSWPSVPLGVFRLSCDA
SWSHTKERGGIGWVLRDCYGIPLQAGYKHVNRIWKISWMEAHAICEGLKSLPAMSSPLRIETDALRVVKLLTKTEEDESELSNFIMEAHALMTSFQIDSVIHIHRIYNGV
AHSLAQTACDLNDNKCWAQDIPFWVLDLCSKDVSNFSHTCGGSCPTGVSFLGAISSS