; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg027555 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg027555
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionPolynucleotidyl transferase, ribonuclease H-like superfamily protein
Genome locationscaffold11:3421580..3434525
RNA-Seq ExpressionSpg027555
SyntenySpg027555
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG50387.1 hypothetical protein EZV62_022911 [Acer yangbiense]4.1e-1727.24Show/hide
Query:  ETAIHLFWDCKVTRGVWHHYFPHSNL--MLVNDRREWTTPDLCELTWKGTGGDKMKEINLKLSLIIWWQIWTHRNSILHNKKQPDLNQIIQSIEKYRNEY
        E+  H+ W C     VW        +  ++V+D      P +    W+      +  +   L +I +W++WT+RNS++H         ++  I+ + NE+
Subjt:  ETAIHLFWDCKVTRGVWHHYFPHSNL--MLVNDRREWTTPDLCELTWKGTGGDKMKEINLKLSLIIWWQIWTHRNSILHNKKQPDLNQIIQSIEKYRNEY

Query:  NKGEEDYQNDGRRLSEESPDGIPANPPNSRSWLPINEGCWKINCDASWNSKMNRGGVGWILRRWDGSPVTAGYRTIAHQWPIHWLEAIAVVEGLK-SIPV
            E   N    LS +             SW     G +KINCDAS+  +  + GVG I+R + GS + A    +     +  LEA A +EG+  +I +
Subjt:  NKGEEDYQNDGRRLSEESPDGIPANPPNSRSWLPINEGCWKINCDASWNSKMNRGGVGWILRRWDGSPVTAGYRTIAHQWPIHWLEAIAVVEGLK-SIPV

Query:  PSPMVIVETDSLVTVRLLSGVSTDLTELNGFIEEAKSLISNRNISSISHVPRIHNFMAHKLAQMATESNSSKCWANHFP
            VI+E+D+   ++LLS  +   TEL   I  + +L ++ N+ S   V R  N +AH +AQ A   +S   W    P
Subjt:  PSPMVIVETDSLVTVRLLSGVSTDLTELNGFIEEAKSLISNRNISSISHVPRIHNFMAHKLAQMATESNSSKCWANHFP

XP_022143535.1 uncharacterized protein LOC111013412 [Momordica charantia]1.3e-1833.18Show/hide
Query:  DKMKEINLKLSLIIWWQIWTHRNSILHNKKQPDLNQIIQSIEKY------RNEYNKGEEDYQNDGRRLSEESPDGIPANPPNSRSWLPINEGCWKINCDA
        DK  E   + S+II WQIW  RN  +     P+   I  +I++Y      RN   KG+    N    L     D   A       W P     WK+N +A
Subjt:  DKMKEINLKLSLIIWWQIWTHRNSILHNKKQPDLNQIIQSIEKY------RNEYNKGEEDYQNDGRRLSEESPDGIPANPPNSRSWLPINEGCWKINCDA

Query:  SWNSKMNRGGVGWILRRWDGSPVTAGYRTIAHQWPIHWLEAIAVVEGLKSIPVPSPMVI-VETDSLVTVRLLSGVSTDLTELNGFIEEAKSLISNRNISS
        +W +  N GG+GWILR   G  + A  R I  +  I +LE +A+ EGL++I       I +E+DSL  + LL     D TE+   +EE   ++ +  I S
Subjt:  SWNSKMNRGGVGWILRRWDGSPVTAGYRTIAHQWPIHWLEAIAVVEGLKSIPVPSPMVI-VETDSLVTVRLLSGVSTDLTELNGFIEEAKSLISNRNISS

Query:  ISHVPRIHNFMAHKLAQMATESN
        + H+ R  N +AH LA+ A E++
Subjt:  ISHVPRIHNFMAHKLAQMATESN

XP_022154990.1 uncharacterized protein LOC111022134 isoform X1 [Momordica charantia]7.6e-1932.59Show/hide
Query:  FKSKPETAIHLFWDCKVTRGVWHHYFPHSNLMLVNDRREWTTPDLCELTWKGTGGDKMKEINLKLSLIIWWQIWTHRNSILHNKKQPDLNQIIQSIEKYR
        F+ K ET  H+ W+CKV + +W +  P        DR  WTT +  E        DK  E   + S+II  QIW  RN  +      +   I  +I++Y 
Subjt:  FKSKPETAIHLFWDCKVTRGVWHHYFPHSNLMLVNDRREWTTPDLCELTWKGTGGDKMKEINLKLSLIIWWQIWTHRNSILHNKKQPDLNQIIQSIEKYR

Query:  NEYNKGEEDYQNDGRRLSEESPDGIPAN--PPNSRS-WLPINEGCWKINCDASWNSKMNRGGVGWILRRWDGSPVTAGYRTIAHQWPIHWLEAIAVVEGL
           N   +D       L  +S D  P      N+R+ W P     WK+N DA+W +  N  G+GWILR   G  +  G R I  +  I +LE +A+ EGL
Subjt:  NEYNKGEEDYQNDGRRLSEESPDGIPAN--PPNSRS-WLPINEGCWKINCDASWNSKMNRGGVGWILRRWDGSPVTAGYRTIAHQWPIHWLEAIAVVEGL

Query:  KSIPVPSPMVI-VETDSLVTVRLL
        ++I       I +E+DSL  + LL
Subjt:  KSIPVPSPMVI-VETDSLVTVRLL

XP_022155262.1 uncharacterized protein LOC111022403 [Momordica charantia]5.1e-2334.63Show/hide
Query:  DKMKEINLKLSLIIWWQIWTHRNSILHNKKQPDLNQIIQSIEKYRNEYNKGEEDYQNDGRRLSEESPDGIPANPPNSRSWLPINEGCWKINCDASWNSKM
        DK  + +L + LI  W IW HRN ++   +    + +IQ + K+       E  YQ      SE S   +     N   W P     W +N DASW+   
Subjt:  DKMKEINLKLSLIIWWQIWTHRNSILHNKKQPDLNQIIQSIEKYRNEYNKGEEDYQNDGRRLSEESPDGIPANPPNSRSWLPINEGCWKINCDASWNSKM

Query:  NRGGVGWILRRWDGSPVTAGYRTIAHQWPIHWLEAIAVVEGLKSIP---VPSPMVIVETDSLVTVRLLSGVSTDLTELNGFIEEAKSLISNRNISSISHV
        +RGG+GWI+R WDG  V AG R +     +  LEA A++EGL+++    V  P+ I ETDS     LL+    DLT+    +EE  +L  +  I + + V
Subjt:  NRGGVGWILRRWDGSPVTAGYRTIAHQWPIHWLEAIAVVEGLKSIP---VPSPMVIVETDSLVTVRLLSGVSTDLTELNGFIEEAKSLISNRNISSISHV

Query:  PRIHNFMAHKLAQMATESNSSKCWANHFPTW
         R  N  AH LAQ A+    S  W + FP W
Subjt:  PRIHNFMAHKLAQMATESNSSKCWANHFPTW

XP_022156777.1 uncharacterized protein LOC111023608 [Momordica charantia]7.1e-1730.67Show/hide
Query:  DKMKEINLKLSLIIWWQIWTHRNSILHNKKQPDLNQIIQSIEKYRNEYNKGEEDYQNDGRRLSEESPDGIPANPPNSRSWLPINEGCWKINCDASWNSKM
        DK  E   + S+II WQIW  RN  +      +   I   I++Y    N    D    G+  +++              W P     WK+N DA+W +  
Subjt:  DKMKEINLKLSLIIWWQIWTHRNSILHNKKQPDLNQIIQSIEKYRNEYNKGEEDYQNDGRRLSEESPDGIPANPPNSRSWLPINEGCWKINCDASWNSKM

Query:  NRGGVGWILRRWDGSPVTAGYRTIAHQWPIHWLEAIAVVEGLKSI------PVPSP---MVIVETDSLVTVRLLSGVSTDLTELNGFIEEAKSLISNRNI
        N GG+GWILR   G  + A  R I  +  I +LE +A+ EGL++I      P+       + +E+DSL  + LL     D TE+   +EE   ++ +  I
Subjt:  NRGGVGWILRRWDGSPVTAGYRTIAHQWPIHWLEAIAVVEGLKSI------PVPSP---MVIVETDSLVTVRLLSGVSTDLTELNGFIEEAKSLISNRNI

Query:  SSISHVPRIHNFMAHKLAQMATESN
         S+ H+ R  N +AH LA+ A E++
Subjt:  SSISHVPRIHNFMAHKLAQMATESN

TrEMBL top hitse value%identityAlignment
A0A5C7H0P0 Uncharacterized protein2.0e-1727.24Show/hide
Query:  ETAIHLFWDCKVTRGVWHHYFPHSNL--MLVNDRREWTTPDLCELTWKGTGGDKMKEINLKLSLIIWWQIWTHRNSILHNKKQPDLNQIIQSIEKYRNEY
        E+  H+ W C     VW        +  ++V+D      P +    W+      +  +   L +I +W++WT+RNS++H         ++  I+ + NE+
Subjt:  ETAIHLFWDCKVTRGVWHHYFPHSNL--MLVNDRREWTTPDLCELTWKGTGGDKMKEINLKLSLIIWWQIWTHRNSILHNKKQPDLNQIIQSIEKYRNEY

Query:  NKGEEDYQNDGRRLSEESPDGIPANPPNSRSWLPINEGCWKINCDASWNSKMNRGGVGWILRRWDGSPVTAGYRTIAHQWPIHWLEAIAVVEGLK-SIPV
            E   N    LS +             SW     G +KINCDAS+  +  + GVG I+R + GS + A    +     +  LEA A +EG+  +I +
Subjt:  NKGEEDYQNDGRRLSEESPDGIPANPPNSRSWLPINEGCWKINCDASWNSKMNRGGVGWILRRWDGSPVTAGYRTIAHQWPIHWLEAIAVVEGLK-SIPV

Query:  PSPMVIVETDSLVTVRLLSGVSTDLTELNGFIEEAKSLISNRNISSISHVPRIHNFMAHKLAQMATESNSSKCWANHFP
            VI+E+D+   ++LLS  +   TEL   I  + +L ++ N+ S   V R  N +AH +AQ A   +S   W    P
Subjt:  PSPMVIVETDSLVTVRLLSGVSTDLTELNGFIEEAKSLISNRNISSISHVPRIHNFMAHKLAQMATESNSSKCWANHFP

A0A6J1CP26 uncharacterized protein LOC1110134126.2e-1933.18Show/hide
Query:  DKMKEINLKLSLIIWWQIWTHRNSILHNKKQPDLNQIIQSIEKY------RNEYNKGEEDYQNDGRRLSEESPDGIPANPPNSRSWLPINEGCWKINCDA
        DK  E   + S+II WQIW  RN  +     P+   I  +I++Y      RN   KG+    N    L     D   A       W P     WK+N +A
Subjt:  DKMKEINLKLSLIIWWQIWTHRNSILHNKKQPDLNQIIQSIEKY------RNEYNKGEEDYQNDGRRLSEESPDGIPANPPNSRSWLPINEGCWKINCDA

Query:  SWNSKMNRGGVGWILRRWDGSPVTAGYRTIAHQWPIHWLEAIAVVEGLKSIPVPSPMVI-VETDSLVTVRLLSGVSTDLTELNGFIEEAKSLISNRNISS
        +W +  N GG+GWILR   G  + A  R I  +  I +LE +A+ EGL++I       I +E+DSL  + LL     D TE+   +EE   ++ +  I S
Subjt:  SWNSKMNRGGVGWILRRWDGSPVTAGYRTIAHQWPIHWLEAIAVVEGLKSIPVPSPMVI-VETDSLVTVRLLSGVSTDLTELNGFIEEAKSLISNRNISS

Query:  ISHVPRIHNFMAHKLAQMATESN
        + H+ R  N +AH LA+ A E++
Subjt:  ISHVPRIHNFMAHKLAQMATESN

A0A6J1DL64 uncharacterized protein LOC111022134 isoform X13.7e-1932.59Show/hide
Query:  FKSKPETAIHLFWDCKVTRGVWHHYFPHSNLMLVNDRREWTTPDLCELTWKGTGGDKMKEINLKLSLIIWWQIWTHRNSILHNKKQPDLNQIIQSIEKYR
        F+ K ET  H+ W+CKV + +W +  P        DR  WTT +  E        DK  E   + S+II  QIW  RN  +      +   I  +I++Y 
Subjt:  FKSKPETAIHLFWDCKVTRGVWHHYFPHSNLMLVNDRREWTTPDLCELTWKGTGGDKMKEINLKLSLIIWWQIWTHRNSILHNKKQPDLNQIIQSIEKYR

Query:  NEYNKGEEDYQNDGRRLSEESPDGIPAN--PPNSRS-WLPINEGCWKINCDASWNSKMNRGGVGWILRRWDGSPVTAGYRTIAHQWPIHWLEAIAVVEGL
           N   +D       L  +S D  P      N+R+ W P     WK+N DA+W +  N  G+GWILR   G  +  G R I  +  I +LE +A+ EGL
Subjt:  NEYNKGEEDYQNDGRRLSEESPDGIPAN--PPNSRS-WLPINEGCWKINCDASWNSKMNRGGVGWILRRWDGSPVTAGYRTIAHQWPIHWLEAIAVVEGL

Query:  KSIPVPSPMVI-VETDSLVTVRLL
        ++I       I +E+DSL  + LL
Subjt:  KSIPVPSPMVI-VETDSLVTVRLL

A0A6J1DNV9 uncharacterized protein LOC1110224032.5e-2334.63Show/hide
Query:  DKMKEINLKLSLIIWWQIWTHRNSILHNKKQPDLNQIIQSIEKYRNEYNKGEEDYQNDGRRLSEESPDGIPANPPNSRSWLPINEGCWKINCDASWNSKM
        DK  + +L + LI  W IW HRN ++   +    + +IQ + K+       E  YQ      SE S   +     N   W P     W +N DASW+   
Subjt:  DKMKEINLKLSLIIWWQIWTHRNSILHNKKQPDLNQIIQSIEKYRNEYNKGEEDYQNDGRRLSEESPDGIPANPPNSRSWLPINEGCWKINCDASWNSKM

Query:  NRGGVGWILRRWDGSPVTAGYRTIAHQWPIHWLEAIAVVEGLKSIP---VPSPMVIVETDSLVTVRLLSGVSTDLTELNGFIEEAKSLISNRNISSISHV
        +RGG+GWI+R WDG  V AG R +     +  LEA A++EGL+++    V  P+ I ETDS     LL+    DLT+    +EE  +L  +  I + + V
Subjt:  NRGGVGWILRRWDGSPVTAGYRTIAHQWPIHWLEAIAVVEGLKSIP---VPSPMVIVETDSLVTVRLLSGVSTDLTELNGFIEEAKSLISNRNISSISHV

Query:  PRIHNFMAHKLAQMATESNSSKCWANHFPTW
         R  N  AH LAQ A+    S  W + FP W
Subjt:  PRIHNFMAHKLAQMATESNSSKCWANHFPTW

A0A6J1DSV1 uncharacterized protein LOC1110236083.4e-1730.67Show/hide
Query:  DKMKEINLKLSLIIWWQIWTHRNSILHNKKQPDLNQIIQSIEKYRNEYNKGEEDYQNDGRRLSEESPDGIPANPPNSRSWLPINEGCWKINCDASWNSKM
        DK  E   + S+II WQIW  RN  +      +   I   I++Y    N    D    G+  +++              W P     WK+N DA+W +  
Subjt:  DKMKEINLKLSLIIWWQIWTHRNSILHNKKQPDLNQIIQSIEKYRNEYNKGEEDYQNDGRRLSEESPDGIPANPPNSRSWLPINEGCWKINCDASWNSKM

Query:  NRGGVGWILRRWDGSPVTAGYRTIAHQWPIHWLEAIAVVEGLKSI------PVPSP---MVIVETDSLVTVRLLSGVSTDLTELNGFIEEAKSLISNRNI
        N GG+GWILR   G  + A  R I  +  I +LE +A+ EGL++I      P+       + +E+DSL  + LL     D TE+   +EE   ++ +  I
Subjt:  NRGGVGWILRRWDGSPVTAGYRTIAHQWPIHWLEAIAVVEGLKSI------PVPSP---MVIVETDSLVTVRLLSGVSTDLTELNGFIEEAKSLISNRNI

Query:  SSISHVPRIHNFMAHKLAQMATESN
         S+ H+ R  N +AH LA+ A E++
Subjt:  SSISHVPRIHNFMAHKLAQMATESN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.1e-0723.4Show/hide
Query:  ETAIHLFWDCKVTRGVWHHYFPHSNLMLVNDRREWTTPDLCELTWKGTGGDKMKEINLKLSLIIW--WQIWTHRNSILHNKKQPDLNQIIQSIEKYRNEY
        ET  HL + C   R VW       + +      EWT      L W      ++ ++    +L+ W  W++W  RN ++   K+ D  ++++         
Subjt:  ETAIHLFWDCKVTRGVWHHYFPHSNLMLVNDRREWTTPDLCELTWKGTGGDKMKEINLKLSLIIW--WQIWTHRNSILHNKKQPDLNQIIQSIEKYRNEY

Query:  NKGEEDYQN-DGRRLSEESPDGIPANPPNSRSWLPINEGCWKINCDASWNSKMNRGGVGWILRRWDGSPVTAGYRTIAHQWPIHWLEAIAVVEGLKSIP-
         +  ED++    RR  E    G       S  W        K N DA+W  +  R G+GWILR   G  +  G R +     +   E  A+   + ++  
Subjt:  NKGEEDYQN-DGRRLSEESPDGIPANPPNSRSWLPINEGCWKINCDASWNSKMNRGGVGWILRRWDGSPVTAGYRTIAHQWPIHWLEAIAVVEGLKSIP-

Query:  VPSPMVIVETDSLVTVRLLSGVSTDL-TELNGFIEEAKSLISNRNISSISHVPRIHNFMAHKLAQ
             +I E+D+   V LL+  S D    L   +E+ + L+ +         PR  N +A ++A+
Subjt:  VPSPMVIVETDSLVTVRLLSGVSTDL-TELNGFIEEAKSLISNRNISSISHVPRIHNFMAHKLAQ

AT3G09510.1 Ribonuclease H-like superfamily protein1.3e-0520Show/hide
Query:  MKEINLKLSLIIWWQIWTHRNSILHNKKQPDLNQIIQSIEKYRNEYNKGEEDYQNDGRRLSEESPDGIPANPPNSRSWLPINEGCWKINCDASWNSKMNR
        M + +  L + + W+IW  RN+++ NK +   ++ + S +   +++    + +        +++P        N   W        K N DA ++ +   
Subjt:  MKEINLKLSLIIWWQIWTHRNSILHNKKQPDLNQIIQSIEKYRNEYNKGEEDYQNDGRRLSEESPDGIPANPPNSRSWLPINEGCWKINCDASWNSKMNR

Query:  GGVGWILRRWDGSPVTAGYRTIAHQWPIHWLEAIAVVEGLKSIPVPS-PMVIVETDSLVTVRLLSGVSTDLTELNGFIEEAKSLISNRNISSISHVPRIH
           GWI+R   G+P++ G   +AH       E  A++  L+   +     V +E D    + L++G+S   + L   +E+     +         + R  
Subjt:  GGVGWILRRWDGSPVTAGYRTIAHQWPIHWLEAIAVVEGLKSIPVPS-PMVIVETDSLVTVRLLSGVSTDLTELNGFIEEAKSLISNRNISSISHVPRIH

Query:  NFMAHKLAQMATESNSSKCWANHFPTWF--VYCNE
        N +AH LA+     ++    +   P W    +CN+
Subjt:  NFMAHKLAQMATESNSSKCWANHFPTWF--VYCNE

AT3G53690.1 RING/U-box superfamily protein3.0e-0535Show/hide
Query:  ALVPRILKGNLAVKCPEPKCATAL---ESKI---------CGSFVLKECG-----EMECGKFQNLRKG-KGRADRRFDDNEVADQKKWKRCPDCKIYVEK
        AL   ++  +    CP   C+  +   ES++         C   V  ECG     EM C +FQ L    +GR D       +A QKKWKRCP CK Y+EK
Subjt:  ALVPRILKGNLAVKCPEPKCATAL---ESKI---------CGSFVLKECG-----EMECGKFQNLRKG-KGRADRRFDDNEVADQKKWKRCPDCKIYVEK

AT4G29090.1 Ribonuclease H-like superfamily protein1.2e-0923.11Show/hide
Query:  ETAIHLFWDCKVTRGVWHHYFPHSNLMLVNDRREWTTPDLCELTWKGTGGDKMKEINLKLSLIIW--WQIWTHRNSILHNKKQPDLNQIIQSIEKYRNEY
        ET  HL + C   R  W       + + +    EW       L W    G+   +      L+ W  W++W +RN ++   ++ +  ++++         
Subjt:  ETAIHLFWDCKVTRGVWHHYFPHSNLMLVNDRREWTTPDLCELTWKGTGGDKMKEINLKLSLIIW--WQIWTHRNSILHNKKQPDLNQIIQSIEKYRNEY

Query:  NKGEEDYQNDGRRLSEESPDGIP-ANPPNSRSWLPINEGCWKINCDASWNSKMNRGGVGWILRRWDGSPVTAGYRTIAHQWPIHWLEAIAVVEGLKSIP-
         + E+D +    R   ES    P  N  +   W P      K N DA+WN    R G+GW+LR   G     G R +     +   E  A+   + S+  
Subjt:  NKGEEDYQNDGRRLSEESPDGIP-ANPPNSRSWLPINEGCWKINCDASWNSKMNRGGVGWILRRWDGSPVTAGYRTIAHQWPIHWLEAIAVVEGLKSIP-

Query:  VPSPMVIVETDSLVTVRLLSGVSTDLTELNGFIEEAKSLISNRNISSISHVPRIHNFMAHKLAQ
             VI E+DS V + +L+        L   I++ + L+S         +PR  N +A ++A+
Subjt:  VPSPMVIVETDSLVTVRLLSGVSTDLTELNGFIEEAKSLISNRNISSISHVPRIHNFMAHKLAQ

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.9e-0522.71Show/hide
Query:  IWWQIWTHRNSILHNKKQPDLNQIIQSIEKYRNEY--NKGEEDYQNDGRRLSEESPDGIPANPPNSRSWLPINEGCWKINCDASWNSKMNRGGVGWILRR
        + W+IW   N ++ N  +      ++       E+  N    + QN  R           A+P  +  W P      K N DAS + +    G+GWILR 
Subjt:  IWWQIWTHRNSILHNKKQPDLNQIIQSIEKYRNEY--NKGEEDYQNDGRRLSEESPDGIPANPPNSRSWLPINEGCWKINCDASWNSKMNRGGVGWILRR

Query:  WDGSPVTAGYRTIAHQWPIHWLEAIAVVEGLK-SIPVPSPMVIVETDSLVTVRLLSGVSTDLTELNGFIEEAKSLISNRNISSISHVPRIHNFMAHKLAQ
          G+ +  G      +      E   ++  ++ S       VI E D+    R+++  S++   L  F++  +S I +      S   R  N  A  LA+
Subjt:  WDGSPVTAGYRTIAHQWPIHWLEAIAVVEGLK-SIPVPSPMVIVETDSLVTVRLLSGVSTDLTELNGFIEEAKSLISNRNISSISHVPRIHNFMAHKLAQ

Query:  MATESNS
         A + N+
Subjt:  MATESNS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCGATCCCCTCTGACATCTGCTCTACTGGTAATCTCTCTCAGACCTTCAACGATTGCTCTGGTCCCAAGGATTTTGAAGGGAAACTTGGCGGTGAAGTGCCCGGA
GCCGAAGTGTGCAACGGCGCTGGAGTCGAAGATCTGCGGTTCTTTCGTTCTGAAGGAGTGTGGGGAAATGGAATGCGGTAAATTTCAGAATTTGCGCAAGGGGAAAGGAA
GAGCAGATAGACGATTTGATGACAATGAAGTTGCTGATCAAAAGAAATGGAAGAGATGTCCTGATTGCAAAATTTATGTCGAGAAAGTTGCCTCCATATTACCTGCAGGT
AAACGATATAGTGGATTTTCTCCCTCGCCGATAGACCTCCACTCAGTTTCTTTTTTCTCACTTGTGTGGCCGTTCATCGCTTGCCGTCATTCTGTGGCCGTTCATCGCTT
GTTCTTCGTCTATGGCGTCTCTTGGCTCAACCGCGAGTACCGCTGTGGAGGATCCGACCACCGCTCTTCTTCTTCTCGATCTCCGATGACTTTCATCTTTCGTGACTCGG
ATAGTTCTCAGACGAACGTACCGGCTACTTTAAGATTACTGCAGACTTCTAGACCCTGGGTTTACAGGCCCAAAATACACTTGCTATCCCCTGAGATGCAAGAGAGATGT
AATGGAATCCAGGAGGAGCCTCCTGACCATAGGGGTCAAATGAGTGGTCGGGTTCGCAGATTCGAGGAAGGATGGACTAAGTATGAGGAATGCAGGGATATAGTGGAACA
GGTTAAGCCATTAGAGCAGAGTGAAATATGGGGGGTCAATCAGAGGGGCCATAGCAAGGAAAGAAAAGGAGCTAAACCACATTCTTTCAAGTCCAAACCGGAGACGGCTA
TACATCTTTTCTGGGACTGTAAAGTCACTAGAGGTGTATGGCATCACTATTTTCCTCATTCTAACTTAATGTTGGTTAATGACAGACGGGAATGGACGACACCGGACTTA
TGTGAGCTAACTTGGAAGGGAACAGGCGGAGACAAGATGAAGGAGATCAATTTGAAGCTTAGTCTGATTATTTGGTGGCAAATTTGGACACACAGGAACAGTATTCTCCA
CAATAAGAAGCAGCCAGATCTTAACCAAATCATTCAGTCTATTGAAAAATACCGGAATGAATACAACAAAGGAGAAGAGGATTACCAAAACGATGGTAGACGGCTGTCCG
AGGAAAGCCCCGACGGTATCCCTGCGAACCCTCCAAATTCTAGGAGTTGGCTTCCTATTAACGAAGGCTGTTGGAAGATAAATTGTGACGCATCCTGGAACTCAAAGATG
AACAGGGGCGGTGTTGGGTGGATTCTTCGCAGGTGGGATGGATCGCCGGTCACGGCTGGATATAGGACGATCGCTCATCAGTGGCCGATTCACTGGTTGGAGGCCATTGC
AGTGGTGGAGGGCTTGAAATCGATACCTGTTCCATCGCCAATGGTGATTGTGGAGACCGATTCTCTGGTGACTGTTCGTCTCCTCTCGGGTGTGAGCACAGATCTGACCG
AACTAAACGGTTTCATTGAAGAAGCTAAATCTTTAATCTCAAATAGGAATATCTCTTCAATATCACATGTGCCTAGGATACACAATTTTATGGCCCATAAACTGGCCCAG
ATGGCCACTGAGTCCAATTCGTCTAAATGCTGGGCCAATCACTTTCCAACTTGGTTTGTTTATTGTAATGAGATTGATACGGGTTTTGTCAACTACACTGGTGGGGGTTC
CTGTCCCACAAACTCTTCTCAAATTCAGGTATTGGAAGGTATCTTTTATACCTTTTCCATAGCTAGGGTTTTGTTTTTACCTGTATTAGTCGACTACAGGCTCAAGCCTG
GCGATTTGGGGCTACAGGAATCAATGGTGACGAGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATCGATCCCCTCTGACATCTGCTCTACTGGTAATCTCTCTCAGACCTTCAACGATTGCTCTGGTCCCAAGGATTTTGAAGGGAAACTTGGCGGTGAAGTGCCCGGA
GCCGAAGTGTGCAACGGCGCTGGAGTCGAAGATCTGCGGTTCTTTCGTTCTGAAGGAGTGTGGGGAAATGGAATGCGGTAAATTTCAGAATTTGCGCAAGGGGAAAGGAA
GAGCAGATAGACGATTTGATGACAATGAAGTTGCTGATCAAAAGAAATGGAAGAGATGTCCTGATTGCAAAATTTATGTCGAGAAAGTTGCCTCCATATTACCTGCAGGT
AAACGATATAGTGGATTTTCTCCCTCGCCGATAGACCTCCACTCAGTTTCTTTTTTCTCACTTGTGTGGCCGTTCATCGCTTGCCGTCATTCTGTGGCCGTTCATCGCTT
GTTCTTCGTCTATGGCGTCTCTTGGCTCAACCGCGAGTACCGCTGTGGAGGATCCGACCACCGCTCTTCTTCTTCTCGATCTCCGATGACTTTCATCTTTCGTGACTCGG
ATAGTTCTCAGACGAACGTACCGGCTACTTTAAGATTACTGCAGACTTCTAGACCCTGGGTTTACAGGCCCAAAATACACTTGCTATCCCCTGAGATGCAAGAGAGATGT
AATGGAATCCAGGAGGAGCCTCCTGACCATAGGGGTCAAATGAGTGGTCGGGTTCGCAGATTCGAGGAAGGATGGACTAAGTATGAGGAATGCAGGGATATAGTGGAACA
GGTTAAGCCATTAGAGCAGAGTGAAATATGGGGGGTCAATCAGAGGGGCCATAGCAAGGAAAGAAAAGGAGCTAAACCACATTCTTTCAAGTCCAAACCGGAGACGGCTA
TACATCTTTTCTGGGACTGTAAAGTCACTAGAGGTGTATGGCATCACTATTTTCCTCATTCTAACTTAATGTTGGTTAATGACAGACGGGAATGGACGACACCGGACTTA
TGTGAGCTAACTTGGAAGGGAACAGGCGGAGACAAGATGAAGGAGATCAATTTGAAGCTTAGTCTGATTATTTGGTGGCAAATTTGGACACACAGGAACAGTATTCTCCA
CAATAAGAAGCAGCCAGATCTTAACCAAATCATTCAGTCTATTGAAAAATACCGGAATGAATACAACAAAGGAGAAGAGGATTACCAAAACGATGGTAGACGGCTGTCCG
AGGAAAGCCCCGACGGTATCCCTGCGAACCCTCCAAATTCTAGGAGTTGGCTTCCTATTAACGAAGGCTGTTGGAAGATAAATTGTGACGCATCCTGGAACTCAAAGATG
AACAGGGGCGGTGTTGGGTGGATTCTTCGCAGGTGGGATGGATCGCCGGTCACGGCTGGATATAGGACGATCGCTCATCAGTGGCCGATTCACTGGTTGGAGGCCATTGC
AGTGGTGGAGGGCTTGAAATCGATACCTGTTCCATCGCCAATGGTGATTGTGGAGACCGATTCTCTGGTGACTGTTCGTCTCCTCTCGGGTGTGAGCACAGATCTGACCG
AACTAAACGGTTTCATTGAAGAAGCTAAATCTTTAATCTCAAATAGGAATATCTCTTCAATATCACATGTGCCTAGGATACACAATTTTATGGCCCATAAACTGGCCCAG
ATGGCCACTGAGTCCAATTCGTCTAAATGCTGGGCCAATCACTTTCCAACTTGGTTTGTTTATTGTAATGAGATTGATACGGGTTTTGTCAACTACACTGGTGGGGGTTC
CTGTCCCACAAACTCTTCTCAAATTCAGGTATTGGAAGGTATCTTTTATACCTTTTCCATAGCTAGGGTTTTGTTTTTACCTGTATTAGTCGACTACAGGCTCAAGCCTG
GCGATTTGGGGCTACAGGAATCAATGGTGACGAGCTAA
Protein sequenceShow/hide protein sequence
MNRSPLTSALLVISLRPSTIALVPRILKGNLAVKCPEPKCATALESKICGSFVLKECGEMECGKFQNLRKGKGRADRRFDDNEVADQKKWKRCPDCKIYVEKVASILPAG
KRYSGFSPSPIDLHSVSFFSLVWPFIACRHSVAVHRLFFVYGVSWLNREYRCGGSDHRSSSSRSPMTFIFRDSDSSQTNVPATLRLLQTSRPWVYRPKIHLLSPEMQERC
NGIQEEPPDHRGQMSGRVRRFEEGWTKYEECRDIVEQVKPLEQSEIWGVNQRGHSKERKGAKPHSFKSKPETAIHLFWDCKVTRGVWHHYFPHSNLMLVNDRREWTTPDL
CELTWKGTGGDKMKEINLKLSLIIWWQIWTHRNSILHNKKQPDLNQIIQSIEKYRNEYNKGEEDYQNDGRRLSEESPDGIPANPPNSRSWLPINEGCWKINCDASWNSKM
NRGGVGWILRRWDGSPVTAGYRTIAHQWPIHWLEAIAVVEGLKSIPVPSPMVIVETDSLVTVRLLSGVSTDLTELNGFIEEAKSLISNRNISSISHVPRIHNFMAHKLAQ
MATESNSSKCWANHFPTWFVYCNEIDTGFVNYTGGGSCPTNSSQIQVLEGIFYTFSIARVLFLPVLVDYRLKPGDLGLQESMVTS