; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg027028 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg027028
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold8:3406598..3411503
RNA-Seq ExpressionSpg027028
SyntenySpg027028
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR036397 - Ribonuclease H superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA3475057.1 reverse transcriptase [Gossypium australe]2.0e-3338.81Show/hide
Query:  LSTAQCDASNCDRIRSCLNYKSSFCVPCKGKSGGGLLLLWKEELDVCNKSFSEGHIDSLIRRKN--GNWRFSGFYGNPETEKRFHSWNLLERLSEYEHED
        L   + D    +++R    + +   +  +G +  GL L WKEE++V  +SFS+ HID +++ +N    WRF+GFYG+P T  +  SWNLL RL+E + + 
Subjt:  LSTAQCDASNCDRIRSCLNYKSSFCVPCKGKSGGGLLLLWKEELDVCNKSFSEGHIDSLIRRKN--GNWRFSGFYGNPETEKRFHSWNLLERLSEYEHED

Query:  IQWIVGGDFNEIISNEEKSGGARRNPCQMEIFRNAINSCKLIDVGYKGSKFIWRRGKGKNNRILERLDKMLVNQAMDLDSVKMDVNHLSFASSDHRPIVL
          W+V GDFNEI+ + EK GG+ R   +ME FR  +  C+L D+G+ G+ F W RG    N I ERLD+ + N    L     +++HL+ + SDH PI+L
Subjt:  IQWIVGGDFNEIISNEEKSGGARRNPCQMEIFRNAINSCKLIDVGYKGSKFIWRRGKGKNNRILERLDKMLVNQAMDLDSVKMDVNHLSFASSDHRPIVL

Query:  S
        +
Subjt:  S

KAF8408042.1 hypothetical protein HHK36_007182 [Tetracentron sinense]3.7e-3532.8Show/hide
Query:  DRIRSCLNYKSSFCVPCKGKSGGGLLLLWKEELDVCNKSFSEGHIDSLIRR--KNGNWRFSGFYGNPETEKRFHSWNLLERLSEYEHEDIQWIVGGDFNE
        ++I+  L + ++  V   G+S GGL LLW+++LD+  +S+S+ HID ++    +   WR +G YG+PE  K++ +W L+  LS      + W+  GDFNE
Subjt:  DRIRSCLNYKSSFCVPCKGKSGGGLLLLWKEELDVCNKSFSEGHIDSLIRR--KNGNWRFSGFYGNPETEKRFHSWNLLERLSEYEHEDIQWIVGGDFNE

Query:  IISNEEKSGGARRNPCQMEIFRNAINSCKLIDVGYKGSKFIWRRGKGKNNRILERLDKMLVNQAMDLDSVKMDVNHLSFASSDHRPIVLSW---------
        I   EEKSG   +   +M  F+ AI  C LI +G++G+ F W   +     + ERLD+ +             V HLS  +SDH P++L++         
Subjt:  IISNEEKSGGARRNPCQMEIFRNAINSCKLIDVGYKGSKFIWRRGKGKNNRILERLDKMLVNQAMDLDSVKMDVNHLSFASSDHRPIVLSW---------

Query:  ---NFR---------------DEVSPGRRIVKEELKGR--RVSTLI-KANGEWDESLIKRSFIPADAEDILAIPLGNPLARDEIIWALDSKGTFQVKSAY
           +FR               D    G   V  ++  R  +VS LI K    W+ +L+   F+P +AE I +IPL   L  D+ +W   SKG F V+SAY
Subjt:  ---NFR---------------DEVSPGRRIVKEELKGR--RVSTLI-KANGEWDESLIKRSFIPADAEDILAIPLGNPLARDEIIWALDSKGTFQVKSAY

Query:  HLASNIQGPTEASS
        HL S ++    A+S
Subjt:  HLASNIQGPTEASS

TXG60811.1 hypothetical protein EZV62_012174 [Acer yangbiense]1.5e-3346.39Show/hide
Query:  GLLLLWKEELDVCNKSFSEGHIDSLIRRKNG-NWRFSGFYGNPETEKRFHSWNLLERLSEYEHEDIQWIVGGDFNEIISNEEKSGGARRNPCQMEIFRNA
        G LLLWK+ LDV   SFS GHID+ I+ ++G  WRFSGFYG+P  +KR +SW LL RL E +   + W+  GDFNE++S  E  GG+ +    M +FR A
Subjt:  GLLLLWKEELDVCNKSFSEGHIDSLIRRKNG-NWRFSGFYGNPETEKRFHSWNLLERLSEYEHEDIQWIVGGDFNEIISNEEKSGGARRNPCQMEIFRNA

Query:  INSCKLIDVGYKGSKFIWRRGKGKNNRILERLDKMLV-NQAMDLDSVKMDVNHLSFASSDHRPIVL
        +  C LID+GY G +++W   +   + I ERLD++L  NQ  D+   ++ V+HL F +SDHRP++L
Subjt:  INSCKLIDVGYKGSKFIWRRGKGKNNRILERLDKMLV-NQAMDLDSVKMDVNHLSFASSDHRPIVL

TXG63522.1 hypothetical protein EZV62_010516 [Acer yangbiense]2.1e-3829.18Show/hide
Query:  RIRSCLNYKSSFCVPCKGKSGGGLLLLWKEELDVCNKSFSEGHIDSLIRRKNG-NWRFSGFYGNPETEKRFHSWNLLERLSEYEHEDIQWIVGGDFNEII
        +IR  L Y+  F V   G+S GGLLLLWK    V   S+S GHID+ I   NG  WRFSG YG+P    R ++W L+ RL E   +++ W+ GGDFNE++
Subjt:  RIRSCLNYKSSFCVPCKGKSGGGLLLLWKEELDVCNKSFSEGHIDSLIRRKNG-NWRFSGFYGNPETEKRFHSWNLLERLSEYEHEDIQWIVGGDFNEII

Query:  SNEEKSGGARRNPCQMEIFRNAINSCKLIDVGYKGSKFIWRRGKGKNNRILERLDKMLVNQAMDLDSVKMDVNHLSFASSDHRPIVLSWNFRDEVSPGRR
        S  EK GG+ +    +  FR  ++ C+ ID+G+ G KF W   +   + + ERLD++L +        ++ + HL F +SDHRP+++ W+        R 
Subjt:  SNEEKSGGARRNPCQMEIFRNAINSCKLIDVGYKGSKFIWRRGKGKNNRILERLDKMLVNQAMDLDSVKMDVNHLSFASSDHRPIVLSWNFRDEVSPGRR

Query:  IVKEELKGRRVSTLIKANGEWDESLIKRSFIPADAEDILAIPLGNPLARDEIIWALDSKGTFQVKSAYHLASNIQGPTEASSLGERSKKGRMQLMRSGNV
        +V++  K   +  L+   G      IK+      A       L      D ++    +    ++     +   I    E  +L   + KG   L+    V
Subjt:  IVKEELKGRRVSTLIKANGEWDESLIKRSFIPADAEDILAIPLGNPLARDEIIWALDSKGTFQVKSAYHLASNIQGPTEASSLGERSKKGRMQLMRSGNV

Query:  KRNKTTMNNDKSNSSSLIRLIKKNLAEQKRRMETYLKLCSEESLLNQECWSPPENCLKLNSDASWNSVKARGGLGWIILDSTGSPIAFGCKKINRPWSIK
         + +  ++  +++ S++I + +                 S   L   +  +PP   LKLNS    N+      LG +I D  G  IA   +K    +S +
Subjt:  KRNKTTMNNDKSNSSSLIRLIKKNLAEQKRRMETYLKLCSEESLLNQECWSPPENCLKLNSDASWNSVKARGGLGWIILDSTGSPIAFGCKKINRPWSIK

Query:  TMKLSAIKEG--LKKYLELNRERKVSLIVEADATEVIKALNFECEELSDSKLVMTEVESLAIAAGV
        T  L A+ EG  L K+L L     V  I E D   V+  LN     L D+  V+ ++++L    GV
Subjt:  TMKLSAIKEG--LKKYLELNRERKVSLIVEADATEVIKALNFECEELSDSKLVMTEVESLAIAAGV

XP_023911327.1 uncharacterized protein LOC112022938 [Quercus suber]1.8e-3445.74Show/hide
Query:  IRSCLNYKSSFCVPCKGKSGGGLLLLWKEELDVCNKSFSEGHIDSLIRRKNGN--WRFSGFYGNPETEKRFHSWNLLERLSEYEHEDIQWIVGGDFNEII
        I++ LNY     VP  G+S GGL LLWKE +DV  KS S  HID ++R    +  WR +GFYG PE+EKR+ SW LLE L   +  ++ WIV GDFNEI+
Subjt:  IRSCLNYKSSFCVPCKGKSGGGLLLLWKEELDVCNKSFSEGHIDSLIRRKNGN--WRFSGFYGNPETEKRFHSWNLLERLSEYEHEDIQWIVGGDFNEII

Query:  SNEEKSGGARRNPCQMEIFRNAINSCKLIDVGYKGSKFIWRRGKGKNNRILERLDKMLVNQAMDLDSVKMDVNHLSFASSDHRPIVLS
         + EKSGG+ R+  QME FR+ ++ C L D+GY G +F W  G+  N R   RLD+M+ N        +  V+H S + SDH  +VL+
Subjt:  SNEEKSGGARRNPCQMEIFRNAINSCKLIDVGYKGSKFIWRRGKGKNNRILERLDKMLVNQAMDLDSVKMDVNHLSFASSDHRPIVLS

TrEMBL top hitse value%identityAlignment
A0A5B6W1U2 Reverse transcriptase9.7e-3438.81Show/hide
Query:  LSTAQCDASNCDRIRSCLNYKSSFCVPCKGKSGGGLLLLWKEELDVCNKSFSEGHIDSLIRRKN--GNWRFSGFYGNPETEKRFHSWNLLERLSEYEHED
        L   + D    +++R    + +   +  +G +  GL L WKEE++V  +SFS+ HID +++ +N    WRF+GFYG+P T  +  SWNLL RL+E + + 
Subjt:  LSTAQCDASNCDRIRSCLNYKSSFCVPCKGKSGGGLLLLWKEELDVCNKSFSEGHIDSLIRRKN--GNWRFSGFYGNPETEKRFHSWNLLERLSEYEHED

Query:  IQWIVGGDFNEIISNEEKSGGARRNPCQMEIFRNAINSCKLIDVGYKGSKFIWRRGKGKNNRILERLDKMLVNQAMDLDSVKMDVNHLSFASSDHRPIVL
          W+V GDFNEI+ + EK GG+ R   +ME FR  +  C+L D+G+ G+ F W RG    N I ERLD+ + N    L     +++HL+ + SDH PI+L
Subjt:  IQWIVGGDFNEIISNEEKSGGARRNPCQMEIFRNAINSCKLIDVGYKGSKFIWRRGKGKNNRILERLDKMLVNQAMDLDSVKMDVNHLSFASSDHRPIVL

Query:  S
        +
Subjt:  S

A0A5C7HUN0 Uncharacterized protein7.4e-3446.39Show/hide
Query:  GLLLLWKEELDVCNKSFSEGHIDSLIRRKNG-NWRFSGFYGNPETEKRFHSWNLLERLSEYEHEDIQWIVGGDFNEIISNEEKSGGARRNPCQMEIFRNA
        G LLLWK+ LDV   SFS GHID+ I+ ++G  WRFSGFYG+P  +KR +SW LL RL E +   + W+  GDFNE++S  E  GG+ +    M +FR A
Subjt:  GLLLLWKEELDVCNKSFSEGHIDSLIRRKNG-NWRFSGFYGNPETEKRFHSWNLLERLSEYEHEDIQWIVGGDFNEIISNEEKSGGARRNPCQMEIFRNA

Query:  INSCKLIDVGYKGSKFIWRRGKGKNNRILERLDKMLV-NQAMDLDSVKMDVNHLSFASSDHRPIVL
        +  C LID+GY G +++W   +   + I ERLD++L  NQ  D+   ++ V+HL F +SDHRP++L
Subjt:  INSCKLIDVGYKGSKFIWRRGKGKNNRILERLDKMLV-NQAMDLDSVKMDVNHLSFASSDHRPIVL

A0A5C7I4W9 RNase H domain-containing protein1.0e-3829.18Show/hide
Query:  RIRSCLNYKSSFCVPCKGKSGGGLLLLWKEELDVCNKSFSEGHIDSLIRRKNG-NWRFSGFYGNPETEKRFHSWNLLERLSEYEHEDIQWIVGGDFNEII
        +IR  L Y+  F V   G+S GGLLLLWK    V   S+S GHID+ I   NG  WRFSG YG+P    R ++W L+ RL E   +++ W+ GGDFNE++
Subjt:  RIRSCLNYKSSFCVPCKGKSGGGLLLLWKEELDVCNKSFSEGHIDSLIRRKNG-NWRFSGFYGNPETEKRFHSWNLLERLSEYEHEDIQWIVGGDFNEII

Query:  SNEEKSGGARRNPCQMEIFRNAINSCKLIDVGYKGSKFIWRRGKGKNNRILERLDKMLVNQAMDLDSVKMDVNHLSFASSDHRPIVLSWNFRDEVSPGRR
        S  EK GG+ +    +  FR  ++ C+ ID+G+ G KF W   +   + + ERLD++L +        ++ + HL F +SDHRP+++ W+        R 
Subjt:  SNEEKSGGARRNPCQMEIFRNAINSCKLIDVGYKGSKFIWRRGKGKNNRILERLDKMLVNQAMDLDSVKMDVNHLSFASSDHRPIVLSWNFRDEVSPGRR

Query:  IVKEELKGRRVSTLIKANGEWDESLIKRSFIPADAEDILAIPLGNPLARDEIIWALDSKGTFQVKSAYHLASNIQGPTEASSLGERSKKGRMQLMRSGNV
        +V++  K   +  L+   G      IK+      A       L      D ++    +    ++     +   I    E  +L   + KG   L+    V
Subjt:  IVKEELKGRRVSTLIKANGEWDESLIKRSFIPADAEDILAIPLGNPLARDEIIWALDSKGTFQVKSAYHLASNIQGPTEASSLGERSKKGRMQLMRSGNV

Query:  KRNKTTMNNDKSNSSSLIRLIKKNLAEQKRRMETYLKLCSEESLLNQECWSPPENCLKLNSDASWNSVKARGGLGWIILDSTGSPIAFGCKKINRPWSIK
         + +  ++  +++ S++I + +                 S   L   +  +PP   LKLNS    N+      LG +I D  G  IA   +K    +S +
Subjt:  KRNKTTMNNDKSNSSSLIRLIKKNLAEQKRRMETYLKLCSEESLLNQECWSPPENCLKLNSDASWNSVKARGGLGWIILDSTGSPIAFGCKKINRPWSIK

Query:  TMKLSAIKEG--LKKYLELNRERKVSLIVEADATEVIKALNFECEELSDSKLVMTEVESLAIAAGV
        T  L A+ EG  L K+L L     V  I E D   V+  LN     L D+  V+ ++++L    GV
Subjt:  TMKLSAIKEG--LKKYLELNRERKVSLIVEADATEVIKALNFECEELSDSKLVMTEVESLAIAAGV

A0A7J6DPD3 CCHC-type domain-containing protein2.2e-3335.09Show/hide
Query:  RSCLNYKSSFCVPCKGKSGGGLLLLWKEELDVCNKSFSEGHIDSLIRRKNGN-WRFSGFYGNPETEKRFHSWNLLERLSEYEHEDIQWIVGGDFNEIISN
        RS  + K    V    +S GGLLLLW ++ +V  KSF+ GHID+L++      WRF+GFYGNP+   R  SW LL RL +    D+ WI GGDFNEI+S 
Subjt:  RSCLNYKSSFCVPCKGKSGGGLLLLWKEELDVCNKSFSEGHIDSLIRRKNGN-WRFSGFYGNPETEKRFHSWNLLERLSEYEHEDIQWIVGGDFNEIISN

Query:  EEKSGGARRNPCQMEIFRNAINSCKLIDVGYKGSKFIWRRGKGKNNRILERLDKMLVNQAMDLDSVKMDVNHLSFASSDHRPIV-----LSWNFRDEVSP
         EK GG+ R+   M  F+NA++ C L D+G++G  F W   +     + ERLD+   NQ        + V +  F +SDHRPIV     +S   R  +S 
Subjt:  EEKSGGARRNPCQMEIFRNAINSCKLIDVGYKGSKFIWRRGKGKNNRILERLDKMLVNQAMDLDSVKMDVNHLSFASSDHRPIV-----LSWNFRDEVSP

Query:  GRRIVKEELKGRRVSTLIKANGEWDESLIKRSFIPADAED----------------ILAIPLGNPLARDEIIWALDSKGTFQVKS
           +  ++             G W++S  K   +P    +                IL IPLG+    D   W   S G++ VKS
Subjt:  GRRIVKEELKGRRVSTLIKANGEWDESLIKRSFIPADAED----------------ILAIPLGNPLARDEIIWALDSKGTFQVKS

A0A803PRV5 Uncharacterized protein1.7e-3334.78Show/hide
Query:  ELGRQSLVMGEKDKIPSLLNLVVSKEKEQRV--------VRDESGNNVLESHGPQTTKGKNKTMKINEKENEQKVVWDPISGPKAIVDQSMQSSP--SGV
        +L + + ++ ++ +   L N+ VS  ++  V        V   S N   E+      K   K  K  + + E KV      G  A+     +++P     
Subjt:  ELGRQSLVMGEKDKIPSLLNLVVSKEKEQRV--------VRDESGNNVLESHGPQTTKGKNKTMKINEKENEQKVVWDPISGPKAIVDQSMQSSP--SGV

Query:  LSTAQCDASNCDRIRSCLNYKSSFCVPCKGKSGGGLLLLWKEELDVCNKSFSEGHIDSLIRRKNGN-WRFSGFYGNPETEKRFHSWNLLERLSEYEHEDI
        +S ++ +A   + +R  L +   F V  +GKS GGL+LLW  +++    S+S+ HIDS IR +NG  WRF+GFYG+P+  +RFHSW LL+RL+       
Subjt:  LSTAQCDASNCDRIRSCLNYKSSFCVPCKGKSGGGLLLLWKEELDVCNKSFSEGHIDSLIRRKNGN-WRFSGFYGNPETEKRFHSWNLLERLSEYEHEDI

Query:  QWIVGGDFNEIISNEEKSGGARRNPCQMEIFRNAINSCKLIDVGYKGSKFIWRRGKGKNNRILERLDKMLVNQAMDLDSVKMDVNHLSFASSDHRPIVL
         W+VGGDFNEI+S +EK GG  +    +  FR A++ C+L DVGY+G+ + W  G+ KN+ I ERLD++  N         M V HL   +SDH P++L
Subjt:  QWIVGGDFNEIISNEEKSGGARRNPCQMEIFRNAINSCKLIDVGYKGSKFIWRRGKGKNNRILERLDKMLVNQAMDLDSVKMDVNHLSFASSDHRPIVL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.0e-0724.02Show/hide
Query:  RNKTTMNNDKSNSSSLIRLIKKNLAEQKRRMETYLKLCSE--ESLLNQECWSPPENCLKLNSDASWNSVKARGGLGWIILDSTGSPIAFGCKKINRPWSI
        RN+      + ++  ++R   ++  E   R E   K      E  L+ +  +PP   +K N+DA+W     R G+GWI+ + +G  +  G + + R  ++
Subjt:  RNKTTMNNDKSNSSSLIRLIKKNLAEQKRRMETYLKLCSE--ESLLNQECWSPPENCLKLNSDASWNSVKARGGLGWIILDSTGSPIAFGCKKINRPWSI

Query:  KTMKLSAIKEGLKKYLELNRERKVSLIVEADATEVIKALNFECEELSDSKLVMTEVESLAIAAGVIKFSKWPRGDNKVS
           +L A++  +      N +R   +I E+DA  ++  LN + +     +  + +++ L      +KF   PRG NKV+
Subjt:  KTMKLSAIKEGLKKYLELNRERKVSLIVEADATEVIKALNFECEELSDSKLVMTEVESLAIAAGVIKFSKWPRGDNKVS

AT4G29090.1 Ribonuclease H-like superfamily protein2.9e-0624.86Show/hide
Query:  RNKTTMNNDKSNSSSLIRLIKKNLAEQKRRMETYLKLCSEESLLNQEC---W-SPPENCLKLNSDASWNSVKARGGLGWIILDSTGSPIAFGCKKINRPW
        RN+      + N+  ++R  + +L E   R+ T  + C  +  +N+     W  PP   +K N+DA+WN    R G+GW++ +  G     G + + +  
Subjt:  RNKTTMNNDKSNSSSLIRLIKKNLAEQKRRMETYLKLCSEESLLNQEC---W-SPPENCLKLNSDASWNSVKARGGLGWIILDSTGSPIAFGCKKINRPW

Query:  SIKTMKLSAIKEGLKKYLELNRERKVSLIVEADATEVIKALNFECEELSDSKLVMTEVESLAIAAGVIKFSKWPRGDNKVS
        S+   +L A++  +   L L+R +   +I E+D+  +I+ LN + E     K  + +++ L      +KF   PR  N ++
Subjt:  SIKTMKLSAIKEGLKKYLELNRERKVSLIVEADATEVIKALNFECEELSDSKLVMTEVESLAIAAGVIKFSKWPRGDNKVS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCGGGCACGTCATGAAGGACTGTGAGGAAGAGGAATGTGAGGAAGATGAAGACGACGAAGAGAGGCAATATGGACTATGGTTACGAGAGAGCTACACTGAGAGGAG
AAGCCTCAAACTGGAAAAAGGGAATGAGAACAATCGAGCGAAAGGAAAAAACATTGGGGAAAGTTGTAAGGAACAAAGCAGTAACCTTTGGTCGGAGGAGAAAGATGAAA
GGACAGTGGCGGGATGCTCCCAGTCGCCGGAGGAGTTGGGGCGGCAATCTTTGGTGATGGGCGAAAAAGACAAAATTCCATCATTGCTGAATCTGGTTGTGTCAAAAGAA
AAAGAACAAAGAGTCGTTCGGGATGAAAGTGGAAATAATGTGCTGGAAAGTCACGGGCCCCAGACTACAAAAGGGAAGAACAAAACTATGAAAATCAATGAAAAAGAAAA
TGAACAAAAAGTAGTGTGGGACCCAATTAGTGGACCTAAAGCAATAGTTGATCAGTCTATGCAAAGCAGCCCATCTGGGGTGTTGTCGACAGCCCAATGTGATGCGTCCA
ACTGTGATAGAATTAGAAGTTGCTTGAATTACAAGTCTTCCTTTTGCGTGCCTTGCAAAGGGAAGAGTGGGGGTGGTTTGTTGCTCCTTTGGAAGGAGGAGCTGGATGTG
TGCAACAAATCTTTCTCTGAGGGGCATATAGACTCCTTAATTAGGAGGAAGAATGGGAATTGGAGATTTTCTGGATTTTATGGTAATCCGGAGACAGAGAAGCGGTTCCA
CTCTTGGAACCTCTTGGAGAGGCTGAGCGAGTATGAGCACGAGGACATTCAGTGGATTGTGGGAGGAGACTTTAATGAGATCATCTCGAATGAGGAGAAGAGTGGCGGGG
CTAGAAGAAATCCTTGCCAGATGGAGATCTTTAGGAATGCTATTAACAGCTGCAAGCTGATCGATGTGGGGTACAAAGGTAGTAAGTTCATCTGGAGGAGAGGCAAAGGT
AAAAACAACAGAATTCTAGAGAGACTGGACAAAATGCTAGTGAACCAGGCGATGGACCTGGATTCGGTTAAGATGGACGTGAATCATCTCAGCTTTGCAAGCTCGGATCA
TAGACCAATCGTCCTTAGTTGGAATTTCAGGGACGAGGTGTCTCCGGGCAGAAGAATTGTCAAGGAGGAGCTAAAAGGCAGGAGGGTGAGCACCCTGATTAAGGCAAACG
GGGAGTGGGATGAGAGCCTGATAAAAAGGAGCTTTATTCCGGCTGATGCTGAAGACATTTTGGCAATTCCCCTAGGAAATCCCCTAGCAAGAGATGAGATAATTTGGGCC
CTAGACTCCAAAGGCACCTTCCAGGTCAAGAGTGCTTATCACCTTGCGTCTAACATTCAAGGACCCACGGAAGCCTCTAGTTTGGGAGAGAGAAGCAAGAAAGGGAGGAT
GCAGCTCATGCGATCTGGCAATGTAAAAAGAAACAAAACAACAATGAACAATGACAAGTCAAATTCCAGCAGCCTCATCAGATTAATTAAGAAGAACCTGGCAGAGCAGA
AGAGAAGGATGGAAACATACCTGAAGTTGTGCAGCGAGGAGAGCCTTCTGAATCAAGAATGTTGGTCCCCCCCTGAAAATTGCTTGAAACTGAATTCGGATGCCTCTTGG
AACTCAGTTAAAGCGAGAGGTGGGTTAGGCTGGATCATCCTTGATTCAACAGGATCTCCCATCGCATTCGGCTGCAAGAAAATTAACAGACCTTGGTCGATCAAAACAAT
GAAGTTGTCAGCAATCAAAGAAGGGTTGAAGAAATACCTGGAGCTGAATCGTGAGAGAAAGGTGTCGTTGATCGTTGAAGCAGACGCGACCGAAGTGATCAAAGCGCTCA
ATTTCGAGTGTGAAGAACTCTCTGACTCCAAGCTTGTGATGACTGAGGTTGAATCCCTAGCGATTGCAGCAGGTGTTATAAAATTCTCAAAATGGCCGAGGGGCGACAAC
AAAGTGTCGCACTCCCTGCGCGAGCGGCGACCGGATTGTTGCCGGAGATGGATCCTGAATCTCTTGGTGTGGCCGATGAACTAG
mRNA sequenceShow/hide mRNA sequence
ATGCTCGGGCACGTCATGAAGGACTGTGAGGAAGAGGAATGTGAGGAAGATGAAGACGACGAAGAGAGGCAATATGGACTATGGTTACGAGAGAGCTACACTGAGAGGAG
AAGCCTCAAACTGGAAAAAGGGAATGAGAACAATCGAGCGAAAGGAAAAAACATTGGGGAAAGTTGTAAGGAACAAAGCAGTAACCTTTGGTCGGAGGAGAAAGATGAAA
GGACAGTGGCGGGATGCTCCCAGTCGCCGGAGGAGTTGGGGCGGCAATCTTTGGTGATGGGCGAAAAAGACAAAATTCCATCATTGCTGAATCTGGTTGTGTCAAAAGAA
AAAGAACAAAGAGTCGTTCGGGATGAAAGTGGAAATAATGTGCTGGAAAGTCACGGGCCCCAGACTACAAAAGGGAAGAACAAAACTATGAAAATCAATGAAAAAGAAAA
TGAACAAAAAGTAGTGTGGGACCCAATTAGTGGACCTAAAGCAATAGTTGATCAGTCTATGCAAAGCAGCCCATCTGGGGTGTTGTCGACAGCCCAATGTGATGCGTCCA
ACTGTGATAGAATTAGAAGTTGCTTGAATTACAAGTCTTCCTTTTGCGTGCCTTGCAAAGGGAAGAGTGGGGGTGGTTTGTTGCTCCTTTGGAAGGAGGAGCTGGATGTG
TGCAACAAATCTTTCTCTGAGGGGCATATAGACTCCTTAATTAGGAGGAAGAATGGGAATTGGAGATTTTCTGGATTTTATGGTAATCCGGAGACAGAGAAGCGGTTCCA
CTCTTGGAACCTCTTGGAGAGGCTGAGCGAGTATGAGCACGAGGACATTCAGTGGATTGTGGGAGGAGACTTTAATGAGATCATCTCGAATGAGGAGAAGAGTGGCGGGG
CTAGAAGAAATCCTTGCCAGATGGAGATCTTTAGGAATGCTATTAACAGCTGCAAGCTGATCGATGTGGGGTACAAAGGTAGTAAGTTCATCTGGAGGAGAGGCAAAGGT
AAAAACAACAGAATTCTAGAGAGACTGGACAAAATGCTAGTGAACCAGGCGATGGACCTGGATTCGGTTAAGATGGACGTGAATCATCTCAGCTTTGCAAGCTCGGATCA
TAGACCAATCGTCCTTAGTTGGAATTTCAGGGACGAGGTGTCTCCGGGCAGAAGAATTGTCAAGGAGGAGCTAAAAGGCAGGAGGGTGAGCACCCTGATTAAGGCAAACG
GGGAGTGGGATGAGAGCCTGATAAAAAGGAGCTTTATTCCGGCTGATGCTGAAGACATTTTGGCAATTCCCCTAGGAAATCCCCTAGCAAGAGATGAGATAATTTGGGCC
CTAGACTCCAAAGGCACCTTCCAGGTCAAGAGTGCTTATCACCTTGCGTCTAACATTCAAGGACCCACGGAAGCCTCTAGTTTGGGAGAGAGAAGCAAGAAAGGGAGGAT
GCAGCTCATGCGATCTGGCAATGTAAAAAGAAACAAAACAACAATGAACAATGACAAGTCAAATTCCAGCAGCCTCATCAGATTAATTAAGAAGAACCTGGCAGAGCAGA
AGAGAAGGATGGAAACATACCTGAAGTTGTGCAGCGAGGAGAGCCTTCTGAATCAAGAATGTTGGTCCCCCCCTGAAAATTGCTTGAAACTGAATTCGGATGCCTCTTGG
AACTCAGTTAAAGCGAGAGGTGGGTTAGGCTGGATCATCCTTGATTCAACAGGATCTCCCATCGCATTCGGCTGCAAGAAAATTAACAGACCTTGGTCGATCAAAACAAT
GAAGTTGTCAGCAATCAAAGAAGGGTTGAAGAAATACCTGGAGCTGAATCGTGAGAGAAAGGTGTCGTTGATCGTTGAAGCAGACGCGACCGAAGTGATCAAAGCGCTCA
ATTTCGAGTGTGAAGAACTCTCTGACTCCAAGCTTGTGATGACTGAGGTTGAATCCCTAGCGATTGCAGCAGGTGTTATAAAATTCTCAAAATGGCCGAGGGGCGACAAC
AAAGTGTCGCACTCCCTGCGCGAGCGGCGACCGGATTGTTGCCGGAGATGGATCCTGAATCTCTTGGTGTGGCCGATGAACTAG
Protein sequenceShow/hide protein sequence
MLGHVMKDCEEEECEEDEDDEERQYGLWLRESYTERRSLKLEKGNENNRAKGKNIGESCKEQSSNLWSEEKDERTVAGCSQSPEELGRQSLVMGEKDKIPSLLNLVVSKE
KEQRVVRDESGNNVLESHGPQTTKGKNKTMKINEKENEQKVVWDPISGPKAIVDQSMQSSPSGVLSTAQCDASNCDRIRSCLNYKSSFCVPCKGKSGGGLLLLWKEELDV
CNKSFSEGHIDSLIRRKNGNWRFSGFYGNPETEKRFHSWNLLERLSEYEHEDIQWIVGGDFNEIISNEEKSGGARRNPCQMEIFRNAINSCKLIDVGYKGSKFIWRRGKG
KNNRILERLDKMLVNQAMDLDSVKMDVNHLSFASSDHRPIVLSWNFRDEVSPGRRIVKEELKGRRVSTLIKANGEWDESLIKRSFIPADAEDILAIPLGNPLARDEIIWA
LDSKGTFQVKSAYHLASNIQGPTEASSLGERSKKGRMQLMRSGNVKRNKTTMNNDKSNSSSLIRLIKKNLAEQKRRMETYLKLCSEESLLNQECWSPPENCLKLNSDASW
NSVKARGGLGWIILDSTGSPIAFGCKKINRPWSIKTMKLSAIKEGLKKYLELNRERKVSLIVEADATEVIKALNFECEELSDSKLVMTEVESLAIAAGVIKFSKWPRGDN
KVSHSLRERRPDCCRRWILNLLVWPMN