; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg035787 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg035787
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold5:38381059..38385627
RNA-Seq ExpressionSpg035787
SyntenySpg035787
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EEC73134.1 hypothetical protein OsI_07152 [Oryza sativa Indica Group]1.0e-1730.14Show/hide
Query:  VCSLIDDNSCWIEAKVRECFHPQDAKDILNTPLGPNARDEII-WSPDKKGKFSVKSTYHLVMAKAVEMEASTSDTINIKKSGKSLWRLSTIPRAKNCGSL
        V  ++  +  W E  V   F P D ++IL   +     ++ + W PD+ G+FSV+S Y L M  A   E S S    IKK    +W  +   + K  G  
Subjt:  VCSLIDDNSCWIEAKVRECFHPQDAKDILNTPLGPNARDEII-WSPDKKGKFSVKSTYHLVMAKAVEMEASTSDTINIKKSGKSLWRLSTIPRAKNCGSL

Query:  VDSELLGLDETKFQ--FGRVGVNNNYNVAHLASRGENHPSQSLWSPPPAHCWKLNIDASWNDSEDRGGVGWTVRDSEGSPI---CAGTKVIKDKWSIKIL
          +  L   E K +  FG VG   N     LA+      +  LWS P     KLNID S+   + RGG+G  +R+S G  I   C            +++
Subjt:  VDSELLGLDETKFQ--FGRVGVNNNYNVAHLASRGENHPSQSLWSPPPAHCWKLNIDASWNDSEDRGGVGWTVRDSEGSPI---CAGTKVIKDKWSIKIL

Query:  EAKAIWEALIRIASLPEKPPKIIVSSDCLELITLLNRGDEDLLEVQTFVDAILNMANT-MGELSFEHCPREQNYIDHSIAHE
          +      ++   LP     I V SDCLE+I LLN   E LL  + F+   +    T   E+SF    R QN + H +A++
Subjt:  EAKAIWEALIRIASLPEKPPKIIVSSDCLELITLLNRGDEDLLEVQTFVDAILNMANT-MGELSFEHCPREQNYIDHSIAHE

XP_015384077.1 uncharacterized protein LOC107176301 [Citrus sinensis]3.1e-1725.63Show/hide
Query:  VCSLIDDNSCWIEAKVRECFHPQDAKDILNTPLGPNAR-DEIIWSPDKKGKFSVKSTYHLVMAKAVEMEASTSDTI------------------------
        V  LI  +S W EAK++  F  +D + +L   L P+ + D ++W  DKKG+++VKS Y + +      +AS+S +                         
Subjt:  VCSLIDDNSCWIEAKVRECFHPQDAKDILNTPLGPNAR-DEIIWSPDKKGKFSVKSTYHLVMAKAVEMEASTSDTI------------------------

Query:  -NIKKSGKSLWRLSTI--PRAKNCGSLVDSELLGLDETK-----------------FQF--------------------GRVGVNNNYNVAHLASRGENH
         N+  + ++LW+   +  P  K C   V++    L E K                 FQ                      R+       V     R  NH
Subjt:  -NIKKSGKSLWRLSTI--PRAKNCGSLVDSELLGLDETK-----------------FQF--------------------GRVGVNNNYNVAHLASRGENH

Query:  PS----------QSLWSPPPAHCWKLNIDASWNDSEDRGGVGWTVRDSEGSPICAGTKVIKDKWSIKILEAKAI-WE-ALIRIASLPEKPPKIIVSSDCL
         +          Q  W PPP + +K+N+DA+ N      G+G  +RDSE + +  G      K S+   EA+AI W   L R A+L      +I+ SDCL
Subjt:  PS----------QSLWSPPPAHCWKLNIDASWNDSEDRGGVGWTVRDSEGSPICAGTKVIKDKWSIKILEAKAI-WE-ALIRIASLPEKPPKIIVSSDCL

Query:  ELITLLNRGDEDLLEVQTFVDAILNMANTMGELSFEHCPREQNYIDHSIAHESVG
        E++ L+N    +   +   ++ I N   T   +   H PR  N   HS+A  +VG
Subjt:  ELITLLNRGDEDLLEVQTFVDAILNMANTMGELSFEHCPREQNYIDHSIAHESVG

XP_015388020.1 uncharacterized protein LOC107177951 [Citrus sinensis]1.6e-2128.62Show/hide
Query:  WLSSIETIFRHMNCPEDPKGSKVCSLIDDNSCWIEAKVRECFHPQDAKDILNTPLGPN-ARDEIIWSPDKKGKFSVKSTYHLVMAKAVEMEASTSDTINI
        WL   ET F+ ++ P  P  + V  LID++ CW +  +R+ FH +DA  IL  PL    + D+++W  DKKG +SVKS Y L +        S+S     
Subjt:  WLSSIETIFRHMNCPEDPKGSKVCSLIDDNSCWIEAKVRECFHPQDAKDILNTPLGPN-ARDEIIWSPDKKGKFSVKSTYHLVMAKAVEMEASTSDTINI

Query:  KKSGKSLWRLSTIPRAKNCGSLVDSEL--LGLDETKFQFGRVGVNNNYNVAHLASRGENHPSQSLWSPPPAHCWKLNIDASWNDSEDRGGVGWTVRDSEG
         K GKS W      +        DS+L   G +     + R+ +     ++      +N  SQ  W+ PPA  +K+N+DA+      R G+G  +R+SEG
Subjt:  KKSGKSLWRLSTIPRAKNCGSLVDSEL--LGLDETKFQFGRVGVNNNYNVAHLASRGENHPSQSLWSPPPAHCWKLNIDASWNDSEDRGGVGWTVRDSEG

Query:  SPICAGTKVIKDKWSIKILEAKAIWEALIRIASLPEKPPKIIVSSDCLELITLLNRGDEDLLEVQTFVDAILNMANTMGELSFEHCPREQNYIDHSIAHE
          I A  K  K    +   EA+A    L  +A +    P +IV +D  E+  L++       E+   V  +      +  +  +H PR  N   H++A  
Subjt:  SPICAGTKVIKDKWSIKILEAKAIWEALIRIASLPEKPPKIIVSSDCLELITLLNRGDEDLLEVQTFVDAILNMANTMGELSFEHCPREQNYIDHSIAHE

Query:  SVGF
        ++ +
Subjt:  SVGF

XP_019190293.1 PREDICTED: uncharacterized protein LOC109184710 [Ipomoea nil]6.1e-1827.24Show/hide
Query:  GSARPFDGASKNPTVAELWLSSIETIFRHMNCPEDPKGSKVCSLIDDNSCWIEAKVRECFHPQDAKDILNTPLGPNARDEIIWSPDKKGKFSVKSTYHLV
        G AR  D           WL+    I  H  C E  + +KV SL+++   W    +R+ F  +    IL TP+  + RD   W  D +G ++V+  Y L+
Subjt:  GSARPFDGASKNPTVAELWLSSIETIFRHMNCPEDPKGSKVCSLIDDNSCWIEAKVRECFHPQDAKDILNTPLGPNARDEIIWSPDKKGKFSVKSTYHLV

Query:  MAKAVEMEASTSDTINIKKSGKSLWRLSTIPRAKNCGSLVDSELLGLDETKFQFGRVGVNNNYNVAHLASRGENHPSQSLWSPPPAHCWKLNIDASWNDS
        +      E +  D  +   + K+LW L   P+               DE    +GR   N     A              WSPP  H +K N+DA+   S
Subjt:  MAKAVEMEASTSDTINIKKSGKSLWRLSTIPRAKNCGSLVDSELLGLDETKFQFGRVGVNNNYNVAHLASRGENHPSQSLWSPPPAHCWKLNIDASWNDS

Query:  EDRGGVGWTVRDSEGSPICAGTKVIKDKWSIKILEAKAIWEALIRIASLPEKPPKIIVSSDCLELITLLNRGDEDLLEVQTFVDAILNMANTMGELSFEH
         D    G  VRDSEG  +   +  +KD     + E   + EAL+ + S   +   IIV SDCL      N  + D   V   V   L++A+ M  +S   
Subjt:  EDRGGVGWTVRDSEGSPICAGTKVIKDKWSIKILEAKAIWEALIRIASLPEKPPKIIVSSDCLELITLLNRGDEDLLEVQTFVDAILNMANTMGELSFEH

Query:  CPREQNYIDHSIAHESVGFTRHG
          R  N + H +A  +V     G
Subjt:  CPREQNYIDHSIAHESVGFTRHG

XP_022155262.1 uncharacterized protein LOC111022403 [Momordica charantia]1.6e-1839.71Show/hide
Query:  WSPPPAHCWKLNIDASWNDSEDRGGVGWTVRDSEGSPICAGTKVIKDKWSIKILEAKAIWEALIRIASLPEKPPKIIVSSDCLELITLLNRGDEDLLEVQ
        W PPP H W LN DASW+DS  RGG+GW +R  +G  + AG + ++   ++K+LEA AI E L  + +L    P + + +D  E+ +LLNR  EDL +  
Subjt:  WSPPPAHCWKLNIDASWNDSEDRGGVGWTVRDSEGSPICAGTKVIKDKWSIKILEAKAIWEALIRIASLPEKPPKIIVSSDCLELITLLNRGDEDLLEVQ

Query:  TFVDAILNMANTMGELSFEHCPREQNYIDHSIAHES
          V+ ILN+ ++   L+F    RE N   HS+A  +
Subjt:  TFVDAILNMANTMGELSFEHCPREQNYIDHSIAHES

TrEMBL top hitse value%identityAlignment
A0A6J1DNV9 uncharacterized protein LOC1110224037.8e-1939.71Show/hide
Query:  WSPPPAHCWKLNIDASWNDSEDRGGVGWTVRDSEGSPICAGTKVIKDKWSIKILEAKAIWEALIRIASLPEKPPKIIVSSDCLELITLLNRGDEDLLEVQ
        W PPP H W LN DASW+DS  RGG+GW +R  +G  + AG + ++   ++K+LEA AI E L  + +L    P + + +D  E+ +LLNR  EDL +  
Subjt:  WSPPPAHCWKLNIDASWNDSEDRGGVGWTVRDSEGSPICAGTKVIKDKWSIKILEAKAIWEALIRIASLPEKPPKIIVSSDCLELITLLNRGDEDLLEVQ

Query:  TFVDAILNMANTMGELSFEHCPREQNYIDHSIAHES
          V+ ILN+ ++   L+F    RE N   HS+A  +
Subjt:  TFVDAILNMANTMGELSFEHCPREQNYIDHSIAHES

A0A803QFF3 Uncharacterized protein3.1e-1524.86Show/hide
Query:  CPEDPKGSKVCSLIDDNSCWIEAKVRECFHPQDAKDILNTPLG-PNARDEIIWSPDKKGKFSVKSTYHLVMAKAVEMEASTSDTI--------------N
        C   P    V + I D   W    +   F   D   I+  PL    ++D +IW  +  G++SVKS +HL  + + + + S SD                N
Subjt:  CPEDPKGSKVCSLIDDNSCWIEAKVRECFHPQDAKDILNTPLG-PNARDEIIWSPDKKGKFSVKSTYHLVMAKAVEMEASTSDTI--------------N

Query:  IKKSG-KSLWRLSTIPRAKNCG-SLVDSELLGLDETKFQFGR------VGVNNNYNVAHLASR----------GENHPSQSL--WSPPPAHCWKLNIDAS
         K  G  +L   + +P+  + G S +DS    ++    +FG       V   N  N+A  +S+          G N P  ++  WSPP  +  K+N+DA+
Subjt:  IKKSG-KSLWRLSTIPRAKNCG-SLVDSELLGLDETKFQFGR------VGVNNNYNVAHLASR----------GENHPSQSL--WSPPPAHCWKLNIDAS

Query:  WNDSEDRGGVGWTVRDSEGSPICAGTKVIKDKWSIKILEAKAIWEALIRIASLPEKPPKIIVSSDCLELITLLNRGDEDLLEVQTFVDAILNMANTMGEL
         N ++ + G+G  VR+++G  I A +K  +  +    +EAKA++ +L  I +   + P  +V +D L + + LN    DL      +D +  + ++   +
Subjt:  WNDSEDRGGVGWTVRDSEGSPICAGTKVIKDKWSIKILEAKAIWEALIRIASLPEKPPKIIVSSDCLELITLLNRGDEDLLEVQTFVDAILNMANTMGEL

Query:  SFEHCPREQNYIDHSIAHESVGFTRHGFLFVSSQRRPSTLEGDE-LFWTSELPHFISSICNFERFR
        +  H  R+ N   H +A  +                   LE DE + W  E+P+ I S+   ER +
Subjt:  SFEHCPREQNYIDHSIAHESVGFTRHGFLFVSSQRRPSTLEGDE-LFWTSELPHFISSICNFERFR

B8AHI8 Uncharacterized protein5.1e-1830.14Show/hide
Query:  VCSLIDDNSCWIEAKVRECFHPQDAKDILNTPLGPNARDEII-WSPDKKGKFSVKSTYHLVMAKAVEMEASTSDTINIKKSGKSLWRLSTIPRAKNCGSL
        V  ++  +  W E  V   F P D ++IL   +     ++ + W PD+ G+FSV+S Y L M  A   E S S    IKK    +W  +   + K  G  
Subjt:  VCSLIDDNSCWIEAKVRECFHPQDAKDILNTPLGPNARDEII-WSPDKKGKFSVKSTYHLVMAKAVEMEASTSDTINIKKSGKSLWRLSTIPRAKNCGSL

Query:  VDSELLGLDETKFQ--FGRVGVNNNYNVAHLASRGENHPSQSLWSPPPAHCWKLNIDASWNDSEDRGGVGWTVRDSEGSPI---CAGTKVIKDKWSIKIL
          +  L   E K +  FG VG   N     LA+      +  LWS P     KLNID S+   + RGG+G  +R+S G  I   C            +++
Subjt:  VDSELLGLDETKFQ--FGRVGVNNNYNVAHLASRGENHPSQSLWSPPPAHCWKLNIDASWNDSEDRGGVGWTVRDSEGSPI---CAGTKVIKDKWSIKIL

Query:  EAKAIWEALIRIASLPEKPPKIIVSSDCLELITLLNRGDEDLLEVQTFVDAILNMANT-MGELSFEHCPREQNYIDHSIAHE
          +      ++   LP     I V SDCLE+I LLN   E LL  + F+   +    T   E+SF    R QN + H +A++
Subjt:  EAKAIWEALIRIASLPEKPPKIIVSSDCLELITLLNRGDEDLLEVQTFVDAILNMANT-MGELSFEHCPREQNYIDHSIAHE

C7J8D0 Os11g0106066 protein1.6e-1627.98Show/hide
Query:  VCSLIDDNSCWIEAKVRECFHPQDAKDILNTPLGPNARDEII-WSPDKKGKFSVKSTYHLVMAKAVEMEASTSDTINIKKSGKSLWRLSTIPRAKNCGSL
        V  LI  +  W  A++R CF   D + IL+  L P   ++ + W PDK G+FSV+S Y+L    A    +S+S  ++ +KS   +W+     + K     
Subjt:  VCSLIDDNSCWIEAKVRECFHPQDAKDILNTPLGPNARDEII-WSPDKKGKFSVKSTYHLVMAKAVEMEASTSDTINIKKSGKSLWRLSTIPRAKNCGSL

Query:  VDSELLGLDETKFQFGRVGVNNNYNVAHLASRGENHPSQSLWSPPPAHCW---KLNIDASWNDSEDRGGVGWTVRDSEGSPICAGTKVIKDKWSIKILEA
        V S  L   + K +   V    N ++  + +R   +   +L+  P A       +  D S++    +GG+   +RD+ GS + A  K +    +    E 
Subjt:  VDSELLGLDETKFQFGRVGVNNNYNVAHLASRGENHPSQSLWSPPPAHCW---KLNIDASWNDSEDRGGVGWTVRDSEGSPICAGTKVIKDKWSIKILEA

Query:  KAIWEALIRIASLPEKPPKIIVSSDCLELITLLNRGDEDLLEV
        +A  E LI       +P  I++ +DC+ L+ LL  G+ DL EV
Subjt:  KAIWEALIRIASLPEKPPKIIVSSDCLELITLLNRGDEDLLEV

Q2RBM8 Retrotransposon protein, putative, unclassified1.6e-1627.98Show/hide
Query:  VCSLIDDNSCWIEAKVRECFHPQDAKDILNTPLGPNARDEII-WSPDKKGKFSVKSTYHLVMAKAVEMEASTSDTINIKKSGKSLWRLSTIPRAKNCGSL
        V  LI  +  W  A++R CF   D + IL+  L P   ++ + W PDK G+FSV+S Y+L    A    +S+S  ++ +KS   +W+     + K     
Subjt:  VCSLIDDNSCWIEAKVRECFHPQDAKDILNTPLGPNARDEII-WSPDKKGKFSVKSTYHLVMAKAVEMEASTSDTINIKKSGKSLWRLSTIPRAKNCGSL

Query:  VDSELLGLDETKFQFGRVGVNNNYNVAHLASRGENHPSQSLWSPPPAHCW---KLNIDASWNDSEDRGGVGWTVRDSEGSPICAGTKVIKDKWSIKILEA
        V S  L   + K +   V    N ++  + +R   +   +L+  P A       +  D S++    +GG+   +RD+ GS + A  K +    +    E 
Subjt:  VDSELLGLDETKFQFGRVGVNNNYNVAHLASRGENHPSQSLWSPPPAHCW---KLNIDASWNDSEDRGGVGWTVRDSEGSPICAGTKVIKDKWSIKILEA

Query:  KAIWEALIRIASLPEKPPKIIVSSDCLELITLLNRGDEDLLEV
        +A  E LI       +P  I++ +DC+ L+ LL  G+ DL EV
Subjt:  KAIWEALIRIASLPEKPPKIIVSSDCLELITLLNRGDEDLLEV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.7e-1128.87Show/hide
Query:  WSPPPAHCWKLNIDASWNDSEDRGGVGWTVRDSEGSPICAGTKVIKDKWSIKILEAKAIWEALIRIASLPEKPPKIIVSSDCLELITLLNRGDEDLLEVQ
        W  PP    K N DA+W     R G+GW +R+  G  +  G + +    ++   E +A+  A++ ++    K  +II  SD   L+ LLN  D+    +Q
Subjt:  WSPPPAHCWKLNIDASWNDSEDRGGVGWTVRDSEGSPICAGTKVIKDKWSIKILEAKAIWEALIRIASLPEKPPKIIVSSDCLELITLLNRGDEDLLEVQ

Query:  TFVDAILNMANTMGELSFEHCPREQNYIDHSIAHESVGFTRH
          ++ I  + +   E+ FE  PR  N +   IA ES+ F+ +
Subjt:  TFVDAILNMANTMGELSFEHCPREQNYIDHSIAHESVGFTRH

AT4G29090.1 Ribonuclease H-like superfamily protein1.2e-1128.19Show/hide
Query:  NHPSQSLWSPPPAHCWKLNIDASWNDSEDRGGVGWTVRDSEGSPICAGTKVIKDKWSIKILEAKAIWEALIRIASLPEKPPKIIVSSDCLELITLLNRGD
        N  S   W PPP    K N DA+WN   +R G+GW +R+ +G     G + +    S+   E +A+  A++ ++        +I  SD   LI +LN  D
Subjt:  NHPSQSLWSPPPAHCWKLNIDASWNDSEDRGGVGWTVRDSEGSPICAGTKVIKDKWSIKILEAKAIWEALIRIASLPEKPPKIIVSSDCLELITLLNRGD

Query:  EDLLEVQTFVDAILNMANTMGELSFEHCPREQNYIDHSIAHESVGFTRH
        E    ++  +  +  + +   E+ F   PRE N +   +A ES+ F  +
Subjt:  EDLLEVQTFVDAILNMANTMGELSFEHCPREQNYIDHSIAHESVGFTRH

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.6e-0623.37Show/hide
Query:  LGLDETKFQFGRVGVNNNYNVAHLASRGENHPSQSLWSPPPAHCWKLNIDASWNDSEDRGGVGWTVRDSEGSPICAGTKVIKDKWSIKILEAKAIWEALI
        + L++TK        N   N     +R  +    + WSPP     K N DAS ++     G+GW +R+S+G+ I  G    + + + +  E   +  A+ 
Subjt:  LGLDETKFQFGRVGVNNNYNVAHLASRGENHPSQSLWSPPPAHCWKLNIDASWNDSEDRGGVGWTVRDSEGSPICAGTKVIKDKWSIKILEAKAIWEALI

Query:  RIASLPEKPPKIIVSSDCLELITLLNRGDEDLLEVQTFVDAILNMANTMGELSFEHCPREQNYIDHSIAHESVGFTRHGFLFVS
               K  K+I   D   +  ++N    +   +Q F+D I +   +   + F    REQN     +A +++       LF S
Subjt:  RIASLPEKPPKIIVSSDCLELITLLNRGDEDLLEVQTFVDAILNMANTMGELSFEHCPREQNYIDHSIAHESVGFTRHGFLFVS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCTTCTTAGATCTCCATTCTGCCAGTTATAGTGCCCTTGTGGTGGTGTTCGTTGAAGAATTTAGAGTCGTCAGACGGAGAGGTCGTGGAAGTGGCCGTGGAAGGGG
TCGCACGGCCCTTGAGGTATTTGTACCGCCAGTGGAGCAAGAAAATAATTTGACAGAAGACCCACAAGTAGAGCAGTCAGATGAGCAGCTAACACCTGCAACAGAACTTG
TCACGGTGGATGCTATTCAAGCAATTGTGCAGTCAGCAGTTGTCGAGACAGTACAAGGCAAAGTGTTTAAGGGACTTGGAGGAAGTGCCCGTCCATTCGATGGAGCATCA
AAGAACCCCACAGTGGCGGAGTTGTGGCTGTCTTCCATTGAAACCATCTTTCGTCACATGAACTGTCCAGAAGACCCGAAGGGAAGTAAAGTGTGCAGCCTTATTGATGA
CAACAGCTGCTGGATTGAGGCCAAAGTGCGAGAATGCTTTCATCCTCAAGATGCCAAAGATATTCTTAACACCCCTTTAGGCCCGAACGCTAGAGACGAGATCATATGGA
GCCCGGACAAAAAGGGAAAATTCTCGGTTAAGAGTACTTATCACCTTGTTATGGCTAAAGCAGTGGAGATGGAAGCCTCAACATCAGACACCATCAACATTAAGAAAAGC
GGGAAGAGTCTATGGAGACTATCAACCATTCCAAGGGCCAAGAACTGTGGATCACTGGTCGACTCTGAATTACTGGGACTGGATGAGACAAAATTTCAATTCGGAAGAGT
TGGAGTCAACAATAATTATAATGTGGCACATTTGGCCTCAAGAGGCGAGAACCACCCGAGTCAGTCTCTTTGGTCTCCTCCGCCAGCGCATTGTTGGAAGCTCAACATCG
ATGCCTCTTGGAATGATTCGGAAGATAGAGGCGGAGTGGGTTGGACAGTTCGCGACTCCGAAGGCTCTCCCATCTGTGCAGGAACGAAAGTGATCAAAGATAAATGGTCG
ATAAAGATTTTGGAAGCCAAAGCGATTTGGGAAGCCCTTATTAGAATTGCATCCTTACCAGAAAAGCCGCCGAAGATTATAGTGAGCTCTGACTGCCTGGAGTTGATCAC
CCTTCTGAATCGCGGTGATGAAGACCTCTTAGAAGTTCAGACCTTCGTTGATGCCATCCTCAACATGGCCAATACTATGGGGGAGCTTTCTTTTGAGCATTGCCCCAGAG
AGCAAAACTACATTGATCATTCCATCGCGCACGAGAGTGTTGGTTTTACCCGCCATGGTTTTTTGTTCGTTTCGAGCCAGAGGCGTCCTTCCACGCTAGAAGGTGATGAG
TTGTTTTGGACCTCTGAGCTCCCTCATTTTATTTCGTCTATCTGTAATTTTGAACGATTTAGGCTTTTAGGCCTTTCTCAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTCTTCTTAGATCTCCATTCTGCCAGTTATAGTGCCCTTGTGGTGGTGTTCGTTGAAGAATTTAGAGTCGTCAGACGGAGAGGTCGTGGAAGTGGCCGTGGAAGGGG
TCGCACGGCCCTTGAGGTATTTGTACCGCCAGTGGAGCAAGAAAATAATTTGACAGAAGACCCACAAGTAGAGCAGTCAGATGAGCAGCTAACACCTGCAACAGAACTTG
TCACGGTGGATGCTATTCAAGCAATTGTGCAGTCAGCAGTTGTCGAGACAGTACAAGGCAAAGTGTTTAAGGGACTTGGAGGAAGTGCCCGTCCATTCGATGGAGCATCA
AAGAACCCCACAGTGGCGGAGTTGTGGCTGTCTTCCATTGAAACCATCTTTCGTCACATGAACTGTCCAGAAGACCCGAAGGGAAGTAAAGTGTGCAGCCTTATTGATGA
CAACAGCTGCTGGATTGAGGCCAAAGTGCGAGAATGCTTTCATCCTCAAGATGCCAAAGATATTCTTAACACCCCTTTAGGCCCGAACGCTAGAGACGAGATCATATGGA
GCCCGGACAAAAAGGGAAAATTCTCGGTTAAGAGTACTTATCACCTTGTTATGGCTAAAGCAGTGGAGATGGAAGCCTCAACATCAGACACCATCAACATTAAGAAAAGC
GGGAAGAGTCTATGGAGACTATCAACCATTCCAAGGGCCAAGAACTGTGGATCACTGGTCGACTCTGAATTACTGGGACTGGATGAGACAAAATTTCAATTCGGAAGAGT
TGGAGTCAACAATAATTATAATGTGGCACATTTGGCCTCAAGAGGCGAGAACCACCCGAGTCAGTCTCTTTGGTCTCCTCCGCCAGCGCATTGTTGGAAGCTCAACATCG
ATGCCTCTTGGAATGATTCGGAAGATAGAGGCGGAGTGGGTTGGACAGTTCGCGACTCCGAAGGCTCTCCCATCTGTGCAGGAACGAAAGTGATCAAAGATAAATGGTCG
ATAAAGATTTTGGAAGCCAAAGCGATTTGGGAAGCCCTTATTAGAATTGCATCCTTACCAGAAAAGCCGCCGAAGATTATAGTGAGCTCTGACTGCCTGGAGTTGATCAC
CCTTCTGAATCGCGGTGATGAAGACCTCTTAGAAGTTCAGACCTTCGTTGATGCCATCCTCAACATGGCCAATACTATGGGGGAGCTTTCTTTTGAGCATTGCCCCAGAG
AGCAAAACTACATTGATCATTCCATCGCGCACGAGAGTGTTGGTTTTACCCGCCATGGTTTTTTGTTCGTTTCGAGCCAGAGGCGTCCTTCCACGCTAGAAGGTGATGAG
TTGTTTTGGACCTCTGAGCTCCCTCATTTTATTTCGTCTATCTGTAATTTTGAACGATTTAGGCTTTTAGGCCTTTCTCAGTAA
Protein sequenceShow/hide protein sequence
MVFLDLHSASYSALVVVFVEEFRVVRRRGRGSGRGRGRTALEVFVPPVEQENNLTEDPQVEQSDEQLTPATELVTVDAIQAIVQSAVVETVQGKVFKGLGGSARPFDGAS
KNPTVAELWLSSIETIFRHMNCPEDPKGSKVCSLIDDNSCWIEAKVRECFHPQDAKDILNTPLGPNARDEIIWSPDKKGKFSVKSTYHLVMAKAVEMEASTSDTINIKKS
GKSLWRLSTIPRAKNCGSLVDSELLGLDETKFQFGRVGVNNNYNVAHLASRGENHPSQSLWSPPPAHCWKLNIDASWNDSEDRGGVGWTVRDSEGSPICAGTKVIKDKWS
IKILEAKAIWEALIRIASLPEKPPKIIVSSDCLELITLLNRGDEDLLEVQTFVDAILNMANTMGELSFEHCPREQNYIDHSIAHESVGFTRHGFLFVSSQRRPSTLEGDE
LFWTSELPHFISSICNFERFRLLGLSQ