; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg030547 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg030547
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptionzf-RVT domain-containing protein
Genome locationscaffold6:31723583..31727075
RNA-Seq ExpressionSpg030547
SyntenySpg030547
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF7825238.1 ribonuclease H [Senna tora]4.6e-2926.71Show/hide
Query:  RFESNWLNLKETKNIIADCWNTVDGSNAQNLSTKIIRCIHKLHIWNKSRLKGNIKTTITRKEREI----QILYSDEVNQNWDSILRAELELDDLLEEEEE
        RFE +W N      ++   W    GS+  +   K+  C  + +  N     G+I+  I   E  I    QI  +D V +N   I  A+ ELD LL+ EE 
Subjt:  RFESNWLNLKETKNIIADCWNTVDGSNAQNLSTKIIRCIHKLHIWNKSRLKGNIKTTITRKEREI----QILYSDEVNQNWDSILRAELELDDLLEEEEE

Query:  YWRLRSREEWLK-------------NEEK---------------------ISEIASNYFKGLFQSATPDLRSIQKISDCITTGISDQMREELDQPYSRSE
         WR RSR  WLK             N+ +                     I+   ++YF  LF+++  +L  ++ +   +   IS+     L +PYS  E
Subjt:  YWRLRSREEWLK-------------NEEK---------------------ISEIASNYFKGLFQSATPDLRSIQKISDCITTGISDQMREELDQPYSRSE

Query:  IEVAMKSLSPSKAPGND-------------------------------GTHASFYQSYW-------KVLILANGE----------------WDERLVKNL
        ++ A+ S+ PSKAPG D                               G + + +   W       KV+  ++ +                W+   +  L
Subjt:  IEVAMKSLSPSKAPGND-------------------------------GTHASFYQSYW-------KVLILANGE----------------WDERLVKNL

Query:  FIPSDAEDILAIPLGSVNVRDEIVWSLESKRVFSVKSAYHLAVNSQCSKEASGSCVSGQNSRWRSIWDLSIMPKAKIGLWRIVKNLIPTKSNLIRKGLDF
        F+P +A+ I +IPL   NV D ++W LE    +SV+SAYH   NS+ +  +S S     +SRW  +W LS+ PK K+ +WR+    I +  N+ R+G+  
Subjt:  FIPSDAEDILAIPLGSVNVRDEIVWSLESKRVFSVKSAYHLAVNSQCSKEASGSCVSGQNSRWRSIWDLSIMPKAKIGLWRIVKNLIPTKSNLIRKGLDF

Query:  NPLCDLCRGKKEDSDHIFWRFSK
        +  C  C  K E   H+F R  K
Subjt:  NPLCDLCRGKKEDSDHIFWRFSK

KAF7831272.1 ribonuclease H [Senna tora]4.2e-2221.7Show/hide
Query:  RFESNWLNLKETKNIIADCWNTVDGSNAQNLSTKIIRCIHKLHIWNKSRLKGNIKTTITRKEREIQIL-YSDEVNQNWDSILRAELELDDLLEEEEEYWR
        RFE +W N      ++A  W+   GSN       +     + + +N S ++  IK      E  I IL + D  +     I  A+ ELD+LL+ EE  WR
Subjt:  RFESNWLNLKETKNIIADCWNTVDGSNAQNLSTKIIRCIHKLHIWNKSRLKGNIKTTITRKEREIQIL-YSDEVNQNWDSILRAELELDDLLEEEEEYWR

Query:  LRSREEWLK----------------------------------NEEKISEIASNYFKGLFQSATPDLRSIQKISDCITTGISDQMREELDQPYSRSEIEV
         RSR  WLK                                  + + I+   ++YF+ LF+++  +L  ++++   +   +SD   E L +P+S  ++++
Subjt:  LRSREEWLK----------------------------------NEEKISEIASNYFKGLFQSATPDLRSIQKISDCITTGISDQMREELDQPYSRSEIEV

Query:  AMKSLSPSKAPGNDGTHASFYQSYWK---------VLILANGE---------------------------------------------------------
         + S+ PSKAPG +G  A F+Q++W           L + N E                                                         
Subjt:  AMKSLSPSKAPGNDGTHASFYQSYWK---------VLILANGE---------------------------------------------------------

Query:  ------------------------------------------------------------------WDERLVKNLFIPSDAEDILAIPLGSVNVRDEIVW
                                                                          W+   V  LF+P +A+ I++IPL   N+ D ++W
Subjt:  ------------------------------------------------------------------WDERLVKNLFIPSDAEDILAIPLGSVNVRDEIVW

Query:  SLESKRVFSVKSAYHLAVNSQCSKEASGSCVSGQNSRWRSIWDLSIMPKAKIGLWRIVKNLIPTKSNLIRKGLDFNPLCDLCRGKKEDSDHIF
          E    +SV+SAYH  ++ +  + +S    S  +SRWR +WDL++ PK K+  WR+    I +  N+ R+G+  +  C  C  K E   H+F
Subjt:  SLESKRVFSVKSAYHLAVNSQCSKEASGSCVSGQNSRWRSIWDLSIMPKAKIGLWRIVKNLIPTKSNLIRKGLDFNPLCDLCRGKKEDSDHIF

KAF7844569.1 protein BPS1, chloroplastic-like [Senna tora]3.8e-3129.32Show/hide
Query:  RFESNWLNLKETKNIIADCWNTVDGSNAQNLSTKIIRCIHKLHIWNKSRLKGNIKTTITRKEREIQIL----YSDEVNQNWDSILRAELELDDLLEEEEE
        RFE  W       +I+A  W      +A N     I  +  L    K    G+I+  I   E  + IL     +D V +N   I   +LELD L + EE 
Subjt:  RFESNWLNLKETKNIIADCWNTVDGSNAQNLSTKIIRCIHKLHIWNKSRLKGNIKTTITRKEREIQIL----YSDEVNQNWDSILRAELELDDLLEEEEE

Query:  YWRLRSREEWLK--------------------------NEEKISEIASN--------YFKGLFQSATPDLRSIQKISDCITTGISDQMREELDQPYSRSE
         WR RSR  WLK                          +E  I    SN        YF  LF+++  +L  ++++   I   ++++    L +PY + E
Subjt:  YWRLRSREEWLK--------------------------NEEKISEIASN--------YFKGLFQSATPDLRSIQKISDCITTGISDQMREELDQPYSRSE

Query:  IEVAMKSLSPSKAPGNDGTHASFYQSYWK---------VLILANGE--------WDERLVKNLFIPSDAEDILAIPLGSVNVRDEIVWSLESKRVFSVKS
        ++ A+ S+ PSKAPG DG  A F+Q +W          VL + N +        W+   V+ LF+P +A+ IL+IPL   NV D ++W LE    +SV+S
Subjt:  IEVAMKSLSPSKAPGNDGTHASFYQSYWK---------VLILANGE--------WDERLVKNLFIPSDAEDILAIPLGSVNVRDEIVWSLESKRVFSVKS

Query:  AYHLAVNSQCSKEASGSCVSGQNSRWRSIWDLSIMPKAKIGLWRIVKNLIPTK-SNLIRKGLDFNPLCDLCRGKKEDSDHIF
        AYHL ++   +  +S   V+  + RW+ +W L++ PK K+  W++    I +   N+ R+G+  +  C  C  K E   H+F
Subjt:  AYHLAVNSQCSKEASGSCVSGQNSRWRSIWDLSIMPKAKIGLWRIVKNLIPTK-SNLIRKGLDFNPLCDLCRGKKEDSDHIF

XP_042965938.1 uncharacterized protein LOC122299618 [Carya illinoinensis]2.7e-2134.33Show/hide
Query:  ILANGEWDERLVKNLFIPSDAEDILAIPLGSVNVRDEIVWSLESKRVFSVKSAYHLAVNSQCSKEASGSCVSGQNSRWRSIWDLSIMPKAKIGLWRIVKN
        +++NGEWD +L+KN+F   + E I +IP+   N  D+++W   +   F+++SAY L ++     +   S     ++RW+SIWDL+I  K K+ +WR +K+
Subjt:  ILANGEWDERLVKNLFIPSDAEDILAIPLGSVNVRDEIVWSLESKRVFSVKSAYHLAVNSQCSKEASGSCVSGQNSRWRSIWDLSIMPKAKIGLWRIVKN

Query:  LIPTKSNLIRKGLDFNPLCDLCRGKKEDSDHIFW
        L+ T+SNL+ + +  N  C +C+ ++E + H  W
Subjt:  LIPTKSNLIRKGLDFNPLCDLCRGKKEDSDHIFW

XP_042965938.1 uncharacterized protein LOC122299618 [Carya illinoinensis]3.1e-0923.25Show/hide
Query:  FESNWLNLKETKNIIADCW--NTVDGSNAQNLSTKIIRCIHKLHIWNKSRLKGNIKTTITRKEREIQILYSDEVNQNWDSILRAELELDDLLEEEEEYWR
        +E+ W   ++ + +I   W     D  + + L   +      L  W+K  ++ + +  +  +   ++ L  DE   N   I + + E+   LE E+  WR
Subjt:  FESNWLNLKETKNIIADCW--NTVDGSNAQNLSTKIIRCIHKLHIWNKSRLKGNIKTTITRKEREIQILYSDEVNQNWDSILRAELELDDLLEEEEEYWR

Query:  LRSREEWLK----------------------------------NEEKISEIASNYFKGLFQSATPDLRSIQKISDCITTGISDQMREELDQPYSRSEIEV
         R++ +W K                                  N  +I  I ++YF+ LF S+ P    I++    + T ++ +M   L + ++R E+E 
Subjt:  LRSREEWLK----------------------------------NEEKISEIASNYFKGLFQSATPDLRSIQKISDCITTGISDQMREELDQPYSRSEIEV

Query:  AMKSLSPSKAPGNDGTHASFYQSYWKVL
        A+K ++P K+PG DG  A F+Q YW+V+
Subjt:  AMKSLSPSKAPGNDGTHASFYQSYWKVL

XP_042974642.1 uncharacterized protein LOC122306274 [Carya illinoinensis]7.7e-2426.62Show/hide
Query:  ILANGEWDERLVKNLFIPSDAEDILAIPLGSVNVRDEIVWSLESKRVFSVKSAYHLAVNSQCSKEASGSCVSGQNSRWRSIWDLSIMPKAKIGLWRIVKN
        +++NGEWD +L+KN+F   + E I +IP+   N  D ++W   +  +FS++SAY L ++   + +   S    ++ RW+SIWDL+I  K K+ +WR VK+
Subjt:  ILANGEWDERLVKNLFIPSDAEDILAIPLGSVNVRDEIVWSLESKRVFSVKSAYHLAVNSQCSKEASGSCVSGQNSRWRSIWDLSIMPKAKIGLWRIVKN

Query:  LIPTKSNLIRKGLDFNPLCDLCRGKKEDSDHIFWR--------------FSKWSVKSLE---------------------AVAIKEGLQSFVKSNRDQNL
        L+ T+SNL+ + +  N  C  C+ ++E   H  W                 KW  K  +                     A+     L+  ++   D N 
Subjt:  LIPTKSNLIRKGLDFNPLCDLCRGKKEDSDHIFWR--------------FSKWSVKSLE---------------------AVAIKEGLQSFVKSNRDQNL

Query:  KLI-VEADAMEVVKVLNHDYLDISEAKSMLDDVEDLAKKAGVISFLKCPRAGNLVAHSLVRTAAGFPPVYLPLPDVVD
          +  E DA  +   +N +  D+S   S+++DV+++ K     S     R  N   H L + A  F    + + D  D
Subjt:  KLI-VEADAMEVVKVLNHDYLDISEAKSMLDDVEDLAKKAGVISFLKCPRAGNLVAHSLVRTAAGFPPVYLPLPDVVD

TrEMBL top hitse value%identityAlignment
A0A6P9EAZ3 uncharacterized protein LOC1183479875.0e-2136.36Show/hide
Query:  LILANGEWDERLVKNLFIPSDAEDILAIPLGSVNVRDEIVWSLESKRVFSVKSAYHLAVNSQCSKEASGSCVSGQNSR-------WRSIWDLSIMPKAKI
        L+   GEW+  L++N+F   +A+ I +IP+ S+ V D +VWS   K +FSV+SAYHL +  +         + G+NSR       W SIW+L +  K K+
Subjt:  LILANGEWDERLVKNLFIPSDAEDILAIPLGSVNVRDEIVWSLESKRVFSVKSAYHLAVNSQCSKEASGSCVSGQNSR-------WRSIWDLSIMPKAKI

Query:  GLWRIVKNLIPTKSNLIRKGLDFNPLCDLCRGKKEDSDHIFWR
         LW+   + + TK NL  K +  NPLC +C+  +E   H+ W+
Subjt:  GLWRIVKNLIPTKSNLIRKGLDFNPLCDLCRGKKEDSDHIFWR

A0A6P9EAZ3 uncharacterized protein LOC1183479871.2e-0924.02Show/hide
Query:  RFESNWLNLKETKNIIADCW-NTVDGSN-AQNLSTKIIRCIHKLHIWNKSRLKGNIKTTITRKEREIQILYSDEVNQNWDSILRAELELDDLLEEEEEYW
        RFE+ W   +  + +I   W   + G   +  L+ K+      L  W+K R+  + K  +  K   ++ L  +E + N  +I   + E+  LLE E+  W
Subjt:  RFESNWLNLKETKNIIADCW-NTVDGSN-AQNLSTKIIRCIHKLHIWNKSRLKGNIKTTITRKEREIQILYSDEVNQNWDSILRAELELDDLLEEEEEYW

Query:  RLRSREEWLK----------------------------------NEEKISEIASNYFKGLFQSATPDLRSIQKISDCITTGISDQMREELDQPYSRSEIE
        R R++  W +                                  ++E +  +  ++F+ LF S+ P    I++  + +   ++  M ++L + ++R EIE
Subjt:  RLRSREEWLK----------------------------------NEEKISEIASNYFKGLFQSATPDLRSIQKISDCITTGISDQMREELDQPYSRSEIE

Query:  VAMKSLSPSKAPGNDGTHASFYQSYWKVL
        VA+KS++P K+PG DG  A F+Q +W  +
Subjt:  VAMKSLSPSKAPGNDGTHASFYQSYWKVL

A0A7N2MHC9 zf-RVT domain-containing protein1.2e-2223.3Show/hide
Query:  RFESNWLNLKETKNIIADCWNTVD--GSNAQNLSTKIIRCIHKLHIWNKSRLKGNIKTTITRKEREIQILYSDEVNQNWDSILRAELELDDLLEEEEEYW
        RFE  W   +  + ++   W+     GS    L  KI RC   L  W+++ + GN KT I  ++  ++ L       N  +I   + E++ LL ++E YW
Subjt:  RFESNWLNLKETKNIIADCWNTVD--GSNAQNLSTKIIRCIHKLHIWNKSRLKGNIKTTITRKEREIQILYSDEVNQNWDSILRAELELDDLLEEEEEYW

Query:  RLRSREEWL-------------------KN---------------EEKISEIASNYFKGLFQSATP-DLRSIQKISDCITTGISDQMREELDQPYSRSEI
        + RSR  WL                   KN               +++I++ + NYFK LF S+ P D   + +  D + T     M   L Q Y+  E+
Subjt:  RLRSREEWL-------------------KN---------------EEKISEIASNYFKGLFQSATP-DLRSIQKISDCITTGISDQMREELDQPYSRSEI

Query:  EVAMKSLSPSKAPGNDGTHASFYQS------------------------------------------------------------------------YW-
        + A+  + PSK+PG DG  A   ++                                                                         W 
Subjt:  EVAMKSLSPSKAPGNDGTHASFYQS------------------------------------------------------------------------YW-

Query:  --------------KVLILAN---GEWDERLVKNLFIPSDAEDILAIPLGSVNVRDEIVWSLESKRVFSVKSAYHLAVNSQCSKEASGSCVSGQNSRWRS
                      KV  L N    +WD   +   F P   + IL++PL  ++ +D +VW+    + FSVK+AY +A+  +    A    +      W  
Subjt:  --------------KVLILAN---GEWDERLVKNLFIPSDAEDILAIPLGSVNVRDEIVWSLESKRVFSVKSAYHLAVNSQCSKEASGSCVSGQNSRWRS

Query:  IWDLSIMPKAKIGLWRIVKNLIPTKSNLIRKGLDFNPLCDLCRGKKEDSDHIFWR
        IW L++ PK +  LWR   N +PT+ NL R+ +    +C  C  + E   HI W+
Subjt:  IWDLSIMPKAKIGLWRIVKNLIPTKSNLIRKGLDFNPLCDLCRGKKEDSDHIFWR

A0A7N2N3A5 zf-RVT domain-containing protein2.9e-2930.2Show/hide
Query:  LDDLLEEEEEYWRLRSREEWLKNEEKISEIASNYFKGLFQSATPDLRSIQKISDCITTGISDQMREELDQPYSRSEIEVAMKSLSPSKAPGNDGTHASFY
        ++  L +    W +      +++++ I +   NYFK +F S+ P   +I  I D I T ++  M  EL + ++  E+E A+K + P  APG DG    FY
Subjt:  LDDLLEEEEEYWRLRSREEWLKNEEKISEIASNYFKGLFQSATPDLRSIQKISDCITTGISDQMREELDQPYSRSEIEVAMKSLSPSKAPGNDGTHASFY

Query:  QSYWKVL----------ILANGE----WDERLVKNLFIPSDAEDILAIPLGSVNVRDEIVWSLESKRVFSVKSAYHLAVNSQCSKEASGSCVSGQNSRWR
        ++YW  +          +L + E    W E  ++   +P +A  IL IPL      D ++W      V + KSAY L  +S+  K+   S  +  N  W+
Subjt:  QSYWKVL----------ILANGE----WDERLVKNLFIPSDAEDILAIPLGSVNVRDEIVWSLESKRVFSVKSAYHLAVNSQCSKEASGSCVSGQNSRWR

Query:  SIWDLSIMPKAKIGLWRIVKNLIPTKSNLIRKGLDFNPLCDLCRGKKEDSDHIFW
         +W L I  K K  LWR     +PTK NL ++ +  N  C +C G+ ED+ H  W
Subjt:  SIWDLSIMPKAKIGLWRIVKNLIPTKSNLIRKGLDFNPLCDLCRGKKEDSDHIFW

A0A803PUL2 Uncharacterized protein3.7e-2421.7Show/hide
Query:  TKIIRCIHKLHIWNKSRLKGNIKTTITRKEREIQIL--YSDEVNQNWDSILRAELELDDLLEEEEEYWRLRSREEWLKNEEK------------------
        + I  C  +L +W   +  G +K  I + +++++ L   S   + ++D + +AE  LD+LLE+EE YW+ RSR +WL   ++                  
Subjt:  TKIIRCIHKLHIWNKSRLKGNIKTTITRKEREIQIL--YSDEVNQNWDSILRAELELDDLLEEEEEYWRLRSREEWLKNEEK------------------

Query:  ----------------ISEIASNYFKGLFQSATPDLRSIQKISDCITTGISDQMREELDQPYSRSEIEVAMKSLSPSKAPGNDGTHASFYQSYWKV----
                        IS +  +++  LF +   D  ++    DCI T +++   + L  P++ +E++ A+K++S  K+PG DG  A FYQ +W +    
Subjt:  ----------------ISEIASNYFKGLFQSATPDLRSIQKISDCITTGISDQMREELDQPYSRSEIEVAMKSLSPSKAPGNDGTHASFYQSYWKV----

Query:  -------------------------------------------------------------------------------------LILANGEWDERLVKN
                                                                                             LI    +W+  L++ 
Subjt:  -------------------------------------------------------------------------------------LILANGEWDERLVKN

Query:  LFIPSDAEDILAIPLGSVNVRDEIVWSLESKRVFSVKSAYHLAVNSQCSKEASGSCVSGQNSRWRSIWDLSIMPKAKIGLWRIVKNLIPTKSNLIRKGLD
         F P D + IL +PL     RD+++W   S   F+V+SAYHLA + +   E   S  +   + W+  W L +  K KI  WR + + +P  ++L+R+ + 
Subjt:  LFIPSDAEDILAIPLGSVNVRDEIVWSLESKRVFSVKSAYHLAVNSQCSKEASGSCVSGQNSRWRSIWDLSIMPKAKIGLWRIVKNLIPTKSNLIRKGLD

Query:  FNPLCDLCRGKKEDSDHIF---------WRFSKWSVKSLEAVAIKEG
         +  C +C+   E + H           WR    S     A ++K G
Subjt:  FNPLCDLCRGKKEDSDHIF---------WRFSKWSVKSLEAVAIKEG

A0A803QDL0 Uncharacterized protein3.8e-2928.61Show/hide
Query:  FESNWLNLKETKNIIADCWNTVDGSNA-----QNLSTKIIRCIHKLHIWNKSRLKGNIKTTITRKEREI----QILYSDEVNQNWDSILRAELELDDLLE
        FE+ WL      +++ D W+    S +     QN  +K   CI  L  WNK+ L  ++K+ I+  + EI     + ++D+ NQ     L++  +LD LL 
Subjt:  FESNWLNLKETKNIIADCWNTVDGSNA-----QNLSTKIIRCIHKLHIWNKSRLKGNIKTTITRKEREI----QILYSDEVNQNWDSILRAELELDDLLE

Query:  EEEEYWRLRSREEWLKNEEK----------------------------------ISEIASNYFKGLFQSATPDLRSIQKISDCITTGISDQMREELDQPY
        +EE YW+ RSR  WLK  +K                                  +S +  +YF  LF S   D  ++  I DC+   + D     LD P+
Subjt:  EEEEYWRLRSREEWLKNEEK----------------------------------ISEIASNYFKGLFQSATPDLRSIQKISDCITTGISDQMREELDQPY

Query:  SRSEIEVA--MKSLSPSKAPGN------DGTHASFYQSYWKVLILANGEWDERLVKNLFIPSDAEDILAIPLGSVNVRDEIVWSLESKRVFSVKSAYHLA
        S  E+     ++++     P N      D    S + SY+   I A+G+WD   + N F     ++IL++P+     +D+I+W   S   F+VK+AYHLA
Subjt:  SRSEIEVA--MKSLSPSKAPGN------DGTHASFYQSYWKVLILANGEWDERLVKNLFIPSDAEDILAIPLGSVNVRDEIVWSLESKRVFSVKSAYHLA

Query:  VNSQCSKEASGSCVSGQNSRWRSIWDLSIMPKAKIGLWRIVKNLIPTKSNLIRKGLDFNPLCDLCRGKKEDSDH
         +SQ     S S        W  IW+  I PK K+ +WR++ N +P   +L ++ +  +PLC LC+   E   H
Subjt:  VNSQCSKEASGSCVSGQNSRWRSIWDLSIMPKAKIGLWRIVKNLIPTKSNLIRKGLDFNPLCDLCRGKKEDSDH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein8.8e-1027.2Show/hide
Query:  WDERLVKNLFIPSDAEDILAIPLGSVNVRDEIVWSLESKRVFSVKSAYHLAVNSQCSKEASGSCVSGQNSRWRSIWDLSIMPKAKIGLWRIVKNLIPTKS
        WD+  +      SD   I  I L      D+I+W+  +   ++V+S Y L  +   +   + +   G       IW+L IMPK K  LWR +   + T  
Subjt:  WDERLVKNLFIPSDAEDILAIPLGSVNVRDEIVWSLESKRVFSVKSAYHLAVNSQCSKEASGSCVSGQNSRWRSIWDLSIMPKAKIGLWRIVKNLIPTKS

Query:  NLIRKGLDFNPLCDLCRGKKEDSDH
         L  +G+  +P C  C  + E  +H
Subjt:  NLIRKGLDFNPLCDLCRGKKEDSDH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGTTTGAGAGCAACTGGTTAAATTTGAAGGAAACAAAGAATATTATTGCTGATTGCTGGAATACAGTAGATGGGAGCAATGCTCAAAATCTTAGCACCAAAATTAT
TAGGTGCATTCATAAGCTCCATATTTGGAATAAGTCACGGCTGAAAGGCAACATTAAAACAACCATAACGAGGAAAGAACGAGAAATTCAGATTCTTTATTCTGATGAAG
TTAACCAGAATTGGGACAGCATTCTCAGGGCTGAGCTTGAACTTGATGATCTGCTTGAAGAAGAAGAGGAATATTGGAGGCTTCGGTCAAGGGAAGAATGGCTTAAAAAT
GAGGAAAAGATCAGTGAGATTGCTTCAAATTACTTCAAGGGCCTTTTTCAATCTGCTACCCCTGACTTAAGAAGCATTCAGAAGATCTCCGATTGTATCACTACTGGGAT
TTCAGATCAAATGAGAGAGGAATTGGACCAACCATATTCCAGAAGTGAGATAGAGGTTGCCATGAAAAGCCTTAGTCCAAGCAAGGCCCCGGGGAACGACGGGACTCATG
CCTCCTTCTATCAATCTTACTGGAAAGTGTTAATTCTTGCCAATGGTGAGTGGGATGAAAGGCTTGTCAAAAACCTTTTCATTCCCTCGGATGCAGAGGATATCTTGGCC
ATTCCTTTAGGGAGCGTGAATGTTAGGGATGAGATTGTATGGAGCCTTGAATCAAAAAGAGTTTTCAGTGTTAAGAGTGCATATCACTTGGCAGTCAACTCTCAGTGTTC
TAAGGAAGCCTCAGGATCCTGTGTCTCTGGCCAAAACAGCAGATGGAGGTCCATCTGGGACCTCAGCATTATGCCAAAAGCAAAAATAGGGCTGTGGAGAATTGTTAAAA
ATCTAATCCCTACTAAATCCAATCTCATCCGTAAAGGTCTTGATTTTAATCCTCTCTGTGATTTGTGCAGGGGCAAGAAAGAGGATTCGGACCATATTTTCTGGAGATTC
AGCAAGTGGAGCGTGAAATCGTTGGAGGCGGTTGCGATCAAAGAAGGACTTCAGTCCTTCGTGAAGAGCAACAGAGACCAAAATTTGAAGCTGATTGTCGAAGCAGACGC
TATGGAAGTGGTCAAAGTTCTGAACCACGATTACCTCGACATCTCAGAAGCGAAGTCTATGCTCGATGATGTTGAGGATTTGGCGAAGAAAGCGGGCGTCATCTCCTTCC
TCAAATGCCCAAGGGCGGGCAATCTTGTAGCGCACTCTCTTGTGCGTACAGCGGCGGGTTTCCCTCCGGTGTATCTGCCGTTGCCCGATGTCGTTGACGGTTTTTTTGTA
TCTTCTTCCTCTTCCACGCTGGAAGGCTTATTGTTTTGTAAGGGCGATTCTCTCCCCTTTTGGATCTCCTCCTTAATTTTGGAGGATGTTGGTATACCAAACTCTTTAGC
TTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGGTTTGAGAGCAACTGGTTAAATTTGAAGGAAACAAAGAATATTATTGCTGATTGCTGGAATACAGTAGATGGGAGCAATGCTCAAAATCTTAGCACCAAAATTAT
TAGGTGCATTCATAAGCTCCATATTTGGAATAAGTCACGGCTGAAAGGCAACATTAAAACAACCATAACGAGGAAAGAACGAGAAATTCAGATTCTTTATTCTGATGAAG
TTAACCAGAATTGGGACAGCATTCTCAGGGCTGAGCTTGAACTTGATGATCTGCTTGAAGAAGAAGAGGAATATTGGAGGCTTCGGTCAAGGGAAGAATGGCTTAAAAAT
GAGGAAAAGATCAGTGAGATTGCTTCAAATTACTTCAAGGGCCTTTTTCAATCTGCTACCCCTGACTTAAGAAGCATTCAGAAGATCTCCGATTGTATCACTACTGGGAT
TTCAGATCAAATGAGAGAGGAATTGGACCAACCATATTCCAGAAGTGAGATAGAGGTTGCCATGAAAAGCCTTAGTCCAAGCAAGGCCCCGGGGAACGACGGGACTCATG
CCTCCTTCTATCAATCTTACTGGAAAGTGTTAATTCTTGCCAATGGTGAGTGGGATGAAAGGCTTGTCAAAAACCTTTTCATTCCCTCGGATGCAGAGGATATCTTGGCC
ATTCCTTTAGGGAGCGTGAATGTTAGGGATGAGATTGTATGGAGCCTTGAATCAAAAAGAGTTTTCAGTGTTAAGAGTGCATATCACTTGGCAGTCAACTCTCAGTGTTC
TAAGGAAGCCTCAGGATCCTGTGTCTCTGGCCAAAACAGCAGATGGAGGTCCATCTGGGACCTCAGCATTATGCCAAAAGCAAAAATAGGGCTGTGGAGAATTGTTAAAA
ATCTAATCCCTACTAAATCCAATCTCATCCGTAAAGGTCTTGATTTTAATCCTCTCTGTGATTTGTGCAGGGGCAAGAAAGAGGATTCGGACCATATTTTCTGGAGATTC
AGCAAGTGGAGCGTGAAATCGTTGGAGGCGGTTGCGATCAAAGAAGGACTTCAGTCCTTCGTGAAGAGCAACAGAGACCAAAATTTGAAGCTGATTGTCGAAGCAGACGC
TATGGAAGTGGTCAAAGTTCTGAACCACGATTACCTCGACATCTCAGAAGCGAAGTCTATGCTCGATGATGTTGAGGATTTGGCGAAGAAAGCGGGCGTCATCTCCTTCC
TCAAATGCCCAAGGGCGGGCAATCTTGTAGCGCACTCTCTTGTGCGTACAGCGGCGGGTTTCCCTCCGGTGTATCTGCCGTTGCCCGATGTCGTTGACGGTTTTTTTGTA
TCTTCTTCCTCTTCCACGCTGGAAGGCTTATTGTTTTGTAAGGGCGATTCTCTCCCCTTTTGGATCTCCTCCTTAATTTTGGAGGATGTTGGTATACCAAACTCTTTAGC
TTCTTAA
Protein sequenceShow/hide protein sequence
MRFESNWLNLKETKNIIADCWNTVDGSNAQNLSTKIIRCIHKLHIWNKSRLKGNIKTTITRKEREIQILYSDEVNQNWDSILRAELELDDLLEEEEEYWRLRSREEWLKN
EEKISEIASNYFKGLFQSATPDLRSIQKISDCITTGISDQMREELDQPYSRSEIEVAMKSLSPSKAPGNDGTHASFYQSYWKVLILANGEWDERLVKNLFIPSDAEDILA
IPLGSVNVRDEIVWSLESKRVFSVKSAYHLAVNSQCSKEASGSCVSGQNSRWRSIWDLSIMPKAKIGLWRIVKNLIPTKSNLIRKGLDFNPLCDLCRGKKEDSDHIFWRF
SKWSVKSLEAVAIKEGLQSFVKSNRDQNLKLIVEADAMEVVKVLNHDYLDISEAKSMLDDVEDLAKKAGVISFLKCPRAGNLVAHSLVRTAAGFPPVYLPLPDVVDGFFV
SSSSSTLEGLLFCKGDSLPFWISSLILEDVGIPNSLAS