; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028608 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028608
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr8:26228207..26231095
RNA-Seq ExpressionLag0028608
SyntenyLag0028608
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EEC83100.1 hypothetical protein OsI_28249 [Oryza sativa Indica Group]6.2e-6128.95Show/hide
Query:  MSCFKLPKSICDDINKHCARFWWGSSEAKRRAHWMSWRKISTSKFKGGLDFRDLNLFNQAMLAKQSWRIIKNPNSLVARVLRGK----------------
        M  FKLP S+C+D++K    FWWG+ + KRR HW +W  ++  K  GG+ FRD  LFNQA+LA+QSWRI++ P SL ARVL+ K                
Subjt:  MSCFKLPKSICDDINKHCARFWWGSSEAKRRAHWMSWRKISTSKFKGGLDFRDLNLFNQAMLAKQSWRIIKNPNSLVARVLRGK----------------

Query:  ---------------------IGDDRRTVIDKDPWLINQGSRCPVWVKDSFQGKRVCDLLDNEGFWCEAKVLEAFSPSKAEEILSIPRQKSIRHDKIIWN
                             +G+ R   I +DPW+    SR P+  K + + K V +LLD  G W   K+ + F P   E+ILSI   +   +D + W+
Subjt:  ---------------------IGDDRRTVIDKDPWLINQGSRCPVWVKDSFQGKRVCDLLDNEGFWCEAKVLEAFSPSKAEEILSIPRQKSIRHDKIIWN

Query:  PDPKGKFSVRSAYNLAVNLNSKEDVSASSPDTTKSMWKELWSLNIVPRAKITVWKIISNALPTNFNITRRGIYIDSRCVLCRSKEESTDHLGVSCP--ED
         D  G+FSVRSAY+LA++L++    S+SS       W +LWS ++  + +I +W+  SN+L T  N  ++ +   S C +C ++EE   H    CP  + 
Subjt:  PDPKGKFSVRSAYNLAVNLNSKEDVSASSPDTTKSMWKELWSLNIVPRAKITVWKIISNALPTNFNITRRGIYIDSRCVLCRSKEESTDHLGVSCP--ED

Query:  FWMIM-----------------------VDKLSKEDLGQVAIVLWVLWCYRDKCN----------SNRQISDVIQICRSIQNGFDVLAKSGKGYLVAAQA
         W +M                        +++SKE+   + ++LW +W  R++            S R IS  I     I+   D     GK  +  A A
Subjt:  FWMIM-----------------------VDKLSKEDLGQVAIVLWVLWCYRDKCN----------SNRQISDVIQICRSIQNGFDVLAKSGKGYLVAAQA

Query:  KSQMSH--------NWIPLDSGRWKLNSDASWIEASRKRGLGWVIRDSSRSLIGAGYIQIDKKWLIKVMEAQAILEGLKAYRSLADKDGI-------SCP
         +Q++H         W+   +G  KLN D S+       G+G V+R+S   LI A    + +   +  +EA+ +          A K+GI         P
Subjt:  KSQMSH--------NWIPLDSGRWKLNSDASWIEASRKRGLGWVIRDSSRSLIGAGYIQIDKKWLIKVMEAQAILEGLKAYRSLADKDGI-------SCP

Query:  LLIESDSACVVNAINDRDVDFSEVALVVDAIAEIARG-VVISIGFCSRWDNILAHQVARAATS
        +++E+D   +V  + ++    S++  ++  I ++ +G   I I   +R  +I  H +A    S
Subjt:  LLIESDSACVVNAINDRDVDFSEVALVVDAIAEIARG-VVISIGFCSRWDNILAHQVARAATS

XP_006487889.1 uncharacterized protein LOC102617714 [Citrus sinensis]2.2e-5830.23Show/hide
Query:  MSCFKLPKSICDDINKHCARFWWGSSEAKRRAHWMSWRKISTSKFKGGLDFRDLNLFNQAMLAKQSWRIIKNPNSLVARVLRGKIGDDRRTVIDKDPWLI
        MS FKLP SIC+DI K  ARFWWGSS  +R  HW  W K+  +K +GG+ FRD   FNQA++AKQ WRII++P SL+ARVL        +  I K  WL 
Subjt:  MSCFKLPKSICDDINKHCARFWWGSSEAKRRAHWMSWRKISTSKFKGGLDFRDLNLFNQAMLAKQSWRIIKNPNSLVARVLRGKIGDDRRTVIDKDPWLI

Query:  NQGSRCPVWVKDSFQGKRVCDLLDNEGFWCEAKVLEAFSPSKAEEILSIPRQKSIRHDKIIWNPDPKGKFSVRSAYNLAVNLNSKEDVSASSPDTTKSMW
           +  P           V DL+D    W +  + + F    A  I+ I   K+ + D+ +W+ D  G++SV+S Y +A+ L      S  S D++ S W
Subjt:  NQGSRCPVWVKDSFQGKRVCDLLDNEGFWCEAKVLEAFSPSKAEEILSIPRQKSIRHDKIIWNPDPKGKFSVRSAYNLAVNLNSKEDVSASSPDTTKSMW

Query:  KELWSLNIVPRAKITVWKIISNALPTNFNITRRGIYIDSRCVLCRSKEESTDHLGVSC--------------------PEDFWMIMVD---KLSKEDLGQ
          +W+  +  + KI +W+ + N LPT  N+ RR I     C  C  K E   H  + C                     +D   ++V+   K  K++L  
Subjt:  KELWSLNIVPRAKITVWKIISNALPTNFNITRRGIYIDSRCVLCRSKEESTDHLGVSC--------------------PEDFWMIMVD---KLSKEDLGQ

Query:  VAIVLWVLWCYRDK-CNSNRQISDVIQICRSIQNGFDVLAKSGKGY-LVAAQAKSQMSHNWIPLDSGRWKLNSDASWIEASRKRGLGWVIRDSSRSLIGA
        + ++ W +W  ++     N++    + I R+  N  D   +  K +  +  + +      W P  SG +K+N DA+     +  GLG +IR+S+  ++ A
Subjt:  VAIVLWVLWCYRDK-CNSNRQISDVIQICRSIQNGFDVLAKSGKGY-LVAAQAKSQMSHNWIPLDSGRWKLNSDASWIEASRKRGLGWVIRDSSRSLIGA

Query:  GYIQIDKKWLIKVMEAQAILEGLKAYRSLADKDGISCPLLIESDSACVVNAINDRDVDFSEVALVVDAIAE-IARGVVISIGFCSRWDNILAHQVARAAT
           +   +     MEA+A++ G+K+    A + G S P++IE+DS  VV+    + V   E + ++  I E I      SI +  R  N+ AH +A+ A 
Subjt:  GYIQIDKKWLIKVMEAQAILEGLKAYRSLADKDGISCPLLIESDSACVVNAINDRDVDFSEVALVVDAIAE-IARGVVISIGFCSRWDNILAHQVARAAT

Query:  SHGNFLCFFGALSSSV
           N + + G+    +
Subjt:  SHGNFLCFFGALSSSV

XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]2.0e-5927.62Show/hide
Query:  MSCFKLPKSICDDINKHCARFWWGSSEAKRRAHWMSWRKISTSKFKGGLDFRDLNLFNQAMLAKQSWRIIKNPNSLVARVL-------------------
        MS FKLPK +C+DI K  ARFWWG+ + K   HW  W  +S +K +GGL FRDL  FNQA++AKQ WR+++ PNSL+ARV+                   
Subjt:  MSCFKLPKSICDDINKHCARFWWGSSEAKRRAHWMSWRKISTSKFKGGLDFRDLNLFNQAMLAKQSWRIIKNPNSLVARVL-------------------

Query:  ------------------RGKIGDDRRTVIDKDPWLINQGSRCPVWVKDSFQGKRVCDLLDNEGFWCEAKVLEAFSPSKAEEILSIPRQKSIRHDKIIWN
                          R +IGD ++ ++ KD W+    +  P+  K       V DL+D+E  W   ++ + F     E IL I        D+++W+
Subjt:  ------------------RGKIGDDRRTVIDKDPWLINQGSRCPVWVKDSFQGKRVCDLLDNEGFWCEAKVLEAFSPSKAEEILSIPRQKSIRHDKIIWN

Query:  PDPKGKFSVRSAYNLAVNLNSKEDVSASSPDTTKSMWKELWSLNIVPRAKITVWKIISNALPTNFNITRRGIYIDSRCVLCRSKEESTDHLGVSCP----
         D KG++SV+S Y LA+N N   +  +S  +++  +WK  W L++  + KI +W+ + N LPT  N+ +R    +  C  C+ + E+  H+ + C     
Subjt:  PDPKGKFSVRSAYNLAVNLNSKEDVSASSPDTTKSMWKELWSLNIVPRAKITVWKIISNALPTNFNITRRGIYIDSRCVLCRSKEESTDHLGVSCP----

Query:  ----------------EDFWMIMVDKLSKEDLGQVAIVL---WVLWCYRDKCNSNRQISD---VIQICRSIQNGFDVLAKSGKGYLVAAQAKSQMSHNWI
                        +DF+  + +  S+    +  +++   WV+W  R+K     + SD   +     S+   +  ++K G  +   A+ +      W 
Subjt:  ----------------EDFWMIMVDKLSKEDLGQVAIVL---WVLWCYRDKCNSNRQISD---VIQICRSIQNGFDVLAKSGKGYLVAAQAKSQMSHNWI

Query:  PLDSGRWKLNSDASWIEASRKRGLGWVIRDSSRSLIGAGYIQIDKKWLIKVMEAQAILEGLKAYRSLADKDGISCPLLIESDSACVVNAINDRDVDFSEV
        P      KLN DA+     +K GLG ++RD+   ++  G  Q   +  + + EA+AI  GL+    ++     S  L++ESD   VV  +N+     +E+
Subjt:  PLDSGRWKLNSDASWIEASRKRGLGWVIRDSSRSLIGAGYIQIDKKWLIKVMEAQAILEGLKAYRSLADKDGISCPLLIESDSACVVNAINDRDVDFSEV

Query:  ALVV-DAIAEIARGVVISIGFCSRWDNILAHQVARAATSHGNFLCFFGALSSSV
          ++ D   E      +   F  R  N  AH +A+ A  + +   + G   + V
Subjt:  ALVV-DAIAEIARGVVISIGFCSRWDNILAHQVARAATSHGNFLCFFGALSSSV

XP_024033483.1 uncharacterized protein LOC112095606 [Citrus clementina]2.0e-5930.3Show/hide
Query:  MSCFKLPKSICDDINKHCARFWWGSSEAKRRAHWMSWRKISTSKFKGGLDFRDLNLFNQAMLAKQSWRIIKNPNSLVARVLRG-----------KIGDDR
        MS FK+P  +C+DI K  A FWWGS   KR  HW  W K+S +K KG + FRD + FNQA++AKQ WRI++ P+SLVA+VL+            +IGD +
Subjt:  MSCFKLPKSICDDINKHCARFWWGSSEAKRRAHWMSWRKISTSKFKGGLDFRDLNLFNQAMLAKQSWRIIKNPNSLVARVLRG-----------KIGDDR

Query:  RTVIDKDPWLINQGSRCPVWVKDSFQGKRVCDLLDNEGFWCEAKVLEAFSPSKAEEILSIPRQKSIRHDKIIWNPDPKGKFSVRSAYNLAVNLNSKEDVS
        +  I KD W+    +  P+      +   V +L++ E  W E+++   F+   A++I+ IP  +S + D IIW+ D KG +SV+S Y  A++L  K    
Subjt:  RTVIDKDPWLINQGSRCPVWVKDSFQGKRVCDLLDNEGFWCEAKVLEAFSPSKAEEILSIPRQKSIRHDKIIWNPDPKGKFSVRSAYNLAVNLNSKEDVS

Query:  ASSPDTTKSMWKELWSLNIVPRAKITVWKIISNALPTNFNITRRGIYIDSRCVLCRSKEESTDHLGVSC--PEDFWMIMVDKLSKEDLGQVAIVLWVLWC
          S  + K+ W  +W+L +  + +I  W+   N LP+  N+ +R I  +  C + +   E+  H  V C      W +    L + D+ Q A    +   
Subjt:  ASSPDTTKSMWKELWSLNIVPRAKITVWKIISNALPTNFNITRRGIYIDSRCVLCRSKEESTDHLGVSC--PEDFWMIMVDKLSKEDLGQVAIVLWVLWC

Query:  YRDKCNSNRQISDVIQICRSIQNGFDVLAKSGKGYLVAAQAKSQMSHNWIPLDSGRWKLNSDASWIEASRKRGLGWVIRDSSRSLIGAGYIQIDK-KWLI
         R+   +      ++    ++   +  + K  +  + + Q ++Q    W P  SG  K+N DA+        GLG +IRD   ++I A  I+I K    +
Subjt:  YRDKCNSNRQISDVIQICRSIQNGFDVLAKSGKGYLVAAQAKSQMSHNWIPLDSGRWKLNSDASWIEASRKRGLGWVIRDSSRSLIGAGYIQIDK-KWLI

Query:  KVMEAQAILEGLKAYRSLADKDGISCPLLIESDSACVVNAINDRDVDFSEVALVVDAIAEIARGV-VISIGFCSRWDNILAHQVARAATSHGNFLCFFGA
           EA+A+  GL+  R+ + K      L++ESD+  VVN +N++    SE+  ++  I  + R   ++SI +  R  N +AH +A+ A      + + G+
Subjt:  KVMEAQAILEGLKAYRSLADKDGISCPLLIESDSACVVNAINDRDVDFSEVALVVDAIAEIARGV-VISIGFCSRWDNILAHQVARAATSHGNFLCFFGA

Query:  LSSSV
          S +
Subjt:  LSSSV

XP_024950112.1 uncharacterized protein LOC112496847 [Citrus sinensis]8.1e-6130.06Show/hide
Query:  MSCFKLPKSICDDINKHCARFWWGSSEAKRRAHWMSWRKISTSKFKGGLDFRDLNLFNQAMLAKQSWRIIKNPNSLVARVL-------------------
        MS FKLP+  CDDI +  A+FWWGS   KR  HW  W K+S +K +GGL FR+ + FNQA++AKQ+WR+++ PNSLV+RVL                   
Subjt:  MSCFKLPKSICDDINKHCARFWWGSSEAKRRAHWMSWRKISTSKFKGGLDFRDLNLFNQAMLAKQSWRIIKNPNSLVARVL-------------------

Query:  ------------------RGKIGDDRRTVIDKDPWLINQGSRCPVWVKDSFQGKRVCDLLDNEGFWCEAKVLEAFSPSKAEEILSIPRQKSIRHDKIIWN
                          R +IG+ ++  I  D WL    +  P++         V DL+  +  W E K+ + F      EIL IP       D+++W+
Subjt:  ------------------RGKIGDDRRTVIDKDPWLINQGSRCPVWVKDSFQGKRVCDLLDNEGFWCEAKVLEAFSPSKAEEILSIPRQKSIRHDKIIWN

Query:  PDPKGKFSVRSAYNLAVNLNSKEDVSASSPDTTKSMWKELWSLNIVPRAKITVWKIISNALPTNFNITRRGIYIDSRCVLCRSKEESTDHLGVSC--PED
         D +G +SV+S Y LA  L SK   S S  + +   W  LW+L +  + KI +W+  +N LP+  N+ +R +  +  C  C+   E+  H  + C     
Subjt:  PDPKGKFSVRSAYNLAVNLNSKEDVSASSPDTTKSMWKELWSLNIVPRAKITVWKIISNALPTNFNITRRGIYIDSRCVLCRSKEESTDHLGVSC--PED

Query:  FWM---------------------IMVDKLSKEDLGQVAIVLWVLWCYRDKC-NSNRQISDVIQICR--SIQNGFDVLAKSGKGYLVAAQAKSQMSHNWI
         W+                      M  +L K DL  +  + W  W  R+KC    R+++ +I   +  S+   F  + K  + ++  +  + Q    W+
Subjt:  FWM---------------------IMVDKLSKEDLGQVAIVLWVLWCYRDKC-NSNRQISDVIQICR--SIQNGFDVLAKSGKGYLVAAQAKSQMSHNWI

Query:  PLDSGRWKLNSDASWIEASRKRGLGWVIRDSSRSLIGAGYIQIDKKWLIKVMEAQAILEGLKAYRSLADKDGISCPLLIESDSACVVNAINDRDVDFSEV
        P     +K+N DA++   +   G+G VIRDS+  ++ AG  Q   K    + EA+A+L GL+  R+ AD       L+IESD   VV  +N+     SE+
Subjt:  PLDSGRWKLNSDASWIEASRKRGLGWVIRDSSRSLIGAGYIQIDKKWLIKVMEAQAILEGLKAYRSLADKDGISCPLLIESDSACVVNAINDRDVDFSEV

Query:  ALVVDAI---AEIARGVVISIGFCSRWDNILAHQVARAA
           + AI    +I + VV++     R  N  AH +A+ A
Subjt:  ALVVDAI---AEIARGVVISIGFCSRWDNILAHQVARAA

TrEMBL top hitse value%identityAlignment
A0A6J1DAR4 uncharacterized protein LOC1110189545.0e-5628.76Show/hide
Query:  MSCFKLPKSICDDINKHCARFWWGSSEAKRRAHWMSWRKISTSKFKGGLDFRDLNLFNQAMLAKQSWRIIKNPNSLVARVLRG-----------------
        MSCF+LPK +  + +   ARFWWGSS+  ++ HW++W  +   K +GG+ FRDL LFN+A+LAKQ WRI+ +PNS+++RVL+G                 
Subjt:  MSCFKLPKSICDDINKHCARFWWGSSEAKRRAHWMSWRKISTSKFKGGLDFRDLNLFNQAMLAKQSWRIIKNPNSLVARVLRG-----------------

Query:  --------------------KIGDDRRTVIDKDPWLINQGSRCPVWVKDSFQGKRVCDLLDN-EGFWCEAKVLEAFSPSKAEEILSIPRQKSIRHDKIIW
                            +IG+     I  D W+ NQ +   +         RV  L+D+ EG W    V + F+P +A+ ILSIP  +    D++IW
Subjt:  --------------------KIGDDRRTVIDKDPWLINQGSRCPVWVKDSFQGKRVCDLLDN-EGFWCEAKVLEAFSPSKAEEILSIPRQKSIRHDKIIW

Query:  NPDPKGKFSVRSAYNLA-VNLNSKEDVSASSPDTTKSMWKELWSLNIVPRAKITVWKIISNALPTNFNITRRGIYIDSRCVLCRSKEESTDHLGVSC--P
        N +  G +SVRS Y +A +N    +  S+SS +  +  W   W ++I  + K+ +W++  + LPT  N+++RG+ I + C  C    E + HL   C   
Subjt:  NPDPKGKFSVRSAYNLA-VNLNSKEDVSASSPDTTKSMWKELWSLNIVPRAKITVWKIISNALPTNFNITRRGIYIDSRCVLCRSKEESTDHLGVSC--P

Query:  EDFWM---------IMV-----DKLSKEDLGQVAIVLWVLWCYRDKCNSNRQISDVIQICRSI---QNGFDVLAKSGKGYLVAAQAKSQMSHNWIPLDSG
        E  W+          ++     + LSK D  ++ +V+W LW  R+    N     V +I   +    N + +  +  K   +  +  +     W P D G
Subjt:  EDFWM---------IMV-----DKLSKEDLGQVAIVLWVLWCYRDKCNSNRQISDVIQICRSI---QNGFDVLAKSGKGYLVAAQAKSQMSHNWIPLDSG

Query:  RWKLNSDASWIEASRKRGLGWVIRDSSRSLIGAGYIQIDKKWLIKVMEAQAILEGLKAYRSLADKDGISCPLLIESDSACVVNAINDRDVDFSEVA-LVV
         +K+N+DAS++ + +  GLG +I +    ++ A    ++    + + EA A +EGL+    LA + G+   L                  D SE   +V+
Subjt:  RWKLNSDASWIEASRKRGLGWVIRDSSRSLIGAGYIQIDKKWLIKVMEAQAILEGLKAYRSLADKDGISCPLLIESDSACVVNAINDRDVDFSEVA-LVV

Query:  DAIAEIARGVVISIGFCSRWDNILAHQVARAA
         A     + +  S  F  R  N  AH +AR A
Subjt:  DAIAEIARGVVISIGFCSRWDNILAHQVARAA

B8BBX0 Reverse transcriptase domain-containing protein3.0e-6128.95Show/hide
Query:  MSCFKLPKSICDDINKHCARFWWGSSEAKRRAHWMSWRKISTSKFKGGLDFRDLNLFNQAMLAKQSWRIIKNPNSLVARVLRGK----------------
        M  FKLP S+C+D++K    FWWG+ + KRR HW +W  ++  K  GG+ FRD  LFNQA+LA+QSWRI++ P SL ARVL+ K                
Subjt:  MSCFKLPKSICDDINKHCARFWWGSSEAKRRAHWMSWRKISTSKFKGGLDFRDLNLFNQAMLAKQSWRIIKNPNSLVARVLRGK----------------

Query:  ---------------------IGDDRRTVIDKDPWLINQGSRCPVWVKDSFQGKRVCDLLDNEGFWCEAKVLEAFSPSKAEEILSIPRQKSIRHDKIIWN
                             +G+ R   I +DPW+    SR P+  K + + K V +LLD  G W   K+ + F P   E+ILSI   +   +D + W+
Subjt:  ---------------------IGDDRRTVIDKDPWLINQGSRCPVWVKDSFQGKRVCDLLDNEGFWCEAKVLEAFSPSKAEEILSIPRQKSIRHDKIIWN

Query:  PDPKGKFSVRSAYNLAVNLNSKEDVSASSPDTTKSMWKELWSLNIVPRAKITVWKIISNALPTNFNITRRGIYIDSRCVLCRSKEESTDHLGVSCP--ED
         D  G+FSVRSAY+LA++L++    S+SS       W +LWS ++  + +I +W+  SN+L T  N  ++ +   S C +C ++EE   H    CP  + 
Subjt:  PDPKGKFSVRSAYNLAVNLNSKEDVSASSPDTTKSMWKELWSLNIVPRAKITVWKIISNALPTNFNITRRGIYIDSRCVLCRSKEESTDHLGVSCP--ED

Query:  FWMIM-----------------------VDKLSKEDLGQVAIVLWVLWCYRDKCN----------SNRQISDVIQICRSIQNGFDVLAKSGKGYLVAAQA
         W +M                        +++SKE+   + ++LW +W  R++            S R IS  I     I+   D     GK  +  A A
Subjt:  FWMIM-----------------------VDKLSKEDLGQVAIVLWVLWCYRDKCN----------SNRQISDVIQICRSIQNGFDVLAKSGKGYLVAAQA

Query:  KSQMSH--------NWIPLDSGRWKLNSDASWIEASRKRGLGWVIRDSSRSLIGAGYIQIDKKWLIKVMEAQAILEGLKAYRSLADKDGI-------SCP
         +Q++H         W+   +G  KLN D S+       G+G V+R+S   LI A    + +   +  +EA+ +          A K+GI         P
Subjt:  KSQMSH--------NWIPLDSGRWKLNSDASWIEASRKRGLGWVIRDSSRSLIGAGYIQIDKKWLIKVMEAQAILEGLKAYRSLADKDGI-------SCP

Query:  LLIESDSACVVNAINDRDVDFSEVALVVDAIAEIARG-VVISIGFCSRWDNILAHQVARAATS
        +++E+D   +V  + ++    S++  ++  I ++ +G   I I   +R  +I  H +A    S
Subjt:  LLIESDSACVVNAINDRDVDFSEVALVVDAIAEIARG-VVISIGFCSRWDNILAHQVARAATS

M5XSK0 Reverse transcriptase domain-containing protein1.8e-5829.16Show/hide
Query:  MSCFKLPKSICDDINKHCARFWWGSSEAKRRAHWMSWRKISTSKFKGGLDFRDLNLFNQAMLAKQSWRIIKNPNSLVARVLRGK----------------
        MSCF LPK +C+D+NK  A+FWW SS   ++ HWM+W ++   K +GGL FR+L+ FN A+LAKQ WR+++NP+SLV +VL+ K                
Subjt:  MSCFKLPKSICDDINKHCARFWWGSSEAKRRAHWMSWRKISTSKFKGGLDFRDLNLFNQAMLAKQSWRIIKNPNSLVARVLRGK----------------

Query:  ------IGDDRRTVIDKDPWLINQGSRCPVW------VKDSFQ----------GKRVCDLLDNEGFWCEAKVLE-AFSPSKAEEILSIPRQKSIRHDKII
              + D R  +I    W +  G    +W        +SFQ            +V DL+  +     A +L+  F P +   I SIP    +  D ++
Subjt:  ------IGDDRRTVIDKDPWLINQGSRCPVW------VKDSFQ----------GKRVCDLLDNEGFWCEAKVLE-AFSPSKAEEILSIPRQKSIRHDKII

Query:  WNPDPKGKFSVRSAYNLAVNLNSKED-VSASSPDTTKSMWKELWSLNIVPRAKITVWKIISNALPTNFNITRRGIYIDSRCVLCRSKEESTDHL------
        W+ D KG F+V+SAY++A +L+S     S+S+ D     W  LW   +  R K   W++IS  LPT  N+ R+ + +D  C+LC    +S  H+      
Subjt:  WNPDPKGKFSVRSAYNLAVNLNSKED-VSASSPDTTKSMWKELWSLNIVPRAKITVWKIISNALPTNFNITRRGIYIDSRCVLCRSKEESTDHL------

Query:  --GVSCPEDFWMIMVDKLSKEDLGQVAIVLWVLWCYRDKC--NSNRQISDVIQICRSIQNGFDVLAKSGKGYLVAAQAKSQMSHNWIPLDSGRWKLNSDA
          G   P+D+     ++LS +D     +V W +W  R+    N+ +   + + +  S++   D L  S    L +   + Q+   W P      K+N D 
Subjt:  --GVSCPEDFWMIMVDKLSKEDLGQVAIVLWVLWCYRDKC--NSNRQISDVIQICRSIQNGFDVLAKSGKGYLVAAQAKSQMSHNWIPLDSGRWKLNSDA

Query:  SWIEASRKRGLGWVIRDSSRSLIGAGYIQIDKKWLIKVMEAQAILEGLKAYRSLADKDGISCPLLIESDSACVVNAINDRDVDFSEVALVVDAIAEIARG
        +W   + + G+G V+RDS+   +     ++   +    +EA A     +    LA + G    ++ ESD+  +V A+ +  +D S +  VV+    +   
Subjt:  SWIEASRKRGLGWVIRDSSRSLIGAGYIQIDKKWLIKVMEAQAILEGLKAYRSLADKDGISCPLLIESDSACVVNAINDRDVDFSEVALVVDAIAEIARG

Query:  VVISIGF--CSRWDNILAHQVARAATSHGNFLCFF
        +    GF    R  N +AH++AR A   G  L +F
Subjt:  VVISIGF--CSRWDNILAHQVARAATSHGNFLCFF

Q2QNX8 Retrotransposon protein, putative, unclassified2.2e-5632.91Show/hide
Query:  MSCFKLPKSICDDINKHCARFWWGSSEAKRRAHWMSWRKISTSKFKGGLDFRDLNLFNQAMLAKQSWRIIKNPNSLVARVLRGKIGDDRRTVIDKDPWLI
        M  FKLP SICD++ K    FWWGS + KR+AHW SW  I+  K  GGL FRD  LFNQA+LA+Q+WR+I+NP+SL  ++ R             DPWL 
Subjt:  MSCFKLPKSICDDINKHCARFWWGSSEAKRRAHWMSWRKISTSKFKGGLDFRDLNLFNQAMLAKQSWRIIKNPNSLVARVLRGKIGDDRRTVIDKDPWLI

Query:  NQGSRCPVWVKDSFQGKRVCDLLDNEGFWCEAKVLEAFSPSKAEEILSIPRQKSIRHDKIIWNPDPKGKFSVRSAYNLAVNLNSKEDVSASSPDTTKSMW
           SR P+  K + + K V DLLD  G W    + + F P   E I SI        D I W PD  G+FS+RSAY LAV L +  + S+SS + TK +W
Subjt:  NQGSRCPVWVKDSFQGKRVCDLLDNEGFWCEAKVLEAFSPSKAEEILSIPRQKSIRHDKIIWNPDPKGKFSVRSAYNLAVNLNSKEDVSASSPDTTKSMW

Query:  KELWSLNIVPRAKITVWKIISNALPTNFNITRRGIYIDSRCVLCRSKEESTDHLGVSCP--EDFWMIM-----------------------VDKLSKEDL
          +W  NI  + K+  W+ ISN LPT  N  +R   +   C  C  + E   H    CP    +W +M                       ++ +S++D 
Subjt:  KELWSLNIVPRAKITVWKIISNALPTNFNITRRGIYIDSRCVLCRSKEESTDHLGVSCP--EDFWMIM-----------------------VDKLSKEDL

Query:  GQVAIVLWVLWCYRDKCNSNRQISDV---IQICRSIQNGFDVL-----AKSGKGYLVAAQ-----AKSQMSHNWIPLDSGRWKLNSDASWIEASRKRGLG
            + LW +W  R++   ++    V    +  RS  N    +     A  G G  V  +      K  +   W    SG  KLN D S+ E +    + 
Subjt:  GQVAIVLWVLWCYRDKCNSNRQISDV---IQICRSIQNGFDVL-----AKSGKGYLVAAQ-----AKSQMSHNWIPLDSGRWKLNSDASWIEASRKRGLG

Query:  WVIRDSSRSLIGA--GYIQIDKKWLIKVMEAQAILEGLKAYRSLADKDGIS-------CPLLIESDSA
         ++R+S+  +I A  G+++          + Q+ LE       LA K+GI         P++IESD A
Subjt:  WVIRDSSRSLIGA--GYIQIDKKWLIKVMEAQAILEGLKAYRSLADKDGIS-------CPLLIESDSA

Q2QUC2 Retrotransposon protein, putative, unclassified2.0e-5728.57Show/hide
Query:  MSCFKLPKSICDDINKHCARFWWGSSEAKRRAHWMSWRKISTSKFKGGLDFRDLNLFNQAMLAKQSWRIIKNPNSLVARVLRGK----------------
        MS F+LP+S+C+D+NK    FWWG+ + KR+ HW +W  ++  K  GGL FRD  LFNQA+LA+Q+WR++  P+SL ARV++ K                
Subjt:  MSCFKLPKSICDDINKHCARFWWGSSEAKRRAHWMSWRKISTSKFKGGLDFRDLNLFNQAMLAKQSWRIIKNPNSLVARVLRGK----------------

Query:  ---------------------IGDDRRTVIDKDPWLINQGSRCPVWVKDSFQGKRVCDLLDNEGFWCEAKVLEAFSPSKAEEILSIPRQKSIRHDKIIWN
                             IG+ R   I +DPW+    S  P+  K + + K V DLL  +G W   KV   F P  A+EIL I     +  D + W+
Subjt:  ---------------------IGDDRRTVIDKDPWLINQGSRCPVWVKDSFQGKRVCDLLDNEGFWCEAKVLEAFSPSKAEEILSIPRQKSIRHDKIIWN

Query:  PDPKGKFSVRSAYNLAVNLNSKEDVSASSPDTTKSMWKELWSLNIVPRAKITVWKIISNALPTNFNITRRGIYIDSRCVLCRSKEESTDHLGVSCPE--D
        PD  G+FSVRSAY LA++L+  ++ S+SS    + +W  +W  N+  + K+  W+  +N L    N  +R +     C +C ++ E   H    CP    
Subjt:  PDPKGKFSVRSAYNLAVNLNSKEDVSASSPDTTKSMWKELWSLNIVPRAKITVWKIISNALPTNFNITRRGIYIDSRCVLCRSKEESTDHLGVSCPE--D

Query:  FWMIM-----------------------VDKLSKEDLGQVAIVLWVLWCYRDKCNSNRQISDVIQICRSIQNGFDVL--------AKSGKGYLVAAQ---
         W  M                       ++    E+   + ++LW +W  R++    +    +    R +++    L        A   +G  V  Q   
Subjt:  FWMIM-----------------------VDKLSKEDLGQVAIVLWVLWCYRDKCNSNRQISDVIQICRSIQNGFDVL--------AKSGKGYLVAAQ---

Query:  AKSQMSH-------NWIPLDSGRWKLNSDASWIEASRKRGLGWVIRDSSRSLI--GAGYIQIDKKWLIKVMEAQAILEGLKAYRSLADKDGIS-------
         +S+ +H        W     G  KLN D S+   S K G+G V+RDS  ++I    G+++             + LE       LA K+GI+       
Subjt:  AKSQMSH-------NWIPLDSGRWKLNSDASWIEASRKRGLGWVIRDSSRSLI--GAGYIQIDKKWLIKVMEAQAILEGLKAYRSLADKDGIS-------

Query:  CPLLIESDSACVVNAINDRDVDFSEVALVVDAIAEIARG
         P+++ESD    VN I     + S++A +V  I E+  G
Subjt:  CPLLIESDSACVVNAINDRDVDFSEVALVVDAIAEIARG

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657505.0e-2121.74Show/hide
Query:  MSCFKLPKSICDDINKHCARFWWGSSEAKRRAHWMSWRKISTSKFKGGLDFRDLNLFNQAMLAKQSWRIIKNPNSLVARVLRGK--IGD--DRRTVIDKD
        MS   LP+SI + +++    F WGS+  K++ H + W K+ + K +GGL  R     N+A+++K  WR+++  NSL   VL+ K  +G+  D R +I K 
Subjt:  MSCFKLPKSICDDINKHCARFWWGSSEAKRRAHWMSWRKISTSKFKGGLDFRDLNLFNQAMLAKQSWRIIKNPNSLVARVLRGK--IGD--DRRTVIDKD

Query:  P---------------------WLINQGSRCPVWVKDSFQGKRV-----------CDLLDNEGFWCEAKVLEAFSPSKAEEILSIPRQKSIR--------
                              W+   G +   W      GK +           CD +  +  W   +    +  +K +   +   +  +R        
Subjt:  P---------------------WLINQGSRCPVWVKDSFQGKRV-----------CDLLDNEGFWCEAKVLEAFSPSKAEEILSIPRQKSIR--------

Query:  --HDKIIWNPDPKGKFSVRSAYNLAVNLNSKEDVSASSPDTTKSMWKELWSLNIVPRAKITVWKIISNALPTNFNITRRGIYIDSRCVLCRSKEESTDHL
           D++ W     G+FSVRSAY +         V         S +  LW + +  R K  +W + + A+ T     RR +   + C +C+   ES  H+
Subjt:  --HDKIIWNPDPKGKFSVRSAYNLAVNLNSKEDVSASSPDTTKSMWKELWSLNIVPRAKITVWKIISNALPTNFNITRRGIYIDSRCVLCRSKEESTDHL

Query:  GVSCPED--FWMIMV----------------------DKLSKEDLGQ---VAIVLWVLWCYRDKCNS----NRQISDVIQICRSIQNGFDVL-AKSGKGY
           CP     W+ +V                      D+   ED+      A+++W  W ++ +C +    N +  D ++  +  +   +V  A SG   
Subjt:  GVSCPED--FWMIMV----------------------DKLSKEDLGQ---VAIVLWVLWCYRDKCNS----NRQISDVIQICRSIQNGFDVL-AKSGKGY

Query:  LVAAQAKSQMSHNWIPLDSGRWKLNSDASWIEASRKRGLGWVIRDSSRSLIGAGYIQIDK
        +   Q + +    W+    G  K+N+D +          G V+RD + +  G   + I +
Subjt:  LVAAQAKSQMSHNWIPLDSGRWKLNSDASWIEASRKRGLGWVIRDSSRSLIGAGYIQIDK

P93295 Uncharacterized mitochondrial protein AtMg003105.2e-1850.59Show/hide
Query:  MSCFKLPKSICDDINKHCARFWWGSSEAKRRAHWMSWRKISTSK-FKGGLDFRDLNLFNQAMLAKQSWRIIKNPNSLVARVLRGK
        MSCF+L K +C  +      FWW S E KR+  W++W+K+  SK   GGL FRDL  FNQA+LAKQS+RII  P++L++R+LR +
Subjt:  MSCFKLPKSICDDINKHCARFWWGSSEAKRRAHWMSWRKISTSK-FKGGLDFRDLNLFNQAMLAKQSWRIIKNPNSLVARVLRGK

Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.2e-0820.79Show/hide
Query:  RGIYIDSRCVLCRSKEESTDHLGVSC---------------PEDFWM--------------IMVDKLSKEDLGQ-VAIVLWVLWCYRDKCNSNRQISDVI
        R ++ ++ CV C    E+ +HL   C               PE  W               + + KL K  +G  V  +LW LW  R++     +  D  
Subjt:  RGIYIDSRCVLCRSKEESTDHLGVSC---------------PEDFWM--------------IMVDKLSKEDLGQ-VAIVLWVLWCYRDKCNSNRQISDVI

Query:  QICRSIQNGFDVLA--KSGKGYLVAAQAKSQMSHNWIPLDSGRWKLNSDASWIEASRKRGLGWVIRDSSRSLIGAGYIQIDKKWLIKVMEAQAILEGLKA
        ++ R     F+  +  +  +G     Q +  +S  W        K N+DA+W   + + G+GW++R+ S  ++  G   + +   +   E +A+   +  
Subjt:  QICRSIQNGFDVLA--KSGKGYLVAAQAKSQMSHNWIPLDSGRWKLNSDASWIEASRKRGLGWVIRDSSRSLIGAGYIQIDKKWLIKVMEAQAILEGLKA

Query:  YRSLADKDGISCPLLIESDSACVVNAINDRDVDFSEVALVVDAIAEIARGVVISIGFCSRWDNILAHQVARAATSHGNF
              K      ++ ESD+  +VN +N  D   +    + D    +     +   F  R  N +A ++AR + S  N+
Subjt:  YRSLADKDGISCPLLIESDSACVVNAINDRDVDFSEVALVVDAIAEIARGVVISIGFCSRWDNILAHQVARAATSHGNF

AT3G09510.1 Ribonuclease H-like superfamily protein2.5e-2322.15Show/hide
Query:  FRDLNLFNQAMLAKQS--WRIIKNPNSLVARVLRGKIGDDRRTVIDKDPWLINQGSRCPVWVKDSFQGKRVCDLLDNEG---FWCEAKVLEAFSPSKAEE
        F+D+++ +  +  +QS  W  + +  +L+ +  R  IGD +   I  D  +++     P+  +++++   + +L + +G   FW ++K+ +    S    
Subjt:  FRDLNLFNQAMLAKQS--WRIIKNPNSLVARVLRGKIGDDRRTVIDKDPWLINQGSRCPVWVKDSFQGKRVCDLLDNEG---FWCEAKVLEAFSPSKAEE

Query:  ILSIPRQKSIRHDKIIWNPDPKGKFSVRSAYNLAVNLNSKEDVSASSPDTTKSMWKELWSLNIVPRAKITVWKIISNALPTNFNITRRGIYIDSRCVLCR
        I  I   KS + DKIIWN +  G+++VRS Y L  +  S    + + P  +  +   +W+L I+P+ K  +W+ +S AL T   +T RG+ ID  C  C 
Subjt:  ILSIPRQKSIRHDKIIWNPDPKGKFSVRSAYNLAVNLNSKEDVSASSPDTTKSMWKELWSLNIVPRAKITVWKIISNALPTNFNITRRGIYIDSRCVLCR

Query:  SKEESTDHLGVSCP--------EDFWMIMVDKLSKEDLGQVAIVL--------------------WVLWCYRDKCNSNRQISDVIQICRSIQNGFDVLAK
         + ES +H   +CP         D  +I    +S +    ++ +L                    W +W  R+    N+      +   S +        
Subjt:  SKEESTDHLGVSCP--------EDFWMIMVDKLSKEDLGQVAIVL--------------------WVLWCYRDKCNSNRQISDVIQICRSIQNGFDVLAK

Query:  SGKGYLVAAQAKSQMSHN---WIPLDSGRWKLNSDASWIEASRKRGLGWVIRDSSRSLIGAGYIQIDKKWLIKVMEAQAILEGLKA--YRSLADKDGISC
        + + +        Q++ N   W    +   K N DA +     +   GW+IR+   + I  G +++         E +A+L  L+    R          
Subjt:  SGKGYLVAAQAKSQMSHN---WIPLDSGRWKLNSDASWIEASRKRGLGWVIRDSSRSLIGAGYIQIDKKWLIKVMEAQAILEGLKA--YRSLADKDGISC

Query:  PLLIESDSACVVNAINDRDVDFSEVALVVDAIAEIARGVVISIGFCSRWDNILAHQVARAATSHGNFLCFFGAL
         + +E D   ++N IN      S    + D      +   I  GF  R  N LAH +A+   ++  F    G+L
Subjt:  PLLIESDSACVVNAINDRDVDFSEVALVVDAIAEIARGVVISIGFCSRWDNILAHQVARAATSHGNFLCFFGAL

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein8.4e-1626.37Show/hide
Query:  MSCFKLPKSICDDINKHCARFWWGSSEAKRRAHWMSWRKISTSKFKGGLDFRDLNLFNQAMLAKQSWRIIKNPNSLVARVLRGKIGDDRRTVIDKDPWLI
        MS F+LP +   +I+  C+ F W   E   +   ++W  + T K +GGL  R L   N+       W I  + N+ +   +  KI   R          I
Subjt:  MSCFKLPKSICDDINKHCARFWWGSSEAKRRAHWMSWRKISTSKFKGGLDFRDLNLFNQAMLAKQSWRIIKNPNSLVARVLRGKIGDDRRTVIDKDPWLI

Query:  NQGSRCPVWVKDSFQGKRVCDLLDNEGFWCEAKVLEAFSPSKAEEILSIPRQKSIRHDKIIWNPD----------PKGKFSVRSAYN---LAVNLNSKED
        + GS    W  +  +  R+ D+  + G       L A   S AE +++  R +  RHD ++   D            G+ +VR   N        N+KE 
Subjt:  NQGSRCPVWVKDSFQGKRVCDLLDNEGFWCEAKVLEAFSPSKAEEILSIPRQKSIRHDKIIWNPD----------PKGKFSVRSAYN---LAVNLNSKED

Query:  VSASSPDTTKSMW-KELWSLNIVPRAKITVWKIISNALPTNFNITRRGIYIDSRCVLCRSKEESTDHLGVSCP
         +A+     K  W K +W  +  P+  +  W  I N L T   +       DS CVLC    E+ DHL  +CP
Subjt:  VSASSPDTTKSMW-KELWSLNIVPRAKITVWKIISNALPTNFNITRRGIYIDSRCVLCRSKEESTDHLGVSCP

AT4G29090.1 Ribonuclease H-like superfamily protein6.6e-3723.02Show/hide
Query:  MSCFKLPKSICDDINKHCARFWWGSSEAKRRAHWMSWRKISTSKFKGGLDFRDLNLFNQAMLAKQSWRIIKNPNSLVARVLRGKI---GDDRRTVIDKDP
        M+CF LPK++C  I    A FWW + +  +  HW +W  +S  K +GG+ F+D+  FN A+L KQ WR++  P SL+A+V + +     D     +   P
Subjt:  MSCFKLPKSICDDINKHCARFWWGSSEAKRRAHWMSWRKISTSKFKGGLDFRDLNLFNQAMLAKQSWRIIKNPNSLVARVLRGKI---GDDRRTVIDKDP

Query:  ---W--------LINQGSRCPV--------WVKDSFQGK-----------------------RVCDLLDNEGFWCEAKVLEAFSPSKAEEILSIPRQKSI
           W        ++ QG+R  V        W       K                       +V DL+D  G      V+E   P    +++   R    
Subjt:  ---W--------LINQGSRCPV--------WVKDSFQGK-----------------------RVCDLLDNEGFWCEAKVLEAFSPSKAEEILSIPRQKSI

Query:  R-HDKIIWNPDPKGKFSVRSAY-NLAVNLNSKEDVSASSPDTTKSMWKELWSLNIVPRAKITVWKIISNALPTNFNITRRGIYIDSRCVLCRSKEESTDH
        R  D   W+    G ++V+S Y  L   +N +      S  +   +++++W     P+ +  +WK +SN+LP    +  R +  +S C+ C S +E+ +H
Subjt:  R-HDKIIWNPDPKGKFSVRSAY-NLAVNLNSKEDVSASSPDTTKSMWKELWSLNIVPRAKITVWKIISNALPTNFNITRRGIYIDSRCVLCRSKEESTDH

Query:  LGVSCP-------------------------EDFWMIMVDKLSK--EDLGQ-VAIVLWVLWCYRDKCNSNRQISDVIQICRSIQNGFDV--LAKSGKGYL
        L   C                            +W+  +   +   E   Q V  +LW LW  R++     +  +  ++ R  ++  +   +    +   
Subjt:  LGVSCP-------------------------EDFWMIMVDKLSK--EDLGQ-VAIVLWVLWCYRDKCNSNRQISDVIQICRSIQNGFDV--LAKSGKGYL

Query:  VAAQAKSQMSHNWIPLDSGRWKLNSDASWIEASRKRGLGWVIRDSSRSLIGAGYIQIDKKWLIKVMEAQAILEGLKAYRSLADKDGISCPLLIESDSACV
           Q        W P      K N+DA+W   + + G+GWV+R+    +   G   + K  L  V+EA+  LE ++ +  L+        ++ ESDS  +
Subjt:  VAAQAKSQMSHNWIPLDSGRWKLNSDASWIEASRKRGLGWVIRDSSRSLIGAGYIQIDKKWLIKVMEAQAILEGLKAYRSLADKDGISCPLLIESDSACV

Query:  VNAINDRDVDFSEVALVVDAIAEIARGVVISIGFCSRWDNILAHQVARAATSHGNF
        +  +N+ ++  S    + D    +++   +   F  R  N LA +VAR + S  N+
Subjt:  VNAINDRDVDFSEVALVVDAIAEIARGVVISIGFCSRWDNILAHQVARAATSHGNF

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.7e-1950.59Show/hide
Query:  MSCFKLPKSICDDINKHCARFWWGSSEAKRRAHWMSWRKISTSK-FKGGLDFRDLNLFNQAMLAKQSWRIIKNPNSLVARVLRGK
        MSCF+L K +C  +      FWW S E KR+  W++W+K+  SK   GGL FRDL  FNQA+LAKQS+RII  P++L++R+LR +
Subjt:  MSCFKLPKSICDDINKHCARFWWGSSEAKRRAHWMSWRKISTSK-FKGGLDFRDLNLFNQAMLAKQSWRIIKNPNSLVARVLRGK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCTGCTTCAAGCTCCCTAAAAGTATATGTGATGATATCAACAAGCATTGCGCCAGGTTCTGGTGGGGGTCTTCTGAGGCAAAGCGAAGAGCTCACTGGATGAGTTG
GAGGAAAATAAGCACGAGCAAATTCAAAGGAGGGCTTGACTTCAGAGATCTTAATCTGTTTAATCAAGCCATGCTCGCAAAGCAAAGCTGGAGGATCATTAAAAATCCCA
ATAGTTTAGTGGCTAGAGTGTTAAGAGGCAAGATTGGAGATGATAGGCGAACTGTCATCGACAAAGATCCTTGGCTTATTAATCAAGGCAGCAGATGCCCGGTTTGGGTC
AAAGATAGCTTCCAAGGCAAGAGGGTTTGTGATTTGCTGGATAATGAAGGGTTCTGGTGTGAAGCAAAAGTTTTGGAAGCTTTTTCTCCTTCTAAGGCCGAAGAGATCCT
TAGTATCCCTCGCCAGAAATCTATCAGACATGATAAAATCATCTGGAATCCCGATCCAAAAGGGAAGTTTTCTGTTAGAAGCGCCTACAATTTAGCGGTGAATCTCAACT
CTAAGGAAGATGTGTCGGCTTCCTCTCCCGATACCACTAAATCCATGTGGAAGGAGCTATGGAGTCTGAACATCGTCCCTAGAGCCAAGATCACCGTCTGGAAAATTATT
AGCAATGCTCTGCCCACTAACTTCAATATTACTCGAAGAGGGATTTATATTGACTCTCGTTGTGTTCTTTGCAGGAGCAAAGAGGAGTCTACTGACCATCTGGGAGTTAG
CTGTCCTGAGGATTTCTGGATGATCATGGTGGATAAGCTTAGCAAAGAGGATTTGGGGCAAGTTGCTATTGTGTTGTGGGTGTTGTGGTGTTATAGGGATAAATGCAATT
CTAATCGCCAGATTTCAGATGTTATCCAGATCTGTAGATCGATTCAAAATGGATTCGATGTTTTAGCGAAAAGTGGGAAGGGTTACCTGGTAGCTGCGCAGGCGAAGAGC
CAAATGAGTCACAATTGGATTCCTCTGGATTCCGGTCGGTGGAAACTCAATTCTGATGCGTCGTGGATCGAGGCGAGCAGAAAAAGAGGGTTGGGCTGGGTGATTCGTGA
CTCCTCTAGATCTTTGATAGGAGCAGGCTACATTCAAATTGACAAGAAATGGCTAATCAAGGTCATGGAAGCCCAAGCAATTCTCGAAGGGCTAAAGGCTTACCGTTCGT
TGGCTGACAAAGATGGTATCAGTTGTCCCCTTCTGATTGAGTCTGATTCCGCTTGTGTCGTGAATGCCATCAACGACAGGGATGTTGATTTTTCAGAGGTGGCTCTGGTT
GTGGATGCTATCGCCGAGATTGCTAGAGGAGTTGTCATCTCCATTGGCTTTTGTAGCAGATGGGATAATATTCTGGCCCATCAGGTTGCTCGTGCCGCAACCAGTCATGG
GAATTTTTTGTGTTTTTTTGGTGCCCTTTCTTCCTCTGTTTCGGAAGATGGCACGAGGGGAGGGAATGAGTCAATTCCCCCTGGGTCGGAGCACTTTCCGGTCCACGATG
ATCACGCTCGGCCTCGGCCCATTGCCGAGGCCGAGGATGAGGTCGGCCTCGGCCCACTGTCGAGGCCGACCAGGGCCAAAAGCTCGATGGACGCCAAAAGTCCCCCAGTA
GCAGAAACCCTAGGGGAGCTTATAAAAGGGGAGACCACGCACGCACTAAGGGATGAAAAATCCAACCCTAAATACGCTCTACGTGTTCTTCGGCCAAAGACTAACTTAAG
CATCGGAGTGTGTGTGGTCTACACCACACCGGTGTGCAGCGTTTGCTCGTCTTGCAGGTCACGTCCTTCCCGCTTCCTTACAAATTCACCGTCGATCGTTGTGTGGAGCA
AGGGGCAAATTCCTAAGCGAAATTTGACCATCAAAAATATGCAAAATAAACGTATGTGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGCTGCTTCAAGCTCCCTAAAAGTATATGTGATGATATCAACAAGCATTGCGCCAGGTTCTGGTGGGGGTCTTCTGAGGCAAAGCGAAGAGCTCACTGGATGAGTTG
GAGGAAAATAAGCACGAGCAAATTCAAAGGAGGGCTTGACTTCAGAGATCTTAATCTGTTTAATCAAGCCATGCTCGCAAAGCAAAGCTGGAGGATCATTAAAAATCCCA
ATAGTTTAGTGGCTAGAGTGTTAAGAGGCAAGATTGGAGATGATAGGCGAACTGTCATCGACAAAGATCCTTGGCTTATTAATCAAGGCAGCAGATGCCCGGTTTGGGTC
AAAGATAGCTTCCAAGGCAAGAGGGTTTGTGATTTGCTGGATAATGAAGGGTTCTGGTGTGAAGCAAAAGTTTTGGAAGCTTTTTCTCCTTCTAAGGCCGAAGAGATCCT
TAGTATCCCTCGCCAGAAATCTATCAGACATGATAAAATCATCTGGAATCCCGATCCAAAAGGGAAGTTTTCTGTTAGAAGCGCCTACAATTTAGCGGTGAATCTCAACT
CTAAGGAAGATGTGTCGGCTTCCTCTCCCGATACCACTAAATCCATGTGGAAGGAGCTATGGAGTCTGAACATCGTCCCTAGAGCCAAGATCACCGTCTGGAAAATTATT
AGCAATGCTCTGCCCACTAACTTCAATATTACTCGAAGAGGGATTTATATTGACTCTCGTTGTGTTCTTTGCAGGAGCAAAGAGGAGTCTACTGACCATCTGGGAGTTAG
CTGTCCTGAGGATTTCTGGATGATCATGGTGGATAAGCTTAGCAAAGAGGATTTGGGGCAAGTTGCTATTGTGTTGTGGGTGTTGTGGTGTTATAGGGATAAATGCAATT
CTAATCGCCAGATTTCAGATGTTATCCAGATCTGTAGATCGATTCAAAATGGATTCGATGTTTTAGCGAAAAGTGGGAAGGGTTACCTGGTAGCTGCGCAGGCGAAGAGC
CAAATGAGTCACAATTGGATTCCTCTGGATTCCGGTCGGTGGAAACTCAATTCTGATGCGTCGTGGATCGAGGCGAGCAGAAAAAGAGGGTTGGGCTGGGTGATTCGTGA
CTCCTCTAGATCTTTGATAGGAGCAGGCTACATTCAAATTGACAAGAAATGGCTAATCAAGGTCATGGAAGCCCAAGCAATTCTCGAAGGGCTAAAGGCTTACCGTTCGT
TGGCTGACAAAGATGGTATCAGTTGTCCCCTTCTGATTGAGTCTGATTCCGCTTGTGTCGTGAATGCCATCAACGACAGGGATGTTGATTTTTCAGAGGTGGCTCTGGTT
GTGGATGCTATCGCCGAGATTGCTAGAGGAGTTGTCATCTCCATTGGCTTTTGTAGCAGATGGGATAATATTCTGGCCCATCAGGTTGCTCGTGCCGCAACCAGTCATGG
GAATTTTTTGTGTTTTTTTGGTGCCCTTTCTTCCTCTGTTTCGGAAGATGGCACGAGGGGAGGGAATGAGTCAATTCCCCCTGGGTCGGAGCACTTTCCGGTCCACGATG
ATCACGCTCGGCCTCGGCCCATTGCCGAGGCCGAGGATGAGGTCGGCCTCGGCCCACTGTCGAGGCCGACCAGGGCCAAAAGCTCGATGGACGCCAAAAGTCCCCCAGTA
GCAGAAACCCTAGGGGAGCTTATAAAAGGGGAGACCACGCACGCACTAAGGGATGAAAAATCCAACCCTAAATACGCTCTACGTGTTCTTCGGCCAAAGACTAACTTAAG
CATCGGAGTGTGTGTGGTCTACACCACACCGGTGTGCAGCGTTTGCTCGTCTTGCAGGTCACGTCCTTCCCGCTTCCTTACAAATTCACCGTCGATCGTTGTGTGGAGCA
AGGGGCAAATTCCTAAGCGAAATTTGACCATCAAAAATATGCAAAATAAACGTATGTGTTAA
Protein sequenceShow/hide protein sequence
MSCFKLPKSICDDINKHCARFWWGSSEAKRRAHWMSWRKISTSKFKGGLDFRDLNLFNQAMLAKQSWRIIKNPNSLVARVLRGKIGDDRRTVIDKDPWLINQGSRCPVWV
KDSFQGKRVCDLLDNEGFWCEAKVLEAFSPSKAEEILSIPRQKSIRHDKIIWNPDPKGKFSVRSAYNLAVNLNSKEDVSASSPDTTKSMWKELWSLNIVPRAKITVWKII
SNALPTNFNITRRGIYIDSRCVLCRSKEESTDHLGVSCPEDFWMIMVDKLSKEDLGQVAIVLWVLWCYRDKCNSNRQISDVIQICRSIQNGFDVLAKSGKGYLVAAQAKS
QMSHNWIPLDSGRWKLNSDASWIEASRKRGLGWVIRDSSRSLIGAGYIQIDKKWLIKVMEAQAILEGLKAYRSLADKDGISCPLLIESDSACVVNAINDRDVDFSEVALV
VDAIAEIARGVVISIGFCSRWDNILAHQVARAATSHGNFLCFFGALSSSVSEDGTRGGNESIPPGSEHFPVHDDHARPRPIAEAEDEVGLGPLSRPTRAKSSMDAKSPPV
AETLGELIKGETTHALRDEKSNPKYALRVLRPKTNLSIGVCVVYTTPVCSVCSSCRSRPSRFLTNSPSIVVWSKGQIPKRNLTIKNMQNKRMC