; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0032494 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0032494
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr11:33494960..33498951
RNA-Seq ExpressionLag0032494
SyntenyLag0032494
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002156 - Ribonuclease H domain
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4263564.1 unnamed protein product [Prunus armeniaca]1.3e-6031.29Show/hide
Query:  MGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVARSSPPISHLFFADDSLLFFRANVME
        MGF  ++  LI+ CV++VS+S  + G   G +IPSRGLRQGDP+SPYLFL+ AE  S+LL+ AER + + G  +A S+P I+HLFFADDSLLF  A   E
Subjt:  MGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVARSSPPISHLFFADDSLLFFRANVME

Query:  AVAIRDLLIRYERAS---------------------------VLSVARCPCHQQYLGLPSFMPRNRSGTLNFIKDRVWKQIQGWKGKFFSLGGKEVLLKS
        A+ ++ +   YE AS                           +L+V   PCH++YLGLP+ + +++      +KDRVW ++ GW+GK  S  GKEVL+KS
Subjt:  AVAIRDLLIRYERAS---------------------------VLSVARCPCHQQYLGLPSFMPRNRSGTLNFIKDRVWKQIQGWKGKFFSLGGKEVLLKS

Query:  IIQAIPCYTMNCFRLPRCLIREMHRAMARFWWNGSE---------------EVNRSIG----------------------------------YFPHSDFW
        + QAIP Y+M+ FRLP  L RE+   +A+FWW+ ++               + +  IG                                  YFPHSDF 
Subjt:  IIQAIPCYTMNCFRLPRCLIREMHRAMARFWWNGSE---------------EVNRSIG----------------------------------YFPHSDFW

Query:  ----GHLWAIGHRSS--GAVCCGVGNCWTKGADGGLGMDGLP--PY-----MVRIGCRIIRLCVCSLLLHFLYLVGSVIYFLRRDSGMRSRFVPTFWGLS
            G L +   +S   G     +G  W  G    + + G P  PY     +  I    +   VC L            +       +R  F    +  +
Subjt:  ----GHLWAIGHRSS--GAVCCGVGNCWTKGADGGLGMDGLP--PY-----MVRIGCRIIRLCVCSLLLHFLYLVGSVIYFLRRDSGMRSRFVPTFWGLS

Query:  EAIMRIPLRSGMLEDRLIWHFEKHDMFSVKSGY--RLAYSLASQGCPSSSILSAGGFGGLVYGGLGF--QISTSSVVEDCLHLFWKCAVIREMWLCSKFL
        EAI+ IPL    L DR IW+F K+  +SVKSGY   L Y    +       LS GG  G     L     +    V +  LHL W+ A   +  L SK +
Subjt:  EAIMRIPLRSGMLEDRLIWHFEKHDMFSVKSGY--RLAYSLASQGCPSSSILSAGGFGGLVYGGLGF--QISTSSVVEDCLHLFWKCAVIREMWLCSKFL

Query:  SYTSRYT---IWILL-MSSGIKGEGESDGRDLWAWSEEYLRVYHDVGRQRSLAAVCSLAS-VSQSSSLPGIPSGRGFKLNTDASVRPDTGVAGGSCVLRD
         +  R T   +W  L   S       +   D+  W +    +     +Q   A   SL+S V      P  P+G  FKLN D +   +TG  G   ++RD
Subjt:  SYTSRYT---IWILL-MSSGIKGEGESDGRDLWAWSEEYLRVYHDVGRQRSLAAVCSLAS-VSQSSSLPGIPSGRGFKLNTDASVRPDTGVAGGSCVLRD

Query:  VSGAVLLAACLVLPRCWSVDLAEGWALVKGVKLALQV--------ADSSEV--------------GLLMDDVRRLL--HPCVVVRCFYATERNRVAHALA
          G ++ A  +  P   SV   E +AL  G+  AL +        +DS +               G L+D VRRLL       VR     + N+ AH +A
Subjt:  VSGAVLLAACLVLPRCWSVDLAEGWALVKGVKLALQV--------ADSSEV--------------GLLMDDVRRLL--HPCVVVRCFYATERNRVAHALA

Query:  CLTFSYSDR---VWLEEWP
           FS  D+   +WL+  P
Subjt:  CLTFSYSDR---VWLEEWP

CAB4316864.1 unnamed protein product [Prunus armeniaca]3.1e-5929.81Show/hide
Query:  MGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVARSSPPISHLFFADDSLLFFRANVME
        MGF  ++  LI+ CV++VS+S  + G   G +IPSRGLRQGDP+SPYLFL+ AE  S+LL+ AER + + G  +A S+P I+HLFFADDSLLF  A   E
Subjt:  MGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVARSSPPISHLFFADDSLLFFRANVME

Query:  AVAIRDLLIRYERAS---------------------------VLSVARCPCHQQYLGLPSFMPRNRSGTLNFIKDRVWKQIQGWKGKFFSLGGKEVLLKS
        A+ ++ +   YE AS                           +L+V   PCH++YLGLP+ + +++      +KDRVW ++ GW+GK  S  GKEVL+KS
Subjt:  AVAIRDLLIRYERAS---------------------------VLSVARCPCHQQYLGLPSFMPRNRSGTLNFIKDRVWKQIQGWKGKFFSLGGKEVLLKS

Query:  IIQAIPCYTMNCFRLPRCLIREMHRAMARFWWNGSEEVNRSIGYFPHSDFWGHLWAIGHRSSGAVCCGVGNCWTKGADGGLGMDGLPPY----MVRIGCR
        + QAIP Y+M+ FRLP  L RE+   +A+FWW+ ++      G   H   W  +                 C  K +DGGLG   L  +    + + G R
Subjt:  IIQAIPCYTMNCFRLPRCLIREMHRAMARFWWNGSEEVNRSIGYFPHSDFWGHLWAIGHRSSGAVCCGVGNCWTKGADGGLGMDGLPPY----MVRIGCR

Query:  IIRL--CVCSLLLHFLYLVGSVIYFLRRDSGMRSRFV--PTFWGL---------------------------------------SEAIMRIPLRSGMLED
        ++     + + +L   Y   S   FL   SG    F      WG                                        +EAI+ IPL    L D
Subjt:  IIRL--CVCSLLLHFLYLVGSVIYFLRRDSGMRSRFV--PTFWGL---------------------------------------SEAIMRIPLRSGMLED

Query:  RLIWHFEKHDMFSVKSGY--RLAYSLASQGCPSSSILSAGGFGGLVYGGLGFQISTSSVVEDCLHLFWKCAVIREMWLCSKFLSYTSRYTIWILLMS---
        R IW+F K+  +SVKSGY   L Y    +       LSA    G           +SS ++   HL WK  V ++ ++C         Y +W+ + S   
Subjt:  RLIWHFEKHDMFSVKSGY--RLAYSLASQGCPSSSILSAGGFGGLVYGGLGFQISTSSVVEDCLHLFWKCAVIREMWLCSKFLSYTSRYTIWILLMS---

Query:  SGIKGEGESDGRD-------------------------------LWA-WSEEYLRVYHD--------VGRQRSLAAVCSLASVSQSSSLPG--------I
            G G   GRD                               +W  W+E    ++          V R +   A     S +   SL           
Subjt:  SGIKGEGESDGRD-------------------------------LWA-WSEEYLRVYHD--------VGRQRSLAAVCSLASVSQSSSLPG--------I

Query:  PSGRGFKLNTDASVRPDTGVAGGSCVLRDVSGAVLLAACLVLPRCWSVDLAEGWALVKGVKLALQV--------ADSSEV--------------GLLMDD
        P+G  FKLN D +   +TG  G   ++RD  G ++ A  +  P   SV   E +AL  G+  AL V        +DS +               G L+D 
Subjt:  PSGRGFKLNTDASVRPDTGVAGGSCVLRDVSGAVLLAACLVLPRCWSVDLAEGWALVKGVKLALQV--------ADSSEV--------------GLLMDD

Query:  VRRLL--HPCVVVRCFYATERNRVAHALACLTFSYSDR---VWLEEWP
        VR LL      VVR     + N+ AH +A   FS  D+    WL+  P
Subjt:  VRRLL--HPCVVVRCFYATERNRVAHALACLTFSYSDR---VWLEEWP

XP_030487384.1 uncharacterized protein LOC115704310 [Cannabis sativa]1.1e-6130.67Show/hide
Query:  MDRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVARSSPPISHLFFADDSLLFFRAN
        M ++G++  W   I+ C++SV FSF +NGE  G V P RGLRQGDPLSP+LFLLCAE  SSL++ AE+R  + G    R    +SHLFFADDSL+F  A 
Subjt:  MDRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVARSSPPISHLFFADDSLLFFRAN

Query:  VMEAVAIRDLLIRY---------------------------ERASVLSVARCPCHQQYLGLPSFMPRNRSGTLNFIKDRVWKQIQGWKGKFFSLGGKEVL
          E    ++LL +Y                           + A+++ V     + +YLGLPSF+ R +     FIK+RVW +++GWKG FFS   KEVL
Subjt:  VMEAVAIRDLLIRY---------------------------ERASVLSVARCPCHQQYLGLPSFMPRNRSGTLNFIKDRVWKQIQGWKGKFFSLGGKEVL

Query:  LKSIIQAIPCYTMNCFRLPRCLIREMHRAMARFWWNGSEEVNR---------SIGYFPHSD-----------------FW-------GHLWAIGHRSSGA
        +K+I+QAIP YTM+CFRLP+  I  +H   ARFWW  SE+  +            Y+P+S                   W       GH W IG+R+S  
Subjt:  LKSIIQAIPCYTMNCFRLPRCLIREMHRAMARFWWNGSEEVNR---------SIGYFPHSD-----------------FW-------GHLWAIGHRSSGA

Query:  VCCGVGNCWTKGADGGLGMDGLP-PYMVRIGCRIIRLCVCSLLLHFLYLVGSVIYFLRRDSG------MRSRFVPTFWGLSEAIMRIPLRSGMLEDRLIW
        V   + + W            LP P   +I  +         L   LY++      L+   G      +R+ F PT    ++ I+ +P     +ED+++W
Subjt:  VCCGVGNCWTKGADGGLGMDGLP-PYMVRIGCRIIRLCVCSLLLHFLYLVGSVIYFLRRDSG------MRSRFVPTFWGLSEAIMRIPLRSGMLEDRLIW

Query:  HFEKHDMFSVKSGYRLAYSLASQGCPSSSILSAGGFGGL------------VYGGLGFQISTSSVV-------------------EDCLHLFWKCAVIRE
        H+ K   +SV+SGYR+A +L  +   S +  +   +  L            V+      I T++V+                   E   H  W C V RE
Subjt:  HFEKHDMFSVKSGYRLAYSLASQGCPSSSILSAGGFGGL------------VYGGLGFQISTSSVV-------------------EDCLHLFWKCAVIRE

Query:  MWLCSKFLSYTSRY----TIWILLMSSGIKGEGESDGRDLWAWSEEYLR--VYHDVGRQRSLAAV--CS--LASVSQSSSLPG-----------IPSGRG
        +W  + F     R      +  L+  S    + E +   + +W+  Y+R  V H   + ++ A +  CS  L+   QS++  G            P+   
Subjt:  MWLCSKFLSYTSRY----TIWILLMSSGIKGEGESDGRDLWAWSEEYLR--VYHDVGRQRSLAAV--CS--LASVSQSSSLPG-----------IPSGRG

Query:  FKLNTDASVRPDTGVAGGSCVLRDVSGAVLLAACLVLPRCWSVDLAEGWALVKGVKLALQVADSSEVGLLMDDVR
        FK+N D  V+   GV+  S V+RD  G V  AA  V+ +  +   AE  A + GVK   QV     +  L+  +R
Subjt:  FKLNTDASVRPDTGVAGGSCVLRDVSGAVLLAACLVLPRCWSVDLAEGWALVKGVKLALQVADSSEVGLLMDDVR

XP_030505068.1 uncharacterized protein LOC115720043 [Cannabis sativa]1.8e-5929.07Show/hide
Query:  MGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVARSSPPISHLFFADDSLLFFRANVME
        MGF     DLI+RC+SSV++SF++NG+  G+V P+RG+ QGDPLSPYLF++CAEGL  LL+  E R  + G +++R +P +SHLFFADDSL+  RAN   
Subjt:  MGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVARSSPPISHLFFADDSLLFFRANVME

Query:  AVAIRDLLIRYERAS---------------------------VLSVARCPCHQQYLGLPSFMPRNRSGTLNFIKDRVWKQIQGWKGKFFSLGGKEVLLKS
        A AI++ L  Y RAS                           +L +    CH++YLGLPS+  R++      IK+++W  +  W+ K FS+GGKEVLLK+
Subjt:  AVAIRDLLIRYERAS---------------------------VLSVARCPCHQQYLGLPSFMPRNRSGTLNFIKDRVWKQIQGWKGKFFSLGGKEVLLKS

Query:  IIQAIPCYTMNCFRLPRCLIREMHRAMARFWWNGSEEVN---------------------RSIGYFPHSDFWGHLWAIGHRSSGAVC-------------
        + QAIP Y M+CFRL + L+ ++   M +FWW  +   N                     +S  +F  +      W I    S  +C             
Subjt:  IIQAIPCYTMNCFRLPRCLIREMHRAMARFWWNGSEEVN---------------------RSIGYFPHSDFWGHLWAIGHRSSGAVC-------------

Query:  --CGVGN----CWT---------------KGADGGL----------GMDGLPPYMVRIGCRIIRLCVCSLLLHFLYLVGSVIYFLRRDSGMRSRFVPTFW
           G+ N     W                K  DG            G+    P + +     ++  V +L+L        ++ +L  DS +         
Subjt:  --CGVGN----CWT---------------KGADGGL----------GMDGLPPYMVRIGCRIIRLCVCSLLLHFLYLVGSVIYFLRRDSGMRSRFVPTFW

Query:  GLSEAIMRIPLRSGMLEDRLIWHFEKHDMFSVKSGYRLAYSLASQGCPSSSILSAGGFGGLVYGGLGFQISTSSVVEDCLHLFWKCAVIREMWLCSKF--
           + I+ IPL     +D L+WH E H  +SVKSGY LA SL  Q    S IL            +      +   E   H  + C   R++W  S F  
Subjt:  GLSEAIMRIPLRSGMLEDRLIWHFEKHDMFSVKSGYRLAYSLASQGCPSSSILSAGGFGGLVYGGLGFQISTSSVVEDCLHLFWKCAVIREMWLCSKF--

Query:  ----LSYTSRYTIWILLMSSGIKGEGESDGRDLWA-WSE---------------------EYLRVYHDVGRQ---RSLAAVCSLASVSQSSSLPGIPSGR
              + S   I + L ++    + E     LW+ W+E                      YL  +H   +     SL      +SVS+ S     PSGR
Subjt:  ----LSYTSRYTIWILLMSSGIKGEGESDGRDLWA-WSE---------------------EYLRVYHDVGRQ---RSLAAVCSLASVSQSSSLPGIPSGR

Query:  GFKLNTDASVRPDTGVAGGSCVLRDVSGAVLLAACLVLPRCWSVDLAEGWALVKGVK
          KLNTDA+V     ++G   +L+  +G ++         C+  ++ E  AL+  +K
Subjt:  GFKLNTDASVRPDTGVAGGSCVLRDVSGAVLLAACLVLPRCWSVDLAEGWALVKGVK

XP_030505962.1 uncharacterized protein LOC115720894 [Cannabis sativa]4.8e-6035.68Show/hide
Query:  MDRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVARSSPPISHLFFADDSLLFFRAN
        M +MGF  +W  LI+RC+ + SFSFNLNG  +G++ P RGLRQGDPLSPYLFL+C+EGLS LL+  ER   + G+ V+R +P ISHLFFADDSL+F++A+
Subjt:  MDRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVARSSPPISHLFFADDSLLFFRAN

Query:  VMEAVAIRDLLIRYERAS---------------------------VLSVARCPCHQQYLGLPSFMPRNRSGTLNFIKDRVWKQIQGWKGKFFSLGGKEVL
              I+  L  Y RAS                           +L +    CH++YLGL ++  R++    N IK+R+WK +  W  K FS+GGKEVL
Subjt:  VMEAVAIRDLLIRYERAS---------------------------VLSVARCPCHQQYLGLPSFMPRNRSGTLNFIKDRVWKQIQGWKGKFFSLGGKEVL

Query:  LKSIIQAIPCYTMNCFRLPRCLIREMHRAMARFWWNGSEEVNRSIGYFPHSDFWGHLWAIGHRSSGAVCCG----VGNCWTKGADGGLGMDGLPPYMVRI
        LK+++Q+IP Y M+CF+LP     ++ + MA FWW   ++  ++     H   W  L     +  G +C G    V     K  DG        P++ R 
Subjt:  LKSIIQAIPCYTMNCFRLPRCLIREMHRAMARFWWNGSEEVNRSIGYFPHSDFWGHLWAIGHRSSGAVCCG----VGNCWTKGADGGLGMDGLPPYMVRI

Query:  GCRIIRLCVCSLLLHFLYL---VGSVIYFLRRDSGMRSRFVPTFWGLSEA--IMRIPLRSGMLEDRLIWHFEKHDMFSVKSGYRLAYSLASQGCPSSS
                  S    F Y       V +++  D       +   +   +   I+ IPL S  + D+ IWHF     ++V++GY LA  L     P+SS
Subjt:  GCRIIRLCVCSLLLHFLYL---VGSVIYFLRRDSGMRSRFVPTFWGLSEA--IMRIPLRSGMLEDRLIWHFEKHDMFSVKSGYRLAYSLASQGCPSSS

TrEMBL top hitse value%identityAlignment
A0A2N9EQ08 Reverse transcriptase domain-containing protein5.9e-6429.41Show/hide
Query:  MDRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVARSSPPISHLFFADDSLLFFRAN
        M RMGF  +W  LI+ C+S+VS+S  +NGE  G++IP+RG+RQGDP+SPYLFLLCAEGL+ LL+ A  +  I G  + R  P I++LFFADDSLLF RA 
Subjt:  MDRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVARSSPPISHLFFADDSLLFFRAN

Query:  VMEAVAIRDLLIRYERAS---------------------------VLSVARCPCHQQYLGLPSFMPRNRSGTLNFIKDRVWKQIQGWKGKFFSLGGKEVL
        + E   I+D+L  YE+AS                           +L V     +++YLGLPS + + +    + IK+RVW +I+GWK K  S  G+E+L
Subjt:  VMEAVAIRDLLIRYERAS---------------------------VLSVARCPCHQQYLGLPSFMPRNRSGTLNFIKDRVWKQIQGWKGKFFSLGGKEVL

Query:  LKSIIQAIPCYTMNCFRLPRCLIREMHRAMARFWWNGSEEVNRSIGYFPHSDFWGHLWAIGHRSSGAVCCGVGNCWTKGADGGLGMDGLPPYMVRIGC--
        +K+++QAIP YTMNCF+LP  L +++   + RFWW G +   R I    H   W  L                 C  KG  GGLG   L  + + +    
Subjt:  LKSIIQAIPCYTMNCFRLPRCLIREMHRAMARFWWNGSEEVNRSIGYFPHSDFWGHLWAIGHRSSGAVCCGVGNCWTKGADGGLGMDGLPPYMVRIGC--

Query:  --RIIRLCVCSLLLHF---LYLVGSVIYFLRRDSG---MRSRFVPTFWGLSEAIMRIPLRSGMLEDRLIWHFEKHDMFSVKSGYRLAY--------SLAS
          R++ +    L   F    +  G+++    R SG    RS        L +AI +IPL      DRLIWH  +   ++V+SGY +            + 
Subjt:  --RIIRLCVCSLLLHF---LYLVGSVIYFLRRDSG---MRSRFVPTFWGLSEAIMRIPLRSGMLEDRLIWHFEKHDMFSVKSGYRLAY--------SLAS

Query:  QG-------------------------CPSSSILSAGGFGGLVYGGLGFQISTSSVVEDCLHLFWKCAVIREMW-------------------LCSKFLS
        QG                         C  S    AG F   V     +       +ED LH  W C V+ ++W                   L  + +S
Subjt:  QG-------------------------CPSSSILSAGGFGGLVYGGLGFQISTSSVVEDCLHLFWKCAVIREMW-------------------LCSKFLS

Query:  YTSRYTIWILLMSSGIKGEGESDGRDLWAWSEEYLRVYHDVGRQRSLAAVCSLASVSQSSSLPG---------IPSGRGFKLNTDASVRPDTGVAGGSCV
          S   I    M+  +     +  R L   SEEY  ++      R+   +    SVSQ+ ++            P    +K+N D +   +T   G   V
Subjt:  YTSRYTIWILLMSSGIKGEGESDGRDLWAWSEEYLRVYHDVGRQRSLAAVCSLASVSQSSSLPG---------IPSGRGFKLNTDASVRPDTGVAGGSCV

Query:  LRDVSGAVLLAACLVLPRCWSVDLAEGWALVKGVKLALQVA-----------------DSSEV-----GLLMDDVRRLLHPCVVVRCFYATER-NRVAHA
        +RD +G  +      LP   S+++ E  A  + +    +V                   SSE      GL+++D + +L         +     N VAHA
Subjt:  LRDVSGAVLLAACLVLPRCWSVDLAEGWALVKGVKLALQVA-----------------DSSEV-----GLLMDDVRRLLHPCVVVRCFYATER-NRVAHA

Query:  LACLTFSYSD-RVWLEEWPNEIAVVLAGDVA
        LA       +  VW+E+ P +I+ VL  D++
Subjt:  LACLTFSYSD-RVWLEEWPNEIAVVLAGDVA

A0A2N9F047 Reverse transcriptase domain-containing protein8.5e-6329.32Show/hide
Query:  MDRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVARSSPPISHLFFADDSLLFFRAN
        M RMGF  +W  LI+ C+S+VS+S  +NGE  G++IP+RG+RQGDP+SPYLFLLCAEGL+ LL+ A  +  I G  + R  P I++LFFADDSLLF RA 
Subjt:  MDRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVARSSPPISHLFFADDSLLFFRAN

Query:  VMEAVAIRDLLIRYERAS---------------------------VLSVARCPCHQQYLGLPSFMPRNRSGTLNFIKDRVWKQIQGWKGKFFSLGGKEVL
        + E   I+D+L  YE+AS                           +L V     +++YLGLPS + + +    + IK+RVW +I+GWK K  S  G+E+L
Subjt:  VMEAVAIRDLLIRYERAS---------------------------VLSVARCPCHQQYLGLPSFMPRNRSGTLNFIKDRVWKQIQGWKGKFFSLGGKEVL

Query:  LKSIIQAIPCYTMNCFRLPRCLIREMHRAMARFWWNGSEEVNRSIGYFPHSDFWGHLWAIGHRSSGAVCCGVGNCWTKGADGGLGMDGLPPYMVRIGC--
        +K+++QAIP YTMNCF+LP  L +++   + RFWW G +   R I    H   W  L                 C  KG  GGLG   L  + + +    
Subjt:  LKSIIQAIPCYTMNCFRLPRCLIREMHRAMARFWWNGSEEVNRSIGYFPHSDFWGHLWAIGHRSSGAVCCGVGNCWTKGADGGLGMDGLPPYMVRIGC--

Query:  --RIIRLCVCSLLLHF---LYLVGSVIYFLRRDSG---MRSRFVPTFWGLSEAIMRIPLRSGMLEDRLIWHFEKHDMFSVKSGYRLAY--------SLAS
          R++ +    L   F    +  G+++    R SG    RS        L +AI +IPL      DRLIWH  +   ++V+SGY +            + 
Subjt:  --RIIRLCVCSLLLHF---LYLVGSVIYFLRRDSG---MRSRFVPTFWGLSEAIMRIPLRSGMLEDRLIWHFEKHDMFSVKSGYRLAY--------SLAS

Query:  QG-------------------------CPSSSILSAGGFGGLVYGGLGFQISTSSVVEDCLHLFWKCAVIREMW-------------------LCSKFLS
        QG                         C  S    AG F   V     +       +ED LH  W C V+ ++W                   L  + +S
Subjt:  QG-------------------------CPSSSILSAGGFGGLVYGGLGFQISTSSVVEDCLHLFWKCAVIREMW-------------------LCSKFLS

Query:  YTSRYTIWILLMSSGIKGEGESDGRDLWAWSEEYLRVYHDVGRQRSLAAVCSLASVSQSSSLPG---------IPSGRGFKLNTDASVRPDTGVAGGSCV
          S   I    M+  +     +  R L   SEEY  ++      R+   +    SVSQ+ ++            P    +K+N D +   +T   G   V
Subjt:  YTSRYTIWILLMSSGIKGEGESDGRDLWAWSEEYLRVYHDVGRQRSLAAVCSLASVSQSSSLPG---------IPSGRGFKLNTDASVRPDTGVAGGSCV

Query:  LRDVSGAVLLAACLVLPRCWSVDLAEGWALVKGVKLALQVA-----------------DSSEV-----GLLMDDVRRLLHPCVVVRCFYATER-NRVAHA
        +RD +G  +      LP   S+++ E  A  + +    +V                   SSE      GL+++D + +L         +     N VAHA
Subjt:  LRDVSGAVLLAACLVLPRCWSVDLAEGWALVKGVKLALQVA-----------------DSSEV-----GLLMDDVRRLLHPCVVVRCFYATER-NRVAHA

Query:  LACLTFSYSD-RVWLEEWPNEIA
        LA       +  VW+E+ P +I+
Subjt:  LACLTFSYSD-RVWLEEWPNEIA

A0A2N9I9F4 Reverse transcriptase domain-containing protein2.9e-6328.67Show/hide
Query:  MDRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVARSSPPISHLFFADDSLLFFRAN
        M RMGF + WT +I+ C+S+VS+S  +NGE  G + P+RGLRQGDP+SPYLFLLCAEGL+ LL+ A  +  I G  + R  P I++LFFADDSLLF RA 
Subjt:  MDRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVARSSPPISHLFFADDSLLFFRAN

Query:  VMEAVAIRDLLIRYERAS---------------------------VLSVARCPCHQQYLGLPSFMPRNRSGTLNFIKDRVWKQIQGWKGKFFSLGGKEVL
          E   I+ +L  YE+AS                           +L V     +++YLGLPS + + +    + IK+RVW +I+GWK K  S  G+E+L
Subjt:  VMEAVAIRDLLIRYERAS---------------------------VLSVARCPCHQQYLGLPSFMPRNRSGTLNFIKDRVWKQIQGWKGKFFSLGGKEVL

Query:  LKSIIQAIPCYTMNCFRLPRCLIREMHRAMARFWWNGSEEVNRSIGYFPHSDFWGHLWAIGHRSSGAVCCGVGNCWTKGADGGLGMDGLPPYMVR-IGCR
        +K+++QAIP YTMNCF+LP  L +E+   + RFWW G +   R I    H   W  L                 C  K   GGLG   L  + +  +  +
Subjt:  LKSIIQAIPCYTMNCFRLPRCLIREMHRAMARFWWNGSEEVNRSIGYFPHSDFWGHLWAIGHRSSGAVCCGVGNCWTKGADGGLGMDGLPPYMVR-IGCR

Query:  IIRLCVCSLLLHF----------------LYLVGSVIYFLRRDSGMRSRF----------------------VPTFWGL-----------SEAIMRIPLR
          RL  C   L F                ++    VI   +   G RS                         P  W             +EAI++IP+ 
Subjt:  IIRLCVCSLLLHF----------------LYLVGSVIYFLRRDSGMRSRF----------------------VPTFWGL-----------SEAIMRIPLR

Query:  SGMLEDRLIWHFEKHDMFSVKSGYRLA---YSLASQGC-----PSS------------------------SILSAGGFGGLVYGGLGFQISTSSVVEDCL
        +    D+L+WH  +   FSV+SGY L    Y +++ GC     P S                        S+ +  G          F  + S+ VED L
Subjt:  SGMLEDRLIWHFEKHDMFSVKSGYRLA---YSLASQGC-----PSS------------------------SILSAGGFGGLVYGGLGFQISTSSVVEDCL

Query:  HLFWKCAVIREMWLCSKFLSYTSRYTIWILLMSSGIKGEGESDGRDLWAWSEEYLRVYHDVGRQRSLAAVCSLASVSQSSSLPGIPSGRGFKLNTDASVR
        H  W C ++ + W     +   S +        S +  +   + +++  +      V     R  S   + +      + +    P+   FK N D +  
Subjt:  HLFWKCAVIREMWLCSKFLSYTSRYTIWILLMSSGIKGEGESDGRDLWAWSEEYLRVYHDVGRQRSLAAVCSLASVSQSSSLPGIPSGRGFKLNTDASVR

Query:  PDTGVAGGSCVLRDVSGAVLLAACLVLPRCWSVDLAEGWALVKGVKLALQV----------------------ADSSEVGLLMDDVRRLLHPCVVVRCFY
          +   G   V+RD  G V+      +  C S ++ E  A  + V  A++V                      A  +  GL++DD++ +L     +RC+ 
Subjt:  PDTGVAGGSCVLRDVSGAVLLAACLVLPRCWSVDLAEGWALVKGVKLALQV----------------------ADSSEVGLLMDDVRRLLHPCVVVRCFY

Query:  ATER----NRVAHALACLTFSYSD-RVWLEEWPNEIAVVLAGD
         +      N VAHALA   F  ++  VWLEE P +I  VL  D
Subjt:  ATER----NRVAHALACLTFSYSD-RVWLEEWPNEIAVVLAGD

A0A803NW04 Uncharacterized protein9.7e-6730.2Show/hide
Query:  MDRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVARSSPPISHLFFADDSLLFFRAN
        M++MGFA  W  LI++C+SS SFSF+LNGE +GNV P RGLRQGDPLSPYLFL+C EGLS LL   E    ++G R+ R +P ISHL FADDSLLF  A 
Subjt:  MDRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVARSSPPISHLFFADDSLLFFRAN

Query:  VMEAVAIRDLLIRYERAS---------------------------VLSVARCPCHQQYLGLPSFMPRNRSGTLNFIKDRVWKQIQGWKGKFFSLGGKEVL
           A+AI  +L  Y RAS                            L++    CH++YLGLP+++ R++    + IK+R+WK +  WK K FS+GGKEV 
Subjt:  VMEAVAIRDLLIRYERAS---------------------------VLSVARCPCHQQYLGLPSFMPRNRSGTLNFIKDRVWKQIQGWKGKFFSLGGKEVL

Query:  LKSIIQAIPCYTMNCFRLPRCLIREMHRAMARFWWNGSEEVNRSIG---------------YFPHSDFWGHLWAIGHRSSGAVCCGVGNCW-----TKGA
        LK+++Q+IP Y M+CF+LP+    ++   MA FWW  ++ V + IG               YF  S FW     +GH  S       G CW      KG 
Subjt:  LKSIIQAIPCYTMNCFRLPRCLIREMHRAMARFWWNGSEEVNRSIG---------------YFPHSDFWGHLWAIGHRSSGAVCCGVGNCW-----TKGA

Query:  DGGLG-----MDGLPPYMVRIG--CRIIRLCVCSLLLHFLYLVGSVIYFLRRDSGMRSRFVPTFWGLSEA--IMRIPLRSGMLEDRLIWHFEKHDMFSVK
           +G          P++  I   C I      S+L         V YF+ +        +   +G  +   I+ IPL     +DRLIWH+    +++VK
Subjt:  DGGLG-----MDGLPPYMVRIG--CRIIRLCVCSLLLHFLYLVGSVIYFLRRDSGMRSRFVPTFWGLSEA--IMRIPLRSGMLEDRLIWHFEKHDMFSVK

Query:  SGYRLAYSLASQGCPSSSILSAGGFGGLVYGGLGFQISTSSV--VEDCLHLFWKCAVIR----EMWLCSKFLSYTSRYTIWILLMSSGIKGEGESDGRDL
        SG+ LA  L  +   S+S          ++    F I  +    + +  +LF    ++     E+ LC  ++ +  R  I        I G+   D   +
Subjt:  SGYRLAYSLASQGCPSSSILSAGGFGGLVYGGLGFQISTSSV--VEDCLHLFWKCAVIR----EMWLCSKFLSYTSRYTIWILLMSSGIKGEGESDGRDL

Query:  WAWSEEYLRVYHDVGRQRSLAAVCSLASVSQSS--------------SLPGIPSG----RGFKLNTDASVRPDTGVAGGSCVLRDVSGAVLLAACLVLPR
          ++  Y+  Y      +   A  S  S +  S                 G+P         KLN DA++     + G   ++R+  G V+ A    +  
Subjt:  WAWSEEYLRVYHDVGRQRSLAAVCSLASVSQSS--------------SLPGIPSG----RGFKLNTDASVRPDTGVAGGSCVLRDVSGAVLLAACLVLPR

Query:  CWSVDLAEGWALVKGVKLALQVA----------------------DSSEVGLLMDDVRRLL--HPCVVVRCFYATERNRVAHALACLTFSYS-DRVWLEE
        C+  D  E  AL   +    Q++                      D S    ++ DVR LL   P +VV        N+ AH LA        D  W  E
Subjt:  CWSVDLAEGWALVKGVKLALQVA----------------------DSSEVGLLMDDVRRLL--HPCVVVRCFYATERNRVAHALACLTFSYS-DRVWLEE

Query:  WPNEIAVVLAGD
         P+ I  V+  D
Subjt:  WPNEIAVVLAGD

A0A803PYI0 Uncharacterized protein1.6e-6138.3Show/hide
Query:  MDRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVARSSPPISHLFFADDSLLFFRAN
        M R+GFAQ W D I+RCV+S SFSF +NGE  G +IP RGLRQGDPLSP+LFL CAE LSSL++  E    + G R  R    +SHLFFADDSL+F  A+
Subjt:  MDRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVARSSPPISHLFFADDSLLFFRAN

Query:  VMEAVAIRDLLIRYERAS---------------------------VLSVARCPCHQQYLGLPSFMPRNRSGTLNFIKDRVWKQIQGWKGKFFSLGGKEVL
        +   +  + +L +Y  AS                           ++ V     H +YLGLPS + RN+   L+ IK++VW +++GWK   FS+ GKEVL
Subjt:  VMEAVAIRDLLIRYERAS---------------------------VLSVARCPCHQQYLGLPSFMPRNRSGTLNFIKDRVWKQIQGWKGKFFSLGGKEVL

Query:  LKSIIQAIPCYTMNCFRLPRCLIREMHRAMARFWWNGSEEVNRSIGYFPHSDFWGHLWAIGHRSSGAVCCGVGNCWTKGADGGLGMDGLPPYMVRIGCRI
        +KSI+QAIP YTM CF+L +  I  +HR  +RFWW GS +  + I    H   W +L                 C  K   GGLG   L  +   +  + 
Subjt:  LKSIIQAIPCYTMNCFRLPRCLIREMHRAMARFWWNGSEEVNRSIGYFPHSDFWGHLWAIGHRSSGAVCCGVGNCWTKGADGGLGMDGLPPYMVRIGCRI

Query:  IRLCV------CSLLLHFLYLVGSVIYFLRRDSGMRSRFVPTFWGLSEAIMRIPLRSGMLEDRLIWHFEKHDMFSVKSGYRLAYSLASQ
        I  C+      CS +L          YF R+  G+         G +E I+ IP      ED+++WH+ K+  ++VKS YR+A SL+++
Subjt:  IRLCV------CSLLLHFLYLVGSVIYFLRRDSGMRSRFVPTFWGLSEAIMRIPLRSGMLEDRLIWHFEKHDMFSVKSGYRLAYSLASQ

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657501.3e-0425Show/hide
Query:  LPSFMPRNRSGTLNFIKDRVWKQIQGWKGKFFSLGGKEVLLKSIIQAIPCYTMNCFRLPRCLIREMHRAMARFWWNGSEE
        +P    R    T   I +RV  ++ GW+ K  S  G+  L K+++ ++P ++M+   LP+ ++  + +    F W  + E
Subjt:  LPSFMPRNRSGTLNFIKDRVWKQIQGWKGKFFSLGGKEVLLKSIIQAIPCYTMNCFRLPRCLIREMHRAMARFWWNGSEE

P11369 LINE-1 retrotransposable element ORF2 protein2.0e-0521.9Show/hide
Query:  MDRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVARSSPPISHLFFADDSLLF----
        ++R G    + ++I    S    +  +NGE+L  +    G RQG PLSPYLF +  E L+  +R   ++  I G ++ +    IS L  ADD +++    
Subjt:  MDRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVARSSPPISHLFFADDSLLF----

Query:  -------------------FRANVMEAVAI---RDLLIRYERASVLSVARCPCHQQYLGLPSFMPRNRSGTLNF--IKDRVWKQIQGWKGKFFSLGGKEV
                           ++ N  +++A    ++     E       +    + +YLG+            NF  +K  + + ++ WK    S  G+  
Subjt:  -------------------FRANVMEAVAI---RDLLIRYERASVLSVARCPCHQQYLGLPSFMPRNRSGTLNF--IKDRVWKQIQGWKGKFFSLGGKEV

Query:  LLKSIIQAIPCYTMNC--FRLPRCLIREMHRAMARFWWNGSE
        ++K  I     Y  N    ++P     E+  A+ +F WN  +
Subjt:  LLKSIIQAIPCYTMNC--FRLPRCLIREMHRAMARFWWNGSE

P92555 Uncharacterized mitochondrial protein AtMg012501.3e-1559.42Show/hide
Query:  FNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVARSSPPISHLFFADDS
        F +NG   G V PSRGLRQGDPLSPYLF+LC E LS L R A+ +  + G RV+ +SP I+HL FADD+
Subjt:  FNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVARSSPPISHLFFADDS

Arabidopsis top hitse value%identityAlignment
ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)9.1e-1759.42Show/hide
Query:  FNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVARSSPPISHLFFADDS
        F +NG   G V PSRGLRQGDPLSPYLF+LC E LS L R A+ +  + G RV+ +SP I+HL FADD+
Subjt:  FNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVARSSPPISHLFFADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATAGAATGGGCTTCGCTCAACAGTGGACTGATTTGATTCTCCGGTGTGTTAGCTCGGTTTCCTTTTCGTTTAATCTGAATGGGGAGAGGTTGGGGAATGTGATTCC
TTCCCGTGGGCTCAGGCAAGGAGACCCGTTGTCTCCGTATTTGTTTTTGCTCTGTGCGGAGGGTTTGTCGAGCCTGTTGCGAGGAGCAGAACGTCGAGCTTTGATATCTG
GTTTTCGGGTTGCGCGGAGTAGCCCTCCGATTTCTCATCTGTTTTTTGCAGATGATAGCCTCCTTTTCTTCAGGGCGAACGTTATGGAAGCAGTGGCTATCCGGGATCTG
TTGATCCGTTATGAACGAGCTTCGGTGCTTTCGGTAGCTCGGTGTCCATGTCACCAGCAATACCTTGGGCTCCCCTCATTCATGCCTAGGAATCGCTCGGGGACGTTGAA
CTTTATTAAGGACCGTGTCTGGAAGCAGATCCAGGGTTGGAAGGGAAAGTTCTTTTCCTTGGGAGGTAAAGAAGTTCTTCTAAAGTCAATCATTCAGGCCATCCCTTGCT
ACACGATGAATTGCTTTCGTCTGCCCCGTTGCCTGATTAGAGAAATGCATCGGGCCATGGCCAGGTTCTGGTGGAATGGTTCTGAGGAGGTGAACAGATCCATTGGGTAT
TTTCCCCACTCTGATTTCTGGGGGCATCTTTGGGCCATAGGCCATCGTTCATCTGGCGCAGTTTGTTGTGGGGTCGGGAACTGCTGGACCAAGGGTGCAGATGGAGGATT
GGGAATGGACGGTCTACCCCCATATATGGTTCGAATTGGGTGCCGGATAATCCGTCTTTGCGTGTGCAGTCTGCTCCTTCACTTCCTTTATCTAGTAGGGTCTGTGATCT
ATTTTCTCCGTCGGGACAGTGGGATGAGGTCAAGATTCGTGCCCACTTTTTGGGGCCTGAGTGAGGCTATTATGAGGATTCCCTTGCGTTCTGGTATGCTGGAGGATCGT
CTTATTTGGCATTTTGAGAAGCATGACATGTTCTCTGTGAAGAGTGGGTATAGGCTGGCTTACTCCTTGGCGTCCCAGGGGTGTCCGTCTTCTTCGATTCTGAGCGCTGG
AGGATTTGGTGGGCTAGTCTATGGAGGCTTGGGGTTCCAAATAAGCACAAGCTCTGTTGTGGAGGACTGTCTCCATCTTTTCTGGAAGTGCGCAGTGATTAGGGAAATGT
GGCTCTGCTCGAAATTTCTCAGCTATACCAGTCGTTATACCATTTGGATCTTGTTGATGTCATCTGGCATTAAAGGAGAAGGGGAATCTGATGGTCGGGATTTGTGGGCA
TGGTCTGAAGAGTATTTGAGGGTGTATCATGATGTTGGCAGGCAACGGAGTCTCGCTGCAGTTTGCAGCCTTGCATCGGTAAGCCAGTCGAGCAGTCTTCCTGGAATCCC
CAGTGGGCGTGGTTTCAAGCTGAACACCGATGCCTCTGTCAGGCCTGATACGGGTGTAGCAGGTGGAAGTTGTGTTCTTCGGGATGTATCTGGGGCAGTGCTTCTAGCGG
CGTGCTTGGTCCTGCCTAGGTGCTGGAGTGTGGACCTGGCTGAGGGTTGGGCATTGGTGAAGGGCGTGAAGTTAGCATTACAGGTGGCTGACTCTTCAGAGGTTGGCCTG
TTGATGGATGATGTCCGACGTCTTCTCCACCCTTGTGTAGTGGTAAGGTGCTTTTACGCCACGGAACGGAATAGAGTGGCACATGCTCTAGCCTGTTTGACCTTCTCCTA
TTCTGATCGTGTTTGGCTGGAGGAGTGGCCTAATGAAATCGCTGTTGTGCTGGCTGGTGATGTCGCGGTGTGGCGGTCGAGTTTGGGTAGTTCGAGAACGGGAGCGATCC
AAGGAGCCATCGATACTGGGAATGCGATCAGACTGAGTAGCGGCAGCCACTACCCTATACGGCGTGCAGTTAGCAGGGGGGGCCTGTTCCTCTGTACTCTGAAAGTTATG
GTGAGGCTGGGTGCACCTGAGCATCGACCCTCTCTTATGTATGCTTTCTTTGTGCTGTCTGGGTATGATTGGCTTGTGGGCCTGCAACTGTGCAGTGACTGTGATGATGT
GTTGAGTTTGACCAGTTATAATATAAGGGCGGTGATTTTGGTTAAGGTTTACATGTTGAATATAGGTCCCGCACTGGAGGTGGTTTGGTCTGATCTCCTTGGTAGGATGA
CTTGGGGTGAAGGTGAGTGTGGAGCTTGGAGTCTAGCTGGATTGCGGGAAGTCATATGGTGTCTTCATGGATTGGGCTTGCACTTTGTGTGGGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGATAGAATGGGCTTCGCTCAACAGTGGACTGATTTGATTCTCCGGTGTGTTAGCTCGGTTTCCTTTTCGTTTAATCTGAATGGGGAGAGGTTGGGGAATGTGATTCC
TTCCCGTGGGCTCAGGCAAGGAGACCCGTTGTCTCCGTATTTGTTTTTGCTCTGTGCGGAGGGTTTGTCGAGCCTGTTGCGAGGAGCAGAACGTCGAGCTTTGATATCTG
GTTTTCGGGTTGCGCGGAGTAGCCCTCCGATTTCTCATCTGTTTTTTGCAGATGATAGCCTCCTTTTCTTCAGGGCGAACGTTATGGAAGCAGTGGCTATCCGGGATCTG
TTGATCCGTTATGAACGAGCTTCGGTGCTTTCGGTAGCTCGGTGTCCATGTCACCAGCAATACCTTGGGCTCCCCTCATTCATGCCTAGGAATCGCTCGGGGACGTTGAA
CTTTATTAAGGACCGTGTCTGGAAGCAGATCCAGGGTTGGAAGGGAAAGTTCTTTTCCTTGGGAGGTAAAGAAGTTCTTCTAAAGTCAATCATTCAGGCCATCCCTTGCT
ACACGATGAATTGCTTTCGTCTGCCCCGTTGCCTGATTAGAGAAATGCATCGGGCCATGGCCAGGTTCTGGTGGAATGGTTCTGAGGAGGTGAACAGATCCATTGGGTAT
TTTCCCCACTCTGATTTCTGGGGGCATCTTTGGGCCATAGGCCATCGTTCATCTGGCGCAGTTTGTTGTGGGGTCGGGAACTGCTGGACCAAGGGTGCAGATGGAGGATT
GGGAATGGACGGTCTACCCCCATATATGGTTCGAATTGGGTGCCGGATAATCCGTCTTTGCGTGTGCAGTCTGCTCCTTCACTTCCTTTATCTAGTAGGGTCTGTGATCT
ATTTTCTCCGTCGGGACAGTGGGATGAGGTCAAGATTCGTGCCCACTTTTTGGGGCCTGAGTGAGGCTATTATGAGGATTCCCTTGCGTTCTGGTATGCTGGAGGATCGT
CTTATTTGGCATTTTGAGAAGCATGACATGTTCTCTGTGAAGAGTGGGTATAGGCTGGCTTACTCCTTGGCGTCCCAGGGGTGTCCGTCTTCTTCGATTCTGAGCGCTGG
AGGATTTGGTGGGCTAGTCTATGGAGGCTTGGGGTTCCAAATAAGCACAAGCTCTGTTGTGGAGGACTGTCTCCATCTTTTCTGGAAGTGCGCAGTGATTAGGGAAATGT
GGCTCTGCTCGAAATTTCTCAGCTATACCAGTCGTTATACCATTTGGATCTTGTTGATGTCATCTGGCATTAAAGGAGAAGGGGAATCTGATGGTCGGGATTTGTGGGCA
TGGTCTGAAGAGTATTTGAGGGTGTATCATGATGTTGGCAGGCAACGGAGTCTCGCTGCAGTTTGCAGCCTTGCATCGGTAAGCCAGTCGAGCAGTCTTCCTGGAATCCC
CAGTGGGCGTGGTTTCAAGCTGAACACCGATGCCTCTGTCAGGCCTGATACGGGTGTAGCAGGTGGAAGTTGTGTTCTTCGGGATGTATCTGGGGCAGTGCTTCTAGCGG
CGTGCTTGGTCCTGCCTAGGTGCTGGAGTGTGGACCTGGCTGAGGGTTGGGCATTGGTGAAGGGCGTGAAGTTAGCATTACAGGTGGCTGACTCTTCAGAGGTTGGCCTG
TTGATGGATGATGTCCGACGTCTTCTCCACCCTTGTGTAGTGGTAAGGTGCTTTTACGCCACGGAACGGAATAGAGTGGCACATGCTCTAGCCTGTTTGACCTTCTCCTA
TTCTGATCGTGTTTGGCTGGAGGAGTGGCCTAATGAAATCGCTGTTGTGCTGGCTGGTGATGTCGCGGTGTGGCGGTCGAGTTTGGGTAGTTCGAGAACGGGAGCGATCC
AAGGAGCCATCGATACTGGGAATGCGATCAGACTGAGTAGCGGCAGCCACTACCCTATACGGCGTGCAGTTAGCAGGGGGGGCCTGTTCCTCTGTACTCTGAAAGTTATG
GTGAGGCTGGGTGCACCTGAGCATCGACCCTCTCTTATGTATGCTTTCTTTGTGCTGTCTGGGTATGATTGGCTTGTGGGCCTGCAACTGTGCAGTGACTGTGATGATGT
GTTGAGTTTGACCAGTTATAATATAAGGGCGGTGATTTTGGTTAAGGTTTACATGTTGAATATAGGTCCCGCACTGGAGGTGGTTTGGTCTGATCTCCTTGGTAGGATGA
CTTGGGGTGAAGGTGAGTGTGGAGCTTGGAGTCTAGCTGGATTGCGGGAAGTCATATGGTGTCTTCATGGATTGGGCTTGCACTTTGTGTGGGGATGA
Protein sequenceShow/hide protein sequence
MDRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVARSSPPISHLFFADDSLLFFRANVMEAVAIRDL
LIRYERASVLSVARCPCHQQYLGLPSFMPRNRSGTLNFIKDRVWKQIQGWKGKFFSLGGKEVLLKSIIQAIPCYTMNCFRLPRCLIREMHRAMARFWWNGSEEVNRSIGY
FPHSDFWGHLWAIGHRSSGAVCCGVGNCWTKGADGGLGMDGLPPYMVRIGCRIIRLCVCSLLLHFLYLVGSVIYFLRRDSGMRSRFVPTFWGLSEAIMRIPLRSGMLEDR
LIWHFEKHDMFSVKSGYRLAYSLASQGCPSSSILSAGGFGGLVYGGLGFQISTSSVVEDCLHLFWKCAVIREMWLCSKFLSYTSRYTIWILLMSSGIKGEGESDGRDLWA
WSEEYLRVYHDVGRQRSLAAVCSLASVSQSSSLPGIPSGRGFKLNTDASVRPDTGVAGGSCVLRDVSGAVLLAACLVLPRCWSVDLAEGWALVKGVKLALQVADSSEVGL
LMDDVRRLLHPCVVVRCFYATERNRVAHALACLTFSYSDRVWLEEWPNEIAVVLAGDVAVWRSSLGSSRTGAIQGAIDTGNAIRLSSGSHYPIRRAVSRGGLFLCTLKVM
VRLGAPEHRPSLMYAFFVLSGYDWLVGLQLCSDCDDVLSLTSYNIRAVILVKVYMLNIGPALEVVWSDLLGRMTWGEGECGAWSLAGLREVIWCLHGLGLHFVWG