; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021109 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021109
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr7:4733967..4739386
RNA-Seq ExpressionLag0021109
SyntenyLag0021109
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4268750.1 unnamed protein product [Prunus armeniaca]1.3e-14532.3Show/hide
Query:  MQNETDHLMMDKHWRFTGFYGQPDHSLRSHSWNLLRRLHDCNDSALLIGGDMNEILWQSEKSGGPERDIGQIMAFRNVLDDCNLWDIGYTQDVFTWCNRR
        +  E + L +   WRFTGFYG P  + R  SW+LL RL   N    L  GD NEIL   EK+GG  R   Q+  FR+ +D+C   D+G++   FTW   R
Subjt:  MQNETDHLMMDKHWRFTGFYGQPDHSLRSHSWNLLRRLHDCNDSALLIGGDMNEILWQSEKSGGPERDIGQIMAFRNVLDDCNLWDIGYTQDVFTWCNRR

Query:  STGEKVSLRLDRLLANANFYSLIENFEVAKLDWAKFDHRPIQLTIQAHDQGPRRGRNRHSFKFEEWWTRQEECRRIIRQSSDWGSNQIVPHQLHHALSQC
        ++ E++ +RLDR LA  ++ +     +V  L+  K DH PI++TI        R RN   F+FEE W + E C   IR+      +   P  +   L + 
Subjt:  STGEKVSLRLDRLLANANFYSLIENFEVAKLDWAKFDHRPIQLTIQAHDQGPRRGRNRHSFKFEEWWTRQEECRRIIRQSSDWGSNQIVPHQLHHALSQC

Query:  ASALKGW--GYDFSLIHALEK--------------------------QLDKLLLEEEMYWKQRSRECWLKWGDRNTRWFHQKATMRKKRNWIDGVEDSNG
           L GW  G   SL H ++                           +LD L+ + E+YW+QRSR  WLK GDRNT++FH KA+ RK+RN I G+ED NG
Subjt:  ASALKGW--GYDFSLIHALEK--------------------------QLDKLLLEEEMYWKQRSRECWLKWGDRNTRWFHQKATMRKKRNWIDGVEDSNG

Query:  NWQSKTENIQKAFENYFKEIFSSS-QPDQMDMT-----RVLNHVPRK--------EVTAAIKDFHPTKAPGPDGFPAIFYQKYWDVVGDTTVSNCLEILN
         WQ+    ++    +YF+++FSS+   D  ++      RV   + RK        E+  A+   HP+KAPGPDGF  +FYQ+YWDVVG   VS  L  L+
Subjt:  NWQSKTENIQKAFENYFKEIFSSS-QPDQMDMT-----RVLNHVPRK--------EVTAAIKDFHPTKAPGPDGFPAIFYQKYWDVVGDTTVSNCLEILN

Query:  SNGRIHELNHTNIVLIPKSRQARLVSDYRPISLCNVSYKIITKVLANRLKIVLNEIIDECQSAFLPGRSISDNMIL-------------GGVKHM-----
        +   +  +N+T++ LIPK    + ++  RPISL NV YKI  KVLANRLK++L  +I + QSAF+PGR ISDN I+             G   +M     
Subjt:  SNGRIHELNHTNIVLIPKSRQARLVSDYRPISLCNVSYKIITKVLANRLKIVLNEIIDECQSAFLPGRSISDNMIL-------------GGVKHM-----

Query:  -------IEW-----------------------------------------------------------------SG-----------SGISIARSCQKI
               +EW                                                                 SG            G+S+ R    +
Subjt:  -------IEW-----------------------------------------------------------------SG-----------SGISIARSCQKI

Query:  SHLFFADDSLIFLKAATKEFGHFRNVLLDFEKASSQSVNYTKSVVIFSKNVANDTRQYLGRILSMRVTESLGAYLGLPSTFHRGKSRDFKYLLDKVWSVL
        SHLFFADDS +FLKA+       + +L  +E+ S Q +N  KS V FS+NVA++ ++     L +R       YLGLP+   R + + F YL D +W  +
Subjt:  SHLFFADDSLIFLKAATKEFGHFRNVLLDFEKASSQSVNYTKSVVIFSKNVANDTRQYLGRILSMRVTESLGAYLGLPSTFHRGKSRDFKYLLDKVWSVL

Query:  QGWNSQFFSQGGKEVLLKSIVQAIPTYAMGCFRIPKGILLKISALCARFWWGSGGDQRRIHWQRWEHLCKPKEVGGLNFRDLVNFNQAMLAKQVWRVLSN
        QGW ++  S  GKEVLLK + QA+P Y M CF IPK +  +I  + A +WWG+   +R+IHW  W++LC+PK  GGL FR+L  FN A++AKQVWR+  N
Subjt:  QGWNSQFFSQGGKEVLLKSIVQAIPTYAMGCFRIPKGILLKISALCARFWWGSGGDQRRIHWQRWEHLCKPKEVGGLNFRDLVNFNQAMLAKQVWRVLSN

Query:  PNLTVSRYLGP---IDVDVIQGL----------SIC--------------------------------------------GST-----------------
        P+  V+R L      D  +++            S+C                                            G+T                 
Subjt:  PNLTVSRYLGP---IDVDVIQGL----------SIC--------------------------------------------GST-----------------

Query:  -----------------------PDKWIWHYDRKGEYSVKNGYKLSMLKSQEVFLSDVEKNNF-------WWKRVWKMRIPSKVKVFIWKSFHNSIPTMV
                               PD  IWH++R G YSVK+GY+++     ++    V+ N          W+++W++R+P KV++F+W++  N +PT V
Subjt:  -----------------------PDKWIWHYDRKGEYSVKNGYKLSMLKSQEVFLSDVEKNNF-------WWKRVWKMRIPSKVKVFIWKSFHNSIPTMV

Query:  NLWNHHVPVMGTCHVCHDEMETTDHALFQCSRAREIW
        NL    V  +G C  C +E ET  H   QC  A  +W
Subjt:  NLWNHHVPVMGTCHVCHDEMETTDHALFQCSRAREIW

ONI09819.1 hypothetical protein PRUPE_4G011200 [Prunus persica]4.2e-14732.45Show/hide
Query:  DKHWRFTGFYGQPDHSLRSHSWNLLRRLHDCNDSALLIGGDMNEILWQSEKSGGPERDIGQIMAFRNVLDDCNLWDIGYTQDVFTWCNRRSTGEKVSLRL
        D  +  TGFYG PD   R+HSW LLRRL      A ++ GD NEIL+  +K GG ER  GQ+  F+  L+DC L    +T   FTW  R   G  V  RL
Subjt:  DKHWRFTGFYGQPDHSLRSHSWNLLRRLHDCNDSALLIGGDMNEILWQSEKSGGPERDIGQIMAFRNVLDDCNLWDIGYTQDVFTWCNRRSTGEKVSLRL

Query:  DRLLANANFYSLIENFEVAKLDWAKFDHRPIQLTIQAHDQGPRRGRNR-HSFKFEEWWTRQEECRRIIRQSSDWGSNQIVPHQLHHALSQCASALKGWGY
        DR +AN  F+    +   + L     DH PI   ++A    P  G  R   F FEE WT++ E  ++I ++  W     V   + ++LS CA  LK W +
Subjt:  DRLLANANFYSLIENFEVAKLDWAKFDHRPIQLTIQAHDQGPRRGRNR-HSFKFEEWWTRQEECRRIIRQSSDWGSNQIVPHQLHHALSQCASALKGWGY

Query:  DF------SLIHA---------------------LEKQLDKLLLEEEMYWKQRSRECWLKWGDRNTRWFHQKATMRKKRNWIDGVEDSNGNWQSKTENIQ
                 L HA                     +E+ +  LL ++E+ W+QRSR  WLK GD+NT +FH +A+ R KRN + G+ D+N  WQ++ + I 
Subjt:  DF------SLIHA---------------------LEKQLDKLLLEEEMYWKQRSRECWLKWGDRNTRWFHQKATMRKKRNWIDGVEDSNGNWQSKTENIQ

Query:  KAFENYFKEIFSSSQPDQMDMTRVLNHVPRKEVTAAIKD-----------------FHPTKAPGPDGFPAIFYQKYWDVVGDTTVSNCLEILNSNGRIHE
          F +YFK +FSSS   QM+  R+LN V R  +T+A+ D                   PTKAPG DG PA+F+QKYW +VGD     CL+ILN  G + E
Subjt:  KAFENYFKEIFSSSQPDQMDMTRVLNHVPRKEVTAAIKD-----------------FHPTKAPGPDGFPAIFYQKYWDVVGDTTVSNCLEILNSNGRIHE

Query:  LNHTNIVLIPKSRQARLVSDYRPISLCNVSYKIITKVLANRLKIVLNEIIDECQSAFLPGRSISDNMI--------LGGVK-------------------
         NHT I LIPK +    VS++RPISLC   YK+I K +ANRLK VL  +I E QSAF+P R I DN++        + GVK                   
Subjt:  LNHTNIVLIPKSRQARLVSDYRPISLCNVSYKIITKVLANRLKIVLNEIIDECQSAFLPGRSISDNMI--------LGGVK-------------------

Query:  -------------------------------HMIEWSGS-------------------------------------------GISIARSCQKISHLFFAD
                                         + W G+                                           G+ +AR    ++HL FAD
Subjt:  -------------------------------HMIEWSGS-------------------------------------------GISIARSCQKISHLFFAD

Query:  DSLIFLKAATKEFGHFRNVLLDFEKASSQSVNYTKSVVIFSKNVANDTRQYLGRILSMRVTESLGAYLGLPSTFHRGKSRDFKYLLDKVWSVLQGWNSQF
        DS++F+KA  K       +   +E+ + Q +NY+KS +  S N        +   L++ V      YLGLP+   +G+ + F++L DK+W  + GW  + 
Subjt:  DSLIFLKAATKEFGHFRNVLLDFEKASSQSVNYTKSVVIFSKNVANDTRQYLGRILSMRVTESLGAYLGLPSTFHRGKSRDFKYLLDKVWSVLQGWNSQF

Query:  FSQGGKEVLLKSIVQAIPTYAMGCFRIPKGILLKISALCARFWWGSGGDQRRIHWQRWEHLCKPKEVGGLNFRDLVNFNQAMLAKQVWRVLSNPNLTVSR
         S+ GKE+L+K+++QAIPTY+M CF+IPKG+  +++ + ARFWW    D+R IHW +WE LCK K  GGL FRDL  FNQA+LAKQ WR+L  P   V+R
Subjt:  FSQGGKEVLLKSIVQAIPTYAMGCFRIPKGILLKISALCARFWWGSGGDQRRIHWQRWEHLCKPKEVGGLNFRDLVNFNQAMLAKQVWRVLSNPNLTVSR

Query:  YLGP----------------------------------IDVDVIQGLSICGSTPDKW-------------------------------------------
                                              +   V  G+SI   T DKW                                           
Subjt:  YLGP----------------------------------IDVDVIQGLSICGSTPDKW-------------------------------------------

Query:  -------------------IWHYDRKGEYSVKNGYKLSMLKSQEVF---LSDVEKNNFWWKRVWKMRIPSKVKVFIWKSFHNSIPTMVNLWNHHVPVMGT
                           IWHY+R G YSVK+GY+L+ L+  ++     + V+ N+ +WK++W ++IP+K+K F+W+   + +P    L+N  +     
Subjt:  -------------------IWHYDRKGEYSVKNGYKLSMLKSQEVF---LSDVEKNNFWWKRVWKMRIPSKVKVFIWKSFHNSIPTMVNLWNHHVPVMGT

Query:  CHVCHDEMETTDHALFQCSRARE
        C  CH + E+  HA++ C  A+E
Subjt:  CHVCHDEMETTDHALFQCSRARE

XP_030923330.1 uncharacterized protein LOC115950239 [Quercus lobata]6.7e-16128.6Show/hide
Query:  SLVDDWSRLSLTSEEEEVLVVAD---REAVDRSNLSLDLCLLEKLLCPRPLGAEAMGRNFCAAWKLDIGFQVDRLGRNLFIFRFVNEFDRLRVIRHGPWL
        ++ D   ++ LT+EEEE + ++D    EA++  NLS    L+ K L  +     A       AW L+   Q+  +G NLF F+F +EFD  R++R GPW 
Subjt:  SLVDDWSRLSLTSEEEEVLVVAD---REAVDRSNLSLDLCLLEKLLCPRPLGAEAMGRNFCAAWKLDIGFQVDRLGRNLFIFRFVNEFDRLRVIRHGPWL

Query:  FEKYLLILVFSIRGLKPLDHHFSSIAFWIHIFELPLDWFNHSMAE-------------------------------------RLDGPLSG-----LWVPI
        F+  LL+L    +G+   +  F S + WI I++ P D  +  +A                                      R  G ++G      WV  
Subjt:  FEKYLLILVFSIRGLKPLDHHFSSIAFWIHIFELPLDWFNHSMAE-------------------------------------RLDGPLSG-----LWVPI

Query:  RYERLSEICTHCGVIGHVARDC---ELLLRSNRRPNFPNQYGDWLRFIGKGMIVSPTLKSSSKTHGHRLSLDVAPTIIHVPPGTELPRPRYGGIQINEPG
        +YERL   C +CG++GH  R C     + +   R  +  QYGD+LR IG G     + K+S ++       D A                          
Subjt:  RYERLSEICTHCGVIGHVARDC---ELLLRSNRRPNFPNQYGDWLRFIGKGMIVSPTLKSSSKTHGHRLSLDVAPTIIHVPPGTELPRPRYGGIQINEPG

Query:  EDRVQRPRAPSSPVSYSLADQPNRIVKGKEKVEEVADKYRNPFQWRRFAGKEKIGEAGASQWRSKSSNRGLPTVNEREVKRMPRQPGINSRPHPNSMHLM
        +D +Q+    ++  + +L D  N                 NP   RR    E +     ++           T+N  E         I+S  + +   + 
Subjt:  EDRVQRPRAPSSPVSYSLADQPNRIVKGKEKVEEVADKYRNPFQWRRFAGKEKIGEAGASQWRSKSSNRGLPTVNEREVKRMPRQPGINSRPHPNSMHLM

Query:  CLSFGNDPSDHAEAIQEDNNRFSKNLLPLFQEATPFFMGHDSKLIHNEGSPKDIESDPEMLEEFGPMSPEGANRPQTVDEDDEDKSVKVLESIKDDCVET
          + G    D  +  +E N         + +     F  +++ L+      +    D  +L      SP G N P  V        V  ++    D   T
Subjt:  CLSFGNDPSDHAEAIQEDNNRFSKNLLPLFQEATPFFMGHDSKLIHNEGSPKDIESDPEMLEEFGPMSPEGANRPQTVDEDDEDKSVKVLESIKDDCVET

Query:  --VPEMQNGTDHLMSPEEANRPQTVDEDDEDKSVKVLESIKDDGVETVPEMQNETDHLMMDKH---WRFTGFYGQPDHSLRSHSWNLLRRLHDCNDSALL
          +P +          E A + QT    D    +  L    D G+E +    N     + ++    W  TGFYG P+   + +SW LL+ L        +
Subjt:  --VPEMQNGTDHLMSPEEANRPQTVDEDDEDKSVKVLESIKDDGVETVPEMQNETDHLMMDKH---WRFTGFYGQPDHSLRSHSWNLLRRLHDCNDSALL

Query:  IGGDMNEILWQSEKSGGPERDIGQIMAFRNVLDDCNLWDIGYTQDVFTWCNRRSTGEKVSLRLDRLLANANFYSLIENFEVAKLDWAKFDHRPIQLTIQA
        + GD N  L  SEK    +    QI AFR  L  C L D+G+    +TW N+R       +RLDR +AN  +    +   V  L     DH P+ L +Q+
Subjt:  IGGDMNEILWQSEKSGGPERDIGQIMAFRNVLDDCNLWDIGYTQDVFTWCNRRSTGEKVSLRLDRLLANANFYSLIENFEVAKLDWAKFDHRPIQLTIQA

Query:  HDQGPRRGRNRHSFKFEEWWTRQEECRRIIRQSSDWGS---NQIVPHQLHHALSQCASALKGWGY-----DFSLIHALEKQLDK----------------
          Q PR+   R SFKFEE W  Q+EC  +I+++  WG+   N+     +   +  C   L  WG      D   I  ++KQLD+                
Subjt:  HDQGPRRGRNRHSFKFEEWWTRQEECRRIIRQSSDWGS---NQIVPHQLHHALSQCASALKGWGY-----DFSLIHALEKQLDK----------------

Query:  -------LLLEEEMYWKQRSRECWLKWGDRNTRWFHQKATMRKKRNWIDGVEDSNGNWQSKTENIQKAFENYFKEIFSSSQPDQM-------------DM
               LL ++E+YW QRSR  WL+ GDRNT++FH KA+ R+++N+I G+ +S G W    E + +   +YF  +F +   DQM             DM
Subjt:  -------LLLEEEMYWKQRSRECWLKWGDRNTRWFHQKATMRKKRNWIDGVEDSNGNWQSKTENIQKAFENYFKEIFSSSQPDQM-------------DM

Query:  TRVL-NHVPRKEVTAAIKDFHPTKAPGPDGFPAIFYQKYWDVVGDTTVSNCLEILNSNGRIHELNHTNIVLIPKSRQARLVSDYRPISLCNVSYKIITKV
           L N    +EV AA+    PTKAPGPDG  A+FYQK+W +VGD+ VS  L+ LN+   + E+NHTNIVLIPK +    +S++RPISLCNV YKII+KV
Subjt:  TRVL-NHVPRKEVTAAIKDFHPTKAPGPDGFPAIFYQKYWDVVGDTTVSNCLEILNSNGRIHELNHTNIVLIPKSRQARLVSDYRPISLCNVSYKIITKV

Query:  LANRLKIVLNEIIDECQSAFLPGRSISDNMILG------------GVK-------------HMIEW----------------------------------
        LANRLK VL +II   QSAF+PGR I+DN+++             G K               +EW                                  
Subjt:  LANRLKIVLNEIIDECQSAFLPGRSISDNMILG------------GVK-------------HMIEW----------------------------------

Query:  ----------------------------------------SG--SGISIARSCQKISHLFFADDSLIFLKAATKEFGHFRNVLLDFEKASSQSVNYTKSV
                                                +G  +G+SI R   KI++L FADDSL+F +A   E      +L  +E+AS QS+N  KS 
Subjt:  ----------------------------------------SG--SGISIARSCQKISHLFFADDSLIFLKAATKEFGHFRNVLLDFEKASSQSVNYTKSV

Query:  VIFSKNVANDTRQYLGRILSMRVTESLGAYLGLPSTFHRGKSRDFKYLLDKVWSVLQGWNSQFFSQGGKEVLLKSIVQAIPTYAMGCFRIPKGILLKISA
          FS N +   +  +  IL ++  +    YLGLP+   R K   F  L D+VW  LQGW     S+ GKE+L+K++ QAIPTY M  F+IP  +  ++ A
Subjt:  VIFSKNVANDTRQYLGRILSMRVTESLGAYLGLPSTFHRGKSRDFKYLLDKVWSVLQGWNSQFFSQGGKEVLLKSIVQAIPTYAMGCFRIPKGILLKISA

Query:  LCARFWWGSGGDQRRIHWQRWEHLCKPKEVGGLNFRDLVNFNQAMLAKQVWRVLSN--------------------------------------------
        LCARFWWG  G++R+IHW+ W+ L  PK+ GG+ FRDL  FN AMLAKQ WR++                                              
Subjt:  LCARFWWGSGGDQRRIHWQRWEHLCKPKEVGGLNFRDLVNFNQAMLAKQVWRVLSN--------------------------------------------

Query:  --------------------PNLTVSRYLGPIDVDVIQGL---------------------------SIC------GSTPDKWIWHYDRKGEYSVKNGYK
                            PN   ++ L  +  D  + L                           +IC         PD   W Y  +G +SVK+ Y 
Subjt:  --------------------PNLTVSRYLGPIDVDVIQGL---------------------------SIC------GSTPDKWIWHYDRKGEYSVKNGYK

Query:  -----LSMLKSQEVFLSDVEKNNFWWKRVWKMRIPSKVKVFIWKSFHNSIPTMVNLWNHHVPVMGTCHVCHDEMETTDHALFQCSRAREIW
             L+        +  V KN   W  +WK+R+P+KVKVF W++ H  +PT VNL    +     C +C  E E+T HAL+ C+  ++IW
Subjt:  -----LSMLKSQEVFLSDVEKNNFWWKRVWKMRIPSKVKVFIWKSFHNSIPTMVNLWNHHVPVMGTCHVCHDEMETTDHALFQCSRAREIW

XP_030924745.1 uncharacterized protein LOC115951731 [Quercus lobata]1.8e-15032.57Show/hide
Query:  MDKHWRFTGFYGQPDHSLRSHSWNLLRRLHDCNDSALLIGGDMNEILWQSEKSGGPERDIGQIMAFRNVLDDCNLWDIGYTQDVFTWCNRRSTGEKVSLR
        +D  WRFTGFYG PD + R +SW+LLR L+       L  GD NEI    EK G  +R   Q+  FR+ LDDC L D+G+    FTWCNRR+    V +R
Subjt:  MDKHWRFTGFYGQPDHSLRSHSWNLLRRLHDCNDSALLIGGDMNEILWQSEKSGGPERDIGQIMAFRNVLDDCNLWDIGYTQDVFTWCNRRSTGEKVSLR

Query:  LDRLLANANFYSLIENFEVAKLDWAKFDHRPIQLTIQAHDQGPRRGRNRHSFKFEEWWTRQEECRRIIRQS--------SDWG-SNQIVPHQ--------
        LDR +A   +        +  LD    DH+PI L   +  +  R  R    F+FE  W + + C ++I+ S        S WG +N++ P Q        
Subjt:  LDRLLANANFYSLIENFEVAKLDWAKFDHRPIQLTIQAHDQGPRRGRNRHSFKFEEWWTRQEECRRIIRQS--------SDWG-SNQIVPHQ--------

Query:  --LHHALSQCASALK---------GWGYDFSLIHALEKQLDKLLLEEEMYWKQRSRECWLKWGDRNTRWFHQKATMRKKRNWIDGVEDSNGNWQSKTENI
            H  +  A  LK         G+  D + ++ L  ++++L  +EE  WKQRSR  WLK GDRNT +FH +AT R KRN I G+ED  G W  K E++
Subjt:  --LHHALSQCASALK---------GWGYDFSLIHALEKQLDKLLLEEEMYWKQRSRECWLKWGDRNTRWFHQKATMRKKRNWIDGVEDSNGNWQSKTENI

Query:  QKAFENYFKEIFSSSQPDQMD--------------MTRVLNHVPRKEVTAAIKDFHPTKAPGPDGFPAIFYQKYWDVVGDTTVSNCLEILNSNGRIHE-L
         +  E+YF+ +F+SS P Q +                 +      +EV  A+K   P  APGPDG   IFY+ YW +VG+  +S  L  LNS G +HE L
Subjt:  QKAFENYFKEIFSSSQPDQMD--------------MTRVLNHVPRKEVTAAIKDFHPTKAPGPDGFPAIFYQKYWDVVGDTTVSNCLEILNSNGRIHE-L

Query:  NHTNIVLIPKSRQARLVSDYRPISLCNVSYKIITKVLANRLKIVLNEIIDECQSAFLPGRSISDNMI-------------------------LGGVKHMI
        N T I LIPK +  + V+++RPISLCNV YK+I KV+ NRLK +L  +ID+ QSAFL GR I+DN++                         +      +
Subjt:  NHTNIVLIPKSRQARLVSDYRPISLCNVSYKIITKVLANRLKIVLNEIIDECQSAFLPGRSISDNMI-------------------------LGGVKHMI

Query:  EW---------------------------------------------SGS--GISIARSCQKISHLFFADDSLIFLKAATKEFGHFRNVLLDFEKASSQS
        EW                                             +G   G+SI R+  ++SHLFFADDS++F +A  +E     ++L  +EK S Q 
Subjt:  EW---------------------------------------------SGS--GISIARSCQKISHLFFADDSLIFLKAATKEFGHFRNVLLDFEKASSQS

Query:  VNYTKSVVIFSKNVANDTRQYLGRILSMRVTESLGAYLGLPSTFHRGKSRDFKYLLDKVWSVLQGWNSQFFSQGGKEVLLKSIVQAIPTYAMGCFRIPKG
        +N  K+ + FS N   D +  +  +L +        YLGLP+   R K + F Y+ ++VW  LQGW  +  SQ G+EVL+KS++QAIPTY M CF++PKG
Subjt:  VNYTKSVVIFSKNVANDTRQYLGRILSMRVTESLGAYLGLPSTFHRGKSRDFKYLLDKVWSVLQGWNSQFFSQGGKEVLLKSIVQAIPTYAMGCFRIPKG

Query:  ILLKISALCARFWWGSGGDQRRIHWQRWEHLCKPKEVGGLNFRDLVNFNQAMLAKQVWRVLSNPNLTVSR---------------------------YLG
        ++ ++  L  +FWW   G+ R++HW +WE LC+ KE GG+ F+D+  FN ++LAKQVWR+++NP+    R                            +G
Subjt:  ILLKISALCARFWWGSGGDQRRIHWQRWEHLCKPKEVGGLNFRDLVNFNQAMLAKQVWRVLSNPNLTVSR---------------------------YLG

Query:  PIDV-----------------------------DVIQGLS-----------------------------------ICGS------TPDKWIWHYDRKGEY
          DV                              VI  L                                    ICG        PD+ IW     G +
Subjt:  PIDV-----------------------------DVIQGLS-----------------------------------ICGS------TPDKWIWHYDRKGEY

Query:  SVKNGYKLSMLK--SQEVFLSDVEKNNFWWKRVWKMRIPSKVKVFIWKSFHNSIPTMVNLWNHHVPVMGTCHVCHDEMETTDHALFQCSRAREIWALL
        + K+ YKL + +  +     S       +W+++W++R+P+K+K F W++ ++++PT+ NL   H+     C  C  + E T HAL+ CS    +W+ L
Subjt:  SVKNGYKLSMLK--SQEVFLSDVEKNNFWWKRVWKMRIPSKVKVFIWKSFHNSIPTMVNLWNHHVPVMGTCHVCHDEMETTDHALFQCSRAREIWALL

XP_030940268.1 uncharacterized protein LOC115965235 [Quercus lobata]2.5e-14732.26Show/hide
Query:  MDKHWRFTGFYGQPDHSLRSHSWNLLRRLHDCNDSALLIGGDMNEILWQSEKSGGPERDIGQIMAFRNVLDDCNLWDIGYTQDVFTWCNRRSTGEKVSLR
        +D  WRFTGFYG P+ + R  SW+LLR L    D   +  GD NEI    EK GG  R   Q+  FR+ LD C   DIG+T   FTWCN R  G  V +R
Subjt:  MDKHWRFTGFYGQPDHSLRSHSWNLLRRLHDCNDSALLIGGDMNEILWQSEKSGGPERDIGQIMAFRNVLDDCNLWDIGYTQDVFTWCNRRSTGEKVSLR

Query:  LDRLLANANFYSLIENFEVAKLDWAKFDHRPIQLTIQAHDQGPRRGRNRHSFKFEEWWTRQEECRRIIRQSSDWGSNQIVPHQLHHALSQCASALKGW--
        LDR LA+A +     +  +  L     DH+PI L   + D   R  R +  F+FEE W + E C  ++  + D         ++ H ++ C S LK W  
Subjt:  LDRLLANANFYSLIENFEVAKLDWAKFDHRPIQLTIQAHDQGPRRGRNRHSFKFEEWWTRQEECRRIIRQSSDWGSNQIVPHQLHHALSQCASALKGW--

Query:  --------------------------GYDFSLIHALEKQLDKLLLEEEMYWKQRSRECWLKWGDRNTRWFHQKATMRKKRNWIDGVEDSNGNWQSKTENI
                                  G +   +  L+++++KLL  EE  W QR++  WL++GDRN+++FH +A+ R KRN+I G+ED  G W  + E I
Subjt:  --------------------------GYDFSLIHALEKQLDKLLLEEEMYWKQRSRECWLKWGDRNTRWFHQKATMRKKRNWIDGVEDSNGNWQSKTENI

Query:  QKAFENYFKEIFSSSQPDQMD---------MTRVLNH-----VPRKEVTAAIKDFHPTKAPGPDGFPAIFYQKYWDVVGDTTVSNCLEILNSNGRIHELN
         +    Y+  +FSSS P + D         +TR +N          EV  A+       +PGPDGFP +FY++YW  +G       + +LN+    HELN
Subjt:  QKAFENYFKEIFSSSQPDQMD---------MTRVLNH-----VPRKEVTAAIKDFHPTKAPGPDGFPAIFYQKYWDVVGDTTVSNCLEILNSNGRIHELN

Query:  HTNIVLIPKSRQARLVSDYRPISLCNVSYKIITKVLANRLKIVLNEIIDECQSAFLPGRSISDNMI----------------------------------
        HT + LIPK +  R VSD+RPISL NV YK+I KVLANRLK  L E+I E QSAF+PGR I+DN++                                  
Subjt:  HTNIVLIPKSRQARLVSDYRPISLCNVSYKIITKVLANRLKIVLNEIIDECQSAFLPGRSISDNMI----------------------------------

Query:  -----------------------------------------------------------------LGGVKHMIEWSGS--GISIARSCQKISHLFFADDS
                                                                         L G+    E SGS  G+S+  +  +ISHL FADDS
Subjt:  -----------------------------------------------------------------LGGVKHMIEWSGS--GISIARSCQKISHLFFADDS

Query:  LIFLKAATKEFGHFRNVLLDFEKASSQSVNYTKSVVIFSKNVANDTRQYLGRILSMRVTESLGAYLGLPSTFHRGKSRDFKYLLDKVWSVLQGWNSQFFS
        LIF +A+  +    +++L  +E AS Q++N  K+ + FS N    T++ +  +L++ V +    YLGLPS   R K R F  + +++W  L+GW  +  S
Subjt:  LIFLKAATKEFGHFRNVLLDFEKASSQSVNYTKSVVIFSKNVANDTRQYLGRILSMRVTESLGAYLGLPSTFHRGKSRDFKYLLDKVWSVLQGWNSQFFS

Query:  QGGKEVLLKSIVQAIPTYAMGCFRIPKGILLKISALCARFWWGSGGDQRRIHWQRWEHLCKPKEVGGLNFRDLVNFNQAMLAKQVWRVLSNPN-------
        Q G+E+L+K++VQAIPTY M CF++PKG++ +I  L  +FWWG  G+Q+RIHW  WE LC PK  GG+ FR+L  FN ++LAKQVWR+ +N +       
Subjt:  QGGKEVLLKSIVQAIPTYAMGCFRIPKGILLKISALCARFWWGSGGDQRRIHWQRWEHLCKPKEVGGLNFRDLVNFNQAMLAKQVWRVLSNPN-------

Query:  --------------------------LTVSR----------------------YLGP----------------------IDVD-----------------
                                  L  SR                      +L P                      IDVD                 
Subjt:  --------------------------LTVSR----------------------YLGP----------------------IDVD-----------------

Query:  --VIQGLSI-CGSTPDKWIWHYDRKGEYSVKNGYKL--SMLKSQEVFLSDVEKNNFWWKRVWKMRIPSKVKVFIWKSFHNSIPTMVNLWNHHVPVMGTCH
          +I G+ +   + PD+ +W    KG Y+ ++ Y+L  S  +      SD  K    WK +W +++P K+K  +WK+ +N+IPT+ NLW  +V     C 
Subjt:  --VIQGLSI-CGSTPDKWIWHYDRKGEYSVKNGYKL--SMLKSQEVFLSDVEKNNFWWKRVWKMRIPSKVKVFIWKSFHNSIPTMVNLWNHHVPVMGTCH

Query:  VCHDEMETTDHALFQCSRAREIW
         C  E E T HAL+ C   R +W
Subjt:  VCHDEMETTDHALFQCSRAREIW

TrEMBL top hitse value%identityAlignment
A0A2N9ELB0 Uncharacterized protein4.8e-15728.19Show/hide
Query:  VDDWSRLSLTSEEEEVLVVADREAVDRSNLSLDLCLLEKLLCPRPLGAEAMGRNFCAAWKLDIGFQVDRLGRNLFIFRFVNEFDRLRVIRHGPWLFEKYL
        +++  R    S+ E+  +   RE V RS       +L KLL  RP   EA+  +  A W    G  V  +  NLF+  F ++ D  R+    PW F+K L
Subjt:  VDDWSRLSLTSEEEEVLVVADREAVDRSNLSLDLCLLEKLLCPRPLGAEAMGRNFCAAWKLDIGFQVDRLGRNLFIFRFVNEFDRLRVIRHGPWLFEKYL

Query:  LILVFSIRGLKPLDHHFSSIAFWIHIFELPLDWFNHSMAERLDGPLSGL-------------------------------------------WVPIRYER
        + +V  I  L+P    F+  AFWI I  LP+      + E +   +  L                                           WV  RYE 
Subjt:  LILVFSIRGLKPLDHHFSSIAFWIHIFELPLDWFNHSMAERLDGPLSGL-------------------------------------------WVPIRYER

Query:  LSEICTHCGVIGHVARDCELLLRSNRRPNFP-NQYGDWLRFI-----GKGMIVSPTLKSSSKTHGHRLSLDVAPTII---HVPPGTELPRPRYG-GIQIN
        L   C  CG+IGH   +C      +    F  +QYG WLR +       G         SS  H  +       T+    +   G  L    Y   I   
Subjt:  LSEICTHCGVIGHVARDCELLLRSNRRPNFP-NQYGDWLRFI-----GKGMIVSPTLKSSSKTHGHRLSLDVAPTII---HVPPGTELPRPRYG-GIQIN

Query:  EPGEDRVQRPRAPSSPVSYSLADQPNRIVKGKEKVEEVADKYRNPFQWRRFAGKEKIGEAGASQWRSKSSNRGLPTVNEREVKRMPRQPGINSRPHPNSM
         PG      P    +PVS  L            KV E A+   +   +R    K  I  AG          +  PT+ E  +  MP  P IN   +P  M
Subjt:  EPGEDRVQRPRAPSSPVSYSLADQPNRIVKGKEKVEEVADKYRNPFQWRRFAGKEKIGEAGASQWRSKSSNRGLPTVNEREVKRMPRQPGINSRPHPNSM

Query:  HLMCLSFGNDPSDHAEAIQEDNNRFSKNLLPLFQEATPFFMGHDSKLIHNEGSPKDIESDPEMLEEFGPMSPEGANRPQTVDE-----DDEDKSVKVLES
             S    P + A  ++         L P   + +P  MG D+   +NE      E+ P  ++ F  ++  G   P+TV E       ED S+  L  
Subjt:  HLMCLSFGNDPSDHAEAIQEDNNRFSKNLLPLFQEATPFFMGHDSKLIHNEGSPKDIESDPEMLEEFGPMSPEGANRPQTVDE-----DDEDKSVKVLES

Query:  IKDDCVETVPEMQNGTDHLMSPEEANRPQTVDEDDEDKSVKVL-ESIKDDGVETVPEMQNETDHLMMD-KHWRFTGFYGQPDHSLRSHSWNLLRRLHDCN
             +ET  E++N  + L           V+       + +L  S  +  +++      + D +  D   WR TGFYG P+ +LR HSW+LLR LH   
Subjt:  IKDDCVETVPEMQNGTDHLMSPEEANRPQTVDEDDEDKSVKVL-ESIKDDGVETVPEMQNETDHLMMD-KHWRFTGFYGQPDHSLRSHSWNLLRRLHDCN

Query:  DSALLIGGDMNEILWQSEKSGGPERDIGQIMAFRNVLDDCNLWDIGYTQDVFTWCNRRSTGEKVSLRLDRLLANANFYSLIENFEVAKLDWAKFDHRPIQ
            L+ GD NEI    EK G  +R + Q+ AFR  L DC+L D+GY    FTW NRR  G  V +RLDR +AN  + SL    +V  +  A  DH  + 
Subjt:  DSALLIGGDMNEILWQSEKSGGPERDIGQIMAFRNVLDDCNLWDIGYTQDVFTWCNRRSTGEKVSLRLDRLLANANFYSLIENFEVAKLDWAKFDHRPIQ

Query:  LTIQAHDQGPRRGRNRHSFKFEEWWTRQEECRRIIR---QSSDWGSNQIVPHQLHHALSQCASALKGWG----------------------------YDF
        +      Q P  GR +  F+FE  W ++  C   I+    SS  GS   V  Q    + QC   L  W                             Y+ 
Subjt:  LTIQAHDQGPRRGRNRHSFKFEEWWTRQEECRRIIR---QSSDWGSNQIVPHQLHHALSQCASALKGWG----------------------------YDF

Query:  SLIHALEKQLDKLLLEEEMYWKQRSRECWLKWGDRNTRWFHQKATMRKKRNWIDGVEDSNGNWQSKTENIQKAFENYFKEIFSSSQPDQMDMT-------
        S ++ L ++L+ L  +EE++W+QRSR  WLK GDRNT++FH+ A+ R+K N I G+ D +G W S    I     +YF  +F+SS P+ +D         
Subjt:  SLIHALEKQLDKLLLEEEMYWKQRSRECWLKWGDRNTRWFHQKATMRKKRNWIDGVEDSNGNWQSKTENIQKAFENYFKEIFSSSQPDQMDMT-------

Query:  -------RVLNHVPRKEVTAAIKDFHPTKAPGPDGFPAIFYQKYWDVVGDTTVSNCLEILNSNGRIHELNHTNIVLIPKSRQARLVSDYRPISLCNVSYK
                ++     +E+  A+    P+KAPGPDG  A+F+QKYW +VG       L+ LNS   +  +N TNI LIPK +    ++++RPISLCNV YK
Subjt:  -------RVLNHVPRKEVTAAIKDFHPTKAPGPDGFPAIFYQKYWDVVGDTTVSNCLEILNSNGRIHELNHTNIVLIPKSRQARLVSDYRPISLCNVSYK

Query:  IITKVLANRLKIVLNEIIDECQSAFLPGRSISDNMILG-GVKH------------------------MIEW-------------------------SGS-
        I++KVL NR+K +L  +I + QSAF+PGR I+DN+I+   V H                         +EW                         S S 
Subjt:  IITKVLANRLKIVLNEIIDECQSAFLPGRSISDNMILG-GVKH------------------------MIEW-------------------------SGS-

Query:  --------------------------------------------------GISIARSCQKISHLFFADDSLIFLKAATKEFGHFRNVLLDFEKASSQSVN
                                                          G++I R   +ISHLFFADDS+IF +A+T +    +N+L  +EKAS Q VN
Subjt:  --------------------------------------------------GISIARSCQKISHLFFADDSLIFLKAATKEFGHFRNVLLDFEKASSQSVN

Query:  YTKSVVIFSKNVANDTRQYLGRILSMRVTESLGAYLGLPSTFHRGKSRDFKYLLDKVWSVLQGWNSQFFSQGGKEVLLKSIVQAIPTYAMGCFRIPKGIL
          K+ + FSKN  +  +  +  +     T     YLGLP    R K   F  + +++W  LQGW  +  SQ G+E L+K++VQAIPTYAM CF+ P G+ 
Subjt:  YTKSVVIFSKNVANDTRQYLGRILSMRVTESLGAYLGLPSTFHRGKSRDFKYLLDKVWSVLQGWNSQFFSQGGKEVLLKSIVQAIPTYAMGCFRIPKGIL

Query:  LKISALCARFWWGSGGDQRRIHWQRWEHLCKPKEVGGLNFRDLVNFNQAMLAKQVWRVLSNPNLTVSRYLG-----------------------------
         +I ++  RFWWG  G +R+IHW R + L +PK+ GG+ FRDL  FN+A+LA+Q WR+L +P   VSR+L                              
Subjt:  LKISALCARFWWGSGGDQRRIHWQRWEHLCKPKEVGGLNFRDLVNFNQAMLAKQVWRVLSNPNLTVSRYLG-----------------------------

Query:  -------------------------------------------------------------------PIDVDVIQGLSICGSTP-DKWIWHYDRKGEYSV
                                                                           P DV++I+ + +    P D+ IW     G ++V
Subjt:  -------------------------------------------------------------------PIDVDVIQGLSICGSTP-DKWIWHYDRKGEYSV

Query:  KNGYKLSMLKSQEVFLSD---VEKNNFWWKRVWKMRIPSKVKVFIWKSFHNSIPTMVNLWNHHVPVMGTCHVCHDEMETTDHALFQCSRAREIWALLHPP
        ++ Y L +     +  S    ++++   W  +W  ++  K++VF+W++  + +PT   L++  +    +C  C ++ ETT H L+QC  A+ +W+    P
Subjt:  KNGYKLSMLKSQEVFLSD---VEKNNFWWKRVWKMRIPSKVKVFIWKSFHNSIPTMVNLWNHHVPVMGTCHVCHDEMETTDHALFQCSRAREIWALLHPP

A0A2N9EV43 Reverse transcriptase domain-containing protein2.8e-15728.43Show/hide
Query:  WSRLSLTSEEEEVLVVADREAVDRSNLSLDLCLLEKLLCPRPLGAEAMGRNFCAAWKLDIGFQVDRLGRNLFIFRFVNEFDRLRVIRHGPWLFEKYLLIL
        W  LSLT +E  ++V+   EA D S L+         L PR +  E++ R     W+   GF+   +G N  +F F NE D  RV+ +GPW F+K+L+IL
Subjt:  WSRLSLTSEEEEVLVVADREAVDRSNLSLDLCLLEKLLCPRPLGAEAMGRNFCAAWKLDIGFQVDRLGRNLFIFRFVNEFDRLRVIRHGPWLFEKYLLIL

Query:  VFSIRGLKPLDHHFSSIAFWIHIFELPLDWFNHSMAERLDGPLSGL------------------------------------------WVPIRYERLSEI
              +      F  ++FW+ I +LP+     +   R+   L  L                                          WV  RYERL+  
Subjt:  VFSIRGLKPLDHHFSSIAFWIHIFELPLDWFNHSMAERLDGPLSGL------------------------------------------WVPIRYERLSEI

Query:  CTHCGVIGHVARDCELLLRSNRRPNFPNQ-YGDWLRFIGKGMIVSPTL---KSSSKTHG-----HRLSLDVAPTIIHVPPGTELPRPRYGGIQINEPGED
        C  CG I H  +DCE+ LRS    +   Q YG W++   + M   P+    K   ++ G      R S      +   PP  + P PR  G  I    E 
Subjt:  CTHCGVIGHVARDCELLLRSNRRPNFPNQ-YGDWLRFIGKGMIVSPTL---KSSSKTHG-----HRLSLDVAPTIIHVPPGTELPRPRYGGIQINEPGED

Query:  RVQRPRAPSSPVSYSLADQPNRIVKGKEKVEEVADKYRNPFQWRRFAGKEKIGEAGASQWRSKSSNRGLPTVNEREVKRMPRQPGI----NSRPHPNSMH
         +      S     +L  Q N  ++ ++K  E                 E I E G      +  N   PTV E     + + P +      +  PNS  
Subjt:  RVQRPRAPSSPVSYSLADQPNRIVKGKEKVEEVADKYRNPFQWRRFAGKEKIGEAGASQWRSKSSNRGLPTVNEREVKRMPRQPGI----NSRPHPNSMH

Query:  LMCLSFGNDPSDHAEAIQEDNNRFSK-NLLPLFQEATPFFMGHDSKLIHNE-----GSPKDIESDPEMLEEFGPMSPEGANRPQTVDEDDEDKSVKVLES
           +     P D    +       SK     +  EA  +  G   K + NE     G+PK + +   M++  GP                  K + +LE+
Subjt:  LMCLSFGNDPSDHAEAIQEDNNRFSK-NLLPLFQEATPFFMGHDSKLIHNE-----GSPKDIESDPEMLEEFGPMSPEGANRPQTVDEDDEDKSVKVLES

Query:  IKDDCVETVPEMQNGTDHLMSPEEANRPQTVDEDDEDKSVKVLESIKDDGVETVPEMQNETDHLMMDKHWRFTGFYGQPDHSLRSHSWNLLRRLHDCNDS
          +        ++ G +++ S     R   +    ++++  V+++     ++   E Q         K WR TGFYG+P+H  R  SW LL+ L   + +
Subjt:  IKDDCVETVPEMQNGTDHLMSPEEANRPQTVDEDDEDKSVKVLESIKDDGVETVPEMQNETDHLMMDKHWRFTGFYGQPDHSLRSHSWNLLRRLHDCNDS

Query:  ALLIGGDMNEILWQSEKSGGPERDIGQIMAFRNVLDDCNLWDIGYTQDVFTWCNRRSTGEKVSLRLDRLLANANFYSLIENFEVAKLDWAKFDHRPIQLT
          L  GD NE+L   EK GG  R + QI+ F++ ++ CNL D+GY    +TW N R     V  RLDR LA + +      + V     +  DH  + + 
Subjt:  ALLIGGDMNEILWQSEKSGGPERDIGQIMAFRNVLDDCNLWDIGYTQDVFTWCNRRSTGEKVSLRLDRLLANANFYSLIENFEVAKLDWAKFDHRPIQLT

Query:  IQAHDQGPRRGRNRHSFKFEEWWTRQEECRRIIR----QSSDWGSNQIVPHQLHHALSQCASALKGW----------------------------GYDFS
        +    QG +R + +   +FEE W     C  +IR    Q  D GS      QL   +S+C  AL  W                            G    
Subjt:  IQAHDQGPRRGRNRHSFKFEEWWTRQEECRRIIR----QSSDWGSNQIVPHQLHHALSQCASALKGW----------------------------GYDFS

Query:  LIHALEKQLDKLLLEEEMYWKQRSRECWLKWGDRNTRWFHQKATMRKKRNWIDGVEDSNGNWQSKTENIQKAFENYFKEIFSSSQPDQM-DMTRVLNHV-
         I  L+ +++ LL ++E++W+QRSRE WLK GDRNT++FHQKA  R+ +N I G+ DS GNW      I      YFKEIFSSS   ++ D  RV++ V 
Subjt:  LIHALEKQLDKLLLEEEMYWKQRSRECWLKWGDRNTRWFHQKATMRKKRNWIDGVEDSNGNWQSKTENIQKAFENYFKEIFSSSQPDQM-DMTRVLNHV-

Query:  ------------PRKEVTAAIKDFHPTKAPGPDGFPAIFYQKYWDVVGDTTVSNCLEILNSNGRIHELNHTNIVLIPKSRQARLVSDYRPISLCNVSYKI
                       E+  A    HP+K+PGPDG  + F+QKYW +VG   VS  L +L S   + + NH++IVLIPK +  +L+SDYRPISL NV YKI
Subjt:  ------------PRKEVTAAIKDFHPTKAPGPDGFPAIFYQKYWDVVGDTTVSNCLEILNSNGRIHELNHTNIVLIPKSRQARLVSDYRPISLCNVSYKI

Query:  ITKVLANRLKIVLNEIIDECQSAFLPGRSISDNMIL-------------GGVKHM------------IEWSG----------------------------
        ++KVLANRLK VL  II E QSAF+PGR I+DN+ +             G V  M            +EW+                             
Subjt:  ITKVLANRLKIVLNEIIDECQSAFLPGRSISDNMIL-------------GGVKHM------------IEWSG----------------------------

Query:  ------------------------------------------------SGISIARSCQKISHLFFADDSLIFLKAATKEFGHFRNVLLDFEKASSQSVNY
                                                        +G+SI R   KISHL FADDSL+F  A+  E      VL  +EKAS Q VN 
Subjt:  ------------------------------------------------SGISIARSCQKISHLFFADDSLIFLKAATKEFGHFRNVLLDFEKASSQSVNY

Query:  TKSVVIFSKNVANDTRQYLGRILSMRVTESLGAYLGLPSTFHRGKSRDFKYLLDKVWSVLQGWNSQFFSQGGKEVLLKSIVQAIPTYAMGCFRIPKGILL
         K+ + FSKN  +  R  +  +  +R   +   YLGLP+   + K + F  L + +   LQGW  +  S+ G+ +L+K+I QAIPTY M CF++PK    
Subjt:  TKSVVIFSKNVANDTRQYLGRILSMRVTESLGAYLGLPSTFHRGKSRDFKYLLDKVWSVLQGWNSQFFSQGGKEVLLKSIVQAIPTYAMGCFRIPKGILL

Query:  KISALCARFWWGSGGDQRRIHWQRWEHLCKPKEVGGLNFRDLVNFNQAMLAKQVWRVLSNPNLTVSRYL-------------------------------
         +++L +++WWG   ++ +IHW  W  LC  K  GG+ FRDL  FN A+LAKQ WR+L+N     +R                                 
Subjt:  KISALCARFWWGSGGDQRRIHWQRWEHLCKPKEVGGLNFRDLVNFNQAMLAKQVWRVLSNPNLTVSRYL-------------------------------

Query:  ------------------------------------------------GPIDVDVIQGLSICG----------STP---DKWIWHYDRKGEYSVKNGYKL
                                                        G  DV V+Q +   G          S P   D + W  +  G +SVK+ Y+L
Subjt:  ------------------------------------------------GPIDVDVIQGLSICG----------STP---DKWIWHYDRKGEYSVKNGYKL

Query:  SMLKSQ---EVFLSDVEKNNFWWKRVWKMRIPSKVKVFIWKSFHNSIPTMVNLWNHHVPVMGTCHVCHDEMETTDHALFQCSRAREIWAL
        +        E   S++ +   +WK +WK+++P+K+K+ +W++  N+IPT + L    V     C +C   +ET  HAL+ C  A  +WAL
Subjt:  SMLKSQ---EVFLSDVEKNNFWWKRVWKMRIPSKVKVFIWKSFHNSIPTMVNLWNHHVPVMGTCHVCHDEMETTDHALFQCSRAREIWAL

A0A2N9HLP3 Uncharacterized protein3.2e-16128.14Show/hide
Query:  SLVDDWSRLSLTSEEEEVLVVADREAVDRSNLSLDLCLLEKLLCPRPLGAEAMGRNFCAAWKLDIGFQVDRLGRNLFIFRFVNEFDRLRVIRHGPWLFEK
        SL   W + SL+  E  ++ +       +S      CL  K L  R L AEA+ R F   W+ D GF +  +  N+ +F F +E DR RV+   PW ++K
Subjt:  SLVDDWSRLSLTSEEEEVLVVADREAVDRSNLSLDLCLLEKLLCPRPLGAEAMGRNFCAAWKLDIGFQVDRLGRNLFIFRFVNEFDRLRVIRHGPWLFEK

Query:  YLLILVFSIRGLKPLDHHFSSIAFWIHIFELPLDWFNHSMAERLDGPLSGLWVPIRYERLSEICTHCGVIGHVARDCELLLRSNRRPNF-PNQYGDWLRF
        +L++L          +  F   +FW+ +  +P+   NH +A  L G   G  V +  +  S      G    + RDC   LR+         Q+G W+R 
Subjt:  YLLILVFSIRGLKPLDHHFSSIAFWIHIFELPLDWFNHSMAERLDGPLSGLWVPIRYERLSEICTHCGVIGHVARDCELLLRSNRRPNF-PNQYGDWLRF

Query:  IGKGMIVSPTLKSSSKTHGHRLSLDVAPTIIHVPPGTELPRPRYGGIQINEPGEDRVQRPRAPSSPVSYSLADQPNRIVKGKEKVEEVADKYRNPFQWRR
                P  K+  K  G  +     P   H PP +  P                 Q    PS P S++    P  I+               P     
Subjt:  IGKGMIVSPTLKSSSKTHGHRLSLDVAPTIIHVPPGTELPRPRYGGIQINEPGEDRVQRPRAPSSPVSYSLADQPNRIVKGKEKVEEVADKYRNPFQWRR

Query:  FAGKEKIGEAGASQWRSKSSNRGLPTVNEREVKRMPRQ-----PGINSRPHPNSMHLMCLSFGNDPSDHAEAIQEDNNRFSKNLLPLFQEATPFFMG--H
             +   A   +         LP+   R+      Q       +N  P P +  +M L            ++ +N   S+    +F + T        
Subjt:  FAGKEKIGEAGASQWRSKSSNRGLPTVNEREVKRMPRQ-----PGINSRPHPNSMHLMCLSFGNDPSDHAEAIQEDNNRFSKNLLPLFQEATPFFMG--H

Query:  DSKLIHNEGSPKDIESDPEMLEEFGPM--SPEGANRPQTVDE------DDEDKSVKVLESIKDDCVETVPEMQNGTDHLMSPEEANR--------PQTVD
        + K+   + S K +   P        +  +  G   P+TV E        +   V ++E+ +D+        +   D+    +  N+          +++
Subjt:  DSKLIHNEGSPKDIESDPEMLEEFGPM--SPEGANRPQTVDE------DDEDKSVKVLESIKDDCVETVPEMQNGTDHLMSPEEANR--------PQTVD

Query:  EDDEDKSVKVLESIKDDGVETVPEMQNETDHLMMDKHWRFTGFYGQPDHSLRSHSWNLLRRLHDCNDSALLIGGDMNEILWQSEKSGGPERDIGQIMAFR
              S   ++++ DD                    WRFTGFYG P+ S R  SW LLRRL+  +       GD NE++   EK G   R   Q+  FR
Subjt:  EDDEDKSVKVLESIKDDGVETVPEMQNETDHLMMDKHWRFTGFYGQPDHSLRSHSWNLLRRLHDCNDSALLIGGDMNEILWQSEKSGGPERDIGQIMAFR

Query:  NVLDDCNLWDIGYTQDVFTWCNRRSTGEKVSLRLDRLLANANFYSLIENFEVAKLDWAKFDHRPIQLTIQAHDQGPRRGRNRHSFKFEEWWTRQEECRRI
        +VLDDC   D+G+    FTW N R  G+    RLDR +A   +     +  V  LD    DH+P+ ++       P + + R  F+FEE WT  + C   
Subjt:  NVLDDCNLWDIGYTQDVFTWCNRRSTGEKVSLRLDRLLANANFYSLIENFEVAKLDWAKFDHRPIQLTIQAHDQGPRRGRNRHSFKFEEWWTRQEECRRI

Query:  IRQSSDWGSNQIVPHQLHHALSQCASALKGW----------------------------GYDFSLIHALEKQLDKLLLEEEMYWKQRSRECWLKWGDRNT
        +  +     N +    +   + +C   L+ W                            GY    +  L ++L  LL +EE  W+QRSR  WLK GDRNT
Subjt:  IRQSSDWGSNQIVPHQLHHALSQCASALKGW----------------------------GYDFSLIHALEKQLDKLLLEEEMYWKQRSRECWLKWGDRNT

Query:  RWFHQKATMRKKRNWIDGVEDSNGNWQSKTENIQKAFENYFKEIFSSSQPDQMD-------------MTRVLNHV-PRKEVTAAIKDFHPTKAPGPDGFP
        R+FH +AT R++RN++  +++S G W +    +   F +++  +F +  PDQ+D             M   LN      EV AA+K   P KAPGPDG P
Subjt:  RWFHQKATMRKKRNWIDGVEDSNGNWQSKTENIQKAFENYFKEIFSSSQPDQMD-------------MTRVLNHV-PRKEVTAAIKDFHPTKAPGPDGFP

Query:  AIFYQKYWDVVGDTTVSNCLEILNSNGRIHELNHTNIVLIPKSRQARLVSDYRPISLCNVSYKIITKVLANRLKIVLNEIIDECQSAFLPGRSISDNMI-
         +F+QKYW  +GD      L  LN+   +  +NHT I LIPK +    V D+RPI LCNV YKII+KVLANRLKI+L +II E QSAF+PGR I+DN++ 
Subjt:  AIFYQKYWDVVGDTTVSNCLEILNSNGRIHELNHTNIVLIPKSRQARLVSDYRPISLCNVSYKIITKVLANRLKIVLNEIIDECQSAFLPGRSISDNMI-

Query:  ---------------LGGVKHMIEWSG--------------SGISIARSCQKISHLFFADDSLIFLKAATKEFGHFRNVLLDFEKASSQSVNYTKSVVIF
                       +G +   ++  G               G+SI+R   KI+HLFFADDSL+F KA   +    + +L  +E+AS Q +N  K+ + F
Subjt:  ---------------LGGVKHMIEWSG--------------SGISIARSCQKISHLFFADDSLIFLKAATKEFGHFRNVLLDFEKASSQSVNYTKSVVIF

Query:  SKNVANDTRQYLGRILSMRVTESLGAYLGLPSTFHRGKSRDFKYLLDKVWSVLQGWNSQFFSQGGKEVLLKSIVQAIPTYAMGCFRIPKGILLKISALCA
        SK+     +  +  +L +   +    YLGLPS   R K   F  + ++VWS L+GW  +  SQ G+E+L+KS+ QAIP YAM CFR+P  ++ +I  L  
Subjt:  SKNVANDTRQYLGRILSMRVTESLGAYLGLPSTFHRGKSRDFKYLLDKVWSVLQGWNSQFFSQGGKEVLLKSIVQAIPTYAMGCFRIPKGILLKISALCA

Query:  RFWWGSGGDQRRIHWQRWEHLCKPKEVGGLNFRDLVNFNQAMLAKQVWRVLSNPN----------------------LTVSRYLG---------------
        RFWWG  GD+ ++HW  W  LCK K  GG+ FR+L  FN+A+LAKQVWR++ NP+                      ++ S Y                 
Subjt:  RFWWGSGGDQRRIHWQRWEHLCKPKEVGGLNFRDLVNFNQAMLAKQVWRVLSNPN----------------------LTVSRYLG---------------

Query:  -----------------------------------------------------------PIDVDVIQGLSICG-STPDKWIWHYDRKGEYSVKNGYKLSM
                                                                   P D   I G+ +C     D  +W   R G+Y+V++GY L +
Subjt:  -----------------------------------------------------------PIDVDVIQGLSICG-STPDKWIWHYDRKGEYSVKNGYKLSM

Query:  LKSQEV--FLSDVEKNNFWWKRVWKMRIPSKVKVFIWKSFHNSIPTMVNLWNHHVPVMGTCHVCHDEMETTDHALFQCSRAREIWALLHPPKMRVLWDPM
         +  +     SD  +    W  +W ++IP K + F+W++ H S+PT  NL + H+    TC  C +++ETT HAL+ C   + +W  L   +     +  
Subjt:  LKSQEV--FLSDVEKNNFWWKRVWKMRIPSKVKVFIWKSFHNSIPTMVNLWNHHVPVMGTCHVCHDEMETTDHALFQCSRAREIWALLHPPKMRVLWDPM

Query:  DIKDRWHG----VSEEPMQIF
        D  D WH     +S   +Q+F
Subjt:  DIKDRWHG----VSEEPMQIF

A0A2N9HYE3 Reverse transcriptase domain-containing protein1.8e-15927.51Show/hide
Query:  SLVDDWSRLSLTSEEEEVLVVADREAVDRSNLSLDLCLLEKLLCPRPLGAEAMGRNFCAAWKLDIGFQVDRLGRNLFIFRFVNEFDRLRVIRHGPWLFEK
        SL   W + SL+  E + + +       +S L+       K L  R L  EA+ R F   W+ D GF +  +  N  +F F +E DR RV+   PW ++K
Subjt:  SLVDDWSRLSLTSEEEEVLVVADREAVDRSNLSLDLCLLEKLLCPRPLGAEAMGRNFCAAWKLDIGFQVDRLGRNLFIFRFVNEFDRLRVIRHGPWLFEK

Query:  YLLILVFSIRGLKPLDHHFSSIAFWIHIFELPLDWFNHSMAERLDGPLSGL------------------------------------------WVPIRYE
        YL+IL          +  F+  +FW+ +  +P+   NH +AE L   L  +                                          W+  +YE
Subjt:  YLLILVFSIRGLKPLDHHFSSIAFWIHIFELPLDWFNHSMAERLDGPLSGL------------------------------------------WVPIRYE

Query:  RLSEICTHCGVIGHVARDCELLLRSNRRPNFPN-QYGDWLRFIGK-----------GMIVSPTLKSSSKTHGHRLS---------LDVAPTIIHVPPGTE
        RL   C  CG++ H  +DC   LR+       + Q+G WLR   +           G+I  P  K  +K     +S          +  PT   +PP T 
Subjt:  RLSEICTHCGVIGHVARDCELLLRSNRRPNFPN-QYGDWLRFIGK-----------GMIVSPTLKSSSKTHGHRLS---------LDVAPTIIHVPPGTE

Query:  LPRPRYGGIQINEPGEDRVQRPRAPS-SPVSYSLA---DQPNRIVKGKEKVEEVADKYRNPFQWRRFAGKEKIGEAGASQWRSKSSNRGL---PTVNERE
           P +     N P  ++   P  P+ +PV++S     D P  +     ++EE      NP +   F   E+ G+    + + +  +  +   P   E  
Subjt:  LPRPRYGGIQINEPGEDRVQRPRAPS-SPVSYSLA---DQPNRIVKGKEKVEEVADKYRNPFQWRRFAGKEKIGEAGASQWRSKSSNRGL---PTVNERE

Query:  VKRMPRQPGINSRPHPNSMHLMCLSFGNDPSDHAEAIQEDNNRFSKNLLPLFQEATPFFMGHDSKLIHNE----GSPKDIESDPEMLEEFGPMSPEGANR
         K +P      + P   S   +      D ++ A A   D    S          T F    +S+ I  E    G        P M       +  G   
Subjt:  VKRMPRQPGINSRPHPNSMHLMCLSFGNDPSDHAEAIQEDNNRFSKNLLPLFQEATPFFMGHDSKLIHNE----GSPKDIESDPEMLEEFGPMSPEGANR

Query:  PQTVDE-----DDEDKSVKVLESIKDDCVETVPEMQNGTDHLMSPEEANRPQTVDEDDEDKSVKVLESIKDDGVETVPEMQNETDHLMMDKH---WRFTG
        P+TV +       +D SV  L       +ET  + +   + L    + +    V+  ++   + +    KD  +       +  D L+ D     WRFTG
Subjt:  PQTVDE-----DDEDKSVKVLESIKDDCVETVPEMQNGTDHLMSPEEANRPQTVDEDDEDKSVKVLESIKDDGVETVPEMQNETDHLMMDKH---WRFTG

Query:  FYGQPDHSLRSHSWNLLRRLHDCNDSALLIGGDMNEILWQSEKSGGPERDIGQIMAFRNVLDDCNLWDIGYTQDVFTWCNRRSTGEKVSLRLDRLLANAN
        FYG P+   R  SW+LLRRL+          GD NE++   EK G   R   Q+  FR+VLD+C   D+G+T   FTW N R  G+    RLDR++A  +
Subjt:  FYGQPDHSLRSHSWNLLRRLHDCNDSALLIGGDMNEILWQSEKSGGPERDIGQIMAFRNVLDDCNLWDIGYTQDVFTWCNRRSTGEKVSLRLDRLLANAN

Query:  FYSLIENFEVAKLDWAKFDHRPIQLTIQAHDQGPRRGRNRHSFKFEEWWTRQEECRRIIRQSSDWGSNQIVPHQLHHALSQCASALKGW-----------
        +     +  V+ L+    DH+PI ++ +      R+      F+FEE WT  + C  +I  S       +  + +   +  C   L+ W           
Subjt:  FYSLIENFEVAKLDWAKFDHRPIQLTIQAHDQGPRRGRNRHSFKFEEWWTRQEECRRIIRQSSDWGSNQIVPHQLHHALSQCASALKGW-----------

Query:  -----------------GYDFSLIHALEKQLDKLLLEEEMYWKQRSRECWLKWGDRNTRWFHQKATMRKKRNWIDGVEDSNGNWQSKTENIQKAFENYFK
                         G D   ++ L+++L  LL +EE  W+QRSR  WL  GDRNTR+FH +AT RK++N +  +   +G W +    +   F  Y+K
Subjt:  -----------------GYDFSLIHALEKQLDKLLLEEEMYWKQRSRECWLKWGDRNTRWFHQKATMRKKRNWIDGVEDSNGNWQSKTENIQKAFENYFK

Query:  EIFSSSQPDQMDM--------------TRVLNHVPRKEVTAAIKDFHPTKAPGPDGFPAIFYQKYWDVVGDTTVSNCLEILNSNGRIHELNHTNIVLIPK
         +F ++ PDQ++               +++++     EV  A+K   P KAPGPD  P IFYQKYW ++G    +  L  LNS   +  +NHT+I LIPK
Subjt:  EIFSSSQPDQMDM--------------TRVLNHVPRKEVTAAIKDFHPTKAPGPDGFPAIFYQKYWDVVGDTTVSNCLEILNSNGRIHELNHTNIVLIPK

Query:  SRQARLVSDYRPISLCNVSYKIITKVLANRLKIVLNEIIDECQSAFLPGRSISDNMILG--GVKHM-----------------------IEW--------
         +    V ++RPISLCNV YK+I+KVLANRLKI+L  I+ E QSAF+PGR I+DN+++    + HM                       +EW        
Subjt:  SRQARLVSDYRPISLCNVSYKIITKVLANRLKIVLNEIIDECQSAFLPGRSISDNMILG--GVKHM-----------------------IEW--------

Query:  ------------------------------------------------------------------SGS--GISIARSCQKISHLFFADDSLIFLKAATK
                                                                          SG+  G+SI+RS  KI+HLFFADDSL+F KA T 
Subjt:  ------------------------------------------------------------------SGS--GISIARSCQKISHLFFADDSLIFLKAATK

Query:  EFGHFRNVLLDFEKASSQSVNYTKSVVIFSKNVANDTRQYLGRILSMRVTESLGAYLGLPSTFHRGKSRDFKYLLDKVWSVLQGWNSQFFSQGGKEVLLK
        +    + +L  +E+AS Q VN  K+ + FSK+     +  +  +L +   +    YLGLPS   R K   F  + ++VWS L+GW  +  SQ G+E+L+K
Subjt:  EFGHFRNVLLDFEKASSQSVNYTKSVVIFSKNVANDTRQYLGRILSMRVTESLGAYLGLPSTFHRGKSRDFKYLLDKVWSVLQGWNSQFFSQGGKEVLLK

Query:  SIVQAIPTYAMGCFRIPKGILLKISALCARFWWGSGGDQRRIHWQRWEHLCKPKEVGGLNFRDLVNFNQAMLAKQVWRVLSNPNL---------------
        S+ QAIP YAM CFR+P  ++ +I  L  RFWWG GGD+ ++HW  W  LCK K  GG+  RDL  FN+A+LAKQVWR+L NP+                
Subjt:  SIVQAIPTYAMGCFRIPKGILLKISALCARFWWGSGGDQRRIHWQRWEHLCKPKEVGGLNFRDLVNFNQAMLAKQVWRVLSNPNL---------------

Query:  ---------------------------------------------------------------------------------TVSRYLGPIDVDVIQGLSI
                                                                                          V R   P +  VI G+ +
Subjt:  ---------------------------------------------------------------------------------TVSRYLGPIDVDVIQGLSI

Query:  CGSTP-DKWIWHYDRKGEYSVKNGYKLSM--LKSQEVFLSDVEKNNFWWKRVWKMRIPSKVKVFIWKSFHNSIPTMVNLWNHHVPVMGTCHVCHDEMETT
            P D  +W   + G Y+V++GY L +      E   SD  K    W  +W + +P K++ F+W++ HNS+PT  NL + H+    +C  C +++E+T
Subjt:  CGSTP-DKWIWHYDRKGEYSVKNGYKLSM--LKSQEVFLSDVEKNNFWWKRVWKMRIPSKVKVFIWKSFHNSIPTMVNLWNHHVPVMGTCHVCHDEMETT

Query:  DHALFQCSRAREIW
         HAL+QC   + +W
Subjt:  DHALFQCSRAREIW

A0A2N9IIR5 Uncharacterized protein2.6e-16328.12Show/hide
Query:  SLVDDWSRLSLTSEEEEVLVVADREAVDRSNLSLDLCLLEKLLCPRPLGAEAMGRNFCAAWKLDIGFQVDRLGRNLFIFRFVNEFDRLRVIRHGPWLFEK
        S+   W + SLT  E       D   +D S+ S    L  K    R + AE++ R F   WK   GF V  +G N  +F F +  D  RV+ + PW ++K
Subjt:  SLVDDWSRLSLTSEEEEVLVVADREAVDRSNLSLDLCLLEKLLCPRPLGAEAMGRNFCAAWKLDIGFQVDRLGRNLFIFRFVNEFDRLRVIRHGPWLFEK

Query:  YLLILVFSIRGLKPLDHHFSSIAFWIHIFELPLDWFNHSMAE-------------------------------------------RLDGPLSGLWVPIRY
        +L+IL      +      FS+I FW+ I  LP+      +AE                                           RL G   G WV  ++
Subjt:  YLLILVFSIRGLKPLDHHFSSIAFWIHIFELPLDWFNHSMAE-------------------------------------------RLDGPLSGLWVPIRY

Query:  ERLSEICTHCGVIGHVARDCELLLRSNR-RPNFPNQYGDWLR----FIGKGMIVS------PTLKSSSKTHGHRLSLDVAPTIIHVPPGT---ELPRPRY
        ERL   C +CG + H  +DC+  +RS        +QYGDWLR    FIGK  +VS       T ++  KT+  + S      + H  P T   +   P  
Subjt:  ERLSEICTHCGVIGHVARDCELLLRSNR-RPNFPNQYGDWLR----FIGKGMIVS------PTLKSSSKTHGHRLSLDVAPTIIHVPPGT---ELPRPRY

Query:  GGIQINEPGEDRVQRPRAPSSPVSYSLADQPNRIVKGKEKVEEVADKYRNP-FQWRRFAGKEKIGEAGASQWRSKSSNRGLPTVNEREVKRMPRQPGINS
               PG+++       +  VS      P   V+   +++    K++N   +  + +  + +     S    K   +  PT +  + K          
Subjt:  GGIQINEPGEDRVQRPRAPSSPVSYSLADQPNRIVKGKEKVEEVADKYRNP-FQWRRFAGKEKIGEAGASQWRSKSSNRGLPTVNEREVKRMPRQPGINS

Query:  RPHPNSM-------HLMCLSFGNDPSDHAEAIQEDNNRFSKNLLPLFQEATPFFMGHDSKLIHNEGSPKDIESDPEMLEEFGPMSPEGANRPQTVDEDDE
           PN +           L FGN P                        A P  M          G+P+ ++   E+ E      PE     +T  + D+
Subjt:  RPHPNSM-------HLMCLSFGNDPSDHAEAIQEDNNRFSKNLLPLFQEATPFFMGHDSKLIHNEGSPKDIESDPEMLEEFGPMSPEGANRPQTVDEDDE

Query:  DKSVKVLESIKDDCVETVPEMQNGTDHLMSPEEANRPQTVDEDDEDKSVKVLESIKDDGVETVPEMQNETDHLMMDKHWRFTGFYGQPDHSLRSHSWNLL
         + ++   S  +  V  VP    G   ++  +       +    +  S   ++SI ++G                   WRFTGFYG P+   R HSW++L
Subjt:  DKSVKVLESIKDDCVETVPEMQNGTDHLMSPEEANRPQTVDEDDEDKSVKVLESIKDDGVETVPEMQNETDHLMMDKHWRFTGFYGQPDHSLRSHSWNLL

Query:  RRLHDCNDSALLIGGDMNEILWQSEKSGGPERDIGQIMAFRNVLDDCNLWDIGYTQDVFTWCNRRSTGEKVSLRLDRLLANANFYSLIENFEVAKLDWAK
        R L           GD NE++  SEK GG  R   Q+ AFR+VLD+C   D+G+    FTWCN R  G  V  RLDR++ N+ +    +   V  +    
Subjt:  RRLHDCNDSALLIGGDMNEILWQSEKSGGPERDIGQIMAFRNVLDDCNLWDIGYTQDVFTWCNRRSTGEKVSLRLDRLLANANFYSLIENFEVAKLDWAK

Query:  FDHRPIQLTIQAHDQGPRRGRNRHSFKFEEWWTRQEECRRIIRQSSDWGSNQIVPH--QLHHALSQCASALKGW--------------------------
         DH P+ L+  +    PR    +  F+FE  W   E CR  +  +  W SN   P   Q+ + ++ C   L  W                          
Subjt:  FDHRPIQLTIQAHDQGPRRGRNRHSFKFEEWWTRQEECRRIIRQSSDWGSNQIVPH--QLHHALSQCASALKGW--------------------------

Query:  --GYDFSLIHALEKQLDKLLLEEEMYWKQRSRECWLKWGDRNTRWFHQKATMRKKRNWIDGVEDSNGNWQSKTENIQKAFENYFKEIFSSSQPDQMDMTR
          G D + +  L  +L +LL +EE  W QRSR  WLK GDRNTR+FH +A+ R++RN I G+ D +G+W  + + I +    YF+++F ++ P  +++ +
Subjt:  --GYDFSLIHALEKQLDKLLLEEEMYWKQRSRECWLKWGDRNTRWFHQKATMRKKRNWIDGVEDSNGNWQSKTENIQKAFENYFKEIFSSSQPDQMDMTR

Query:  VLNHVPR----------------KEVTAAIKDFHPTKAPGPDGFPAIFYQKYWDVVGDTTVSNCLEILNSNGRIHELNHTNIVLIPKSRQARLVSDYRPI
        V  HV R                +EV  AIK   P  APGPDG P +FYQ +W +VG+   +  L  LNS   +  +NHT I LIPK +    ++++RPI
Subjt:  VLNHVPR----------------KEVTAAIKDFHPTKAPGPDGFPAIFYQKYWDVVGDTTVSNCLEILNSNGRIHELNHTNIVLIPKSRQARLVSDYRPI

Query:  SLCNVSYKIITKVLANRLKIVLNEIIDECQSAFLPGRSISDNMI-----LGGVKH--------------------MIEWSG-------------------
        SLCNV YKII+KV+ANRLK ++ ++I + QSAF+PGR I+DN++     L  +KH                     +EW                     
Subjt:  SLCNVSYKIITKVLANRLKIVLNEIIDECQSAFLPGRSISDNMI-----LGGVKH--------------------MIEWSG-------------------

Query:  ---------------------------------------------------------SGISIARSCQKISHLFFADDSLIFLKAATKEFGHFRNVLLDFE
                                                                  G+SI+RS  K+SHLFFADDSL+F +A   E     ++LL +E
Subjt:  ---------------------------------------------------------SGISIARSCQKISHLFFADDSLIFLKAATKEFGHFRNVLLDFE

Query:  KASSQSVNYTKSVVIFSKNVANDTRQYLGRILSMRVTESLGAYLGLPSTFHRGKSRDFKYLLDKVWSVLQGWNSQFFSQGGKEVLLKSIVQAIPTYAMGC
         AS Q +N  K+ + FSK+ +   ++ +   L +    S   YLGLPS   R K + F Y+ ++VWS LQGW  +  SQ G+EVL+K++VQA+PT++M C
Subjt:  KASSQSVNYTKSVVIFSKNVANDTRQYLGRILSMRVTESLGAYLGLPSTFHRGKSRDFKYLLDKVWSVLQGWNSQFFSQGGKEVLLKSIVQAIPTYAMGC

Query:  FRIPKGILLKISALCARFWWGSGGDQRRIHWQRWEHLCKPKEVGGLNFRDLVNFNQAMLAKQVWRV----------------------------------
        FR+P  +   +  L  RFWWG G D+R+I W  W+ LCKPK  GG+ FR++  FN A+L KQVWR+                                  
Subjt:  FRIPKGILLKISALCARFWWGSGGDQRRIHWQRWEHLCKPKEVGGLNFRDLVNFNQAMLAKQVWRV----------------------------------

Query:  --------------------------------LSNPNL--------------TVSR----------------YLGPIDVDVIQG--LSICGSTPDKWIWH
                                        LS+PNL              TVS+                +L P +V+ I+   LS+C   PD   W 
Subjt:  --------------------------------LSNPNL--------------TVSR----------------YLGPIDVDVIQG--LSICGSTPDKWIWH

Query:  YDRKGEYSVKNGYKLSMLKSQEVFL---SDVEKNNFWWKRVWKMRIPSKVKVFIWKSFHNSIPTMVNLWNHHVPVMGTCHVCHDEMETTDHALFQCSRAR
            G +SV++ Y+L +L+ ++  L   S   +    WK VW + IP  V+ F+WK+   ++PT + LW   V     C  C D +E T HAL+ CS   
Subjt:  YDRKGEYSVKNGYKLSMLKSQEVFL---SDVEKNNFWWKRVWKMRIPSKVKVFIWKSFHNSIPTMVNLWNHHVPVMGTCHVCHDEMETTDHALFQCSRAR

Query:  EIWA
         IW+
Subjt:  EIWA

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.3e-1327.96Show/hide
Query:  RKKRNWIDGVEDSNGNWQSKTENIQKAFENYFKEIFSSSQPDQMDMTRVLN--HVPR---------------KEVTAAIKDFHPTKAPGPDGFPAIFYQK
        ++++N ID +++  G+  +    IQ     Y+K ++++   +  +M   L+   +PR                E+ A I      K+PGPDGF A FYQ+
Subjt:  RKKRNWIDGVEDSNGNWQSKTENIQKAFENYFKEIFSSSQPDQMDMTRVLN--HVPR---------------KEVTAAIKDFHPTKAPGPDGFPAIFYQK

Query:  YWDVVGDTTVSNCLEILNSNGRIHELNHTNIVLIPK-SRQARLVSDYRPISLCNVSYKIITKVLANRLKIVLNEIIDECQSAFLPG
        Y + +    +     I       +     +I+LIPK  R      ++RPISL N+  KI+ K+LANR++  + ++I   Q  F+PG
Subjt:  YWDVVGDTTVSNCLEILNSNGRIHELNHTNIVLIPK-SRQARLVSDYRPISLCNVSYKIITKVLANRLKIVLNEIIDECQSAFLPG

P08548 LINE-1 reverse transcriptase homolog1.7e-1327.49Show/hide
Query:  DRNTRWFHQKAT---------MRKKR--NWIDGVEDSNGNWQSKTENIQKAFENYFKEIFSSSQPDQMDMTRVLN--HVPR---------------KEVT
        +++  WF +K            RKKR  + I  + + N    +    IQK    Y+K+++S    +  ++ + L   H+PR                E+ 
Subjt:  DRNTRWFHQKAT---------MRKKR--NWIDGVEDSNGNWQSKTENIQKAFENYFKEIFSSSQPDQMDMTRVLN--HVPR---------------KEVT

Query:  AAIKDFHPTKAPGPDGFPAIFYQKYWDVVGDTTVSNCLEILNSNGRI-HELNHTNIVLIPK-SRQARLVSDYRPISLCNVSYKIITKVLANRLKIVLNEI
        + I++    K+PGPDGF + FYQ + + +    + N  + +   G + +     NI LIPK  +      +YRPISL N+  KI+ K+L NR++  + +I
Subjt:  AAIKDFHPTKAPGPDGFPAIFYQKYWDVVGDTTVSNCLEILNSNGRI-HELNHTNIVLIPK-SRQARLVSDYRPISLCNVSYKIITKVLANRLKIVLNEI

Query:  IDECQSAFLPG
        I   Q  F+PG
Subjt:  IDECQSAFLPG

P11369 LINE-1 retrotransposable element ORF2 protein7.0e-1228.33Show/hide
Query:  IDGVEDSNGNWQSKTENIQKAFENYFKEIFSSSQPDQMDMTRVLN--HVPR---------------KEVTAAIKDFHPTKAPGPDGFPAIFYQKYWDVVG
        I+ + +  G+  +  E IQ    +++K ++S+   +  +M + L+   VP+               KE+ A I      K+PGPDGF A FYQ + + + 
Subjt:  IDGVEDSNGNWQSKTENIQKAFENYFKEIFSSSQPDQMDMTRVLN--HVPR---------------KEVTAAIKDFHPTKAPGPDGFPAIFYQKYWDVVG

Query:  DTTVSNCLEILNSNGRIHELNHTNIVLIPK-SRQARLVSDYRPISLCNVSYKIITKVLANRLKIVLNEIIDECQSAFLPG
                +I       +      I LIPK  +    + ++RPISL N+  KI+ K+LANR++  +  II   Q  F+PG
Subjt:  DTTVSNCLEILNSNGRIHELNHTNIVLIPK-SRQARLVSDYRPISLCNVSYKIITKVLANRLKIVLNEIIDECQSAFLPG

P14381 Transposon TX1 uncharacterized 149 kDa protein5.7e-1427.04Show/hide
Query:  RSRECWLKWGDRNTRWFHQKATMRKKRNWIDGVEDSNGNWQSKTENIQKAFENYFKEIFSSS--QPDQMD-------------MTRVLNHVPRKEVTAAI
        RSR   L   DR +R+F+     +  R  I  +   +G      E I+    ++++ +FS     PD  +               R+   +   E++ A+
Subjt:  RSRECWLKWGDRNTRWFHQKATMRKKRNWIDGVEDSNGNWQSKTENIQKAFENYFKEIFSSS--QPDQMD-------------MTRVLNHVPRKEVTAAI

Query:  KDFHPTKAPGPDGFPAIFYQKYWDVVGDTTVSNCLEILNSNGRIHELNHTNIVLIPKSRQARLVSDYRPISLCNVSYKIITKVLANRLKIVLNEIIDECQ
        +     K+PG DG    F+Q +WD +G        E               + L+PK    RL+ ++RP+SL +  YKI+ K ++ RLK VL E+I   Q
Subjt:  KDFHPTKAPGPDGFPAIFYQKYWDVVGDTTVSNCLEILNSNGRIHELNHTNIVLIPKSRQARLVSDYRPISLCNVSYKIITKVLANRLKIVLNEIIDECQ

Query:  SAFLPGRSISDNMILGGVKHMIEWS-GSGISIA
        S  +PGR+I DN+ L  ++ ++ ++  +G+S+A
Subjt:  SAFLPGRSISDNMILGGVKHMIEWS-GSGISIA

P93295 Uncharacterized mitochondrial protein AtMg003105.0e-1848.86Show/hide
Query:  AIPTYAMGCFRIPKGILLKISALCARFWWGSGGDQRRIHWQRWEHLCKPKE-VGGLNFRDLVNFNQAMLAKQVWRVLSNPNLTVSRYL
        A+P YAM CFR+ K +  K+++    FWW S  ++R+I W  W+ LCK KE  GGL FRDL  FNQA+LAKQ +R++  P+  +SR L
Subjt:  AIPTYAMGCFRIPKGILLKISALCARFWWGSGGDQRRIHWQRWEHLCKPKE-VGGLNFRDLVNFNQAMLAKQVWRVLSNPNLTVSRYL

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.4e-2325.42Show/hide
Query:  DSALLIGGDMNEILWQSEKSGGPERDIGQ--IMAFRNVLDDCNLWDIGYTQDVFTWCNRRSTGEKVSLRLDRLLAN----ANFYSLIENFEVAKLDWAKF
        D  +++ GD ++I   S+     +  I    +  F+N L D +L DI      +TW N +     +  +LDR +AN    ++F S I  FE++ +     
Subjt:  DSALLIGGDMNEILWQSEKSGGPERDIGQ--IMAFRNVLDDCNLWDIGYTQDVFTWCNRRSTGEKVSLRLDRLLAN----ANFYSLIENFEVAKLDWAKF

Query:  DHRP-----------------------------IQLTIQAHDQGPRRGRNRHSFKFEEWWTRQEECRRIIRQSSDWGSNQIVPHQLHHALSQCASALKGW
        DH P                             + LT+   +Q P      H F   E     ++C +++ +   +G+ Q    +   +L    S L   
Subjt:  DHRP-----------------------------IQLTIQAHDQGPRRGRNRHSFKFEEWWTRQEECRRIIRQSSDWGSNQIVPHQLHHALSQCASALKGW

Query:  GYD--FSLIHALEKQLDKLLLEEEMYWKQRSRECWLKWGDRNTRWFHQKATMRKKRNWIDGVEDSNGNWQSKTENIQKAFENYFKEIFSSSQ----PDQM
          D  F + H   K+ +      E +++Q+SR  WL+ GD NTR+FH+     + +N I  +   +         +++    Y+  +  S      PD +
Subjt:  GYD--FSLIHALEKQLDKLLLEEEMYWKQRSRECWLKWGDRNTRWFHQKATMRKKRNWIDGVEDSNGNWQSKTENIQKAFENYFKEIFSSSQ----PDQM

Query:  D-------------MTRVLNHVPR-KEVTAAIKDFHPTKAPGPDGFPAIFYQKYWDVVGDTTVSNCLEILNSNGRIHELNHTNIVLIPKSRQARLVSDYR
                      +   L+ +P  KE+TAA+      KAPGPD F A F+ + W VV D+T++   E   +   +   N T I LIPK      +S +R
Subjt:  D-------------MTRVLNHVPR-KEVTAAIKDFHPTKAPGPDGFPAIFYQKYWDVVGDTTVSNCLEILNSNGRIHELNHTNIVLIPKSRQARLVSDYR

Query:  PISLCNVSYKIIT
        P+S C V YKIIT
Subjt:  PISLCNVSYKIIT

AT3G09510.1 Ribonuclease H-like superfamily protein1.2e-1130.6Show/hide
Query:  VSRYLGPIDVDVIQGLSICGS-TPDKWIWHYDRKGEYSVKNGY-KLSMLKSQEVFLSDVEKNNFWWK-RVWKMRIPSKVKVFIWKSFHNSIPTMVNLWNH
        +S+++   D   I  + +  S  PDK IW+Y+  GEY+V++GY  L+   S  +   +    +   K R+W + I  K+K F+W++   ++ T   L   
Subjt:  VSRYLGPIDVDVIQGLSICGS-TPDKWIWHYDRKGEYSVKNGY-KLSMLKSQEVFLSDVEKNNFWWK-RVWKMRIPSKVKVFIWKSFHNSIPTMVNLWNH

Query:  HVPVMGTCHVCHDEMETTDHALFQCSRAREIWAL
         + +  +C  CH E E+ +HALF C  A   W L
Subjt:  HVPVMGTCHVCHDEMETTDHALFQCSRAREIWAL

AT3G25270.1 Ribonuclease H-like superfamily protein1.8e-0734.38Show/hide
Query:  RVWKMRIPSKVKVFIWKSFHNSIPTMVNLWNHHVPVMGTCHVCHDEMETTDHALFQCSRAREIW
        ++WK++   K+K F+WK    ++ T  NL   H+     CH C  E ET+ H  F C  A+++W
Subjt:  RVWKMRIPSKVKVFIWKSFHNSIPTMVNLWNHHVPVMGTCHVCHDEMETTDHALFQCSRAREIW

AT4G29090.1 Ribonuclease H-like superfamily protein8.7e-1841.18Show/hide
Query:  AIPTYAMGCFRIPKGILLKISALCARFWWGSGGDQRRIHWQRWEHLCKPKEVGGLNFRDLVNFNQAMLAKQVWRVLSNPNLTVSR
        A+PTY M CF +PK +  +I ++ A FWW +  + + +HW+ W+HL   K  GG+ F+D+  FN A+L KQ+WR+LS P   +++
Subjt:  AIPTYAMGCFRIPKGILLKISALCARFWWGSGGDQRRIHWQRWEHLCKPKEVGGLNFRDLVNFNQAMLAKQVWRVLSNPNLTVSR

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.6e-1948.86Show/hide
Query:  AIPTYAMGCFRIPKGILLKISALCARFWWGSGGDQRRIHWQRWEHLCKPKE-VGGLNFRDLVNFNQAMLAKQVWRVLSNPNLTVSRYL
        A+P YAM CFR+ K +  K+++    FWW S  ++R+I W  W+ LCK KE  GGL FRDL  FNQA+LAKQ +R++  P+  +SR L
Subjt:  AIPTYAMGCFRIPKGILLKISALCARFWWGSGGDQRRIHWQRWEHLCKPKE-VGGLNFRDLVNFNQAMLAKQVWRVLSNPNLTVSRYL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCCGTTTAATTTTTTCTCTCTCAGGGCGTTTTCACTGCTGCTGGATTCTCGCACCGGATTTTCAGCTGTCAAGCTGCTATATCAGTGTAGACGATGGTGTGTTGT
TTCATCATCGCTACTGGTGGTAACTTCATCCTTTCTCTACTCTTGCTTCTATTCTTTGTGTTTTTATCTCGAAACTATGGATCCTTCCTCTTTAGTTGATGACTGGTCCC
GTTTGAGCCTGACTTCGGAGGAGGAAGAAGTTTTGGTTGTAGCGGATAGAGAAGCTGTGGATAGATCTAATCTTTCCTTGGATCTCTGTTTGCTCGAGAAATTGCTTTGC
CCTCGTCCTCTTGGTGCTGAGGCCATGGGGAGAAATTTTTGTGCTGCCTGGAAGCTTGATATTGGGTTCCAGGTGGATCGGCTTGGGCGTAATCTTTTCATCTTTCGGTT
TGTGAACGAATTTGACCGACTGCGTGTGATTCGACATGGACCATGGCTATTCGAGAAATATCTCCTGATTTTGGTCTTTTCGATTCGGGGGCTAAAGCCGTTGGACCATC
ATTTTTCTTCGATTGCATTTTGGATTCATATTTTTGAGCTACCACTGGACTGGTTTAACCATAGCATGGCGGAGAGATTGGATGGACCGCTCAGTGGGTTATGGGTTCCT
ATTCGTTATGAGAGACTGTCGGAGATTTGTACACACTGTGGAGTTATTGGCCATGTTGCACGAGACTGTGAATTGCTCCTACGCTCAAATCGCCGTCCAAATTTCCCTAA
TCAGTACGGTGATTGGCTAAGGTTTATTGGAAAAGGAATGATAGTCTCACCTACGCTCAAATCGTCGTCCAAAACCCATGGCCACCGGTTATCGCTTGATGTAGCGCCGA
CGATAATCCATGTCCCTCCTGGAACCGAGCTCCCTCGGCCACGCTACGGTGGTATTCAGATTAATGAGCCTGGCGAAGATCGTGTGCAAAGGCCTCGGGCTCCATCATCA
CCGGTATCCTACTCTCTCGCTGATCAACCGAATCGAATTGTTAAAGGGAAAGAGAAGGTGGAGGAGGTTGCAGACAAATACAGAAATCCATTCCAATGGCGGCGGTTTGC
TGGAAAGGAGAAGATAGGTGAGGCTGGTGCATCTCAGTGGCGGTCGAAGTCTTCCAACCGTGGGTTGCCGACGGTGAACGAAAGGGAAGTGAAACGGATGCCTCGTCAGC
CAGGCATTAATTCGAGGCCCCATCCAAATTCAATGCATTTAATGTGCCTTAGTTTTGGTAACGACCCAAGTGACCATGCTGAGGCTATTCAAGAGGATAACAACCGTTTT
TCAAAGAATTTATTACCTTTGTTTCAAGAGGCTACGCCGTTTTTCATGGGCCATGATTCTAAATTAATCCACAATGAAGGAAGCCCAAAAGACATTGAAAGCGATCCGGA
AATGCTTGAGGAGTTTGGGCCCATGAGCCCAGAAGGAGCAAATAGGCCGCAGACCGTTGATGAAGATGACGAGGACAAATCTGTGAAAGTGCTGGAATCCATCAAGGATG
ATTGTGTGGAGACGGTTCCTGAGATGCAAAATGGAACAGACCACCTTATGAGCCCAGAAGAAGCAAATAGGCCACAAACCGTTGATGAAGATGACGAGGACAAATCTGTG
AAAGTGCTGGAATCCATCAAGGATGATGGTGTGGAGACGGTTCCTGAGATGCAAAATGAAACAGACCACCTTATGATGGATAAACATTGGCGATTTACGGGTTTCTATGG
TCAACCTGATCATTCTCTTCGTTCTCACTCATGGAACCTTCTTCGGAGATTGCATGATTGTAATGATTCAGCCTTGTTGATTGGGGGAGATATGAATGAAATTCTATGGC
AGTCTGAGAAATCGGGTGGTCCAGAAAGGGATATTGGGCAAATAATGGCTTTTCGGAATGTGTTAGATGATTGCAATCTTTGGGATATTGGATATACTCAGGATGTGTTT
ACTTGGTGTAACAGGCGAAGCACGGGGGAAAAGGTTAGTCTACGGCTAGACCGTTTGCTGGCAAATGCAAATTTCTATAGCCTTATAGAGAACTTCGAGGTAGCGAAATT
GGATTGGGCTAAATTCGATCATAGGCCGATTCAGCTCACCATTCAGGCACATGACCAGGGTCCTCGAAGGGGAAGAAATAGACATTCGTTTAAATTTGAAGAATGGTGGA
CACGTCAGGAGGAATGTCGAAGGATTATCCGGCAAAGTAGTGATTGGGGCAGTAATCAAATTGTGCCTCATCAACTTCATCATGCATTATCCCAATGTGCGTCGGCTTTG
AAGGGTTGGGGATATGATTTCTCGCTAATTCATGCTTTGGAGAAACAACTTGATAAGTTACTGTTGGAGGAGGAAATGTATTGGAAACAAAGATCAAGAGAATGTTGGCT
GAAATGGGGTGATCGTAATACGAGATGGTTCCACCAAAAGGCCACAATGAGGAAAAAAAGGAATTGGATTGATGGCGTTGAAGACTCGAATGGGAATTGGCAATCCAAAA
CGGAGAACATACAGAAGGCGTTCGAGAATTATTTCAAAGAAATTTTTTCATCATCCCAGCCTGATCAGATGGACATGACTAGAGTGCTCAACCATGTTCCACGGAAGGAG
GTCACGGCTGCCATTAAGGATTTCCATCCAACCAAAGCTCCTGGACCTGATGGCTTCCCTGCAATTTTCTATCAGAAATATTGGGATGTAGTAGGTGACACTACTGTTTC
GAATTGTTTGGAGATTTTGAATTCAAATGGAAGGATTCATGAGTTGAATCATACCAATATTGTTCTAATACCAAAGAGTCGTCAGGCGAGGTTAGTATCCGATTATCGGC
CTATTAGCTTGTGTAATGTTTCTTATAAGATTATTACTAAGGTATTGGCTAATAGACTTAAGATTGTTTTGAATGAGATTATTGATGAATGTCAATCAGCGTTTCTTCCG
GGTAGATCAATATCTGATAACATGATTTTGGGTGGAGTAAAGCATATGATAGAGTGGAGTGGTTCTGGAATATCCATTGCCAGATCCTGCCAAAAAATCTCCCACTTATT
TTTTGCGGATGACAGTCTGATCTTTCTCAAAGCTGCGACTAAGGAATTTGGTCATTTTCGGAATGTATTATTGGACTTTGAAAAGGCGTCCAGTCAATCTGTCAACTATA
CAAAATCAGTGGTGATATTTTCCAAAAATGTGGCCAATGATACAAGGCAATATCTGGGTCGCATTTTGTCTATGAGAGTGACTGAATCATTGGGAGCTTACCTTGGTTTA
CCATCAACCTTTCATAGAGGTAAATCTCGAGATTTCAAATATCTGCTCGATAAAGTTTGGTCTGTTCTGCAAGGGTGGAATAGTCAATTTTTTTCCCAAGGCGGAAAGGA
GGTGCTTCTCAAGAGTATAGTTCAGGCTATTCCAACATATGCCATGGGGTGTTTCCGAATTCCAAAAGGCATTTTGCTTAAGATCTCCGCCCTATGTGCAAGGTTTTGGT
GGGGCTCAGGTGGAGACCAACGTCGGATTCATTGGCAGCGATGGGAGCATCTGTGCAAGCCAAAAGAAGTTGGAGGTCTAAACTTTAGAGATCTTGTCAACTTCAATCAG
GCAATGCTTGCAAAACAAGTATGGAGAGTGTTATCTAATCCGAACCTTACTGTTTCTAGGTACTTGGGTCCAATCGATGTGGACGTTATTCAAGGCTTATCGATCTGTGG
TTCGACGCCTGATAAATGGATATGGCATTATGATAGAAAAGGGGAGTATTCTGTTAAAAATGGATACAAGCTCTCGATGCTAAAGAGCCAAGAGGTATTTCTATCAGACG
TGGAAAAAAATAATTTCTGGTGGAAGAGAGTGTGGAAGATGAGAATTCCTAGTAAAGTTAAAGTATTCATTTGGAAATCATTCCACAACTCAATCCCAACCATGGTAAAC
CTATGGAACCATCATGTGCCAGTTATGGGAACCTGTCATGTATGCCATGATGAGATGGAGACTACAGATCATGCTTTGTTTCAGTGTTCGAGGGCTCGAGAGATATGGGC
CCTTCTTCACCCGCCCAAGATGAGAGTGCTTTGGGATCCTATGGACATTAAAGACCGATGGCATGGCGTCTCTGAAGAACCCATGCAGATTTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGCCGTTTAATTTTTTCTCTCTCAGGGCGTTTTCACTGCTGCTGGATTCTCGCACCGGATTTTCAGCTGTCAAGCTGCTATATCAGTGTAGACGATGGTGTGTTGT
TTCATCATCGCTACTGGTGGTAACTTCATCCTTTCTCTACTCTTGCTTCTATTCTTTGTGTTTTTATCTCGAAACTATGGATCCTTCCTCTTTAGTTGATGACTGGTCCC
GTTTGAGCCTGACTTCGGAGGAGGAAGAAGTTTTGGTTGTAGCGGATAGAGAAGCTGTGGATAGATCTAATCTTTCCTTGGATCTCTGTTTGCTCGAGAAATTGCTTTGC
CCTCGTCCTCTTGGTGCTGAGGCCATGGGGAGAAATTTTTGTGCTGCCTGGAAGCTTGATATTGGGTTCCAGGTGGATCGGCTTGGGCGTAATCTTTTCATCTTTCGGTT
TGTGAACGAATTTGACCGACTGCGTGTGATTCGACATGGACCATGGCTATTCGAGAAATATCTCCTGATTTTGGTCTTTTCGATTCGGGGGCTAAAGCCGTTGGACCATC
ATTTTTCTTCGATTGCATTTTGGATTCATATTTTTGAGCTACCACTGGACTGGTTTAACCATAGCATGGCGGAGAGATTGGATGGACCGCTCAGTGGGTTATGGGTTCCT
ATTCGTTATGAGAGACTGTCGGAGATTTGTACACACTGTGGAGTTATTGGCCATGTTGCACGAGACTGTGAATTGCTCCTACGCTCAAATCGCCGTCCAAATTTCCCTAA
TCAGTACGGTGATTGGCTAAGGTTTATTGGAAAAGGAATGATAGTCTCACCTACGCTCAAATCGTCGTCCAAAACCCATGGCCACCGGTTATCGCTTGATGTAGCGCCGA
CGATAATCCATGTCCCTCCTGGAACCGAGCTCCCTCGGCCACGCTACGGTGGTATTCAGATTAATGAGCCTGGCGAAGATCGTGTGCAAAGGCCTCGGGCTCCATCATCA
CCGGTATCCTACTCTCTCGCTGATCAACCGAATCGAATTGTTAAAGGGAAAGAGAAGGTGGAGGAGGTTGCAGACAAATACAGAAATCCATTCCAATGGCGGCGGTTTGC
TGGAAAGGAGAAGATAGGTGAGGCTGGTGCATCTCAGTGGCGGTCGAAGTCTTCCAACCGTGGGTTGCCGACGGTGAACGAAAGGGAAGTGAAACGGATGCCTCGTCAGC
CAGGCATTAATTCGAGGCCCCATCCAAATTCAATGCATTTAATGTGCCTTAGTTTTGGTAACGACCCAAGTGACCATGCTGAGGCTATTCAAGAGGATAACAACCGTTTT
TCAAAGAATTTATTACCTTTGTTTCAAGAGGCTACGCCGTTTTTCATGGGCCATGATTCTAAATTAATCCACAATGAAGGAAGCCCAAAAGACATTGAAAGCGATCCGGA
AATGCTTGAGGAGTTTGGGCCCATGAGCCCAGAAGGAGCAAATAGGCCGCAGACCGTTGATGAAGATGACGAGGACAAATCTGTGAAAGTGCTGGAATCCATCAAGGATG
ATTGTGTGGAGACGGTTCCTGAGATGCAAAATGGAACAGACCACCTTATGAGCCCAGAAGAAGCAAATAGGCCACAAACCGTTGATGAAGATGACGAGGACAAATCTGTG
AAAGTGCTGGAATCCATCAAGGATGATGGTGTGGAGACGGTTCCTGAGATGCAAAATGAAACAGACCACCTTATGATGGATAAACATTGGCGATTTACGGGTTTCTATGG
TCAACCTGATCATTCTCTTCGTTCTCACTCATGGAACCTTCTTCGGAGATTGCATGATTGTAATGATTCAGCCTTGTTGATTGGGGGAGATATGAATGAAATTCTATGGC
AGTCTGAGAAATCGGGTGGTCCAGAAAGGGATATTGGGCAAATAATGGCTTTTCGGAATGTGTTAGATGATTGCAATCTTTGGGATATTGGATATACTCAGGATGTGTTT
ACTTGGTGTAACAGGCGAAGCACGGGGGAAAAGGTTAGTCTACGGCTAGACCGTTTGCTGGCAAATGCAAATTTCTATAGCCTTATAGAGAACTTCGAGGTAGCGAAATT
GGATTGGGCTAAATTCGATCATAGGCCGATTCAGCTCACCATTCAGGCACATGACCAGGGTCCTCGAAGGGGAAGAAATAGACATTCGTTTAAATTTGAAGAATGGTGGA
CACGTCAGGAGGAATGTCGAAGGATTATCCGGCAAAGTAGTGATTGGGGCAGTAATCAAATTGTGCCTCATCAACTTCATCATGCATTATCCCAATGTGCGTCGGCTTTG
AAGGGTTGGGGATATGATTTCTCGCTAATTCATGCTTTGGAGAAACAACTTGATAAGTTACTGTTGGAGGAGGAAATGTATTGGAAACAAAGATCAAGAGAATGTTGGCT
GAAATGGGGTGATCGTAATACGAGATGGTTCCACCAAAAGGCCACAATGAGGAAAAAAAGGAATTGGATTGATGGCGTTGAAGACTCGAATGGGAATTGGCAATCCAAAA
CGGAGAACATACAGAAGGCGTTCGAGAATTATTTCAAAGAAATTTTTTCATCATCCCAGCCTGATCAGATGGACATGACTAGAGTGCTCAACCATGTTCCACGGAAGGAG
GTCACGGCTGCCATTAAGGATTTCCATCCAACCAAAGCTCCTGGACCTGATGGCTTCCCTGCAATTTTCTATCAGAAATATTGGGATGTAGTAGGTGACACTACTGTTTC
GAATTGTTTGGAGATTTTGAATTCAAATGGAAGGATTCATGAGTTGAATCATACCAATATTGTTCTAATACCAAAGAGTCGTCAGGCGAGGTTAGTATCCGATTATCGGC
CTATTAGCTTGTGTAATGTTTCTTATAAGATTATTACTAAGGTATTGGCTAATAGACTTAAGATTGTTTTGAATGAGATTATTGATGAATGTCAATCAGCGTTTCTTCCG
GGTAGATCAATATCTGATAACATGATTTTGGGTGGAGTAAAGCATATGATAGAGTGGAGTGGTTCTGGAATATCCATTGCCAGATCCTGCCAAAAAATCTCCCACTTATT
TTTTGCGGATGACAGTCTGATCTTTCTCAAAGCTGCGACTAAGGAATTTGGTCATTTTCGGAATGTATTATTGGACTTTGAAAAGGCGTCCAGTCAATCTGTCAACTATA
CAAAATCAGTGGTGATATTTTCCAAAAATGTGGCCAATGATACAAGGCAATATCTGGGTCGCATTTTGTCTATGAGAGTGACTGAATCATTGGGAGCTTACCTTGGTTTA
CCATCAACCTTTCATAGAGGTAAATCTCGAGATTTCAAATATCTGCTCGATAAAGTTTGGTCTGTTCTGCAAGGGTGGAATAGTCAATTTTTTTCCCAAGGCGGAAAGGA
GGTGCTTCTCAAGAGTATAGTTCAGGCTATTCCAACATATGCCATGGGGTGTTTCCGAATTCCAAAAGGCATTTTGCTTAAGATCTCCGCCCTATGTGCAAGGTTTTGGT
GGGGCTCAGGTGGAGACCAACGTCGGATTCATTGGCAGCGATGGGAGCATCTGTGCAAGCCAAAAGAAGTTGGAGGTCTAAACTTTAGAGATCTTGTCAACTTCAATCAG
GCAATGCTTGCAAAACAAGTATGGAGAGTGTTATCTAATCCGAACCTTACTGTTTCTAGGTACTTGGGTCCAATCGATGTGGACGTTATTCAAGGCTTATCGATCTGTGG
TTCGACGCCTGATAAATGGATATGGCATTATGATAGAAAAGGGGAGTATTCTGTTAAAAATGGATACAAGCTCTCGATGCTAAAGAGCCAAGAGGTATTTCTATCAGACG
TGGAAAAAAATAATTTCTGGTGGAAGAGAGTGTGGAAGATGAGAATTCCTAGTAAAGTTAAAGTATTCATTTGGAAATCATTCCACAACTCAATCCCAACCATGGTAAAC
CTATGGAACCATCATGTGCCAGTTATGGGAACCTGTCATGTATGCCATGATGAGATGGAGACTACAGATCATGCTTTGTTTCAGTGTTCGAGGGCTCGAGAGATATGGGC
CCTTCTTCACCCGCCCAAGATGAGAGTGCTTTGGGATCCTATGGACATTAAAGACCGATGGCATGGCGTCTCTGAAGAACCCATGCAGATTTTTTAG
Protein sequenceShow/hide protein sequence
MEPFNFFSLRAFSLLLDSRTGFSAVKLLYQCRRWCVVSSSLLVVTSSFLYSCFYSLCFYLETMDPSSLVDDWSRLSLTSEEEEVLVVADREAVDRSNLSLDLCLLEKLLC
PRPLGAEAMGRNFCAAWKLDIGFQVDRLGRNLFIFRFVNEFDRLRVIRHGPWLFEKYLLILVFSIRGLKPLDHHFSSIAFWIHIFELPLDWFNHSMAERLDGPLSGLWVP
IRYERLSEICTHCGVIGHVARDCELLLRSNRRPNFPNQYGDWLRFIGKGMIVSPTLKSSSKTHGHRLSLDVAPTIIHVPPGTELPRPRYGGIQINEPGEDRVQRPRAPSS
PVSYSLADQPNRIVKGKEKVEEVADKYRNPFQWRRFAGKEKIGEAGASQWRSKSSNRGLPTVNEREVKRMPRQPGINSRPHPNSMHLMCLSFGNDPSDHAEAIQEDNNRF
SKNLLPLFQEATPFFMGHDSKLIHNEGSPKDIESDPEMLEEFGPMSPEGANRPQTVDEDDEDKSVKVLESIKDDCVETVPEMQNGTDHLMSPEEANRPQTVDEDDEDKSV
KVLESIKDDGVETVPEMQNETDHLMMDKHWRFTGFYGQPDHSLRSHSWNLLRRLHDCNDSALLIGGDMNEILWQSEKSGGPERDIGQIMAFRNVLDDCNLWDIGYTQDVF
TWCNRRSTGEKVSLRLDRLLANANFYSLIENFEVAKLDWAKFDHRPIQLTIQAHDQGPRRGRNRHSFKFEEWWTRQEECRRIIRQSSDWGSNQIVPHQLHHALSQCASAL
KGWGYDFSLIHALEKQLDKLLLEEEMYWKQRSRECWLKWGDRNTRWFHQKATMRKKRNWIDGVEDSNGNWQSKTENIQKAFENYFKEIFSSSQPDQMDMTRVLNHVPRKE
VTAAIKDFHPTKAPGPDGFPAIFYQKYWDVVGDTTVSNCLEILNSNGRIHELNHTNIVLIPKSRQARLVSDYRPISLCNVSYKIITKVLANRLKIVLNEIIDECQSAFLP
GRSISDNMILGGVKHMIEWSGSGISIARSCQKISHLFFADDSLIFLKAATKEFGHFRNVLLDFEKASSQSVNYTKSVVIFSKNVANDTRQYLGRILSMRVTESLGAYLGL
PSTFHRGKSRDFKYLLDKVWSVLQGWNSQFFSQGGKEVLLKSIVQAIPTYAMGCFRIPKGILLKISALCARFWWGSGGDQRRIHWQRWEHLCKPKEVGGLNFRDLVNFNQ
AMLAKQVWRVLSNPNLTVSRYLGPIDVDVIQGLSICGSTPDKWIWHYDRKGEYSVKNGYKLSMLKSQEVFLSDVEKNNFWWKRVWKMRIPSKVKVFIWKSFHNSIPTMVN
LWNHHVPVMGTCHVCHDEMETTDHALFQCSRAREIWALLHPPKMRVLWDPMDIKDRWHGVSEEPMQIF