; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001161 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001161
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr4:25785189..25789503
RNA-Seq ExpressionLag0001161
SyntenyLag0001161
Gene Ontology termsNA
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR029472 - Retrotransposon Copia-like, N-terminal
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW64278.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]1.6e-10130.79Show/hide
Query:  PRPQFGMYHPQLFSFVQPSTIFPSPMYPNITQALSVKLTDSNYLPWKNQLLNVVYAYGLKGFINGSIPPPPQYLDNLQQQLNPEFSIWQKCNR-------
        P  Q    HP++      ++ F     P++ QA +V+L  SNYL W+ Q+LN++ A GL+  I+G IP P ++L +  + +NPE+SIWQ+ NR       
Subjt:  PRPQFGMYHPQLFSFVQPSTIFPSPMYPNITQALSVKLTDSNYLPWKNQLLNVVYAYGLKGFINGSIPPPPQYLDNLQQQLNPEFSIWQKCNR-------

Query:  SSEVSALAIKKDGLTVTQYLAKIKDVADKLSSIGEPVSLKDHISYILEG---LGSEYNAFVTSIQHRVELPPIEDVRALLLSYEYRLERQSSIDQLNTIQ
        SS    +  +  GL     +    +     +S    + L+  +    +G   LG+EYN+FV ++    E   +E++ ++LL++E RLE+Q + ++ N +Q
Subjt:  SSEVSALAIKKDGLTVTQYLAKIKDVADKLSSIGEPVSLKDHISYILEG---LGSEYNAFVTSIQHRVELPPIEDVRALLLSYEYRLERQSSIDQLNTIQ

Query:  ANFARLQVGSSNNLRQGS-RFPHQSANRDRQTQQVQSAGILGKPSSSPSN------RWPSRTSPNNPP---------------RILCQICNKYGHTAFVC
        AN   + +   N   Q S +F  Q      Q Q        G+     +N        P R S NN                 +  CQ+C KYGH A  C
Subjt:  ANFARLQVGSSNNLRQGS-RFPHQSANRDRQTQQVQSAGILGKPSSSPSN------RWPSRTSPNNPP---------------RILCQICNKYGHTAFVC

Query:  HHRANLQFQPSFNQNPSPQALFSQFSAPNPSPSSDSSQPTLPTSSFIPDDAWYMDSGATHHVTPDLNSLSNPLPYTGIEKVIVGNGMNLPISSIGSSYLL
        +HR +  +QP+ N + +                       + T S + D++WYMD+GATHH+TP+LN L++  P+ G +KV+VGNG  L IS+IG S + 
Subjt:  HHRANLQFQPSFNQNPSPQALFSQFSAPNPSPSSDSSQPTLPTSSFIPDDAWYMDSGATHHVTPDLNSLSNPLPYTGIEKVIVGNGMNLPISSIGSSYLL

Query:  SMSKFPIHLSHVLHTPSISHKLISISKLCYDNKASVEFYPSFFLVKDLITKRVLLRGNLEGGLYKLSSSF----------SNMSARSSLP------RAAV
        S S+  ++L ++LH P ++  LIS+++LC DN  +VEF+ + F+VKD  +K+ LL+GNL  GLYKLSSS           + ++ R+SL        + +
Subjt:  SMSKFPIHLSHVLHTPSISHKLISISKLCYDNKASVEFYPSFFLVKDLITKRVLLRGNLEGGLYKLSSSF----------SNMSARSSLP------RAAV

Query:  LLSKTTDVWHSRLGHPSSLVLKQILEFY----------------------------------YSSVHA---------------------------TYAYL
         LS   D+WH RLGHP+  ++ ++L                                     ++ VH+                           ++ YL
Subjt:  LLSKTTDVWHSRLGHPSSLVLKQILEFY----------------------------------YSSVHA---------------------------TYAYL

Query:  -----TCISNYRQFYSYI--------------------------------------SLSPQPYRFTRN---------LPFPFLASLSSSPS---------
               IS + QF   I                                      S S Q  R  R           P   L   SSSPS         
Subjt:  -----TCISNYRQFYSYI--------------------------------------SLSPQPYRFTRN---------LPFPFLASLSSSPS---------

Query:  ----------------PLSHHRPPPPTNTHPMLTRSKQGIFQPKVWTTLT-SLDLSVIEPTSYKRALTCPDWRLAMESELSALRRNDTWSLVPPP-----
                        P S H P P    HPM TR+K GIF+PK++ + T ++   + EP S+K A+  P+W+ AM  E  AL  N TW LVPPP     
Subjt:  ----------------PLSHHRPPPPTNTHPMLTRSKQGIFQPKVWTTLT-SLDLSVIEPTSYKRALTCPDWRLAMESELSALRRNDTWSLVPPP-----

Query:  VGCR----------------------------PQLDVNNAFLNGDLTEEIHMEQPPGFVDSANSRYVCHLNKAIYGLKQSPRAWFMKLSSCLVSWGFSSS
        +GCR                            P  D    F    + +   +  PPGFVD +   +VC L KA+YGLKQ+PRAWF KLSS LV WGFS S
Subjt:  VGCR----------------------------PQLDVNNAFLNGDLTEEIHMEQPPGFVDSANSRYVCHLNKAIYGLKQSPRAWFMKLSSCLVSWGFSSS

Query:  KSDSSIFYFRRSDDILIILVYVDDIIATGNNPTLLSSLISRLNSQFMLKDLGKLS
        ++D+S+F +  +  +L++LVYVDDII TG++  L+  LIS LNS F LKDLG L+
Subjt:  KSDSSIFYFRRSDDILIILVYVDDIIATGNNPTLLSSLISRLNSQFMLKDLGKLS

RVW64436.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]1.5e-10231.59Show/hide
Query:  NITQALSVKLTDSNYLPWKNQLLNVVYAYGLKGFINGSIPPPPQYLDNLQQQLNPEFSIWQKCNR----------------SSEVSALAI-------KKD
        ++  AL +KL  SNY+ WK Q+ NVVYA G + +I G+   PP+ L      LNP+F  W++ +R                SS+   + +       KK 
Subjt:  NITQALSVKLTDSNYLPWKNQLLNVVYAYGLKGFINGSIPPPPQYLDNLQQQLNPEFSIWQKCNR----------------SSEVSALAI-------KKD

Query:  GLTVTQYLAKIKDVADKLSSIGEPVSLKDHISYILEGLGSEYNAFVTSIQHRVELPPIEDVRALLLSYEYRLERQSSIDQLNTIQANFARLQVGSSNNLR
        G ++  Y+ K+K ++D L+++GEPV  +DHI  +L GLG +YN+ V S+  R +   +  V ++LL++E RL  Q S    +    +FA   + S  + R
Subjt:  GLTVTQYLAKIKDVADKLSSIGEPVSLKDHISYILEGLGSEYNAFVTSIQHRVELPPIEDVRALLLSYEYRLERQSSIDQLNTIQANFARLQVGSSNNLR

Query:  QGSRFPHQSANRDRQTQ-QVQSAGILGKPSS-----SPSNRWPSRTSPNNP----PRILCQICNKYGHTAFVCHHRANLQFQPSFNQNPSPQALFSQFSA
        Q +R PHQ  + +  ++ Q Q +    +P +      P N  P  ++ N P     R  CQ+C K+GHTA  C+HR ++ +Q + N  P  QA FS    
Subjt:  QGSRFPHQSANRDRQTQ-QVQSAGILGKPSS-----SPSNRWPSRTSPNNP----PRILCQICNKYGHTAFVCHHRANLQFQPSFNQNPSPQALFSQFSA

Query:  PNPSPSSDSSQPTLPTSSFIPDDAWYMDSGATHHVTPDLNSLSNPLPYTGIEKVIVGNGMNLPISSIGSSYLLSMSKFPIHLSHVLHTPSISHKLISISK
                     +  ++    D+W+ D+GATHH++    +LS   PY+G ++V +G+G +LPI + G+      SK    L+ VLH P +S  LIS+SK
Subjt:  PNPSPSSDSSQPTLPTSSFIPDDAWYMDSGATHHVTPDLNSLSNPLPYTGIEKVIVGNGMNLPISSIGSSYLLSMSKFPIHLSHVLHTPSISHKLISISK

Query:  LCYDNKASVEFYPSFFLVKDLITKRVLLRGNLEGGLYKLSSSFSNMSARSSLPRAAVLLSKTTD--VWHSRLGHPSSLVLKQILEFYYSSVHA-------
         C DN    EF+ S+F VKD +TK++LL+G L  GLY+ SS        SS PRA V     +D  +WHSRLGHP+  +L + L  ++ S +        
Subjt:  LCYDNKASVEFYPSFFLVKDLITKRVLLRGNLEGGLYKLSSSFSNMSARSSLPRAAVLLSKTTD--VWHSRLGHPSSLVLKQILEFYYSSVHA-------

Query:  ------------------------TYAYLT--------------------------------------CISNYRQFYS----------------------
                                TYA+ T                                      C  + R +                        
Subjt:  ------------------------TYAYLT--------------------------------------CISNYRQFYS----------------------

Query:  ----------YIS------------------LSPQPYR-------------------------FTRN-----LPFPFLASLSSSPSPLSHHRPPPPTNTH
                  YIS                   SP P+                           T N     +P PF  S  ++PSP     PP P NTH
Subjt:  ----------YIS------------------LSPQPYR-------------------------FTRN-----LPFPFLASLSSSPSPLSHHRPPPPTNTH

Query:  PMLTRSKQGIFQPKVWTTLTSLDLSVIEPTSYKRALTCPDWRLAMESELSALRRNDTWSLVPPP-----VGCR---------------------------
        PM+TR+K GI + +     + +     EP +Y +A     W  AM  E  AL RN TWSLVPPP     VGCR                           
Subjt:  PMLTRSKQGIFQPKVWTTLTSLDLSVIEPTSYKRALTCPDWRLAMESELSALRRNDTWSLVPPP-----VGCR---------------------------

Query:  ---------------------------------PQLDVNNAFLNGDLTEEIHMEQPPGFVDSANSRYVCHLNKAIYGLKQSPRAWFMKLSSCLVSWGFSS
                                          QLDV NAFLNGDL EE+ M QP GFV+     YVC L+KA+YGLKQ+PRAWF KL   L+ +GF S
Subjt:  ---------------------------------PQLDVNNAFLNGDLTEEIHMEQPPGFVDSANSRYVCHLNKAIYGLKQSPRAWFMKLSSCLVSWGFSS

Query:  SKSDSSIFYFRRSDDILIILVYVDDIIATGNNPTLLSSLISRLNSQFMLKDLGKLS
        S++D+S+F F  + DILI+LVYVDDI+ TG+NPTL+S  IS L+++F L+DLG LS
Subjt:  SKSDSSIFYFRRSDDILIILVYVDDIIATGNNPTLLSSLISRLNSQFMLKDLGKLS

RVW65725.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]9.6e-10232.47Show/hide
Query:  QPSTIFPSPMYPNITQALSVKLTDSNYLPWKNQLLNVVYAYGLKGFINGSIPPPPQYLDNLQQQLNPEFSIWQKCNRS----------------------
        Q ST+   P Y  +   L VKL  +NY+ W++Q+ NV++A G + FI+G+   P +  D     +NP F  W++ +R+                      
Subjt:  QPSTIFPSPMYPNITQALSVKLTDSNYLPWKNQLLNVVYAYGLKGFINGSIPPPPQYLDNLQQQLNPEFSIWQKCNRS----------------------

Query:  --SEVSAL--------------------AIKKDGLTVTQYLAKIKDVADKLSSIGEPVSLKDHISYILEGLGSEYNAFVTSIQHRVELPPIEDVRALLLS
          S  +AL                    + KK  +++  Y+ KIK  AD L++IGEP+S +D +  +L GLGS+YNA VT+I  R +   +E + ++LL+
Subjt:  --SEVSAL--------------------AIKKDGLTVTQYLAKIKDVADKLSSIGEPVSLKDHISYILEGLGSEYNAFVTSIQHRVELPPIEDVRALLLS

Query:  YEYRLERQSSIDQLNTIQANFARLQVGSSNNLRQGSR----------FPHQSANRDRQTQQVQSAGILGKPSSSPSNRWPSRTSPNNPPRILCQICNKYG
        +E+RLE+QSSI+Q++   AN+A     SS+N R G R          +P+ +    R   +    G  G+ +SSPS +          P+  CQ+C K+G
Subjt:  YEYRLERQSSIDQLNTIQANFARLQVGSSNNLRQGSR----------FPHQSANRDRQTQQVQSAGILGKPSSSPSNRWPSRTSPNNPPRILCQICNKYG

Query:  HTAFVCHHRANLQFQPSFNQNPSPQALFSQFSAPNPSPSSDSSQPTLPTSSFIPDDAWYMDSGATHHVTPDLNSLSNPLPYTGIEKVIVGNGMNLPISSI
        HTA +C+HR ++ FQ    Q     +L +     N  P+  +S    P      D++WY+DSGA+HH+T +L +L++  PYTG ++V +GNG +L IS+I
Subjt:  HTAFVCHHRANLQFQPSFNQNPSPQALFSQFSAPNPSPSSDSSQPTLPTSSFIPDDAWYMDSGATHHVTPDLNSLSNPLPYTGIEKVIVGNGMNLPISSI

Query:  GSSYLLSMSKFPIHLSHVLHTPSISHKLISISKLCYDNKASVEFYPSFFLVKDLITKRVLLRGNLEGGLYKLSSSFSNMSARSSLPRAAVLLSK------
        GS  L S +     L  V H P IS  LIS++K C +N A +EF+ + F VKDL TK VL +G LE GLYK    FSN+   SS+  A+   S+      
Subjt:  GSSYLLSMSKFPIHLSHVLHTPSISHKLISISKLCYDNKASVEFYPSFFLVKDLITKRVLLRGNLEGGLYKLSSSFSNMSARSSLPRAAVLLSK------

Query:  -TTDVWHSRLGHPSSLVLKQIL-----------EFYYSSVHATYAY--------------------------LTCISN-----YRQFYSYISLSPQPYRF
           ++WH+RLGH S  ++ +++            F  S    T+ Y                          + C+ +     +R F S++      +RF
Subjt:  -TTDVWHSRLGHPSSLVLKQIL-----------EFYYSSVHATYAY--------------------------LTCISN-----YRQFYSYISLSPQPYRF

Query:  T---------------------------------------------------------RNLPFPF---LASLSSSPSPLSH-----------HRPPPPTN
        +                                                         R  P P+     S+SS  S +SH               P T+
Subjt:  T---------------------------------------------------------RNLPFPF---LASLSSSPSPLSH-----------HRPPPPTN

Query:  THP----------------------------MLTRSKQGIFQPKVWTTLTSLDLSVI---EPTSYKRALTCPDWRLAMESELSALRRNDTWSLVPPP---
        + P                            M TRS +GI + K     T LDLS I   EP++ K+A   P+W  AME E++AL RN TW LV  P   
Subjt:  THP----------------------------MLTRSKQGIFQPKVWTTLTSLDLSVI---EPTSYKRALTCPDWRLAMESELSALRRNDTWSLVPPP---

Query:  --VGCR-PQLDVNNAFLNGDLTEEIHMEQPPGFVDSANSRYVCHLNKAIYGLKQSPRAWFMKLSSCLVSWGFSSSKSDSSIFYFRRSDDILIILVYVDDI
          +G    QLDV+NAFLNG+L E+++M QPPG+ D+     VC L KA+YGLKQ+PRAWF +LSS L+ WGFS S++DSS+F        LI+LVYVDDI
Subjt:  --VGCR-PQLDVNNAFLNGDLTEEIHMEQPPGFVDSANSRYVCHLNKAIYGLKQSPRAWFMKLSSCLVSWGFSSSKSDSSIFYFRRSDDILIILVYVDDI

Query:  IATGNNPTLLSSLISRLNSQFMLKDLG
        + TG++ T +SSLI++L+S F L+  G
Subjt:  IATGNNPTLLSSLISRLNSQFMLKDLG

RVX01903.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]7.4e-10229.77Show/hide
Query:  PSTIFPSPMYPNITQALSVKLTDSNYLPWKNQLLNVVYAYGLKGFINGSIPPPPQYLDNLQQQLNPEFSIWQKCNR----------------------SS
        PS+   +     +  AL +KL  +NY+ W+ Q+ NVV+A G +  I G    PPQ   +   + NP+F +W++ +R                      SS
Subjt:  PSTIFPSPMYPNITQALSVKLTDSNYLPWKNQLLNVVYAYGLKGFINGSIPPPPQYLDNLQQQLNPEFSIWQKCNR----------------------SS

Query:  EVSALAI----------------------KKDGLTVTQYLAKIKDVADKLSSIGEPVSLKDHISYILEGLGSEYNAFVTSIQHRVELPPIEDVRALLLSY
          +  A+                      +K  LT+ +Y+ K+K +AD L++IGEPV+ +D I  +L GLG++YN+ V S+  R +   +  V ++LL++
Subjt:  EVSALAI----------------------KKDGLTVTQYLAKIKDVADKLSSIGEPVSLKDHISYILEGLGSEYNAFVTSIQHRVELPPIEDVRALLLSY

Query:  EYRLERQSSIDQLNTIQANFARLQVGSSNNLRQGSRFPHQSANRDRQTQQVQSAGILGKPSSSPSNRWPSRTSPNNPPRILCQICNKYGHTAFVCHHRAN
        E RL  Q+S+ + N I AN A  Q    NN R        S+ ++RQ+         G  +   +N   S++S + P    CQ+C K+GHT   C+HR +
Subjt:  EYRLERQSSIDQLNTIQANFARLQVGSSNNLRQGSRFPHQSANRDRQTQQVQSAGILGKPSSSPSNRWPSRTSPNNPPRILCQICNKYGHTAFVCHHRAN

Query:  LQFQPSFNQNPSPQALFSQFSAPNPSPSSDSSQPTLPTSSFIPDDAWYMDSGATHHVTPDLNSLSNPLPYTGIEKVIVGNGMNLPISSIGSSYLLSMSKF
        + FQ     NP+   +  Q + PN   + +  Q  + + S I D+AW+ D+GATHH++  ++ LS+  PY G +KVIVGNG +L I   G+++  S SK 
Subjt:  LQFQPSFNQNPSPQALFSQFSAPNPSPSSDSSQPTLPTSSFIPDDAWYMDSGATHHVTPDLNSLSNPLPYTGIEKVIVGNGMNLPISSIGSSYLLSMSKF

Query:  PIHLSHVLHTPSISHKLISISKLCYDNKASVEFYPSFFLVKDLITKRVLLRGNLEGGLYKLSSSFSNMSA---RSSLPRAAVL-LSKTTDVWHSRLGHPS
           L  VLH P I+  LIS+S+ C DN    EF+P  F VKD +TK++LL+G+LE GLY+  + F    A    SS  R++ L L+ TT +WHSRLGHP+
Subjt:  PIHLSHVLHTPSISHKLISISKLCYDNKASVEFYPSFFLVKDLITKRVLLRGNLEGGLYKLSSSFSNMSA---RSSLPRAAVL-LSKTTDVWHSRLGHPS

Query:  SLVLKQI--------------------------LEFYYSSVHATY-------------------------------------------------------
          +LK I                          L F  S   A++                                                       
Subjt:  SLVLKQI--------------------------LEFYYSSVHATY-------------------------------------------------------

Query:  -------AYLTCI-----SNYRQFYSYISLSPQPYRF----------------------------TRNLPFPF--------------------------L
               + + C+       ++ F SY++      +F                            T +LPF F                          +
Subjt:  -------AYLTCI-----SNYRQFYSYISLSPQPYRF----------------------------TRNLPFPF--------------------------L

Query:  ASLSSSPSPLSHHRP--------------------------PPPTNTHPMLTRSKQGIFQPKVWTTLTSLDLSVIEPTSYKRALTCPDWRLAMESELSAL
        +SL S  +P +   P                          P PTN HPM+TR+K GI + KV+         + EPT++ +A+   +W LAME + SAL
Subjt:  ASLSSSPSPLSHHRP--------------------------PPPTNTHPMLTRSKQGIFQPKVWTTLTSLDLSVIEPTSYKRALTCPDWRLAMESELSAL

Query:  RRNDTWSLVPPP-----VGCR------------------------------------------------------------PQLDVNNAFLNGDLTEEIH
        +RN+TW LVPPP     +GC+                                                             QLDV NAFL+GDL E + 
Subjt:  RRNDTWSLVPPP-----VGCR------------------------------------------------------------PQLDVNNAFLNGDLTEEIH

Query:  MEQPPGFVDSANSRYVCHLNKAIYGLKQSPRAWFMKLSSCLVSWGFSSSKSDSSIFYFRRSDDILIILVYVDDIIATGNNPTLLSSLISRLNSQFMLKDL
        M+QPPGF++S    +VC LNKA+YGLKQ+PRAW+ KL++ L+ WGF +S++DSS+F    + D+LI+L+YVDDI+  G++   +SS I+RLNS F L+DL
Subjt:  MEQPPGFVDSANSRYVCHLNKAIYGLKQSPRAWFMKLSSCLVSWGFSSSKSDSSIFYFRRSDDILIILVYVDDIIATGNNPTLLSSLISRLNSQFMLKDL

Query:  G
        G
Subjt:  G

RVX12711.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]1.5e-10231.78Show/hide
Query:  PSTIFPSPMYPNITQALSVKLTDSNYLPWKNQLLNVVYAYGLKGFINGSIPPPPQYLDNLQQQLNPEFSIWQKCNR----------------------SS
        PS+   +     +  AL +KL  +NY+ W+ Q+ NVV+A G +  I G    PPQ   +   + NP+F +W++ +R                      SS
Subjt:  PSTIFPSPMYPNITQALSVKLTDSNYLPWKNQLLNVVYAYGLKGFINGSIPPPPQYLDNLQQQLNPEFSIWQKCNR----------------------SS

Query:  EVSALAI----------------------KKDGLTVTQYLAKIKDVADKLSSIGEPVSLKDHISYILEGLGSEYNAFVTSIQHRVELPPIEDVRALLLSY
          +  A+                      +K  LT+ +Y+ K+K +AD L++IGEPV+ +D I  +L GLG++YN+ V S+  R +   +  V ++LL++
Subjt:  EVSALAI----------------------KKDGLTVTQYLAKIKDVADKLSSIGEPVSLKDHISYILEGLGSEYNAFVTSIQHRVELPPIEDVRALLLSY

Query:  EYRLERQSSIDQLNTIQANFARLQVGSSNNLRQGSRFPHQSANRDRQTQQVQSAGILGKPSSSPSNRWPSRTSPNNPPRILCQICNKYGHTAFVCHHRAN
        E RL  Q+SI + N I AN A  Q    NN R        S+ ++RQ+         G  +   +N   S++S + P    CQ+C K+GHT   C+HR +
Subjt:  EYRLERQSSIDQLNTIQANFARLQVGSSNNLRQGSRFPHQSANRDRQTQQVQSAGILGKPSSSPSNRWPSRTSPNNPPRILCQICNKYGHTAFVCHHRAN

Query:  LQFQPSFNQNPSPQALFSQFSAPNPSPSSDSSQPTLPTSSFIPDDAWYMDSGATHHVTPDLNSLSNPLPYTGIEKVIVGNGMNLPISSIGSSYLLSMSKF
        + FQ     NP+   +  Q + PN   + +  Q  + + S I D+AW+ D+GATHH++  ++ LS+  PY G +KVIVGNG +L I    +++  S SK 
Subjt:  LQFQPSFNQNPSPQALFSQFSAPNPSPSSDSSQPTLPTSSFIPDDAWYMDSGATHHVTPDLNSLSNPLPYTGIEKVIVGNGMNLPISSIGSSYLLSMSKF

Query:  PIHLSHVLHTPSISHKLISISKLCYDNKASVEFYPSFFLVKDLITKRVLLRGNLEGGLYKLSSSFSNMSA---RSSLPRAAVLLSKT-TDVW--------
           L  VLH P I+  LIS+S+ C DN    EF+P FF VKD +TK++LL+G+LE GLY+  + F    A    SS  RA+  L+    D+W        
Subjt:  PIHLSHVLHTPSISHKLISISKLCYDNKASVEFYPSFFLVKDLITKRVLLRGNLEGGLYKLSSSFSNMSA---RSSLPRAAVLLSKT-TDVW--------

Query:  ------------HSRLG-----HPSSLVLKQILEFYYSSVHATYAYLTCI-----SNYRQFYSYISLSPQPYRF--------------------------
                     SR       H     L   ++F     +   + + C+       ++ F SY++      +F                          
Subjt:  ------------HSRLG-----HPSSLVLKQILEFYYSSVHATYAYLTCI-----SNYRQFYSYISLSPQPYRF--------------------------

Query:  --TRNLP----------FPFLASLSSSPSPLSHHRP-------PPPTNTHPMLTRSKQGIFQPKVWTTLTSLDLSVIEPTSYKRALTCPDWRLAMESELS
          T +LP          FPF ++   S S ++   P       PP ++   M+TR+K GI + KV+         + EPT++ +A+   +W LAME E S
Subjt:  --TRNLP----------FPFLASLSSSPSPLSHHRP-------PPPTNTHPMLTRSKQGIFQPKVWTTLTSLDLSVIEPTSYKRALTCPDWRLAMESELS

Query:  ALRRNDTWSLVPPP-----VGCR------------------------------------------------------------PQLDVNNAFLNGDLTEE
        AL+RN+TW LVPPP     +GC+                                                             QLDV NAFL+GDL E 
Subjt:  ALRRNDTWSLVPPP-----VGCR------------------------------------------------------------PQLDVNNAFLNGDLTEE

Query:  IHMEQPPGFVDSANSRYVCHLNKAIYGLKQSPRAWFMKLSSCLVSWGFSSSKSDSSIFYFRRSDDILIILVYVDDIIATGNNPTLLSSLISRLNSQFMLK
        + M+QPPGF++S    +VC LNKA+YGLKQ+PRAW+ KLS+ L+ WGF +S++DSS+F    + D+LI+L+YVDDI+ TG++   +SS I+RLNS F L+
Subjt:  IHMEQPPGFVDSANSRYVCHLNKAIYGLKQSPRAWFMKLSSCLVSWGFSSSKSDSSIFYFRRSDDILIILVYVDDIIATGNNPTLLSSLISRLNSQFMLK

Query:  DLG
        DLG
Subjt:  DLG

TrEMBL top hitse value%identityAlignment
A0A438FWG3 Retrovirus-related Pol polyprotein from transposon RE18.0e-10230.79Show/hide
Query:  PRPQFGMYHPQLFSFVQPSTIFPSPMYPNITQALSVKLTDSNYLPWKNQLLNVVYAYGLKGFINGSIPPPPQYLDNLQQQLNPEFSIWQKCNR-------
        P  Q    HP++      ++ F     P++ QA +V+L  SNYL W+ Q+LN++ A GL+  I+G IP P ++L +  + +NPE+SIWQ+ NR       
Subjt:  PRPQFGMYHPQLFSFVQPSTIFPSPMYPNITQALSVKLTDSNYLPWKNQLLNVVYAYGLKGFINGSIPPPPQYLDNLQQQLNPEFSIWQKCNR-------

Query:  SSEVSALAIKKDGLTVTQYLAKIKDVADKLSSIGEPVSLKDHISYILEG---LGSEYNAFVTSIQHRVELPPIEDVRALLLSYEYRLERQSSIDQLNTIQ
        SS    +  +  GL     +    +     +S    + L+  +    +G   LG+EYN+FV ++    E   +E++ ++LL++E RLE+Q + ++ N +Q
Subjt:  SSEVSALAIKKDGLTVTQYLAKIKDVADKLSSIGEPVSLKDHISYILEG---LGSEYNAFVTSIQHRVELPPIEDVRALLLSYEYRLERQSSIDQLNTIQ

Query:  ANFARLQVGSSNNLRQGS-RFPHQSANRDRQTQQVQSAGILGKPSSSPSN------RWPSRTSPNNPP---------------RILCQICNKYGHTAFVC
        AN   + +   N   Q S +F  Q      Q Q        G+     +N        P R S NN                 +  CQ+C KYGH A  C
Subjt:  ANFARLQVGSSNNLRQGS-RFPHQSANRDRQTQQVQSAGILGKPSSSPSN------RWPSRTSPNNPP---------------RILCQICNKYGHTAFVC

Query:  HHRANLQFQPSFNQNPSPQALFSQFSAPNPSPSSDSSQPTLPTSSFIPDDAWYMDSGATHHVTPDLNSLSNPLPYTGIEKVIVGNGMNLPISSIGSSYLL
        +HR +  +QP+ N + +                       + T S + D++WYMD+GATHH+TP+LN L++  P+ G +KV+VGNG  L IS+IG S + 
Subjt:  HHRANLQFQPSFNQNPSPQALFSQFSAPNPSPSSDSSQPTLPTSSFIPDDAWYMDSGATHHVTPDLNSLSNPLPYTGIEKVIVGNGMNLPISSIGSSYLL

Query:  SMSKFPIHLSHVLHTPSISHKLISISKLCYDNKASVEFYPSFFLVKDLITKRVLLRGNLEGGLYKLSSSF----------SNMSARSSLP------RAAV
        S S+  ++L ++LH P ++  LIS+++LC DN  +VEF+ + F+VKD  +K+ LL+GNL  GLYKLSSS           + ++ R+SL        + +
Subjt:  SMSKFPIHLSHVLHTPSISHKLISISKLCYDNKASVEFYPSFFLVKDLITKRVLLRGNLEGGLYKLSSSF----------SNMSARSSLP------RAAV

Query:  LLSKTTDVWHSRLGHPSSLVLKQILEFY----------------------------------YSSVHA---------------------------TYAYL
         LS   D+WH RLGHP+  ++ ++L                                     ++ VH+                           ++ YL
Subjt:  LLSKTTDVWHSRLGHPSSLVLKQILEFY----------------------------------YSSVHA---------------------------TYAYL

Query:  -----TCISNYRQFYSYI--------------------------------------SLSPQPYRFTRN---------LPFPFLASLSSSPS---------
               IS + QF   I                                      S S Q  R  R           P   L   SSSPS         
Subjt:  -----TCISNYRQFYSYI--------------------------------------SLSPQPYRFTRN---------LPFPFLASLSSSPS---------

Query:  ----------------PLSHHRPPPPTNTHPMLTRSKQGIFQPKVWTTLT-SLDLSVIEPTSYKRALTCPDWRLAMESELSALRRNDTWSLVPPP-----
                        P S H P P    HPM TR+K GIF+PK++ + T ++   + EP S+K A+  P+W+ AM  E  AL  N TW LVPPP     
Subjt:  ----------------PLSHHRPPPPTNTHPMLTRSKQGIFQPKVWTTLT-SLDLSVIEPTSYKRALTCPDWRLAMESELSALRRNDTWSLVPPP-----

Query:  VGCR----------------------------PQLDVNNAFLNGDLTEEIHMEQPPGFVDSANSRYVCHLNKAIYGLKQSPRAWFMKLSSCLVSWGFSSS
        +GCR                            P  D    F    + +   +  PPGFVD +   +VC L KA+YGLKQ+PRAWF KLSS LV WGFS S
Subjt:  VGCR----------------------------PQLDVNNAFLNGDLTEEIHMEQPPGFVDSANSRYVCHLNKAIYGLKQSPRAWFMKLSSCLVSWGFSSS

Query:  KSDSSIFYFRRSDDILIILVYVDDIIATGNNPTLLSSLISRLNSQFMLKDLGKLS
        ++D+S+F +  +  +L++LVYVDDII TG++  L+  LIS LNS F LKDLG L+
Subjt:  KSDSSIFYFRRSDDILIILVYVDDIIATGNNPTLLSSLISRLNSQFMLKDLGKLS

A0A438FWU8 Retrovirus-related Pol polyprotein from transposon RE17.2e-10331.59Show/hide
Query:  NITQALSVKLTDSNYLPWKNQLLNVVYAYGLKGFINGSIPPPPQYLDNLQQQLNPEFSIWQKCNR----------------SSEVSALAI-------KKD
        ++  AL +KL  SNY+ WK Q+ NVVYA G + +I G+   PP+ L      LNP+F  W++ +R                SS+   + +       KK 
Subjt:  NITQALSVKLTDSNYLPWKNQLLNVVYAYGLKGFINGSIPPPPQYLDNLQQQLNPEFSIWQKCNR----------------SSEVSALAI-------KKD

Query:  GLTVTQYLAKIKDVADKLSSIGEPVSLKDHISYILEGLGSEYNAFVTSIQHRVELPPIEDVRALLLSYEYRLERQSSIDQLNTIQANFARLQVGSSNNLR
        G ++  Y+ K+K ++D L+++GEPV  +DHI  +L GLG +YN+ V S+  R +   +  V ++LL++E RL  Q S    +    +FA   + S  + R
Subjt:  GLTVTQYLAKIKDVADKLSSIGEPVSLKDHISYILEGLGSEYNAFVTSIQHRVELPPIEDVRALLLSYEYRLERQSSIDQLNTIQANFARLQVGSSNNLR

Query:  QGSRFPHQSANRDRQTQ-QVQSAGILGKPSS-----SPSNRWPSRTSPNNP----PRILCQICNKYGHTAFVCHHRANLQFQPSFNQNPSPQALFSQFSA
        Q +R PHQ  + +  ++ Q Q +    +P +      P N  P  ++ N P     R  CQ+C K+GHTA  C+HR ++ +Q + N  P  QA FS    
Subjt:  QGSRFPHQSANRDRQTQ-QVQSAGILGKPSS-----SPSNRWPSRTSPNNP----PRILCQICNKYGHTAFVCHHRANLQFQPSFNQNPSPQALFSQFSA

Query:  PNPSPSSDSSQPTLPTSSFIPDDAWYMDSGATHHVTPDLNSLSNPLPYTGIEKVIVGNGMNLPISSIGSSYLLSMSKFPIHLSHVLHTPSISHKLISISK
                     +  ++    D+W+ D+GATHH++    +LS   PY+G ++V +G+G +LPI + G+      SK    L+ VLH P +S  LIS+SK
Subjt:  PNPSPSSDSSQPTLPTSSFIPDDAWYMDSGATHHVTPDLNSLSNPLPYTGIEKVIVGNGMNLPISSIGSSYLLSMSKFPIHLSHVLHTPSISHKLISISK

Query:  LCYDNKASVEFYPSFFLVKDLITKRVLLRGNLEGGLYKLSSSFSNMSARSSLPRAAVLLSKTTD--VWHSRLGHPSSLVLKQILEFYYSSVHA-------
         C DN    EF+ S+F VKD +TK++LL+G L  GLY+ SS        SS PRA V     +D  +WHSRLGHP+  +L + L  ++ S +        
Subjt:  LCYDNKASVEFYPSFFLVKDLITKRVLLRGNLEGGLYKLSSSFSNMSARSSLPRAAVLLSKTTD--VWHSRLGHPSSLVLKQILEFYYSSVHA-------

Query:  ------------------------TYAYLT--------------------------------------CISNYRQFYS----------------------
                                TYA+ T                                      C  + R +                        
Subjt:  ------------------------TYAYLT--------------------------------------CISNYRQFYS----------------------

Query:  ----------YIS------------------LSPQPYR-------------------------FTRN-----LPFPFLASLSSSPSPLSHHRPPPPTNTH
                  YIS                   SP P+                           T N     +P PF  S  ++PSP     PP P NTH
Subjt:  ----------YIS------------------LSPQPYR-------------------------FTRN-----LPFPFLASLSSSPSPLSHHRPPPPTNTH

Query:  PMLTRSKQGIFQPKVWTTLTSLDLSVIEPTSYKRALTCPDWRLAMESELSALRRNDTWSLVPPP-----VGCR---------------------------
        PM+TR+K GI + +     + +     EP +Y +A     W  AM  E  AL RN TWSLVPPP     VGCR                           
Subjt:  PMLTRSKQGIFQPKVWTTLTSLDLSVIEPTSYKRALTCPDWRLAMESELSALRRNDTWSLVPPP-----VGCR---------------------------

Query:  ---------------------------------PQLDVNNAFLNGDLTEEIHMEQPPGFVDSANSRYVCHLNKAIYGLKQSPRAWFMKLSSCLVSWGFSS
                                          QLDV NAFLNGDL EE+ M QP GFV+     YVC L+KA+YGLKQ+PRAWF KL   L+ +GF S
Subjt:  ---------------------------------PQLDVNNAFLNGDLTEEIHMEQPPGFVDSANSRYVCHLNKAIYGLKQSPRAWFMKLSSCLVSWGFSS

Query:  SKSDSSIFYFRRSDDILIILVYVDDIIATGNNPTLLSSLISRLNSQFMLKDLGKLS
        S++D+S+F F  + DILI+LVYVDDI+ TG+NPTL+S  IS L+++F L+DLG LS
Subjt:  SKSDSSIFYFRRSDDILIILVYVDDIIATGNNPTLLSSLISRLNSQFMLKDLGKLS

A0A438G0Q9 Retrovirus-related Pol polyprotein from transposon RE14.7e-10232.47Show/hide
Query:  QPSTIFPSPMYPNITQALSVKLTDSNYLPWKNQLLNVVYAYGLKGFINGSIPPPPQYLDNLQQQLNPEFSIWQKCNRS----------------------
        Q ST+   P Y  +   L VKL  +NY+ W++Q+ NV++A G + FI+G+   P +  D     +NP F  W++ +R+                      
Subjt:  QPSTIFPSPMYPNITQALSVKLTDSNYLPWKNQLLNVVYAYGLKGFINGSIPPPPQYLDNLQQQLNPEFSIWQKCNRS----------------------

Query:  --SEVSAL--------------------AIKKDGLTVTQYLAKIKDVADKLSSIGEPVSLKDHISYILEGLGSEYNAFVTSIQHRVELPPIEDVRALLLS
          S  +AL                    + KK  +++  Y+ KIK  AD L++IGEP+S +D +  +L GLGS+YNA VT+I  R +   +E + ++LL+
Subjt:  --SEVSAL--------------------AIKKDGLTVTQYLAKIKDVADKLSSIGEPVSLKDHISYILEGLGSEYNAFVTSIQHRVELPPIEDVRALLLS

Query:  YEYRLERQSSIDQLNTIQANFARLQVGSSNNLRQGSR----------FPHQSANRDRQTQQVQSAGILGKPSSSPSNRWPSRTSPNNPPRILCQICNKYG
        +E+RLE+QSSI+Q++   AN+A     SS+N R G R          +P+ +    R   +    G  G+ +SSPS +          P+  CQ+C K+G
Subjt:  YEYRLERQSSIDQLNTIQANFARLQVGSSNNLRQGSR----------FPHQSANRDRQTQQVQSAGILGKPSSSPSNRWPSRTSPNNPPRILCQICNKYG

Query:  HTAFVCHHRANLQFQPSFNQNPSPQALFSQFSAPNPSPSSDSSQPTLPTSSFIPDDAWYMDSGATHHVTPDLNSLSNPLPYTGIEKVIVGNGMNLPISSI
        HTA +C+HR ++ FQ    Q     +L +     N  P+  +S    P      D++WY+DSGA+HH+T +L +L++  PYTG ++V +GNG +L IS+I
Subjt:  HTAFVCHHRANLQFQPSFNQNPSPQALFSQFSAPNPSPSSDSSQPTLPTSSFIPDDAWYMDSGATHHVTPDLNSLSNPLPYTGIEKVIVGNGMNLPISSI

Query:  GSSYLLSMSKFPIHLSHVLHTPSISHKLISISKLCYDNKASVEFYPSFFLVKDLITKRVLLRGNLEGGLYKLSSSFSNMSARSSLPRAAVLLSK------
        GS  L S +     L  V H P IS  LIS++K C +N A +EF+ + F VKDL TK VL +G LE GLYK    FSN+   SS+  A+   S+      
Subjt:  GSSYLLSMSKFPIHLSHVLHTPSISHKLISISKLCYDNKASVEFYPSFFLVKDLITKRVLLRGNLEGGLYKLSSSFSNMSARSSLPRAAVLLSK------

Query:  -TTDVWHSRLGHPSSLVLKQIL-----------EFYYSSVHATYAY--------------------------LTCISN-----YRQFYSYISLSPQPYRF
           ++WH+RLGH S  ++ +++            F  S    T+ Y                          + C+ +     +R F S++      +RF
Subjt:  -TTDVWHSRLGHPSSLVLKQIL-----------EFYYSSVHATYAY--------------------------LTCISN-----YRQFYSYISLSPQPYRF

Query:  T---------------------------------------------------------RNLPFPF---LASLSSSPSPLSH-----------HRPPPPTN
        +                                                         R  P P+     S+SS  S +SH               P T+
Subjt:  T---------------------------------------------------------RNLPFPF---LASLSSSPSPLSH-----------HRPPPPTN

Query:  THP----------------------------MLTRSKQGIFQPKVWTTLTSLDLSVI---EPTSYKRALTCPDWRLAMESELSALRRNDTWSLVPPP---
        + P                            M TRS +GI + K     T LDLS I   EP++ K+A   P+W  AME E++AL RN TW LV  P   
Subjt:  THP----------------------------MLTRSKQGIFQPKVWTTLTSLDLSVI---EPTSYKRALTCPDWRLAMESELSALRRNDTWSLVPPP---

Query:  --VGCR-PQLDVNNAFLNGDLTEEIHMEQPPGFVDSANSRYVCHLNKAIYGLKQSPRAWFMKLSSCLVSWGFSSSKSDSSIFYFRRSDDILIILVYVDDI
          +G    QLDV+NAFLNG+L E+++M QPPG+ D+     VC L KA+YGLKQ+PRAWF +LSS L+ WGFS S++DSS+F        LI+LVYVDDI
Subjt:  --VGCR-PQLDVNNAFLNGDLTEEIHMEQPPGFVDSANSRYVCHLNKAIYGLKQSPRAWFMKLSSCLVSWGFSSSKSDSSIFYFRRSDDILIILVYVDDI

Query:  IATGNNPTLLSSLISRLNSQFMLKDLG
        + TG++ T +SSLI++L+S F L+  G
Subjt:  IATGNNPTLLSSLISRLNSQFMLKDLG

A0A438IYY8 Retrovirus-related Pol polyprotein from transposon RE13.6e-10229.77Show/hide
Query:  PSTIFPSPMYPNITQALSVKLTDSNYLPWKNQLLNVVYAYGLKGFINGSIPPPPQYLDNLQQQLNPEFSIWQKCNR----------------------SS
        PS+   +     +  AL +KL  +NY+ W+ Q+ NVV+A G +  I G    PPQ   +   + NP+F +W++ +R                      SS
Subjt:  PSTIFPSPMYPNITQALSVKLTDSNYLPWKNQLLNVVYAYGLKGFINGSIPPPPQYLDNLQQQLNPEFSIWQKCNR----------------------SS

Query:  EVSALAI----------------------KKDGLTVTQYLAKIKDVADKLSSIGEPVSLKDHISYILEGLGSEYNAFVTSIQHRVELPPIEDVRALLLSY
          +  A+                      +K  LT+ +Y+ K+K +AD L++IGEPV+ +D I  +L GLG++YN+ V S+  R +   +  V ++LL++
Subjt:  EVSALAI----------------------KKDGLTVTQYLAKIKDVADKLSSIGEPVSLKDHISYILEGLGSEYNAFVTSIQHRVELPPIEDVRALLLSY

Query:  EYRLERQSSIDQLNTIQANFARLQVGSSNNLRQGSRFPHQSANRDRQTQQVQSAGILGKPSSSPSNRWPSRTSPNNPPRILCQICNKYGHTAFVCHHRAN
        E RL  Q+S+ + N I AN A  Q    NN R        S+ ++RQ+         G  +   +N   S++S + P    CQ+C K+GHT   C+HR +
Subjt:  EYRLERQSSIDQLNTIQANFARLQVGSSNNLRQGSRFPHQSANRDRQTQQVQSAGILGKPSSSPSNRWPSRTSPNNPPRILCQICNKYGHTAFVCHHRAN

Query:  LQFQPSFNQNPSPQALFSQFSAPNPSPSSDSSQPTLPTSSFIPDDAWYMDSGATHHVTPDLNSLSNPLPYTGIEKVIVGNGMNLPISSIGSSYLLSMSKF
        + FQ     NP+   +  Q + PN   + +  Q  + + S I D+AW+ D+GATHH++  ++ LS+  PY G +KVIVGNG +L I   G+++  S SK 
Subjt:  LQFQPSFNQNPSPQALFSQFSAPNPSPSSDSSQPTLPTSSFIPDDAWYMDSGATHHVTPDLNSLSNPLPYTGIEKVIVGNGMNLPISSIGSSYLLSMSKF

Query:  PIHLSHVLHTPSISHKLISISKLCYDNKASVEFYPSFFLVKDLITKRVLLRGNLEGGLYKLSSSFSNMSA---RSSLPRAAVL-LSKTTDVWHSRLGHPS
           L  VLH P I+  LIS+S+ C DN    EF+P  F VKD +TK++LL+G+LE GLY+  + F    A    SS  R++ L L+ TT +WHSRLGHP+
Subjt:  PIHLSHVLHTPSISHKLISISKLCYDNKASVEFYPSFFLVKDLITKRVLLRGNLEGGLYKLSSSFSNMSA---RSSLPRAAVL-LSKTTDVWHSRLGHPS

Query:  SLVLKQI--------------------------LEFYYSSVHATY-------------------------------------------------------
          +LK I                          L F  S   A++                                                       
Subjt:  SLVLKQI--------------------------LEFYYSSVHATY-------------------------------------------------------

Query:  -------AYLTCI-----SNYRQFYSYISLSPQPYRF----------------------------TRNLPFPF--------------------------L
               + + C+       ++ F SY++      +F                            T +LPF F                          +
Subjt:  -------AYLTCI-----SNYRQFYSYISLSPQPYRF----------------------------TRNLPFPF--------------------------L

Query:  ASLSSSPSPLSHHRP--------------------------PPPTNTHPMLTRSKQGIFQPKVWTTLTSLDLSVIEPTSYKRALTCPDWRLAMESELSAL
        +SL S  +P +   P                          P PTN HPM+TR+K GI + KV+         + EPT++ +A+   +W LAME + SAL
Subjt:  ASLSSSPSPLSHHRP--------------------------PPPTNTHPMLTRSKQGIFQPKVWTTLTSLDLSVIEPTSYKRALTCPDWRLAMESELSAL

Query:  RRNDTWSLVPPP-----VGCR------------------------------------------------------------PQLDVNNAFLNGDLTEEIH
        +RN+TW LVPPP     +GC+                                                             QLDV NAFL+GDL E + 
Subjt:  RRNDTWSLVPPP-----VGCR------------------------------------------------------------PQLDVNNAFLNGDLTEEIH

Query:  MEQPPGFVDSANSRYVCHLNKAIYGLKQSPRAWFMKLSSCLVSWGFSSSKSDSSIFYFRRSDDILIILVYVDDIIATGNNPTLLSSLISRLNSQFMLKDL
        M+QPPGF++S    +VC LNKA+YGLKQ+PRAW+ KL++ L+ WGF +S++DSS+F    + D+LI+L+YVDDI+  G++   +SS I+RLNS F L+DL
Subjt:  MEQPPGFVDSANSRYVCHLNKAIYGLKQSPRAWFMKLSSCLVSWGFSSSKSDSSIFYFRRSDDILIILVYVDDIIATGNNPTLLSSLISRLNSQFMLKDL

Query:  G
        G
Subjt:  G

A0A438JUQ9 Retrovirus-related Pol polyprotein from transposon RE17.2e-10331.78Show/hide
Query:  PSTIFPSPMYPNITQALSVKLTDSNYLPWKNQLLNVVYAYGLKGFINGSIPPPPQYLDNLQQQLNPEFSIWQKCNR----------------------SS
        PS+   +     +  AL +KL  +NY+ W+ Q+ NVV+A G +  I G    PPQ   +   + NP+F +W++ +R                      SS
Subjt:  PSTIFPSPMYPNITQALSVKLTDSNYLPWKNQLLNVVYAYGLKGFINGSIPPPPQYLDNLQQQLNPEFSIWQKCNR----------------------SS

Query:  EVSALAI----------------------KKDGLTVTQYLAKIKDVADKLSSIGEPVSLKDHISYILEGLGSEYNAFVTSIQHRVELPPIEDVRALLLSY
          +  A+                      +K  LT+ +Y+ K+K +AD L++IGEPV+ +D I  +L GLG++YN+ V S+  R +   +  V ++LL++
Subjt:  EVSALAI----------------------KKDGLTVTQYLAKIKDVADKLSSIGEPVSLKDHISYILEGLGSEYNAFVTSIQHRVELPPIEDVRALLLSY

Query:  EYRLERQSSIDQLNTIQANFARLQVGSSNNLRQGSRFPHQSANRDRQTQQVQSAGILGKPSSSPSNRWPSRTSPNNPPRILCQICNKYGHTAFVCHHRAN
        E RL  Q+SI + N I AN A  Q    NN R        S+ ++RQ+         G  +   +N   S++S + P    CQ+C K+GHT   C+HR +
Subjt:  EYRLERQSSIDQLNTIQANFARLQVGSSNNLRQGSRFPHQSANRDRQTQQVQSAGILGKPSSSPSNRWPSRTSPNNPPRILCQICNKYGHTAFVCHHRAN

Query:  LQFQPSFNQNPSPQALFSQFSAPNPSPSSDSSQPTLPTSSFIPDDAWYMDSGATHHVTPDLNSLSNPLPYTGIEKVIVGNGMNLPISSIGSSYLLSMSKF
        + FQ     NP+   +  Q + PN   + +  Q  + + S I D+AW+ D+GATHH++  ++ LS+  PY G +KVIVGNG +L I    +++  S SK 
Subjt:  LQFQPSFNQNPSPQALFSQFSAPNPSPSSDSSQPTLPTSSFIPDDAWYMDSGATHHVTPDLNSLSNPLPYTGIEKVIVGNGMNLPISSIGSSYLLSMSKF

Query:  PIHLSHVLHTPSISHKLISISKLCYDNKASVEFYPSFFLVKDLITKRVLLRGNLEGGLYKLSSSFSNMSA---RSSLPRAAVLLSKT-TDVW--------
           L  VLH P I+  LIS+S+ C DN    EF+P FF VKD +TK++LL+G+LE GLY+  + F    A    SS  RA+  L+    D+W        
Subjt:  PIHLSHVLHTPSISHKLISISKLCYDNKASVEFYPSFFLVKDLITKRVLLRGNLEGGLYKLSSSFSNMSA---RSSLPRAAVLLSKT-TDVW--------

Query:  ------------HSRLG-----HPSSLVLKQILEFYYSSVHATYAYLTCI-----SNYRQFYSYISLSPQPYRF--------------------------
                     SR       H     L   ++F     +   + + C+       ++ F SY++      +F                          
Subjt:  ------------HSRLG-----HPSSLVLKQILEFYYSSVHATYAYLTCI-----SNYRQFYSYISLSPQPYRF--------------------------

Query:  --TRNLP----------FPFLASLSSSPSPLSHHRP-------PPPTNTHPMLTRSKQGIFQPKVWTTLTSLDLSVIEPTSYKRALTCPDWRLAMESELS
          T +LP          FPF ++   S S ++   P       PP ++   M+TR+K GI + KV+         + EPT++ +A+   +W LAME E S
Subjt:  --TRNLP----------FPFLASLSSSPSPLSHHRP-------PPPTNTHPMLTRSKQGIFQPKVWTTLTSLDLSVIEPTSYKRALTCPDWRLAMESELS

Query:  ALRRNDTWSLVPPP-----VGCR------------------------------------------------------------PQLDVNNAFLNGDLTEE
        AL+RN+TW LVPPP     +GC+                                                             QLDV NAFL+GDL E 
Subjt:  ALRRNDTWSLVPPP-----VGCR------------------------------------------------------------PQLDVNNAFLNGDLTEE

Query:  IHMEQPPGFVDSANSRYVCHLNKAIYGLKQSPRAWFMKLSSCLVSWGFSSSKSDSSIFYFRRSDDILIILVYVDDIIATGNNPTLLSSLISRLNSQFMLK
        + M+QPPGF++S    +VC LNKA+YGLKQ+PRAW+ KLS+ L+ WGF +S++DSS+F    + D+LI+L+YVDDI+ TG++   +SS I+RLNS F L+
Subjt:  IHMEQPPGFVDSANSRYVCHLNKAIYGLKQSPRAWFMKLSSCLVSWGFSSSKSDSSIFYFRRSDDILIILVYVDDIIATGNNPTLLSSLISRLNSQFMLK

Query:  DLG
        DLG
Subjt:  DLG

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.4e-1539.02Show/hide
Query:  QLDVNNAFLNGDLTEEIHMEQPPGFVDSANSRYVCHLNKAIYGLKQSPRAWFMKLSSCLVSWGFSSSKSDSSIFYFRRS--DDILIILVYVDDIIATGNN
        Q+DV  AFLNG L EEI+M  P G   S NS  VC LNKAIYGLKQ+ R WF      L    F +S  D  I+   +   ++ + +L+YVDD++    +
Subjt:  QLDVNNAFLNGDLTEEIHMEQPPGFVDSANSRYVCHLNKAIYGLKQSPRAWFMKLSSCLVSWGFSSSKSDSSIFYFRRS--DDILIILVYVDDIIATGNN

Query:  PTLLSSLISRLNSQFMLKDLGKL
         T +++    L  +F + DL ++
Subjt:  PTLLSSLISRLNSQFMLKDLGKL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.2e-2345.83Show/hide
Query:  QLDVNNAFLNGDLTEEIHMEQPPGFVDSANSRYVCHLNKAIYGLKQSPRAWFMKLSSCLVSWGFSSSKSDSSIFYFRRSD-DILIILVYVDDIIATGNNP
        QLDV  AFL+GDL EEI+MEQP GF  +     VC LNK++YGLKQ+PR W+MK  S + S  +  + SD  +++ R S+ + +I+L+YVDD++  G + 
Subjt:  QLDVNNAFLNGDLTEEIHMEQPPGFVDSANSRYVCHLNKAIYGLKQSPRAWFMKLSSCLVSWGFSSSKSDSSIFYFRRSD-DILIILVYVDDIIATGNNP

Query:  TLLSSLISRLNSQFMLKDLG
         L++ L   L+  F +KDLG
Subjt:  TLLSSLISRLNSQFMLKDLG

P25600 Putative transposon Ty5-1 protein YCL074W9.9e-1735.83Show/hide
Query:  LDVNNAFLNGDLTEEIHMEQPPGFVDSANSRYVCHLNKAIYGLKQSPRAWFMKLSSCLVSWGFSSSKSDSSIFYFRRSDDILIILVYVDDIIATGNNPTL
        +DV+ AFLN  + E I+++QPPGFV+  N  YV  L   +YGLKQ+P  W   +++ L   GF   + +  +++   SD  + I VYVDD++    +P +
Subjt:  LDVNNAFLNGDLTEEIHMEQPPGFVDSANSRYVCHLNKAIYGLKQSPRAWFMKLSSCLVSWGFSSSKSDSSIFYFRRSDDILIILVYVDDIIATGNNPTL

Query:  LSSLISRLNSQFMLKDLGKL
           +   L   + +KDLGK+
Subjt:  LSSLISRLNSQFMLKDLGKL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.5e-4126.63Show/hide
Query:  NITQALSVKLTDSNYLPWKNQLLNVVYAYGLKGFINGSIPPPPQYL-DNLQQQLNPEFSIWQKCNR---SSEVSALAIK---------------------
        N+  +   KLT +NYL W  Q+  +   Y L GF++GS   PP  +  +   ++NP+++ W++ ++   S+ + A+++                      
Subjt:  NITQALSVKLTDSNYLPWKNQLLNVVYAYGLKGFINGSIPPPPQYL-DNLQQQLNPEFSIWQKCNR---SSEVSALAIK---------------------

Query:  --------------------KDGLTVTQYLAKIKDVADKLSSIGEPVSLKDHISYILEGLGSEYNAFVTSIQHRVELPPIEDVRALLLSYEYRLERQSSI
                            K   T+  Y+  +    D+L+ +G+P+   + +  +LE L  EY   +  I  +   P + ++   LL++E ++   SS 
Subjt:  --------------------KDGLTVTQYLAKIKDVADKLSSIGEPVSLKDHISYILEGLGSEYNAFVTSIQHRVELPPIEDVRALLLSYEYRLERQSSI

Query:  DQLNTIQANFARLQVGSSNNLRQGSRFPHQSANRDRQTQQVQSAGILGKPSSSPSNRWPSRTSPNNPPRILCQICNKYGHTAFVCHHRANLQFQPSFNQN
          +       +     ++NN   G+R  ++  NR+             KP    S  +    + + P    CQIC   GH+A  C    +     +  Q 
Subjt:  DQLNTIQANFARLQVGSSNNLRQGSRFPHQSANRDRQTQQVQSAGILGKPSSSPSNRWPSRTSPNNPPRILCQICNKYGHTAFVCHHRANLQFQPSFNQN

Query:  PSPQALFSQFSAPNPSPSSDSSQPTLPTSSFIPDDAWYMDSGATHHVTPDLNSLSNPLPYTGIEKVIVGNGMNLPISSIGSSYLLSMSKFPIHLSHVLHT
        PSP      F+   P       +  L   S    + W +DSGATHH+T D N+LS   PYTG + V+V +G  +PIS  GS+ L + S+ P++L ++L+ 
Subjt:  PSPQALFSQFSAPNPSPSSDSSQPTLPTSSFIPDDAWYMDSGATHHVTPDLNSLSNPLPYTGIEKVIVGNGMNLPISSIGSSYLLSMSKFPIHLSHVLHT

Query:  PSISHKLISISKLCYDNKASVEFYPSFFLVKDLITKRVLLRGNLEGGLYKLSSSFSNMSARSSLPRAAVLLSKTTDVWHSRLGHPSSLVLKQILEFYYSS
        P+I   LIS+ +LC  N  SVEF+P+ F VKDL T   LL+G  +  LY+   + S   +  + P +      T   WH+RLGHP+  +L  ++  Y  S
Subjt:  PSISHKLISISKLCYDNKASVEFYPSFFLVKDLITKRVLLRGNLEGGLYKLSSSFSNMSARSSLPRAAVLLSKTTDVWHSRLGHPSSLVLKQILEFYYSS

Query:  V-HATYAYLTCISNYRQFYSYISLSPQPYRFTRNLPFPFLASLSSSPSPLSH
        V + ++ +L+C        + +  S      TR L + + + + SSP  LSH
Subjt:  V-HATYAYLTCISNYRQFYSYISLSPQPYRFTRNLPFPFLASLSSSPSPLSH

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.1e-3153.85Show/hide
Query:  QLDVNNAFLNGDLTEEIHMEQPPGFVDSANSRYVCHLNKAIYGLKQSPRAWFMKLSSCLVSWGFSSSKSDSSIFYFRRSDDILIILVYVDDIIATGNNPT
        QLDVNNAFL G LT++++M QPPGF+D     YVC L KA+YGLKQ+PRAW+++L + L++ GF +S SD+S+F  +R   I+ +LVYVDDI+ TGN+PT
Subjt:  QLDVNNAFLNGDLTEEIHMEQPPGFVDSANSRYVCHLNKAIYGLKQSPRAWFMKLSSCLVSWGFSSSKSDSSIFYFRRSDDILIILVYVDDIIATGNNPT

Query:  LLSSLISRLNSQFMLKD
        LL + +  L+ +F +KD
Subjt:  LLSSLISRLNSQFMLKD

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.9e-4028.54Show/hide
Query:  NITQALSVKLTDSNYLPWKNQLLNVVYAYGLKGFINGSIPPPPQYL-DNLQQQLNPEFSIWQKCNR---SSEVSALAIK-----KDGLTVTQYLAKIKDV
        N+  +   KLT +NYL W  Q+  +   Y L GF++GS P PP  +  +   ++NP+++ W++ ++   S+ + A+++          T  Q    ++ +
Subjt:  NITQALSVKLTDSNYLPWKNQLLNVVYAYGLKGFINGSIPPPPQYL-DNLQQQLNPEFSIWQKCNR---SSEVSALAIK-----KDGLTVTQYLAKIKDV

Query:  -----------------ADKLSSIGEPVSLKDHISYILEGLGSEYNAFVTSIQHRVELPPIEDVRALLLSYEYRLERQSSIDQLNTIQANFARLQVGSSN
                          D+L+ +G+P+   + +  +LE L  +Y   +  I  +   P + ++   L++ E +L   +S + +  I AN    +  ++ 
Subjt:  -----------------ADKLSSIGEPVSLKDHISYILEGLGSEYNAFVTSIQHRVELPPIEDVRALLLSYEYRLERQSSIDQLNTIQANFARLQVGSSN

Query:  NLRQGSRFPHQSANRDRQTQQVQSAGILGKPSSSPSNRWPSRTSPNNPPRIL--CQICNKYGHTAFVCHHRANLQFQPSFNQNPSPQALFSQFSAPNPSP
        N  Q +R  +++ N +             +PSSS      SR+    P   L  CQIC+  GH+A  C      QFQ + NQ  S     S F+   P  
Subjt:  NLRQGSRFPHQSANRDRQTQQVQSAGILGKPSSSPSNRWPSRTSPNNPPRIL--CQICNKYGHTAFVCHHRANLQFQPSFNQNPSPQALFSQFSAPNPSP

Query:  SSDSSQPTLPTSSFIPDDAWYMDSGATHHVTPDLNSLSNPLPYTGIEKVIVGNGMNLPISSIGSSYLLSMSKFPIHLSHVLHTPSISHKLISISKLCYDN
             +  L  +S    + W +DSGATHH+T D N+LS   PYTG + V++ +G  +PI+  GS+ L + S+  + L+ VL+ P+I   LIS+ +LC  N
Subjt:  SSDSSQPTLPTSSFIPDDAWYMDSGATHHVTPDLNSLSNPLPYTGIEKVIVGNGMNLPISSIGSSYLLSMSKFPIHLSHVLHTPSISHKLISISKLCYDN

Query:  KASVEFYPSFFLVKDLITKRVLLRGNLEGGLYKLSSSFSNMSARSSLPRAAVLLSKTTDVWHSRLGHPSSLVLKQILEFYYSSV-HATYAYLTC
        + SVEF+P+ F VKDL T   LL+G  +  LY+   + S   +  + P +      T   WHSRLGHPS  +L  ++  +   V + ++  L+C
Subjt:  KASVEFYPSFFLVKDLITKRVLLRGNLEGGLYKLSSSFSNMSARSSLPRAAVLLSKTTDVWHSRLGHPSSLVLKQILEFYYSSV-HATYAYLTC

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.0e-3053.85Show/hide
Query:  QLDVNNAFLNGDLTEEIHMEQPPGFVDSANSRYVCHLNKAIYGLKQSPRAWFMKLSSCLVSWGFSSSKSDSSIFYFRRSDDILIILVYVDDIIATGNNPT
        QLDVNNAFL G LT+E++M QPPGFVD     YVC L KAIYGLKQ+PRAW+++L + L++ GF +S SD+S+F  +R   I+ +LVYVDDI+ TGN+  
Subjt:  QLDVNNAFLNGDLTEEIHMEQPPGFVDSANSRYVCHLNKAIYGLKQSPRAWFMKLSSCLVSWGFSSSKSDSSIFYFRRSDDILIILVYVDDIIATGNNPT

Query:  LLSSLISRLNSQFMLKD
        LL   +  L+ +F +K+
Subjt:  LLSSLISRLNSQFMLKD

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)8.1e-0631.51Show/hide
Query:  LTVTQYLAKIKDVADKLSSIGEPVSLKDHISYILEGLGSEYNAFVTSIQHRVELPPIEDVRALLLSYEYRLER
        + V  Y  K+K +AD L ++  PV+ ++ + Y+L GL  +++  +  I+HR   P  +D   +L   E RL+R
Subjt:  LTVTQYLAKIKDVADKLSSIGEPVSLKDHISYILEGLGSEYNAFVTSIQHRVELPPIEDVRALLLSYEYRLER

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.4e-2650.4Show/hide
Query:  QLDVNNAFLNGDLTEEIHMEQPPGFV----DSANSRYVCHLNKAIYGLKQSPRAWFMKLSSCLVSWGFSSSKSDSSIFYFRRSDDILIILVYVDDIIATG
        QLD++NAFLNGDL EEI+M+ PPG+     DS     VC+L K+IYGLKQ+ R WF+K S  L+ +GF  S SD + F    +   L +LVYVDDII   
Subjt:  QLDVNNAFLNGDLTEEIHMEQPPGFV----DSANSRYVCHLNKAIYGLKQSPRAWFMKLSSCLVSWGFSSSKSDSSIFYFRRSDDILIILVYVDDIIATG

Query:  NNPTLLSSLISRLNSQFMLKDLGKL
        NN   +  L S+L S F L+DLG L
Subjt:  NNPTLLSSLISRLNSQFMLKDLGKL

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)6.6e-0850Show/hide
Query:  MLTRSKQGI--FQPKVWTTLTSLDLSVIEPTSYKRALTCPDWRLAMESELSALRRNDTWSLVPPPV-----GCR
        MLTRSK GI    PK   T+T+      EP S   AL  P W  AM+ EL AL RN TW LVPPPV     GC+
Subjt:  MLTRSKQGI--FQPKVWTTLTSLDLSVIEPTSYKRALTCPDWRLAMESELSALRRNDTWSLVPPPV-----GCR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCATATCTGAACACCCGTTGCAGCCGTGGATCTCGGTTCAGACTATTCCTCGCCGTTGTGGGTCTTTTTTTACGTCGTCGTGAGTCTGTTTTTTGCGCCATCGTGGC
TCTGTTTTTTGCTCCGGCGTGGGTCTTGGGTTTTGCACCCTTTTATGGTATCAGAGAACTCAGGTTTATTCCTCCTCGAATGGCACAATTTCCCTCATCGTCAACCTCTG
GTCTTGGATCATCTTCCAACCAGTCAACAAATGCAAATGTTTCTACCACCCAAAACCCACCCCTTCATTCTCCTCAAACACCTTCACCAAATGGGTTACAGCCGCCGCCA
CCACCCTTCCAACCTACACTTCCTCCTTTCTCATCACCTCGGCCCCAGTTTGGGATGTATCATCCTCAGCTGTTTTCATTTGTTCAACCCTCTACCATTTTTCCTTCTCC
AATGTATCCAAATATCACTCAAGCTCTATCTGTTAAACTCACCGATTCAAACTATCTACCGTGGAAAAATCAACTTCTCAATGTTGTTTATGCTTATGGCCTTAAAGGAT
TCATCAATGGATCCATTCCTCCTCCTCCTCAGTATCTTGATAATCTTCAACAACAACTTAATCCTGAGTTTTCCATTTGGCAAAAATGCAACAGGTCATCTGAAGTCTCA
GCTTTAGCTATCAAGAAAGACGGTCTCACTGTAACCCAATACCTTGCAAAAATAAAAGATGTGGCCGACAAATTATCTTCTATTGGTGAACCGGTCTCGCTCAAGGATCA
CATATCCTATATATTAGAGGGTCTTGGCTCTGAGTATAATGCCTTTGTGACCTCAATCCAACACCGTGTTGAGTTACCACCGATTGAAGATGTTCGTGCCCTCTTGTTGA
GCTATGAATACCGTCTTGAGCGTCAGTCTTCTATAGACCAACTCAATACTATTCAAGCCAACTTTGCCAGGCTTCAAGTAGGCTCCTCCAACAATCTAAGGCAGGGCTCT
CGCTTTCCTCATCAATCTGCCAATCGTGATCGCCAAACCCAGCAAGTGCAGTCTGCAGGTATTCTAGGTAAGCCTTCTTCTTCCCCCTCTAACCGTTGGCCCTCTAGAAC
TTCTCCCAACAACCCACCCCGTATTCTATGCCAAATCTGTAACAAATATGGGCACACTGCCTTTGTTTGTCACCACCGAGCTAATTTACAATTTCAGCCATCTTTCAACC
AAAATCCTAGTCCTCAAGCCCTTTTTAGCCAATTTTCGGCACCTAATCCATCACCTTCTTCAGACTCATCCCAACCTACTTTGCCAACTTCATCTTTTATTCCAGATGAT
GCTTGGTATATGGACTCCGGAGCTACACATCATGTTACACCAGATTTGAACTCTTTATCTAACCCCTTACCTTACACTGGTATTGAGAAAGTGATTGTTGGCAATGGTAT
GAATCTTCCCATTTCTTCTATTGGTTCCTCTTACTTGTTATCCATGTCTAAGTTTCCTATCCATTTATCTCATGTTTTACACACTCCATCCATTTCTCACAAACTAATTA
GTATTTCAAAGTTATGTTATGATAATAAAGCCTCTGTTGAATTTTATCCCTCATTCTTTCTTGTGAAGGATCTCATTACCAAGAGGGTTCTACTTCGGGGCAATCTTGAA
GGGGGGCTTTACAAGCTTAGTTCTTCATTTTCCAACATGTCGGCTCGCTCATCTTTGCCTCGTGCTGCTGTCCTTTTGTCAAAGACCACTGATGTGTGGCATTCGCGTTT
GGGTCACCCATCGTCTCTAGTTCTCAAACAAATTTTGGAGTTTTACTATTCCTCTGTCCACGCCACCTATGCCTACCTCACCTGTATCTCCAACTATAGACAGTTCTACT
CCTACATCTCCCTCTCCCCTCAACCCTATCGTTTCACCCGCAACCTCCCCTTCCCCTTCCTTGCTTCTTTATCATCATCTCCCTCTCCTTTATCTCATCATCGGCCTCCC
CCTCCAACAAACACCCATCCCATGTTGACTCGTTCTAAACAAGGCATCTTTCAACCAAAGGTGTGGACCACTCTTACTTCCTTAGACCTTTCTGTCATTGAGCCTACCTC
CTACAAACGTGCTCTTACCTGTCCTGATTGGCGATTGGCTATGGAAAGTGAGCTCTCAGCTTTACGTAGAAATGATACATGGAGCCTCGTTCCACCTCCAGTTGGTTGTC
GTCCGCAGCTTGATGTCAACAATGCGTTTCTAAATGGAGACCTTACCGAGGAGATCCATATGGAACAGCCACCGGGCTTTGTAGATTCTGCCAATTCGAGGTATGTTTGT
CACTTGAACAAGGCTATTTATGGTCTGAAACAGAGTCCAAGGGCTTGGTTTATGAAATTGAGTTCTTGTTTAGTCAGTTGGGGTTTCTCTTCATCAAAGTCTGATTCGTC
CATATTTTACTTTCGACGCTCTGATGACATTTTGATCATATTAGTGTATGTTGATGATATCATTGCCACGGGCAACAACCCAACTCTCTTGTCTAGTCTCATCTCACGAT
TAAATAGTCAGTTTATGCTTAAAGATTTGGGGAAATTGTCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCATATCTGAACACCCGTTGCAGCCGTGGATCTCGGTTCAGACTATTCCTCGCCGTTGTGGGTCTTTTTTTACGTCGTCGTGAGTCTGTTTTTTGCGCCATCGTGGC
TCTGTTTTTTGCTCCGGCGTGGGTCTTGGGTTTTGCACCCTTTTATGGTATCAGAGAACTCAGGTTTATTCCTCCTCGAATGGCACAATTTCCCTCATCGTCAACCTCTG
GTCTTGGATCATCTTCCAACCAGTCAACAAATGCAAATGTTTCTACCACCCAAAACCCACCCCTTCATTCTCCTCAAACACCTTCACCAAATGGGTTACAGCCGCCGCCA
CCACCCTTCCAACCTACACTTCCTCCTTTCTCATCACCTCGGCCCCAGTTTGGGATGTATCATCCTCAGCTGTTTTCATTTGTTCAACCCTCTACCATTTTTCCTTCTCC
AATGTATCCAAATATCACTCAAGCTCTATCTGTTAAACTCACCGATTCAAACTATCTACCGTGGAAAAATCAACTTCTCAATGTTGTTTATGCTTATGGCCTTAAAGGAT
TCATCAATGGATCCATTCCTCCTCCTCCTCAGTATCTTGATAATCTTCAACAACAACTTAATCCTGAGTTTTCCATTTGGCAAAAATGCAACAGGTCATCTGAAGTCTCA
GCTTTAGCTATCAAGAAAGACGGTCTCACTGTAACCCAATACCTTGCAAAAATAAAAGATGTGGCCGACAAATTATCTTCTATTGGTGAACCGGTCTCGCTCAAGGATCA
CATATCCTATATATTAGAGGGTCTTGGCTCTGAGTATAATGCCTTTGTGACCTCAATCCAACACCGTGTTGAGTTACCACCGATTGAAGATGTTCGTGCCCTCTTGTTGA
GCTATGAATACCGTCTTGAGCGTCAGTCTTCTATAGACCAACTCAATACTATTCAAGCCAACTTTGCCAGGCTTCAAGTAGGCTCCTCCAACAATCTAAGGCAGGGCTCT
CGCTTTCCTCATCAATCTGCCAATCGTGATCGCCAAACCCAGCAAGTGCAGTCTGCAGGTATTCTAGGTAAGCCTTCTTCTTCCCCCTCTAACCGTTGGCCCTCTAGAAC
TTCTCCCAACAACCCACCCCGTATTCTATGCCAAATCTGTAACAAATATGGGCACACTGCCTTTGTTTGTCACCACCGAGCTAATTTACAATTTCAGCCATCTTTCAACC
AAAATCCTAGTCCTCAAGCCCTTTTTAGCCAATTTTCGGCACCTAATCCATCACCTTCTTCAGACTCATCCCAACCTACTTTGCCAACTTCATCTTTTATTCCAGATGAT
GCTTGGTATATGGACTCCGGAGCTACACATCATGTTACACCAGATTTGAACTCTTTATCTAACCCCTTACCTTACACTGGTATTGAGAAAGTGATTGTTGGCAATGGTAT
GAATCTTCCCATTTCTTCTATTGGTTCCTCTTACTTGTTATCCATGTCTAAGTTTCCTATCCATTTATCTCATGTTTTACACACTCCATCCATTTCTCACAAACTAATTA
GTATTTCAAAGTTATGTTATGATAATAAAGCCTCTGTTGAATTTTATCCCTCATTCTTTCTTGTGAAGGATCTCATTACCAAGAGGGTTCTACTTCGGGGCAATCTTGAA
GGGGGGCTTTACAAGCTTAGTTCTTCATTTTCCAACATGTCGGCTCGCTCATCTTTGCCTCGTGCTGCTGTCCTTTTGTCAAAGACCACTGATGTGTGGCATTCGCGTTT
GGGTCACCCATCGTCTCTAGTTCTCAAACAAATTTTGGAGTTTTACTATTCCTCTGTCCACGCCACCTATGCCTACCTCACCTGTATCTCCAACTATAGACAGTTCTACT
CCTACATCTCCCTCTCCCCTCAACCCTATCGTTTCACCCGCAACCTCCCCTTCCCCTTCCTTGCTTCTTTATCATCATCTCCCTCTCCTTTATCTCATCATCGGCCTCCC
CCTCCAACAAACACCCATCCCATGTTGACTCGTTCTAAACAAGGCATCTTTCAACCAAAGGTGTGGACCACTCTTACTTCCTTAGACCTTTCTGTCATTGAGCCTACCTC
CTACAAACGTGCTCTTACCTGTCCTGATTGGCGATTGGCTATGGAAAGTGAGCTCTCAGCTTTACGTAGAAATGATACATGGAGCCTCGTTCCACCTCCAGTTGGTTGTC
GTCCGCAGCTTGATGTCAACAATGCGTTTCTAAATGGAGACCTTACCGAGGAGATCCATATGGAACAGCCACCGGGCTTTGTAGATTCTGCCAATTCGAGGTATGTTTGT
CACTTGAACAAGGCTATTTATGGTCTGAAACAGAGTCCAAGGGCTTGGTTTATGAAATTGAGTTCTTGTTTAGTCAGTTGGGGTTTCTCTTCATCAAAGTCTGATTCGTC
CATATTTTACTTTCGACGCTCTGATGACATTTTGATCATATTAGTGTATGTTGATGATATCATTGCCACGGGCAACAACCCAACTCTCTTGTCTAGTCTCATCTCACGAT
TAAATAGTCAGTTTATGCTTAAAGATTTGGGGAAATTGTCCTAA
Protein sequenceShow/hide protein sequence
MPYLNTRCSRGSRFRLFLAVVGLFLRRRESVFCAIVALFFAPAWVLGFAPFYGIRELRFIPPRMAQFPSSSTSGLGSSSNQSTNANVSTTQNPPLHSPQTPSPNGLQPPP
PPFQPTLPPFSSPRPQFGMYHPQLFSFVQPSTIFPSPMYPNITQALSVKLTDSNYLPWKNQLLNVVYAYGLKGFINGSIPPPPQYLDNLQQQLNPEFSIWQKCNRSSEVS
ALAIKKDGLTVTQYLAKIKDVADKLSSIGEPVSLKDHISYILEGLGSEYNAFVTSIQHRVELPPIEDVRALLLSYEYRLERQSSIDQLNTIQANFARLQVGSSNNLRQGS
RFPHQSANRDRQTQQVQSAGILGKPSSSPSNRWPSRTSPNNPPRILCQICNKYGHTAFVCHHRANLQFQPSFNQNPSPQALFSQFSAPNPSPSSDSSQPTLPTSSFIPDD
AWYMDSGATHHVTPDLNSLSNPLPYTGIEKVIVGNGMNLPISSIGSSYLLSMSKFPIHLSHVLHTPSISHKLISISKLCYDNKASVEFYPSFFLVKDLITKRVLLRGNLE
GGLYKLSSSFSNMSARSSLPRAAVLLSKTTDVWHSRLGHPSSLVLKQILEFYYSSVHATYAYLTCISNYRQFYSYISLSPQPYRFTRNLPFPFLASLSSSPSPLSHHRPP
PPTNTHPMLTRSKQGIFQPKVWTTLTSLDLSVIEPTSYKRALTCPDWRLAMESELSALRRNDTWSLVPPPVGCRPQLDVNNAFLNGDLTEEIHMEQPPGFVDSANSRYVC
HLNKAIYGLKQSPRAWFMKLSSCLVSWGFSSSKSDSSIFYFRRSDDILIILVYVDDIIATGNNPTLLSSLISRLNSQFMLKDLGKLS