; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0032249 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0032249
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr11:28521589..28524998
RNA-Seq ExpressionLag0032249
SyntenyLag0032249
Gene Ontology termsNA
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU19483.1 hypothetical protein TSUD_77270 [Trifolium subterraneum]1.1e-7826.63Show/hide
Query:  NSLELNPLYEEWITVDQALCGWLFGSMTPAIAAIVVSFKTSREGCKALEKIYGSTSKARINQLRGDLHNTKNGTMKMMEYLAIMKLALESLQLAGNPISL
        +S   N  + EW   DQ L GW+  SMT  IA  ++  +TS++     + + G+ ++++I  L+ + H+ + G MKM +YL  MK  ++ L+LAGNP+S 
Subjt:  NSLELNPLYEEWITVDQALCGWLFGSMTPAIAAIVVSFKTSREGCKALEKIYGSTSKARINQLRGDLHNTKNGTMKMMEYLAIMKLALESLQLAGNPISL

Query:  GDLISYVLVGLNSDYVLIFCSIEDKDITTWQELSSILVQFEGTLADYSVFTN-----TRELPDLATHLVYNLPNQSSGQKSFS-----------------
         DLI   L GL+S+Y  +   + D+   +W +L + L+ FE  +   +  TN     T  + + + H   +  N   G  S                   
Subjt:  GDLISYVLVGLNSDYVLIFCSIEDKDITTWQELSSILVQFEGTLADYSVFTN-----TRELPDLATHLVYNLPNQSSGQKSFS-----------------

Query:  ---SNRTNLESFNNPHGANGKDNQAGNGAGSASSSAYIATPKILCDPKWLADSGATNHIPADA----------ASNLMSCSN--------------KSIK
           SN   ++ F+       + N +       S +A++A+   + D  W  DSGA+NH+                N +   N              KS+ 
Subjt:  ---SNRTNLESFNNPHGANGKDNQAGNGAGSASSSAYIATPKILCDPKWLADSGATNHIPADA----------ASNLMSCSN--------------KSIK

Query:  LDNMFHVPSIMRNLISIAPLTADNDVFVEFHSDFCLVKDKASRTVMLCGTLKDGLYQIELPSIQNASSPISSTSSFVDVSTSNKSYDKSKSIVLWSANDY
        L ++ +VP+I +NL+S++ L ADN++ VEF  + C VKDK +  V+L G LKDGLYQ+        S    + S+FV V  S            W     
Subjt:  LDNMFHVPSIMRNLISIAPLTADNDVFVEFHSDFCLVKDKASRTVMLCGTLKDGLYQIELPSIQNASSPISSTSSFVDVSTSNKSYDKSKSIVLWSANDY

Query:  SSSSTSLQSIYGCSKSSPIVYQVTRLSILSKSLLHHRLGHASENTINSVIVACNLKVSSNEKFSFCDACQLGKSHRLPFPTSMTQTRHPLELIFCDLWGS
                                          H RLGH +   ++ V+ +C +KV  ++ FSFC+ACQ GK H LPF +S +  + PLEL+  D+WG 
Subjt:  SSSSTSLQSIYGCSKSSPIVYQVTRLSILSKSLLHHRLGHASENTINSVIVACNLKVSSNEKFSFCDACQLGKSHRLPFPTSMTQTRHPLELIFCDLWGS

Query:  SSV-------------------------------------------------------------------------------------------------
        + +                                                                                                 
Subjt:  SSV-------------------------------------------------------------------------------------------------

Query:  -------------------------------PS-----------------------TLGC----------------------------SHKGYKCLSSSE
                                       PS                       T GC                            SHKGYKCL+S  
Subjt:  -------------------------------PS-----------------------TLGC----------------------------SHKGYKCLSSSE

Query:  RVYISRHVIFNENDFPFQHDFL---------------SFISATVSDDIIIHWLPFPTSSSSMTPPTADALPHPYLNHSTEFSTVSPTLDATPSKSTPTAP
        R++ISRHVIFNE+ FPF   FL               SF   T  + I    +P   + +     T D+     +N  TE +   P+ D T  +   T  
Subjt:  RVYISRHVIFNENDFPFQHDFL---------------SFISATVSDDIIIHWLPFPTSSSSMTPPTADALPHPYLNHSTEFSTVSPTLDATPSKSTPTAP

Query:  SSASSTVGFPDIGVLVADFPPDSVVDLHPVQTRAKIGVFQPKNWGSFLAASFADLPPSKPISVKEALKSSHGKKAMDEELAALHRNNIWRL---------
         +   +VG              +    H + TR+K G+ +PK     L  ++ D    +P + KEAL     K+AM +E  AL  N  W L         
Subjt:  SSASSTVGFPDIGVLVADFPPDSVVDLHPVQTRAKIGVFQPKNWGSFLAASFADLPPSKPISVKEALKSSHGKKAMDEELAALHRNNIWRL---------

Query:  -----VFKVKRQADGSFDRCKARLVAKGFHKIPGIDFHETFNPVVKDPTIRVILA---------RQLDINNTLLNGHLIEDVFMKQPDGFVDASHIDYVC
             VFK K + DGS +R KARLVAKGF +  GID+ ETF+PV+K  T+R+IL+         RQLDINN  LNGHL E VFM QP+GFVD++  +++C
Subjt:  -----VFKVKRQADGSFDRCKARLVAKGFHKIPGIDFHETFNPVVKDPTIRVILA---------RQLDINNTLLNGHLIEDVFMKQPDGFVDASHIDYVC

Query:  KLDKVLYGLK
        KL K +YGLK
Subjt:  KLDKVLYGLK

GAU51268.1 hypothetical protein TSUD_412550 [Trifolium subterraneum]6.7e-8127.93Show/hide
Query:  SLELNPLYEEWITVDQALCGWLFGSMTPAIAAIVVSFKTSREGCKALEKIYGSTSKARINQLRGDLHNTKNGTMKMMEYLAIMKLALESLQLAGNPISLG
        S ++NP + +WI  DQAL GWL  SM   IA  ++  +TS++     + + G+ +K+RI  L+ + HNT+ G MKM EYL  MK   + L+LAG+PIS  
Subjt:  SLELNPLYEEWITVDQALCGWLFGSMTPAIAAIVVSFKTSREGCKALEKIYGSTSKARINQLRGDLHNTKNGTMKMMEYLAIMKLALESLQLAGNPISLG

Query:  DLISYVLVGLNSDYVLIFCSIEDKDITTWQELSSILVQFEGTLADYSVFTNTRELPDLATHLVYNLPNQSS--GQKSFSSNRTNLESFNNPHGANGKDN-
        DL+   L GL+++Y  +   + D+   +W ++ + L+ FE  L  ++ F+       L  +   N  N++   G K  S       +F    G  GK   
Subjt:  DLISYVLVGLNSDYVLIFCSIEDKDITTWQELSSILVQFEGTLADYSVFTNTRELPDLATHLVYNLPNQSS--GQKSFSSNRTNLESFNNPHGANGKDN-

Query:  -----QAGNGAG-------------------------SASSSAYIATPKILCDPKWLADSGATNHIPADA-------------------ASNLMSCSNKS
             Q  NG G                           S SA+IA+P    D +W  DSGA NH+                          L   ++ S
Subjt:  -----QAGNGAG-------------------------SASSSAYIATPKILCDPKWLADSGATNHIPADA-------------------ASNLMSCSNKS

Query:  IKLDNM-----FHVPSIMRNLISIAPLTADNDVFVEFHSDFCLVKDKASRTVMLCGTLKDGLYQIELPSIQNASSPISSTSSFVDVSTSNKSYDKSKSIV
         KL+N+      +VP I +NL+S++ LTADN++ VEF ++ C VKDK +   +L G LKDGLYQ+                                   
Subjt:  IKLDNM-----FHVPSIMRNLISIAPLTADNDVFVEFHSDFCLVKDKASRTVMLCGTLKDGLYQIELPSIQNASSPISSTSSFVDVSTSNKSYDKSKSIV

Query:  LWSANDYSSSSTSLQSIYGCSKSSPIVYQVTRLSILSKSLLHHRLGHASENTINSVIVACNLKVSSNEKFSFCDACQLGKSHRLPFPTSMTQTRHPLELI
                            S   P VY    +S+  K   H +LGH +   ++ V+  CN+K+S +++FSFC+ACQ GK H LPF  S +  + PL LI
Subjt:  LWSANDYSSSSTSLQSIYGCSKSSPIVYQVTRLSILSKSLLHHRLGHASENTINSVIVACNLKVSSNEKFSFCDACQLGKSHRLPFPTSMTQTRHPLELI

Query:  FCDLWGSSSV------------------------------------------------------------------------------------------
          D+WG + +                                                                                          
Subjt:  FCDLWGSSSV------------------------------------------------------------------------------------------

Query:  --------------------------------------PSTL-----------------------GC----------------------------SHKGY
                                              PS++                       GC                            SHKGY
Subjt:  --------------------------------------PSTL-----------------------GC----------------------------SHKGY

Query:  KCLSSSERVYISRHVIFNENDFPFQHDFLSFIS--ATVSDDIIIHWLPFPTSSSSMTPPTADALPHPYLN-------HSTEFSTVSPTLDATPS------
        KC++S  R+++SRHVIFNEN FPF   FL   +   T++D+  I     PT S+  T  T DA+  P  N       HS E S  +   +   S      
Subjt:  KCLSSSERVYISRHVIFNENDFPFQHDFLSFIS--ATVSDDIIIHWLPFPTSSSSMTPPTADALPHPYLN-------HSTEFSTVSPTLDATPS------

Query:  -KSTPTAPSSASSTVGFPD------IGVLVADFPPDSVVDLHPVQTRAKIGVFQPKNWGSFLAASFADLPPSKPISVKEALKSSHGKKAMDEELAALHRN
          ++ T    A ++V   D       G +      D+  + H ++TR+K G+ +PK    ++  +  D    +P SVKEAL     K+AMD+E  AL  N
Subjt:  -KSTPTAPSSASSTVGFPD------IGVLVADFPPDSVVDLHPVQTRAKIGVFQPKNWGSFLAASFADLPPSKPISVKEALKSSHGKKAMDEELAALHRN

Query:  NIWRLV--------------FKVKRQADGSFDRCKARLVAKGFHKIPGIDFHETFNPVVKDPTIRVIL---------ARQLDINNTLLNGHLIEDVFMKQ
        + W LV              FK K ++DGS +R KARLVAKGF +  G+DF ETF+PVVK  T+R+IL          RQLDINN  LNG L E VFM Q
Subjt:  NIWRLV--------------FKVKRQADGSFDRCKARLVAKGFHKIPGIDFHETFNPVVKDPTIRVIL---------ARQLDINNTLLNGHLIEDVFMKQ

Query:  PDGFVDASHIDYVCKLDKVLYGLK
        P+G++DA+  +++CKL K +YGLK
Subjt:  PDGFVDASHIDYVCKLDKVLYGLK

PNX94503.1 putative retrotransposon Ty1-copia subclass protein, partial [Trifolium pratense]2.3e-8127.3Show/hide
Query:  ISNSLELNPLYEEWITVDQALCGWLFGSMTPAIAAIVVSFKTSREGCKALEKIYGSTSKARINQLRGDLHNTKNGTMKMMEYLAIMKLALESLQLAGNPI
        I N+ ++NP Y++W   DQAL GWL  SMT  IA  V+  +TS++     + + G+ +++RI  L+ + HNT    MKM +YLA MK   + L+LAG+PI
Subjt:  ISNSLELNPLYEEWITVDQALCGWLFGSMTPAIAAIVVSFKTSREGCKALEKIYGSTSKARINQLRGDLHNTKNGTMKMMEYLAIMKLALESLQLAGNPI

Query:  SLGDLISYVLVGLNSDYVLIFCSIEDKDITTWQELSSILVQFEGTLADYSVFTNTRELPDLATHLVYNLPNQSSGQKSFSSNRTNLESFNNPHGANGKDN
        S  DL+   L GL+S+Y  +   + D+   +W +  + L+ FE  L   + F N     +L     +   N+S G K  S       +     G  G+  
Subjt:  SLGDLISYVLVGLNSDYVLIFCSIEDKDITTWQELSSILVQFEGTLADYSVFTNTRELPDLATHLVYNLPNQSSGQKSFSSNRTNLESFNNPHGANGKDN

Query:  QA-------------GNGA-------------------GSASSSAYIATPKILCDPKWLADSGATNHIP----------------------ADAASNLMS
         +             G+ A                   G  S SA++A+P    D +W  DSGA+NH+                        +    L S
Subjt:  QA-------------GNGA-------------------GSASSSAYIATPKILCDPKWLADSGATNHIP----------------------ADAASNLMS

Query:  CSNK--SIKLDNMFHVPSIMRNLISIAPLTADNDVFVEFHSDFCLVKDKASRTVMLCGTLKDGLYQIELPSIQNASSPISSTSSFVDVSTSNKSYDKSKS
         S K   + L N+ +VP I +NL+S++ LT DN+  VEF  ++C VKDK +   +L G LKDGLYQ+                      ++NK    +K 
Subjt:  CSNK--SIKLDNMFHVPSIMRNLISIAPLTADNDVFVEFHSDFCLVKDKASRTVMLCGTLKDGLYQIELPSIQNASSPISSTSSFVDVSTSNKSYDKSKS

Query:  IVLWSANDYSSSSTSLQSIYGCSKSSPIVYQVTRLSILSKSLLHHRLGHASENTINSVIVACNLKVSSNEKFSFCDACQLGKSHRLPFPTSMTQTRHPLE
                                  P  Y      I  K + H +LGH +   +  V+   N+K+S ++KF+FC+ACQ GK H LPF TS +  + PL+
Subjt:  IVLWSANDYSSSSTSLQSIYGCSKSSPIVYQVTRLSILSKSLLHHRLGHASENTINSVIVACNLKVSSNEKFSFCDACQLGKSHRLPFPTSMTQTRHPLE

Query:  LIFCDLWGSSSV----------------------------------------------------------------------------------------
        LI  D+WG + +                                                                                        
Subjt:  LIFCDLWGSSSV----------------------------------------------------------------------------------------

Query:  -------------------------------------------------PSTL--------------GC----------------------------SHK
                                                         P TL              GC                            SHK
Subjt:  -------------------------------------------------PSTL--------------GC----------------------------SHK

Query:  GYKCLSSSERVYISRHVIFNENDFPFQHDFLSFIS--ATVSDDIIIHWLPFPTSSSSMTPPTADALPH------PYLN-------HSTEFSTVSPTLDAT
        GYKC++S  RV++SRHV+FNEN FPFQ  FL   +    V++D  I +  FP   +  T  TA+A  +      P LN        S E  T   T +  
Subjt:  GYKCLSSSERVYISRHVIFNENDFPFQHDFLSFIS--ATVSDDIIIHWLPFPTSSSSMTPPTADALPH------PYLN-------HSTEFSTVSPTLDAT

Query:  PSKSTPTAPSSASSTVGFPDIGVLVADF---PPDSVVDLHPVQTRAKIGVFQPKNWGSFLAASFADLPPSKPISVKEALKSSHGKKAMDEELAALHRNNI
         S       + A+      +I   + +    P   + + H ++TR+K GV++PK    ++  +       +P SV EAL       AMD E  AL  N  
Subjt:  PSKSTPTAPSSASSTVGFPDIGVLVADF---PPDSVVDLHPVQTRAKIGVFQPKNWGSFLAASFADLPPSKPISVKEALKSSHGKKAMDEELAALHRNNI

Query:  WRLV--------------FKVKRQADGSFDRCKARLVAKGFHKIPGIDFHETFNPVVKDPTIRVILA---------RQLDINNTLLNGHLIEDVFMKQPD
        W LV              FK K +ADG+ +R KARLVA+GF +  G+D+ ETF+PVVK  T+R+IL+         RQLDINN  LNG+L E VFM QP+
Subjt:  WRLV--------------FKVKRQADGSFDRCKARLVAKGFHKIPGIDFHETFNPVVKDPTIRVILA---------RQLDINNTLLNGHLIEDVFMKQPD

Query:  GFVDASHIDYVCKLDKVLYGLK
        G++D +   ++C+L+K +YGLK
Subjt:  GFVDASHIDYVCKLDKVLYGLK

RVW22017.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]4.1e-7827.51Show/hide
Query:  ELNPLYEEWITVDQALCGWLFGSMTPAIAAIVVSFKTSREGCKALEKIYGSTSKARINQLRGDLHNTKNGTMKMMEYLAIMKLALESLQLAGNPISLGDL
        E+NP +  W   D+ +  W++ S+TP I A ++   +S     ALEKI+ S S+ARI QLR +  +TK G+M M++Y+  +K A +SL   G P+S  D 
Subjt:  ELNPLYEEWITVDQALCGWLFGSMTPAIAAIVVSFKTSREGCKALEKIYGSTSKARINQLRGDLHNTKNGTMKMMEYLAIMKLALESLQLAGNPISLGDL

Query:  ISYVLVGLNSDYVLIFCSIE-DKDITTWQELSSILVQFEGTLADYSVFTNTRELPDLATHLVYNLPNQSSGQKSFSSNRTNLESFNN---------PHGA
        I  +L GL SDY  +  +I   +D  + + + S+L+ FE  L       +  +LP ++ +   +  N+  G+K     R N    N+          +G 
Subjt:  ISYVLVGLNSDYVLIFCSIE-DKDITTWQELSSILVQFEGTLADYSVFTNTRELPDLATHLVYNLPNQSSGQKSFSSNRTNLESFNN---------PHGA

Query:  NGKDN--------------------------------------QAGNGAGSASSSAYIATPKILCDPKWLADSGATNHIPADAAS---------------
        NG+ N                                       A N   S S SA +A+   L +  W  DSGA++H+  + A+               
Subjt:  NGKDN--------------------------------------QAGNGAGSASSSAYIATPKILCDPKWLADSGATNHIPADAAS---------------

Query:  ---NLMSCSN----------KSIKLDNMFHVPSIMRNLISIAPLTADNDVFVEFHSDFCLVKDKASRTVMLCGTLKDGLYQIELPSIQNASSPISSTSSF
             ++ SN           S +L  +FHVP I  NLIS+A   +DN+  +EFHS+   VKD  ++ V+  G L++GLY+           P+ S    
Subjt:  ---NLMSCSN----------KSIKLDNMFHVPSIMRNLISIAPLTADNDVFVEFHSDFCLVKDKASRTVMLCGTLKDGLYQIELPSIQNASSPISSTSSF

Query:  VDVSTSNKSYDKSKSIVLWSANDYSSSSTSLQSIYGCSKSSPIVYQVTRLSILSKSLLHHRLGHASENTINSVIVACNLKVSSNEKFSFCDACQLGKSHR
          V  +N                         S + CS                + L HHRLGHA+ + +  ++  CN+      K + C +CQL KSHR
Subjt:  VDVSTSNKSYDKSKSIVLWSANDYSSSSTSLQSIYGCSKSSPIVYQVTRLSILSKSLLHHRLGHASENTINSVIVACNLKVSSNEKFSFCDACQLGKSHR

Query:  LPFPTSMTQTRHPLELIFCDLWGSSSVPSTLGCS------------------------------------------------------------------
        LP   S      PLEL++ D+WG +SV ST G                                                                    
Subjt:  LPFPTSMTQTRHPLELIFCDLWGSSSVPSTLGCS------------------------------------------------------------------

Query:  -------------------------------------------------HKGYKCLSS-SERVYISRHVIFNENDFPFQHDFLSFISATVSDDIIIHWLP
                                                         HKGY CL + + RVY+S HV+F+E  FPF  +  S  S   SD+ II    
Subjt:  -------------------------------------------------HKGYKCLSS-SERVYISRHVIFNENDFPFQHDFLSFISATVSDDIIIHWLP

Query:  FPTSSSSMTPPTADALPHPYLNHSTEFSTVSPTLDATPSKSTPTAPSSASSTVGFPDIGVLVADFPPDSVVDLHPVQTRAKIGVFQPKNWGSFLAASFAD
         P    S  P T          H +  S VSP L      ST T P+S + T                      P    ++  V + ++  +  A+    
Subjt:  FPTSSSSMTPPTADALPHPYLNHSTEFSTVSPTLDATPSKSTPTAPSSASSTVGFPDIGVLVADFPPDSVVDLHPVQTRAKIGVFQPKNWGSFLAASFAD

Query:  LPPSKPISVKEALKSSHGKKAMDEELAALHRNNIWRL--------------VFKVKRQADGSFDRCKARLVAKGFHKIPGIDFHETFNPVVKDPTIRVIL
           S+P ++K+A+K  +  +AM  E+AALH+N  W L              V+K+K + DGS DR KARLVA+GF++  G+D+ ETF+PVVK  TIR++L
Subjt:  LPPSKPISVKEALKSSHGKKAMDEELAALHRNNIWRL--------------VFKVKRQADGSFDRCKARLVAKGFHKIPGIDFHETFNPVVKDPTIRVIL

Query:  A---------RQLDINNTLLNGHLIEDVFMKQPDGFVDASHIDYVCKLDKVLYGLK
                  RQLD+ N  LNG L+E V+M QP GF+  +H + VCKL K LYGLK
Subjt:  A---------RQLDINNTLLNGHLIEDVFMKQPDGFVDASHIDYVCKLDKVLYGLK

RVX04593.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]5.1e-8128.29Show/hide
Query:  ELNPLYEEWITVDQALCGWLFGSMTPAIAAIVVSFKTSREGCKALEKIYGSTSKARINQLRGDLHNTKNGTMKMMEYLAIMKLALESLQLAGNPISLGDL
        E+NP +  W   D+ +  W++ S+TP I A ++   +S     ALEKI+ S S+ARI QLR +  +TK G+M M++Y+  +K A +SL   G P+S  D 
Subjt:  ELNPLYEEWITVDQALCGWLFGSMTPAIAAIVVSFKTSREGCKALEKIYGSTSKARINQLRGDLHNTKNGTMKMMEYLAIMKLALESLQLAGNPISLGDL

Query:  ISYVLVGLNSDYVLIFCSIE-DKDITTWQELSSILVQFEGTLADYSVFTNTRELPDLATHLVYNLPNQSSGQKSFSSNRTNLESFNN---------PHGA
        I  +L GL SDY  +  +I   +D  + + + S+L+ FE  L       +  +LP ++ +   +  N+  G+K       N    N+          +G 
Subjt:  ISYVLVGLNSDYVLIFCSIE-DKDITTWQELSSILVQFEGTLADYSVFTNTRELPDLATHLVYNLPNQSSGQKSFSSNRTNLESFNN---------PHGA

Query:  NGKDNQA--------------------------------------GNGAGSASSSAYIATPKILCDPKWLADSGAT-------NHIPADAASNLMSCSN-
        +G+ N +                                       N   S S  A +A+   L D  W  D GA         H+           SN 
Subjt:  NGKDNQA--------------------------------------GNGAGSASSSAYIATPKILCDPKWLADSGAT-------NHIPADAASNLMSCSN-

Query:  KSIKLDNMFHVPSIMRNLISIAPLTADNDVFVEFHSDFCLVKDKASRTVMLCGTLKDGLYQIELPSIQNASSPISSTSSFVDVSTSNKSYDKSKSIVLWS
         S +L  +FHVP I  NLIS+A   +DN+  +EFHS+   VKD  ++ V+  G L++GLY  + P I N        +S+V ++                
Subjt:  KSIKLDNMFHVPSIMRNLISIAPLTADNDVFVEFHSDFCLVKDKASRTVMLCGTLKDGLYQIELPSIQNASSPISSTSSFVDVSTSNKSYDKSKSIVLWS

Query:  ANDYSSSSTSLQSIYGCSKSSPIVYQVTRLSILSKSLLHHRLGHASENTINSVIVACNLKVSSNEKFSFCDACQLGKSHRLPFPTSMTQTRHPLELIFCD
         ND +   +++++                     + L HHRLGHA+ + +  ++  CN+      K + C +CQL KSHRLP   S      PLEL++ D
Subjt:  ANDYSSSSTSLQSIYGCSKSSPIVYQVTRLSILSKSLLHHRLGHASENTINSVIVACNLKVSSNEKFSFCDACQLGKSHRLPFPTSMTQTRHPLELIFCD

Query:  LWGSSSVPSTLGCS--------------------------------------------------------------------------------------
        +WG +S+ ST G                                                                                        
Subjt:  LWGSSSVPSTLGCS--------------------------------------------------------------------------------------

Query:  ----------HKGYKCLSS-SERVYISRHVIFNENDFPFQHDFLSFISATVSDDIIIHWLPFPTSSSSMTPPTADALPHPYLNHSTEFSTVSPTLDATPS
                  HKGY CL + + RVY+S HV+F+E  FPF  +  S  S   SD+ +I     PT   S  P T          H +  S  SP L +  +
Subjt:  ----------HKGYKCLSS-SERVYISRHVIFNENDFPFQHDFLSFISATVSDDIIIHWLPFPTSSSSMTPPTADALPHPYLNHSTEFSTVSPTLDATPS

Query:  KSTPTAPSSASSTVGFPDIGVLVADFPPDSVVDLHP--VQTRAKIGVFQPKNWGSFLAASFADLPPSKPISVKEALKSSHGKKAMDEELAALHRNNIWRL
          TP   +  + ++  P +   V   P    V + P  V TR+  G+ + K+       + A    S+P ++K+A+K  +  +AM  E+AALH+N  W L
Subjt:  KSTPTAPSSASSTVGFPDIGVLVADFPPDSVVDLHP--VQTRAKIGVFQPKNWGSFLAASFADLPPSKPISVKEALKSSHGKKAMDEELAALHRNNIWRL

Query:  --------------VFKVKRQADGSFDRCKARLVAKGFHKIPGIDFHETFNPVVKDPTIRVILA---------RQLDINNTLLNGHLIEDVFMKQPDGFV
                      V+K+K + DGS DR KARLVA+GF++  G+D+ ETF+PVVK  TIR++L          RQLD+ N  LNG L+E V+M QP GF+
Subjt:  --------------VFKVKRQADGSFDRCKARLVAKGFHKIPGIDFHETFNPVVKDPTIRVILA---------RQLDINNTLLNGHLIEDVFMKQPDGFV

Query:  DASHIDYVCKLDKVLYGLK
          +H + VCKL K LYGLK
Subjt:  DASHIDYVCKLDKVLYGLK

TrEMBL top hitse value%identityAlignment
A0A2K3MUJ9 Putative retrotransposon Ty1-copia subclass protein (Fragment)1.1e-8127.3Show/hide
Query:  ISNSLELNPLYEEWITVDQALCGWLFGSMTPAIAAIVVSFKTSREGCKALEKIYGSTSKARINQLRGDLHNTKNGTMKMMEYLAIMKLALESLQLAGNPI
        I N+ ++NP Y++W   DQAL GWL  SMT  IA  V+  +TS++     + + G+ +++RI  L+ + HNT    MKM +YLA MK   + L+LAG+PI
Subjt:  ISNSLELNPLYEEWITVDQALCGWLFGSMTPAIAAIVVSFKTSREGCKALEKIYGSTSKARINQLRGDLHNTKNGTMKMMEYLAIMKLALESLQLAGNPI

Query:  SLGDLISYVLVGLNSDYVLIFCSIEDKDITTWQELSSILVQFEGTLADYSVFTNTRELPDLATHLVYNLPNQSSGQKSFSSNRTNLESFNNPHGANGKDN
        S  DL+   L GL+S+Y  +   + D+   +W +  + L+ FE  L   + F N     +L     +   N+S G K  S       +     G  G+  
Subjt:  SLGDLISYVLVGLNSDYVLIFCSIEDKDITTWQELSSILVQFEGTLADYSVFTNTRELPDLATHLVYNLPNQSSGQKSFSSNRTNLESFNNPHGANGKDN

Query:  QA-------------GNGA-------------------GSASSSAYIATPKILCDPKWLADSGATNHIP----------------------ADAASNLMS
         +             G+ A                   G  S SA++A+P    D +W  DSGA+NH+                        +    L S
Subjt:  QA-------------GNGA-------------------GSASSSAYIATPKILCDPKWLADSGATNHIP----------------------ADAASNLMS

Query:  CSNK--SIKLDNMFHVPSIMRNLISIAPLTADNDVFVEFHSDFCLVKDKASRTVMLCGTLKDGLYQIELPSIQNASSPISSTSSFVDVSTSNKSYDKSKS
         S K   + L N+ +VP I +NL+S++ LT DN+  VEF  ++C VKDK +   +L G LKDGLYQ+                      ++NK    +K 
Subjt:  CSNK--SIKLDNMFHVPSIMRNLISIAPLTADNDVFVEFHSDFCLVKDKASRTVMLCGTLKDGLYQIELPSIQNASSPISSTSSFVDVSTSNKSYDKSKS

Query:  IVLWSANDYSSSSTSLQSIYGCSKSSPIVYQVTRLSILSKSLLHHRLGHASENTINSVIVACNLKVSSNEKFSFCDACQLGKSHRLPFPTSMTQTRHPLE
                                  P  Y      I  K + H +LGH +   +  V+   N+K+S ++KF+FC+ACQ GK H LPF TS +  + PL+
Subjt:  IVLWSANDYSSSSTSLQSIYGCSKSSPIVYQVTRLSILSKSLLHHRLGHASENTINSVIVACNLKVSSNEKFSFCDACQLGKSHRLPFPTSMTQTRHPLE

Query:  LIFCDLWGSSSV----------------------------------------------------------------------------------------
        LI  D+WG + +                                                                                        
Subjt:  LIFCDLWGSSSV----------------------------------------------------------------------------------------

Query:  -------------------------------------------------PSTL--------------GC----------------------------SHK
                                                         P TL              GC                            SHK
Subjt:  -------------------------------------------------PSTL--------------GC----------------------------SHK

Query:  GYKCLSSSERVYISRHVIFNENDFPFQHDFLSFIS--ATVSDDIIIHWLPFPTSSSSMTPPTADALPH------PYLN-------HSTEFSTVSPTLDAT
        GYKC++S  RV++SRHV+FNEN FPFQ  FL   +    V++D  I +  FP   +  T  TA+A  +      P LN        S E  T   T +  
Subjt:  GYKCLSSSERVYISRHVIFNENDFPFQHDFLSFIS--ATVSDDIIIHWLPFPTSSSSMTPPTADALPH------PYLN-------HSTEFSTVSPTLDAT

Query:  PSKSTPTAPSSASSTVGFPDIGVLVADF---PPDSVVDLHPVQTRAKIGVFQPKNWGSFLAASFADLPPSKPISVKEALKSSHGKKAMDEELAALHRNNI
         S       + A+      +I   + +    P   + + H ++TR+K GV++PK    ++  +       +P SV EAL       AMD E  AL  N  
Subjt:  PSKSTPTAPSSASSTVGFPDIGVLVADF---PPDSVVDLHPVQTRAKIGVFQPKNWGSFLAASFADLPPSKPISVKEALKSSHGKKAMDEELAALHRNNI

Query:  WRLV--------------FKVKRQADGSFDRCKARLVAKGFHKIPGIDFHETFNPVVKDPTIRVILA---------RQLDINNTLLNGHLIEDVFMKQPD
        W LV              FK K +ADG+ +R KARLVA+GF +  G+D+ ETF+PVVK  T+R+IL+         RQLDINN  LNG+L E VFM QP+
Subjt:  WRLV--------------FKVKRQADGSFDRCKARLVAKGFHKIPGIDFHETFNPVVKDPTIRVILA---------RQLDINNTLLNGHLIEDVFMKQPD

Query:  GFVDASHIDYVCKLDKVLYGLK
        G++D +   ++C+L+K +YGLK
Subjt:  GFVDASHIDYVCKLDKVLYGLK

A0A438J6L1 Retrovirus-related Pol polyprotein from transposon RE12.5e-8128.29Show/hide
Query:  ELNPLYEEWITVDQALCGWLFGSMTPAIAAIVVSFKTSREGCKALEKIYGSTSKARINQLRGDLHNTKNGTMKMMEYLAIMKLALESLQLAGNPISLGDL
        E+NP +  W   D+ +  W++ S+TP I A ++   +S     ALEKI+ S S+ARI QLR +  +TK G+M M++Y+  +K A +SL   G P+S  D 
Subjt:  ELNPLYEEWITVDQALCGWLFGSMTPAIAAIVVSFKTSREGCKALEKIYGSTSKARINQLRGDLHNTKNGTMKMMEYLAIMKLALESLQLAGNPISLGDL

Query:  ISYVLVGLNSDYVLIFCSIE-DKDITTWQELSSILVQFEGTLADYSVFTNTRELPDLATHLVYNLPNQSSGQKSFSSNRTNLESFNN---------PHGA
        I  +L GL SDY  +  +I   +D  + + + S+L+ FE  L       +  +LP ++ +   +  N+  G+K       N    N+          +G 
Subjt:  ISYVLVGLNSDYVLIFCSIE-DKDITTWQELSSILVQFEGTLADYSVFTNTRELPDLATHLVYNLPNQSSGQKSFSSNRTNLESFNN---------PHGA

Query:  NGKDNQA--------------------------------------GNGAGSASSSAYIATPKILCDPKWLADSGAT-------NHIPADAASNLMSCSN-
        +G+ N +                                       N   S S  A +A+   L D  W  D GA         H+           SN 
Subjt:  NGKDNQA--------------------------------------GNGAGSASSSAYIATPKILCDPKWLADSGAT-------NHIPADAASNLMSCSN-

Query:  KSIKLDNMFHVPSIMRNLISIAPLTADNDVFVEFHSDFCLVKDKASRTVMLCGTLKDGLYQIELPSIQNASSPISSTSSFVDVSTSNKSYDKSKSIVLWS
         S +L  +FHVP I  NLIS+A   +DN+  +EFHS+   VKD  ++ V+  G L++GLY  + P I N        +S+V ++                
Subjt:  KSIKLDNMFHVPSIMRNLISIAPLTADNDVFVEFHSDFCLVKDKASRTVMLCGTLKDGLYQIELPSIQNASSPISSTSSFVDVSTSNKSYDKSKSIVLWS

Query:  ANDYSSSSTSLQSIYGCSKSSPIVYQVTRLSILSKSLLHHRLGHASENTINSVIVACNLKVSSNEKFSFCDACQLGKSHRLPFPTSMTQTRHPLELIFCD
         ND +   +++++                     + L HHRLGHA+ + +  ++  CN+      K + C +CQL KSHRLP   S      PLEL++ D
Subjt:  ANDYSSSSTSLQSIYGCSKSSPIVYQVTRLSILSKSLLHHRLGHASENTINSVIVACNLKVSSNEKFSFCDACQLGKSHRLPFPTSMTQTRHPLELIFCD

Query:  LWGSSSVPSTLGCS--------------------------------------------------------------------------------------
        +WG +S+ ST G                                                                                        
Subjt:  LWGSSSVPSTLGCS--------------------------------------------------------------------------------------

Query:  ----------HKGYKCLSS-SERVYISRHVIFNENDFPFQHDFLSFISATVSDDIIIHWLPFPTSSSSMTPPTADALPHPYLNHSTEFSTVSPTLDATPS
                  HKGY CL + + RVY+S HV+F+E  FPF  +  S  S   SD+ +I     PT   S  P T          H +  S  SP L +  +
Subjt:  ----------HKGYKCLSS-SERVYISRHVIFNENDFPFQHDFLSFISATVSDDIIIHWLPFPTSSSSMTPPTADALPHPYLNHSTEFSTVSPTLDATPS

Query:  KSTPTAPSSASSTVGFPDIGVLVADFPPDSVVDLHP--VQTRAKIGVFQPKNWGSFLAASFADLPPSKPISVKEALKSSHGKKAMDEELAALHRNNIWRL
          TP   +  + ++  P +   V   P    V + P  V TR+  G+ + K+       + A    S+P ++K+A+K  +  +AM  E+AALH+N  W L
Subjt:  KSTPTAPSSASSTVGFPDIGVLVADFPPDSVVDLHP--VQTRAKIGVFQPKNWGSFLAASFADLPPSKPISVKEALKSSHGKKAMDEELAALHRNNIWRL

Query:  --------------VFKVKRQADGSFDRCKARLVAKGFHKIPGIDFHETFNPVVKDPTIRVILA---------RQLDINNTLLNGHLIEDVFMKQPDGFV
                      V+K+K + DGS DR KARLVA+GF++  G+D+ ETF+PVVK  TIR++L          RQLD+ N  LNG L+E V+M QP GF+
Subjt:  --------------VFKVKRQADGSFDRCKARLVAKGFHKIPGIDFHETFNPVVKDPTIRVILA---------RQLDINNTLLNGHLIEDVFMKQPDGFV

Query:  DASHIDYVCKLDKVLYGLK
          +H + VCKL K LYGLK
Subjt:  DASHIDYVCKLDKVLYGLK

A0A803NU85 Uncharacterized protein1.8e-8428.74Show/hide
Query:  LNPLYEEWITVDQALCGWLFGSMTPAIAAIVVSFKTSREGCKALEKIYGSTSKARINQLRGDLHNTKNGTMKMMEYLAIMKLALESLQLAGNPISLGDLI
        +NP YE W+  DQ L GWL                       ALE++YG+ S+A ++ +R  +  T+ GT  M +YL   ++  +SL LAG+P     L+
Subjt:  LNPLYEEWITVDQALCGWLFGSMTPAIAAIVVSFKTSREGCKALEKIYGSTSKARINQLRGDLHNTKNGTMKMMEYLAIMKLALESLQLAGNPISLGDLI

Query:  SYVLVGLNSDYVLIFCSIEDKDITTWQELSSILVQFEGTLADY-SVFTNTRELPDLATHLVYNLPNQSSGQ-------KSFSSNRTNLESFNNPHGANGK
        S VL GL+++Y+ I   IE ++ TTWQ+L ++L+ F+G L    +V TN+R L + + +     P+ S+ Q       + +S    N  + NN     G 
Subjt:  SYVLVGLNSDYVLIFCSIEDKDITTWQELSSILVQFEGTLADY-SVFTNTRELPDLATHLVYNLPNQSSGQ-------KSFSSNRTNLESFNNPHGANGK

Query:  DNQAGNGAGSASS------------------------------------------SAYIATPKILCDPKWLADSGATNHIPAD-----------------
          + G G G   +                                          SA +A P+++ D  W ADSGA+NH+ +D                 
Subjt:  DNQAGNGAGSASS------------------------------------------SAYIATPKILCDPKWLADSGATNHIPAD-----------------

Query:  -----------AASNLMSCSNKSIKLDNMFHVPSIMRNLISIAPLTADNDVFVEFHSDFCLVKDKASRTVMLCGTLKDGLYQIELPSIQNASSPISSTSS
                     ++ +   N  + L NM HVPSI +NLIS++ LT+DN+V +EF SD C+VK++ +  V+L GTLKDGLYQ+      ++S   S   +
Subjt:  -----------AASNLMSCSNKSIKLDNMFHVPSIMRNLISIAPLTADNDVFVEFHSDFCLVKDKASRTVMLCGTLKDGLYQIELPSIQNASSPISSTSS

Query:  FVDVSTSNKSYDKSKSIVLWSANDYSSSSTSLQSIYGCSKSSPIVYQVTRLSILSKSLLHHRLGHASENTINSVIVACNLKVSSNEKFSFCDACQL----
        FV                                      S+P    V   S+  K + H +LGH S   +N V+   N+KV  NE  SFCDACQ     
Subjt:  FVDVSTSNKSYDKSKSIVLWSANDYSSSSTSLQSIYGCSKSSPIVYQVTRLSILSKSLLHHRLGHASENTINSVIVACNLKVSSNEKFSFCDACQL----

Query:  ----------------------------------------GKSHR-----------------LP-----------------FPTSMTQTRHPLELI----
                                                G++ R                 +P                  PT++ Q + P   +    
Subjt:  ----------------------------------------GKSHR-----------------LP-----------------FPTSMTQTRHPLELI----

Query:  ----FCDLWGSSSVP------------STLGC-------SHKGYKCLSSSERVYISRHVIFNENDFPFQHDFLSFISATVSDDIIIH----WLPFPTSSS
            F   +G +  P             ++ C       SHKGYKCLS   R+YISRHV+FNE++FPF+  FL+  +      +++     W   P S++
Subjt:  ----FCDLWGSSSVP------------STLGC-------SHKGYKCLSSSERVYISRHVIFNENDFPFQHDFLSFISATVSDDIIIH----WLPFPTSSS

Query:  SMTPPTADALPHPYLNHSTEFSTVSPTLDATPSKSTPTAPSSASSTVGFPDIGVLVADFPPDSVVDLHPVQTRAKIGVFQPKNWGSFLAASFADLPPSKP
        + +                  S+   + DA P++  P A S                                 + G+F+P+    FL  +   L   +P
Subjt:  SMTPPTADALPHPYLNHSTEFSTVSPTLDATPSKSTPTAPSSASSTVGFPDIGVLVADFPPDSVVDLHPVQTRAKIGVFQPKNWGSFLAASFADLPPSKP

Query:  ISVKEALKSSHGKKAMDEELAALHRNNIWRL--------------VFKVKRQADGSFDRCKARLVAKGFHKIPGIDFHETFNPVVKDPTIRVILA-----
        +SV++AL       AM  E+ AL RN  W L              V+KVK  AD SF R KARLVAKGFH+ PGIDF ETF+PV+K  T+RV+L      
Subjt:  ISVKEALKSSHGKKAMDEELAALHRNNIWRL--------------VFKVKRQADGSFDRCKARLVAKGFHKIPGIDFHETFNPVVKDPTIRVILA-----

Query:  ----RQLDINNTLLNGHLIEDVFMKQPDGFVDASHIDYVCKLDKVLYGLK
            RQLDINN  LNG+L EDV+M QP GF D     YVCKL K +YGLK
Subjt:  ----RQLDINNTLLNGHLIEDVFMKQPDGFVDASHIDYVCKLDKVLYGLK

A0A803PM38 Uncharacterized protein2.2e-8528.06Show/hide
Query:  ISNSLELNPLYEEWITVDQALCGWLFGSMTPAIAAIVVSFKTSREGCKALEKIYGSTSKARINQLRGDLHNTKNGTMKMMEYLAIMKLALESLQLAGNPI
        +S+  ++NP +E+WI  DQ L GWL+GSMT  IA  V+   +S     ALE+++G+ SKA++++ R  +   + G + M +YL   +   + L LAG P 
Subjt:  ISNSLELNPLYEEWITVDQALCGWLFGSMTPAIAAIVVSFKTSREGCKALEKIYGSTSKARINQLRGDLHNTKNGTMKMMEYLAIMKLALESLQLAGNPI

Query:  SLGDLISYVLVGLNSDYVLIFCSIEDKDITTWQELSSILVQFEGTLADYSVFTNTRELPDLATHLVYNLPNQS--SGQKSFSSNRTNLESFNNPHGANGK
            L+S VL GL+ +Y+ +   IE +  TTWQ+L  +L+  +  +     F+ + +L  +  +   +L N+    G    + N  N    +N  G+N +
Subjt:  SLGDLISYVLVGLNSDYVLIFCSIEDKDITTWQELSSILVQFEGTLADYSVFTNTRELPDLATHLVYNLPNQS--SGQKSFSSNRTNLESFNNPHGANGK

Query:  DNQAGNGAGSASSSAYIATPKILCDPKWLADSGATNHIPAD-----------------------------AASNLMSCSNKSIKLDNMFHVPSIMRNLIS
            G G  S          K         + GA+NHI ++                                +L + S   + L  + HVPSI +NL+S
Subjt:  DNQAGNGAGSASSSAYIATPKILCDPKWLADSGATNHIPAD-----------------------------AASNLMSCSNKSIKLDNMFHVPSIMRNLIS

Query:  IAPLTADNDVFVEFHSDFCLVKDKASRTVMLCGTLKDGLYQIELPSIQNASSPISSTSSFVDVSTSNKSYDKSKSIVLWSANDYSSSSTSLQSIYGCSKS
        I+ LT+DN+V VEF SD C VKDK +  V+L G LKDGLYQ + P                   TS  S   ++SI         S  TS   +   +  
Subjt:  IAPLTADNDVFVEFHSDFCLVKDKASRTVMLCGTLKDGLYQIELPSIQNASSPISSTSSFVDVSTSNKSYDKSKSIVLWSANDYSSSSTSLQSIYGCSKS

Query:  SPIVYQVTRLSILS-KSLLHHRLGHASENTINSVIVACNLKVSSNEKFSFCDACQLGKSHRLPFPTSMTQTRHPLELIFCDLWGSSSVPS----------
        S +   +    + S K   H RLGH S   +++V+   N+K + N   SFCDACQLGKSH LPF  +  +   PLEL+  D+WG S + S          
Subjt:  SPIVYQVTRLSILS-KSLLHHRLGHASENTINSVIVACNLKVSSNEKFSFCDACQLGKSHRLPFPTSMTQTRHPLELIFCDLWGSSSVPS----------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------------------------------TLGCS--HKGYKCLSSSERVYISRHVIFNENDF
                                                                            LG S  HKGYKCLSS+ R+YISR VIFNE++F
Subjt:  -------------------------------------------------------------------TLGCS--HKGYKCLSSSERVYISRHVIFNENDF

Query:  PFQHDFLSFISATVSDDIIIHWLPFPTSSSSM-------------------------TPPTADALP--HPYLNHST-----EFSTVSPTLDATPSKSTPT
        PF+  FL+         +++   PF T+SS +                         TP T+  +P    +  + T     +F  +    D    +   T
Subjt:  PFQHDFLSFISATVSDDIIIHWLPFPTSSSSM-------------------------TPPTADALP--HPYLNHST-----EFSTVSPTLDATPSKSTPT

Query:  APSSASSTVGFPDIGVLVADFPPDSVVDLHPVQTRAKIGVFQPKNWGSFLAASFADLPPSKPISVKEALKSSHGKKAMDEELAALHRNNIWRL-------
            +++      I    +D    +VV  HP+ TRAK G+F+PK   ++L  +      S+P S++EAL+      AM  E+ AL RN  W+L       
Subjt:  APSSASSTVGFPDIGVLVADFPPDSVVDLHPVQTRAKIGVFQPKNWGSFLAASFADLPPSKPISVKEALKSSHGKKAMDEELAALHRNNIWRL-------

Query:  -------VFKVKRQADGSFDRCKARLVAKGFHKIPGIDFHETFNPVVKDPTIRVILA---------RQLDINNTLLNGHLIEDVFMKQPDGFVDASHIDY
               V+K KR ADGSF R KARLVAKGF + PG+DF ETF+PV+K  T+R++L+         RQLDINN  LNGH+ ED++MKQP GF D +  ++
Subjt:  -------VFKVKRQADGSFDRCKARLVAKGFHKIPGIDFHETFNPVVKDPTIRVILA---------RQLDINNTLLNGHLIEDVFMKQPDGFVDASHIDY

Query:  VCKLDKVLYGLK
        VCKL K +YGL+
Subjt:  VCKLDKVLYGLK

A0A803PYD1 Uncharacterized protein1.3e-8231.2Show/hide
Query:  GNVISNSLELNPLYEEWITVDQALCGWLFGSMTPAIAAIVVSFKTSREGCKALEKIYGSTSKARINQLRGDLHNTKNGTMKMMEYLAIMKLALESLQLAG
        GNV ++SL  N  YE  I  DQ L GWL+GSMT  I + V+  +++     ALE++YG+ S+A +++L+  L  T+ G   M EYL   ++  ++L + G
Subjt:  GNVISNSLELNPLYEEWITVDQALCGWLFGSMTPAIAAIVVSFKTSREGCKALEKIYGSTSKARINQLRGDLHNTKNGTMKMMEYLAIMKLALESLQLAG

Query:  NPISLGDLISYVLVGLNSDYVLIFCSIEDKDITTWQELSSILVQFEGTLADYSVFTNTRELPDLATHLVYNLPNQSSGQKSF---------------SSN
        +P     L   +L GL+++Y+ I   +E +   + QEL S L+ ++  L      +   +L         N P+ +  QK F               S+N
Subjt:  NPISLGDLISYVLVGLNSDYVLIFCSIEDKDITTWQELSSILVQFEGTLADYSVFTNTRELPDLATHLVYNLPNQSSGQKSF---------------SSN

Query:  RTNLESFNNPHGAN--GKDNQAGNGAGSASS---------------------------------------------SAYIATPKILCDPKWLADSGATNH
          N      P G++  G   Q G G G   +                                             S  +ATP+ L D  W ADSGATNH
Subjt:  RTNLESFNNPHGAN--GKDNQAGNGAGSASS---------------------------------------------SAYIATPKILCDPKWLADSGATNH

Query:  IPADA----------------------------ASNLMSC-SNKSIKLDNMFHVPSIMRNLISIAPLTADNDVFVEFHSDFCLVKDKASRTVMLCGTLKD
        +  D+                             SN+++   +K + L ++ HVPSI +NLISI+ LT+DNDV VEF SDFC VKD+ +  V+L  TLKD
Subjt:  IPADA----------------------------ASNLMSC-SNKSIKLDNMFHVPSIMRNLISIAPLTADNDVFVEFHSDFCLVKDKASRTVMLCGTLKD

Query:  GLYQIELPSIQNASSPISSTSSFVDVSTSNKSYDKSKSIVLWSANDYSSSSTSLQSIYGCSKSSPIVYQVTRLSILSKSLLHHRLGHASENTINSVIVAC
        GLYQ    ++ N S              +NK +  S S         S S  SL                       K   H RLGH S   +N V+   
Subjt:  GLYQIELPSIQNASSPISSTSSFVDVSTSNKSYDKSKSIVLWSANDYSSSSTSLQSIYGCSKSSPIVYQVTRLSILSKSLLHHRLGHASENTINSVIVAC

Query:  NLKVSSNEKFS----------------------FCDACQLGKSHRLPFPTSMTQTRHPLELI--------FCDLWGSSSVP-----------------ST
        N+K  +    S                      + DA Q         PT   + + P E++        F  ++GS+  P                   
Subjt:  NLKVSSNEKFS----------------------FCDACQLGKSHRLPFPTSMTQTRHPLELI--------FCDLWGSSSVP-----------------ST

Query:  LGCS--HKGYKCLSSSERVYISRHVIFNENDFPFQHDFLSFI--SATVSDDIIIHWLPFP-----TSSSSMTP--PTADALPHPYLNHSTEFSTVSPTLD
        LG S  HKGYKCLS   R+YISR+V FNE++FP    F +       ++ D    W   P     T SS  +P  P A + P       +  S+ S +  
Subjt:  LGCS--HKGYKCLSSSERVYISRHVIFNENDFPFQHDFLSFI--SATVSDDIIIHWLPFP-----TSSSSMTP--PTADALPHPYLNHSTEFSTVSPTLD

Query:  ATPSKSTPTAPSSASSTVGFPDIGVLVADFPP--DSVVDLHPVQTRAKIGVFQPKNWGSFLAASFADLPPSKPISVKEALKSSHGKKAMDEELAALHRNN
           +     + S  SS +  PDI       PP   +    HP+ TRAK G+F+PK + S    S        P SV EAL+      AM +E  AL R  
Subjt:  ATPSKSTPTAPSSASSTVGFPDIGVLVADFPP--DSVVDLHPVQTRAKIGVFQPKNWGSFLAASFADLPPSKPISVKEALKSSHGKKAMDEELAALHRNN

Query:  IWRLV--------------FKVKRQADGSFDRCKARLVAKGFHKIPGIDFHETFNPVVKDPTIRVILA---------RQLDINNTLLNGHLIEDVFMKQP
         W LV              F+ K  ADGS  R KARLVAKGFH+ PG+DF ETF+PVVK PTIR++LA         RQ++INN  LNG   EDV+M QP
Subjt:  IWRLV--------------FKVKRQADGSFDRCKARLVAKGFHKIPGIDFHETFNPVVKDPTIRVILA---------RQLDINNTLLNGHLIEDVFMKQP

Query:  DGFVDASHIDYVCKLDKVLYGLK
        +GF D    D+VCKL K +YGLK
Subjt:  DGFVDASHIDYVCKLDKVLYGLK

SwissProt top hitse value%identityAlignment
P04146 Copia protein5.6e-1437.59Show/hide
Query:  SHGKKAMDEELAALHRNNIW--------------RLVFKVKRQADGSFDRCKARLVAKGFHKIPGIDFHETFNPVVKDPTIRVILA---------RQLDI
        S  ++A++ EL A   NN W              R VF VK    G+  R KARLVA+GF +   ID+ ETF PV +  + R IL+          Q+D+
Subjt:  SHGKKAMDEELAALHRNNIW--------------RLVFKVKRQADGSFDRCKARLVAKGFHKIPGIDFHETFNPVVKDPTIRVILA---------RQLDI

Query:  NNTLLNGHLIEDVFMKQPDGFVDASHIDYVCKLDKVLYGLK
            LNG L E+++M+ P G   + + D VCKL+K +YGLK
Subjt:  NNTLLNGHLIEDVFMKQPDGFVDASHIDYVCKLDKVLYGLK

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.2e-1929.01Show/hide
Query:  ERVYISRHVIFNENDFPFQHDFLSFISATVSDDIIIHWLPFPTSSSSMTPPTADALPHPYLNHSTEFSTVSPTLDATPSKSTPTAPSSASSTVGFPDIGV
        ++V  SR V+F E++     D    +S  V + II +++  P++S++ T                  S  S T + +     P         +   D GV
Subjt:  ERVYISRHVIFNENDFPFQHDFLSFISATVSDDIIIHWLPFPTSSSSMTPPTADALPHPYLNHSTEFSTVSPTLDATPSKSTPTAPSSASSTVGFPDIGV

Query:  LVADFPPDSVVDLHPVQTRAKIGVFQPKNWGSFLAASFADLPPSKPISVKEAL---KSSHGKKAMDEELAALHRNNIWRL--------------VFKVKR
           + P        P++ R++    + + + S      +D    +P S+KE L   + +   KAM EE+ +L +N  ++L              VFK+K+
Subjt:  LVADFPPDSVVDLHPVQTRAKIGVFQPKNWGSFLAASFADLPPSKPISVKEAL---KSSHGKKAMDEELAALHRNNIWRL--------------VFKVKR

Query:  QADGSFDRCKARLVAKGFHKIPGIDFHETFNPVVKDPTIRVILA---------RQLDINNTLLNGHLIEDVFMKQPDGFVDASHIDYVCKLDKVLYGLKC
          D    R KARLV KGF +  GIDF E F+PVVK  +IR IL+          QLD+    L+G L E+++M+QP+GF  A     VCKL+K LYGLK 
Subjt:  QADGSFDRCKARLVAKGFHKIPGIDFHETFNPVVKDPTIRVILA---------RQLDINNTLLNGHLIEDVFMKQPDGFVDASHIDYVCKLDKVLYGLKC

Query:  CFQAKVMIIDFITIPEIREKTFKE
          +   M  D     +   KT+ +
Subjt:  CFQAKVMIIDFITIPEIREKTFKE

P92520 Uncharacterized mitochondrial protein AtMg008206.6e-1539.87Show/hide
Query:  TRAKIGV--FQPKNWGSFLAASFADLPPSKPISVKEALKSSHGKKAMDEELAALHRNNIWRL--------------VFKVKRQADGSFDRCKARLVAKGF
        TR+K G+    PK      + +       +P SV  ALK     +AM EEL AL RN  W L              VFK K  +DG+ DR KARLVAKGF
Subjt:  TRAKIGV--FQPKNWGSFLAASFADLPPSKPISVKEALKSSHGKKAMDEELAALHRNNIWRL--------------VFKVKRQADGSFDRCKARLVAKGF

Query:  HKIPGIDFHETFNPVVKDPTIRVIL--ARQLDIN---NTLLNGHLIEDVFMKQ
        H+  GI F ET++PVV+  TIR IL  A+QL++    N +   H    +F K+
Subjt:  HKIPGIDFHETFNPVVKDPTIRVIL--ARQLDIN---NTLLNGHLIEDVFMKQ

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.6e-4024.5Show/hide
Query:  LNPLYEEWITVDQALCGWLFGSMTPAIAAIVVSFKTSREGCKALEKIYGSTSKARINQLRGDLHNTKNGTMKMMEYLAIMKLALESLQLAGNPISLGDLI
        +NP Y  W   D+ +   + G+++ ++   V    T+ +  + L KIY + S   + QLR  L     GT  + +Y+  +    + L L G P+   + +
Subjt:  LNPLYEEWITVDQALCGWLFGSMTPAIAAIVVSFKTSREGCKALEKIYGSTSKARINQLRGDLHNTKNGTMKMMEYLAIMKLALESLQLAGNPISLGDLI

Query:  SYVLVGLNSDYVLIFCSIEDKDI-TTWQELSSILVQFEGTLADYSVFTNTRELPDLATHL---------------VYNLPNQSSGQKSFSSNRTNLESFN
          VL  L  +Y  +   I  KD   T  E+   L+  E  +   S  T      +  +H                 Y+  N ++  K +  + TN    N
Subjt:  SYVLVGLNSDYVLIFCSIEDKDI-TTWQELSSILVQFEGTLADYSVFTNTRELPDLATHL---------------VYNLPNQSSGQKSFSSNRTNLESFN

Query:  N---PH-------GANG----KDNQAGNGAGSASSS------------AYIATPKILCDPKWLADSGATNHIPAD-------------------------
        N   P+       G  G    + +Q  +   S +S             A +A         WL DSGAT+HI +D                         
Subjt:  N---PH-------GANG----KDNQAGNGAGSASSS------------AYIATPKILCDPKWLADSGATNHIPAD-------------------------

Query:  ---AASNLMSCSNKSIKLDNMFHVPSIMRNLISIAPLTADNDVFVEFHSDFCLVKDKASRTVMLCGTLKDGLYQIELPSIQNASSPISSTSSFVDVSTSN
             S  +S  ++ + L N+ +VP+I +NLIS+  L   N V VEF      VKD  +   +L G  KD LY+  + S Q  S   S +S     S   
Subjt:  ---AASNLMSCSNKSIKLDNMFHVPSIMRNLISIAPLTADNDVFVEFHSDFCLVKDKASRTVMLCGTLKDGLYQIELPSIQNASSPISSTSSFVDVSTSN

Query:  KSYDKSKSIVLWSANDYS-------------------------------SSSTSLQSIYGCSKSSPIV---------------------------YQVTR
        +    + SI+    ++YS                               +S+  L+ IY    SSPI+                            QV  
Subjt:  KSYDKSKSIVLWSANDYS-------------------------------SSSTSLQSIYGCSKSSPIV---------------------------YQVTR

Query:  LSILSKSLLHHRL-------------------GHASENTINSV-------------------IVACNLKVSSNEK-----FSFCDACQLGKSHRLPFPTS
          I  K+LL +R                     + S++ I+ +                   IV   L + S+       + +  A  +   +RLP P  
Subjt:  LSILSKSLLHHRL-------------------GHASENTINSV-------------------IVACNLKVSSNEK-----FSFCDACQLGKSHRLPFPTS

Query:  MTQTRHPLELIF--------CDLWGSSSVP------------STLGCSHKGYKCLSS--------SERVYISRHVIFNENDFPFQHDFLSFISATVSD--
        + Q   P + +F          ++G +  P             +  C   GY    S        + R+YISRHV F+EN FPF  ++L+ +S       
Subjt:  MTQTRHPLELIF--------CDLWGSSSVP------------STLGCSHKGYKCLSS--------SERVYISRHVIFNENDFPFQHDFLSFISATVSD--

Query:  DIIIHW------------LPFPTSSS---SMTPPTADALPH--------------------------PYLN---------------HSTE-FSTVSPTLD
        +    W            LP P+ S    + TPP++ + P                           P  N               HS++  S  +PT +
Subjt:  DIIIHW------------LPFPTSSS---SMTPPTADALPH--------------------------PYLN---------------HSTE-FSTVSPTLD

Query:  ---------ATPSKS-----TPTAPSSASSTVGFPDIGVLVADFPP---------DSVVDLHPVQTRAKIGVFQPKNWGSFLAASFADLPPSKPISVKEA
                 +TP++S     +PT  +S+SST   P   +L+   PP          + ++ H + TRAK G+ +P N    LA S A    S+P +  +A
Subjt:  ---------ATPSKS-----TPTAPSSASSTVGFPDIGVLVADFPP---------DSVVDLHPVQTRAKIGVFQPKNWGSFLAASFADLPPSKPISVKEA

Query:  LKSSHGKKAMDEELAALHRNNIW---------------RLVFKVKRQADGSFDRCKARLVAKGFHKIPGIDFHETFNPVVKDPTIRVILA---------R
        LK    + AM  E+ A   N+ W               R +F  K  +DGS +R KARLVAKG+++ PG+D+ ETF+PV+K  +IR++L          R
Subjt:  LKSSHGKKAMDEELAALHRNNIW---------------RLVFKVKRQADGSFDRCKARLVAKGFHKIPGIDFHETFNPVVKDPTIRVILA---------R

Query:  QLDINNTLLNGHLIEDVFMKQPDGFVDASHIDYVCKLDKVLYGLK
        QLD+NN  L G L +DV+M QP GF+D    +YVCKL K LYGLK
Subjt:  QLDINNTLLNGHLIEDVFMKQPDGFVDASHIDYVCKLDKVLYGLK

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.1e-3036.12Show/hide
Query:  PFPTSSSSMTPPTADALPHPYLNHSTEFSTVSPTLDATPSK--STPTAPSSAS-STVGFPDI--GVLVADFPPDSVVDLHPVQTRAKIGVFQPKNWGSFL
        P P S S  +P     LP          S +S     TPS   S P +PSS+S ST   P +     +      + V+ H + TRAK G+ +P    S+ 
Subjt:  PFPTSSSSMTPPTADALPHPYLNHSTEFSTVSPTLDATPSK--STPTAPSSAS-STVGFPDI--GVLVADFPPDSVVDLHPVQTRAKIGVFQPKNWGSFL

Query:  AASFADLPPSKPISVKEALKSSHGKKAMDEELAALHRNNIW---------------RLVFKVKRQADGSFDRCKARLVAKGFHKIPGIDFHETFNPVVKD
         +  A+   S+P +  +A+K    ++AM  E+ A   N+ W               R +F  K  +DGS +R KARLVAKG+++ PG+D+ ETF+PV+K 
Subjt:  AASFADLPPSKPISVKEALKSSHGKKAMDEELAALHRNNIW---------------RLVFKVKRQADGSFDRCKARLVAKGFHKIPGIDFHETFNPVVKD

Query:  PTIRVILA---------RQLDINNTLLNGHLIEDVFMKQPDGFVDASHIDYVCKLDKVLYGLK
         +IR++L          RQLD+NN  L G L ++V+M QP GFVD    DYVC+L K +YGLK
Subjt:  PTIRVILA---------RQLDINNTLLNGHLIEDVFMKQPDGFVDASHIDYVCKLDKVLYGLK

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.8e-0531.58Show/hide
Query:  WLADSGATNHIPAD----------------------------AASNLMSCSNKSIKLDNMFHVPSIMRNLISIAPLTADNDVFVEFHSDFCLVKDKASRT
        WL DSGAT+HI +D                              S  +  S++S+ L+ + +VP+I +NLIS+  L   N V VEF      VKD  +  
Subjt:  WLADSGATNHIPAD----------------------------AASNLMSCSNKSIKLDNMFHVPSIMRNLISIAPLTADNDVFVEFHSDFCLVKDKASRT

Query:  VMLCGTLKDGLYQIELPSIQNAS---SPISSTS
         +L G  KD LY+  + S Q  S   SP S  +
Subjt:  VMLCGTLKDGLYQIELPSIQNAS---SPISSTS

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.8e-2040Show/hide
Query:  AMDEELAALHRNNIWRL--------------VFKVKRQADGSFDRCKARLVAKGFHKIPGIDFHETFNPVVKDPTIRVILA---------RQLDINNTLL
        AMD+E+ A+   + W +              V+K+K  +DG+ +R KARLVAKG+ +  GIDF ETF+PV K  ++++ILA          QLDI+N  L
Subjt:  AMDEELAALHRNNIWRL--------------VFKVKRQADGSFDRCKARLVAKGFHKIPGIDFHETFNPVVKDPTIRVILA---------RQLDINNTLL

Query:  NGHLIEDVFMKQPDGFV----DASHIDYVCKLDKVLYGLK
        NG L E+++MK P G+     D+   + VC L K +YGLK
Subjt:  NGHLIEDVFMKQPDGFV----DASHIDYVCKLDKVLYGLK

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)4.7e-1639.87Show/hide
Query:  TRAKIGV--FQPKNWGSFLAASFADLPPSKPISVKEALKSSHGKKAMDEELAALHRNNIWRL--------------VFKVKRQADGSFDRCKARLVAKGF
        TR+K G+    PK      + +       +P SV  ALK     +AM EEL AL RN  W L              VFK K  +DG+ DR KARLVAKGF
Subjt:  TRAKIGV--FQPKNWGSFLAASFADLPPSKPISVKEALKSSHGKKAMDEELAALHRNNIWRL--------------VFKVKRQADGSFDRCKARLVAKGF

Query:  HKIPGIDFHETFNPVVKDPTIRVIL--ARQLDIN---NTLLNGHLIEDVFMKQ
        H+  GI F ET++PVV+  TIR IL  A+QL++    N +   H    +F K+
Subjt:  HKIPGIDFHETFNPVVKDPTIRVIL--ARQLDIN---NTLLNGHLIEDVFMKQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGATGAAGGTTCGTCATCCTCTTCCTCTGTAGTAGTCTCAACTCCAATCACTTATGCTGGAAATGTGATTAGCAACTCGTTGGAACTGAATCCTCTGTATGAAGA
ATGGATTACAGTCGACCAAGCCTTATGTGGCTGGCTTTTTGGGTCCATGACCCCTGCGATTGCTGCCATTGTTGTAAGTTTTAAGACTTCTAGAGAAGGTTGCAAAGCGC
TCGAAAAGATATATGGGTCGACTAGCAAAGCTCGAATCAATCAACTACGAGGAGATTTGCATAACACAAAAAATGGAACTATGAAGATGATGGAGTATCTTGCCATCATG
AAGTTGGCTTTAGAGAGTTTGCAACTAGCAGGTAATCCAATTTCCCTGGGTGACCTAATTTCATATGTCTTGGTGGGTCTCAACTCCGATTATGTGTTGATTTTTTGCTC
GATTGAAGACAAAGACATCACTACTTGGCAAGAGTTATCGTCTATCCTAGTTCAATTTGAAGGCACGCTTGCTGACTACAGTGTTTTCACGAATACGAGGGAGCTTCCTG
ATCTTGCAACTCATCTAGTGTACAACCTTCCAAATCAATCTTCAGGACAGAAATCGTTCAGCTCTAATCGCACAAATTTAGAAAGCTTCAACAACCCGCATGGTGCAAAT
GGTAAGGACAATCAAGCAGGAAATGGGGCTGGTTCTGCTTCTTCTTCAGCTTATATTGCTACACCTAAAATCCTATGTGATCCTAAGTGGTTGGCTGATAGCGGAGCTAC
AAACCACATCCCTGCAGATGCGGCAAGCAATCTTATGTCATGTTCAAATAAATCTATTAAACTGGATAATATGTTTCATGTGCCAAGTATAATGAGAAATTTAATTAGTA
TTGCTCCCCTAACTGCTGATAATGATGTCTTCGTTGAGTTTCACTCGGATTTTTGTCTTGTGAAGGACAAAGCTTCAAGGACGGTGATGTTGTGTGGAACTCTTAAGGAT
GGTCTTTACCAAATAGAGTTGCCCTCAATTCAAAATGCTTCTTCACCTATCAGTTCCACCTCCTCTTTTGTTGATGTCTCTACCTCAAATAAATCTTATGATAAGTCTAA
GTCTATTGTTTTATGGTCTGCCAATGATTATTCGTCATCTTCAACATCTCTGCAGTCTATTTATGGTTGTTCTAAATCCAGTCCTATTGTTTATCAAGTTACAAGGTTGT
CTATTTTGTCTAAATCCTTGTTGCATCATCGACTTGGCCATGCTTCTGAAAATACTATCAATAGTGTTATTGTTGCTTGTAACTTGAAAGTTTCTAGTAATGAAAAATTT
TCTTTCTGTGATGCATGTCAATTAGGGAAGTCACATCGCTTACCTTTTCCTACTTCGATGACTCAAACCCGACACCCACTTGAACTTATATTTTGTGACCTTTGGGGGTC
ATCCTCTGTTCCATCCACCCTTGGATGTTCCCACAAAGGCTACAAATGTCTCAGTTCAAGTGAACGTGTATATATCTCCCGTCATGTCATCTTTAATGAGAATGATTTTC
CTTTTCAGCATGATTTTTTGTCCTTTATCTCTGCTACTGTCTCTGACGATATAATCATCCATTGGCTTCCTTTTCCAACCTCTTCTTCATCTATGACTCCTCCAACAGCT
GATGCCTTGCCACATCCTTATCTAAACCATTCCACTGAGTTCTCCACTGTATCACCTACCTTGGATGCTACCCCTTCTAAGTCTACTCCTACTGCACCATCTTCTGCCTC
TTCTACAGTTGGTTTTCCTGATATTGGTGTGCTTGTTGCTGACTTTCCTCCTGATTCGGTTGTCGATCTTCATCCAGTGCAAACAAGAGCCAAGATTGGTGTCTTTCAGC
CTAAAAATTGGGGGTCATTCCTTGCTGCTTCATTTGCGGATCTGCCCCCTTCTAAGCCGATTTCTGTCAAGGAGGCCTTAAAGTCTTCTCACGGGAAGAAAGCCATGGAT
GAAGAATTGGCTGCACTCCATCGTAATAACATTTGGCGATTGGTGTTCAAGGTCAAGCGACAGGCTGATGGTTCGTTTGATCGGTGCAAAGCTAGACTGGTTGCTAAAGG
GTTCCACAAAATACCGGGGATAGATTTTCATGAGACTTTCAATCCAGTGGTTAAAGATCCCACCATTCGTGTCATTCTTGCCAGACAGTTGGACATAAACAATACATTAC
TCAATGGGCATTTGATCGAAGATGTTTTTATGAAACAGCCTGATGGTTTTGTTGATGCCTCTCATATTGATTATGTGTGCAAACTTGACAAAGTGTTATATGGTCTCAAG
TGTTGTTTTCAAGCAAAAGTCATGATCATAGACTTCATCACCATACCAGAAATTCGAGAGAAAACATTTAAGGAATTAAGGAGATAG
mRNA sequenceShow/hide mRNA sequence
ATGGGAGATGAAGGTTCGTCATCCTCTTCCTCTGTAGTAGTCTCAACTCCAATCACTTATGCTGGAAATGTGATTAGCAACTCGTTGGAACTGAATCCTCTGTATGAAGA
ATGGATTACAGTCGACCAAGCCTTATGTGGCTGGCTTTTTGGGTCCATGACCCCTGCGATTGCTGCCATTGTTGTAAGTTTTAAGACTTCTAGAGAAGGTTGCAAAGCGC
TCGAAAAGATATATGGGTCGACTAGCAAAGCTCGAATCAATCAACTACGAGGAGATTTGCATAACACAAAAAATGGAACTATGAAGATGATGGAGTATCTTGCCATCATG
AAGTTGGCTTTAGAGAGTTTGCAACTAGCAGGTAATCCAATTTCCCTGGGTGACCTAATTTCATATGTCTTGGTGGGTCTCAACTCCGATTATGTGTTGATTTTTTGCTC
GATTGAAGACAAAGACATCACTACTTGGCAAGAGTTATCGTCTATCCTAGTTCAATTTGAAGGCACGCTTGCTGACTACAGTGTTTTCACGAATACGAGGGAGCTTCCTG
ATCTTGCAACTCATCTAGTGTACAACCTTCCAAATCAATCTTCAGGACAGAAATCGTTCAGCTCTAATCGCACAAATTTAGAAAGCTTCAACAACCCGCATGGTGCAAAT
GGTAAGGACAATCAAGCAGGAAATGGGGCTGGTTCTGCTTCTTCTTCAGCTTATATTGCTACACCTAAAATCCTATGTGATCCTAAGTGGTTGGCTGATAGCGGAGCTAC
AAACCACATCCCTGCAGATGCGGCAAGCAATCTTATGTCATGTTCAAATAAATCTATTAAACTGGATAATATGTTTCATGTGCCAAGTATAATGAGAAATTTAATTAGTA
TTGCTCCCCTAACTGCTGATAATGATGTCTTCGTTGAGTTTCACTCGGATTTTTGTCTTGTGAAGGACAAAGCTTCAAGGACGGTGATGTTGTGTGGAACTCTTAAGGAT
GGTCTTTACCAAATAGAGTTGCCCTCAATTCAAAATGCTTCTTCACCTATCAGTTCCACCTCCTCTTTTGTTGATGTCTCTACCTCAAATAAATCTTATGATAAGTCTAA
GTCTATTGTTTTATGGTCTGCCAATGATTATTCGTCATCTTCAACATCTCTGCAGTCTATTTATGGTTGTTCTAAATCCAGTCCTATTGTTTATCAAGTTACAAGGTTGT
CTATTTTGTCTAAATCCTTGTTGCATCATCGACTTGGCCATGCTTCTGAAAATACTATCAATAGTGTTATTGTTGCTTGTAACTTGAAAGTTTCTAGTAATGAAAAATTT
TCTTTCTGTGATGCATGTCAATTAGGGAAGTCACATCGCTTACCTTTTCCTACTTCGATGACTCAAACCCGACACCCACTTGAACTTATATTTTGTGACCTTTGGGGGTC
ATCCTCTGTTCCATCCACCCTTGGATGTTCCCACAAAGGCTACAAATGTCTCAGTTCAAGTGAACGTGTATATATCTCCCGTCATGTCATCTTTAATGAGAATGATTTTC
CTTTTCAGCATGATTTTTTGTCCTTTATCTCTGCTACTGTCTCTGACGATATAATCATCCATTGGCTTCCTTTTCCAACCTCTTCTTCATCTATGACTCCTCCAACAGCT
GATGCCTTGCCACATCCTTATCTAAACCATTCCACTGAGTTCTCCACTGTATCACCTACCTTGGATGCTACCCCTTCTAAGTCTACTCCTACTGCACCATCTTCTGCCTC
TTCTACAGTTGGTTTTCCTGATATTGGTGTGCTTGTTGCTGACTTTCCTCCTGATTCGGTTGTCGATCTTCATCCAGTGCAAACAAGAGCCAAGATTGGTGTCTTTCAGC
CTAAAAATTGGGGGTCATTCCTTGCTGCTTCATTTGCGGATCTGCCCCCTTCTAAGCCGATTTCTGTCAAGGAGGCCTTAAAGTCTTCTCACGGGAAGAAAGCCATGGAT
GAAGAATTGGCTGCACTCCATCGTAATAACATTTGGCGATTGGTGTTCAAGGTCAAGCGACAGGCTGATGGTTCGTTTGATCGGTGCAAAGCTAGACTGGTTGCTAAAGG
GTTCCACAAAATACCGGGGATAGATTTTCATGAGACTTTCAATCCAGTGGTTAAAGATCCCACCATTCGTGTCATTCTTGCCAGACAGTTGGACATAAACAATACATTAC
TCAATGGGCATTTGATCGAAGATGTTTTTATGAAACAGCCTGATGGTTTTGTTGATGCCTCTCATATTGATTATGTGTGCAAACTTGACAAAGTGTTATATGGTCTCAAG
TGTTGTTTTCAAGCAAAAGTCATGATCATAGACTTCATCACCATACCAGAAATTCGAGAGAAAACATTTAAGGAATTAAGGAGATAG
Protein sequenceShow/hide protein sequence
MGDEGSSSSSSVVVSTPITYAGNVISNSLELNPLYEEWITVDQALCGWLFGSMTPAIAAIVVSFKTSREGCKALEKIYGSTSKARINQLRGDLHNTKNGTMKMMEYLAIM
KLALESLQLAGNPISLGDLISYVLVGLNSDYVLIFCSIEDKDITTWQELSSILVQFEGTLADYSVFTNTRELPDLATHLVYNLPNQSSGQKSFSSNRTNLESFNNPHGAN
GKDNQAGNGAGSASSSAYIATPKILCDPKWLADSGATNHIPADAASNLMSCSNKSIKLDNMFHVPSIMRNLISIAPLTADNDVFVEFHSDFCLVKDKASRTVMLCGTLKD
GLYQIELPSIQNASSPISSTSSFVDVSTSNKSYDKSKSIVLWSANDYSSSSTSLQSIYGCSKSSPIVYQVTRLSILSKSLLHHRLGHASENTINSVIVACNLKVSSNEKF
SFCDACQLGKSHRLPFPTSMTQTRHPLELIFCDLWGSSSVPSTLGCSHKGYKCLSSSERVYISRHVIFNENDFPFQHDFLSFISATVSDDIIIHWLPFPTSSSSMTPPTA
DALPHPYLNHSTEFSTVSPTLDATPSKSTPTAPSSASSTVGFPDIGVLVADFPPDSVVDLHPVQTRAKIGVFQPKNWGSFLAASFADLPPSKPISVKEALKSSHGKKAMD
EELAALHRNNIWRLVFKVKRQADGSFDRCKARLVAKGFHKIPGIDFHETFNPVVKDPTIRVILARQLDINNTLLNGHLIEDVFMKQPDGFVDASHIDYVCKLDKVLYGLK
CCFQAKVMIIDFITIPEIREKTFKELRR