; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0041193 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0041193
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr13:13521702..13523514
RNA-Seq ExpressionLag0041193
SyntenyLag0041193
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsIPR025724 - GAG-pre-integrase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]3.9e-15553.39Show/hide
Query:  STESSTSASMEETANPSSQTFSPGNKISIVKLTDDNFLLWKFQILMALEG-------SSSAEPVRQ-------------ETLNPAYTLWKKQDRMISSWL
        ST S       E ++P +Q F  GNKIS+VKL DD FLLWKFQIL ALE         S +EP  +              T NPAY +WK+QDR+ISSWL
Subjt:  STESSTSASMEETANPSSQTFSPGNKISIVKLTDDNFLLWKFQILMALEG-------SSSAEPVRQ-------------ETLNPAYTLWKKQDRMISSWL

Query:  VGSMSEEILHQMIHCTSSKEIWVSLKQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYIDALAVVGKPVEVEDHILFILAGLGSEYESMVFVIS
        +GSMSEEIL+QM+HC S+KEIW +L+ IF++R LAQ M+ K KL  I+KG M LKEYF KI Q +DALA + KPV  +DHIL+ILAGLGS+Y+SM+ VIS
Subjt:  VGSMSEEILHQMIHCTSSKEIWVSLKQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYIDALAVVGKPVEVEDHILFILAGLGSEYESMVFVIS

Query:  AKIGPQTVQEVMSLLLTQENRIESKIASTESSLPSANLMVHS--KPPESDLQKSNTNHFSPNPGSGNRGRGGGRGGFNTNRGGRSWNNRNRPQCQVCGKF
        A+    +VQEVMSLLLTQE++ ESK+ S E++LPS N++  +  K  ES + ++N N++  N     R   GGRG   +NRG R   NRN+PQCQ+C K 
Subjt:  AKIGPQTVQEVMSLLLTQENRIESKIASTESSLPSANLMVHS--KPPESDLQKSNTNHFSPNPGSGNRGRGGGRGGFNTNRGGRSWNNRNRPQCQVCGKF

Query:  NHTAPKCFFRYAPFGSSN--TPGSFSPNFNQFNRPPSYPQMEAMVTSPDLNQDTNWYPDSGATNHLTHSFNNLSIGTEYGGGNQVHVGNGAGLPILNFSF
         ++A +CFFRY P  +S+  +P S + ++   N   ++PQM AMV + DLN D+NWYPDSGATNHLTHS +NLSIG+EYGGGNQ++  NG+GLPI ++  
Subjt:  NHTAPKCFFRYAPFGSSN--TPGSFSPNFNQFNRPPSYPQMEAMVTSPDLNQDTNWYPDSGATNHLTHSFNNLSIGTEYGGGNQVHVGNGAGLPILNFSF

Query:  TSFSSHVCSNRIFRLNNLLHVSSITKNLISVSQFAKDNGVYFEFHPTLCYVKDQVFGQVLLQGTLHDGLYRSHLYSSSLVEPQDKL--PVQALTSQLF--
         SF+S     + F LNNLL V SITKNLISVSQFAKDN V+FEFHPTLCYVKD   GQVLLQG L+DGLY+        +EP  K      + T  +F  
Subjt:  TSFSSHVCSNRIFRLNNLLHVSSITKNLISVSQFAKDNGVYFEFHPTLCYVKDQVFGQVLLQGTLHDGLYRSHLYSSSLVEPQDKL--PVQALTSQLF--

Query:  LSPSNVNCFVFDLWHRRLGHSSLSTVKSVIQRFNPRLLINNKFQFCSACAMGKVHNLPFHNSTTVYTAPLQLIIVTDLWGPTYIPSSQGYRYCISFIDAF
        + P + N  + DLWHRRLGH  L  VK+V+   +      NK  FC ACA+GK H LPF +S T+YT PLQL I  DLWGP    S  G+RY ISF+DA+
Subjt:  LSPSNVNCFVFDLWHRRLGHSSLSTVKSVIQRFNPRLLINNKFQFCSACAMGKVHNLPFHNSTTVYTAPLQLIIVTDLWGPTYIPSSQGYRYCISFIDAF

Query:  SRYTW
        SRYTW
Subjt:  SRYTW

KAF7832320.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Senna tora]1.8e-9940.33Show/hide
Query:  VKLTDDNFLLWKFQILMALEGS----------------SSAEPVRQETLNPAYTLWKKQDRMISSWLVGSMSEEILHQMIHCTSSKEIWVSLKQIFTTRN
        +KL + N+LLW+ QI+ A++G                  S E    E L+  Y  WKKQD+++ SWL+ SM+E ++ ++I CT S E+W  ++Q F T  
Subjt:  VKLTDDNFLLWKFQILMALEGS----------------SSAEPVRQETLNPAYTLWKKQDRMISSWLVGSMSEEILHQMIHCTSSKEIWVSLKQIFTTRN

Query:  LAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYIDALAVVG-KPVEVEDHILFILAGLGSEYESMVFVISAKIGPQTVQEVMSLLLTQENRIESKIASTESS
         A++ + +T+L+ I+KG  S+ EY  KI+   DAL  +G   V   +H+  +L GL  EYES V  I  +  P TV E+  LL+ QE R+E  + +T + 
Subjt:  LAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYIDALAVVG-KPVEVEDHILFILAGLGSEYESMVFVISAKIGPQTVQEVMSLLLTQENRIESKIASTESS

Query:  LPSANL-------MVHSKPPESDLQKSNTNHFSPNPGSG------NRGRGGG--RGGFNTNRGGRSWNNRNRP--QCQVCGKFNHTAPKCFFRYAPFGSS
         PSAN+          S  P +   +S+   F+   G G       RGRG G  RGG++  RGG S NN NRP   CQVC K  H A  C+ R+      
Subjt:  LPSANL-------MVHSKPPESDLQKSNTNHFSPNPGSG------NRGRGGG--RGGFNTNRGGRSWNNRNRP--QCQVCGKFNHTAPKCFFRYAPFGSS

Query:  NTPGSFSPNFNQFNRPPSYPQMEAMVTSPDLNQDTNWYPDSGATNHLTHSFNNLSIGTEYGGGNQVHVGNGAGLPILNFSFTSFSSHVCSNRIFRLNNLL
         +   +S +F QF +  S P M A + +P++  D+ W+PDSGATNH+T   NNL  G+EY G  Q+H+GNG GL I +   +   S    N    LN+LL
Subjt:  NTPGSFSPNFNQFNRPPSYPQMEAMVTSPDLNQDTNWYPDSGATNHLTHSFNNLSIGTEYGGGNQVHVGNGAGLPILNFSFTSFSSHVCSNRIFRLNNLL

Query:  HVSSITKNLISVSQFAKDNGVYFEFHPTLCYVKDQVFGQVLLQGTL-HDGLYR---------SHLYSSSLVEPQDKLPVQALTSQLFLSPSNVNCFVFDL
        HV +ITKNLISVS+FAKDN V+FEFH   C VK QV  QVLL+GT+  DGLY+         S+ Y SS   P    P   + S    S S  + FVF++
Subjt:  HVSSITKNLISVSQFAKDNGVYFEFHPTLCYVKDQVFGQVLLQGTL-HDGLYR---------SHLYSSSLVEPQDKLPVQALTSQLFLSPSNVNCFVFDL

Query:  --------WHRRLGHSSLSTVKSVIQRFNPRLLINNKFQFCSACAMGKVHNLPFHNSTTVYTAPLQLIIVTDLWGPTYIPSSQGYRYCISFIDAFSRYTW
                WH RLGH++ + V +V++  N  +   +  +FC AC  GK H LP   S ++YT PL+L + TDLWGP  I S  GY Y ISF+DAFSRY W
Subjt:  --------WHRRLGHSSLSTVKSVIQRFNPRLLINNKFQFCSACAMGKVHNLPFHNSTTVYTAPLQLIIVTDLWGPTYIPSSQGYRYCISFIDAFSRYTW

KZV26181.1 hypothetical protein F511_06348 [Dorcoceras hygrometricum]3.3e-10643.32Show/hide
Query:  ETLNPAYTLWKKQDRMISSWLVGSMSEEILHQMIHCTSSKEIWVSLKQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYIDALAVVGKPVEVED
        E +NP +  W +QD+++ S+L+ SMSE    QMI C +S ++W  + Q+F TR+ A++M+ K +LQT++KG +S+K+Y  K++ YID LA  G  +  +D
Subjt:  ETLNPAYTLWKKQDRMISSWLVGSMSEEILHQMIHCTSSKEIWVSLKQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYIDALAVVGKPVEVED

Query:  HILFILAGLGSEYESMVFVISAKIGPQTVQEVMSLLLTQENRIES-KIASTESSLPSANLMVHSKPPESDLQKSNTNHFSPNPGSGNRGRGGGRGGFNTN
         IL IL G+G EYES+V  +++++   ++ EV +LLL  E RIE+  I    ++ PS N+        S  +  NT+   P      RGRG GR G    
Subjt:  HILFILAGLGSEYESMVFVISAKIGPQTVQEVMSLLLTQENRIES-KIASTESSLPSANLMVHSKPPESDLQKSNTNHFSPNPGSGNRGRGGGRGGFNTN

Query:  RGGRS-WNNRNRPQCQVCGKFNHTAPKCFFRYAPFGSSNTPGSFSPNFNQFNR-PPSYPQMEAMVTSPDLNQDTNWYPDSGATNHLTHSFNNLSIGTEYG
        RGGR  W+N  RP CQ+CG   H A  C++R+       + G    +  QFNR  PSYP      T  +   +  WYPDSGA++H+T+   NLS+ +EY 
Subjt:  RGGRS-WNNRNRPQCQVCGKFNHTAPKCFFRYAPFGSSNTPGSFSPNFNQFNR-PPSYPQMEAMVTSPDLNQDTNWYPDSGATNHLTHSFNNLSIGTEYG

Query:  GGNQVHVGNGAGLPILNFSFTSFSSHVCSNRIFRLNNLLHVSSITKNLISVSQFAKDNGVYFEFHPTLCYVKDQVFGQVLLQGTLHDGLYRSHLYSSSLV
        GG++V VGNGAGL I N   ++ +    S+R F L NLLHV  ITKNLISVS+FA DN VYFEFHP+ C VKD     VLL+GTLH+GLYR +L S    
Subjt:  GGNQVHVGNGAGLPILNFSFTSFSSHVCSNRIFRLNNLLHVSSITKNLISVSQFAKDNGVYFEFHPTLCYVKDQVFGQVLLQGTLHDGLYRSHLYSSSLV

Query:  EPQDKLPVQALTSQLFLSPSNVNCF---VFDLWHRRLGHSSLSTVKSVIQRFNPRLLINNKFQFCSACAMGKVHNLPFHNSTTVYTAPLQLIIVTDLWGP
               +Q+  S + +   +  C      D WH RLGH S++TVK V+   N R+  N+   FCS+C +GK H LPF  STT ++AP + ++ +DLWGP
Subjt:  EPQDKLPVQALTSQLFLSPSNVNCF---VFDLWHRRLGHSSLSTVKSVIQRFNPRLLINNKFQFCSACAMGKVHNLPFHNSTTVYTAPLQLIIVTDLWGP

Query:  TYIPSSQGYRYCISFIDAFSRYTW
         +IPS  G RY ISF+DA++RYTW
Subjt:  TYIPSSQGYRYCISFIDAFSRYTW

RVW60229.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]2.6e-10639.19Show/hide
Query:  SPGNKISIVKLTDDNFLLWKFQILMALEG--------SSSAEPVRQET-------LNPAYTLWKKQDRMISSWLVGSMSEEILHQMIHCTSSKEIWVSLK
        SP +++  ++L DDNFL+WK+QI  A+ G         +   P +  T        NP +  +++QD ++ SWL+ S+    L Q++ C+S+ E+W ++ 
Subjt:  SPGNKISIVKLTDDNFLLWKFQILMALEG--------SSSAEPVRQET-------LNPAYTLWKKQDRMISSWLVGSMSEEILHQMIHCTSSKEIWVSLK

Query:  QIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYIDALAVVGKPVEVEDHILFILAGLGSEYESMVFVISAKIGPQTVQEVMSLLLTQENRIESKI
        Q F +++ A++M  K+++Q ++K G+++++Y +K++ Y D LA  G  +   DHIL I+ GLG EYES++ VIS+K    ++Q V S L+  E RI  KI
Subjt:  QIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYIDALAVVGKPVEVEDHILFILAGLGSEYESMVFVISAKIGPQTVQEVMSLLLTQENRIESKI

Query:  ASTESSLPSANLMVHSKPPESDLQKSNTNHFSPNPGSGNRGRGGG----RGGFNTNRG---GRSWNNRNRPQCQVCGKFNHTAPKCFFRYAPFGSSNTP-
        +S + S+   +   +  P  S     N+N + P+ G  NR + GG    RG F  NRG   GR+     +PQCQ+C KF HT  +CF+RY P    N P 
Subjt:  ASTESSLPSANLMVHSKPPESDLQKSNTNHFSPNPGSGNRGRGGG----RGGFNTNRG---GRSWNNRNRPQCQVCGKFNHTAPKCFFRYAPFGSSNTP-

Query:  -----------------GSFSP----NFNQFNRPPS--YPQMEAMVTSPDLNQDTNWYPDSGATNHLTHSFNNLSIGTEYGGGNQVHVGNGAGLPILNFS
                         GS S     N  +++   +  Y +MEAMV +P+  Q+  W+PDSGATNH+TH   NL+ G EY G +++H+GNG GL I +  
Subjt:  -----------------GSFSP----NFNQFNRPPS--YPQMEAMVTSPDLNQDTNWYPDSGATNHLTHSFNNLSIGTEYGGGNQVHVGNGAGLPILNFS

Query:  FTSFSSHVCSNRIFRLNNLLHVSSITKNLISVSQFAKDNGVYFEFHPTLCYVKDQVFGQVLLQGTLHDGLYRSHL------YSSSLVEPQDKLPVQALTS
         + F S    N++  L N+L V +I KNL+SVSQFA+DN VYFEFHP +C+VKD+    +LLQG LH GLY+ +L       +S L    DK  +    +
Subjt:  FTSFSSHVCSNRIFRLNNLLHVSSITKNLISVSQFAKDNGVYFEFHPTLCYVKDQVFGQVLLQGTLHDGLYRSHL------YSSSLVEPQDKLPVQALTS

Query:  QL-------FLSPSNVNCFVFDLWHRRLGHSSLSTVKSVIQRFNPRLLINNKFQFCSACAMGKVHNLPFHNSTTVYTAPLQLIIVTDLWGPTYIPSSQGY
         L       F   +N +  VFDLWH+RLGH +   V  V+          +    CSAC +GK HNLPF  S TVYT PLQL +V+DLWGP  I SS G+
Subjt:  QL-------FLSPSNVNCFVFDLWHRRLGHSSLSTVKSVIQRFNPRLLINNKFQFCSACAMGKVHNLPFHNSTTVYTAPLQLIIVTDLWGPTYIPSSQGY

Query:  RYCISFIDAFSRYTW
         Y +SF+DA+SRYTW
Subjt:  RYCISFIDAFSRYTW

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]3.9e-15553.39Show/hide
Query:  STESSTSASMEETANPSSQTFSPGNKISIVKLTDDNFLLWKFQILMALEG-------SSSAEPVRQ-------------ETLNPAYTLWKKQDRMISSWL
        ST S       E ++P +Q F  GNKIS+VKL DD FLLWKFQIL ALE         S +EP  +              T NPAY +WK+QDR+ISSWL
Subjt:  STESSTSASMEETANPSSQTFSPGNKISIVKLTDDNFLLWKFQILMALEG-------SSSAEPVRQ-------------ETLNPAYTLWKKQDRMISSWL

Query:  VGSMSEEILHQMIHCTSSKEIWVSLKQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYIDALAVVGKPVEVEDHILFILAGLGSEYESMVFVIS
        +GSMSEEIL+QM+HC S+KEIW +L+ IF++R LAQ M+ K KL  I+KG M LKEYF KI Q +DALA + KPV  +DHIL+ILAGLGS+Y+SM+ VIS
Subjt:  VGSMSEEILHQMIHCTSSKEIWVSLKQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYIDALAVVGKPVEVEDHILFILAGLGSEYESMVFVIS

Query:  AKIGPQTVQEVMSLLLTQENRIESKIASTESSLPSANLMVHS--KPPESDLQKSNTNHFSPNPGSGNRGRGGGRGGFNTNRGGRSWNNRNRPQCQVCGKF
        A+    +VQEVMSLLLTQE++ ESK+ S E++LPS N++  +  K  ES + ++N N++  N     R   GGRG   +NRG R   NRN+PQCQ+C K 
Subjt:  AKIGPQTVQEVMSLLLTQENRIESKIASTESSLPSANLMVHS--KPPESDLQKSNTNHFSPNPGSGNRGRGGGRGGFNTNRGGRSWNNRNRPQCQVCGKF

Query:  NHTAPKCFFRYAPFGSSN--TPGSFSPNFNQFNRPPSYPQMEAMVTSPDLNQDTNWYPDSGATNHLTHSFNNLSIGTEYGGGNQVHVGNGAGLPILNFSF
         ++A +CFFRY P  +S+  +P S + ++   N   ++PQM AMV + DLN D+NWYPDSGATNHLTHS +NLSIG+EYGGGNQ++  NG+GLPI ++  
Subjt:  NHTAPKCFFRYAPFGSSN--TPGSFSPNFNQFNRPPSYPQMEAMVTSPDLNQDTNWYPDSGATNHLTHSFNNLSIGTEYGGGNQVHVGNGAGLPILNFSF

Query:  TSFSSHVCSNRIFRLNNLLHVSSITKNLISVSQFAKDNGVYFEFHPTLCYVKDQVFGQVLLQGTLHDGLYRSHLYSSSLVEPQDKL--PVQALTSQLF--
         SF+S     + F LNNLL V SITKNLISVSQFAKDN V+FEFHPTLCYVKD   GQVLLQG L+DGLY+        +EP  K      + T  +F  
Subjt:  TSFSSHVCSNRIFRLNNLLHVSSITKNLISVSQFAKDNGVYFEFHPTLCYVKDQVFGQVLLQGTLHDGLYRSHLYSSSLVEPQDKL--PVQALTSQLF--

Query:  LSPSNVNCFVFDLWHRRLGHSSLSTVKSVIQRFNPRLLINNKFQFCSACAMGKVHNLPFHNSTTVYTAPLQLIIVTDLWGPTYIPSSQGYRYCISFIDAF
        + P + N  + DLWHRRLGH  L  VK+V+   +      NK  FC ACA+GK H LPF +S T+YT PLQL I  DLWGP    S  G+RY ISF+DA+
Subjt:  LSPSNVNCFVFDLWHRRLGHSSLSTVKSVIQRFNPRLLINNKFQFCSACAMGKVHNLPFHNSTTVYTAPLQLIIVTDLWGPTYIPSSQGYRYCISFIDAF

Query:  SRYTW
        SRYTW
Subjt:  SRYTW

TrEMBL top hitse value%identityAlignment
A0A2Z7AWA7 Integrase catalytic domain-containing protein1.6e-10643.32Show/hide
Query:  ETLNPAYTLWKKQDRMISSWLVGSMSEEILHQMIHCTSSKEIWVSLKQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYIDALAVVGKPVEVED
        E +NP +  W +QD+++ S+L+ SMSE    QMI C +S ++W  + Q+F TR+ A++M+ K +LQT++KG +S+K+Y  K++ YID LA  G  +  +D
Subjt:  ETLNPAYTLWKKQDRMISSWLVGSMSEEILHQMIHCTSSKEIWVSLKQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYIDALAVVGKPVEVED

Query:  HILFILAGLGSEYESMVFVISAKIGPQTVQEVMSLLLTQENRIES-KIASTESSLPSANLMVHSKPPESDLQKSNTNHFSPNPGSGNRGRGGGRGGFNTN
         IL IL G+G EYES+V  +++++   ++ EV +LLL  E RIE+  I    ++ PS N+        S  +  NT+   P      RGRG GR G    
Subjt:  HILFILAGLGSEYESMVFVISAKIGPQTVQEVMSLLLTQENRIES-KIASTESSLPSANLMVHSKPPESDLQKSNTNHFSPNPGSGNRGRGGGRGGFNTN

Query:  RGGRS-WNNRNRPQCQVCGKFNHTAPKCFFRYAPFGSSNTPGSFSPNFNQFNR-PPSYPQMEAMVTSPDLNQDTNWYPDSGATNHLTHSFNNLSIGTEYG
        RGGR  W+N  RP CQ+CG   H A  C++R+       + G    +  QFNR  PSYP      T  +   +  WYPDSGA++H+T+   NLS+ +EY 
Subjt:  RGGRS-WNNRNRPQCQVCGKFNHTAPKCFFRYAPFGSSNTPGSFSPNFNQFNR-PPSYPQMEAMVTSPDLNQDTNWYPDSGATNHLTHSFNNLSIGTEYG

Query:  GGNQVHVGNGAGLPILNFSFTSFSSHVCSNRIFRLNNLLHVSSITKNLISVSQFAKDNGVYFEFHPTLCYVKDQVFGQVLLQGTLHDGLYRSHLYSSSLV
        GG++V VGNGAGL I N   ++ +    S+R F L NLLHV  ITKNLISVS+FA DN VYFEFHP+ C VKD     VLL+GTLH+GLYR +L S    
Subjt:  GGNQVHVGNGAGLPILNFSFTSFSSHVCSNRIFRLNNLLHVSSITKNLISVSQFAKDNGVYFEFHPTLCYVKDQVFGQVLLQGTLHDGLYRSHLYSSSLV

Query:  EPQDKLPVQALTSQLFLSPSNVNCF---VFDLWHRRLGHSSLSTVKSVIQRFNPRLLINNKFQFCSACAMGKVHNLPFHNSTTVYTAPLQLIIVTDLWGP
               +Q+  S + +   +  C      D WH RLGH S++TVK V+   N R+  N+   FCS+C +GK H LPF  STT ++AP + ++ +DLWGP
Subjt:  EPQDKLPVQALTSQLFLSPSNVNCF---VFDLWHRRLGHSSLSTVKSVIQRFNPRLLINNKFQFCSACAMGKVHNLPFHNSTTVYTAPLQLIIVTDLWGP

Query:  TYIPSSQGYRYCISFIDAFSRYTW
         +IPS  G RY ISF+DA++RYTW
Subjt:  TYIPSSQGYRYCISFIDAFSRYTW

A0A438FJP6 Retrovirus-related Pol polyprotein from transposon TNT 1-941.2e-10639.19Show/hide
Query:  SPGNKISIVKLTDDNFLLWKFQILMALEG--------SSSAEPVRQET-------LNPAYTLWKKQDRMISSWLVGSMSEEILHQMIHCTSSKEIWVSLK
        SP +++  ++L DDNFL+WK+QI  A+ G         +   P +  T        NP +  +++QD ++ SWL+ S+    L Q++ C+S+ E+W ++ 
Subjt:  SPGNKISIVKLTDDNFLLWKFQILMALEG--------SSSAEPVRQET-------LNPAYTLWKKQDRMISSWLVGSMSEEILHQMIHCTSSKEIWVSLK

Query:  QIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYIDALAVVGKPVEVEDHILFILAGLGSEYESMVFVISAKIGPQTVQEVMSLLLTQENRIESKI
        Q F +++ A++M  K+++Q ++K G+++++Y +K++ Y D LA  G  +   DHIL I+ GLG EYES++ VIS+K    ++Q V S L+  E RI  KI
Subjt:  QIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYIDALAVVGKPVEVEDHILFILAGLGSEYESMVFVISAKIGPQTVQEVMSLLLTQENRIESKI

Query:  ASTESSLPSANLMVHSKPPESDLQKSNTNHFSPNPGSGNRGRGGG----RGGFNTNRG---GRSWNNRNRPQCQVCGKFNHTAPKCFFRYAPFGSSNTP-
        +S + S+   +   +  P  S     N+N + P+ G  NR + GG    RG F  NRG   GR+     +PQCQ+C KF HT  +CF+RY P    N P 
Subjt:  ASTESSLPSANLMVHSKPPESDLQKSNTNHFSPNPGSGNRGRGGG----RGGFNTNRG---GRSWNNRNRPQCQVCGKFNHTAPKCFFRYAPFGSSNTP-

Query:  -----------------GSFSP----NFNQFNRPPS--YPQMEAMVTSPDLNQDTNWYPDSGATNHLTHSFNNLSIGTEYGGGNQVHVGNGAGLPILNFS
                         GS S     N  +++   +  Y +MEAMV +P+  Q+  W+PDSGATNH+TH   NL+ G EY G +++H+GNG GL I +  
Subjt:  -----------------GSFSP----NFNQFNRPPS--YPQMEAMVTSPDLNQDTNWYPDSGATNHLTHSFNNLSIGTEYGGGNQVHVGNGAGLPILNFS

Query:  FTSFSSHVCSNRIFRLNNLLHVSSITKNLISVSQFAKDNGVYFEFHPTLCYVKDQVFGQVLLQGTLHDGLYRSHL------YSSSLVEPQDKLPVQALTS
         + F S    N++  L N+L V +I KNL+SVSQFA+DN VYFEFHP +C+VKD+    +LLQG LH GLY+ +L       +S L    DK  +    +
Subjt:  FTSFSSHVCSNRIFRLNNLLHVSSITKNLISVSQFAKDNGVYFEFHPTLCYVKDQVFGQVLLQGTLHDGLYRSHL------YSSSLVEPQDKLPVQALTS

Query:  QL-------FLSPSNVNCFVFDLWHRRLGHSSLSTVKSVIQRFNPRLLINNKFQFCSACAMGKVHNLPFHNSTTVYTAPLQLIIVTDLWGPTYIPSSQGY
         L       F   +N +  VFDLWH+RLGH +   V  V+          +    CSAC +GK HNLPF  S TVYT PLQL +V+DLWGP  I SS G+
Subjt:  QL-------FLSPSNVNCFVFDLWHRRLGHSSLSTVKSVIQRFNPRLLINNKFQFCSACAMGKVHNLPFHNSTTVYTAPLQLIIVTDLWGPTYIPSSQGY

Query:  RYCISFIDAFSRYTW
         Y +SF+DA+SRYTW
Subjt:  RYCISFIDAFSRYTW

A0A438H844 Retrovirus-related Pol polyprotein from transposon RE12.1e-9838.95Show/hide
Query:  MEETANPSSQTFSPGNKISIV--KLTDDNFLLWKFQILMALEGSS----------------SAEPVRQETLNPAYTLWKKQDRMISSWLVGSMSEEILHQ
        MEET+  +S  F P +    V  KL + NFL+W+ QIL  L G                  S++   Q  +NP +  W++QD++I SWL+ S+++ +L +
Subjt:  MEETANPSSQTFSPGNKISIV--KLTDDNFLLWKFQILMALEGSS----------------SAEPVRQETLNPAYTLWKKQDRMISSWLVGSMSEEILHQ

Query:  MIHCTSSKEIWVSLKQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYIDALAVVGKPVEVEDHILFILAGLGSEYESMVFVISAKIGPQTVQEV
        M++C +S ++W +L+  F T+  A++ + KT+L   +KG +S+ +Y  KI+  +D LA+VG  + V+DHI  I  GL  +YE+ +  +++++ P TV+E+
Subjt:  MIHCTSSKEIWVSLKQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYIDALAVVGKPVEVEDHILFILAGLGSEYESMVFVISAKIGPQTVQEV

Query:  MSLLLTQENRIESKIASTESSLPS-ANLMVHSK--PPESDLQKSNTN-HFSPNPGSGNRGRGGGRGGFNTNRGGR----SWNNRNRPQCQVCGKFNHTAP
          LLL QE+RIE  I   + S PS A+L+  ++   P  + + S  N +F P   SGN G    RG F     GR    SW   N+PQCQ+CG+  H   
Subjt:  MSLLLTQENRIESKIASTESSLPS-ANLMVHSK--PPESDLQKSNTN-HFSPNPGSGNRGRGGGRGGFNTNRGGR----SWNNRNRPQCQVCGKFNHTAP

Query:  KCFFRYAPFGSSNTPGSFSPNFNQFNRPPSYPQM---------EAMVTSPDLNQDTNWYPDSGATNHLTHSFNNLSIGTEYGGGNQVHVGNGAGLPILNF
        +C++R+    S   P     N  Q N    + Q+             T+ ++ QD NWYPDSGAT+HLT + NNL   +++   ++V VGNG GLPI + 
Subjt:  KCFFRYAPFGSSNTPGSFSPNFNQFNRPPSYPQM---------EAMVTSPDLNQDTNWYPDSGATNHLTHSFNNLSIGTEYGGGNQVHVGNGAGLPILNF

Query:  SFTSFSSHVCSNRIFRLNNLLHVSSITKNLISVSQFAKDNGVYFEFHPTLCYVKDQVFGQVLLQGTLHDGLYRSHLYSSSLVEPQDKLPVQ--------A
          TSFSS    ++   L  LLHV  ITKNL+SVS+FA DN V+FEFHPT C+VKD     VL+ G L  GLY   ++ ++    Q KLP+         A
Subjt:  SFTSFSSHVCSNRIFRLNNLLHVSSITKNLISVSQFAKDNGVYFEFHPTLCYVKDQVFGQVLLQGTLHDGLYRSHLYSSSLVEPQDKLPVQ--------A

Query:  LTSQLFLSPSNVNCFVFDLWHRRLGHSSLSTVKSVIQRFNPRLLINNKFQFCSACAMGKVHNLPFHNSTTVYTAPLQLIIVTDLWGPTYIPSSQGYRYCI
        L S+    P++ +   F LWH RLGH S   V  V+ + N   L       CSAC MGK+H  PF +S + YT PL+L I TDLWGP   PSS G++Y I
Subjt:  LTSQLFLSPSNVNCFVFDLWHRRLGHSSLSTVKSVIQRFNPRLLINNKFQFCSACAMGKVHNLPFHNSTTVYTAPLQLIIVTDLWGPTYIPSSQGYRYCI

Query:  SFIDAFSRYTW
         FIDA+SR+TW
Subjt:  SFIDAFSRYTW

A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-941.9e-15553.39Show/hide
Query:  STESSTSASMEETANPSSQTFSPGNKISIVKLTDDNFLLWKFQILMALEG-------SSSAEPVRQ-------------ETLNPAYTLWKKQDRMISSWL
        ST S       E ++P +Q F  GNKIS+VKL DD FLLWKFQIL ALE         S +EP  +              T NPAY +WK+QDR+ISSWL
Subjt:  STESSTSASMEETANPSSQTFSPGNKISIVKLTDDNFLLWKFQILMALEG-------SSSAEPVRQ-------------ETLNPAYTLWKKQDRMISSWL

Query:  VGSMSEEILHQMIHCTSSKEIWVSLKQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYIDALAVVGKPVEVEDHILFILAGLGSEYESMVFVIS
        +GSMSEEIL+QM+HC S+KEIW +L+ IF++R LAQ M+ K KL  I+KG M LKEYF KI Q +DALA + KPV  +DHIL+ILAGLGS+Y+SM+ VIS
Subjt:  VGSMSEEILHQMIHCTSSKEIWVSLKQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYIDALAVVGKPVEVEDHILFILAGLGSEYESMVFVIS

Query:  AKIGPQTVQEVMSLLLTQENRIESKIASTESSLPSANLMVHS--KPPESDLQKSNTNHFSPNPGSGNRGRGGGRGGFNTNRGGRSWNNRNRPQCQVCGKF
        A+    +VQEVMSLLLTQE++ ESK+ S E++LPS N++  +  K  ES + ++N N++  N     R   GGRG   +NRG R   NRN+PQCQ+C K 
Subjt:  AKIGPQTVQEVMSLLLTQENRIESKIASTESSLPSANLMVHS--KPPESDLQKSNTNHFSPNPGSGNRGRGGGRGGFNTNRGGRSWNNRNRPQCQVCGKF

Query:  NHTAPKCFFRYAPFGSSN--TPGSFSPNFNQFNRPPSYPQMEAMVTSPDLNQDTNWYPDSGATNHLTHSFNNLSIGTEYGGGNQVHVGNGAGLPILNFSF
         ++A +CFFRY P  +S+  +P S + ++   N   ++PQM AMV + DLN D+NWYPDSGATNHLTHS +NLSIG+EYGGGNQ++  NG+GLPI ++  
Subjt:  NHTAPKCFFRYAPFGSSN--TPGSFSPNFNQFNRPPSYPQMEAMVTSPDLNQDTNWYPDSGATNHLTHSFNNLSIGTEYGGGNQVHVGNGAGLPILNFSF

Query:  TSFSSHVCSNRIFRLNNLLHVSSITKNLISVSQFAKDNGVYFEFHPTLCYVKDQVFGQVLLQGTLHDGLYRSHLYSSSLVEPQDKL--PVQALTSQLF--
         SF+S     + F LNNLL V SITKNLISVSQFAKDN V+FEFHPTLCYVKD   GQVLLQG L+DGLY+        +EP  K      + T  +F  
Subjt:  TSFSSHVCSNRIFRLNNLLHVSSITKNLISVSQFAKDNGVYFEFHPTLCYVKDQVFGQVLLQGTLHDGLYRSHLYSSSLVEPQDKL--PVQALTSQLF--

Query:  LSPSNVNCFVFDLWHRRLGHSSLSTVKSVIQRFNPRLLINNKFQFCSACAMGKVHNLPFHNSTTVYTAPLQLIIVTDLWGPTYIPSSQGYRYCISFIDAF
        + P + N  + DLWHRRLGH  L  VK+V+   +      NK  FC ACA+GK H LPF +S T+YT PLQL I  DLWGP    S  G+RY ISF+DA+
Subjt:  LSPSNVNCFVFDLWHRRLGHSSLSTVKSVIQRFNPRLLINNKFQFCSACAMGKVHNLPFHNSTTVYTAPLQLIIVTDLWGPTYIPSSQGYRYCISFIDAF

Query:  SRYTW
        SRYTW
Subjt:  SRYTW

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-941.9e-15553.39Show/hide
Query:  STESSTSASMEETANPSSQTFSPGNKISIVKLTDDNFLLWKFQILMALEG-------SSSAEPVRQ-------------ETLNPAYTLWKKQDRMISSWL
        ST S       E ++P +Q F  GNKIS+VKL DD FLLWKFQIL ALE         S +EP  +              T NPAY +WK+QDR+ISSWL
Subjt:  STESSTSASMEETANPSSQTFSPGNKISIVKLTDDNFLLWKFQILMALEG-------SSSAEPVRQ-------------ETLNPAYTLWKKQDRMISSWL

Query:  VGSMSEEILHQMIHCTSSKEIWVSLKQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYIDALAVVGKPVEVEDHILFILAGLGSEYESMVFVIS
        +GSMSEEIL+QM+HC S+KEIW +L+ IF++R LAQ M+ K KL  I+KG M LKEYF KI Q +DALA + KPV  +DHIL+ILAGLGS+Y+SM+ VIS
Subjt:  VGSMSEEILHQMIHCTSSKEIWVSLKQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYIDALAVVGKPVEVEDHILFILAGLGSEYESMVFVIS

Query:  AKIGPQTVQEVMSLLLTQENRIESKIASTESSLPSANLMVHS--KPPESDLQKSNTNHFSPNPGSGNRGRGGGRGGFNTNRGGRSWNNRNRPQCQVCGKF
        A+    +VQEVMSLLLTQE++ ESK+ S E++LPS N++  +  K  ES + ++N N++  N     R   GGRG   +NRG R   NRN+PQCQ+C K 
Subjt:  AKIGPQTVQEVMSLLLTQENRIESKIASTESSLPSANLMVHS--KPPESDLQKSNTNHFSPNPGSGNRGRGGGRGGFNTNRGGRSWNNRNRPQCQVCGKF

Query:  NHTAPKCFFRYAPFGSSN--TPGSFSPNFNQFNRPPSYPQMEAMVTSPDLNQDTNWYPDSGATNHLTHSFNNLSIGTEYGGGNQVHVGNGAGLPILNFSF
         ++A +CFFRY P  +S+  +P S + ++   N   ++PQM AMV + DLN D+NWYPDSGATNHLTHS +NLSIG+EYGGGNQ++  NG+GLPI ++  
Subjt:  NHTAPKCFFRYAPFGSSN--TPGSFSPNFNQFNRPPSYPQMEAMVTSPDLNQDTNWYPDSGATNHLTHSFNNLSIGTEYGGGNQVHVGNGAGLPILNFSF

Query:  TSFSSHVCSNRIFRLNNLLHVSSITKNLISVSQFAKDNGVYFEFHPTLCYVKDQVFGQVLLQGTLHDGLYRSHLYSSSLVEPQDKL--PVQALTSQLF--
         SF+S     + F LNNLL V SITKNLISVSQFAKDN V+FEFHPTLCYVKD   GQVLLQG L+DGLY+        +EP  K      + T  +F  
Subjt:  TSFSSHVCSNRIFRLNNLLHVSSITKNLISVSQFAKDNGVYFEFHPTLCYVKDQVFGQVLLQGTLHDGLYRSHLYSSSLVEPQDKL--PVQALTSQLF--

Query:  LSPSNVNCFVFDLWHRRLGHSSLSTVKSVIQRFNPRLLINNKFQFCSACAMGKVHNLPFHNSTTVYTAPLQLIIVTDLWGPTYIPSSQGYRYCISFIDAF
        + P + N  + DLWHRRLGH  L  VK+V+   +      NK  FC ACA+GK H LPF +S T+YT PLQL I  DLWGP    S  G+RY ISF+DA+
Subjt:  LSPSNVNCFVFDLWHRRLGHSSLSTVKSVIQRFNPRLLINNKFQFCSACAMGKVHNLPFHNSTTVYTAPLQLIIVTDLWGPTYIPSSQGYRYCISFIDAF

Query:  SRYTW
        SRYTW
Subjt:  SRYTW

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.4e-1421.72Show/hide
Query:  GNKISIVKLTDDN-FLLWK-----FQILMALEGSSSAEPVRQETLNPAYTLWKKQDRMISSWLVGSMSEEILHQMIHCTSSKEIWVSLKQIFTTRNLAQM
        G K  + K   DN F  W+       I   L      +  + +T+      W   D   +S +   +S+++++ +I   +++ IW  L+ ++ ++ L   
Subjt:  GNKISIVKLTDDN-FLLWK-----FQILMALEGSSSAEPVRQETLNPAYTLWKKQDRMISSWLVGSMSEEILHQMIHCTSSKEIWVSLKQIFTTRNLAQM

Query:  MKIKTKLQTIQKG-GMSLKEYFSKIQQYIDALAVVGKPVEVEDHILFILAGLGSEYESMVFVISAKIGPQTVQEVMSLLLTQENRIESKIASTESSLPSA
        + +K +L  +    G +   + +     I  LA +G  +E ED  + +L  L S Y+++   I        +++V S LL  E                 
Subjt:  MKIKTKLQTIQKG-GMSLKEYFSKIQQYIDALAVVGKPVEVEDHILFILAGLGSEYESMVFVISAKIGPQTVQEVMSLLLTQENRIESKIASTESSLPSA

Query:  NLMVHSKPPESDLQKSNTNHFSPNPGSGNRGRGGGRGGFNTNRGGRSWNNRNRPQCQVCGKFNHTAPKCFFRYAPFGSSNTPGSFSPNFNQFNRPPSYPQ
              K PE+  Q   T           RGR   R   N  R G    ++NR + +V   +N   P  F R  P       G  S   N  N       
Subjt:  NLMVHSKPPESDLQKSNTNHFSPNPGSGNRGRGGGRGGFNTNRGGRSWNNRNRPQCQVCGKFNHTAPKCFFRYAPFGSSNTPGSFSPNFNQFNRPPSYPQ

Query:  MEAMVTSPDLNQ--------DTNWYPDSGATNHLTHSFNNLSIGTEYGGGNQVHVGNGAGLPILNFSFTSFSSHVCSNRIFRLNNLLHVSSITKNLISVS
         + +V   +  +        ++ W  D+ A++H T    +L      G    V +GN +   I         ++V    +  L ++ HV  +  NLIS  
Subjt:  MEAMVTSPDLNQ--------DTNWYPDSGATNHLTHSFNNLSIGTEYGGGNQVHVGNGAGLPILNFSFTSFSSHVCSNRIFRLNNLLHVSSITKNLISVS

Query:  QFAKDNGVYFEFHPTLCYVKDQVFGQVLLQGTLHDGLYRSH--LYSSSLVEPQDKLPVQALTSQLFLSPSNVNCFVFDLWHRRLGHSSLSTVKSVIQRFN
           +D    +  +      K  +   V+ +G     LYR++  +    L   QD++ V                   DLWH+R+GH S   ++ + ++  
Subjt:  QFAKDNGVYFEFHPTLCYVKDQVFGQVLLQGTLHDGLYRSH--LYSSSLVEPQDKLPVQALTSQLFLSPSNVNCFVFDLWHRRLGHSSLSTVKSVIQRFN

Query:  PRLLINNKFQFCSACAMGKVHNLPFHNSTTVYTAPLQLIIVTDLWGPTYIPSSQGYRYCISFIDAFSRYTW
                 + C  C  GK H + F  S+      L L + +D+ GP  I S  G +Y ++FID  SR  W
Subjt:  PRLLINNKFQFCSACAMGKVHNLPFHNSTTVYTAPLQLIIVTDLWGPTYIPSSQGYRYCISFIDAFSRYTW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE18.7e-6531.84Show/hide
Query:  NKISIVKLTDDNFLLWK---------FQILMALEGSSSAEPVRQET-----LNPAYTLWKKQDRMISSWLVGSMSEEILHQMIHCTSSKEIWVSLKQIFT
        N  ++ KLT  N+L+W          +++   L+GS++  P    T     +NP YT WK+QD++I S ++G++S  +   +   T++ +IW +L++I+ 
Subjt:  NKISIVKLTDDNFLLWK---------FQILMALEGSSSAEPVRQET-----LNPAYTLWKKQDRMISSWLVGSMSEEILHQMIHCTSSKEIWVSLKQIFT

Query:  TRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYIDALAVVGKPVEVEDHILFILAGLGSEYESMVFVISAKIGPQTVQEVMSLLLTQENRIESKIASTE
          +   + +++T+L+   KG  ++ +Y   +    D LA++GKP++ ++ +  +L  L  EY+ ++  I+AK  P T+ E+   LL  E++I +  ++T 
Subjt:  TRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYIDALAVVGKPVEVEDHILFILAGLGSEYESMVFVISAKIGPQTVQEVMSLLLTQENRIESKIASTE

Query:  SSLPSANLMVHSKPPESDLQKSNTNHFSPNPGSGNRGRGGGRGGFNTNRGGRSW----------NNRNRP---QCQVCGKFNHTAPKCFFRYAPFGSSNT
          + +AN + H     +    +N N+ + N    NR         N N   + W          NN+++P   +CQ+CG   H+A +C        S N+
Subjt:  SSLPSANLMVHSKPPESDLQKSNTNHFSPNPGSGNRGRGGGRGGFNTNRGGRSW----------NNRNRP---QCQVCGKFNHTAPKCFFRYAPFGSSNT

Query:  PGSFSPNFNQFNRPPSYPQMEAMVTSPDLNQDTNWYPDSGATNHLTHSFNNLSIGTEYGGGNQVHVGNGAGLPILNFSFTSFSSHVCSNRIFRLNNLLHV
            SP        P  P+    + SP      NW  DSGAT+H+T  FNNLS+   Y GG+ V V +G+ +PI +   TS S+    +R   L+N+L+V
Subjt:  PGSFSPNFNQFNRPPSYPQMEAMVTSPDLNQDTNWYPDSGATNHLTHSFNNLSIGTEYGGGNQVHVGNGAGLPILNFSFTSFSSHVCSNRIFRLNNLLHV

Query:  SSITKNLISVSQFAKDNGVYFEFHPTLCYVKDQVFGQVLLQGTLHDGLYRSHLYSSSLVEPQDKLPVQALTSQLFLSPSNVNCFVFDLWHRRLGHSSLST
         +I KNLISV +    NGV  EF P    VKD   G  LLQG   D LY   + SS  V              LF SPS+        WH RLGH + S 
Subjt:  SSITKNLISVSQFAKDNGVYFEFHPTLCYVKDQVFGQVLLQGTLHDGLYRSHLYSSSLVEPQDKLPVQALTSQLFLSPSNVNCFVFDLWHRRLGHSSLST

Query:  VKSVIQRFNPRLL-INNKFQFCSACAMGKVHNLPFHNSTTVYTAPLQLIIVTDLWGPTYIPSSQGYRYCISFIDAFSRYTW
        + SVI  ++  +L  ++KF  CS C + K + +PF  ST   T PL+  I +D+W  + I S   YRY + F+D F+RYTW
Subjt:  VKSVIQRFNPRLL-INNKFQFCSACAMGKVHNLPFHNSTTVYTAPLQLIIVTDLWGPTYIPSSQGYRYCISFIDAFSRYTW

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.5e-5631.13Show/hide
Query:  NKISIVKLTDDNFLLWK---------FQILMALEGSSSAEPVRQET-----LNPAYTLWKKQDRMISSWLVGSMSEEILHQMIHCTSSKEIWVSLKQIFT
        N  ++ KLT  N+L+W          +++   L+GS+   P    T     +NP YT W++QD++I S ++G++S  +   +   T++ +IW +L++I+ 
Subjt:  NKISIVKLTDDNFLLWK---------FQILMALEGSSSAEPVRQET-----LNPAYTLWKKQDRMISSWLVGSMSEEILHQMIHCTSSKEIWVSLKQIFT

Query:  TRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYIDALAVVGKPVEVEDHILFILAGLGSEYESMVFVISAKIGPQTVQEVMSLLLTQENRIESKIASTE
          +   +    T+L+ I +                D LA++GKP++ ++ +  +L  L  +Y+ ++  I+AK  P ++ E+   L+ +E+++ + + S E
Subjt:  TRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYIDALAVVGKPVEVEDHILFILAGLGSEYESMVFVISAKIGPQTVQEVMSLLLTQENRIESKIASTE

Query:  SSLPSANLMVHSKPPESDLQKSNTNHFSPNPGSGNRGRGGGRGGFN----TNRGGRSWNNRNRP---QCQVCGKFNHTAPKCFFRYAPFGSSNTPGSFSP
            +AN++ H        + +NTN    N G  NR         N    ++ G RS N + +P   +CQ+C    H+A +C   +    ++N   S SP
Subjt:  SSLPSANLMVHSKPPESDLQKSNTNHFSPNPGSGNRGRGGGRGGFN----TNRGGRSWNNRNRP---QCQVCGKFNHTAPKCFFRYAPFGSSNTPGSFSP

Query:  NFNQFNRPPSYPQMEAMVTSPDLNQDTNWYPDSGATNHLTHSFNNLSIGTEYGGGNQVHVGNGAGLPILNFSFTSFSSHVCSNRIFRLNNLLHVSSITKN
                P  P+    V SP      NW  DSGAT+H+T  FNNLS    Y GG+ V + +G+ +PI   + T  +S   S+R   LN +L+V +I KN
Subjt:  NFNQFNRPPSYPQMEAMVTSPDLNQDTNWYPDSGATNHLTHSFNNLSIGTEYGGGNQVHVGNGAGLPILNFSFTSFSSHVCSNRIFRLNNLLHVSSITKN

Query:  LISVSQFAKDNGVYFEFHPTLCYVKDQVFGQVLLQGTLHDGLYRSHLYSSSLVEPQDKLPVQALTSQLFLSPSNVNCFVFDLWHRRLGHSSLSTVKSVIQ
        LISV +    N V  EF P    VKD   G  LLQG   D LY   + SS  V              +F SP +        WH RLGH SL+ + SVI 
Subjt:  LISVSQFAKDNGVYFEFHPTLCYVKDQVFGQVLLQGTLHDGLYRSHLYSSSLVEPQDKLPVQALTSQLFLSPSNVNCFVFDLWHRRLGHSSLSTVKSVIQ

Query:  RFN-PRLLINNKFQFCSACAMGKVHNLPFHNSTTVYTAPLQLIIVTDLWGPTYIPSSQGYRYCISFIDAFSRYTW
          + P L  ++K   CS C + K H +PF NST   + PL+  I +D+W  + I S   YRY + F+D F+RYTW
Subjt:  RFN-PRLLINNKFQFCSACAMGKVHNLPFHNSTTVYTAPLQLIIVTDLWGPTYIPSSQGYRYCISFIDAFSRYTW

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).8.2e-1024.34Show/hide
Query:  SSTSASMEETANPSSQTFSP-----GNKISIVKLT--DDNFLLWKFQILMALE-----GSSSAEPVRQETLNPAYTLWKKQDRMISSWLVGSMSEEILHQ
        + T  S+  T++P S  + P      +  SI KL+  +DN++ WK +    L      G       + +  +P Y  W++ + M+  WL+ SM++++L  
Subjt:  SSTSASMEETANPSSQTFSP-----GNKISIVKLT--DDNFLLWKFQILMALE-----GSSSAEPVRQETLNPAYTLWKKQDRMISSWLVGSMSEEILHQ

Query:  MIHCTSSKEIWVSLKQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQ
        +++  ++ ++W  L+++F      ++ +++ +L T+++GG S++EYF K+ +
Subjt:  MIHCTSSKEIWVSLKQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQ

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.9e-1424.81Show/hide
Query:  LLWKFQILMALEGSSSAEPVRQETLNPAYTLWKKQDRMISSWLVGSMSEEILHQMIH--CTSSKEIWVSLKQIFTTRNLAQMMKIKTKLQTIQKGGMSLK
        L   F +L  ++GSS+  P+ ++        WK++D ++  W+ G++++ +L  +I   CT ++++W+SL+ +F     A+ ++ + +L+T     +S+ 
Subjt:  LLWKFQILMALEGSSSAEPVRQETLNPAYTLWKKQDRMISSWLVGSMSEEILHQMIH--CTSSKEIWVSLKQIFTTRNLAQMMKIKTKLQTIQKGGMSLK

Query:  EYFSKIQQYIDALAVVGKPVEVEDHILFILAGLGSEYESMVFVISAKIGPQTVQEVMSLLLTQENRI--ESKIASTESSLPSANLMVHSKPPESDLQKSN
        EY  K++   D L  V  P+     ++ +L GL  +Y+ ++ VI  K    +  E  S+LL +E+R+  +SK + + ++ PS + ++ + P + +     
Subjt:  EYFSKIQQYIDALAVVGKPVEVEDHILFILAGLGSEYESMVFVISAKIGPQTVQEVMSLLLTQENRI--ESKIASTESSLPSANLMVHSKPPESDLQKSN

Query:  TNHFSPNPGSG-----NRGRGGGRGGFNTNRGGRSWNNRNRPQCQVCG------KFNHTAPKCFFRYAPF
         ++ + N G G     NRG G   G +N N   R     N+P   + G       + H  P+ F +   F
Subjt:  TNHFSPNPGSG-----NRGRGGGRGGFNTNRGGRSWNNRNRPQCQVCG------KFNHTAPKCFFRYAPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGACGGAAAGCTCCACTAGTGCGTCCATGGAGGAAACTGCAAATCCGTCTTCTCAGACGTTTAGTCCCGGTAACAAAATATCTATAGTCAAGCTTACTGATGATAA
TTTTCTGTTATGGAAATTTCAGATCCTCATGGCTTTAGAAGGTTCCTCGTCCGCTGAACCAGTTCGACAGGAGACTCTAAATCCCGCCTATACCCTATGGAAGAAACAAG
ATCGAATGATCTCGTCGTGGCTAGTTGGTTCCATGTCTGAGGAAATACTCCATCAAATGATACATTGTACATCCTCCAAGGAGATTTGGGTAAGTCTCAAACAGATATTC
ACCACTCGAAATCTTGCCCAGATGATGAAAATAAAAACCAAGCTCCAAACAATACAAAAAGGAGGTATGTCTTTAAAAGAATACTTCTCGAAAATTCAGCAATATATTGA
TGCTCTTGCTGTTGTGGGGAAACCGGTAGAAGTTGAAGATCATATCCTTTTTATTTTAGCTGGTTTGGGATCTGAATATGAATCTATGGTGTTCGTTATCTCTGCTAAAA
TTGGTCCTCAAACGGTCCAAGAAGTTATGTCTCTGTTGTTAACTCAGGAAAATCGAATTGAAAGCAAAATAGCTTCCACTGAAAGCTCTCTTCCCTCGGCGAATCTCATG
GTTCATTCTAAACCGCCAGAGTCCGACTTGCAAAAGTCTAATACTAATCATTTTTCTCCCAATCCTGGTAGCGGTAACAGAGGAAGAGGTGGTGGTCGTGGAGGTTTTAA
CACAAATCGTGGAGGTCGTTCCTGGAACAATCGCAACCGACCACAGTGTCAAGTCTGTGGGAAATTCAACCATACAGCTCCCAAATGCTTCTTCCGATATGCTCCATTTG
GATCCTCAAATACTCCAGGTTCGTTCTCTCCAAATTTTAACCAATTTAATCGACCTCCCTCATATCCTCAGATGGAAGCCATGGTGACTTCCCCTGATCTGAATCAAGAT
ACCAATTGGTATCCGGACTCCGGTGCTACCAATCACCTTACTCATTCCTTCAACAACCTCTCGATTGGAACTGAATACGGTGGTGGCAATCAAGTGCACGTGGGAAATGG
AGCAGGTTTGCCTATCCTTAATTTTAGCTTTACTTCATTTTCTTCACATGTCTGTTCTAATAGAATTTTTCGATTAAACAACTTACTTCATGTGTCTTCTATCACCAAAA
ATTTAATCAGTGTTAGTCAATTTGCTAAGGACAATGGAGTTTATTTTGAGTTTCATCCTACCCTTTGCTATGTGAAGGACCAAGTCTTTGGGCAGGTTTTACTCCAAGGG
ACTCTCCATGATGGACTTTATCGCTCACATTTATATAGTTCATCGTTGGTGGAACCTCAAGATAAACTGCCAGTGCAAGCTCTCACTTCTCAACTTTTTTTGTCTCCTTC
TAATGTTAACTGTTTTGTGTTTGATCTTTGGCATAGGCGTCTAGGCCACTCTTCTCTTTCTACTGTTAAAAGTGTCATTCAGAGGTTTAATCCTCGATTGTTGATAAATA
ACAAATTTCAATTCTGTTCTGCGTGTGCTATGGGAAAAGTTCACAATCTTCCTTTTCATAATTCCACTACTGTCTATACAGCTCCTCTTCAACTAATTATTGTTACTGAT
CTATGGGGGCCTACCTATATACCATCTTCTCAAGGTTATAGATACTGTATTAGCTTCATAGATGCATTCAGTAGATACACCTGGTTTACTTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGACGGAAAGCTCCACTAGTGCGTCCATGGAGGAAACTGCAAATCCGTCTTCTCAGACGTTTAGTCCCGGTAACAAAATATCTATAGTCAAGCTTACTGATGATAA
TTTTCTGTTATGGAAATTTCAGATCCTCATGGCTTTAGAAGGTTCCTCGTCCGCTGAACCAGTTCGACAGGAGACTCTAAATCCCGCCTATACCCTATGGAAGAAACAAG
ATCGAATGATCTCGTCGTGGCTAGTTGGTTCCATGTCTGAGGAAATACTCCATCAAATGATACATTGTACATCCTCCAAGGAGATTTGGGTAAGTCTCAAACAGATATTC
ACCACTCGAAATCTTGCCCAGATGATGAAAATAAAAACCAAGCTCCAAACAATACAAAAAGGAGGTATGTCTTTAAAAGAATACTTCTCGAAAATTCAGCAATATATTGA
TGCTCTTGCTGTTGTGGGGAAACCGGTAGAAGTTGAAGATCATATCCTTTTTATTTTAGCTGGTTTGGGATCTGAATATGAATCTATGGTGTTCGTTATCTCTGCTAAAA
TTGGTCCTCAAACGGTCCAAGAAGTTATGTCTCTGTTGTTAACTCAGGAAAATCGAATTGAAAGCAAAATAGCTTCCACTGAAAGCTCTCTTCCCTCGGCGAATCTCATG
GTTCATTCTAAACCGCCAGAGTCCGACTTGCAAAAGTCTAATACTAATCATTTTTCTCCCAATCCTGGTAGCGGTAACAGAGGAAGAGGTGGTGGTCGTGGAGGTTTTAA
CACAAATCGTGGAGGTCGTTCCTGGAACAATCGCAACCGACCACAGTGTCAAGTCTGTGGGAAATTCAACCATACAGCTCCCAAATGCTTCTTCCGATATGCTCCATTTG
GATCCTCAAATACTCCAGGTTCGTTCTCTCCAAATTTTAACCAATTTAATCGACCTCCCTCATATCCTCAGATGGAAGCCATGGTGACTTCCCCTGATCTGAATCAAGAT
ACCAATTGGTATCCGGACTCCGGTGCTACCAATCACCTTACTCATTCCTTCAACAACCTCTCGATTGGAACTGAATACGGTGGTGGCAATCAAGTGCACGTGGGAAATGG
AGCAGGTTTGCCTATCCTTAATTTTAGCTTTACTTCATTTTCTTCACATGTCTGTTCTAATAGAATTTTTCGATTAAACAACTTACTTCATGTGTCTTCTATCACCAAAA
ATTTAATCAGTGTTAGTCAATTTGCTAAGGACAATGGAGTTTATTTTGAGTTTCATCCTACCCTTTGCTATGTGAAGGACCAAGTCTTTGGGCAGGTTTTACTCCAAGGG
ACTCTCCATGATGGACTTTATCGCTCACATTTATATAGTTCATCGTTGGTGGAACCTCAAGATAAACTGCCAGTGCAAGCTCTCACTTCTCAACTTTTTTTGTCTCCTTC
TAATGTTAACTGTTTTGTGTTTGATCTTTGGCATAGGCGTCTAGGCCACTCTTCTCTTTCTACTGTTAAAAGTGTCATTCAGAGGTTTAATCCTCGATTGTTGATAAATA
ACAAATTTCAATTCTGTTCTGCGTGTGCTATGGGAAAAGTTCACAATCTTCCTTTTCATAATTCCACTACTGTCTATACAGCTCCTCTTCAACTAATTATTGTTACTGAT
CTATGGGGGCCTACCTATATACCATCTTCTCAAGGTTATAGATACTGTATTAGCTTCATAGATGCATTCAGTAGATACACCTGGTTTACTTCTTGA
Protein sequenceShow/hide protein sequence
MSTESSTSASMEETANPSSQTFSPGNKISIVKLTDDNFLLWKFQILMALEGSSSAEPVRQETLNPAYTLWKKQDRMISSWLVGSMSEEILHQMIHCTSSKEIWVSLKQIF
TTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYIDALAVVGKPVEVEDHILFILAGLGSEYESMVFVISAKIGPQTVQEVMSLLLTQENRIESKIASTESSLPSANLM
VHSKPPESDLQKSNTNHFSPNPGSGNRGRGGGRGGFNTNRGGRSWNNRNRPQCQVCGKFNHTAPKCFFRYAPFGSSNTPGSFSPNFNQFNRPPSYPQMEAMVTSPDLNQD
TNWYPDSGATNHLTHSFNNLSIGTEYGGGNQVHVGNGAGLPILNFSFTSFSSHVCSNRIFRLNNLLHVSSITKNLISVSQFAKDNGVYFEFHPTLCYVKDQVFGQVLLQG
TLHDGLYRSHLYSSSLVEPQDKLPVQALTSQLFLSPSNVNCFVFDLWHRRLGHSSLSTVKSVIQRFNPRLLINNKFQFCSACAMGKVHNLPFHNSTTVYTAPLQLIIVTD
LWGPTYIPSSQGYRYCISFIDAFSRYTWFTS