; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G25230 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G25230
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionPolyprotein
Genome locationChr3:22468284..22470412
RNA-Seq ExpressionCSPI03G25230
SyntenyCSPI03G25230
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR005162 - Retrotransposon gag domain
IPR012337 - Ribonuclease H-like superfamily
IPR021109 - Aspartic peptidase domain superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050130.1 uncharacterized protein E6C27_scaffold675G001500 [Cucumis melo var. makuwa]1.7e-26365.56Show/hide
Query:  KKHFVPHNAEMESRAKLRRLRHTGNILDYVKEFTTIMLEISDLPEKEALFQFRDGLKDWAKVELDRRNVQTLDDAIAAAEMLIDYTTQSKERKPSPSKRE
        +KHFVPHNAE++SR KLRRLRHT +ILDYVKEFTT+MLEI DLPEKEALFQF+DGLKDWAK+ELDRRNVQTLDDAIAA E L+DY+ QSK +KP P K  
Subjt:  KKHFVPHNAEMESRAKLRRLRHTGNILDYVKEFTTIMLEISDLPEKEALFQFRDGLKDWAKVELDRRNVQTLDDAIAAAEMLIDYTTQSKERKPSPSKRE

Query:  GNHYKHRDSEQKEKGK-KAFQ-RHGKYHKTQQGELSK-PSPCFVCNGPHRTRDCPNRKAMNALVAKLLESKQGDDT--PQIGSLQHIGALKQANQTTKGG
        G   K ++  +K+ GK K FQ ++GK     +GE S  P PCF+C GPH TRDCPNRKA+NALVAK  E KQ +D   PQIGS+Q IG +K+       G
Subjt:  GNHYKHRDSEQKEKGK-KAFQ-RHGKYHKTQQGELSK-PSPCFVCNGPHRTRDCPNRKAMNALVAKLLESKQGDDT--PQIGSLQHIGALKQANQTTKGG

Query:  LLHGNVKINGREAIAMFDTGASHNFMNVNEAKRIGLKFTNEEGTVKAVNSEAQAIKGVARGVTVKIGDWQGKLDFTVLPMDDFDIVLGLGFFNKVIAIIN
        LL+G+++I G+EA  MFDTGASHNFM+V EAKR+GLKF  E G VK VN++  AI GVA+GV VKIGDWQ +LDF+VLPMDDF+IVLGLGFF+KV+ +++
Subjt:  LLHGNVKINGREAIAMFDTGASHNFMNVNEAKRIGLKFTNEEGTVKAVNSEAQAIKGVARGVTVKIGDWQGKLDFTVLPMDDFDIVLGLGFFNKVIAIIN

Query:  SSGSTLTIVDGQITTIPLKKGRPVTMLSAILSKEGVAETQ-----PQTVPSRE-KNSKPTVP-------AEHKDVM-----------------LEVKSGN
        S+  TL+I+DG +TTIP+++G+PV MLSA+  K GV + Q      +T+ + E K  +P VP        E+KD+                   E++   
Subjt:  SSGSTLTIVDGQITTIPLKKGRPVTMLSAILSKEGVAETQ-----PQTVPSRE-KNSKPTVP-------AEHKDVM-----------------LEVKSGN

Query:  ADAVNNVVSQKAKLATITTSMPRSDFLDKIKEGMKNDATAKNLLKLAKEGKTRRFWESNGALLTIGNRLFVPRWGSLRKDILKECHDSLWAGHP------
        A+ V + +S+KA+L  ITTSMP SDFL++IKEGM++D  AKNLLKL KEGKTRRFWE++G LLTIGNRLFVPRWG+LRKD+L+ECHDSLWAGHP      
Subjt:  ADAVNNVVSQKAKLATITTSMPRSDFLDKIKEGMKNDATAKNLLKLAKEGKTRRFWESNGALLTIGNRLFVPRWGSLRKDILKECHDSLWAGHP------

Query:  ----------------------------DKGEQQLPAGLLEPLPIAEKPWDSVIMDFIVALPKSQGFGSIMVVVDRFSKYATFIPCSPDVKVDEAARLFF
                                    DKGEQQLPAGLLEPLPIAEKPWDSV MDFIVALPKS GFG+IMVVVDRFSKYATFIPC PDVKVDEAARLFF
Subjt:  ----------------------------DKGEQQLPAGLLEPLPIAEKPWDSVIMDFIVALPKSQGFGSIMVVVDRFSKYATFIPCSPDVKVDEAARLFF

Query:  KNVVKLWGVPKSIISDRDPRFTGKFWRELFKLVGTELNFSTSFHPQSDGQTERVNALLEHYLRHYVSAHQKNWATLIDIAQFSYNLQRSEATGKSPFEIV
        KNVVKLWG+PKSIISDRDPRFTG+FWRELFKL+GT+LNFSTSFHPQSDGQTER+NALLE YLRHYVSAHQK+WATL+D+AQFSYNLQRSE+TGKSPFEI+
Subjt:  KNVVKLWGVPKSIISDRDPRFTGKFWRELFKLVGTELNFSTSFHPQSDGQTERVNALLEHYLRHYVSAHQKNWATLIDIAQFSYNLQRSEATGKSPFEIV

Query:  MNQQPNTPSALIIPYEGPNPTAF
        MNQQPNTP ALI PYEGPNP+AF
Subjt:  MNQQPNTPSALIIPYEGPNPTAF

KAA0050296.1 reverse transcriptase [Cucumis melo var. makuwa]1.8e-20443.77Show/hide
Query:  KKHFVPHNAEMESRAKLRRLRHTGNILDYVKEFTTIMLEISDLPEKEALFQFRDGLKDWAKVELDRRNVQTLDDAIAAAEMLIDYTTQSKERKPSPSKRE
        +KHFVPHNAE ESR KLRRLRHTG+ILDYVKEFTT+ML+I DLPEKEALFQF+DGLKDWAK+ELDRRNVQTLDDAIA AE L+DY+ QSK +KP P K  
Subjt:  KKHFVPHNAEMESRAKLRRLRHTGNILDYVKEFTTIMLEISDLPEKEALFQFRDGLKDWAKVELDRRNVQTLDDAIAAAEMLIDYTTQSKERKPSPSKRE

Query:  GNHYKHRDSEQKEKGK-KAFQ-RHGKYHKTQQGELSKPS-PCFVCNGPHRTRDCPNRKAMNALVAKLLESKQGDDT--PQIGSLQHIGALKQANQTTKGG
        G   K +    K+ GK K FQ R+GK     +GE S PS PCF+C GPH TRDCPNRKA+NALVAK  E KQ +D   PQIGS+Q IG +K+   T    
Subjt:  GNHYKHRDSEQKEKGK-KAFQ-RHGKYHKTQQGELSKPS-PCFVCNGPHRTRDCPNRKAMNALVAKLLESKQGDDT--PQIGSLQHIGALKQANQTTKGG

Query:  LLHGNVKINGREAIAMFDTGASHNFMNVNEAKRIGLKFTNEEGTVKAVNSEAQAIKGVARGVTVKIGDWQGKLDFTVLPMDDFDIVLGLGFFNKVIAIIN
        LL+G+V+I G+EA AMFDTGASHNFM+V EAKR+GLKF  E GTVK VN++ Q I G+A+GV VKIGDWQ +LDF++LPMDDF IVLGLGFF+KV+ +++
Subjt:  LLHGNVKINGREAIAMFDTGASHNFMNVNEAKRIGLKFTNEEGTVKAVNSEAQAIKGVARGVTVKIGDWQGKLDFTVLPMDDFDIVLGLGFFNKVIAIIN

Query:  SSGSTLTIVDGQITTIPLKKGRPVTMLSAILSKEGVAE------TQPQTVPSREKNSKPTVP-------AEHKDVM------------------------
        S+  TL+I+DG +TTI +++G+PV MLSA+  K GV++      T         K+ +P VP        E+KD+M                        
Subjt:  SSGSTLTIVDGQITTIPLKKGRPVTMLSAILSKEGVAE------TQPQTVPSREKNSKPTVP-------AEHKDVM------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------LEVKSGNADAVNNVVSQKAKLATITTSMPRSDFLDKIKEGMKNDATAKNLLKLAKEGKTRRF
                                              LE K G A+ V + +S KA+L+TITTSMP SDFL++IKEGM++D  AKNLLKLAKEGKTRRF
Subjt:  --------------------------------------LEVKSGNADAVNNVVSQKAKLATITTSMPRSDFLDKIKEGMKNDATAKNLLKLAKEGKTRRF

Query:  WESNGALLTI-GNRLFVPRWGSLRKDILKECHDSLWAGHPDKGEQQLPAGLLEPLPIAEKPWDSVIMDFIVALPKSQGFGSIMVVVDRFSKYATFIPCSP
         E+NG LLTI  ++ + PR     +  +K C         DKGEQQLPA LLEPLPIAEKPWDSV MDFIVALPKS GFG+IMVVVDRFSKYATFIPC P
Subjt:  WESNGALLTI-GNRLFVPRWGSLRKDILKECHDSLWAGHPDKGEQQLPAGLLEPLPIAEKPWDSVIMDFIVALPKSQGFGSIMVVVDRFSKYATFIPCSP

Query:  DVKVDEAARLFFKNVVKLWGVPKSIISDRDPRFTGKFWRELFKLVGTELNFSTSFHPQSDGQTERVNALLEHYLRHYVSAHQKNWATLIDIAQFSYNLQR
        DVKVDEAARLFFKN+VK WGVPKSIISDRD RFTGKFWRELFKL G +LNFSTSFH QSDGQTER+N LLE YLRHYVSAHQK+WA L+D+AQFSYNLQR
Subjt:  DVKVDEAARLFFKNVVKLWGVPKSIISDRDPRFTGKFWRELFKLVGTELNFSTSFHPQSDGQTERVNALLEHYLRHYVSAHQKNWATLIDIAQFSYNLQR

Query:  SEATGKSPFEIVMNQQPNTPSALIIPYEGPNPTAF
        SE+TGKSPFEI+MNQQPNTP ALI PYEGPNP+AF
Subjt:  SEATGKSPFEIVMNQQPNTPSALIIPYEGPNPTAF

KAA0054276.1 polyprotein [Cucumis melo var. makuwa]1.1e-18238.4Show/hide
Query:  KKHFVPHNAEMESRAKLRRLRHTGNILDYVKEFTTIMLEISDLPEKEALFQFRDGLKDWAKVELDRRNVQTLDDAIAAAEMLIDYTTQSKERKPSPSKRE
        +KHFVPHNAE+ESR                           DLPEKEALFQF+DGLKDWAK+ELDRRNVQTLDDAIAAAE L+DY+ QSK +KP P K  
Subjt:  KKHFVPHNAEMESRAKLRRLRHTGNILDYVKEFTTIMLEISDLPEKEALFQFRDGLKDWAKVELDRRNVQTLDDAIAAAEMLIDYTTQSKERKPSPSKRE

Query:  GNHYKHRDSEQKEKGK-KAFQ-RHGKYHKTQQGELSK-PSPCFVCNGPHRTRDCPNRKAMNALVAKLLESKQGDDT--PQIGSLQHIGALKQANQTTKGG
        G   K ++  +K+ GK K FQ ++GK     +GE S  P PCF+C GPH TRDCPNRKA+NALVAK  E KQ +D   PQIGS+Q IG            
Subjt:  GNHYKHRDSEQKEKGK-KAFQ-RHGKYHKTQQGELSK-PSPCFVCNGPHRTRDCPNRKAMNALVAKLLESKQGDDT--PQIGSLQHIGALKQANQTTKGG

Query:  LLHGNVKINGREAIAMFDTGASHNFMNVNEAKRIGLKFTNEEGTVKAVNSEAQAIKGVARGVTVKIGDWQGKLDFTVLPMDDFDIVLGLGFFNKVIAIIN
                                     EAKR+GLK+  E GTVK VN++ Q I GVA+GV VKIGDWQ +LDF+VLPMDDF+IVLGLGFF+KV+ +++
Subjt:  LLHGNVKINGREAIAMFDTGASHNFMNVNEAKRIGLKFTNEEGTVKAVNSEAQAIKGVARGVTVKIGDWQGKLDFTVLPMDDFDIVLGLGFFNKVIAIIN

Query:  SSGSTLTIVDGQITTIPLKKGRPVTMLSAILSKEGVAETQ----------------------------------------PQTVPSR---------EKNS
        S+  TL+I+DG +TTIP+++G+PV MLSA+  K GV + Q                                        P+ +P R         E  +
Subjt:  SSGSTLTIVDGQITTIPLKKGRPVTMLSAILSKEGVAETQ----------------------------------------PQTVPSR---------EKNS

Query:  KPTVPAEHK-------------------------------------------------------------------------------------------
        KP   A ++                                                                                           
Subjt:  KPTVPAEHK-------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------------------------------------DVMLEVKS
                                                                                                    D  LE K 
Subjt:  --------------------------------------------------------------------------------------------DVMLEVKS

Query:  GNADAVNNVVSQKAKLATITTSMPRSDFLDKIKEGMKNDATAKNLLKLAKEGKTRRFWESNGALLTIGNRLFVPRWGSLRKDILKECHDSLWAGHP----
        G A+ V + +S+KA+L  IT SMP S+FL++IKEGM++D  AKNLLKLAKEGKTRRFWE++G LLT GNRLFVPRWG+LRKD+L+ECHDSLWAGHP    
Subjt:  GNADAVNNVVSQKAKLATITTSMPRSDFLDKIKEGMKNDATAKNLLKLAKEGKTRRFWESNGALLTIGNRLFVPRWGSLRKDILKECHDSLWAGHP----

Query:  ------------------------------DKGEQQLPAGLLEPLPIAEKPWDSVIMDFIVALPKSQGFGSIMVVVDRFSKYATFIPCSPDVKVDEAARL
                                      DKGEQQLPAGLLEPLPIAEKPWDS+ MDFIVALPKS GFG+IMVVVDRFSKYATFIPCSPDVKVDEAARL
Subjt:  ------------------------------DKGEQQLPAGLLEPLPIAEKPWDSVIMDFIVALPKSQGFGSIMVVVDRFSKYATFIPCSPDVKVDEAARL

Query:  FFKNVVKLWGVPKSIISDRDPRFTGKFWRELFKLVGTELNFSTSFHPQSDGQTERVNALLEHYLRHYVSAHQKNWATLIDIAQFSYNLQRSEATGKSPFE
        FFKNVVKLWG+PKSIISDRDPRFTGKFWRELFKL+GT+LNFSTSFHPQSDGQTER+NALLE YLRHYVSAHQK+W  L+D+AQFSYNLQRSEATGKSPFE
Subjt:  FFKNVVKLWGVPKSIISDRDPRFTGKFWRELFKLVGTELNFSTSFHPQSDGQTERVNALLEHYLRHYVSAHQKNWATLIDIAQFSYNLQRSEATGKSPFE

Query:  IVMNQQPNTPSALIIPYEGPNPTAF
        ++MNQQPNTP ALI PYEGPNP+AF
Subjt:  IVMNQQPNTPSALIIPYEGPNPTAF

KAA0065760.1 polyprotein [Cucumis melo var. makuwa]2.0e-21942.4Show/hide
Query:  KKHFVPHNAEMESRAKLRRLRHTGNILDYVKEFTTIMLEISDLPEKEALFQFRDGLKDWAKVELDRRNVQTLDDAIAAAEMLIDYTTQSKERKPSPSKRE
        +KHFVPHNAE+ESR KLRRLRHTG+IL+YVKEFTT+MLEI DLPEKEALFQF+DGLKDWAK+ELDRRNVQTLDDAIAAAE L+DY+ QSK +KP P K  
Subjt:  KKHFVPHNAEMESRAKLRRLRHTGNILDYVKEFTTIMLEISDLPEKEALFQFRDGLKDWAKVELDRRNVQTLDDAIAAAEMLIDYTTQSKERKPSPSKRE

Query:  GNHYKHRDSEQKEKGK-KAFQ-RHGKYHKTQQGELSK-PSPCFVCNGPHRTRDCPNRKAMNALVAKLLESKQGDDT--PQIGSLQHIGALKQANQTTKGG
        G   K ++  +++ GK K FQ ++GK     +GE S  P PCF+C GPH TRDCPN+KA+NALVAK  E KQ +D   PQIGS+Q IG +K+       G
Subjt:  GNHYKHRDSEQKEKGK-KAFQ-RHGKYHKTQQGELSK-PSPCFVCNGPHRTRDCPNRKAMNALVAKLLESKQGDDT--PQIGSLQHIGALKQANQTTKGG

Query:  LLHGNVKINGREAIAMFDTGASHNFMNVNEAKRIGLKFTNEEGTVKAVNSEAQAIKGVARGVTVKIGDWQGKLDFTVLPMDDFDIVLGLGFFNKVIAIIN
        LL+G+++I G+EA AMFDTGASHNFM+V EAKR+GLK+  E GTVK VN++ Q I GVA+GV VKIGDWQ +LDF+VLPMDDF+IVLGLGFF+KV+ +++
Subjt:  LLHGNVKINGREAIAMFDTGASHNFMNVNEAKRIGLKFTNEEGTVKAVNSEAQAIKGVARGVTVKIGDWQGKLDFTVLPMDDFDIVLGLGFFNKVIAIIN

Query:  SSGSTLTIVDGQITTIPLKKGRPVTMLSAILSKEGVAETQ-----PQTVPSRE-KNSKPTVP-------AEHKDVM------------------------
        S+  TL+I+DG +TTIP+++G+P+ MLSA+  K GV + Q      +T+   E K  +P VP        E+KD+M                        
Subjt:  SSGSTLTIVDGQITTIPLKKGRPVTMLSAILSKEGVAETQ-----PQTVPSRE-KNSKPTVP-------AEHKDVM------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------------------------------------------------------------------LEVKS
                                                                                                       LE K 
Subjt:  -----------------------------------------------------------------------------------------------LEVKS

Query:  GNADAVNNVVSQKAKLATITTSMPRSDFLDKIKEGMKNDATAKNLLKLAKEGKTRRFWESNGALLTIGNRLFVPRWGSLRKDILKECHDSLWAGHP----
        G A+ V + +S+KA+L TIT SMP S+FL++IKEGM++D  AKNLLKLAKEGKTRRFWE++G LLT GNRLFV RWG+LRKD+L+ECHDSLWAGHP    
Subjt:  GNADAVNNVVSQKAKLATITTSMPRSDFLDKIKEGMKNDATAKNLLKLAKEGKTRRFWESNGALLTIGNRLFVPRWGSLRKDILKECHDSLWAGHP----

Query:  ------------------------------DKGEQQLPAGLLEPLPIAEKPWDSVIMDFIVALPKSQGFGSIMVVVDRFSKYATFIPCSPDVKVDEAARL
                                      DKGEQQLPAGLLEPLPIAEKPWDSV MDFIVALPKS GFG+IMVVVDRFSKYATFIPCSPDVKVDEAARL
Subjt:  ------------------------------DKGEQQLPAGLLEPLPIAEKPWDSVIMDFIVALPKSQGFGSIMVVVDRFSKYATFIPCSPDVKVDEAARL

Query:  FFKNVVKLWGVPKSIISDRDPRFTGKFWRELFKLVGTELNFSTSFHPQSDGQTERVNALLEHYLRHYVSAHQKNWATLIDIAQFSYNLQRSEATGKSPFE
        FFKNVVKLWG+PKSIISDRDPRFTGKFWRELFKL+GT+LNFSTSFHPQSDGQTER+NALLE YLRHYVSAHQK+WATL+D+AQFSYNLQRSE+TGKSPFE
Subjt:  FFKNVVKLWGVPKSIISDRDPRFTGKFWRELFKLVGTELNFSTSFHPQSDGQTERVNALLEHYLRHYVSAHQKNWATLIDIAQFSYNLQRSEATGKSPFE

Query:  IVMNQQPNTPSALIIPYEGPNPTAF
        I+MNQQPNTP  LI PYEGPNP+AF
Subjt:  IVMNQQPNTPSALIIPYEGPNPTAF

XP_008460615.1 PREDICTED: uncharacterized protein LOC103499392 [Cucumis melo]1.1e-18238.4Show/hide
Query:  KKHFVPHNAEMESRAKLRRLRHTGNILDYVKEFTTIMLEISDLPEKEALFQFRDGLKDWAKVELDRRNVQTLDDAIAAAEMLIDYTTQSKERKPSPSKRE
        +KHFVPHNAE+ESR                           DLPEKEALFQF+DGLKDWAK+ELDRRNVQTLDDAIAAAE L+DY+ QSK +KP P K  
Subjt:  KKHFVPHNAEMESRAKLRRLRHTGNILDYVKEFTTIMLEISDLPEKEALFQFRDGLKDWAKVELDRRNVQTLDDAIAAAEMLIDYTTQSKERKPSPSKRE

Query:  GNHYKHRDSEQKEKGK-KAFQ-RHGKYHKTQQGELSK-PSPCFVCNGPHRTRDCPNRKAMNALVAKLLESKQGDDT--PQIGSLQHIGALKQANQTTKGG
        G   K ++  +K+ GK K FQ ++GK     +GE S  P PCF+C GPH TRDCPNRKA+NALVAK  E KQ +D   PQIGS+Q IG            
Subjt:  GNHYKHRDSEQKEKGK-KAFQ-RHGKYHKTQQGELSK-PSPCFVCNGPHRTRDCPNRKAMNALVAKLLESKQGDDT--PQIGSLQHIGALKQANQTTKGG

Query:  LLHGNVKINGREAIAMFDTGASHNFMNVNEAKRIGLKFTNEEGTVKAVNSEAQAIKGVARGVTVKIGDWQGKLDFTVLPMDDFDIVLGLGFFNKVIAIIN
                                     EAKR+GLK+  E GTVK VN++ Q I GVA+GV VKIGDWQ +LDF+VLPMDDF+IVLGLGFF+KV+ +++
Subjt:  LLHGNVKINGREAIAMFDTGASHNFMNVNEAKRIGLKFTNEEGTVKAVNSEAQAIKGVARGVTVKIGDWQGKLDFTVLPMDDFDIVLGLGFFNKVIAIIN

Query:  SSGSTLTIVDGQITTIPLKKGRPVTMLSAILSKEGVAETQ----------------------------------------PQTVPSR---------EKNS
        S+  TL+I+DG +TTIP+++G+PV MLSA+  K GV + Q                                        P+ +P R         E  +
Subjt:  SSGSTLTIVDGQITTIPLKKGRPVTMLSAILSKEGVAETQ----------------------------------------PQTVPSR---------EKNS

Query:  KPTVPAEHK-------------------------------------------------------------------------------------------
        KP   A ++                                                                                           
Subjt:  KPTVPAEHK-------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------------------------------------DVMLEVKS
                                                                                                    D  LE K 
Subjt:  --------------------------------------------------------------------------------------------DVMLEVKS

Query:  GNADAVNNVVSQKAKLATITTSMPRSDFLDKIKEGMKNDATAKNLLKLAKEGKTRRFWESNGALLTIGNRLFVPRWGSLRKDILKECHDSLWAGHP----
        G A+ V + +S+KA+L  IT SMP S+FL++IKEGM++D  AKNLLKLAKEGKTRRFWE++G LLT GNRLFVPRWG+LRKD+L+ECHDSLWAGHP    
Subjt:  GNADAVNNVVSQKAKLATITTSMPRSDFLDKIKEGMKNDATAKNLLKLAKEGKTRRFWESNGALLTIGNRLFVPRWGSLRKDILKECHDSLWAGHP----

Query:  ------------------------------DKGEQQLPAGLLEPLPIAEKPWDSVIMDFIVALPKSQGFGSIMVVVDRFSKYATFIPCSPDVKVDEAARL
                                      DKGEQQLPAGLLEPLPIAEKPWDS+ MDFIVALPKS GFG+IMVVVDRFSKYATFIPCSPDVKVDEAARL
Subjt:  ------------------------------DKGEQQLPAGLLEPLPIAEKPWDSVIMDFIVALPKSQGFGSIMVVVDRFSKYATFIPCSPDVKVDEAARL

Query:  FFKNVVKLWGVPKSIISDRDPRFTGKFWRELFKLVGTELNFSTSFHPQSDGQTERVNALLEHYLRHYVSAHQKNWATLIDIAQFSYNLQRSEATGKSPFE
        FFKNVVKLWG+PKSIISDRDPRFTGKFWRELFKL+GT+LNFSTSFHPQSDGQTER+NALLE YLRHYVSAHQK+W  L+D+AQFSYNLQRSEATGKSPFE
Subjt:  FFKNVVKLWGVPKSIISDRDPRFTGKFWRELFKLVGTELNFSTSFHPQSDGQTERVNALLEHYLRHYVSAHQKNWATLIDIAQFSYNLQRSEATGKSPFE

Query:  IVMNQQPNTPSALIIPYEGPNPTAF
        ++MNQQPNTP ALI PYEGPNP+AF
Subjt:  IVMNQQPNTPSALIIPYEGPNPTAF

TrEMBL top hitse value%identityAlignment
A0A1S3CE17 uncharacterized protein LOC1034993925.6e-18338.4Show/hide
Query:  KKHFVPHNAEMESRAKLRRLRHTGNILDYVKEFTTIMLEISDLPEKEALFQFRDGLKDWAKVELDRRNVQTLDDAIAAAEMLIDYTTQSKERKPSPSKRE
        +KHFVPHNAE+ESR                           DLPEKEALFQF+DGLKDWAK+ELDRRNVQTLDDAIAAAE L+DY+ QSK +KP P K  
Subjt:  KKHFVPHNAEMESRAKLRRLRHTGNILDYVKEFTTIMLEISDLPEKEALFQFRDGLKDWAKVELDRRNVQTLDDAIAAAEMLIDYTTQSKERKPSPSKRE

Query:  GNHYKHRDSEQKEKGK-KAFQ-RHGKYHKTQQGELSK-PSPCFVCNGPHRTRDCPNRKAMNALVAKLLESKQGDDT--PQIGSLQHIGALKQANQTTKGG
        G   K ++  +K+ GK K FQ ++GK     +GE S  P PCF+C GPH TRDCPNRKA+NALVAK  E KQ +D   PQIGS+Q IG            
Subjt:  GNHYKHRDSEQKEKGK-KAFQ-RHGKYHKTQQGELSK-PSPCFVCNGPHRTRDCPNRKAMNALVAKLLESKQGDDT--PQIGSLQHIGALKQANQTTKGG

Query:  LLHGNVKINGREAIAMFDTGASHNFMNVNEAKRIGLKFTNEEGTVKAVNSEAQAIKGVARGVTVKIGDWQGKLDFTVLPMDDFDIVLGLGFFNKVIAIIN
                                     EAKR+GLK+  E GTVK VN++ Q I GVA+GV VKIGDWQ +LDF+VLPMDDF+IVLGLGFF+KV+ +++
Subjt:  LLHGNVKINGREAIAMFDTGASHNFMNVNEAKRIGLKFTNEEGTVKAVNSEAQAIKGVARGVTVKIGDWQGKLDFTVLPMDDFDIVLGLGFFNKVIAIIN

Query:  SSGSTLTIVDGQITTIPLKKGRPVTMLSAILSKEGVAETQ----------------------------------------PQTVPSR---------EKNS
        S+  TL+I+DG +TTIP+++G+PV MLSA+  K GV + Q                                        P+ +P R         E  +
Subjt:  SSGSTLTIVDGQITTIPLKKGRPVTMLSAILSKEGVAETQ----------------------------------------PQTVPSR---------EKNS

Query:  KPTVPAEHK-------------------------------------------------------------------------------------------
        KP   A ++                                                                                           
Subjt:  KPTVPAEHK-------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------------------------------------DVMLEVKS
                                                                                                    D  LE K 
Subjt:  --------------------------------------------------------------------------------------------DVMLEVKS

Query:  GNADAVNNVVSQKAKLATITTSMPRSDFLDKIKEGMKNDATAKNLLKLAKEGKTRRFWESNGALLTIGNRLFVPRWGSLRKDILKECHDSLWAGHP----
        G A+ V + +S+KA+L  IT SMP S+FL++IKEGM++D  AKNLLKLAKEGKTRRFWE++G LLT GNRLFVPRWG+LRKD+L+ECHDSLWAGHP    
Subjt:  GNADAVNNVVSQKAKLATITTSMPRSDFLDKIKEGMKNDATAKNLLKLAKEGKTRRFWESNGALLTIGNRLFVPRWGSLRKDILKECHDSLWAGHP----

Query:  ------------------------------DKGEQQLPAGLLEPLPIAEKPWDSVIMDFIVALPKSQGFGSIMVVVDRFSKYATFIPCSPDVKVDEAARL
                                      DKGEQQLPAGLLEPLPIAEKPWDS+ MDFIVALPKS GFG+IMVVVDRFSKYATFIPCSPDVKVDEAARL
Subjt:  ------------------------------DKGEQQLPAGLLEPLPIAEKPWDSVIMDFIVALPKSQGFGSIMVVVDRFSKYATFIPCSPDVKVDEAARL

Query:  FFKNVVKLWGVPKSIISDRDPRFTGKFWRELFKLVGTELNFSTSFHPQSDGQTERVNALLEHYLRHYVSAHQKNWATLIDIAQFSYNLQRSEATGKSPFE
        FFKNVVKLWG+PKSIISDRDPRFTGKFWRELFKL+GT+LNFSTSFHPQSDGQTER+NALLE YLRHYVSAHQK+W  L+D+AQFSYNLQRSEATGKSPFE
Subjt:  FFKNVVKLWGVPKSIISDRDPRFTGKFWRELFKLVGTELNFSTSFHPQSDGQTERVNALLEHYLRHYVSAHQKNWATLIDIAQFSYNLQRSEATGKSPFE

Query:  IVMNQQPNTPSALIIPYEGPNPTAF
        ++MNQQPNTP ALI PYEGPNP+AF
Subjt:  IVMNQQPNTPSALIIPYEGPNPTAF

A0A5A7U4C8 Uncharacterized protein8.4e-26465.56Show/hide
Query:  KKHFVPHNAEMESRAKLRRLRHTGNILDYVKEFTTIMLEISDLPEKEALFQFRDGLKDWAKVELDRRNVQTLDDAIAAAEMLIDYTTQSKERKPSPSKRE
        +KHFVPHNAE++SR KLRRLRHT +ILDYVKEFTT+MLEI DLPEKEALFQF+DGLKDWAK+ELDRRNVQTLDDAIAA E L+DY+ QSK +KP P K  
Subjt:  KKHFVPHNAEMESRAKLRRLRHTGNILDYVKEFTTIMLEISDLPEKEALFQFRDGLKDWAKVELDRRNVQTLDDAIAAAEMLIDYTTQSKERKPSPSKRE

Query:  GNHYKHRDSEQKEKGK-KAFQ-RHGKYHKTQQGELSK-PSPCFVCNGPHRTRDCPNRKAMNALVAKLLESKQGDDT--PQIGSLQHIGALKQANQTTKGG
        G   K ++  +K+ GK K FQ ++GK     +GE S  P PCF+C GPH TRDCPNRKA+NALVAK  E KQ +D   PQIGS+Q IG +K+       G
Subjt:  GNHYKHRDSEQKEKGK-KAFQ-RHGKYHKTQQGELSK-PSPCFVCNGPHRTRDCPNRKAMNALVAKLLESKQGDDT--PQIGSLQHIGALKQANQTTKGG

Query:  LLHGNVKINGREAIAMFDTGASHNFMNVNEAKRIGLKFTNEEGTVKAVNSEAQAIKGVARGVTVKIGDWQGKLDFTVLPMDDFDIVLGLGFFNKVIAIIN
        LL+G+++I G+EA  MFDTGASHNFM+V EAKR+GLKF  E G VK VN++  AI GVA+GV VKIGDWQ +LDF+VLPMDDF+IVLGLGFF+KV+ +++
Subjt:  LLHGNVKINGREAIAMFDTGASHNFMNVNEAKRIGLKFTNEEGTVKAVNSEAQAIKGVARGVTVKIGDWQGKLDFTVLPMDDFDIVLGLGFFNKVIAIIN

Query:  SSGSTLTIVDGQITTIPLKKGRPVTMLSAILSKEGVAETQ-----PQTVPSRE-KNSKPTVP-------AEHKDVM-----------------LEVKSGN
        S+  TL+I+DG +TTIP+++G+PV MLSA+  K GV + Q      +T+ + E K  +P VP        E+KD+                   E++   
Subjt:  SSGSTLTIVDGQITTIPLKKGRPVTMLSAILSKEGVAETQ-----PQTVPSRE-KNSKPTVP-------AEHKDVM-----------------LEVKSGN

Query:  ADAVNNVVSQKAKLATITTSMPRSDFLDKIKEGMKNDATAKNLLKLAKEGKTRRFWESNGALLTIGNRLFVPRWGSLRKDILKECHDSLWAGHP------
        A+ V + +S+KA+L  ITTSMP SDFL++IKEGM++D  AKNLLKL KEGKTRRFWE++G LLTIGNRLFVPRWG+LRKD+L+ECHDSLWAGHP      
Subjt:  ADAVNNVVSQKAKLATITTSMPRSDFLDKIKEGMKNDATAKNLLKLAKEGKTRRFWESNGALLTIGNRLFVPRWGSLRKDILKECHDSLWAGHP------

Query:  ----------------------------DKGEQQLPAGLLEPLPIAEKPWDSVIMDFIVALPKSQGFGSIMVVVDRFSKYATFIPCSPDVKVDEAARLFF
                                    DKGEQQLPAGLLEPLPIAEKPWDSV MDFIVALPKS GFG+IMVVVDRFSKYATFIPC PDVKVDEAARLFF
Subjt:  ----------------------------DKGEQQLPAGLLEPLPIAEKPWDSVIMDFIVALPKSQGFGSIMVVVDRFSKYATFIPCSPDVKVDEAARLFF

Query:  KNVVKLWGVPKSIISDRDPRFTGKFWRELFKLVGTELNFSTSFHPQSDGQTERVNALLEHYLRHYVSAHQKNWATLIDIAQFSYNLQRSEATGKSPFEIV
        KNVVKLWG+PKSIISDRDPRFTG+FWRELFKL+GT+LNFSTSFHPQSDGQTER+NALLE YLRHYVSAHQK+WATL+D+AQFSYNLQRSE+TGKSPFEI+
Subjt:  KNVVKLWGVPKSIISDRDPRFTGKFWRELFKLVGTELNFSTSFHPQSDGQTERVNALLEHYLRHYVSAHQKNWATLIDIAQFSYNLQRSEATGKSPFEIV

Query:  MNQQPNTPSALIIPYEGPNPTAF
        MNQQPNTP ALI PYEGPNP+AF
Subjt:  MNQQPNTPSALIIPYEGPNPTAF

A0A5A7UJ25 Polyprotein5.6e-18338.4Show/hide
Query:  KKHFVPHNAEMESRAKLRRLRHTGNILDYVKEFTTIMLEISDLPEKEALFQFRDGLKDWAKVELDRRNVQTLDDAIAAAEMLIDYTTQSKERKPSPSKRE
        +KHFVPHNAE+ESR                           DLPEKEALFQF+DGLKDWAK+ELDRRNVQTLDDAIAAAE L+DY+ QSK +KP P K  
Subjt:  KKHFVPHNAEMESRAKLRRLRHTGNILDYVKEFTTIMLEISDLPEKEALFQFRDGLKDWAKVELDRRNVQTLDDAIAAAEMLIDYTTQSKERKPSPSKRE

Query:  GNHYKHRDSEQKEKGK-KAFQ-RHGKYHKTQQGELSK-PSPCFVCNGPHRTRDCPNRKAMNALVAKLLESKQGDDT--PQIGSLQHIGALKQANQTTKGG
        G   K ++  +K+ GK K FQ ++GK     +GE S  P PCF+C GPH TRDCPNRKA+NALVAK  E KQ +D   PQIGS+Q IG            
Subjt:  GNHYKHRDSEQKEKGK-KAFQ-RHGKYHKTQQGELSK-PSPCFVCNGPHRTRDCPNRKAMNALVAKLLESKQGDDT--PQIGSLQHIGALKQANQTTKGG

Query:  LLHGNVKINGREAIAMFDTGASHNFMNVNEAKRIGLKFTNEEGTVKAVNSEAQAIKGVARGVTVKIGDWQGKLDFTVLPMDDFDIVLGLGFFNKVIAIIN
                                     EAKR+GLK+  E GTVK VN++ Q I GVA+GV VKIGDWQ +LDF+VLPMDDF+IVLGLGFF+KV+ +++
Subjt:  LLHGNVKINGREAIAMFDTGASHNFMNVNEAKRIGLKFTNEEGTVKAVNSEAQAIKGVARGVTVKIGDWQGKLDFTVLPMDDFDIVLGLGFFNKVIAIIN

Query:  SSGSTLTIVDGQITTIPLKKGRPVTMLSAILSKEGVAETQ----------------------------------------PQTVPSR---------EKNS
        S+  TL+I+DG +TTIP+++G+PV MLSA+  K GV + Q                                        P+ +P R         E  +
Subjt:  SSGSTLTIVDGQITTIPLKKGRPVTMLSAILSKEGVAETQ----------------------------------------PQTVPSR---------EKNS

Query:  KPTVPAEHK-------------------------------------------------------------------------------------------
        KP   A ++                                                                                           
Subjt:  KPTVPAEHK-------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------------------------------------DVMLEVKS
                                                                                                    D  LE K 
Subjt:  --------------------------------------------------------------------------------------------DVMLEVKS

Query:  GNADAVNNVVSQKAKLATITTSMPRSDFLDKIKEGMKNDATAKNLLKLAKEGKTRRFWESNGALLTIGNRLFVPRWGSLRKDILKECHDSLWAGHP----
        G A+ V + +S+KA+L  IT SMP S+FL++IKEGM++D  AKNLLKLAKEGKTRRFWE++G LLT GNRLFVPRWG+LRKD+L+ECHDSLWAGHP    
Subjt:  GNADAVNNVVSQKAKLATITTSMPRSDFLDKIKEGMKNDATAKNLLKLAKEGKTRRFWESNGALLTIGNRLFVPRWGSLRKDILKECHDSLWAGHP----

Query:  ------------------------------DKGEQQLPAGLLEPLPIAEKPWDSVIMDFIVALPKSQGFGSIMVVVDRFSKYATFIPCSPDVKVDEAARL
                                      DKGEQQLPAGLLEPLPIAEKPWDS+ MDFIVALPKS GFG+IMVVVDRFSKYATFIPCSPDVKVDEAARL
Subjt:  ------------------------------DKGEQQLPAGLLEPLPIAEKPWDSVIMDFIVALPKSQGFGSIMVVVDRFSKYATFIPCSPDVKVDEAARL

Query:  FFKNVVKLWGVPKSIISDRDPRFTGKFWRELFKLVGTELNFSTSFHPQSDGQTERVNALLEHYLRHYVSAHQKNWATLIDIAQFSYNLQRSEATGKSPFE
        FFKNVVKLWG+PKSIISDRDPRFTGKFWRELFKL+GT+LNFSTSFHPQSDGQTER+NALLE YLRHYVSAHQK+W  L+D+AQFSYNLQRSEATGKSPFE
Subjt:  FFKNVVKLWGVPKSIISDRDPRFTGKFWRELFKLVGTELNFSTSFHPQSDGQTERVNALLEHYLRHYVSAHQKNWATLIDIAQFSYNLQRSEATGKSPFE

Query:  IVMNQQPNTPSALIIPYEGPNPTAF
        ++MNQQPNTP ALI PYEGPNP+AF
Subjt:  IVMNQQPNTPSALIIPYEGPNPTAF

A0A5A7VEX8 Polyprotein9.7e-22042.4Show/hide
Query:  KKHFVPHNAEMESRAKLRRLRHTGNILDYVKEFTTIMLEISDLPEKEALFQFRDGLKDWAKVELDRRNVQTLDDAIAAAEMLIDYTTQSKERKPSPSKRE
        +KHFVPHNAE+ESR KLRRLRHTG+IL+YVKEFTT+MLEI DLPEKEALFQF+DGLKDWAK+ELDRRNVQTLDDAIAAAE L+DY+ QSK +KP P K  
Subjt:  KKHFVPHNAEMESRAKLRRLRHTGNILDYVKEFTTIMLEISDLPEKEALFQFRDGLKDWAKVELDRRNVQTLDDAIAAAEMLIDYTTQSKERKPSPSKRE

Query:  GNHYKHRDSEQKEKGK-KAFQ-RHGKYHKTQQGELSK-PSPCFVCNGPHRTRDCPNRKAMNALVAKLLESKQGDDT--PQIGSLQHIGALKQANQTTKGG
        G   K ++  +++ GK K FQ ++GK     +GE S  P PCF+C GPH TRDCPN+KA+NALVAK  E KQ +D   PQIGS+Q IG +K+       G
Subjt:  GNHYKHRDSEQKEKGK-KAFQ-RHGKYHKTQQGELSK-PSPCFVCNGPHRTRDCPNRKAMNALVAKLLESKQGDDT--PQIGSLQHIGALKQANQTTKGG

Query:  LLHGNVKINGREAIAMFDTGASHNFMNVNEAKRIGLKFTNEEGTVKAVNSEAQAIKGVARGVTVKIGDWQGKLDFTVLPMDDFDIVLGLGFFNKVIAIIN
        LL+G+++I G+EA AMFDTGASHNFM+V EAKR+GLK+  E GTVK VN++ Q I GVA+GV VKIGDWQ +LDF+VLPMDDF+IVLGLGFF+KV+ +++
Subjt:  LLHGNVKINGREAIAMFDTGASHNFMNVNEAKRIGLKFTNEEGTVKAVNSEAQAIKGVARGVTVKIGDWQGKLDFTVLPMDDFDIVLGLGFFNKVIAIIN

Query:  SSGSTLTIVDGQITTIPLKKGRPVTMLSAILSKEGVAETQ-----PQTVPSRE-KNSKPTVP-------AEHKDVM------------------------
        S+  TL+I+DG +TTIP+++G+P+ MLSA+  K GV + Q      +T+   E K  +P VP        E+KD+M                        
Subjt:  SSGSTLTIVDGQITTIPLKKGRPVTMLSAILSKEGVAETQ-----PQTVPSRE-KNSKPTVP-------AEHKDVM------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------------------------------------------------------------------LEVKS
                                                                                                       LE K 
Subjt:  -----------------------------------------------------------------------------------------------LEVKS

Query:  GNADAVNNVVSQKAKLATITTSMPRSDFLDKIKEGMKNDATAKNLLKLAKEGKTRRFWESNGALLTIGNRLFVPRWGSLRKDILKECHDSLWAGHP----
        G A+ V + +S+KA+L TIT SMP S+FL++IKEGM++D  AKNLLKLAKEGKTRRFWE++G LLT GNRLFV RWG+LRKD+L+ECHDSLWAGHP    
Subjt:  GNADAVNNVVSQKAKLATITTSMPRSDFLDKIKEGMKNDATAKNLLKLAKEGKTRRFWESNGALLTIGNRLFVPRWGSLRKDILKECHDSLWAGHP----

Query:  ------------------------------DKGEQQLPAGLLEPLPIAEKPWDSVIMDFIVALPKSQGFGSIMVVVDRFSKYATFIPCSPDVKVDEAARL
                                      DKGEQQLPAGLLEPLPIAEKPWDSV MDFIVALPKS GFG+IMVVVDRFSKYATFIPCSPDVKVDEAARL
Subjt:  ------------------------------DKGEQQLPAGLLEPLPIAEKPWDSVIMDFIVALPKSQGFGSIMVVVDRFSKYATFIPCSPDVKVDEAARL

Query:  FFKNVVKLWGVPKSIISDRDPRFTGKFWRELFKLVGTELNFSTSFHPQSDGQTERVNALLEHYLRHYVSAHQKNWATLIDIAQFSYNLQRSEATGKSPFE
        FFKNVVKLWG+PKSIISDRDPRFTGKFWRELFKL+GT+LNFSTSFHPQSDGQTER+NALLE YLRHYVSAHQK+WATL+D+AQFSYNLQRSE+TGKSPFE
Subjt:  FFKNVVKLWGVPKSIISDRDPRFTGKFWRELFKLVGTELNFSTSFHPQSDGQTERVNALLEHYLRHYVSAHQKNWATLIDIAQFSYNLQRSEATGKSPFE

Query:  IVMNQQPNTPSALIIPYEGPNPTAF
        I+MNQQPNTP  LI PYEGPNP+AF
Subjt:  IVMNQQPNTPSALIIPYEGPNPTAF

A0A5D3BWR0 Reverse transcriptase8.8e-20543.77Show/hide
Query:  KKHFVPHNAEMESRAKLRRLRHTGNILDYVKEFTTIMLEISDLPEKEALFQFRDGLKDWAKVELDRRNVQTLDDAIAAAEMLIDYTTQSKERKPSPSKRE
        +KHFVPHNAE ESR KLRRLRHTG+ILDYVKEFTT+ML+I DLPEKEALFQF+DGLKDWAK+ELDRRNVQTLDDAIA AE L+DY+ QSK +KP P K  
Subjt:  KKHFVPHNAEMESRAKLRRLRHTGNILDYVKEFTTIMLEISDLPEKEALFQFRDGLKDWAKVELDRRNVQTLDDAIAAAEMLIDYTTQSKERKPSPSKRE

Query:  GNHYKHRDSEQKEKGK-KAFQ-RHGKYHKTQQGELSKPS-PCFVCNGPHRTRDCPNRKAMNALVAKLLESKQGDDT--PQIGSLQHIGALKQANQTTKGG
        G   K +    K+ GK K FQ R+GK     +GE S PS PCF+C GPH TRDCPNRKA+NALVAK  E KQ +D   PQIGS+Q IG +K+   T    
Subjt:  GNHYKHRDSEQKEKGK-KAFQ-RHGKYHKTQQGELSKPS-PCFVCNGPHRTRDCPNRKAMNALVAKLLESKQGDDT--PQIGSLQHIGALKQANQTTKGG

Query:  LLHGNVKINGREAIAMFDTGASHNFMNVNEAKRIGLKFTNEEGTVKAVNSEAQAIKGVARGVTVKIGDWQGKLDFTVLPMDDFDIVLGLGFFNKVIAIIN
        LL+G+V+I G+EA AMFDTGASHNFM+V EAKR+GLKF  E GTVK VN++ Q I G+A+GV VKIGDWQ +LDF++LPMDDF IVLGLGFF+KV+ +++
Subjt:  LLHGNVKINGREAIAMFDTGASHNFMNVNEAKRIGLKFTNEEGTVKAVNSEAQAIKGVARGVTVKIGDWQGKLDFTVLPMDDFDIVLGLGFFNKVIAIIN

Query:  SSGSTLTIVDGQITTIPLKKGRPVTMLSAILSKEGVAE------TQPQTVPSREKNSKPTVP-------AEHKDVM------------------------
        S+  TL+I+DG +TTI +++G+PV MLSA+  K GV++      T         K+ +P VP        E+KD+M                        
Subjt:  SSGSTLTIVDGQITTIPLKKGRPVTMLSAILSKEGVAE------TQPQTVPSREKNSKPTVP-------AEHKDVM------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------LEVKSGNADAVNNVVSQKAKLATITTSMPRSDFLDKIKEGMKNDATAKNLLKLAKEGKTRRF
                                              LE K G A+ V + +S KA+L+TITTSMP SDFL++IKEGM++D  AKNLLKLAKEGKTRRF
Subjt:  --------------------------------------LEVKSGNADAVNNVVSQKAKLATITTSMPRSDFLDKIKEGMKNDATAKNLLKLAKEGKTRRF

Query:  WESNGALLTI-GNRLFVPRWGSLRKDILKECHDSLWAGHPDKGEQQLPAGLLEPLPIAEKPWDSVIMDFIVALPKSQGFGSIMVVVDRFSKYATFIPCSP
         E+NG LLTI  ++ + PR     +  +K C         DKGEQQLPA LLEPLPIAEKPWDSV MDFIVALPKS GFG+IMVVVDRFSKYATFIPC P
Subjt:  WESNGALLTI-GNRLFVPRWGSLRKDILKECHDSLWAGHPDKGEQQLPAGLLEPLPIAEKPWDSVIMDFIVALPKSQGFGSIMVVVDRFSKYATFIPCSP

Query:  DVKVDEAARLFFKNVVKLWGVPKSIISDRDPRFTGKFWRELFKLVGTELNFSTSFHPQSDGQTERVNALLEHYLRHYVSAHQKNWATLIDIAQFSYNLQR
        DVKVDEAARLFFKN+VK WGVPKSIISDRD RFTGKFWRELFKL G +LNFSTSFH QSDGQTER+N LLE YLRHYVSAHQK+WA L+D+AQFSYNLQR
Subjt:  DVKVDEAARLFFKNVVKLWGVPKSIISDRDPRFTGKFWRELFKLVGTELNFSTSFHPQSDGQTERVNALLEHYLRHYVSAHQKNWATLIDIAQFSYNLQR

Query:  SEATGKSPFEIVMNQQPNTPSALIIPYEGPNPTAF
        SE+TGKSPFEI+MNQQPNTP ALI PYEGPNP+AF
Subjt:  SEATGKSPFEIVMNQQPNTPSALIIPYEGPNPTAF

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein7.3e-3932.03Show/hide
Query:  DFLDKIKEGMKNDATAKNLLKLAKEGKTRRFWESNGALLTIGNRLFVPRWGSLRKDILKECHDSLWAGHP------------------------------
        DF +++     ND    NLL    +         +G L+   +++ +P    L + I+K+ H+     HP                              
Subjt:  DFLDKIKEGMKNDATAKNLLKLAKEGKTRRFWESNGALLTIGNRLFVPRWGSLRKDILKECHDSLWAGHP------------------------------

Query:  ----DKGEQQLPAGLLEPLPIAEKPWDSVIMDFIVALPKSQGFGSIMVVVDRFSKYATFIPCSPDVKVDEAARLFFKNVVKLWGVPKSIISDRDPRFTGK
            +K     P G L+P+P +E+PW+S+ MDFI ALP+S G+ ++ VVVDRFSK A  +PC+  +  ++ AR+F + V+  +G PK II+D D  FT +
Subjt:  ----DKGEQQLPAGLLEPLPIAEKPWDSVIMDFIVALPKSQGFGSIMVVVDRFSKYATFIPCSPDVKVDEAARLFFKNVVKLWGVPKSIISDRDPRFTGK

Query:  FWRELFKLVGTELNFSTSFHPQSDGQTERVNALLEHYLRHYVSAHQKNWATLIDIAQFSYNLQRSEATGKSPFEIVMNQQP
         W++        + FS  + PQ+DGQTER N  +E  LR   S H   W   I + Q SYN     AT  +PFEIV    P
Subjt:  FWRELFKLVGTELNFSTSFHPQSDGQTERVNALLEHYLRHYVSAHQKNWATLIDIAQFSYNLQRSEATGKSPFEIVMNQQP

P0CT35 Transposon Tf2-2 polyprotein7.3e-3932.03Show/hide
Query:  DFLDKIKEGMKNDATAKNLLKLAKEGKTRRFWESNGALLTIGNRLFVPRWGSLRKDILKECHDSLWAGHP------------------------------
        DF +++     ND    NLL    +         +G L+   +++ +P    L + I+K+ H+     HP                              
Subjt:  DFLDKIKEGMKNDATAKNLLKLAKEGKTRRFWESNGALLTIGNRLFVPRWGSLRKDILKECHDSLWAGHP------------------------------

Query:  ----DKGEQQLPAGLLEPLPIAEKPWDSVIMDFIVALPKSQGFGSIMVVVDRFSKYATFIPCSPDVKVDEAARLFFKNVVKLWGVPKSIISDRDPRFTGK
            +K     P G L+P+P +E+PW+S+ MDFI ALP+S G+ ++ VVVDRFSK A  +PC+  +  ++ AR+F + V+  +G PK II+D D  FT +
Subjt:  ----DKGEQQLPAGLLEPLPIAEKPWDSVIMDFIVALPKSQGFGSIMVVVDRFSKYATFIPCSPDVKVDEAARLFFKNVVKLWGVPKSIISDRDPRFTGK

Query:  FWRELFKLVGTELNFSTSFHPQSDGQTERVNALLEHYLRHYVSAHQKNWATLIDIAQFSYNLQRSEATGKSPFEIVMNQQP
         W++        + FS  + PQ+DGQTER N  +E  LR   S H   W   I + Q SYN     AT  +PFEIV    P
Subjt:  FWRELFKLVGTELNFSTSFHPQSDGQTERVNALLEHYLRHYVSAHQKNWATLIDIAQFSYNLQRSEATGKSPFEIVMNQQP

P0CT36 Transposon Tf2-3 polyprotein7.3e-3932.03Show/hide
Query:  DFLDKIKEGMKNDATAKNLLKLAKEGKTRRFWESNGALLTIGNRLFVPRWGSLRKDILKECHDSLWAGHP------------------------------
        DF +++     ND    NLL    +         +G L+   +++ +P    L + I+K+ H+     HP                              
Subjt:  DFLDKIKEGMKNDATAKNLLKLAKEGKTRRFWESNGALLTIGNRLFVPRWGSLRKDILKECHDSLWAGHP------------------------------

Query:  ----DKGEQQLPAGLLEPLPIAEKPWDSVIMDFIVALPKSQGFGSIMVVVDRFSKYATFIPCSPDVKVDEAARLFFKNVVKLWGVPKSIISDRDPRFTGK
            +K     P G L+P+P +E+PW+S+ MDFI ALP+S G+ ++ VVVDRFSK A  +PC+  +  ++ AR+F + V+  +G PK II+D D  FT +
Subjt:  ----DKGEQQLPAGLLEPLPIAEKPWDSVIMDFIVALPKSQGFGSIMVVVDRFSKYATFIPCSPDVKVDEAARLFFKNVVKLWGVPKSIISDRDPRFTGK

Query:  FWRELFKLVGTELNFSTSFHPQSDGQTERVNALLEHYLRHYVSAHQKNWATLIDIAQFSYNLQRSEATGKSPFEIVMNQQP
         W++        + FS  + PQ+DGQTER N  +E  LR   S H   W   I + Q SYN     AT  +PFEIV    P
Subjt:  FWRELFKLVGTELNFSTSFHPQSDGQTERVNALLEHYLRHYVSAHQKNWATLIDIAQFSYNLQRSEATGKSPFEIVMNQQP

P0CT41 Transposon Tf2-12 polyprotein7.3e-3932.03Show/hide
Query:  DFLDKIKEGMKNDATAKNLLKLAKEGKTRRFWESNGALLTIGNRLFVPRWGSLRKDILKECHDSLWAGHP------------------------------
        DF +++     ND    NLL    +         +G L+   +++ +P    L + I+K+ H+     HP                              
Subjt:  DFLDKIKEGMKNDATAKNLLKLAKEGKTRRFWESNGALLTIGNRLFVPRWGSLRKDILKECHDSLWAGHP------------------------------

Query:  ----DKGEQQLPAGLLEPLPIAEKPWDSVIMDFIVALPKSQGFGSIMVVVDRFSKYATFIPCSPDVKVDEAARLFFKNVVKLWGVPKSIISDRDPRFTGK
            +K     P G L+P+P +E+PW+S+ MDFI ALP+S G+ ++ VVVDRFSK A  +PC+  +  ++ AR+F + V+  +G PK II+D D  FT +
Subjt:  ----DKGEQQLPAGLLEPLPIAEKPWDSVIMDFIVALPKSQGFGSIMVVVDRFSKYATFIPCSPDVKVDEAARLFFKNVVKLWGVPKSIISDRDPRFTGK

Query:  FWRELFKLVGTELNFSTSFHPQSDGQTERVNALLEHYLRHYVSAHQKNWATLIDIAQFSYNLQRSEATGKSPFEIVMNQQP
         W++        + FS  + PQ+DGQTER N  +E  LR   S H   W   I + Q SYN     AT  +PFEIV    P
Subjt:  FWRELFKLVGTELNFSTSFHPQSDGQTERVNALLEHYLRHYVSAHQKNWATLIDIAQFSYNLQRSEATGKSPFEIVMNQQP

Q9UR07 Transposon Tf2-11 polyprotein7.3e-3932.03Show/hide
Query:  DFLDKIKEGMKNDATAKNLLKLAKEGKTRRFWESNGALLTIGNRLFVPRWGSLRKDILKECHDSLWAGHP------------------------------
        DF +++     ND    NLL    +         +G L+   +++ +P    L + I+K+ H+     HP                              
Subjt:  DFLDKIKEGMKNDATAKNLLKLAKEGKTRRFWESNGALLTIGNRLFVPRWGSLRKDILKECHDSLWAGHP------------------------------

Query:  ----DKGEQQLPAGLLEPLPIAEKPWDSVIMDFIVALPKSQGFGSIMVVVDRFSKYATFIPCSPDVKVDEAARLFFKNVVKLWGVPKSIISDRDPRFTGK
            +K     P G L+P+P +E+PW+S+ MDFI ALP+S G+ ++ VVVDRFSK A  +PC+  +  ++ AR+F + V+  +G PK II+D D  FT +
Subjt:  ----DKGEQQLPAGLLEPLPIAEKPWDSVIMDFIVALPKSQGFGSIMVVVDRFSKYATFIPCSPDVKVDEAARLFFKNVVKLWGVPKSIISDRDPRFTGK

Query:  FWRELFKLVGTELNFSTSFHPQSDGQTERVNALLEHYLRHYVSAHQKNWATLIDIAQFSYNLQRSEATGKSPFEIVMNQQP
         W++        + FS  + PQ+DGQTER N  +E  LR   S H   W   I + Q SYN     AT  +PFEIV    P
Subjt:  FWRELFKLVGTELNFSTSFHPQSDGQTERVNALLEHYLRHYVSAHQKNWATLIDIAQFSYNLQRSEATGKSPFEIVMNQQP

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGCACAACTTTGGTGGCGTCGAAGATTTACCATCCAAAATGGGAATGCCATCCAAAAAGCACTTTGTTCCCCACAATGCCGAAATGGAGTCAAGAGCCAAACTTCG
AAGGCTACGACACACAGGTAACATTCTCGACTATGTGAAGGAATTTACCACTATTATGCTTGAAATTAGTGATCTGCCCGAGAAGGAAGCCCTATTCCAATTTAGAGATG
GATTGAAAGATTGGGCCAAGGTTGAGCTAGACCGTCGCAATGTCCAAACATTGGACGATGCTATAGCCGCAGCTGAGATGCTTATCGATTATACAACCCAATCAAAGGAG
AGAAAACCTAGCCCAAGTAAACGTGAAGGCAATCACTACAAGCATAGAGACTCCGAGCAAAAGGAAAAGGGAAAGAAGGCATTCCAAAGACATGGCAAGTATCACAAAAC
TCAACAAGGTGAGTTGTCCAAACCTTCTCCCTGTTTTGTTTGTAATGGCCCTCATCGGACAAGAGACTGTCCGAATAGAAAAGCAATGAATGCACTTGTTGCCAAATTGC
TCGAGTCCAAGCAAGGAGATGATACACCACAGATTGGTTCCTTGCAACACATTGGAGCTCTGAAACAGGCCAATCAAACAACTAAAGGAGGCCTACTCCATGGAAATGTA
AAGATAAACGGGAGAGAGGCTATTGCCATGTTTGACACCGGTGCGAGTCATAACTTCATGAACGTGAATGAGGCTAAGAGAATAGGCCTCAAGTTCACAAACGAAGAGGG
GACTGTCAAGGCGGTAAACTCGGAAGCACAAGCCATCAAAGGTGTAGCAAGAGGTGTAACAGTCAAGATCGGTGATTGGCAAGGTAAGCTCGACTTCACCGTGCTACCTA
TGGATGACTTTGATATTGTGCTTGGCTTAGGGTTCTTCAATAAGGTAATTGCCATCATTAATTCTTCCGGAAGCACCCTAACTATAGTGGATGGTCAAATAACAACCATC
CCATTGAAGAAGGGAAGGCCAGTGACCATGCTCTCAGCCATACTGAGTAAGGAAGGAGTCGCTGAAACCCAACCTCAAACGGTACCGTCGAGAGAGAAAAATTCTAAACC
AACAGTCCCAGCAGAACACAAGGACGTTATGCTCGAGGTAAAATCAGGAAACGCCGATGCAGTCAATAACGTCGTAAGCCAGAAGGCAAAACTGGCTACTATAACCACCA
GCATGCCCAGGAGTGACTTCCTTGACAAAATCAAGGAAGGGATGAAGAATGATGCAACAGCCAAGAACTTATTAAAGTTGGCCAAGGAAGGCAAAACCAGACGTTTTTGG
GAGAGCAATGGTGCCCTCCTTACAATTGGAAATCGGTTGTTCGTGCCACGATGGGGATCCTTACGGAAGGATATTTTAAAGGAGTGTCACGATTCTTTATGGGCCGGACA
CCCAGACAAGGGGGAACAGCAACTCCCAGCAGGGTTACTTGAACCCTTGCCTATAGCAGAAAAACCTTGGGATAGTGTTATAATGGACTTCATTGTAGCCTTACCTAAGT
CCCAAGGGTTTGGAAGCATCATGGTAGTGGTGGATAGATTCAGCAAGTATGCAACATTTATTCCATGCTCGCCAGATGTGAAAGTCGATGAAGCTGCTAGGCTGTTCTTC
AAAAACGTTGTGAAGTTATGGGGAGTACCTAAAAGCATCATCAGCGATCGAGACCCCCGCTTCACAGGCAAGTTTTGGCGAGAACTCTTTAAACTTGTGGGGACAGAACT
AAACTTCTCTACTAGTTTCCATCCCCAGAGCGACGGCCAAACGGAACGAGTAAATGCCTTACTGGAGCACTACCTAAGGCATTATGTCAGTGCACACCAAAAGAATTGGG
CAACTCTCATCGATATAGCCCAATTTTCTTATAACCTTCAAAGGAGTGAGGCTACCGGAAAGAGCCCCTTTGAAATAGTCATGAATCAACAACCCAACACTCCTAGTGCC
CTGATTATTCCATATGAAGGACCCAATCCAACGGCATTTGCGAAGGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGCCGCACAACTTTGGTGGCGTCGAAGATTTACCATCCAAAATGGGAATGCCATCCAAAAAGCACTTTGTTCCCCACAATGCCGAAATGGAGTCAAGAGCCAAACTTCG
AAGGCTACGACACACAGGTAACATTCTCGACTATGTGAAGGAATTTACCACTATTATGCTTGAAATTAGTGATCTGCCCGAGAAGGAAGCCCTATTCCAATTTAGAGATG
GATTGAAAGATTGGGCCAAGGTTGAGCTAGACCGTCGCAATGTCCAAACATTGGACGATGCTATAGCCGCAGCTGAGATGCTTATCGATTATACAACCCAATCAAAGGAG
AGAAAACCTAGCCCAAGTAAACGTGAAGGCAATCACTACAAGCATAGAGACTCCGAGCAAAAGGAAAAGGGAAAGAAGGCATTCCAAAGACATGGCAAGTATCACAAAAC
TCAACAAGGTGAGTTGTCCAAACCTTCTCCCTGTTTTGTTTGTAATGGCCCTCATCGGACAAGAGACTGTCCGAATAGAAAAGCAATGAATGCACTTGTTGCCAAATTGC
TCGAGTCCAAGCAAGGAGATGATACACCACAGATTGGTTCCTTGCAACACATTGGAGCTCTGAAACAGGCCAATCAAACAACTAAAGGAGGCCTACTCCATGGAAATGTA
AAGATAAACGGGAGAGAGGCTATTGCCATGTTTGACACCGGTGCGAGTCATAACTTCATGAACGTGAATGAGGCTAAGAGAATAGGCCTCAAGTTCACAAACGAAGAGGG
GACTGTCAAGGCGGTAAACTCGGAAGCACAAGCCATCAAAGGTGTAGCAAGAGGTGTAACAGTCAAGATCGGTGATTGGCAAGGTAAGCTCGACTTCACCGTGCTACCTA
TGGATGACTTTGATATTGTGCTTGGCTTAGGGTTCTTCAATAAGGTAATTGCCATCATTAATTCTTCCGGAAGCACCCTAACTATAGTGGATGGTCAAATAACAACCATC
CCATTGAAGAAGGGAAGGCCAGTGACCATGCTCTCAGCCATACTGAGTAAGGAAGGAGTCGCTGAAACCCAACCTCAAACGGTACCGTCGAGAGAGAAAAATTCTAAACC
AACAGTCCCAGCAGAACACAAGGACGTTATGCTCGAGGTAAAATCAGGAAACGCCGATGCAGTCAATAACGTCGTAAGCCAGAAGGCAAAACTGGCTACTATAACCACCA
GCATGCCCAGGAGTGACTTCCTTGACAAAATCAAGGAAGGGATGAAGAATGATGCAACAGCCAAGAACTTATTAAAGTTGGCCAAGGAAGGCAAAACCAGACGTTTTTGG
GAGAGCAATGGTGCCCTCCTTACAATTGGAAATCGGTTGTTCGTGCCACGATGGGGATCCTTACGGAAGGATATTTTAAAGGAGTGTCACGATTCTTTATGGGCCGGACA
CCCAGACAAGGGGGAACAGCAACTCCCAGCAGGGTTACTTGAACCCTTGCCTATAGCAGAAAAACCTTGGGATAGTGTTATAATGGACTTCATTGTAGCCTTACCTAAGT
CCCAAGGGTTTGGAAGCATCATGGTAGTGGTGGATAGATTCAGCAAGTATGCAACATTTATTCCATGCTCGCCAGATGTGAAAGTCGATGAAGCTGCTAGGCTGTTCTTC
AAAAACGTTGTGAAGTTATGGGGAGTACCTAAAAGCATCATCAGCGATCGAGACCCCCGCTTCACAGGCAAGTTTTGGCGAGAACTCTTTAAACTTGTGGGGACAGAACT
AAACTTCTCTACTAGTTTCCATCCCCAGAGCGACGGCCAAACGGAACGAGTAAATGCCTTACTGGAGCACTACCTAAGGCATTATGTCAGTGCACACCAAAAGAATTGGG
CAACTCTCATCGATATAGCCCAATTTTCTTATAACCTTCAAAGGAGTGAGGCTACCGGAAAGAGCCCCTTTGAAATAGTCATGAATCAACAACCCAACACTCCTAGTGCC
CTGATTATTCCATATGAAGGACCCAATCCAACGGCATTTGCGAAGGAATGA
Protein sequenceShow/hide protein sequence
MPHNFGGVEDLPSKMGMPSKKHFVPHNAEMESRAKLRRLRHTGNILDYVKEFTTIMLEISDLPEKEALFQFRDGLKDWAKVELDRRNVQTLDDAIAAAEMLIDYTTQSKE
RKPSPSKREGNHYKHRDSEQKEKGKKAFQRHGKYHKTQQGELSKPSPCFVCNGPHRTRDCPNRKAMNALVAKLLESKQGDDTPQIGSLQHIGALKQANQTTKGGLLHGNV
KINGREAIAMFDTGASHNFMNVNEAKRIGLKFTNEEGTVKAVNSEAQAIKGVARGVTVKIGDWQGKLDFTVLPMDDFDIVLGLGFFNKVIAIINSSGSTLTIVDGQITTI
PLKKGRPVTMLSAILSKEGVAETQPQTVPSREKNSKPTVPAEHKDVMLEVKSGNADAVNNVVSQKAKLATITTSMPRSDFLDKIKEGMKNDATAKNLLKLAKEGKTRRFW
ESNGALLTIGNRLFVPRWGSLRKDILKECHDSLWAGHPDKGEQQLPAGLLEPLPIAEKPWDSVIMDFIVALPKSQGFGSIMVVVDRFSKYATFIPCSPDVKVDEAARLFF
KNVVKLWGVPKSIISDRDPRFTGKFWRELFKLVGTELNFSTSFHPQSDGQTERVNALLEHYLRHYVSAHQKNWATLIDIAQFSYNLQRSEATGKSPFEIVMNQQPNTPSA
LIIPYEGPNPTAFAKE