; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036513 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036513
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr3:47700801..47713815
RNA-Seq ExpressionLag0036513
SyntenyLag0036513
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR005174 - Domain unknown function DUF295
IPR012337 - Ribonuclease H-like superfamily
IPR025314 - Domain of unknown function DUF4219
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8673149.1 hypothetical protein F3Y22_tig00111810pilonHSYRG00151 [Hibiscus syriacus]8.8e-22949.83Show/hide
Query:  MGDLQVVGGIKKLNTQNYKTWSTCMESYLQGQDLWEVVGGTEVTPP--EDATVGFYALKLVDSKCPLGPTGSSLGALRVFIPSATALKKWKIKAGKAMFV
        MGDLQVVGGIKKLN +NY TW+TCMESYLQGQDLWEVVGG EVT P  EDA                                   L+KWKIKAGKAMF 
Subjt:  MGDLQVVGGIKKLNTQNYKTWSTCMESYLQGQDLWEVVGGTEVTPP--EDATVGFYALKLVDSKCPLGPTGSSLGALRVFIPSATALKKWKIKAGKAMFV

Query:  IKTTIDEEMLEYIRGAETPKAAWDTFASLFSKRNDARLQFLENELLSVAQREMTINQYFNKVKTLYREISELDPTATISKSRMRRIIIHGLKPEYRSFIV
        +KTTI+EEMLE+IR A+TPK AWDTF +LFSKRND +LQ LENELLS+AQR+M + QYF+KVK++ REISELDPTA I ++R++RII+H L+PEYR F+ 
Subjt:  IKTTIDEEMLEYIRGAETPKAAWDTFASLFSKRNDARLQFLENELLSVAQREMTINQYFNKVKTLYREISELDPTATISKSRMRRIIIHGLKPEYRSFIV

Query:  AVQGWSVQPSLIDLENLLVGQEALGKQMSRVTLKSNKEEALFSGQRKG----------------------------EPDSKSTSNAMKKED---------
        AVQGW  QPSL++ EN L GQEA+ KQM  V+LK  +EEAL++ + +G                            +P   +T+ +  KE+         
Subjt:  AVQGWSVQPSLIDLENLLVGQEALGKQMSRVTLKSNKEEALFSGQRKG----------------------------EPDSKSTSNAMKKED---------

Query:  ----------LTLHA-EEVSYENDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITHVGKTMIMPRSNSKQVELENVFYVPGMKKNLVSVS
                  LT+   E + Y+NDWIVDSG SNHMTGDK+KLQN  EY G RVVVTA+NS+LPITH+GKT++ PR N+ QV+L++V++VPG+KK L+SV+
Subjt:  ----------LTLHA-EEVSYENDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITHVGKTMIMPRSNSKQVELENVFYVPGMKKNLVSVS

Query:  QLTSSGNFVVFGPNDVKVYQDLKVSGMPLMKGRRMDSIYVMSAEAAYVNKTRKNETTDLWHARLGHVSYNKLKIMISKSMLKGLPQLDIREDMVCAGCQY
        QLTSSG++V+FG  DVKVY+D+K+S  P M+GRR++SIYVMSAE+AYV++TRKNET+DLWH +LGHVSY+KL +M+ KSMLKGLPQLD+R D VC GCQY
Subjt:  QLTSSGNFVVFGPNDVKVYQDLKVSGMPLMKGRRMDSIYVMSAEAAYVNKTRKNETTDLWHARLGHVSYNKLKIMISKSMLKGLPQLDIREDMVCAGCQY

Query:  GKAHQLPFEESKFKAKQPLELVHSDVFGPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTKFKEFKEQVE-------------------------
        GKAHQLP++ESKFKAK+PLELVHSDVFGPVKQ SISGMRYMVTFIDD SRYVWVFFMKEKS+TF+KFKEF++  E                         
Subjt:  GKAHQLPFEESKFKAKQPLELVHSDVFGPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTKFKEFKEQVE-------------------------

Query:  --------------------------------------------------------------------------------------------------DH
                                                                                                          DH
Subjt:  --------------------------------------------------------------------------------------------------DH

Query:  LRSKFDKKAINCIFVGYDNQRKGWRCIDPVTGRCYTSRNVIFDEATSWWAPKSEKAPTDERSFKEGLKEEMSQVQQVPIEEKEDPPEENNGEE-EQLRTQ
        LRSKFDKKA+ CIFVGYD+QRKGW+C DP++GRCYTSRNV+FDEA+SWW+ + E  P D R F + L+++M +   V ++   D  E+ NG++ EQ  TQ
Subjt:  LRSKFDKKAINCIFVGYDNQRKGWRCIDPVTGRCYTSRNVIFDEATSWWAPKSEKAPTDERSFKEGLKEEMSQVQQVPIEEKEDPPEENNGEE-EQLRTQ

Query:  SPWQSGVHGQ---------------EPQLRRSTRQRKPNPKYVNATLAILE---EPTIYEETSKNIQWRKVVEEEVGALRRKRRWDL
        +PW++GV+ Q               + QLRRSTR R+PNPKY NA  AI+E   EP  +EE SK+ QW   ++EE+ AL++ + WD+
Subjt:  SPWQSGVHGQ---------------EPQLRRSTRQRKPNPKYVNATLAILE---EPTIYEETSKNIQWRKVVEEEVGALRRKRRWDL

KAE8684576.1 hypothetical protein F3Y22_tig00111127pilonHSYRG00074 [Hibiscus syriacus]1.2e-23049.08Show/hide
Query:  MGDLQVVGGIKKLNTQNYKTWSTCMESYLQGQDLWEVVGGTEVTPP--EDATVGFYALKLVDSKCPLGPTGSSLGALRVFIPSATALKKWKIKAGKAMFV
        MGDLQVVGGIKKLN +NY TW+TCMESYLQGQDLWEVVGG EVT P  EDA                                   L+KWKIKAGKAMF 
Subjt:  MGDLQVVGGIKKLNTQNYKTWSTCMESYLQGQDLWEVVGGTEVTPP--EDATVGFYALKLVDSKCPLGPTGSSLGALRVFIPSATALKKWKIKAGKAMFV

Query:  IKTTIDEEMLEYIRGAETPKAAWDTFASLFSKRNDARLQFLENELLSVAQREMTINQYFNKVKTLYREISELDPTATISKSRMRRIIIHGLKPEYRSFIV
        +KTTI+EEMLE+IR A+TPK AWDTF +LFSKRND +LQ LENELLS+AQR+M + QYF+KVK++ REISELDPTA I ++R++RII+HGL+PEYR F+ 
Subjt:  IKTTIDEEMLEYIRGAETPKAAWDTFASLFSKRNDARLQFLENELLSVAQREMTINQYFNKVKTLYREISELDPTATISKSRMRRIIIHGLKPEYRSFIV

Query:  AVQGWSVQPSLIDLENLLVGQEALGKQMSRVTLKSNKEEALFSG------------------------QRKGEPDSKSTSN-------------------
        AVQGW  QPSL++ ENLL GQEA+ KQM  V+LK  +EEAL++                         Q KG P S   S                    
Subjt:  AVQGWSVQPSLIDLENLLVGQEALGKQMSRVTLKSNKEEALFSG------------------------QRKGEPDSKSTSN-------------------

Query:  ------------------------------------AMKKEDLTLHA---EEVSYENDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITH
                                            A ++E+L L     E + Y+NDWIVDSGCSNHMTGDK+KLQN  EY G RVVVTA+NS+LPITH
Subjt:  ------------------------------------AMKKEDLTLHA---EEVSYENDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITH

Query:  VGKTMIMPRSNSKQVELENVFYVPGMKKNLVSVSQLTSSGNFVVFGPNDVKVYQDLKVSGMPLMKGRRMDSIYVMSAEAAYVNKTRKNETTDLWHARLGH
        +GKT++ PR N+ QV+L++V++VPGMKKNL+SV+QLTSSG++V+FGP DVKVY+D+K++  P M+GRR++SIYVMSAE+AYV++TRKNET+DLWH RLGH
Subjt:  VGKTMIMPRSNSKQVELENVFYVPGMKKNLVSVSQLTSSGNFVVFGPNDVKVYQDLKVSGMPLMKGRRMDSIYVMSAEAAYVNKTRKNETTDLWHARLGH

Query:  VSYNKLKIMISKSMLKGLPQLDIREDMVCAGCQYGKAHQLPFEESKFKAKQPLELVHSDVFGPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTK
        VSY+KL +M+ KSMLKGLPQLD+R D VCAGCQYGKAHQLP++ESKFKAK+PLELVHSDVFGPVKQ SISGMRYMVTFIDD SRYVWVFFMKEKS+TF+K
Subjt:  VSYNKLKIMISKSMLKGLPQLDIREDMVCAGCQYGKAHQLPFEESKFKAKQPLELVHSDVFGPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTK

Query:  FKEFKEQVE-------------------------------------------------------------------------------------------
        FKEF++  E                                                                                           
Subjt:  FKEFKEQVE-------------------------------------------------------------------------------------------

Query:  --------------------------------DHLRSKFDKKAINCIFVGYDNQRKGWRCIDPVTGRCYTSRNVIFDEATSWWAPKSEKAPTDERSFKEG
                                        DHLRSKFDKKA+ CIFVGYD+QRKGW+C DP++GRCYTSRNV+FDEA+SWW+ + E  P D R F + 
Subjt:  --------------------------------DHLRSKFDKKAINCIFVGYDNQRKGWRCIDPVTGRCYTSRNVIFDEATSWWAPKSEKAPTDERSFKEG

Query:  LKEEMSQVQQVPIEEKEDPPEENNGEE-EQLRTQSPWQSGVHGQ---------------EPQLRRSTRQRKPNPKYVNATLAILE---EPTIYEETSKNI
        L+++M +   V ++   D  E+ NG++ EQ  TQ+PWQ+GV+ Q               + QLRRSTR R+PNPKY NA  AI+E   EP  +EE SK+ 
Subjt:  LKEEMSQVQQVPIEEKEDPPEENNGEE-EQLRTQSPWQSGVHGQ---------------EPQLRRSTRQRKPNPKYVNATLAILE---EPTIYEETSKNI

Query:  QWRKVVEEEVGALRRKRRWDL
        +W   ++EE+ AL++ + WD+
Subjt:  QWRKVVEEEVGALRRKRRWDL

KAE8705435.1 hypothetical protein F3Y22_tig00110429pilonHSYRG01243 [Hibiscus syriacus]8.0e-23048.97Show/hide
Query:  MGDLQVVGGIKKLNTQNYKTWSTCMESYLQGQDLWEVVGGTEVTPP--EDATVGFYALKLVDSKCPLGPTGSSLGALRVFIPSATALKKWKIKAGKAMFV
        MGDLQVVGGIKKLN +NY TW+TCMESYLQGQDLWEVVGG EVT P  EDA                                   L+KWKIKAGKAMF 
Subjt:  MGDLQVVGGIKKLNTQNYKTWSTCMESYLQGQDLWEVVGGTEVTPP--EDATVGFYALKLVDSKCPLGPTGSSLGALRVFIPSATALKKWKIKAGKAMFV

Query:  IKTTIDEEMLEYIRGAETPKAAWDTFASLFSKRNDARLQFLENELLSVAQREMTINQYFNKVKTLYREISELDPTATISKSRMRRIIIHGLKPEYRSFIV
        +KTTI+EEMLE+IR A+TPK AWDTF +LFSKRND +LQ LENELLS+AQR+M + QYF+KVK++ REISELDPTA I ++R++RII+HGL+PEYR F+ 
Subjt:  IKTTIDEEMLEYIRGAETPKAAWDTFASLFSKRNDARLQFLENELLSVAQREMTINQYFNKVKTLYREISELDPTATISKSRMRRIIIHGLKPEYRSFIV

Query:  AVQGWSVQPSLIDLENLLVGQEALGKQMSRVTLKSNKEEALFSG------------------------QRKGEPDSKSTSN-------------------
        AVQGW  QPSL++ ENLL GQEA+ KQM  V+LK  +EEAL++                         Q KG P S   S                    
Subjt:  AVQGWSVQPSLIDLENLLVGQEALGKQMSRVTLKSNKEEALFSG------------------------QRKGEPDSKSTSN-------------------

Query:  ------------------------------------AMKKEDLTLHA---EEVSYENDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITH
                                            A ++E+L L     E + Y+NDWIVDSGCSNHMTGDK+KLQN  EY G RVVVTA+NS+LPITH
Subjt:  ------------------------------------AMKKEDLTLHA---EEVSYENDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITH

Query:  VGKTMIMPRSNSKQVELENVFYVPGMKKNLVSVSQLTSSGNFVVFGPNDVKVYQDLKVSGMPLMKGRRMDSIYVMSAEAAYVNKTRKNETTDLWHARLGH
        +GKT++ PR N+ QV+L++V++VPGMKKNL+SV+QLTSSG++V+FGP DVKVY+D+K++  P M+GRR++SIYVMSAE+AYV++TRKNET+DLWH RLGH
Subjt:  VGKTMIMPRSNSKQVELENVFYVPGMKKNLVSVSQLTSSGNFVVFGPNDVKVYQDLKVSGMPLMKGRRMDSIYVMSAEAAYVNKTRKNETTDLWHARLGH

Query:  VSYNKLKIMISKSMLKGLPQLDIREDMVCAGCQYGKAHQLPFEESKFKAKQPLELVHSDVFGPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTK
        VSY+KL +M+ KSMLKGLPQLD+R D VCAGCQYGKAHQLP++ESKFKAK+PLELVHSDVFGPVKQ SISGMRYMVTFIDD SRYVWVFFMKEKS+TF+K
Subjt:  VSYNKLKIMISKSMLKGLPQLDIREDMVCAGCQYGKAHQLPFEESKFKAKQPLELVHSDVFGPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTK

Query:  FKEFKEQVE-------------------------------------------------------------------------------------------
        FKEF++  E                                                                                           
Subjt:  FKEFKEQVE-------------------------------------------------------------------------------------------

Query:  --------------------------------DHLRSKFDKKAINCIFVGYDNQRKGWRCIDPVTGRCYTSRNVIFDEATSWWAPKSEKAPTDERSFKEG
                                        DHLRSKFDKKA+ CIFVGYD+ RKGW+C DP++GRCYTSRNV+FDEA+SWW+ + E  P D R F + 
Subjt:  --------------------------------DHLRSKFDKKAINCIFVGYDNQRKGWRCIDPVTGRCYTSRNVIFDEATSWWAPKSEKAPTDERSFKEG

Query:  LKEEMSQVQQVPIEEKEDPPEENNGEE-EQLRTQSPWQSGVHGQ---------------EPQLRRSTRQRKPNPKYVNATLAILE---EPTIYEETSKNI
        L+++M +   V ++   D  E+ NG++ EQ  TQ+PWQ+GV+ Q               + QLRRSTR R+PNPKY NA  AI+E   EP  +EE SK+ 
Subjt:  LKEEMSQVQQVPIEEKEDPPEENNGEE-EQLRTQSPWQSGVHGQ---------------EPQLRRSTRQRKPNPKYVNATLAILE---EPTIYEETSKNI

Query:  QWRKVVEEEVGALRRKRRWDL
        +W   ++EE+ AL++ + WD+
Subjt:  QWRKVVEEEVGALRRKRRWDL

KAE8715296.1 hypothetical protein F3Y22_tig00110183pilonHSYRG00102 [Hibiscus syriacus]6.1e-23048.97Show/hide
Query:  MGDLQVVGGIKKLNTQNYKTWSTCMESYLQGQDLWEVVGGTEVTPP--EDATVGFYALKLVDSKCPLGPTGSSLGALRVFIPSATALKKWKIKAGKAMFV
        MGDLQVVGGIKKLN +NY TW+TCMESYLQGQDLWEVVGG EVT P  EDA                                   L+KWKIKAGKAMF 
Subjt:  MGDLQVVGGIKKLNTQNYKTWSTCMESYLQGQDLWEVVGGTEVTPP--EDATVGFYALKLVDSKCPLGPTGSSLGALRVFIPSATALKKWKIKAGKAMFV

Query:  IKTTIDEEMLEYIRGAETPKAAWDTFASLFSKRNDARLQFLENELLSVAQREMTINQYFNKVKTLYREISELDPTATISKSRMRRIIIHGLKPEYRSFIV
        +KTTI+EEMLE+IR A+TPK AWDTF +LFSKRND +LQ LENELLS+AQR+M + QYF+KVK++ REISELDPTA I ++R++RII+HGL+PEYR F+ 
Subjt:  IKTTIDEEMLEYIRGAETPKAAWDTFASLFSKRNDARLQFLENELLSVAQREMTINQYFNKVKTLYREISELDPTATISKSRMRRIIIHGLKPEYRSFIV

Query:  AVQGWSVQPSLIDLENLLVGQEALGKQMSRVTLKSNKEEALFSG------------------------QRKGEPDSKSTSN-------------------
        AVQGW  QPSL++ ENLL GQEA+ KQM  V+LK  +EEAL++                         Q KG P S   S                    
Subjt:  AVQGWSVQPSLIDLENLLVGQEALGKQMSRVTLKSNKEEALFSG------------------------QRKGEPDSKSTSN-------------------

Query:  ------------------------------------AMKKEDLTLHA---EEVSYENDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITH
                                            A ++E+L L     E + Y+NDWIVDSGCSNHMTGDK+KLQN  EY G RVVVTA+NS+LPITH
Subjt:  ------------------------------------AMKKEDLTLHA---EEVSYENDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITH

Query:  VGKTMIMPRSNSKQVELENVFYVPGMKKNLVSVSQLTSSGNFVVFGPNDVKVYQDLKVSGMPLMKGRRMDSIYVMSAEAAYVNKTRKNETTDLWHARLGH
        +GKT++ PR N+ QV+L++V++VPGMKKNL+SV+QLTSSG++V+FGP DVKVY+D+K++  P M+GRR++SIYVMSAE+AYV++TRKNET+DLWH RLGH
Subjt:  VGKTMIMPRSNSKQVELENVFYVPGMKKNLVSVSQLTSSGNFVVFGPNDVKVYQDLKVSGMPLMKGRRMDSIYVMSAEAAYVNKTRKNETTDLWHARLGH

Query:  VSYNKLKIMISKSMLKGLPQLDIREDMVCAGCQYGKAHQLPFEESKFKAKQPLELVHSDVFGPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTK
        VSY+KL +M+ KSMLKGLPQLD+R D VCAGCQYGKAHQLP++ESKFKAK+PLELVHSDVFGPVKQ SISGMRYMVTFIDD SRYVWVFFMKEKS+TF+K
Subjt:  VSYNKLKIMISKSMLKGLPQLDIREDMVCAGCQYGKAHQLPFEESKFKAKQPLELVHSDVFGPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTK

Query:  FKEFKEQVE-------------------------------------------------------------------------------------------
        FKEF++  E                                                                                           
Subjt:  FKEFKEQVE-------------------------------------------------------------------------------------------

Query:  --------------------------------DHLRSKFDKKAINCIFVGYDNQRKGWRCIDPVTGRCYTSRNVIFDEATSWWAPKSEKAPTDERSFKEG
                                        DHLRSKFDKKA+ CIFVGYD+QRKGW+C DP++GRCYTSRNV+FDEA+SWW+ + E  P D R F + 
Subjt:  --------------------------------DHLRSKFDKKAINCIFVGYDNQRKGWRCIDPVTGRCYTSRNVIFDEATSWWAPKSEKAPTDERSFKEG

Query:  LKEEMSQVQQVPIEEKEDPPEENNGEE-EQLRTQSPWQSGVHGQ---------------EPQLRRSTRQRKPNPKYVNATLAILE---EPTIYEETSKNI
        L+ ++ +   V ++   D  E+ NG++ EQ  TQ+PWQ+GV+ Q               + QLRRSTR R+PNPKY NA  AI+E   EP  +EE SK+ 
Subjt:  LKEEMSQVQQVPIEEKEDPPEENNGEE-EQLRTQSPWQSGVHGQ---------------EPQLRRSTRQRKPNPKYVNATLAILE---EPTIYEETSKNI

Query:  QWRKVVEEEVGALRRKRRWDL
        +W   ++EE+ AL++ + WD+
Subjt:  QWRKVVEEEVGALRRKRRWDL

KAE8733549.1 hypothetical protein F3Y22_tig00001120pilonHSYRG00173 [Hibiscus syriacus]1.2e-23350.62Show/hide
Query:  MGDLQVVGGIKKLNTQNYKTWSTCMESYLQGQDLWEVVGGTEVTPP--EDATVGFYALKLVDSKCPLGPTGSSLGALRVFIPSATALKKWKIKAGKAMFV
        MGDLQVVGGIKKLN +NY TW+TCMESYLQGQDLWEVVGG EVT P  EDA                                   L+KWKIKAGKAMF 
Subjt:  MGDLQVVGGIKKLNTQNYKTWSTCMESYLQGQDLWEVVGGTEVTPP--EDATVGFYALKLVDSKCPLGPTGSSLGALRVFIPSATALKKWKIKAGKAMFV

Query:  IKTTIDEEMLEYIRGAETPKAAWDTFASLFSKRNDARLQFLENELLSVAQREMTINQYFNKVKTLYREISELDPTATISKSRMRRIIIHGLKPEYRSFIV
        +KTTI+EEMLE+IR A+TPK AWDTF +LFSKRND +LQ LENELLS+AQR+M + QYF+KVK++ REISELDPTA I ++R++RII+HGL+PEYR F+ 
Subjt:  IKTTIDEEMLEYIRGAETPKAAWDTFASLFSKRNDARLQFLENELLSVAQREMTINQYFNKVKTLYREISELDPTATISKSRMRRIIIHGLKPEYRSFIV

Query:  AVQGWSVQPSLIDLENLLVGQEALGKQMSRVTLKSNKEEALFSG------------------------QRKGEPDSKSTSN-------------------
        AVQGW  QPSL++ ENLL GQEA+ KQM  V+LK  +EEAL++                         Q KG P S   S                    
Subjt:  AVQGWSVQPSLIDLENLLVGQEALGKQMSRVTLKSNKEEALFSG------------------------QRKGEPDSKSTSN-------------------

Query:  ------------------------------------AMKKEDLTLHA---EEVSYENDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITH
                                            A ++E+L L     E + Y+NDWIVDSGCSNHMTGDK+KLQN  EY G RVVVTA+NS+LPITH
Subjt:  ------------------------------------AMKKEDLTLHA---EEVSYENDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITH

Query:  VGKTMIMPRSNSKQVELENVFYVPGMKKNLVSVSQLTSSGNFVVFGPNDVKVYQDLKVSGMPLMKGRRMDSIYVMSAEAAYVNKTRKNETTDLWHARLGH
        +GKT++ PR N+ QV+L++V++VPGMKKNL+SV+QLTSSG++V+FGP DVKVY+D+K++  P M+GRR++SIYVMSAE+AYV++TRKNET+DLWH RLGH
Subjt:  VGKTMIMPRSNSKQVELENVFYVPGMKKNLVSVSQLTSSGNFVVFGPNDVKVYQDLKVSGMPLMKGRRMDSIYVMSAEAAYVNKTRKNETTDLWHARLGH

Query:  VSYNKLKIMISKSMLKGLPQLDIREDMVCAGCQYGKAHQLPFEESKFKAKQPLELVHSDVFGPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTK
        VSY+KL +M+ KSMLKGLPQLD+R D VCAGCQYGKAHQLP++ESKFKAK+PLELVHSDVFGPVKQ SISGMRYMVTFIDD SRYVWVFFMKEKS+TF+K
Subjt:  VSYNKLKIMISKSMLKGLPQLDIREDMVCAGCQYGKAHQLPFEESKFKAKQPLELVHSDVFGPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTK

Query:  FKEFKEQVE-------------------------------------------------------------------------------------------
        FKEF++  E                                                                                           
Subjt:  FKEFKEQVE-------------------------------------------------------------------------------------------

Query:  ----DHLRSKFDKKAINCIFVGYDNQRKGWRCIDPVTGRCYTSRNVIFDEATSWWAPKSEKAPTDERSFKEGLKEEMSQVQQVPIEEKEDPPEENNGEE-
            DHLRSKFDKKA+ CIFVGYD+QRKGW+C DP++GRCYTSRNV+FDEA+SWW+ + E  P D R F + L+ +M +   V ++   D  E+ NG++ 
Subjt:  ----DHLRSKFDKKAINCIFVGYDNQRKGWRCIDPVTGRCYTSRNVIFDEATSWWAPKSEKAPTDERSFKEGLKEEMSQVQQVPIEEKEDPPEENNGEE-

Query:  EQLRTQSPWQSGVHGQ---------------EPQLRRSTRQRKPNPKYVNATLAILE---EPTIYEETSKNIQWRKVVEEEVGALRRKRRWDL
        EQ  TQ+PWQ+GV+ Q               + QLRRSTR R+PNPKY NA  AI+E   EP  +EE SK+ +W   ++EE+ AL++ + WD+
Subjt:  EQLRTQSPWQSGVHGQ---------------EPQLRRSTRQRKPNPKYVNATLAILE---EPTIYEETSKNIQWRKVVEEEVGALRRKRRWDL

TrEMBL top hitse value%identityAlignment
A0A2N9EQ78 Uncharacterized protein8.9e-23551.03Show/hide
Query:  MGDLQVVGGIKKLNTQNYKTWSTCMESYLQGQDLWEVVGGTEVTPP--EDATVGFYALKLVDSKCPLGPTGSSLGALRVFIPSATALKKWKIKAGKAMFV
        MGDLQVVGGIKKLN +NY TW+TC+ESYLQGQDLWEVVGG+EVT P  EDA                                +  L+KWKIKAGKAMF 
Subjt:  MGDLQVVGGIKKLNTQNYKTWSTCMESYLQGQDLWEVVGGTEVTPP--EDATVGFYALKLVDSKCPLGPTGSSLGALRVFIPSATALKKWKIKAGKAMFV

Query:  IKTTIDEEMLEYIRGAETPKAAWDTFASLFSKRNDARLQFLENELLSVAQREMTINQYFNKVKTLYREISELDPTATISKSRMRRIIIHGLKPEYRSFIV
        +KTTI+EEMLE+IR A+TPK AWDTF +LFSK+ND RLQ LENELLS+AQR+MTI QYF+KVK + REIS+LDPTA I +SR++RIIIHGL+PEYR F+ 
Subjt:  IKTTIDEEMLEYIRGAETPKAAWDTFASLFSKRNDARLQFLENELLSVAQREMTINQYFNKVKTLYREISELDPTATISKSRMRRIIIHGLKPEYRSFIV

Query:  AVQGWSVQPSLIDLENLLVGQEALGKQMSRVTLKSNKEEALFSGQRKG----------------------------------------------------
        A+QGW  QPSL++ ENLL  QEA+ KQM  V+LK  +EEAL++ + +G                                                    
Subjt:  AVQGWSVQPSLIDLENLLVGQEALGKQMSRVTLKSNKEEALFSGQRKG----------------------------------------------------

Query:  -------------EPDSKSTSN--------------AMKKEDLTLHA---EEVSYENDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITH
                     E ++ ++S+              AM++E+L L     E++ Y+NDWI+DSGCSNHMTGDK KLQN  EYKG RVVVTA+NS+LPI H
Subjt:  -------------EPDSKSTSN--------------AMKKEDLTLHA---EEVSYENDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITH

Query:  VGKTMIMPRSNSKQVELENVFYVPGMKKNLVSVSQLTSSGNFVVFGPNDVKVYQDLKVSGMPLMKGRRMDSIYVMSAEAAYVNKTRKNETTDLWHARLGH
        +GKT++ PR NS QV L++V++VPGMKKNL+SV+QLT SG++V+FGP DVKVY+DLK+S  P+M+G+R++S+YVMSAE+AYV++TRKNETTDLWH RLGH
Subjt:  VGKTMIMPRSNSKQVELENVFYVPGMKKNLVSVSQLTSSGNFVVFGPNDVKVYQDLKVSGMPLMKGRRMDSIYVMSAEAAYVNKTRKNETTDLWHARLGH

Query:  VSYNKLKIMISKSMLKGLPQLDIREDMVCAGCQYGKAHQLPFEESKFKAKQPLELVHSDVFGPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTK
        VSY+KL +M+ KSMLKGLPQLD+R D VCAGCQYGKAHQLP+EESKFKAK+PLELVHSDVFGPVKQPSI GMRYMVTFIDD SRYVWVFFMKEKS+TF+K
Subjt:  VSYNKLKIMISKSMLKGLPQLDIREDMVCAGCQYGKAHQLPFEESKFKAKQPLELVHSDVFGPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTK

Query:  FKEFKEQVE----------------------------------------------------------------------------DHLRSKFDKKAINCI
        FKEF+E  E                                                                            +HLRSKFDKKA+ CI
Subjt:  FKEFKEQVE----------------------------------------------------------------------------DHLRSKFDKKAINCI

Query:  FVGYDNQRKGWRCIDPVTGRCYTSRNVIFDEATSWWAPKSEKAPTDERSFKEGLKEEMSQVQQVPIEEKEDPPEENNGEE-EQLRTQSPWQSGVHGQ---
        FVGYD+QRKGW+C DP +GRCYTSR+V+FDEA+SWW+ + E   +D R F + L+++M +   V ++   D   + NG++ EQ   Q+PWQ+GV+ Q   
Subjt:  FVGYDNQRKGWRCIDPVTGRCYTSRNVIFDEATSWWAPKSEKAPTDERSFKEGLKEEMSQVQQVPIEEKEDPPEENNGEE-EQLRTQSPWQSGVHGQ---

Query:  ------------EPQLRRSTRQRKPNPKYVNATL---AILEEPTIYEETSKNIQWRKVVEEEVGALRRKRRWDL
                    + QLRRSTR R+PNPKY N  +   A + EP  +EE S++ +W K +EEE+ AL++ + WDL
Subjt:  ------------EPQLRRSTRQRKPNPKYVNATL---AILEEPTIYEETSKNIQWRKVVEEEVGALRRKRRWDL

A0A2N9HNS8 Integrase catalytic domain-containing protein2.0e-23450.79Show/hide
Query:  MGDLQVVGGIKKLNTQNYKTWSTCMESYLQGQDLWEVVGGTEVTPP--EDATVGFYALKLVDSKCPLGPTGSSLGALRVFIPSATALKKWKIKAGKAMFV
        MGDLQVVGGIKKLN +NY TW+TC+ESYLQGQDLWEVVGG+EVT P  EDA                                +  L+KWKIKAGKAMF 
Subjt:  MGDLQVVGGIKKLNTQNYKTWSTCMESYLQGQDLWEVVGGTEVTPP--EDATVGFYALKLVDSKCPLGPTGSSLGALRVFIPSATALKKWKIKAGKAMFV

Query:  IKTTIDEEMLEYIRGAETPKAAWDTFASLFSKRNDARLQFLENELLSVAQREMTINQYFNKVKTLYREISELDPTATISKSRMRRIIIHGLKPEYRSFIV
        +KTTI+EEMLE+IR A+TPK AWDTF +LFSK+ND RLQ LENELLS+AQR+MTI QYF+KVK + REIS+LDPTA I +SR++RIIIHGL+PEYR F+ 
Subjt:  IKTTIDEEMLEYIRGAETPKAAWDTFASLFSKRNDARLQFLENELLSVAQREMTINQYFNKVKTLYREISELDPTATISKSRMRRIIIHGLKPEYRSFIV

Query:  AVQGWSVQPSLIDLENLLVGQEALGKQMSRVTLKSNKEEALFSGQRKG---------------------------------EPDSKSTSN---------A
        A+QGW  QPSL++ ENLL  QEA+ KQM  V+LK  +EEAL++ + +G                                 E  SK  S          A
Subjt:  AVQGWSVQPSLIDLENLLVGQEALGKQMSRVTLKSNKEEALFSGQRKG---------------------------------EPDSKSTSN---------A

Query:  MKKEDLTLHA---EEVSYENDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITHVGKTMIMPRSNSKQVELENVFYVPGMKKNLVSVSQLT
        M++++L L     E++ Y+NDWIVDSGCSNHMTGDK KLQN  EYKG RVVVTA+NS+LPI H+GKT++ PR NS QV L++V++VPGMKKNL+SV+QLT
Subjt:  MKKEDLTLHA---EEVSYENDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITHVGKTMIMPRSNSKQVELENVFYVPGMKKNLVSVSQLT

Query:  SSGNFVVFGPNDVKVYQDLKVSGMPLMKGRRMDSIYVMSAEAAYVNKTRKNETTDLWHARLGHVSYNKLKIMISKSMLKGLPQLDIREDMVCAGCQYGKA
         SG++V+FGP DVKVY+DLK+S  P+M+G+R++S+YVMSAE+AYV++TRKNETTDLWH RLGHVSY+KL IM+ KSMLKGLPQLD+R D VCAGCQYGKA
Subjt:  SSGNFVVFGPNDVKVYQDLKVSGMPLMKGRRMDSIYVMSAEAAYVNKTRKNETTDLWHARLGHVSYNKLKIMISKSMLKGLPQLDIREDMVCAGCQYGKA

Query:  HQLPFEESKFKAKQPLELVHSDVFGPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTKFKEFKEQVE----------------------------
        HQLP++ESKFKAK+PLELVHSDVFGPVKQPSI GMRYMVTFIDD SRYVWVFFMKEKS+TF+KFKEF+E  E                            
Subjt:  HQLPFEESKFKAKQPLELVHSDVFGPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTKFKEFKEQVE----------------------------

Query:  -----------------------------------------------------------------------------------------------DHLRS
                                                                                                       +HLRS
Subjt:  -----------------------------------------------------------------------------------------------DHLRS

Query:  KFDKKAINCIFVGYDNQRKGWRCIDPVTGRCYTSRNVIFDEATSWWAPKSEKAPTDERSFKEGLKEEMSQVQQVPIEEKEDPPEENNGEE-EQLRTQSPW
        KFDKKA+ CIFVGYD+QRKGW+C DP +GRCYTSR+V+FDEA+SWW+ + E   +D R F + L+++M +   V ++   D   ++NG++ EQ   Q+PW
Subjt:  KFDKKAINCIFVGYDNQRKGWRCIDPVTGRCYTSRNVIFDEATSWWAPKSEKAPTDERSFKEGLKEEMSQVQQVPIEEKEDPPEENNGEE-EQLRTQSPW

Query:  QSGVHGQ---------------EPQLRRSTRQRKPNPKYVNATL---AILEEPTIYEETSKNIQWRKVVEEEVGALRRKRRWDL
        Q+GV+ Q               + QLRRSTR R+PNPKY NA +   A + EP  +EE S++ +W K +EEE+ AL++ + WDL
Subjt:  QSGVHGQ---------------EPQLRRSTRQRKPNPKYVNATL---AILEEPTIYEETSKNIQWRKVVEEEVGALRRKRRWDL

A0A2N9HY47 Integrase catalytic domain-containing protein1.7e-23350.57Show/hide
Query:  MGDLQVVGGIKKLNTQNYKTWSTCMESYLQGQDLWEVVGGTEVTPP--EDATVGFYALKLVDSKCPLGPTGSSLGALRVFIPSATALKKWKIKAGKAMFV
        MGDLQVVGGIKKLN +NY TW+TC+ESYLQGQDLWEVVGG+EVT P  EDA                                +  L+KWKIKAGKAMF 
Subjt:  MGDLQVVGGIKKLNTQNYKTWSTCMESYLQGQDLWEVVGGTEVTPP--EDATVGFYALKLVDSKCPLGPTGSSLGALRVFIPSATALKKWKIKAGKAMFV

Query:  IKTTIDEEMLEYIRGAETPKAAWDTFASLFSKRNDARLQFLENELLSVAQREMTINQYFNKVKTLYREISELDPTATISKSRMRRIIIHGLKPEYRSFIV
        +KTTI+EEMLE+IR A+TPK AWDTF +LFSK+ND RLQ LENELLS+AQR+MTI QYF+KVK + REIS+LDPTA I +SR++RIIIHGL+PEYR F+ 
Subjt:  IKTTIDEEMLEYIRGAETPKAAWDTFASLFSKRNDARLQFLENELLSVAQREMTINQYFNKVKTLYREISELDPTATISKSRMRRIIIHGLKPEYRSFIV

Query:  AVQGWSVQPSLIDLENLLVGQEALGKQMSRVTLKSNKEEALFSGQRKGE------PDSKSTSN------------------------------------A
        A+QGW  QPSL++ ENLL  QEA+ KQM  V+LK  +EEAL++ + +G        +SK   +                                    A
Subjt:  AVQGWSVQPSLIDLENLLVGQEALGKQMSRVTLKSNKEEALFSGQRKGE------PDSKSTSN------------------------------------A

Query:  MKKEDLTLHA---EEVSYENDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITHVGKTMIMPRSNSKQVELENVFYVPGMKKNLVSVSQLT
        M++E+L L     E++ Y+NDWIVDSGCSNHMTGDK KLQN  EYKG RVVVTA+NS+LPI H+GKT++ PR NS QV L++V++VPGMKKNL+SV+QLT
Subjt:  MKKEDLTLHA---EEVSYENDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITHVGKTMIMPRSNSKQVELENVFYVPGMKKNLVSVSQLT

Query:  SSGNFVVFGPNDVKVYQDLKVSGMPLMKGRRMDSIYVMSAEAAYVNKTRKNETTDLWHARLGHVSYNKLKIMISKSMLKGLPQLDIREDMVCAGCQYGKA
         SG++V+FGP DVKVY+DLK+S  P+M+G+R++S+YVMSAE+AYV++TRKNETTDLWH RLGHVSY+KL +M+ KSMLKGLPQLD+R D VCAGCQYGKA
Subjt:  SSGNFVVFGPNDVKVYQDLKVSGMPLMKGRRMDSIYVMSAEAAYVNKTRKNETTDLWHARLGHVSYNKLKIMISKSMLKGLPQLDIREDMVCAGCQYGKA

Query:  HQLPFEESKFKAKQPLELVHSDVFGPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTKFKEFKEQVE----------------------------
        HQLP++ESKFKAK+PLELVHSDVFGPVKQPSI GMRYMVTFIDD SRYVWVFFMKEKS+TF+KFKEF+E  E                            
Subjt:  HQLPFEESKFKAKQPLELVHSDVFGPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTKFKEFKEQVE----------------------------

Query:  -----------------------------------------------------------------------------------------------DHLRS
                                                                                                       +HLRS
Subjt:  -----------------------------------------------------------------------------------------------DHLRS

Query:  KFDKKAINCIFVGYDNQRKGWRCIDPVTGRCYTSRNVIFDEATSWWAPKSEKAPTDERSFKEGLKEEMSQVQQVPIEEKEDPPEENNGEE-EQLRTQSPW
        KFDKKA+ CIFVGYD+QRKGW+C DP +GRCYTSR+V+FDEA+SWW+ + E   +D R F + L+++M +   V ++   D   + NG++ EQ   Q+PW
Subjt:  KFDKKAINCIFVGYDNQRKGWRCIDPVTGRCYTSRNVIFDEATSWWAPKSEKAPTDERSFKEGLKEEMSQVQQVPIEEKEDPPEENNGEE-EQLRTQSPW

Query:  QSGVHGQ---------------EPQLRRSTRQRKPNPKYVNATL---AILEEPTIYEETSKNIQWRKVVEEEVGALRRKRRWDL
        Q+GV+ Q               + QLRRSTR R+PNPKY NA +   A + EP  +EE S++ +W K +EEE+ AL++ + WDL
Subjt:  QSGVHGQ---------------EPQLRRSTRQRKPNPKYVNATL---AILEEPTIYEETSKNIQWRKVVEEEVGALRRKRRWDL

A0A2N9IHF5 Uncharacterized protein3.2e-23249.83Show/hide
Query:  MGDLQVVGGIKKLNTQNYKTWSTCMESYLQGQDLWEVVGGTEVTPP--EDATVGFYALKLVDSKCPLGPTGSSLGALRVFIPSATALKKWKIKAGKAMFV
        MGDLQVVGGIKKLN +NY TW+TC+ESYLQGQDLWEVVGG+EVT P  EDA                                +  L+KWKIKAGKAMF 
Subjt:  MGDLQVVGGIKKLNTQNYKTWSTCMESYLQGQDLWEVVGGTEVTPP--EDATVGFYALKLVDSKCPLGPTGSSLGALRVFIPSATALKKWKIKAGKAMFV

Query:  IKTTIDEEMLEYIRGAETPKAAWDTFASLFSKRNDARLQFLENELLSVAQREMTINQYFNKVKTLYREISELDPTATISKSRMRRIIIHGLKPEYRSFIV
        +KTTI+EEMLE+IR A+TPK AWDTF +LFSK+ND RLQ LENELLS+AQR+MTI QYF+KVK + REIS+LDPTA I +SR++RIIIHGL+PEYR F+ 
Subjt:  IKTTIDEEMLEYIRGAETPKAAWDTFASLFSKRNDARLQFLENELLSVAQREMTINQYFNKVKTLYREISELDPTATISKSRMRRIIIHGLKPEYRSFIV

Query:  AVQGWSVQPSLIDLENLLVGQEALGKQMSRVTLKSNKEEALFSGQRKG-------------------------------------------EPDSKSTSN
        A+QGW  QPSL++ ENLL  QEA+ KQM  V+LK  +EEAL++ + +G                                           E ++ ++S+
Subjt:  AVQGWSVQPSLIDLENLLVGQEALGKQMSRVTLKSNKEEALFSGQRKG-------------------------------------------EPDSKSTSN

Query:  --------------AMKKEDLTLHA---EEVSYENDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITHVGKTMIMPRSNSKQVELENVFY
                      AM++E+L L     E++ Y+NDWIVDSGCSNHMTGDK KLQN  EYKG RVVVTA+NS+LPI H+GKT++ PR NS QV L++V++
Subjt:  --------------AMKKEDLTLHA---EEVSYENDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITHVGKTMIMPRSNSKQVELENVFY

Query:  VPGMKKNLVSVSQLTSSGNFVVFGPNDVKVYQDLKVSGMPLMKGRRMDSIYVMSAEAAYVNKTRKNETTDLWHARLGHVSYNKLKIMISKSMLKGLPQLD
        VPGMKKNL+SV+QLT SG++V+FGP DVKVY+DLK+S  P+M+G+R++S+YVMSAE+AYV++TRKNETTDLWH RLGHVSY+KL IM+ KSMLKGLPQLD
Subjt:  VPGMKKNLVSVSQLTSSGNFVVFGPNDVKVYQDLKVSGMPLMKGRRMDSIYVMSAEAAYVNKTRKNETTDLWHARLGHVSYNKLKIMISKSMLKGLPQLD

Query:  IREDMVCAGCQYGKAHQLPFEESKFKAKQPLELVHSDVFGPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTKFKEFKEQVE-------------
        +R D VCAGCQYGKAHQLP++ESKFKAK+PLELVHSDVFGPVKQPSI GMRYMVTFIDD SRYVWVFFMKEKS+TF+KFKEF+E  E             
Subjt:  IREDMVCAGCQYGKAHQLPFEESKFKAKQPLELVHSDVFGPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTKFKEFKEQVE-------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------DHLRSKFDKKAINCIFVGYDNQRKGWRCIDPVTGRCYTSRNVIFDEATSWWAPKSEKAPTDERSFKEGLKEEMSQVQQVPIEEKEDPPEE
                  +HLRSKFDKKA+ CIFVGYD+QRKGW+C DP +GRCYTSR+V+FDEA+SWW+ + E   +D R F + L+++M +   V ++   D   +
Subjt:  ----------DHLRSKFDKKAINCIFVGYDNQRKGWRCIDPVTGRCYTSRNVIFDEATSWWAPKSEKAPTDERSFKEGLKEEMSQVQQVPIEEKEDPPEE

Query:  NNGEE-EQLRTQSPWQSGVHGQ---------------EPQLRRSTRQRKPNPKYVNATL---AILEEPTIYEETSKNIQWRKVVEEEVGALRRKRRWDL
         NG++ EQ   Q+PWQ+GV+ Q               + QLRRSTR R+PNPKY NA +   A + EP  +EE S++ +W K +EEE+ AL++ + WDL
Subjt:  NNGEE-EQLRTQSPWQSGVHGQ---------------EPQLRRSTRQRKPNPKYVNATL---AILEEPTIYEETSKNIQWRKVVEEEVGALRRKRRWDL

A0A6A3D2P3 Uncharacterized protein5.8e-23450.62Show/hide
Query:  MGDLQVVGGIKKLNTQNYKTWSTCMESYLQGQDLWEVVGGTEVTPP--EDATVGFYALKLVDSKCPLGPTGSSLGALRVFIPSATALKKWKIKAGKAMFV
        MGDLQVVGGIKKLN +NY TW+TCMESYLQGQDLWEVVGG EVT P  EDA                                   L+KWKIKAGKAMF 
Subjt:  MGDLQVVGGIKKLNTQNYKTWSTCMESYLQGQDLWEVVGGTEVTPP--EDATVGFYALKLVDSKCPLGPTGSSLGALRVFIPSATALKKWKIKAGKAMFV

Query:  IKTTIDEEMLEYIRGAETPKAAWDTFASLFSKRNDARLQFLENELLSVAQREMTINQYFNKVKTLYREISELDPTATISKSRMRRIIIHGLKPEYRSFIV
        +KTTI+EEMLE+IR A+TPK AWDTF +LFSKRND +LQ LENELLS+AQR+M + QYF+KVK++ REISELDPTA I ++R++RII+HGL+PEYR F+ 
Subjt:  IKTTIDEEMLEYIRGAETPKAAWDTFASLFSKRNDARLQFLENELLSVAQREMTINQYFNKVKTLYREISELDPTATISKSRMRRIIIHGLKPEYRSFIV

Query:  AVQGWSVQPSLIDLENLLVGQEALGKQMSRVTLKSNKEEALFSG------------------------QRKGEPDSKSTSN-------------------
        AVQGW  QPSL++ ENLL GQEA+ KQM  V+LK  +EEAL++                         Q KG P S   S                    
Subjt:  AVQGWSVQPSLIDLENLLVGQEALGKQMSRVTLKSNKEEALFSG------------------------QRKGEPDSKSTSN-------------------

Query:  ------------------------------------AMKKEDLTLHA---EEVSYENDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITH
                                            A ++E+L L     E + Y+NDWIVDSGCSNHMTGDK+KLQN  EY G RVVVTA+NS+LPITH
Subjt:  ------------------------------------AMKKEDLTLHA---EEVSYENDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITH

Query:  VGKTMIMPRSNSKQVELENVFYVPGMKKNLVSVSQLTSSGNFVVFGPNDVKVYQDLKVSGMPLMKGRRMDSIYVMSAEAAYVNKTRKNETTDLWHARLGH
        +GKT++ PR N+ QV+L++V++VPGMKKNL+SV+QLTSSG++V+FGP DVKVY+D+K++  P M+GRR++SIYVMSAE+AYV++TRKNET+DLWH RLGH
Subjt:  VGKTMIMPRSNSKQVELENVFYVPGMKKNLVSVSQLTSSGNFVVFGPNDVKVYQDLKVSGMPLMKGRRMDSIYVMSAEAAYVNKTRKNETTDLWHARLGH

Query:  VSYNKLKIMISKSMLKGLPQLDIREDMVCAGCQYGKAHQLPFEESKFKAKQPLELVHSDVFGPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTK
        VSY+KL +M+ KSMLKGLPQLD+R D VCAGCQYGKAHQLP++ESKFKAK+PLELVHSDVFGPVKQ SISGMRYMVTFIDD SRYVWVFFMKEKS+TF+K
Subjt:  VSYNKLKIMISKSMLKGLPQLDIREDMVCAGCQYGKAHQLPFEESKFKAKQPLELVHSDVFGPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTK

Query:  FKEFKEQVE-------------------------------------------------------------------------------------------
        FKEF++  E                                                                                           
Subjt:  FKEFKEQVE-------------------------------------------------------------------------------------------

Query:  ----DHLRSKFDKKAINCIFVGYDNQRKGWRCIDPVTGRCYTSRNVIFDEATSWWAPKSEKAPTDERSFKEGLKEEMSQVQQVPIEEKEDPPEENNGEE-
            DHLRSKFDKKA+ CIFVGYD+QRKGW+C DP++GRCYTSRNV+FDEA+SWW+ + E  P D R F + L+ +M +   V ++   D  E+ NG++ 
Subjt:  ----DHLRSKFDKKAINCIFVGYDNQRKGWRCIDPVTGRCYTSRNVIFDEATSWWAPKSEKAPTDERSFKEGLKEEMSQVQQVPIEEKEDPPEENNGEE-

Query:  EQLRTQSPWQSGVHGQ---------------EPQLRRSTRQRKPNPKYVNATLAILE---EPTIYEETSKNIQWRKVVEEEVGALRRKRRWDL
        EQ  TQ+PWQ+GV+ Q               + QLRRSTR R+PNPKY NA  AI+E   EP  +EE SK+ +W   ++EE+ AL++ + WD+
Subjt:  EQLRTQSPWQSGVHGQ---------------EPQLRRSTRQRKPNPKYVNATLAILE---EPTIYEETSKNIQWRKVVEEEVGALRRKRRWDL

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.6e-2329.01Show/hide
Query:  WIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITHVGKTMIMPRSNSKQVELENVFYVPGMKKNLVSVSQLTSSGNFVVFGPNDVKVYQDLKV
        +++DSG S+H+  D+    ++ E      +  A   +    +  K  I+   N  ++ LE+V +      NL+SV +L  +G  + F  + V + +    
Subjt:  WIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITHVGKTMIMPRSNSKQVELENVFYVPGMKKNLVSVSQLTSSGNFVVFGPNDVKVYQDLKV

Query:  SGMPLMKGRRM-DSIYVMSAEAAYVNKTRKNETTDLWHARLGHVSYNKLKIMISKSMLKGLPQLDIRE--DMVCAGCQYGKAHQLPFEESKFKA--KQPL
        +G+ ++K   M +++ V++ +A  +N   KN    LWH R GH+S  KL  +  K+M      L+  E    +C  C  GK  +LPF++ K K   K+PL
Subjt:  SGMPLMKGRRM-DSIYVMSAEAAYVNKTRKNETTDLWHARLGHVSYNKLKIMISKSMLKGLPQLDIRE--DMVCAGCQYGKAHQLPFEESKFKA--KQPL

Query:  ELVHSDVFGPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTKFKEFKEQVEDHLRSK
         +VHSDV GP+   ++    Y V F+D  + Y   + +K KS+ F+ F++F  + E H   K
Subjt:  ELVHSDVFGPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTKFKEFKEQVEDHLRSK

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.3e-2223.36Show/hide
Query:  GQRKGEPDSKSTSNAMKKED---LTLHAEEVSY-----ENDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITHVGKTMIMPRSNSKQVEL
        G+  G+ +  +T+  ++  D   L ++ EE        E++W+VD+  S+H T   + L   +       V   N S   I  +G   I        V L
Subjt:  GQRKGEPDSKSTSNAMKKED---LTLHAEEVSY-----ENDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITHVGKTMIMPRSNSKQVEL

Query:  ENVFYVPGMKKNLVSVSQLTSSGNFVVFGPNDVKVYQDLKVSGMPLMKGRRMDSIYVMSAEAAY--VNKTRKNETTDLWHARLGHVSYNKLKIMISKSML
        ++V +VP ++ NL+S   L   G    F     +    L    + + KG    ++Y  +AE     +N  +   + DLWH R+GH+S   L+I+  KS++
Subjt:  ENVFYVPGMKKNLVSVSQLTSSGNFVVFGPNDVKVYQDLKVSGMPLMKGRRMDSIYVMSAEAAY--VNKTRKNETTDLWHARLGHVSYNKLKIMISKSML

Query:  KGLPQLDIREDMVCAGCQYGKAHQLPFEESKFKAKQPLELVHSDVFGPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTKFKEFKEQVE------
               ++    C  C +GK H++ F+ S  +    L+LV+SDV GP++  S+ G +Y VTFIDD SR +WV+ +K K + F  F++F   VE      
Subjt:  KGLPQLDIREDMVCAGCQYGKAHQLPFEESKFKAKQPLELVHSDVFGPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTKFKEFKEQVE------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --DHL---------------RSKFDKKAINCIFVGYDNQRKGWRCIDPVTGRCYTSRNVIFDEA-TSWWAPKSEK-------------APTDERSFKEGL
           HL               R+K D K+I CIF+GY ++  G+R  DPV  +   SR+V+F E+     A  SEK             + ++  +  E  
Subjt:  --DHL---------------RSKFDKKAINCIFVGYDNQRKGWRCIDPVTGRCYTSRNVIFDEA-TSWWAPKSEK-------------APTDERSFKEGL

Query:  KEEMSQVQQVPIEEKEDPPEENNGEEEQLRTQSPWQSGVHGQEPQLRRSTRQRKPNPKYVNATLAIL---EEPTIYEET----SKNIQWRKVVEEEVGAL
         +E+S+  + P E  E   + + G EE    + P Q     Q   LRRS R R  + +Y +    ++    EP   +E      KN Q  K ++EE+ +L
Subjt:  KEEMSQVQQVPIEEKEDPPEENNGEEEQLRTQSPWQSGVHGQEPQLRRSTRQRKPNPKYVNATLAIL---EEPTIYEET----SKNIQWRKVVEEEVGAL

Query:  RRKRRWDL
        ++   + L
Subjt:  RRKRRWDL

P93293 Uncharacterized mitochondrial protein AtMg003001.2e-1034.86Show/hide
Query:  LMKGRRMDSIYVM--SAEAAYVN--KTRKNETTDLWHARLGHVSYNKLKIMISKSMLKGLPQLDIREDMVCAGCQYGKAHQLPFEESKFKAKQPLELVHS
        ++KG R DS+Y++  S E    N  +T K+ET  LWH+RL H+S   +++++ K  L       ++    C  C YGK H++ F   +   K PL+ VHS
Subjt:  LMKGRRMDSIYVM--SAEAAYVN--KTRKNETTDLWHARLGHVSYNKLKIMISKSMLKGLPQLDIREDMVCAGCQYGKAHQLPFEESKFKAKQPLELVHS

Query:  DVFGPVKQP
        D++G    P
Subjt:  DVFGPVKQP

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.3e-3323.17Show/hide
Query:  VGGIKKLNTQNYKTWSTCMESYLQGQDLWEVVGGTEVTPPEDATVGFYALKLVDSKCPLGPTGSSLGALRVFIPSATALKKWKIKAGKAMFVIKTTIDEE
        +  + KL + NY  WS  + +   G +L   + G+   PP  AT+G  A   V+                   P  T   +WK +       +   I   
Subjt:  VGGIKKLNTQNYKTWSTCMESYLQGQDLWEVVGGTEVTPPEDATVGFYALKLVDSKCPLGPTGSSLGALRVFIPSATALKKWKIKAGKAMFVIKTTIDEE

Query:  MLEYIRGAETPKAAWDTFASLFSKRNDARLQFLENELLSVAQREMTINQYFNKVKTLYREISELDPTATISKSRMRRIIIHGLKPEYRSFIVAVQGWSVQ
        +   +  A T    W+T   +++  +   +  L  +L    +   TI+ Y   + T + +++ L       +   R  ++  L  EY+  I  +      
Subjt:  MLEYIRGAETPKAAWDTFASLFSKRNDARLQFLENELLSVAQREMTINQYFNKVKTLYREISELDPTATISKSRMRRIIIHGLKPEYRSFIVAVQGWSVQ

Query:  PSLIDLENLLVGQEALGKQMSRVTLKSNKEEALF-----------SGQRKGEPDSKSTSNAMK---KEDLTLH----------------------AEEVS
        P+L ++   L+  E+    +S  T+      A+            +G R    D+++ +N  K   +     H                      A+  S
Subjt:  PSLIDLENLLVGQEALGKQMSRVTLKSNKEEALF-----------SGQRKGEPDSKSTSNAMK---KEDLTLH----------------------AEEVS

Query:  Y---------------------------------ENDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITHVGKTMIMPRSNSKQVELENVF
                                           N+W++DSG ++H+T D   L     Y G   V+ A+ S +PI+H G T +   + S+ + L N+ 
Subjt:  Y---------------------------------ENDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITHVGKTMIMPRSNSKQVELENVF

Query:  YVPGMKKNLVSVSQL-TSSGNFVVFGPNDVKVYQDLKVSGMPLMKGRRMDSIY----VMSAEAAYVNKTRKNETTDLWHARLGHVSYNKLKIMISKSMLK
        YVP + KNL+SV +L  ++G  V F P   +V +DL  +G+PL++G+  D +Y      S   +         T   WHARLGH + + L  +IS   L 
Subjt:  YVPGMKKNLVSVSQL-TSSGNFVVFGPNDVKVYQDLKVSGMPLMKGRRMDSIY----VMSAEAAYVNKTRKNETTDLWHARLGHVSYNKLKIMISKSMLK

Query:  GLPQLDIREDMVCAGCQYGKAHQLPFEESKFKAKQPLELVHSDVF-GPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTKFKEFKEQVEDHLRSK
         L      + + C+ C   K++++PF +S   + +PLE ++SDV+  P+   S    RY V F+D  +RY W++ +K+KS+    F  FK  +E+  +++
Subjt:  GLPQLDIREDMVCAGCQYGKAHQLPFEESKFKAKQPLELVHSDVF-GPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTKFKEFKEQVEDHLRSK

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.9e-3234.21Show/hide
Query:  NDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITHVGKTMIMPRSNSKQVELENVFYVPGMKKNLVSVSQLTSSGNF-VVFGPNDVKVYQD
        N+W++DSG ++H+T D   L     Y G   V+ A+ S +PITH G   +   ++S+ ++L  V YVP + KNL+SV +L ++    V F P   +V +D
Subjt:  NDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITHVGKTMIMPRSNSKQVELENVFYVPGMKKNLVSVSQLTSSGNF-VVFGPNDVKVYQD

Query:  LKVSGMPLMKGRRMDSIY---VMSAEAAYVNKTRKNETT-DLWHARLGHVSYNKLKIMISKSMLKGLPQLDIREDMV-CAGCQYGKAHQLPFEESKFKAK
        L  +G+PL++G+  D +Y   + S++A  +  +  ++ T   WH+RLGH S   L  +IS      LP L+    ++ C+ C   K+H++PF  S   + 
Subjt:  LKVSGMPLMKGRRMDSIY---VMSAEAAYVNKTRKNETT-DLWHARLGHVSYNKLKIMISKSMLKGLPQLDIREDMV-CAGCQYGKAHQLPFEESKFKAK

Query:  QPLELVHSDVF-GPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTKFKEFKEQVEDHLRSK
        +PLE ++SDV+  P+   SI   RY V F+D  +RY W++ +K+KS+    F  FK  VE+  +++
Subjt:  QPLELVHSDVF-GPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTKFKEFKEQVEDHLRSK

Arabidopsis top hitse value%identityAlignment
AT3G20980.1 Gag-Pol-related retrotransposon family protein8.5e-0422.02Show/hide
Query:  NYKTWSTCMESYLQGQDLWEVVGGTEVTPPEDATVGFYALKLVDSKCPLGPTGSSLGALRVFIPSATALKKWKIKAGKAMFVIKTTIDEEMLEYIRGAET
        NY+ W+  M++ L  + LW++V               Y +    SK P   T      L ++  SA       +K  KA+ ++++ + + +        +
Subjt:  NYKTWSTCMESYLQGQDLWEVVGGTEVTPPEDATVGFYALKLVDSKCPLGPTGSSLGALRVFIPSATALKKWKIKAGKAMFVIKTTIDEEMLEYIRGAET

Query:  PKAAWDTF------ASLFSKRNDARLQFLENELLSVAQREMTI-----------NQYFNK-VKTLYREISELDPTATISKSRMRRIIIHGLKPEYRSFIV
         K  WD        A L  ++      F  N L +   +++T+            +  NK +K +           ++ +  M+R + +G+ P+      
Subjt:  PKAAWDTF------ASLFSKRNDARLQFLENELLSVAQREMTI-----------NQYFNK-VKTLYREISELDPTATISKSRMRRIIIHGLKPEYRSFIV

Query:  AVQGWSVQPSL---IDLENLLVGQEALGKQMSRVTLKSNKEEALFSGQRKGEPDS--KSTSNAMKKEDLTLHAEEVS------------YENDWIVDSGC
             S  P L   I  E++   +E + K M  +      E   FS      PDS  + T +A+  +DL     EV             +EN W++ S  
Subjt:  AVQGWSVQPSL---IDLENLLVGQEALGKQMSRVTLKSNKEEALFSGQRKGEPDS--KSTSNAMKKEDLTLHAEEVS------------YENDWIVDSGC

Query:  SNHMTGDKKKLQNTFEYKGSRV-VVTANNSKLPITHV-GKTMIMPRSNSKQVELENVFYVPGMKKNLVSVSQLTSSG
        SNHMT   K        +  +V  ++ + S+  +  V G   +   +N     ++NV YVPG++ N +SVSQL  +G
Subjt:  SNHMTGDKKKLQNTFEYKGSRV-VVTANNSKLPITHV-GKTMIMPRSNSKQVELENVFYVPGMKKNLVSVSQLTSSG

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)4.1e-0623.6Show/hide
Query:  PSATALKKWKIKAGKAMFVIKTTIDEEMLE-YIRGAETPKAAWDTFASLFSKRNDARLQFLENELLSVAQREMTINQYFNKVKTLYREISELDPTATISK
        P+    K+WK + G     I  TI + +L+  I+   T +  W +  +LF    +AR    ENEL +    ++++++Y  K+K+L   ++ +D  + IS 
Subjt:  PSATALKKWKIKAGKAMFVIKTTIDEEMLE-YIRGAETPKAAWDTFASLFSKRNDARLQFLENELLSVAQREMTINQYFNKVKTLYREISELDPTATISK

Query:  SRMRRIIIHGLKPEYRSFIVAVQGWSVQPSLIDLENLLVGQEALGKQMSRVTLKSNKEEAL
          +   +++GL  +Y   +  ++  S  PS  +  ++L+ +E+     S+ +L      +L
Subjt:  SRMRRIIIHGLKPEYRSFIVAVQGWSVQPSLIDLENLLVGQEALGKQMSRVTLKSNKEEAL

ATMG00300.1 Gag-Pol-related retrotransposon family protein8.5e-1234.86Show/hide
Query:  LMKGRRMDSIYVM--SAEAAYVN--KTRKNETTDLWHARLGHVSYNKLKIMISKSMLKGLPQLDIREDMVCAGCQYGKAHQLPFEESKFKAKQPLELVHS
        ++KG R DS+Y++  S E    N  +T K+ET  LWH+RL H+S   +++++ K  L       ++    C  C YGK H++ F   +   K PL+ VHS
Subjt:  LMKGRRMDSIYVM--SAEAAYVN--KTRKNETTDLWHARLGHVSYNKLKIMISKSMLKGLPQLDIREDMVCAGCQYGKAHQLPFEESKFKAKQPLELVHS

Query:  DVFGPVKQP
        D++G    P
Subjt:  DVFGPVKQP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAAAGAAGAAAGGGCCACCGTTGACGGCGACGGGTTCGAAGGTGTTGATAGGTTGTGTGCGCCATGCTGCTGACATTGGTGAGAGTGGCCTGTTCGCCGACTCAAT
AAGCCTACCATTTTGGGGACAAGACCGAGTGAGGAGCTGGGAACATAGTCTTACAAGATGGAATTTACTCCTTCCCGACATTAGGCCAATTAATCTCATACCGTTGGAGC
TTCTGATCTGTAGGTCCATTAGGTCCTACCGGAGAATACTGGTGATACTACGTGGTGGTGTTCGTGAGCGATTTGCAGCATCAAAAGAAAAAGTTCTTGCTGCTGGAATT
TTTGGATTCGACGAGGTTTTGTTGGCGAAACGATCGAGAACGACTTCTAAATATTTTGATCTTGAGAAAACACAGTATGGTTTCTTGTTCTTTTCGCTACGTTGGGCTTT
AATCCCTTCAGACAGCCCTCCAGATCTGCTCGCAACAACCCTCCGAATCTGGATTATTAGCCCCCTGAAGGAACGAAGATGGGGCTGTGGGATTTTGACCCCCTGGAGAA
ACGAATATCGATATGTGGGAGAAACGAAGATCGAACTGAGGGGTTTTGGCATTATTAGCCCCCCCGGATCTGGATTGTTAACCCAGAGAAATCGATGTGGTGAAGAAAAT
GTAGGGAAGAAGAAAGAAGAGAAAAAGAGGAAAGAAGAAAAAGGAAAAAAACAAAAATCGCCGGCGGCGAGCAGTGGCTGCCGGTCGCCGGTCGCCGGACATAGGAAGAA
GAAGAAGAGGAAGGAGAAGGCTCATCATCTTCAGTGCGGCGAAACGGATTCTTCTCCCCCGATGGACGAAAGCTCTGCACGGTGGTCCGATCTCCCTCCTGAACTTTGGA
CGGACTGTGAAGCTTGTTGTCTGGGACACAGCTTACTCGTGGAGTGCATTCATCACTACGAGATGATGGGAGATCTTCAAGTTGTTGGAGGAATCAAGAAGCTCAACACT
CAGAACTACAAAACATGGTCCACGTGCATGGAGTCGTATCTCCAAGGCCAAGACTTATGGGAAGTCGTGGGAGGCACTGAAGTCACGCCACCTGAAGATGCTACTGTTGG
GTTTTATGCCCTAAAACTCGTAGATAGTAAATGTCCATTAGGTCCCACCGGTAGCTCACTAGGGGCGTTGAGGGTTTTTATCCCTTCAGCTACAGCCTTGAAGAAATGGA
AGATCAAGGCAGGTAAGGCCATGTTTGTTATTAAAACTACAATTGATGAAGAAATGTTGGAATACATTAGGGGAGCAGAGACGCCTAAAGCGGCATGGGACACGTTTGCC
TCACTTTTCTCAAAGAGGAATGATGCAAGACTGCAGTTTCTAGAGAACGAGCTTCTGTCAGTTGCTCAAAGGGAGATGACTATCAATCAGTACTTCAACAAGGTAAAAAC
TCTTTACCGTGAAATCTCTGAATTAGATCCTACTGCTACTATTTCAAAATCAAGAATGAGGAGGATTATCATTCATGGTCTTAAACCTGAATATAGAAGTTTTATTGTTG
CCGTTCAAGGTTGGTCAGTCCAACCTTCTCTCATTGACCTTGAAAATTTGCTTGTCGGTCAAGAAGCATTGGGTAAGCAAATGTCGAGGGTCACATTAAAGAGCAATAAG
GAAGAAGCGCTCTTTAGTGGCCAAAGAAAAGGAGAACCTGACTCAAAGTCGACATCCAATGCAATGAAGAAGGAAGACTTAACTCTCCACGCAGAAGAGGTAAGTTATGA
AAATGATTGGATTGTCGATTCAGGTTGCTCTAACCATATGACAGGTGATAAAAAGAAGTTACAAAACACATTTGAGTACAAAGGAAGTCGAGTTGTCGTGACTGCAAACA
ACTCGAAATTGCCAATAACTCATGTTGGCAAAACTATGATAATGCCTCGCTCCAATTCCAAGCAAGTAGAGCTGGAGAATGTATTTTATGTACCTGGAATGAAGAAGAAT
TTGGTATCAGTATCACAACTAACATCATCAGGCAACTTCGTCGTGTTTGGACCTAATGATGTCAAGGTGTACCAAGATCTCAAAGTCAGTGGTATGCCACTAATGAAAGG
ACGAAGAATGGACTCCATCTATGTCATGTCAGCAGAGGCCGCCTATGTGAATAAGACACGAAAGAATGAAACAACAGATTTGTGGCATGCAAGACTTGGTCATGTCAGTT
ACAACAAATTGAAGATAATGATAAGCAAGTCTATGCTCAAGGGGTTGCCTCAACTTGATATTAGAGAAGACATGGTGTGTGCTGGTTGCCAGTATGGGAAGGCACATCAA
CTACCATTTGAGGAGTCCAAATTCAAAGCAAAGCAACCATTGGAGCTGGTGCACTCTGACGTATTTGGTCCTGTCAAGCAACCTTCAATCAGTGGCATGCGCTATATGGT
GACCTTTATTGATGACCTCTCCAGGTATGTTTGGGTGTTCTTTATGAAAGAAAAGTCTGAAACATTTACAAAATTCAAGGAATTCAAAGAACAAGTTGAAGATCATCTGA
GAAGCAAGTTTGACAAGAAAGCAATCAACTGCATCTTTGTTGGTTATGACAACCAAAGAAAAGGGTGGAGGTGCATTGATCCTGTCACTGGACGATGCTATACATCAAGG
AATGTTATATTCGATGAAGCAACATCATGGTGGGCACCCAAATCAGAGAAGGCACCTACGGACGAGAGGTCTTTCAAGGAAGGATTAAAAGAGGAGATGAGTCAAGTGCA
ACAAGTTCCAATAGAAGAAAAAGAAGACCCTCCAGAAGAGAACAATGGAGAGGAAGAACAGTTAAGGACACAAAGTCCATGGCAAAGTGGTGTACATGGTCAAGAGCCGC
AACTACGAAGATCAACCAGACAAAGAAAACCAAATCCCAAGTACGTGAATGCAACATTGGCAATATTGGAAGAGCCAACAATATATGAAGAAACATCGAAGAACATCCAA
TGGAGGAAAGTTGTGGAAGAAGAAGTTGGCGCACTACGAAGAAAACGAAGATGGGACTTGTGTTTGTATCTCCCTTTTTGTGCTAATTTCTTCAACAATACCTACATCGA
CGTCCTTCGGTGTCGCGGCGTCTACCGATCATGGCGTGCTTCTCTTCCTTCGTTCAACGCCTTTTCTCCTCTTCTACCTCTCCATGTCCCTTGCCCTCCCAACGACGACG
CCCATTACCGAATCAAAGACACTGTACTCATCCGAAAGATAATCTATCGCCTCAGTCCTCTTCAGGATCATCAAACTCCTAACTCTGTTTCTTCTTCTTCTTCTTCTTCT
CGGGCGAAGGTTTGTTTGGAAGGAAGATTGGAGGAGGTGAAGAATTTGGGGAAGAAAGCGTTTGTTTTGGGGGCAGAGGACAGTTTCTCAGTTTCAGCAAGAGATTTTGA
AGGAATGAAAGGGAATTGCATATATTATCCTCGAGCATTCAATAACGCCTCTCATGGATATGGATATGGTATCACTCACAATCTGGTTGTTTCTTGTAAAGGGAGAGGGA
CGAACCTAGTGCTGTTGTCGTCCGTTCACCGTGGAGCTGCTACTCGTGAACGCCGGTGTGGGTTCGTGAAGGGCGTGGGTTGTATAGATCTGAGGCTTCAGATCATCCGC
TTCAGATCTACGATTGAAGGGTGTGGGTTCGTGAAGTGTGTGGGTTGCATCTGCTTCAGATCTGCGATTGAAGGGCGTGGGTTCGTGAAGAGCGTGGGTTGTAGATCTGC
GACTTCAAATCATCCACTTCAGATCTGCGGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGACAAAGAAGAAAGGGCCACCGTTGACGGCGACGGGTTCGAAGGTGTTGATAGGTTGTGTGCGCCATGCTGCTGACATTGGTGAGAGTGGCCTGTTCGCCGACTCAAT
AAGCCTACCATTTTGGGGACAAGACCGAGTGAGGAGCTGGGAACATAGTCTTACAAGATGGAATTTACTCCTTCCCGACATTAGGCCAATTAATCTCATACCGTTGGAGC
TTCTGATCTGTAGGTCCATTAGGTCCTACCGGAGAATACTGGTGATACTACGTGGTGGTGTTCGTGAGCGATTTGCAGCATCAAAAGAAAAAGTTCTTGCTGCTGGAATT
TTTGGATTCGACGAGGTTTTGTTGGCGAAACGATCGAGAACGACTTCTAAATATTTTGATCTTGAGAAAACACAGTATGGTTTCTTGTTCTTTTCGCTACGTTGGGCTTT
AATCCCTTCAGACAGCCCTCCAGATCTGCTCGCAACAACCCTCCGAATCTGGATTATTAGCCCCCTGAAGGAACGAAGATGGGGCTGTGGGATTTTGACCCCCTGGAGAA
ACGAATATCGATATGTGGGAGAAACGAAGATCGAACTGAGGGGTTTTGGCATTATTAGCCCCCCCGGATCTGGATTGTTAACCCAGAGAAATCGATGTGGTGAAGAAAAT
GTAGGGAAGAAGAAAGAAGAGAAAAAGAGGAAAGAAGAAAAAGGAAAAAAACAAAAATCGCCGGCGGCGAGCAGTGGCTGCCGGTCGCCGGTCGCCGGACATAGGAAGAA
GAAGAAGAGGAAGGAGAAGGCTCATCATCTTCAGTGCGGCGAAACGGATTCTTCTCCCCCGATGGACGAAAGCTCTGCACGGTGGTCCGATCTCCCTCCTGAACTTTGGA
CGGACTGTGAAGCTTGTTGTCTGGGACACAGCTTACTCGTGGAGTGCATTCATCACTACGAGATGATGGGAGATCTTCAAGTTGTTGGAGGAATCAAGAAGCTCAACACT
CAGAACTACAAAACATGGTCCACGTGCATGGAGTCGTATCTCCAAGGCCAAGACTTATGGGAAGTCGTGGGAGGCACTGAAGTCACGCCACCTGAAGATGCTACTGTTGG
GTTTTATGCCCTAAAACTCGTAGATAGTAAATGTCCATTAGGTCCCACCGGTAGCTCACTAGGGGCGTTGAGGGTTTTTATCCCTTCAGCTACAGCCTTGAAGAAATGGA
AGATCAAGGCAGGTAAGGCCATGTTTGTTATTAAAACTACAATTGATGAAGAAATGTTGGAATACATTAGGGGAGCAGAGACGCCTAAAGCGGCATGGGACACGTTTGCC
TCACTTTTCTCAAAGAGGAATGATGCAAGACTGCAGTTTCTAGAGAACGAGCTTCTGTCAGTTGCTCAAAGGGAGATGACTATCAATCAGTACTTCAACAAGGTAAAAAC
TCTTTACCGTGAAATCTCTGAATTAGATCCTACTGCTACTATTTCAAAATCAAGAATGAGGAGGATTATCATTCATGGTCTTAAACCTGAATATAGAAGTTTTATTGTTG
CCGTTCAAGGTTGGTCAGTCCAACCTTCTCTCATTGACCTTGAAAATTTGCTTGTCGGTCAAGAAGCATTGGGTAAGCAAATGTCGAGGGTCACATTAAAGAGCAATAAG
GAAGAAGCGCTCTTTAGTGGCCAAAGAAAAGGAGAACCTGACTCAAAGTCGACATCCAATGCAATGAAGAAGGAAGACTTAACTCTCCACGCAGAAGAGGTAAGTTATGA
AAATGATTGGATTGTCGATTCAGGTTGCTCTAACCATATGACAGGTGATAAAAAGAAGTTACAAAACACATTTGAGTACAAAGGAAGTCGAGTTGTCGTGACTGCAAACA
ACTCGAAATTGCCAATAACTCATGTTGGCAAAACTATGATAATGCCTCGCTCCAATTCCAAGCAAGTAGAGCTGGAGAATGTATTTTATGTACCTGGAATGAAGAAGAAT
TTGGTATCAGTATCACAACTAACATCATCAGGCAACTTCGTCGTGTTTGGACCTAATGATGTCAAGGTGTACCAAGATCTCAAAGTCAGTGGTATGCCACTAATGAAAGG
ACGAAGAATGGACTCCATCTATGTCATGTCAGCAGAGGCCGCCTATGTGAATAAGACACGAAAGAATGAAACAACAGATTTGTGGCATGCAAGACTTGGTCATGTCAGTT
ACAACAAATTGAAGATAATGATAAGCAAGTCTATGCTCAAGGGGTTGCCTCAACTTGATATTAGAGAAGACATGGTGTGTGCTGGTTGCCAGTATGGGAAGGCACATCAA
CTACCATTTGAGGAGTCCAAATTCAAAGCAAAGCAACCATTGGAGCTGGTGCACTCTGACGTATTTGGTCCTGTCAAGCAACCTTCAATCAGTGGCATGCGCTATATGGT
GACCTTTATTGATGACCTCTCCAGGTATGTTTGGGTGTTCTTTATGAAAGAAAAGTCTGAAACATTTACAAAATTCAAGGAATTCAAAGAACAAGTTGAAGATCATCTGA
GAAGCAAGTTTGACAAGAAAGCAATCAACTGCATCTTTGTTGGTTATGACAACCAAAGAAAAGGGTGGAGGTGCATTGATCCTGTCACTGGACGATGCTATACATCAAGG
AATGTTATATTCGATGAAGCAACATCATGGTGGGCACCCAAATCAGAGAAGGCACCTACGGACGAGAGGTCTTTCAAGGAAGGATTAAAAGAGGAGATGAGTCAAGTGCA
ACAAGTTCCAATAGAAGAAAAAGAAGACCCTCCAGAAGAGAACAATGGAGAGGAAGAACAGTTAAGGACACAAAGTCCATGGCAAAGTGGTGTACATGGTCAAGAGCCGC
AACTACGAAGATCAACCAGACAAAGAAAACCAAATCCCAAGTACGTGAATGCAACATTGGCAATATTGGAAGAGCCAACAATATATGAAGAAACATCGAAGAACATCCAA
TGGAGGAAAGTTGTGGAAGAAGAAGTTGGCGCACTACGAAGAAAACGAAGATGGGACTTGTGTTTGTATCTCCCTTTTTGTGCTAATTTCTTCAACAATACCTACATCGA
CGTCCTTCGGTGTCGCGGCGTCTACCGATCATGGCGTGCTTCTCTTCCTTCGTTCAACGCCTTTTCTCCTCTTCTACCTCTCCATGTCCCTTGCCCTCCCAACGACGACG
CCCATTACCGAATCAAAGACACTGTACTCATCCGAAAGATAATCTATCGCCTCAGTCCTCTTCAGGATCATCAAACTCCTAACTCTGTTTCTTCTTCTTCTTCTTCTTCT
CGGGCGAAGGTTTGTTTGGAAGGAAGATTGGAGGAGGTGAAGAATTTGGGGAAGAAAGCGTTTGTTTTGGGGGCAGAGGACAGTTTCTCAGTTTCAGCAAGAGATTTTGA
AGGAATGAAAGGGAATTGCATATATTATCCTCGAGCATTCAATAACGCCTCTCATGGATATGGATATGGTATCACTCACAATCTGGTTGTTTCTTGTAAAGGGAGAGGGA
CGAACCTAGTGCTGTTGTCGTCCGTTCACCGTGGAGCTGCTACTCGTGAACGCCGGTGTGGGTTCGTGAAGGGCGTGGGTTGTATAGATCTGAGGCTTCAGATCATCCGC
TTCAGATCTACGATTGAAGGGTGTGGGTTCGTGAAGTGTGTGGGTTGCATCTGCTTCAGATCTGCGATTGAAGGGCGTGGGTTCGTGAAGAGCGTGGGTTGTAGATCTGC
GACTTCAAATCATCCACTTCAGATCTGCGGTTGA
Protein sequenceShow/hide protein sequence
MTKKKGPPLTATGSKVLIGCVRHAADIGESGLFADSISLPFWGQDRVRSWEHSLTRWNLLLPDIRPINLIPLELLICRSIRSYRRILVILRGGVRERFAASKEKVLAAGI
FGFDEVLLAKRSRTTSKYFDLEKTQYGFLFFSLRWALIPSDSPPDLLATTLRIWIISPLKERRWGCGILTPWRNEYRYVGETKIELRGFGIISPPGSGLLTQRNRCGEEN
VGKKKEEKKRKEEKGKKQKSPAASSGCRSPVAGHRKKKKRKEKAHHLQCGETDSSPPMDESSARWSDLPPELWTDCEACCLGHSLLVECIHHYEMMGDLQVVGGIKKLNT
QNYKTWSTCMESYLQGQDLWEVVGGTEVTPPEDATVGFYALKLVDSKCPLGPTGSSLGALRVFIPSATALKKWKIKAGKAMFVIKTTIDEEMLEYIRGAETPKAAWDTFA
SLFSKRNDARLQFLENELLSVAQREMTINQYFNKVKTLYREISELDPTATISKSRMRRIIIHGLKPEYRSFIVAVQGWSVQPSLIDLENLLVGQEALGKQMSRVTLKSNK
EEALFSGQRKGEPDSKSTSNAMKKEDLTLHAEEVSYENDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITHVGKTMIMPRSNSKQVELENVFYVPGMKKN
LVSVSQLTSSGNFVVFGPNDVKVYQDLKVSGMPLMKGRRMDSIYVMSAEAAYVNKTRKNETTDLWHARLGHVSYNKLKIMISKSMLKGLPQLDIREDMVCAGCQYGKAHQ
LPFEESKFKAKQPLELVHSDVFGPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTKFKEFKEQVEDHLRSKFDKKAINCIFVGYDNQRKGWRCIDPVTGRCYTSR
NVIFDEATSWWAPKSEKAPTDERSFKEGLKEEMSQVQQVPIEEKEDPPEENNGEEEQLRTQSPWQSGVHGQEPQLRRSTRQRKPNPKYVNATLAILEEPTIYEETSKNIQ
WRKVVEEEVGALRRKRRWDLCLYLPFCANFFNNTYIDVLRCRGVYRSWRASLPSFNAFSPLLPLHVPCPPNDDAHYRIKDTVLIRKIIYRLSPLQDHQTPNSVSSSSSSS
RAKVCLEGRLEEVKNLGKKAFVLGAEDSFSVSARDFEGMKGNCIYYPRAFNNASHGYGYGITHNLVVSCKGRGTNLVLLSSVHRGAATRERRCGFVKGVGCIDLRLQIIR
FRSTIEGCGFVKCVGCICFRSAIEGRGFVKSVGCRSATSNHPLQICG