; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI05G18590 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI05G18590
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionBeta-galactosidase
Genome locationChr5:19642509..19647061
RNA-Seq ExpressionCSPI05G18590
SyntenyCSPI05G18590
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0000166 - nucleotide binding (molecular function)
GO:0003676 - nucleic acid binding (molecular function)
GO:0016491 - oxidoreductase activity (molecular function)
GO:0016874 - ligase activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR029472 - Retrotransposon Copia-like, N-terminal
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031941.1 Beta-galactosidase [Cucumis melo var. makuwa]0.0e+0065.96Show/hide
Query:  MYSENPVNSFHNLSSPYVTNKGAQSSMYHLSGEKLNGNNYFSWSWSVKMVLEGRQKFSFLTGEIPRPLP-------------------------------
        MYS+NPV SF N  S Y+T     SS  + SGEKLNG NYFSWS S+KM LEGR +F FLTGE  RP P                               
Subjt:  MYSENPVNSFHNLSSPYVTNKGAQSSMYHLSGEKLNGNNYFSWSWSVKMVLEGRQKFSFLTGEIPRPLP-------------------------------

Query:  -----AKDIWDTAHTLYSKRHNASRLYTEKAKL------------------------------VLRDPTDGVQYSRVEENGRIYDFLAGLNPKFDVVRRC
             AKD+WDT  TLYSKR NASRLYT + ++                              V   P D  QY+++EE  R+YDFLAGLNPKFD V   
Subjt:  -----AKDIWDTAHTLYSKRHNASRLYTEKAKL------------------------------VLRDPTDGVQYSRVEENGRIYDFLAGLNPKFDVVRRC

Query:  ILGQRPIPSLMKVCSEIRLEEDWTSAMNISTTPTIDSAAFSARSSNSSMNKHNGKPIPVCKHCKKQWRTKEQCWKLHGRPPGGKKRHSNEKHNTGQTYVS
        ILGQRP+PSLM+VC E+RLEED T+AM + TTPTIDSAAFSARSSN   +K+NGK IPVC+HCKKQW TK+QCWKLHGRPPGGKKR SNEK N+G+ Y+S
Subjt:  ILGQRPIPSLMKVCSEIRLEEDWTSAMNISTTPTIDSAAFSARSSNSSMNKHNGKPIPVCKHCKKQWRTKEQCWKLHGRPPGGKKRHSNEKHNTGQTYVS

Query:  ES--AETPQQSDPHKNKTDLNLATLGAIVQS------------------------DHLTGSSEHFVSYIPCVGNETIRIADGSLAPIAGKGKISPCAGLS
        E+  A T Q +DP  ++T     TLGAI QS                        DHLTGSSEHF+SY PC GNE IRIADGSLAPIAGKG+I P  G +
Subjt:  ES--AETPQQSDPHKNKTDLNLATLGAIVQS------------------------DHLTGSSEHFVSYIPCVGNETIRIADGSLAPIAGKGKISPCAGLS

Query:  LHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSIQDLSSGRMTSTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQDYQTSTSALASSLKLKW-
        L NVLHVPKLSYNLLSISKIT EL+CKAIFLP+SV  QD+SSGR   TARHSRGLY+LDDDTS SS+ R SLLSSYF+TSEQD+  S+ +    ++ K  
Subjt:  LHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSIQDLSSGRMTSTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQDYQTSTSALASSLKLKW-

Query:  ------LPY--------LEIDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKITILRSDNGRKFQKHNLSEFLAS
               PY        +  DVWGPSK+TTSSGKRWFVTFIDDHTRLTWVYLI+DKSEV S+FQNFYHTI+TQFH KI ILRSDNGR+FQ HNLSEFLAS
Subjt:  ------LPY--------LEIDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKITILRSDNGRKFQKHNLSEFLAS

Query:  KGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSYVWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVSEVPLRVFGCTAYVHNFG
        KGIVHQ SCAYTPQQNGVAERKNRHL+EVARSLMLSTSLPSY+WGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTR VSEVPLRVFGCTAYVHNFG
Subjt:  KGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSYVWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVSEVPLRVFGCTAYVHNFG

Query:  PNQTKFTPRAQASVFVGYPPHQRGYKCFHPPSRKYFVTMD----------------GESVSEESNNTFEFIKPTPSVVSNINPHTIVLPTNQVPWKTYYR
        PNQTKFTPRAQA VFVGYP HQ GYKCFHPPSRKYFVTMD                GE+VSEESNNTFEF++PT   VS+I+PH I+LPTNQVPWKTYYR
Subjt:  PNQTKFTPRAQASVFVGYPPHQRGYKCFHPPSRKYFVTMD----------------GESVSEESNNTFEFIKPTPSVVSNINPHTIVLPTNQVPWKTYYR

Query:  RNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNEAKRGHTGKPDEYDSSLDIPIALRKGT
        RN +KEVGSPTSQPPAPVQ+ EPPRDQGMENPT+PCT N +SEND+S++AVLEN+EEK+  DE EVRIET N+EA++GHT K DEYD SLDIPIALRKGT
Subjt:  RNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNEAKRGHTGKPDEYDSSLDIPIALRKGT

Query:  SS
         S
Subjt:  SS

KAA0034386.1 Beta-galactosidase [Cucumis melo var. makuwa]0.0e+0064.22Show/hide
Query:  MYSENPVNSFHNLSSPYVTNKGAQSSMYHLSGEKLNGNNYFSWSWSVKMVLEGRQKFSFLTGEIPRPLP-------------------------------
        MYS+NPV SF N  S Y+T     SS  + SGEKLNG NYFSWS S+KM LEGR +F FLTGEI RP P                               
Subjt:  MYSENPVNSFHNLSSPYVTNKGAQSSMYHLSGEKLNGNNYFSWSWSVKMVLEGRQKFSFLTGEIPRPLP-------------------------------

Query:  -----AKDIWDTAHTLYSKRHNASRLYTEKAKL------------------------------VLRDPTDGVQYSRVEENGRIYDFLAGLNPKFDVVRRC
             AKD+WDT  TLYSKR NASRLYT + ++                              V   P D  QY+++EE  R+YDFLAGLNPKFD V   
Subjt:  -----AKDIWDTAHTLYSKRHNASRLYTEKAKL------------------------------VLRDPTDGVQYSRVEENGRIYDFLAGLNPKFDVVRRC

Query:  ILGQRPIPSLMKVCSEIRLEEDWTSAMNISTTPTIDSAAFSARSSNSSMNKHNGKPIPVCKHCKKQWRTKEQCWKLHGRPPGGKKRHSNEKHNTGQTYVS
        ILGQRP+PSLM+VC E+RLEED T+AM + TTPTIDSAAFSARSSN   +K+NGK IPVC+HCKKQW TK+QCWKLHGRPPGGKKR SNEK N+G+ Y+S
Subjt:  ILGQRPIPSLMKVCSEIRLEEDWTSAMNISTTPTIDSAAFSARSSNSSMNKHNGKPIPVCKHCKKQWRTKEQCWKLHGRPPGGKKRHSNEKHNTGQTYVS

Query:  ES--AETPQQSDPHKNKTDLNLATLGAIVQS------------------------DHLTGSSEHFVSYIPCVGNETIRIADGSLAPIAGKGKISPCAGLS
        E+  A T Q +DP  ++T     TLGAI QS                        DHLTGSSEHF+SY PC GNE IRIADGSLAPIAGKG+I P  G +
Subjt:  ES--AETPQQSDPHKNKTDLNLATLGAIVQS------------------------DHLTGSSEHFVSYIPCVGNETIRIADGSLAPIAGKGKISPCAGLS

Query:  LHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSIQDLSSGRMTSTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQD-----------------
        L NVLHVPKLSYNLLSISKIT EL+CKAIFLP+SV  QD+SSGR   TARHSRGLY+LDDDTS SS+ R SLLSSYF+TSEQD                 
Subjt:  LHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSIQDLSSGRMTSTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQD-----------------

Query:  -------YQTSTSALASSLKLK---------WLPY--------LEIDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQF
                +   S+L+  + ++           PY        +  DVWGPSK+TTSSGKRWFVTFIDDHTRLTWVYLI+DKSEV S+FQNFYHTI+TQF
Subjt:  -------YQTSTSALASSLKLK---------WLPY--------LEIDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQF

Query:  HQKITILRSDNGRKFQKHNLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSYVWGDAILTAAHLINRMPSRILHLQTPLDCLKE
        H KI ILRSDNGR+FQ HNLSEFLASKGIVHQ SCAYTPQQNGVAERKNRHL+EVARSLMLSTSLPSY+WGDAILTAAHLINRMPSRILHLQTPLDCLKE
Subjt:  HQKITILRSDNGRKFQKHNLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSYVWGDAILTAAHLINRMPSRILHLQTPLDCLKE

Query:  SYPSTRHVSEVPLRVFGCTAYVHNFGPNQTKFTPRAQASVFVGYPPHQRGYKCFHPPSRKYFVTMD----------------GESVSEESNNTFEFIKPT
        SYPSTR VSEVPLRVFGCTAYVHNFGPNQTKFTPRAQA VFVGYP HQ GYKCFHPPSRKYFVTMD                GE+VSEESNNTFEF++PT
Subjt:  SYPSTRHVSEVPLRVFGCTAYVHNFGPNQTKFTPRAQASVFVGYPPHQRGYKCFHPPSRKYFVTMD----------------GESVSEESNNTFEFIKPT

Query:  PSVVSNINPHTIVLPTNQVPWKTYYRRNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNE
           VS+I+PH I+LPTNQVPWKTYYRRN +KEVGSPTSQPPAPVQ+ EPPRDQGMENPT+PCT N +SEND+S++AVLEN+EEK+  DE EVRIET N+E
Subjt:  PSVVSNINPHTIVLPTNQVPWKTYYRRNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNE

Query:  AKRGHTGKPDEYDSSLDIPIALRKGTSS
        A++GHT K DEYD SLDIPIALRKGT S
Subjt:  AKRGHTGKPDEYDSSLDIPIALRKGTSS

KAA0052172.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]0.0e+0065.74Show/hide
Query:  MYSENPVNSFHNLSSPYVTNKGAQSSMYHLSGEKLNGNNYFSWSWSVKMVLEGRQKFSFLTGEIPRPLP-------------------------------
        MYS+NPV SF N  S Y+T     SS  + SGEKLNG NYFSWS S+KM LEGR +F FLTGE  RP P                               
Subjt:  MYSENPVNSFHNLSSPYVTNKGAQSSMYHLSGEKLNGNNYFSWSWSVKMVLEGRQKFSFLTGEIPRPLP-------------------------------

Query:  -----AKDIWDTAHTLYSKRHNASRLYTEKAKL------------------------------VLRDPTDGVQYSRVEENGRIYDFLAGLNPKFDVVRRC
             AKD+WDT  TLYSKR NASRLYT + ++                              V   P D  QY+++EE  R+YDFLAGLNPKFD V   
Subjt:  -----AKDIWDTAHTLYSKRHNASRLYTEKAKL------------------------------VLRDPTDGVQYSRVEENGRIYDFLAGLNPKFDVVRRC

Query:  ILGQRPIPSLMKVCSEIRLEEDWTSAMNISTTPTIDSAAFSARSSNSSMNKHNGKPIPVCKHCKKQWRTKEQCWKLHGRPPGGKKRHSNEKHNTGQTYVS
        ILGQRP+PSLM+VC E+RLEED T+AM + TTPTIDSAAFSARSSN   +K+NGK I VC+HCKKQW TK+QCWKLHGRPPGGKKR SNEK N+G+ Y+S
Subjt:  ILGQRPIPSLMKVCSEIRLEEDWTSAMNISTTPTIDSAAFSARSSNSSMNKHNGKPIPVCKHCKKQWRTKEQCWKLHGRPPGGKKRHSNEKHNTGQTYVS

Query:  ES--AETPQQSDPHKNKTDLNLATLGAIVQS------------------------DHLTGSSEHFVSYIPCVGNETIRIADGSLAPIAGKGKISPCAGLS
        E+  A T Q +DP  ++T     TLGAI QS                        DHLTGSSEHF+SY  C GNE IRIADGSLAPIAGKG+I P  G +
Subjt:  ES--AETPQQSDPHKNKTDLNLATLGAIVQS------------------------DHLTGSSEHFVSYIPCVGNETIRIADGSLAPIAGKGKISPCAGLS

Query:  LHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSIQDLSSGRMTSTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQDYQTSTSALASSLKLKW-
        L NVLHVPKLSYNLLSISKIT EL+CKAIFLP+SV  QD+SSGR   TARHSRGLY+LDDDTS SS+ R SLLSSYF+TSEQD+  S  +    ++ K  
Subjt:  LHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSIQDLSSGRMTSTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQDYQTSTSALASSLKLKW-

Query:  ------LPY--------LEIDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKITILRSDNGRKFQKHNLSEFLAS
               PY        +  DVWGPSK+TTSSGKRWFVTFIDDHTRLTWVYLI+DKSEV S+FQNFYHTI+TQFH KI ILRSDNGR+FQ HNLSEFLAS
Subjt:  ------LPY--------LEIDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKITILRSDNGRKFQKHNLSEFLAS

Query:  KGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSYVWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVSEVPLRVFGCTAYVHNFG
        KGIVHQ SCAYTPQQNGVAERKNRHL+EVARSLMLSTSLPSY+WGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTR VSEVPLRVFGCTAYVHNFG
Subjt:  KGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSYVWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVSEVPLRVFGCTAYVHNFG

Query:  PNQTKFTPRAQASVFVGYPPHQRGYKCFHPPSRKYFVTMD----------------GESVSEESNNTFEFIKPTPSVVSNINPHTIVLPTNQVPWKTYYR
        PNQTKFTPRAQA VFVGYP HQ GYKCFHPPSRKYFVTMD                GE+VSEESNNTFEF++PT   VS+I+PH I+LPTNQVPWKTYYR
Subjt:  PNQTKFTPRAQASVFVGYPPHQRGYKCFHPPSRKYFVTMD----------------GESVSEESNNTFEFIKPTPSVVSNINPHTIVLPTNQVPWKTYYR

Query:  RNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNEAKRGHTGKPDEYDSSLDIPIALRKGT
        RN +KEVGSPTSQPPAPVQ+ EPPRDQGMENPT+PCT N +SEND+S++AVLEN+EEK+  DE EVRIET N+EA++GHT K DEYD SLDIPIALRKGT
Subjt:  RNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNEAKRGHTGKPDEYDSSLDIPIALRKGT

Query:  SS
         S
Subjt:  SS

TYK31050.1 Beta-galactosidase [Cucumis melo var. makuwa]0.0e+0065.96Show/hide
Query:  MYSENPVNSFHNLSSPYVTNKGAQSSMYHLSGEKLNGNNYFSWSWSVKMVLEGRQKFSFLTGEIPRPLP-------------------------------
        MYS+NPV SF N  S Y+T     SS  + SGEKLNG NYFSWS S+KM LEGR +F FLTGEI RP P                               
Subjt:  MYSENPVNSFHNLSSPYVTNKGAQSSMYHLSGEKLNGNNYFSWSWSVKMVLEGRQKFSFLTGEIPRPLP-------------------------------

Query:  -----AKDIWDTAHTLYSKRHNASRLYTEKAKL------------------------------VLRDPTDGVQYSRVEENGRIYDFLAGLNPKFDVVRRC
             AKD+WDT  TLYSKR NASRLYT + ++                              V   P D  QY+++EE  R+YDFLAGLNPKFD V   
Subjt:  -----AKDIWDTAHTLYSKRHNASRLYTEKAKL------------------------------VLRDPTDGVQYSRVEENGRIYDFLAGLNPKFDVVRRC

Query:  ILGQRPIPSLMKVCSEIRLEEDWTSAMNISTTPTIDSAAFSARSSNSSMNKHNGKPIPVCKHCKKQWRTKEQCWKLHGRPPGGKKRHSNEKHNTGQTYVS
        ILGQRP+PSLM+VC E+RLEED T+AM + TTPTIDSAAFSARSSN   +K+NGK IPVC+HCKKQW TK+QCWKLHGRPPGGKKR SNEK N+G+ Y+S
Subjt:  ILGQRPIPSLMKVCSEIRLEEDWTSAMNISTTPTIDSAAFSARSSNSSMNKHNGKPIPVCKHCKKQWRTKEQCWKLHGRPPGGKKRHSNEKHNTGQTYVS

Query:  ES--AETPQQSDPHKNKTDLNLATLGAIVQS------------------------DHLTGSSEHFVSYIPCVGNETIRIADGSLAPIAGKGKISPCAGLS
        E+  A T Q +DP  ++T     TLGAI QS                        DHLTGSSEHF+SY PC GNE IRIADGSLAPIAGKG+I P  G +
Subjt:  ES--AETPQQSDPHKNKTDLNLATLGAIVQS------------------------DHLTGSSEHFVSYIPCVGNETIRIADGSLAPIAGKGKISPCAGLS

Query:  LHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSIQDLSSGRMTSTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQDYQTSTSALASSLKLKW-
        L NVLHVPKLSYNLLSISKIT EL+CKAIFLP+SV  QD+SSGR   TARHSRGLY+LDDDTS SS+ R SLLSSYF+TSEQD+  S+ +    ++ K  
Subjt:  LHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSIQDLSSGRMTSTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQDYQTSTSALASSLKLKW-

Query:  ------LPY--------LEIDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKITILRSDNGRKFQKHNLSEFLAS
               PY        +  DVWGPSK+TTSSGKRWFVTFIDDHTRLTWVYLI+DKSEV S+FQNFYHTI+TQFH KI ILRSDNGR+FQ HNLSEFLAS
Subjt:  ------LPY--------LEIDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKITILRSDNGRKFQKHNLSEFLAS

Query:  KGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSYVWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVSEVPLRVFGCTAYVHNFG
        KGIVHQ SCAYTPQQNGVAERKNRHL+EVARSLMLSTSLPSY+WGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTR VSEVPLRVFGCTAYVHNFG
Subjt:  KGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSYVWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVSEVPLRVFGCTAYVHNFG

Query:  PNQTKFTPRAQASVFVGYPPHQRGYKCFHPPSRKYFVTMD----------------GESVSEESNNTFEFIKPTPSVVSNINPHTIVLPTNQVPWKTYYR
        PNQTKFTPRAQA VFVGYP HQ GYKCFHPPSRKYFVTMD                GE+VSEESNNTFEF++PT   VS+I+PH I+LPTNQVPWKTYYR
Subjt:  PNQTKFTPRAQASVFVGYPPHQRGYKCFHPPSRKYFVTMD----------------GESVSEESNNTFEFIKPTPSVVSNINPHTIVLPTNQVPWKTYYR

Query:  RNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNEAKRGHTGKPDEYDSSLDIPIALRKGT
        RN +KEVGSPTSQPPAPVQ+ EPPRDQGMENPT+PCT N +SEND+S++A LEN+EEK+  DE EVRIET N+EA++GHT K DEYD SLDIPIALRKGT
Subjt:  RNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNEAKRGHTGKPDEYDSSLDIPIALRKGT

Query:  SS
         S
Subjt:  SS

TYK31717.1 Beta-galactosidase [Cucumis melo var. makuwa]0.0e+0065.96Show/hide
Query:  MYSENPVNSFHNLSSPYVTNKGAQSSMYHLSGEKLNGNNYFSWSWSVKMVLEGRQKFSFLTGEIPRPLP-------------------------------
        MYS+NPV SF N  S Y+T     SS  + SGEKLNG NYFSWS S+KM LEGR +F FLTGEI RP P                               
Subjt:  MYSENPVNSFHNLSSPYVTNKGAQSSMYHLSGEKLNGNNYFSWSWSVKMVLEGRQKFSFLTGEIPRPLP-------------------------------

Query:  -----AKDIWDTAHTLYSKRHNASRLYTEKAKL------------------------------VLRDPTDGVQYSRVEENGRIYDFLAGLNPKFDVVRRC
             AKD+WDT  TLYSKR NASRLYT + ++                              V   P D  QY+++EE  R+YDFLAGLNPKFD V   
Subjt:  -----AKDIWDTAHTLYSKRHNASRLYTEKAKL------------------------------VLRDPTDGVQYSRVEENGRIYDFLAGLNPKFDVVRRC

Query:  ILGQRPIPSLMKVCSEIRLEEDWTSAMNISTTPTIDSAAFSARSSNSSMNKHNGKPIPVCKHCKKQWRTKEQCWKLHGRPPGGKKRHSNEKHNTGQTYVS
        ILGQRP+PSLM+VC E+RLEED T+AM + TTPTIDSAAFSARSSN   +K+NGK IPVC+HCKKQW TK+QCWKLHGRPPGGKKR SNEK N+G+ Y+S
Subjt:  ILGQRPIPSLMKVCSEIRLEEDWTSAMNISTTPTIDSAAFSARSSNSSMNKHNGKPIPVCKHCKKQWRTKEQCWKLHGRPPGGKKRHSNEKHNTGQTYVS

Query:  ES--AETPQQSDPHKNKTDLNLATLGAIVQS------------------------DHLTGSSEHFVSYIPCVGNETIRIADGSLAPIAGKGKISPCAGLS
        E+  A T Q +DP  ++T     TLGAI QS                        DHLTGSSEHF+SY PC GNE IRIADGSLAPIAGKG+I P  G +
Subjt:  ES--AETPQQSDPHKNKTDLNLATLGAIVQS------------------------DHLTGSSEHFVSYIPCVGNETIRIADGSLAPIAGKGKISPCAGLS

Query:  LHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSIQDLSSGRMTSTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQDYQTSTSALASSLKLKW-
        L NVLHVPKLSYNLLSISKIT EL+CKAIFLP+SV  QD+SSGR   TARHSRGLY+LDDDTS SS+ R SLLSSYF+TSEQD+  S+ +    ++ K  
Subjt:  LHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSIQDLSSGRMTSTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQDYQTSTSALASSLKLKW-

Query:  ------LPY--------LEIDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKITILRSDNGRKFQKHNLSEFLAS
               PY        +  DVWGPSK+TTSSGKRWFVTFIDDHTRLTWVYLI+DKSEV S+FQNFYHTI+TQFH KI ILRSDNGR+FQ HNLSEFLAS
Subjt:  ------LPY--------LEIDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKITILRSDNGRKFQKHNLSEFLAS

Query:  KGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSYVWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVSEVPLRVFGCTAYVHNFG
        KGIVHQ SCAYTPQQNGVAERKNRHL+EVARSLMLSTSLPSY+WGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTR VSEVPLRVFGCTAYVHNFG
Subjt:  KGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSYVWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVSEVPLRVFGCTAYVHNFG

Query:  PNQTKFTPRAQASVFVGYPPHQRGYKCFHPPSRKYFVTMD----------------GESVSEESNNTFEFIKPTPSVVSNINPHTIVLPTNQVPWKTYYR
        PNQTKFTPRAQA VFVGYP HQ GYKCFHPPSRKYFVTMD                GE+VSEESNNTFEF++PT   VS+I+PH I+LPTNQVPWKTYYR
Subjt:  PNQTKFTPRAQASVFVGYPPHQRGYKCFHPPSRKYFVTMD----------------GESVSEESNNTFEFIKPTPSVVSNINPHTIVLPTNQVPWKTYYR

Query:  RNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNEAKRGHTGKPDEYDSSLDIPIALRKGT
        RN +KEVGSPTSQPPAPVQ+ EPPRDQGMENPT+PCT N +SEND+S++A LEN+EEK+  DE EVRIET N+EA++GHT K DEYD SLDIPIALRKGT
Subjt:  RNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNEAKRGHTGKPDEYDSSLDIPIALRKGT

Query:  SS
         S
Subjt:  SS

TrEMBL top hitse value%identityAlignment
A0A5A7SM64 Beta-galactosidase0.0e+0064.22Show/hide
Query:  MYSENPVNSFHNLSSPYVTNKGAQSSMYHLSGEKLNGNNYFSWSWSVKMVLEGRQKFSFLTGEIPRPLP-------------------------------
        MYS+NPV SF N  S Y+T     SS  + SGEKLNG NYFSWS S+KM LEGR +F FLTGEI RP P                               
Subjt:  MYSENPVNSFHNLSSPYVTNKGAQSSMYHLSGEKLNGNNYFSWSWSVKMVLEGRQKFSFLTGEIPRPLP-------------------------------

Query:  -----AKDIWDTAHTLYSKRHNASRLYTEKAKL------------------------------VLRDPTDGVQYSRVEENGRIYDFLAGLNPKFDVVRRC
             AKD+WDT  TLYSKR NASRLYT + ++                              V   P D  QY+++EE  R+YDFLAGLNPKFD V   
Subjt:  -----AKDIWDTAHTLYSKRHNASRLYTEKAKL------------------------------VLRDPTDGVQYSRVEENGRIYDFLAGLNPKFDVVRRC

Query:  ILGQRPIPSLMKVCSEIRLEEDWTSAMNISTTPTIDSAAFSARSSNSSMNKHNGKPIPVCKHCKKQWRTKEQCWKLHGRPPGGKKRHSNEKHNTGQTYVS
        ILGQRP+PSLM+VC E+RLEED T+AM + TTPTIDSAAFSARSSN   +K+NGK IPVC+HCKKQW TK+QCWKLHGRPPGGKKR SNEK N+G+ Y+S
Subjt:  ILGQRPIPSLMKVCSEIRLEEDWTSAMNISTTPTIDSAAFSARSSNSSMNKHNGKPIPVCKHCKKQWRTKEQCWKLHGRPPGGKKRHSNEKHNTGQTYVS

Query:  ES--AETPQQSDPHKNKTDLNLATLGAIVQS------------------------DHLTGSSEHFVSYIPCVGNETIRIADGSLAPIAGKGKISPCAGLS
        E+  A T Q +DP  ++T     TLGAI QS                        DHLTGSSEHF+SY PC GNE IRIADGSLAPIAGKG+I P  G +
Subjt:  ES--AETPQQSDPHKNKTDLNLATLGAIVQS------------------------DHLTGSSEHFVSYIPCVGNETIRIADGSLAPIAGKGKISPCAGLS

Query:  LHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSIQDLSSGRMTSTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQD-----------------
        L NVLHVPKLSYNLLSISKIT EL+CKAIFLP+SV  QD+SSGR   TARHSRGLY+LDDDTS SS+ R SLLSSYF+TSEQD                 
Subjt:  LHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSIQDLSSGRMTSTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQD-----------------

Query:  -------YQTSTSALASSLKLK---------WLPY--------LEIDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQF
                +   S+L+  + ++           PY        +  DVWGPSK+TTSSGKRWFVTFIDDHTRLTWVYLI+DKSEV S+FQNFYHTI+TQF
Subjt:  -------YQTSTSALASSLKLK---------WLPY--------LEIDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQF

Query:  HQKITILRSDNGRKFQKHNLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSYVWGDAILTAAHLINRMPSRILHLQTPLDCLKE
        H KI ILRSDNGR+FQ HNLSEFLASKGIVHQ SCAYTPQQNGVAERKNRHL+EVARSLMLSTSLPSY+WGDAILTAAHLINRMPSRILHLQTPLDCLKE
Subjt:  HQKITILRSDNGRKFQKHNLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSYVWGDAILTAAHLINRMPSRILHLQTPLDCLKE

Query:  SYPSTRHVSEVPLRVFGCTAYVHNFGPNQTKFTPRAQASVFVGYPPHQRGYKCFHPPSRKYFVTMD----------------GESVSEESNNTFEFIKPT
        SYPSTR VSEVPLRVFGCTAYVHNFGPNQTKFTPRAQA VFVGYP HQ GYKCFHPPSRKYFVTMD                GE+VSEESNNTFEF++PT
Subjt:  SYPSTRHVSEVPLRVFGCTAYVHNFGPNQTKFTPRAQASVFVGYPPHQRGYKCFHPPSRKYFVTMD----------------GESVSEESNNTFEFIKPT

Query:  PSVVSNINPHTIVLPTNQVPWKTYYRRNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNE
           VS+I+PH I+LPTNQVPWKTYYRRN +KEVGSPTSQPPAPVQ+ EPPRDQGMENPT+PCT N +SEND+S++AVLEN+EEK+  DE EVRIET N+E
Subjt:  PSVVSNINPHTIVLPTNQVPWKTYYRRNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNE

Query:  AKRGHTGKPDEYDSSLDIPIALRKGTSS
        A++GHT K DEYD SLDIPIALRKGT S
Subjt:  AKRGHTGKPDEYDSSLDIPIALRKGTSS

A0A5A7SQW1 Beta-galactosidase0.0e+0065.96Show/hide
Query:  MYSENPVNSFHNLSSPYVTNKGAQSSMYHLSGEKLNGNNYFSWSWSVKMVLEGRQKFSFLTGEIPRPLP-------------------------------
        MYS+NPV SF N  S Y+T     SS  + SGEKLNG NYFSWS S+KM LEGR +F FLTGE  RP P                               
Subjt:  MYSENPVNSFHNLSSPYVTNKGAQSSMYHLSGEKLNGNNYFSWSWSVKMVLEGRQKFSFLTGEIPRPLP-------------------------------

Query:  -----AKDIWDTAHTLYSKRHNASRLYTEKAKL------------------------------VLRDPTDGVQYSRVEENGRIYDFLAGLNPKFDVVRRC
             AKD+WDT  TLYSKR NASRLYT + ++                              V   P D  QY+++EE  R+YDFLAGLNPKFD V   
Subjt:  -----AKDIWDTAHTLYSKRHNASRLYTEKAKL------------------------------VLRDPTDGVQYSRVEENGRIYDFLAGLNPKFDVVRRC

Query:  ILGQRPIPSLMKVCSEIRLEEDWTSAMNISTTPTIDSAAFSARSSNSSMNKHNGKPIPVCKHCKKQWRTKEQCWKLHGRPPGGKKRHSNEKHNTGQTYVS
        ILGQRP+PSLM+VC E+RLEED T+AM + TTPTIDSAAFSARSSN   +K+NGK IPVC+HCKKQW TK+QCWKLHGRPPGGKKR SNEK N+G+ Y+S
Subjt:  ILGQRPIPSLMKVCSEIRLEEDWTSAMNISTTPTIDSAAFSARSSNSSMNKHNGKPIPVCKHCKKQWRTKEQCWKLHGRPPGGKKRHSNEKHNTGQTYVS

Query:  ES--AETPQQSDPHKNKTDLNLATLGAIVQS------------------------DHLTGSSEHFVSYIPCVGNETIRIADGSLAPIAGKGKISPCAGLS
        E+  A T Q +DP  ++T     TLGAI QS                        DHLTGSSEHF+SY PC GNE IRIADGSLAPIAGKG+I P  G +
Subjt:  ES--AETPQQSDPHKNKTDLNLATLGAIVQS------------------------DHLTGSSEHFVSYIPCVGNETIRIADGSLAPIAGKGKISPCAGLS

Query:  LHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSIQDLSSGRMTSTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQDYQTSTSALASSLKLKW-
        L NVLHVPKLSYNLLSISKIT EL+CKAIFLP+SV  QD+SSGR   TARHSRGLY+LDDDTS SS+ R SLLSSYF+TSEQD+  S+ +    ++ K  
Subjt:  LHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSIQDLSSGRMTSTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQDYQTSTSALASSLKLKW-

Query:  ------LPY--------LEIDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKITILRSDNGRKFQKHNLSEFLAS
               PY        +  DVWGPSK+TTSSGKRWFVTFIDDHTRLTWVYLI+DKSEV S+FQNFYHTI+TQFH KI ILRSDNGR+FQ HNLSEFLAS
Subjt:  ------LPY--------LEIDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKITILRSDNGRKFQKHNLSEFLAS

Query:  KGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSYVWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVSEVPLRVFGCTAYVHNFG
        KGIVHQ SCAYTPQQNGVAERKNRHL+EVARSLMLSTSLPSY+WGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTR VSEVPLRVFGCTAYVHNFG
Subjt:  KGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSYVWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVSEVPLRVFGCTAYVHNFG

Query:  PNQTKFTPRAQASVFVGYPPHQRGYKCFHPPSRKYFVTMD----------------GESVSEESNNTFEFIKPTPSVVSNINPHTIVLPTNQVPWKTYYR
        PNQTKFTPRAQA VFVGYP HQ GYKCFHPPSRKYFVTMD                GE+VSEESNNTFEF++PT   VS+I+PH I+LPTNQVPWKTYYR
Subjt:  PNQTKFTPRAQASVFVGYPPHQRGYKCFHPPSRKYFVTMD----------------GESVSEESNNTFEFIKPTPSVVSNINPHTIVLPTNQVPWKTYYR

Query:  RNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNEAKRGHTGKPDEYDSSLDIPIALRKGT
        RN +KEVGSPTSQPPAPVQ+ EPPRDQGMENPT+PCT N +SEND+S++AVLEN+EEK+  DE EVRIET N+EA++GHT K DEYD SLDIPIALRKGT
Subjt:  RNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNEAKRGHTGKPDEYDSSLDIPIALRKGT

Query:  SS
         S
Subjt:  SS

A0A5A7U8U2 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0065.74Show/hide
Query:  MYSENPVNSFHNLSSPYVTNKGAQSSMYHLSGEKLNGNNYFSWSWSVKMVLEGRQKFSFLTGEIPRPLP-------------------------------
        MYS+NPV SF N  S Y+T     SS  + SGEKLNG NYFSWS S+KM LEGR +F FLTGE  RP P                               
Subjt:  MYSENPVNSFHNLSSPYVTNKGAQSSMYHLSGEKLNGNNYFSWSWSVKMVLEGRQKFSFLTGEIPRPLP-------------------------------

Query:  -----AKDIWDTAHTLYSKRHNASRLYTEKAKL------------------------------VLRDPTDGVQYSRVEENGRIYDFLAGLNPKFDVVRRC
             AKD+WDT  TLYSKR NASRLYT + ++                              V   P D  QY+++EE  R+YDFLAGLNPKFD V   
Subjt:  -----AKDIWDTAHTLYSKRHNASRLYTEKAKL------------------------------VLRDPTDGVQYSRVEENGRIYDFLAGLNPKFDVVRRC

Query:  ILGQRPIPSLMKVCSEIRLEEDWTSAMNISTTPTIDSAAFSARSSNSSMNKHNGKPIPVCKHCKKQWRTKEQCWKLHGRPPGGKKRHSNEKHNTGQTYVS
        ILGQRP+PSLM+VC E+RLEED T+AM + TTPTIDSAAFSARSSN   +K+NGK I VC+HCKKQW TK+QCWKLHGRPPGGKKR SNEK N+G+ Y+S
Subjt:  ILGQRPIPSLMKVCSEIRLEEDWTSAMNISTTPTIDSAAFSARSSNSSMNKHNGKPIPVCKHCKKQWRTKEQCWKLHGRPPGGKKRHSNEKHNTGQTYVS

Query:  ES--AETPQQSDPHKNKTDLNLATLGAIVQS------------------------DHLTGSSEHFVSYIPCVGNETIRIADGSLAPIAGKGKISPCAGLS
        E+  A T Q +DP  ++T     TLGAI QS                        DHLTGSSEHF+SY  C GNE IRIADGSLAPIAGKG+I P  G +
Subjt:  ES--AETPQQSDPHKNKTDLNLATLGAIVQS------------------------DHLTGSSEHFVSYIPCVGNETIRIADGSLAPIAGKGKISPCAGLS

Query:  LHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSIQDLSSGRMTSTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQDYQTSTSALASSLKLKW-
        L NVLHVPKLSYNLLSISKIT EL+CKAIFLP+SV  QD+SSGR   TARHSRGLY+LDDDTS SS+ R SLLSSYF+TSEQD+  S  +    ++ K  
Subjt:  LHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSIQDLSSGRMTSTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQDYQTSTSALASSLKLKW-

Query:  ------LPY--------LEIDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKITILRSDNGRKFQKHNLSEFLAS
               PY        +  DVWGPSK+TTSSGKRWFVTFIDDHTRLTWVYLI+DKSEV S+FQNFYHTI+TQFH KI ILRSDNGR+FQ HNLSEFLAS
Subjt:  ------LPY--------LEIDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKITILRSDNGRKFQKHNLSEFLAS

Query:  KGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSYVWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVSEVPLRVFGCTAYVHNFG
        KGIVHQ SCAYTPQQNGVAERKNRHL+EVARSLMLSTSLPSY+WGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTR VSEVPLRVFGCTAYVHNFG
Subjt:  KGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSYVWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVSEVPLRVFGCTAYVHNFG

Query:  PNQTKFTPRAQASVFVGYPPHQRGYKCFHPPSRKYFVTMD----------------GESVSEESNNTFEFIKPTPSVVSNINPHTIVLPTNQVPWKTYYR
        PNQTKFTPRAQA VFVGYP HQ GYKCFHPPSRKYFVTMD                GE+VSEESNNTFEF++PT   VS+I+PH I+LPTNQVPWKTYYR
Subjt:  PNQTKFTPRAQASVFVGYPPHQRGYKCFHPPSRKYFVTMD----------------GESVSEESNNTFEFIKPTPSVVSNINPHTIVLPTNQVPWKTYYR

Query:  RNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNEAKRGHTGKPDEYDSSLDIPIALRKGT
        RN +KEVGSPTSQPPAPVQ+ EPPRDQGMENPT+PCT N +SEND+S++AVLEN+EEK+  DE EVRIET N+EA++GHT K DEYD SLDIPIALRKGT
Subjt:  RNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNEAKRGHTGKPDEYDSSLDIPIALRKGT

Query:  SS
         S
Subjt:  SS

A0A5D3E603 Beta-galactosidase0.0e+0065.96Show/hide
Query:  MYSENPVNSFHNLSSPYVTNKGAQSSMYHLSGEKLNGNNYFSWSWSVKMVLEGRQKFSFLTGEIPRPLP-------------------------------
        MYS+NPV SF N  S Y+T     SS  + SGEKLNG NYFSWS S+KM LEGR +F FLTGEI RP P                               
Subjt:  MYSENPVNSFHNLSSPYVTNKGAQSSMYHLSGEKLNGNNYFSWSWSVKMVLEGRQKFSFLTGEIPRPLP-------------------------------

Query:  -----AKDIWDTAHTLYSKRHNASRLYTEKAKL------------------------------VLRDPTDGVQYSRVEENGRIYDFLAGLNPKFDVVRRC
             AKD+WDT  TLYSKR NASRLYT + ++                              V   P D  QY+++EE  R+YDFLAGLNPKFD V   
Subjt:  -----AKDIWDTAHTLYSKRHNASRLYTEKAKL------------------------------VLRDPTDGVQYSRVEENGRIYDFLAGLNPKFDVVRRC

Query:  ILGQRPIPSLMKVCSEIRLEEDWTSAMNISTTPTIDSAAFSARSSNSSMNKHNGKPIPVCKHCKKQWRTKEQCWKLHGRPPGGKKRHSNEKHNTGQTYVS
        ILGQRP+PSLM+VC E+RLEED T+AM + TTPTIDSAAFSARSSN   +K+NGK IPVC+HCKKQW TK+QCWKLHGRPPGGKKR SNEK N+G+ Y+S
Subjt:  ILGQRPIPSLMKVCSEIRLEEDWTSAMNISTTPTIDSAAFSARSSNSSMNKHNGKPIPVCKHCKKQWRTKEQCWKLHGRPPGGKKRHSNEKHNTGQTYVS

Query:  ES--AETPQQSDPHKNKTDLNLATLGAIVQS------------------------DHLTGSSEHFVSYIPCVGNETIRIADGSLAPIAGKGKISPCAGLS
        E+  A T Q +DP  ++T     TLGAI QS                        DHLTGSSEHF+SY PC GNE IRIADGSLAPIAGKG+I P  G +
Subjt:  ES--AETPQQSDPHKNKTDLNLATLGAIVQS------------------------DHLTGSSEHFVSYIPCVGNETIRIADGSLAPIAGKGKISPCAGLS

Query:  LHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSIQDLSSGRMTSTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQDYQTSTSALASSLKLKW-
        L NVLHVPKLSYNLLSISKIT EL+CKAIFLP+SV  QD+SSGR   TARHSRGLY+LDDDTS SS+ R SLLSSYF+TSEQD+  S+ +    ++ K  
Subjt:  LHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSIQDLSSGRMTSTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQDYQTSTSALASSLKLKW-

Query:  ------LPY--------LEIDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKITILRSDNGRKFQKHNLSEFLAS
               PY        +  DVWGPSK+TTSSGKRWFVTFIDDHTRLTWVYLI+DKSEV S+FQNFYHTI+TQFH KI ILRSDNGR+FQ HNLSEFLAS
Subjt:  ------LPY--------LEIDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKITILRSDNGRKFQKHNLSEFLAS

Query:  KGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSYVWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVSEVPLRVFGCTAYVHNFG
        KGIVHQ SCAYTPQQNGVAERKNRHL+EVARSLMLSTSLPSY+WGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTR VSEVPLRVFGCTAYVHNFG
Subjt:  KGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSYVWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVSEVPLRVFGCTAYVHNFG

Query:  PNQTKFTPRAQASVFVGYPPHQRGYKCFHPPSRKYFVTMD----------------GESVSEESNNTFEFIKPTPSVVSNINPHTIVLPTNQVPWKTYYR
        PNQTKFTPRAQA VFVGYP HQ GYKCFHPPSRKYFVTMD                GE+VSEESNNTFEF++PT   VS+I+PH I+LPTNQVPWKTYYR
Subjt:  PNQTKFTPRAQASVFVGYPPHQRGYKCFHPPSRKYFVTMD----------------GESVSEESNNTFEFIKPTPSVVSNINPHTIVLPTNQVPWKTYYR

Query:  RNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNEAKRGHTGKPDEYDSSLDIPIALRKGT
        RN +KEVGSPTSQPPAPVQ+ EPPRDQGMENPT+PCT N +SEND+S++A LEN+EEK+  DE EVRIET N+EA++GHT K DEYD SLDIPIALRKGT
Subjt:  RNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNEAKRGHTGKPDEYDSSLDIPIALRKGT

Query:  SS
         S
Subjt:  SS

A0A5D3E6F8 Beta-galactosidase0.0e+0065.96Show/hide
Query:  MYSENPVNSFHNLSSPYVTNKGAQSSMYHLSGEKLNGNNYFSWSWSVKMVLEGRQKFSFLTGEIPRPLP-------------------------------
        MYS+NPV SF N  S Y+T     SS  + SGEKLNG NYFSWS S+KM LEGR +F FLTGEI RP P                               
Subjt:  MYSENPVNSFHNLSSPYVTNKGAQSSMYHLSGEKLNGNNYFSWSWSVKMVLEGRQKFSFLTGEIPRPLP-------------------------------

Query:  -----AKDIWDTAHTLYSKRHNASRLYTEKAKL------------------------------VLRDPTDGVQYSRVEENGRIYDFLAGLNPKFDVVRRC
             AKD+WDT  TLYSKR NASRLYT + ++                              V   P D  QY+++EE  R+YDFLAGLNPKFD V   
Subjt:  -----AKDIWDTAHTLYSKRHNASRLYTEKAKL------------------------------VLRDPTDGVQYSRVEENGRIYDFLAGLNPKFDVVRRC

Query:  ILGQRPIPSLMKVCSEIRLEEDWTSAMNISTTPTIDSAAFSARSSNSSMNKHNGKPIPVCKHCKKQWRTKEQCWKLHGRPPGGKKRHSNEKHNTGQTYVS
        ILGQRP+PSLM+VC E+RLEED T+AM + TTPTIDSAAFSARSSN   +K+NGK IPVC+HCKKQW TK+QCWKLHGRPPGGKKR SNEK N+G+ Y+S
Subjt:  ILGQRPIPSLMKVCSEIRLEEDWTSAMNISTTPTIDSAAFSARSSNSSMNKHNGKPIPVCKHCKKQWRTKEQCWKLHGRPPGGKKRHSNEKHNTGQTYVS

Query:  ES--AETPQQSDPHKNKTDLNLATLGAIVQS------------------------DHLTGSSEHFVSYIPCVGNETIRIADGSLAPIAGKGKISPCAGLS
        E+  A T Q +DP  ++T     TLGAI QS                        DHLTGSSEHF+SY PC GNE IRIADGSLAPIAGKG+I P  G +
Subjt:  ES--AETPQQSDPHKNKTDLNLATLGAIVQS------------------------DHLTGSSEHFVSYIPCVGNETIRIADGSLAPIAGKGKISPCAGLS

Query:  LHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSIQDLSSGRMTSTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQDYQTSTSALASSLKLKW-
        L NVLHVPKLSYNLLSISKIT EL+CKAIFLP+SV  QD+SSGR   TARHSRGLY+LDDDTS SS+ R SLLSSYF+TSEQD+  S+ +    ++ K  
Subjt:  LHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSIQDLSSGRMTSTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQDYQTSTSALASSLKLKW-

Query:  ------LPY--------LEIDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKITILRSDNGRKFQKHNLSEFLAS
               PY        +  DVWGPSK+TTSSGKRWFVTFIDDHTRLTWVYLI+DKSEV S+FQNFYHTI+TQFH KI ILRSDNGR+FQ HNLSEFLAS
Subjt:  ------LPY--------LEIDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKITILRSDNGRKFQKHNLSEFLAS

Query:  KGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSYVWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVSEVPLRVFGCTAYVHNFG
        KGIVHQ SCAYTPQQNGVAERKNRHL+EVARSLMLSTSLPSY+WGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTR VSEVPLRVFGCTAYVHNFG
Subjt:  KGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSYVWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVSEVPLRVFGCTAYVHNFG

Query:  PNQTKFTPRAQASVFVGYPPHQRGYKCFHPPSRKYFVTMD----------------GESVSEESNNTFEFIKPTPSVVSNINPHTIVLPTNQVPWKTYYR
        PNQTKFTPRAQA VFVGYP HQ GYKCFHPPSRKYFVTMD                GE+VSEESNNTFEF++PT   VS+I+PH I+LPTNQVPWKTYYR
Subjt:  PNQTKFTPRAQASVFVGYPPHQRGYKCFHPPSRKYFVTMD----------------GESVSEESNNTFEFIKPTPSVVSNINPHTIVLPTNQVPWKTYYR

Query:  RNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNEAKRGHTGKPDEYDSSLDIPIALRKGT
        RN +KEVGSPTSQPPAPVQ+ EPPRDQGMENPT+PCT N +SEND+S++A LEN+EEK+  DE EVRIET N+EA++GHT K DEYD SLDIPIALRKGT
Subjt:  RNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNEAKRGHTGKPDEYDSSLDIPIALRKGT

Query:  SS
         S
Subjt:  SS

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.3e-3539.39Show/hide
Query:  DVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKITILRSDNGRKFQKHNLSEFLASKGIVHQNSCAYTPQQNGVAE
        DV GP    T   K +FV F+D  T     YLI  KS+V SMFQ+F    E  F+ K+  L  DNGR++  + + +F   KGI +  +  +TPQ NGV+E
Subjt:  DVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKITILRSDNGRKFQKHNLSEFLASKGIVHQNSCAYTPQQNGVAE

Query:  RKNRHLLEVARSLMLSTSLPSYVWGDAILTAAHLINRMPSRIL--HLQTPLDCLKESYPSTRHVSEVPLRVFGCTAYVHNFGPNQTKFTPRAQASVFVGY
        R  R + E AR+++    L    WG+A+LTA +LINR+PSR L    +TP +      P  +H     LRVFG T YVH     Q KF  ++  S+FVGY
Subjt:  RKNRHLLEVARSLMLSTSLPSYVWGDAILTAAHLINRMPSRIL--HLQTPLDCLKESYPSTRHVSEVPLRVFGCTAYVHNFGPNQTKFTPRAQASVFVGY

Query:  PPHQRGYKCFHPPSRKYFVTMDGESVSEESN
         P+  G+K +   + K+ V  D   V +E+N
Subjt:  PPHQRGYKCFHPPSRKYFVTMDGESVSEESN

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.9e-4025.28Show/hide
Query:  AKDIWDTAHTLYSKRHNASRLYTEKAKLVLRDPTDGVQY------------------SRVEENGRIYDFLAGLNPKFDVVRRCILGQRPIPSLMKVCSEI
        A+ IW    +LY  +   ++LY +K +L     ++G  +                   ++EE  +    L  L   +D +   IL  +    L  V S +
Subjt:  AKDIWDTAHTLYSKRHNASRLYTEKAKLVLRDPTDGVQY------------------SRVEENGRIYDFLAGLNPKFDVVRRCILGQRPIPSLMKVCSEI

Query:  RLEEDWTSAMNISTTPTIDSA---AFSARSSNSSMNKHNGKP-------IPVCKHCKKQWRTKEQCWKLHGRPPGGKKRHSNEKHNTGQT----------
         L E             I      ++   S+N   +   GK        +  C +C +    K  C      P  GK   S +K++              
Subjt:  RLEEDWTSAMNISTTPTIDSA---AFSARSSNSSMNKHNGKP-------IPVCKHCKKQWRTKEQCWKLHGRPPGGKKRHSNEKHNTGQT----------

Query:  -YVSESAETPQQSDPHKNKTDLNLATLGAIVQSDHLTGSSEHFVSYIPCVGN-ETIRIADGSLAPIAGKG----KISPCAGLSLHNVLHVPKLSYNLLS-
         +++E  E    S P         A       S H T   + F  Y+   G+  T+++ + S + IAG G    K +    L L +V HVP L  NL+S 
Subjt:  -YVSESAETPQQSDPHKNKTDLNLATLGAIVQSDHLTGSSEHFVSYIPCVGN-ETIRIADGSLAPIAGKG----KISPCAGLSLHNVLHVPKLSYNLLS-

Query:  -------------------------ISK--------ITHELNCKAIF--LPDSVSIQDLSSGRMTSTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSE
                                 I+K         T+   C+       D +S+ DL   RM   +   +GL +L   +  S    T++    +    
Subjt:  -------------------------ISK--------ITHELNCKAIF--LPDSVSIQDLSSGRMTSTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSE

Query:  QDYQTSTSALASSLKLKWLPYLEIDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKITILRSDNGRKFQKHNLSE
        + ++ S    +S  KL  L  +  DV GP +I +  G ++FVTFIDD +R  WVY++  K +V  +FQ F+  +E +  +K+  LRSDNG ++      E
Subjt:  QDYQTSTSALASSLKLKWLPYLEIDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKITILRSDNGRKFQKHNLSE

Query:  FLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSYVWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVSEVPLRVFGCTAYV
        + +S GI H+ +   TPQ NGVAER NR ++E  RS++    LP   WG+A+ TA +LINR PS  L  + P     E   + + VS   L+VFGC A+ 
Subjt:  FLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSYVWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVSEVPLRVFGCTAYV

Query:  HNFGPNQTKFTPRAQASVFVGYPPHQRGYKCFHPPSRKYFVTMDGESVSEESNNTFEFIKPTPSVVSNINPHTIVLP-TNQVPWKTYYRRNHKKEVGSPT
        H     +TK   ++   +F+GY   + GY+ + P  +K   + D      E     +    +  V + I P+ + +P T+  P       +   E G   
Subjt:  HNFGPNQTKFTPRAQASVFVGYPPHQRGYKCFHPPSRKYFVTMDGESVSEESNNTFEFIKPTPSVVSNINPHTIVLP-TNQVPWKTYYRRNHKKEVGSPT

Query:  SQPPAPVQDSEPPRDQGMENPTEP
         + P  V +     D+G+E    P
Subjt:  SQPPAPVQDSEPPRDQGMENPTEP

Q12491 Transposon Ty2-B Gag-Pol polyprotein4.8e-1723.7Show/hide
Query:  CVGNETIRIADGS-LAPIAGKGKISPCAGLSLHNVL--HVPKLSYNLL----SISKITHELNCKAIFLPDSVSIQDLSSGRMTSTARHSRGLYLLDDDTS
        C    T+  +DG+ LAPI   G       LS   ++  H+ KL+ N +    S++K  + L  + +   +  SIQ         + + +   YL + D  
Subjt:  CVGNETIRIADGS-LAPIAGKGKISPCAGLSLHNVL--HVPKLSYNLL----SISKITHELNCKAIFLPDSVSIQDLSSGRMTSTARHSRGLYLLDDDTS

Query:  SSSIPRTSLLSSYFTTSEQDYQTSTSALASSLKLKWLPYLEIDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVS--SMFQNFYHTIETQFHQ
         S+             S +      S L      +   YL  D++GP      S   +F++F D+ TR  WVY + D+ E S  ++F +    I+ QF+ 
Subjt:  SSSIPRTSLLSSYFTTSEQDYQTSTSALASSLKLKWLPYLEIDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVS--SMFQNFYHTIETQFHQ

Query:  KITILRSDNGRKFQKHNLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSYVWGDAILTAAHLINRMPSRILHLQTPLDCLKESY
        ++ +++ D G ++    L +F  ++GI    +     + +GVAER NR LL   R+L+  + LP+++W  A+  +  + N + S            K   
Subjt:  KITILRSDNGRKFQKHNLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSYVWGDAILTAAHLINRMPSRILHLQTPLDCLKESY

Query:  PSTRHVSEVPLRV-----FGCTAYVHNFGPNQTKFTPRAQASVFVGYPPHQRGYKCFHPPSRK------YFVTMDGESVSEESN
         + +H     L +     FG    V+N  P+ +K  PR      +    +  GY  + P  +K      Y +  D +S  ++ N
Subjt:  PSTRHVSEVPLRV-----FGCTAYVHNFGPNQTKFTPRAQASVFVGYPPHQRGYKCFHPPSRK------YFVTMDGESVSEESN

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.8e-4024.79Show/hide
Query:  IPRPLPAKDIWDTAHTLYSKRH--NASRLYTE-----KAKLVLRDPTDGV--QYSRVEENGRIYD-------FLAGLNPKFDVVRRCILGQRPIPSLMKV
        + R   A  IW+T   +Y+     + ++L T+     K    + D   G+  ++ ++   G+  D        L  L  ++  V   I  +   P+L ++
Subjt:  IPRPLPAKDIWDTAHTLYSKRH--NASRLYTE-----KAKLVLRDPTDGV--QYSRVEENGRIYD-------FLAGLNPKFDVVRRCILGQRPIPSLMKV

Query:  CSEIRLEEDWTSAMNISTTPTIDSAAFSARSSNSSMNKHNGKPIPVCKHCKKQWRTKEQCWKL----------HGRPPGGK------KRHSNEKHNTGQT
           +   E    A++ +T   I + A S R++ ++ N +NG      ++  +      + W+             +P  GK      + HS ++ +  Q 
Subjt:  CSEIRLEEDWTSAMNISTTPTIDSAAFSARSSNSSMNKHNGKPIPVCKHCKKQWRTKEQCWKL----------HGRPPGGK------KRHSNEKHNTGQT

Query:  YVS--ESAETPQQSDPHKNKTDLNLAT-------LGAIVQSDHLTGSSEHFVSYIPCVGNETIRIADGSLAPIAGKGKIS---PCAGLSLHNVLHVPKLS
        ++S   S + P    P + + +L L +       L     + H+T    +   + P  G + + +ADGS  PI+  G  S       L+LHN+L+VP + 
Subjt:  YVS--ESAETPQQSDPHKNKTDLNLAT-------LGAIVQSDHLTGSSEHFVSYIPCVGNETIRIADGSLAPIAGKGKIS---PCAGLSLHNVLHVPKLS

Query:  YNLLSISKITHELNCKAIFLPDSVSIQDLSSGRMTSTARHSRGLY-----------LLDDDTSSSS---------IPRTSLLSSYFTT--------SEQD
         NL+S+ ++ +       F P S  ++DL++G      +    LY           L    +S ++          P  S+L+S  +         S + 
Subjt:  YNLLSISKITHELNCKAIFLPDSVSIQDLSSGRMTSTARHSRGLY-----------LLDDDTSSSS---------IPRTSLLSSYFTT--------SEQD

Query:  YQTSTSALASSLKLKW----------LPYLEIDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKITILRSDNGRK
           S   +  S K+ +          L Y+  DVW  S I +    R++V F+D  TR TW+Y +  KS+V   F  F + +E +F  +I    SDNG +
Subjt:  YQTSTSALASSLKLKW----------LPYLEIDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKITILRSDNGRK

Query:  FQKHNLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSYVWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVSEVPLR
        F    L E+ +  GI H  S  +TP+ NG++ERK+RH++E   +L+   S+P   W  A   A +LINR+P+ +L L++P   L  + P+        LR
Subjt:  FQKHNLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSYVWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVSEVPLR

Query:  VFGCTAYVHNFGPNQTKFTPRAQASVFVGYPPHQRGYKCFHPPSRKYFVTMDGESVSEESNNTFEFIKPTPSVVSN--------INPHTIVLPTNQVPWK
        VFGC  Y      NQ K   +++  VF+GY   Q  Y C H  + + +++       +E+   F     T S V           +PHT  LPT + P  
Subjt:  VFGCTAYVHNFGPNQTKFTPRAQASVFVGYPPHQRGYKCFHPPSRKYFVTMDGESVSEESNNTFEFIKPTPSVVSN--------INPHTIVLPTNQVPWK

Query:  TYYRRNHKKEVGSPTSQPPAPVQDSE
             +      +P S P AP ++S+
Subjt:  TYYRRNHKKEVGSPTSQPPAPVQDSE

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.5e-4227.98Show/hide
Query:  SARSSNSSMNKHNGKPIPV---CKHCKKQWRTKEQCWKLHGRPPGGKKRHSNEKHNTGQTYVSESAETPQQSDPHKNKTDLNLATLGAIVQSDHLTGSSE
        S + S+S     N +P P    C+ C  Q  + ++C +LH       ++ S       Q   + +  +P  ++         L   GA   + H+T    
Subjt:  SARSSNSSMNKHNGKPIPV---CKHCKKQWRTKEQCWKLHGRPPGGKKRHSNEKHNTGQTYVSESAETPQQSDPHKNKTDLNLATLGAIVQSDHLTGSSE

Query:  HFVSYIPCVGNETIRIADGSLAPIAGKGKIS---PCAGLSLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSIQDLSSG-----------------
        +   + P  G + + IADGS  PI   G  S       L L+ VL+VP +  NL+S+ ++ +       F P S  ++DL++G                 
Subjt:  HFVSYIPCVGNETIRIADGSLAPIAGKGKIS---PCAGLSLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSIQDLSSG-----------------

Query:  --------------RMTSTARHSR----GLYLLDDDTSSSSIP------RTSLLSSYFTTSEQDYQTSTSALASSLKLKWLPYLEIDVWGPSKITTSSGK
                      + T ++ HSR     L +L+   S+ S+P      +    S  F         S S + SS   K L Y+  DVW  S I +    
Subjt:  --------------RMTSTARHSR----GLYLLDDDTSSSSIP------RTSLLSSYFTTSEQDYQTSTSALASSLKLKWLPYLEIDVWGPSKITTSSGK

Query:  RWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKITILRSDNGRKFQKHNLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLM
        R++V F+D  TR TW+Y +  KS+V   F  F   +E +F  +I  L SDNG +F    L ++L+  GI H  S  +TP+ NG++ERK+RH++E+  +L+
Subjt:  RWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKITILRSDNGRKFQKHNLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLM

Query:  LSTSLPSYVWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVSEVPLRVFGCTAYVHNFGPNQTKFTPRAQASVFVGYPPHQRGYKCFHPPSRK
           S+P   W  A   A +LINR+P+ +L LQ+P   L    P+        L+VFGC  Y      N+ K   +++   F+GY   Q  Y C H P+ +
Subjt:  LSTSLPSYVWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVSEVPLRVFGCTAYVHNFGPNQTKFTPRAQASVFVGYPPHQRGYKCFHPPSRK

Query:  YFVT
         + +
Subjt:  YFVT

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.9e-0540Show/hide
Query:  IQLIEDTGLLGAKSTFVPMDPTTKLNAYNKNIIHDATPYVRLIGRIGASTQTPVAQSQDLVFSFLLNE-SQI-KVPSLAH
        + L+++TGLLG K + VPMDP+   +A++     DA  Y RLIGR+         Q   L  SF +N+ SQ  + P LAH
Subjt:  IQLIEDTGLLGAKSTFVPMDPTTKLNAYNKNIIHDATPYVRLIGRIGASTQTPVAQSQDLVFSFLLNE-SQI-KVPSLAH

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.1e-0734.83Show/hide
Query:  NRHLLEVARSLMLSTSLPSYVWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVSEVPLRVFGCTAYVHNFGPNQTKFTPRAQ
        NR ++E  RS++    LP     DA  TA H+IN+ PS  ++   P +   +S P+  +     LR FGC AY+H    ++ K  PRA+
Subjt:  NRHLLEVARSLMLSTSLPSYVWGDAILTAAHLINRMPSRILHLQTPLDCLKESYPSTRHVSEVPLRVFGCTAYVHNFGPNQTKFTPRAQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATTCTGAGAATCCGGTAAACTCGTTCCATAATTTATCCTCTCCTTATGTGACTAATAAGGGGGCTCAATCTTCCATGTATCATCTTTCAGGAGAAAAGTTGAATGG
CAACAACTATTTCTCATGGTCTTGGTCAGTAAAGATGGTCCTCGAAGGACGACAAAAATTTAGCTTTCTGACAGGGGAAATACCTCGCCCCCTACCAGCCAAGGATATTT
GGGACACAGCCCACACACTTTACTCAAAACGTCATAATGCCTCTCGTCTATACACTGAGAAAGCAAAACTAGTCTTGCGTGATCCCACTGATGGTGTGCAGTACTCGAGA
GTTGAAGAGAATGGCAGGATTTATGACTTTCTTGCTGGTCTTAATCCTAAGTTTGATGTAGTTCGAAGGTGTATACTAGGCCAAAGACCGATTCCCTCCCTGATGAAAGT
TTGTTCTGAAATCCGCCTCGAGGAAGATTGGACAAGTGCTATGAATATTTCCACAACCCCTACTATTGACTCCGCTGCGTTTAGTGCAAGGTCTTCTAACAGTAGCATGA
ACAAGCATAATGGAAAACCAATTCCTGTCTGCAAGCATTGCAAAAAACAATGGCGTACCAAAGAACAATGTTGGAAGTTACATGGTCGTCCCCCAGGAGGTAAGAAACGC
CATTCCAACGAAAAACATAACACAGGGCAGACGTATGTGAGTGAGTCTGCTGAAACTCCTCAACAATCTGATCCACACAAAAACAAAACTGATCTCAATCTTGCCACTTT
AGGTGCCATTGTCCAATCAGATCATTTGACTGGGTCCTCTGAACATTTTGTGTCTTACATTCCTTGTGTTGGGAACGAGACAATTAGAATTGCAGATGGCTCATTGGCCC
CCATTGCTGGAAAGGGGAAGATTTCTCCCTGTGCAGGGCTCTCCTTACATAATGTTTTGCATGTGCCCAAGCTATCTTATAATTTGCTTTCGATAAGCAAGATCACTCAT
GAGTTAAACTGCAAAGCAATATTCTTACCTGATTCTGTCTCTATTCAGGACTTGAGCTCGGGGAGGATGACTAGCACTGCCCGACATAGTAGGGGACTCTACCTCCTTGA
TGACGATACCTCTTCTAGTAGCATTCCTAGGACTAGTCTCTTATCTTCCTATTTCACTACTTCTGAACAAGATTACCAAACTTCGACTTCTGCGTTAGCAAGTTCTCTAA
AATTGAAATGGCTACCTTATCTTGAGATTGATGTCTGGGGACCATCCAAGATAACAACCTCATCTGGAAAACGGTGGTTCGTAACCTTCATTGATGATCATACCCGTCTT
ACCTGGGTCTACCTTATCACTGATAAATCTGAGGTTTCCTCTATGTTTCAAAATTTCTATCACACCATTGAAACACAATTCCATCAAAAAATTACTATTCTTCGAAGTGA
TAATGGTCGGAAATTCCAAAAACATAACCTTAGTGAATTTCTTGCTTCCAAGGGGATTGTTCATCAAAACTCGTGCGCCTACACTCCTCAACAAAATGGAGTGGCCGAGC
GAAAAAACCGTCACCTTCTGGAAGTAGCCCGTTCCCTTATGCTTTCTACTTCCCTTCCTTCATACGTGTGGGGAGATGCTATTCTTACAGCAGCTCATTTAATCAATAGA
ATGCCTTCTCGTATTCTTCATCTTCAAACTCCCTTAGATTGTCTTAAGGAGTCCTACCCATCGACTCGTCATGTTTCTGAGGTTCCTCTTCGTGTGTTTGGGTGTACCGC
TTATGTCCATAATTTTGGCCCTAATCAAACCAAATTTACCCCTCGAGCTCAGGCAAGTGTGTTTGTTGGGTATCCCCCTCACCAGCGTGGTTATAAATGTTTTCACCCAC
CATCCAGAAAATACTTTGTCACTATGGATGGGGAGAGTGTGAGTGAAGAGTCTAACAACACCTTTGAATTTATCAAACCCACTCCTAGTGTTGTGTCTAACATCAATCCT
CATACCATAGTCCTACCCACAAACCAAGTCCCCTGGAAAACGTACTACAGGAGGAATCACAAAAAGGAAGTCGGTTCCCCTACTAGTCAGCCGCCGGCTCCAGTCCAAGA
CTCTGAACCTCCTCGAGATCAAGGTATGGAAAACCCTACTGAACCCTGTACTAAGAATATGATAAGTGAGAATGACAGGTCTAATGTTGCTGTTCTTGAAAATGTGGAAG
AAAAGGATAGTGGTGATGAGATTGAGGTCCGAATAGAAACCCGTAATAATGAAGCGAAACGGGGTCATACAGGAAAACCGGATGAGTATGATTCCTCTCTTGACATTCCC
ATTGCTCTGAGAAAAGGCACCAGTTCTTTCTTGGCCTTGAATTGGAAAGAAGCTCATCTGGCCTGTGATATTCAACTTATTGAAGATACAGGCCTTCTAGGTGCAAAATC
AACATTTGTCCCTATGGATCCCACCACAAAATTGAATGCTTATAATAAAAACATTATTCATGATGCCACTCCATACGTACGTCTTATTGGTCGAATTGGGGCCTCTACCC
AGACACCTGTCGCTCAGTCACAGGATTTAGTGTTTTCCTTCCTTCTAAATGAATCACAAATTAAAGTTCCATCTCTAGCCCATCTGTTTTATGGCAATCAAGTCGCCATC
TACATTGCAACTCGCAGACACTTTTACAAAGCCTCTTCTAGCAAACATCTTGTTTCCATGGATAAGCAAGATGGGTGTCCAAGACATATTTGCCTCACCTTGAAGGGGAT
TATCAAAAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGTATTCTGAGAATCCGGTAAACTCGTTCCATAATTTATCCTCTCCTTATGTGACTAATAAGGGGGCTCAATCTTCCATGTATCATCTTTCAGGAGAAAAGTTGAATGG
CAACAACTATTTCTCATGGTCTTGGTCAGTAAAGATGGTCCTCGAAGGACGACAAAAATTTAGCTTTCTGACAGGGGAAATACCTCGCCCCCTACCAGCCAAGGATATTT
GGGACACAGCCCACACACTTTACTCAAAACGTCATAATGCCTCTCGTCTATACACTGAGAAAGCAAAACTAGTCTTGCGTGATCCCACTGATGGTGTGCAGTACTCGAGA
GTTGAAGAGAATGGCAGGATTTATGACTTTCTTGCTGGTCTTAATCCTAAGTTTGATGTAGTTCGAAGGTGTATACTAGGCCAAAGACCGATTCCCTCCCTGATGAAAGT
TTGTTCTGAAATCCGCCTCGAGGAAGATTGGACAAGTGCTATGAATATTTCCACAACCCCTACTATTGACTCCGCTGCGTTTAGTGCAAGGTCTTCTAACAGTAGCATGA
ACAAGCATAATGGAAAACCAATTCCTGTCTGCAAGCATTGCAAAAAACAATGGCGTACCAAAGAACAATGTTGGAAGTTACATGGTCGTCCCCCAGGAGGTAAGAAACGC
CATTCCAACGAAAAACATAACACAGGGCAGACGTATGTGAGTGAGTCTGCTGAAACTCCTCAACAATCTGATCCACACAAAAACAAAACTGATCTCAATCTTGCCACTTT
AGGTGCCATTGTCCAATCAGATCATTTGACTGGGTCCTCTGAACATTTTGTGTCTTACATTCCTTGTGTTGGGAACGAGACAATTAGAATTGCAGATGGCTCATTGGCCC
CCATTGCTGGAAAGGGGAAGATTTCTCCCTGTGCAGGGCTCTCCTTACATAATGTTTTGCATGTGCCCAAGCTATCTTATAATTTGCTTTCGATAAGCAAGATCACTCAT
GAGTTAAACTGCAAAGCAATATTCTTACCTGATTCTGTCTCTATTCAGGACTTGAGCTCGGGGAGGATGACTAGCACTGCCCGACATAGTAGGGGACTCTACCTCCTTGA
TGACGATACCTCTTCTAGTAGCATTCCTAGGACTAGTCTCTTATCTTCCTATTTCACTACTTCTGAACAAGATTACCAAACTTCGACTTCTGCGTTAGCAAGTTCTCTAA
AATTGAAATGGCTACCTTATCTTGAGATTGATGTCTGGGGACCATCCAAGATAACAACCTCATCTGGAAAACGGTGGTTCGTAACCTTCATTGATGATCATACCCGTCTT
ACCTGGGTCTACCTTATCACTGATAAATCTGAGGTTTCCTCTATGTTTCAAAATTTCTATCACACCATTGAAACACAATTCCATCAAAAAATTACTATTCTTCGAAGTGA
TAATGGTCGGAAATTCCAAAAACATAACCTTAGTGAATTTCTTGCTTCCAAGGGGATTGTTCATCAAAACTCGTGCGCCTACACTCCTCAACAAAATGGAGTGGCCGAGC
GAAAAAACCGTCACCTTCTGGAAGTAGCCCGTTCCCTTATGCTTTCTACTTCCCTTCCTTCATACGTGTGGGGAGATGCTATTCTTACAGCAGCTCATTTAATCAATAGA
ATGCCTTCTCGTATTCTTCATCTTCAAACTCCCTTAGATTGTCTTAAGGAGTCCTACCCATCGACTCGTCATGTTTCTGAGGTTCCTCTTCGTGTGTTTGGGTGTACCGC
TTATGTCCATAATTTTGGCCCTAATCAAACCAAATTTACCCCTCGAGCTCAGGCAAGTGTGTTTGTTGGGTATCCCCCTCACCAGCGTGGTTATAAATGTTTTCACCCAC
CATCCAGAAAATACTTTGTCACTATGGATGGGGAGAGTGTGAGTGAAGAGTCTAACAACACCTTTGAATTTATCAAACCCACTCCTAGTGTTGTGTCTAACATCAATCCT
CATACCATAGTCCTACCCACAAACCAAGTCCCCTGGAAAACGTACTACAGGAGGAATCACAAAAAGGAAGTCGGTTCCCCTACTAGTCAGCCGCCGGCTCCAGTCCAAGA
CTCTGAACCTCCTCGAGATCAAGGTATGGAAAACCCTACTGAACCCTGTACTAAGAATATGATAAGTGAGAATGACAGGTCTAATGTTGCTGTTCTTGAAAATGTGGAAG
AAAAGGATAGTGGTGATGAGATTGAGGTCCGAATAGAAACCCGTAATAATGAAGCGAAACGGGGTCATACAGGAAAACCGGATGAGTATGATTCCTCTCTTGACATTCCC
ATTGCTCTGAGAAAAGGCACCAGTTCTTTCTTGGCCTTGAATTGGAAAGAAGCTCATCTGGCCTGTGATATTCAACTTATTGAAGATACAGGCCTTCTAGGTGCAAAATC
AACATTTGTCCCTATGGATCCCACCACAAAATTGAATGCTTATAATAAAAACATTATTCATGATGCCACTCCATACGTACGTCTTATTGGTCGAATTGGGGCCTCTACCC
AGACACCTGTCGCTCAGTCACAGGATTTAGTGTTTTCCTTCCTTCTAAATGAATCACAAATTAAAGTTCCATCTCTAGCCCATCTGTTTTATGGCAATCAAGTCGCCATC
TACATTGCAACTCGCAGACACTTTTACAAAGCCTCTTCTAGCAAACATCTTGTTTCCATGGATAAGCAAGATGGGTGTCCAAGACATATTTGCCTCACCTTGAAGGGGAT
TATCAAAAAATAA
Protein sequenceShow/hide protein sequence
MYSENPVNSFHNLSSPYVTNKGAQSSMYHLSGEKLNGNNYFSWSWSVKMVLEGRQKFSFLTGEIPRPLPAKDIWDTAHTLYSKRHNASRLYTEKAKLVLRDPTDGVQYSR
VEENGRIYDFLAGLNPKFDVVRRCILGQRPIPSLMKVCSEIRLEEDWTSAMNISTTPTIDSAAFSARSSNSSMNKHNGKPIPVCKHCKKQWRTKEQCWKLHGRPPGGKKR
HSNEKHNTGQTYVSESAETPQQSDPHKNKTDLNLATLGAIVQSDHLTGSSEHFVSYIPCVGNETIRIADGSLAPIAGKGKISPCAGLSLHNVLHVPKLSYNLLSISKITH
ELNCKAIFLPDSVSIQDLSSGRMTSTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQDYQTSTSALASSLKLKWLPYLEIDVWGPSKITTSSGKRWFVTFIDDHTRL
TWVYLITDKSEVSSMFQNFYHTIETQFHQKITILRSDNGRKFQKHNLSEFLASKGIVHQNSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSYVWGDAILTAAHLINR
MPSRILHLQTPLDCLKESYPSTRHVSEVPLRVFGCTAYVHNFGPNQTKFTPRAQASVFVGYPPHQRGYKCFHPPSRKYFVTMDGESVSEESNNTFEFIKPTPSVVSNINP
HTIVLPTNQVPWKTYYRRNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNEAKRGHTGKPDEYDSSLDIP
IALRKGTSSFLALNWKEAHLACDIQLIEDTGLLGAKSTFVPMDPTTKLNAYNKNIIHDATPYVRLIGRIGASTQTPVAQSQDLVFSFLLNESQIKVPSLAHLFYGNQVAI
YIATRRHFYKASSSKHLVSMDKQDGCPRHICLTLKGIIKK