; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI05G02130 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI05G02130
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionBeta-galactosidase
Genome locationChr5:2877194..2881807
RNA-Seq ExpressionCSPI05G02130
SyntenyCSPI05G02130
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR029472 - Retrotransposon Copia-like, N-terminal
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025363.1 Beta-galactosidase [Cucumis melo var. makuwa]0.0e+0069.66Show/hide
Query:  MVSERDNENTLETQKNQTTYENQTEVTAISFSAAVAAAIDARMSAAMDELLSRLQKTSENNFSSLPQSSAPPPDH---HEP---GFLPHTAPTIPSVQPF
        MVSE+ N  TLE    +T  E +           VAAA     +AA+++LL  LQK        +PQ  APP D    H P   G   H  P      PF
Subjt:  MVSERDNENTLETQKNQTTYENQTEVTAISFSAAVAAAIDARMSAAMDELLSRLQKTSENNFSSLPQSSAPPPDH---HEP---GFLPHTAPTIPSVQPF

Query:  SSSTAYIAPHAPIYVLPSNSNRLP-PLLPSNLYGQPPNDPSYHPDVKNSQIHSTFEVGESSAYSNRNVQASSGIVHQQLEGLRQQIAALEATLGTTST-L
          +   +  +AP  V PSN +  P P  PS   GQ P+  +        Q++   +  +   +S   +              R  I A E++  +  T L
Subjt:  SSSTAYIAPHAPIYVLPSNSNRLP-PLLPSNLYGQPPNDPSYHPDVKNSQIHSTFEVGESSAYSNRNVQASSGIVHQQLEGLRQQIAALEATLGTTST-L

Query:  PMYSEYPVNSFPNVSSPYLTNTVAQSSMYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKP
        PMYS+ PV SFPN  S Y+T ++  SS  + SGEKLNG NYFSWSQS+KM LEGR +F FLTGE  RP PGD  ER WK EDS++RS+LINSMEPQIGKP
Subjt:  PMYSEYPVNSFPNVSSPYLTNTVAQSSMYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKP

Query:  LLFAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTSFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRG
        LL+AATAKD+WDT QTLYSKRQNASRLYTLRKQVH CKQGT+DVT++FNKLSL+WQEMDLCRE VW  P D  QY+++EE DR+YDFLAGLNPKFD V G
Subjt:  LLFAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTSFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRG

Query:  RILGQRPIPSLMEVCSEIRLEEDRTSAMNISATPTIDSAAFSARSSNSSSDKHNGKPIPVCEHCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYV
        RILGQRP+PSLMEVC E+RLEEDRT+AM +  TPTIDSAAFSARSSN  SDK+NGK IPVCEHCKKQWHTK+QCWKLHGRPPG KKR SN+KQN+GRAY+
Subjt:  RILGQRPIPSLMEVCSEIRLEEDRTSAMNISATPTIDSAAFSARSSNSSSDKHNGKPIPVCEHCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYV

Query:  SES--AEPPQQSDPHKNQTDLSLATLGAIVQSGIPHSFGLVSIDGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRIADGSLAPVAGKGKISPCAGL
        SE+  A   Q +DP  +QT     TLGAI QSG+P S GL+S+DGKNPWILDSGATDHLTGSSEHF+SY PCAGNE IRIADGSLAP+AGKG+I P  G 
Subjt:  SES--AEPPQQSDPHKNQTDLSLATLGAIVQSGIPHSFGLVSIDGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRIADGSLAPVAGKGKISPCAGL

Query:  SLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDLSSGRMIGTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQDCMLWHFRLGHPNFQYM
        +L NVLHVPKLSYNLLSISKIT EL+CKAIFLP+SV FQD+SSGR IGTARHSRGLY+LDDDTS SS+ R SLLSSYF+TSEQDCMLWHFRLGHPNF YM
Subjt:  SLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDLSSGRMIGTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQDCMLWHFRLGHPNFQYM

Query:  KHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHSDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQ
        +HLFPHLFSKV++++LSCDVCI+AKQHRVSFPSQPYKPTQPF L+HSDVWGPSK+TTSSGKRWFVTFIDDHTRLTWVYLI+DKSEV S+FQNFYHTI+TQ
Subjt:  KHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHSDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQ

Query:  FHQKIAILRSDNGREFQNHNLSEFLASKGIVHQNSCAYT-------------------------------------------------------------
        FH KIAILRSDNGREFQNHNLSEFLASKGIVHQ SCAYT                                                             
Subjt:  FHQKIAILRSDNGREFQNHNLSEFLASKGIVHQNSCAYT-------------------------------------------------------------

Query:  ---------------------------------------------------------------VTMDVTFCEDRPYFPVSHLQGESVSEESNNTFEFIEP
                                                                       VTMDVTFCE+RPYFPVSHLQGE+VSEESNNTFEF+EP
Subjt:  ---------------------------------------------------------------VTMDVTFCEDRPYFPVSHLQGESVSEESNNTFEFIEP

Query:  TPSVVSNIIPHSIVLPTNQVPWKTYYRRNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNN
        T   VS+I PH I+LPTNQVPWKTYYRRN +KEVGSPTSQPPAPVQ+ EPPRDQGMENPT+PCT N +SEND+S++AVLEN+EEK+  DE EVRIET N+
Subjt:  TPSVVSNIIPHSIVLPTNQVPWKTYYRRNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNN

Query:  EAEQGHTGKSDEYDSSLDIPIALRKGTRSCTKHPICNYVSYDSLSPQFRAFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHK
        EAEQGHT K DEYD SLDIPIALRKGTRSCTKHPICNYVSYD+LSPQFRAFTA+LDSTIIPK+IYTAL+ PEWKNAVMEEMKALEKN TW+IC LPKGHK
Subjt:  EAEQGHTGKSDEYDSSLDIPIALRKGTRSCTKHPICNYVSYDSLSPQFRAFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHK

Query:  TVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTIIVLLSVAVNKDWHLYQLDIKNAFLNGDLVEEVYMSPPPGFEAQFGQHV-
        TVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNT+ VLLSVAVNKDW LYQLD+KNAFLNGDLVEEVYMSPPPGFEAQFGQ V 
Subjt:  TVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTIIVLLSVAVNKDWHLYQLDIKNAFLNGDLVEEVYMSPPPGFEAQFGQHV-

Query:  ------------------------------------------SKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKE
                                                  SKTGKIA+LIVYVDDIVLTGDDQ EISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKE
Subjt:  ------------------------------------------SKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKE

Query:  GISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPG
        GISVSQRKY LDLLTETGMLGCRP DTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQ P E+HM+AVNRILRYLK+TPG
Subjt:  GISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPG

Query:  KGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDMHQECETPLKLFYDNKAAI
        KGLMFRKT+RKTIEAYTDSDWAGSV+DRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVL+D+HQECETPLKLF DNKAAI
Subjt:  KGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDMHQECETPLKLFYDNKAAI

Query:  SIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDIYVPT
        SIANNPVQHDRTKHVEIDRHFIKE+LDSGSICIPYIPSSQQ+ADVLTKGLLRP+FD CVSKLGLIDIY+PT
Subjt:  SIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDIYVPT

KAA0050140.1 Beta-galactosidase [Cucumis melo var. makuwa]0.0e+0070.88Show/hide
Query:  MDELLSRLQKTSENNFSSLPQSSAPPPDHHEPGFLPHTAPTIPSVQPFSSSTAYIAPHAPIYVL---PSNSNRLPPLLPSNLYGQPPNDPSYHPD-VKNS
        M++LL  LQK        +PQ  APPP H     +P  AP+   VQP S+ + +  PHAP       PS  N         LY  P   P +  + +   
Subjt:  MDELLSRLQKTSENNFSSLPQSSAPPPDHHEPGFLPHTAPTIPSVQPFSSSTAYIAPHAPIYVL---PSNSNRLPPLLPSNLYGQPPNDPSYHPD-VKNS

Query:  QIHSTFEVGESSAYSNRNVQASSGIVHQQLEGLRQQIAALEATLGTTSTLPMYSEYPVNSFPNVSSPYLTNTVAQSSMYHLSGEKLNGNNYFSWSQSVKM
        Q  S  E GESS +S                                + LPMYS+ PV SFPN  S Y+T ++  SS  + SGEKLNG NYFSWSQS+KM
Subjt:  QIHSTFEVGESSAYSNRNVQASSGIVHQQLEGLRQQIAALEATLGTTSTLPMYSEYPVNSFPNVSSPYLTNTVAQSSMYHLSGEKLNGNNYFSWSQSVKM

Query:  VLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKPLLFAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTSFFNK
         LEGR +F FLTGE  RP PGD  ER WK EDS++RS+LINSMEPQIGKPLL+AATAKD+WDT QTLYSKRQNASRLYTLRKQVH CKQGT+DVT++FNK
Subjt:  VLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKPLLFAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTSFFNK

Query:  LSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQRPIPSLMEVCSEIRLEEDRTSAMNISATPTIDSAAFSARSSNSSS
        LSL+WQEMDLCRE VW  P D  QY+++EE DR+YDFLAGLNPKFD V GRILGQRP+PSLMEVC E+RLEEDRT+AM +  TPTIDSAAFSARSSN  S
Subjt:  LSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQRPIPSLMEVCSEIRLEEDRTSAMNISATPTIDSAAFSARSSNSSS

Query:  DKHNGKPIPVCEHCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYVSES--AEPPQQSDPHKNQTDLSLATLGAIVQSGIPHSFGLVSIDGKNPWI
        DK+NGK IPVCEHCKKQWHTK+QCWKLHGRPPG KKR SN+KQN+GRAY+SE+  A   Q +DP  +QT     TLGAI QSG+P S GL+S+DGKNPWI
Subjt:  DKHNGKPIPVCEHCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYVSES--AEPPQQSDPHKNQTDLSLATLGAIVQSGIPHSFGLVSIDGKNPWI

Query:  LDSGATDHLTGSSEHFVSYIPCAGNETIRIADGSLAPVAGKGKISPCAGLSLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDLSSGRMIGTA
        LDSGATDHLTGSSEHF+SY PCAGNE IRIADGSLAP+AGKG+I P  G +L NVLHVPKLSYNLLSISKIT EL+CKAIFLP+SV FQD+SSGR IGTA
Subjt:  LDSGATDHLTGSSEHFVSYIPCAGNETIRIADGSLAPVAGKGKISPCAGLSLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDLSSGRMIGTA

Query:  RHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQDCMLWHFRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHSDVW
        RHSRGLY+LDDDTS SS+ R SLLSSYF+TSEQDCMLWHFRLGHPNF YM+HLFPHLFSKV++++LSCDVCI+AKQHRVSFPSQPYKPTQPF L+HSDVW
Subjt:  RHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQDCMLWHFRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHSDVW

Query:  GPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKIAILRSDNGREFQNHNLSEFLASKGIVHQNSCAYT-----------
        GPSK+TTSSGKRWFVTFIDDHTRLTWVYLI+DKSEV S+FQNFYHTI+TQFH KIAILRSDNGREFQNHNLSEFLASKGIVHQ SCAYT           
Subjt:  GPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKIAILRSDNGREFQNHNLSEFLASKGIVHQNSCAYT-----------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------VTMDVTFCEDRPYFPVSHLQGESVSEESNNTFEFIEPTPSVVSNIIPHSIVLPTNQVPWKTYYRRNHKKEVGSPTSQPPAPVQDSEP
                     VTMDVTFCE+RPYFPVSHLQGE+VSEESNNTFEF+EPT   VS+I PH I+LPTNQVPWKTYYRRN +KEVGSPTSQPPAPVQ+ EP
Subjt:  -------------VTMDVTFCEDRPYFPVSHLQGESVSEESNNTFEFIEPTPSVVSNIIPHSIVLPTNQVPWKTYYRRNHKKEVGSPTSQPPAPVQDSEP

Query:  PRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNEAEQGHTGKSDEYDSSLDIPIALRKGTRSCTKHPICNYVSYDSLSPQFRA
        PRDQGMENPT+PCT N +SEND+S++AVLEN+EEK+  DE EVRIET N+EAEQGHT K DEYD SLDIPIALRKGTRSCTKHPICNYVSYD+LSPQFRA
Subjt:  PRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNEAEQGHTGKSDEYDSSLDIPIALRKGTRSCTKHPICNYVSYDSLSPQFRA

Query:  FTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNT
        FTA+LDSTIIPK+IYTAL+ PEWKNAVMEEMKALEKN TW+IC LPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNT
Subjt:  FTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNT

Query:  IIVLLSVAVNKDWHLYQLDIKNAFLNGDLVEEVYMSPPPGFEAQFGQHV-------------------------------------------SKTGKIAV
        + VLLSVAVNKDW LYQLD+KNAFLNGDLVEEVYMSPPPGFEAQFGQ V                                           SKTGKIA+
Subjt:  IIVLLSVAVNKDWHLYQLDIKNAFLNGDLVEEVYMSPPPGFEAQFGQHV-------------------------------------------SKTGKIAV

Query:  LIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQ
        LIVYVDDIVLTGDDQ EISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKY LDLLTETGMLGCRP DTPIEFNCKLGNSDDQVPVDKEQYQ
Subjt:  LIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQ

Query:  RLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQ
        RLVGKLIYLSHTRPDISFAVSVVSQFMQ P E+HM+AVNRILRYLK+TPGKGLMFRKT+RKTIEAYTDSDWAGSV+DRKSTSGYCTFVWGNLVTWRSKKQ
Subjt:  RLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQ

Query:  SVVARSSAEAEYRAMSLGICEEIWLQKVLTDMHQECETPLKLFYDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGL
        SVVARSSAEAEYRAMSLGICEEIWLQKVL+D+HQECETPLKLF DNKAAISIANNPVQHDRTKHVEIDRHFIKE+LDSGSICIPYIPSSQQ+ADVLTKGL
Subjt:  SVVARSSAEAEYRAMSLGICEEIWLQKVLTDMHQECETPLKLFYDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGL

Query:  LRPNFDFCVSKLGLIDIYVPT
        LRP+FD CVSKLGLIDIY+PT
Subjt:  LRPNFDFCVSKLGLIDIYVPT

KAA0052775.1 Beta-galactosidase [Cucumis melo var. makuwa]0.0e+0069.66Show/hide
Query:  MVSERDNENTLETQKNQTTYENQTEVTAISFSAAVAAAIDARMSAAMDELLSRLQKTSENNFSSLPQSSAPPPDH---HEP---GFLPHTAPTIPSVQPF
        MVSE+ N  TLE    +T  E  TE  A               +AAM++LL  LQK        +PQ  APP D    H P   G   H  P      PF
Subjt:  MVSERDNENTLETQKNQTTYENQTEVTAISFSAAVAAAIDARMSAAMDELLSRLQKTSENNFSSLPQSSAPPPDH---HEP---GFLPHTAPTIPSVQPF

Query:  SSSTAYIAPHAPIYVLPSNSNRLP-PLLPSNLYGQPPNDPSYHPDVKNSQIHSTFEVGESSAYSNRNVQASSGIVHQQLEGLRQQIAALEATLGTTST-L
          +   +  +AP  V PSN +  P P  PS   GQ P+  +        Q++   +  +   +S   +              R  I A E++  +  T L
Subjt:  SSSTAYIAPHAPIYVLPSNSNRLP-PLLPSNLYGQPPNDPSYHPDVKNSQIHSTFEVGESSAYSNRNVQASSGIVHQQLEGLRQQIAALEATLGTTST-L

Query:  PMYSEYPVNSFPNVSSPYLTNTVAQSSMYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKP
        PMYS+ PV SFPN  S Y+T ++  SS  + SGEKLNG NYFSWSQS+KM LEGR +F FLTGEI RP PGD  ER WK EDS++RS+LINSMEPQIGKP
Subjt:  PMYSEYPVNSFPNVSSPYLTNTVAQSSMYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKP

Query:  LLFAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTSFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRG
        LL+A TAKD+WDT QTLYSKRQNASRLYTLRKQVH CKQGT+DVT++FNKLSL+WQEMDLCRE VW  P D  QY+++EE DR+YDFLAGLNPKFD V G
Subjt:  LLFAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTSFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRG

Query:  RILGQRPIPSLMEVCSEIRLEEDRTSAMNISATPTIDSAAFSARSSNSSSDKHNGKPIPVCEHCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYV
        RILGQRP+PSLMEVC E+RLEEDRT+AM +  TPTIDSAAFSARSSN  SDK+NGK IPVCEHCKKQWHTK+QCWKLHGRPPG KKR SN+KQN+GRAY+
Subjt:  RILGQRPIPSLMEVCSEIRLEEDRTSAMNISATPTIDSAAFSARSSNSSSDKHNGKPIPVCEHCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYV

Query:  SES--AEPPQQSDPHKNQTDLSLATLGAIVQSGIPHSFGLVSIDGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRIADGSLAPVAGKGKISPCAGL
        SE+  A   Q +DP  +QT     TLGAI QSG+P S GL+S+DGKNPWILDSGATDHLTGSSEHF+SY PCAGNE IRIADGSLAP+AGKG+I P  G 
Subjt:  SES--AEPPQQSDPHKNQTDLSLATLGAIVQSGIPHSFGLVSIDGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRIADGSLAPVAGKGKISPCAGL

Query:  SLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDLSSGRMIGTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQDCMLWHFRLGHPNFQYM
        +L NVLHVPKLSYNLLSISKIT EL+CKAIFLP+SV FQD+SSGR IGTARHSRGLY+LDDDTS SS+ R SLLSSYF+TSEQDCMLWHFRLGHPNF YM
Subjt:  SLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDLSSGRMIGTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQDCMLWHFRLGHPNFQYM

Query:  KHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHSDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQ
        +HLFPHLFSKV++++LSCDVCI+AKQHRVSFPSQPYKPTQPF L+HSDVWGPSK+TTSSGKRWFVTFIDDHTRLTWVYLI+DKSEV S+FQNFYHTI+TQ
Subjt:  KHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHSDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQ

Query:  FHQKIAILRSDNGREFQNHNLSEFLASKGIVHQNSCAYT-------------------------------------------------------------
        FH KIAILRSDNGREFQNHNLSEFLASKGIVHQ SCAYT                                                             
Subjt:  FHQKIAILRSDNGREFQNHNLSEFLASKGIVHQNSCAYT-------------------------------------------------------------

Query:  ---------------------------------------------------------------VTMDVTFCEDRPYFPVSHLQGESVSEESNNTFEFIEP
                                                                       VTMDVTFCE+RPYFPVSHLQGE+VSEESNNTFEF+EP
Subjt:  ---------------------------------------------------------------VTMDVTFCEDRPYFPVSHLQGESVSEESNNTFEFIEP

Query:  TPSVVSNIIPHSIVLPTNQVPWKTYYRRNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNN
        T   VS+I PH I+LPTNQVPWKTYYRRN +KEVGSPTSQPPAPVQ+ EPPRDQGMENPT+PCT N +SEND+S++AVLEN+EEK+  DE EVRIET N+
Subjt:  TPSVVSNIIPHSIVLPTNQVPWKTYYRRNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNN

Query:  EAEQGHTGKSDEYDSSLDIPIALRKGTRSCTKHPICNYVSYDSLSPQFRAFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHK
        EAEQGHT K DEYD SLDIPIALRKGTRSCTKHPICNYVSYD+LSPQFRAFTA+LDSTIIPK+IYTAL+ PEWKNAVMEEMKALEKN TW+IC LPKGHK
Subjt:  EAEQGHTGKSDEYDSSLDIPIALRKGTRSCTKHPICNYVSYDSLSPQFRAFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHK

Query:  TVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTIIVLLSVAVNKDWHLYQLDIKNAFLNGDLVEEVYMSPPPGFEAQFGQHV-
        TVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNT+ VLLSVAVNKDW LYQLD+KNAFLNGDLVEEVYMSPPPGFEAQFGQ V 
Subjt:  TVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTIIVLLSVAVNKDWHLYQLDIKNAFLNGDLVEEVYMSPPPGFEAQFGQHV-

Query:  ------------------------------------------SKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKE
                                                  SKTGKIA+LIVYVDDIVLTGDDQ EISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKE
Subjt:  ------------------------------------------SKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKE

Query:  GISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPG
        GISVSQRKY LDLLTETGMLGCRP DTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQ P E+HM+AVNRILRYLK+TPG
Subjt:  GISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPG

Query:  KGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDMHQECETPLKLFYDNKAAI
        KGLMFRKT+RKTIEAYTDSDWAGSV+DRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVL+D+HQECETPLKLF DNKAAI
Subjt:  KGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDMHQECETPLKLFYDNKAAI

Query:  SIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDIYVPT
        SIANNPVQHDRTKHVEIDRHFIKE+LDSGSICIPYIPSSQQ+ADVLTKGLLRP+FD CVSKLGLIDIY+PT
Subjt:  SIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDIYVPT

TYK11240.1 Beta-galactosidase [Cucumis melo var. makuwa]0.0e+0069.74Show/hide
Query:  SERDNENTLETQKNQTTYENQTEVTAISFSAAVAAAIDARMSAAMDELLSRLQKTSENNFSSLPQSSAPPPDH---HEP---GFLPHTAPTIPSVQPFSS
        SE+ N  TLE    +T  E      A + +A ++AA+DA ++AAM++LL  LQK        +PQ  APP D    H P   G   H  P      PF  
Subjt:  SERDNENTLETQKNQTTYENQTEVTAISFSAAVAAAIDARMSAAMDELLSRLQKTSENNFSSLPQSSAPPPDH---HEP---GFLPHTAPTIPSVQPFSS

Query:  STAYIAPHAPIYVLPSNSNRLP-PLLPSNLYGQPPNDPSYHPDVKNSQIHSTFEVGESSAYSNRNVQASSGIVHQQLEGLRQQIAALEATLGTTST-LPM
        +   +  +AP  V PSN +  P P  PS   GQ P+  +        Q++   +  +   +S   +              R  I A E++  +  T LPM
Subjt:  STAYIAPHAPIYVLPSNSNRLP-PLLPSNLYGQPPNDPSYHPDVKNSQIHSTFEVGESSAYSNRNVQASSGIVHQQLEGLRQQIAALEATLGTTST-LPM

Query:  YSEYPVNSFPNVSSPYLTNTVAQSSMYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKPLL
        YS+ PV SFPN  S Y+T ++  SS  + SGEKLNG NYFSWSQS+KM LEGR +F FLTGEI RP PGD  ER WK EDS++RS+LINSMEPQIGKPLL
Subjt:  YSEYPVNSFPNVSSPYLTNTVAQSSMYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKPLL

Query:  FAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTSFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRI
        +A TAKD+WDT QTLYSKRQNASRLYTLRKQVH CKQGT+DVT++FNKLSL+WQEMDLCRE VW  P D  QY+++EE DR+YDFLAGLNPKFD V GRI
Subjt:  FAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTSFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRI

Query:  LGQRPIPSLMEVCSEIRLEEDRTSAMNISATPTIDSAAFSARSSNSSSDKHNGKPIPVCEHCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYVSE
        LGQRP+PSLMEVC E+RLEEDRT+AM +  TPTIDSAAFSARSSN  SDK+NGK IPVCEHCKKQWHTK+QCWKLHGRPPG KKR SN+KQN+GRAY+SE
Subjt:  LGQRPIPSLMEVCSEIRLEEDRTSAMNISATPTIDSAAFSARSSNSSSDKHNGKPIPVCEHCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYVSE

Query:  S--AEPPQQSDPHKNQTDLSLATLGAIVQSGIPHSFGLVSIDGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRIADGSLAPVAGKGKISPCAGLSL
        +  A   Q +DP  +QT     TLGAI QSG+P S GL+S+DGKNPWILDSGATDHLTGSSEHF+SY PCAGNE IRIADGSLAP+AGKG+I P  G +L
Subjt:  S--AEPPQQSDPHKNQTDLSLATLGAIVQSGIPHSFGLVSIDGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRIADGSLAPVAGKGKISPCAGLSL

Query:  HNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDLSSGRMIGTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQDCMLWHFRLGHPNFQYMKH
         NVLHVPKLSYNLLSISKIT EL+CKAIFLP+SV FQD+SSGR IGTARHSRGLY+LDDDTS SS+ R SLLSSYF+TSEQDCMLWHFRLGHPNF YM+H
Subjt:  HNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDLSSGRMIGTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQDCMLWHFRLGHPNFQYMKH

Query:  LFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHSDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFH
        LFPHLFSKV++++LSCDVCI+AKQHRVSFPSQPYKPTQPF L+HSDVWGPSK+TTSSGKRWFVTFIDDHTRLTWVYLI+DKSEV S+FQNFYHTI+TQFH
Subjt:  LFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHSDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFH

Query:  QKIAILRSDNGREFQNHNLSEFLASKGIVHQNSCAYT---------------------------------------------------------------
         KIAILRSDNGREFQNHNLSEFLASKGIVHQ SCAYT                                                               
Subjt:  QKIAILRSDNGREFQNHNLSEFLASKGIVHQNSCAYT---------------------------------------------------------------

Query:  -------------------------------------------------------------VTMDVTFCEDRPYFPVSHLQGESVSEESNNTFEFIEPTP
                                                                     VTMDVTFCE+RPYFPVSHLQGE+VSEESNNTFEF+EPT 
Subjt:  -------------------------------------------------------------VTMDVTFCEDRPYFPVSHLQGESVSEESNNTFEFIEPTP

Query:  SVVSNIIPHSIVLPTNQVPWKTYYRRNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNEA
          VS+I PH I+LPTNQVPWKTYYRRN +KEVGSPTSQPPAPVQ+ EPPRDQGMENPT+PCT N +SEND+S++A LEN+EEK+  DE EVRIET N+EA
Subjt:  SVVSNIIPHSIVLPTNQVPWKTYYRRNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNEA

Query:  EQGHTGKSDEYDSSLDIPIALRKGTRSCTKHPICNYVSYDSLSPQFRAFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTV
        EQGHT K DEYD SLDIPIALRKGTRSCTKHPICNYVSYD+LSPQFRAFTA+LDSTIIPK+IYTAL+ PEWKNAVMEEMKALEKN TW+IC LPKGHKTV
Subjt:  EQGHTGKSDEYDSSLDIPIALRKGTRSCTKHPICNYVSYDSLSPQFRAFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTV

Query:  GCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTIIVLLSVAVNKDWHLYQLDIKNAFLNGDLVEEVYMSPPPGFEAQFGQHV---
        GCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNT+ VLLSVAVNKDW LYQLD+KNAFLNGDLVEEVYMSPPPGFEAQFGQ V   
Subjt:  GCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTIIVLLSVAVNKDWHLYQLDIKNAFLNGDLVEEVYMSPPPGFEAQFGQHV---

Query:  ----------------------------------------SKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGI
                                                SKTGKIA+LIVYVDDIVLTGDDQ EISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGI
Subjt:  ----------------------------------------SKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGI

Query:  SVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKG
        SVSQRKY LDLLTETGMLGCRP DTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQ P E+HM+AVNRILRYLK+TPGKG
Subjt:  SVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKG

Query:  LMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDMHQECETPLKLFYDNKAAISI
        LMFRKT+RKTIEAYTDSDWAGSV+DRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVL+D+HQECETPLKLF DNKAAISI
Subjt:  LMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDMHQECETPLKLFYDNKAAISI

Query:  ANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDIYVPT
        ANNPVQHDRTKHVEIDRHFIKE+LDSGSICIPYIPSSQQ+ADVLTKGLLRP+FD CVSKLGLIDIY+PT
Subjt:  ANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDIYVPT

TYK23439.1 Beta-galactosidase [Cucumis melo var. makuwa]0.0e+0069.78Show/hide
Query:  MVSERDNENTLETQKNQTTYENQTEVTAISFSAAVAAAIDARMSAAMDELLSRLQKTSENNFSSLPQSSAPPPDH---HEP---GFLPHTAPTIPSVQPF
        MVSE+ N  TLE    +T  E +        +AA AAA+DA ++AA+++LL  LQK        +PQ  APP D    H P   G   H  P      PF
Subjt:  MVSERDNENTLETQKNQTTYENQTEVTAISFSAAVAAAIDARMSAAMDELLSRLQKTSENNFSSLPQSSAPPPDH---HEP---GFLPHTAPTIPSVQPF

Query:  SSSTAYIAPHAPIYVLPSNSNRLP-PLLPSNLYGQPPNDPSYHPDVKNSQIHSTFEVGESSAYSNRNVQASSGIVHQQLEGLRQQIAALEATLGTTST-L
          +   +  +AP  V PSN +  P P  PS   GQ P+  +        Q++   +  +   +S   +              R  I A E++  +  T L
Subjt:  SSSTAYIAPHAPIYVLPSNSNRLP-PLLPSNLYGQPPNDPSYHPDVKNSQIHSTFEVGESSAYSNRNVQASSGIVHQQLEGLRQQIAALEATLGTTST-L

Query:  PMYSEYPVNSFPNVSSPYLTNTVAQSSMYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKP
        PMYS+ PV SFPN  S Y+T ++  SS  + SGEKLNG NYFSWSQS+KM LEGR +F FLTGEI RP PGD  ER WK EDS++RS+LINSMEPQIGKP
Subjt:  PMYSEYPVNSFPNVSSPYLTNTVAQSSMYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKP

Query:  LLFAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTSFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRG
        LL+A TAKD+WDT QTLYSKRQNASRLYTLRKQVH CKQGT+DVT++FNKLSL+WQEMDLCRE VW  P D  QY+++EE DR+YDFLAGLNPKFD V G
Subjt:  LLFAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTSFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRG

Query:  RILGQRPIPSLMEVCSEIRLEEDRTSAMNISATPTIDSAAFSARSSNSSSDKHNGKPIPVCEHCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYV
        RILGQRP+PSLMEVC E+RLEEDRT+AM +  TPTIDSAAFSARSSN  SDK+NGK IPVCEHCKKQWHTK+QCWKLHGRPPG KKR SN+KQN+GRAY+
Subjt:  RILGQRPIPSLMEVCSEIRLEEDRTSAMNISATPTIDSAAFSARSSNSSSDKHNGKPIPVCEHCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYV

Query:  SES--AEPPQQSDPHKNQTDLSLATLGAIVQSGIPHSFGLVSIDGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRIADGSLAPVAGKGKISPCAGL
        SE+  A   Q +DP  +QT     TLGAI QSG+P S GL+S+DGKNPWILDSGATDHLTGSSEHF+SY PCAGNE IRIADGSLAP+AGKG+I P  G 
Subjt:  SES--AEPPQQSDPHKNQTDLSLATLGAIVQSGIPHSFGLVSIDGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRIADGSLAPVAGKGKISPCAGL

Query:  SLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDLSSGRMIGTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQDCMLWHFRLGHPNFQYM
        +L NVLHVPKLSYNLLSISKIT EL+CKAIFLP+SV FQD+SSGR IGTARHSRGLY+LDDDTS SS+ R SLLSSYF+TSEQDCMLWHFRLGHPNF YM
Subjt:  SLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDLSSGRMIGTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQDCMLWHFRLGHPNFQYM

Query:  KHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHSDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQ
        +HLFPHLFSKV++++LSCDVCI+AKQHRVSFPSQPYKPTQPF L+HSDVWGPSK+TTSSGKRWFVTFIDDHTRLTWVYLI+DKSEV S+FQNFYHTI+TQ
Subjt:  KHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHSDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQ

Query:  FHQKIAILRSDNGREFQNHNLSEFLASKGIVHQNSCAYT-------------------------------------------------------------
        FH KIAILRSDNGREFQNHNLSEFLASKGIVHQ SCAYT                                                             
Subjt:  FHQKIAILRSDNGREFQNHNLSEFLASKGIVHQNSCAYT-------------------------------------------------------------

Query:  ---------------------------------------------------------------VTMDVTFCEDRPYFPVSHLQGESVSEESNNTFEFIEP
                                                                       VTMDVTFCE+RPYFPVSHLQGE+VSEESNNTFEF+EP
Subjt:  ---------------------------------------------------------------VTMDVTFCEDRPYFPVSHLQGESVSEESNNTFEFIEP

Query:  TPSVVSNIIPHSIVLPTNQVPWKTYYRRNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNN
        T   VS+I PH I+LPTNQVPWKTYYRRN +KEVGSPTSQPPAPVQ+ EPPRDQGMENPT+PCT N +SEND+S++A LEN+EEK+  DE EVRIET N+
Subjt:  TPSVVSNIIPHSIVLPTNQVPWKTYYRRNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNN

Query:  EAEQGHTGKSDEYDSSLDIPIALRKGTRSCTKHPICNYVSYDSLSPQFRAFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHK
        EAEQGHT K DEYD SLDIPIALRKGTRSCTKHPICNYVSYD+LSPQFRAFTA+LDSTIIPK+IYTAL+ PEWKNAVMEEMKALEKN TW+IC LPKGHK
Subjt:  EAEQGHTGKSDEYDSSLDIPIALRKGTRSCTKHPICNYVSYDSLSPQFRAFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHK

Query:  TVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTIIVLLSVAVNKDWHLYQLDIKNAFLNGDLVEEVYMSPPPGFEAQFGQHV-
        TVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNT+ VLLSVAVNKDW LYQLD+KNAFLNGDLVEEVYMSPPPGFEAQFGQ V 
Subjt:  TVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTIIVLLSVAVNKDWHLYQLDIKNAFLNGDLVEEVYMSPPPGFEAQFGQHV-

Query:  ------------------------------------------SKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKE
                                                  SKTGKIA+LIVYVDDIVLTGDDQ EISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKE
Subjt:  ------------------------------------------SKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKE

Query:  GISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPG
        GISVSQRKY LDLLTETGMLGCRP DTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQ P E+HM+AVNRILRYLK+TPG
Subjt:  GISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPG

Query:  KGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDMHQECETPLKLFYDNKAAI
        KGLMFRKT+RKTIEAYTDSDWAGSV+DRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVL+D+HQECETPLKLF DNKAAI
Subjt:  KGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDMHQECETPLKLFYDNKAAI

Query:  SIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDIYVPT
        SIANNPVQHDRTKHVEIDRHFIKE+LDSGSICIPYIPSSQQ+ADVLTKGLLRP+FD CVSKLGLIDIY+PT
Subjt:  SIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDIYVPT

TrEMBL top hitse value%identityAlignment
A0A5A7SL21 Beta-galactosidase0.0e+0069.66Show/hide
Query:  MVSERDNENTLETQKNQTTYENQTEVTAISFSAAVAAAIDARMSAAMDELLSRLQKTSENNFSSLPQSSAPPPDH---HEP---GFLPHTAPTIPSVQPF
        MVSE+ N  TLE    +T  E +           VAAA     +AA+++LL  LQK        +PQ  APP D    H P   G   H  P      PF
Subjt:  MVSERDNENTLETQKNQTTYENQTEVTAISFSAAVAAAIDARMSAAMDELLSRLQKTSENNFSSLPQSSAPPPDH---HEP---GFLPHTAPTIPSVQPF

Query:  SSSTAYIAPHAPIYVLPSNSNRLP-PLLPSNLYGQPPNDPSYHPDVKNSQIHSTFEVGESSAYSNRNVQASSGIVHQQLEGLRQQIAALEATLGTTST-L
          +   +  +AP  V PSN +  P P  PS   GQ P+  +        Q++   +  +   +S   +              R  I A E++  +  T L
Subjt:  SSSTAYIAPHAPIYVLPSNSNRLP-PLLPSNLYGQPPNDPSYHPDVKNSQIHSTFEVGESSAYSNRNVQASSGIVHQQLEGLRQQIAALEATLGTTST-L

Query:  PMYSEYPVNSFPNVSSPYLTNTVAQSSMYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKP
        PMYS+ PV SFPN  S Y+T ++  SS  + SGEKLNG NYFSWSQS+KM LEGR +F FLTGE  RP PGD  ER WK EDS++RS+LINSMEPQIGKP
Subjt:  PMYSEYPVNSFPNVSSPYLTNTVAQSSMYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKP

Query:  LLFAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTSFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRG
        LL+AATAKD+WDT QTLYSKRQNASRLYTLRKQVH CKQGT+DVT++FNKLSL+WQEMDLCRE VW  P D  QY+++EE DR+YDFLAGLNPKFD V G
Subjt:  LLFAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTSFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRG

Query:  RILGQRPIPSLMEVCSEIRLEEDRTSAMNISATPTIDSAAFSARSSNSSSDKHNGKPIPVCEHCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYV
        RILGQRP+PSLMEVC E+RLEEDRT+AM +  TPTIDSAAFSARSSN  SDK+NGK IPVCEHCKKQWHTK+QCWKLHGRPPG KKR SN+KQN+GRAY+
Subjt:  RILGQRPIPSLMEVCSEIRLEEDRTSAMNISATPTIDSAAFSARSSNSSSDKHNGKPIPVCEHCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYV

Query:  SES--AEPPQQSDPHKNQTDLSLATLGAIVQSGIPHSFGLVSIDGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRIADGSLAPVAGKGKISPCAGL
        SE+  A   Q +DP  +QT     TLGAI QSG+P S GL+S+DGKNPWILDSGATDHLTGSSEHF+SY PCAGNE IRIADGSLAP+AGKG+I P  G 
Subjt:  SES--AEPPQQSDPHKNQTDLSLATLGAIVQSGIPHSFGLVSIDGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRIADGSLAPVAGKGKISPCAGL

Query:  SLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDLSSGRMIGTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQDCMLWHFRLGHPNFQYM
        +L NVLHVPKLSYNLLSISKIT EL+CKAIFLP+SV FQD+SSGR IGTARHSRGLY+LDDDTS SS+ R SLLSSYF+TSEQDCMLWHFRLGHPNF YM
Subjt:  SLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDLSSGRMIGTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQDCMLWHFRLGHPNFQYM

Query:  KHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHSDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQ
        +HLFPHLFSKV++++LSCDVCI+AKQHRVSFPSQPYKPTQPF L+HSDVWGPSK+TTSSGKRWFVTFIDDHTRLTWVYLI+DKSEV S+FQNFYHTI+TQ
Subjt:  KHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHSDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQ

Query:  FHQKIAILRSDNGREFQNHNLSEFLASKGIVHQNSCAYT-------------------------------------------------------------
        FH KIAILRSDNGREFQNHNLSEFLASKGIVHQ SCAYT                                                             
Subjt:  FHQKIAILRSDNGREFQNHNLSEFLASKGIVHQNSCAYT-------------------------------------------------------------

Query:  ---------------------------------------------------------------VTMDVTFCEDRPYFPVSHLQGESVSEESNNTFEFIEP
                                                                       VTMDVTFCE+RPYFPVSHLQGE+VSEESNNTFEF+EP
Subjt:  ---------------------------------------------------------------VTMDVTFCEDRPYFPVSHLQGESVSEESNNTFEFIEP

Query:  TPSVVSNIIPHSIVLPTNQVPWKTYYRRNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNN
        T   VS+I PH I+LPTNQVPWKTYYRRN +KEVGSPTSQPPAPVQ+ EPPRDQGMENPT+PCT N +SEND+S++AVLEN+EEK+  DE EVRIET N+
Subjt:  TPSVVSNIIPHSIVLPTNQVPWKTYYRRNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNN

Query:  EAEQGHTGKSDEYDSSLDIPIALRKGTRSCTKHPICNYVSYDSLSPQFRAFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHK
        EAEQGHT K DEYD SLDIPIALRKGTRSCTKHPICNYVSYD+LSPQFRAFTA+LDSTIIPK+IYTAL+ PEWKNAVMEEMKALEKN TW+IC LPKGHK
Subjt:  EAEQGHTGKSDEYDSSLDIPIALRKGTRSCTKHPICNYVSYDSLSPQFRAFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHK

Query:  TVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTIIVLLSVAVNKDWHLYQLDIKNAFLNGDLVEEVYMSPPPGFEAQFGQHV-
        TVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNT+ VLLSVAVNKDW LYQLD+KNAFLNGDLVEEVYMSPPPGFEAQFGQ V 
Subjt:  TVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTIIVLLSVAVNKDWHLYQLDIKNAFLNGDLVEEVYMSPPPGFEAQFGQHV-

Query:  ------------------------------------------SKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKE
                                                  SKTGKIA+LIVYVDDIVLTGDDQ EISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKE
Subjt:  ------------------------------------------SKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKE

Query:  GISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPG
        GISVSQRKY LDLLTETGMLGCRP DTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQ P E+HM+AVNRILRYLK+TPG
Subjt:  GISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPG

Query:  KGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDMHQECETPLKLFYDNKAAI
        KGLMFRKT+RKTIEAYTDSDWAGSV+DRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVL+D+HQECETPLKLF DNKAAI
Subjt:  KGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDMHQECETPLKLFYDNKAAI

Query:  SIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDIYVPT
        SIANNPVQHDRTKHVEIDRHFIKE+LDSGSICIPYIPSSQQ+ADVLTKGLLRP+FD CVSKLGLIDIY+PT
Subjt:  SIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDIYVPT

A0A5A7U4D7 Beta-galactosidase0.0e+0070.88Show/hide
Query:  MDELLSRLQKTSENNFSSLPQSSAPPPDHHEPGFLPHTAPTIPSVQPFSSSTAYIAPHAPIYVL---PSNSNRLPPLLPSNLYGQPPNDPSYHPD-VKNS
        M++LL  LQK        +PQ  APPP H     +P  AP+   VQP S+ + +  PHAP       PS  N         LY  P   P +  + +   
Subjt:  MDELLSRLQKTSENNFSSLPQSSAPPPDHHEPGFLPHTAPTIPSVQPFSSSTAYIAPHAPIYVL---PSNSNRLPPLLPSNLYGQPPNDPSYHPD-VKNS

Query:  QIHSTFEVGESSAYSNRNVQASSGIVHQQLEGLRQQIAALEATLGTTSTLPMYSEYPVNSFPNVSSPYLTNTVAQSSMYHLSGEKLNGNNYFSWSQSVKM
        Q  S  E GESS +S                                + LPMYS+ PV SFPN  S Y+T ++  SS  + SGEKLNG NYFSWSQS+KM
Subjt:  QIHSTFEVGESSAYSNRNVQASSGIVHQQLEGLRQQIAALEATLGTTSTLPMYSEYPVNSFPNVSSPYLTNTVAQSSMYHLSGEKLNGNNYFSWSQSVKM

Query:  VLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKPLLFAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTSFFNK
         LEGR +F FLTGE  RP PGD  ER WK EDS++RS+LINSMEPQIGKPLL+AATAKD+WDT QTLYSKRQNASRLYTLRKQVH CKQGT+DVT++FNK
Subjt:  VLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKPLLFAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTSFFNK

Query:  LSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQRPIPSLMEVCSEIRLEEDRTSAMNISATPTIDSAAFSARSSNSSS
        LSL+WQEMDLCRE VW  P D  QY+++EE DR+YDFLAGLNPKFD V GRILGQRP+PSLMEVC E+RLEEDRT+AM +  TPTIDSAAFSARSSN  S
Subjt:  LSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQRPIPSLMEVCSEIRLEEDRTSAMNISATPTIDSAAFSARSSNSSS

Query:  DKHNGKPIPVCEHCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYVSES--AEPPQQSDPHKNQTDLSLATLGAIVQSGIPHSFGLVSIDGKNPWI
        DK+NGK IPVCEHCKKQWHTK+QCWKLHGRPPG KKR SN+KQN+GRAY+SE+  A   Q +DP  +QT     TLGAI QSG+P S GL+S+DGKNPWI
Subjt:  DKHNGKPIPVCEHCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYVSES--AEPPQQSDPHKNQTDLSLATLGAIVQSGIPHSFGLVSIDGKNPWI

Query:  LDSGATDHLTGSSEHFVSYIPCAGNETIRIADGSLAPVAGKGKISPCAGLSLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDLSSGRMIGTA
        LDSGATDHLTGSSEHF+SY PCAGNE IRIADGSLAP+AGKG+I P  G +L NVLHVPKLSYNLLSISKIT EL+CKAIFLP+SV FQD+SSGR IGTA
Subjt:  LDSGATDHLTGSSEHFVSYIPCAGNETIRIADGSLAPVAGKGKISPCAGLSLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDLSSGRMIGTA

Query:  RHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQDCMLWHFRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHSDVW
        RHSRGLY+LDDDTS SS+ R SLLSSYF+TSEQDCMLWHFRLGHPNF YM+HLFPHLFSKV++++LSCDVCI+AKQHRVSFPSQPYKPTQPF L+HSDVW
Subjt:  RHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQDCMLWHFRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHSDVW

Query:  GPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKIAILRSDNGREFQNHNLSEFLASKGIVHQNSCAYT-----------
        GPSK+TTSSGKRWFVTFIDDHTRLTWVYLI+DKSEV S+FQNFYHTI+TQFH KIAILRSDNGREFQNHNLSEFLASKGIVHQ SCAYT           
Subjt:  GPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKIAILRSDNGREFQNHNLSEFLASKGIVHQNSCAYT-----------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------VTMDVTFCEDRPYFPVSHLQGESVSEESNNTFEFIEPTPSVVSNIIPHSIVLPTNQVPWKTYYRRNHKKEVGSPTSQPPAPVQDSEP
                     VTMDVTFCE+RPYFPVSHLQGE+VSEESNNTFEF+EPT   VS+I PH I+LPTNQVPWKTYYRRN +KEVGSPTSQPPAPVQ+ EP
Subjt:  -------------VTMDVTFCEDRPYFPVSHLQGESVSEESNNTFEFIEPTPSVVSNIIPHSIVLPTNQVPWKTYYRRNHKKEVGSPTSQPPAPVQDSEP

Query:  PRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNEAEQGHTGKSDEYDSSLDIPIALRKGTRSCTKHPICNYVSYDSLSPQFRA
        PRDQGMENPT+PCT N +SEND+S++AVLEN+EEK+  DE EVRIET N+EAEQGHT K DEYD SLDIPIALRKGTRSCTKHPICNYVSYD+LSPQFRA
Subjt:  PRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNEAEQGHTGKSDEYDSSLDIPIALRKGTRSCTKHPICNYVSYDSLSPQFRA

Query:  FTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNT
        FTA+LDSTIIPK+IYTAL+ PEWKNAVMEEMKALEKN TW+IC LPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNT
Subjt:  FTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNT

Query:  IIVLLSVAVNKDWHLYQLDIKNAFLNGDLVEEVYMSPPPGFEAQFGQHV-------------------------------------------SKTGKIAV
        + VLLSVAVNKDW LYQLD+KNAFLNGDLVEEVYMSPPPGFEAQFGQ V                                           SKTGKIA+
Subjt:  IIVLLSVAVNKDWHLYQLDIKNAFLNGDLVEEVYMSPPPGFEAQFGQHV-------------------------------------------SKTGKIAV

Query:  LIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQ
        LIVYVDDIVLTGDDQ EISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKY LDLLTETGMLGCRP DTPIEFNCKLGNSDDQVPVDKEQYQ
Subjt:  LIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQ

Query:  RLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQ
        RLVGKLIYLSHTRPDISFAVSVVSQFMQ P E+HM+AVNRILRYLK+TPGKGLMFRKT+RKTIEAYTDSDWAGSV+DRKSTSGYCTFVWGNLVTWRSKKQ
Subjt:  RLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQ

Query:  SVVARSSAEAEYRAMSLGICEEIWLQKVLTDMHQECETPLKLFYDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGL
        SVVARSSAEAEYRAMSLGICEEIWLQKVL+D+HQECETPLKLF DNKAAISIANNPVQHDRTKHVEIDRHFIKE+LDSGSICIPYIPSSQQ+ADVLTKGL
Subjt:  SVVARSSAEAEYRAMSLGICEEIWLQKVLTDMHQECETPLKLFYDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGL

Query:  LRPNFDFCVSKLGLIDIYVPT
        LRP+FD CVSKLGLIDIY+PT
Subjt:  LRPNFDFCVSKLGLIDIYVPT

A0A5A7UGB2 Beta-galactosidase0.0e+0069.66Show/hide
Query:  MVSERDNENTLETQKNQTTYENQTEVTAISFSAAVAAAIDARMSAAMDELLSRLQKTSENNFSSLPQSSAPPPDH---HEP---GFLPHTAPTIPSVQPF
        MVSE+ N  TLE    +T  E  TE  A               +AAM++LL  LQK        +PQ  APP D    H P   G   H  P      PF
Subjt:  MVSERDNENTLETQKNQTTYENQTEVTAISFSAAVAAAIDARMSAAMDELLSRLQKTSENNFSSLPQSSAPPPDH---HEP---GFLPHTAPTIPSVQPF

Query:  SSSTAYIAPHAPIYVLPSNSNRLP-PLLPSNLYGQPPNDPSYHPDVKNSQIHSTFEVGESSAYSNRNVQASSGIVHQQLEGLRQQIAALEATLGTTST-L
          +   +  +AP  V PSN +  P P  PS   GQ P+  +        Q++   +  +   +S   +              R  I A E++  +  T L
Subjt:  SSSTAYIAPHAPIYVLPSNSNRLP-PLLPSNLYGQPPNDPSYHPDVKNSQIHSTFEVGESSAYSNRNVQASSGIVHQQLEGLRQQIAALEATLGTTST-L

Query:  PMYSEYPVNSFPNVSSPYLTNTVAQSSMYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKP
        PMYS+ PV SFPN  S Y+T ++  SS  + SGEKLNG NYFSWSQS+KM LEGR +F FLTGEI RP PGD  ER WK EDS++RS+LINSMEPQIGKP
Subjt:  PMYSEYPVNSFPNVSSPYLTNTVAQSSMYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKP

Query:  LLFAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTSFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRG
        LL+A TAKD+WDT QTLYSKRQNASRLYTLRKQVH CKQGT+DVT++FNKLSL+WQEMDLCRE VW  P D  QY+++EE DR+YDFLAGLNPKFD V G
Subjt:  LLFAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTSFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRG

Query:  RILGQRPIPSLMEVCSEIRLEEDRTSAMNISATPTIDSAAFSARSSNSSSDKHNGKPIPVCEHCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYV
        RILGQRP+PSLMEVC E+RLEEDRT+AM +  TPTIDSAAFSARSSN  SDK+NGK IPVCEHCKKQWHTK+QCWKLHGRPPG KKR SN+KQN+GRAY+
Subjt:  RILGQRPIPSLMEVCSEIRLEEDRTSAMNISATPTIDSAAFSARSSNSSSDKHNGKPIPVCEHCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYV

Query:  SES--AEPPQQSDPHKNQTDLSLATLGAIVQSGIPHSFGLVSIDGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRIADGSLAPVAGKGKISPCAGL
        SE+  A   Q +DP  +QT     TLGAI QSG+P S GL+S+DGKNPWILDSGATDHLTGSSEHF+SY PCAGNE IRIADGSLAP+AGKG+I P  G 
Subjt:  SES--AEPPQQSDPHKNQTDLSLATLGAIVQSGIPHSFGLVSIDGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRIADGSLAPVAGKGKISPCAGL

Query:  SLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDLSSGRMIGTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQDCMLWHFRLGHPNFQYM
        +L NVLHVPKLSYNLLSISKIT EL+CKAIFLP+SV FQD+SSGR IGTARHSRGLY+LDDDTS SS+ R SLLSSYF+TSEQDCMLWHFRLGHPNF YM
Subjt:  SLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDLSSGRMIGTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQDCMLWHFRLGHPNFQYM

Query:  KHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHSDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQ
        +HLFPHLFSKV++++LSCDVCI+AKQHRVSFPSQPYKPTQPF L+HSDVWGPSK+TTSSGKRWFVTFIDDHTRLTWVYLI+DKSEV S+FQNFYHTI+TQ
Subjt:  KHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHSDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQ

Query:  FHQKIAILRSDNGREFQNHNLSEFLASKGIVHQNSCAYT-------------------------------------------------------------
        FH KIAILRSDNGREFQNHNLSEFLASKGIVHQ SCAYT                                                             
Subjt:  FHQKIAILRSDNGREFQNHNLSEFLASKGIVHQNSCAYT-------------------------------------------------------------

Query:  ---------------------------------------------------------------VTMDVTFCEDRPYFPVSHLQGESVSEESNNTFEFIEP
                                                                       VTMDVTFCE+RPYFPVSHLQGE+VSEESNNTFEF+EP
Subjt:  ---------------------------------------------------------------VTMDVTFCEDRPYFPVSHLQGESVSEESNNTFEFIEP

Query:  TPSVVSNIIPHSIVLPTNQVPWKTYYRRNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNN
        T   VS+I PH I+LPTNQVPWKTYYRRN +KEVGSPTSQPPAPVQ+ EPPRDQGMENPT+PCT N +SEND+S++AVLEN+EEK+  DE EVRIET N+
Subjt:  TPSVVSNIIPHSIVLPTNQVPWKTYYRRNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNN

Query:  EAEQGHTGKSDEYDSSLDIPIALRKGTRSCTKHPICNYVSYDSLSPQFRAFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHK
        EAEQGHT K DEYD SLDIPIALRKGTRSCTKHPICNYVSYD+LSPQFRAFTA+LDSTIIPK+IYTAL+ PEWKNAVMEEMKALEKN TW+IC LPKGHK
Subjt:  EAEQGHTGKSDEYDSSLDIPIALRKGTRSCTKHPICNYVSYDSLSPQFRAFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHK

Query:  TVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTIIVLLSVAVNKDWHLYQLDIKNAFLNGDLVEEVYMSPPPGFEAQFGQHV-
        TVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNT+ VLLSVAVNKDW LYQLD+KNAFLNGDLVEEVYMSPPPGFEAQFGQ V 
Subjt:  TVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTIIVLLSVAVNKDWHLYQLDIKNAFLNGDLVEEVYMSPPPGFEAQFGQHV-

Query:  ------------------------------------------SKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKE
                                                  SKTGKIA+LIVYVDDIVLTGDDQ EISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKE
Subjt:  ------------------------------------------SKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKE

Query:  GISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPG
        GISVSQRKY LDLLTETGMLGCRP DTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQ P E+HM+AVNRILRYLK+TPG
Subjt:  GISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPG

Query:  KGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDMHQECETPLKLFYDNKAAI
        KGLMFRKT+RKTIEAYTDSDWAGSV+DRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVL+D+HQECETPLKLF DNKAAI
Subjt:  KGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDMHQECETPLKLFYDNKAAI

Query:  SIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDIYVPT
        SIANNPVQHDRTKHVEIDRHFIKE+LDSGSICIPYIPSSQQ+ADVLTKGLLRP+FD CVSKLGLIDIY+PT
Subjt:  SIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDIYVPT

A0A5D3CIR0 Beta-galactosidase0.0e+0069.74Show/hide
Query:  SERDNENTLETQKNQTTYENQTEVTAISFSAAVAAAIDARMSAAMDELLSRLQKTSENNFSSLPQSSAPPPDH---HEP---GFLPHTAPTIPSVQPFSS
        SE+ N  TLE    +T  E      A + +A ++AA+DA ++AAM++LL  LQK        +PQ  APP D    H P   G   H  P      PF  
Subjt:  SERDNENTLETQKNQTTYENQTEVTAISFSAAVAAAIDARMSAAMDELLSRLQKTSENNFSSLPQSSAPPPDH---HEP---GFLPHTAPTIPSVQPFSS

Query:  STAYIAPHAPIYVLPSNSNRLP-PLLPSNLYGQPPNDPSYHPDVKNSQIHSTFEVGESSAYSNRNVQASSGIVHQQLEGLRQQIAALEATLGTTST-LPM
        +   +  +AP  V PSN +  P P  PS   GQ P+  +        Q++   +  +   +S   +              R  I A E++  +  T LPM
Subjt:  STAYIAPHAPIYVLPSNSNRLP-PLLPSNLYGQPPNDPSYHPDVKNSQIHSTFEVGESSAYSNRNVQASSGIVHQQLEGLRQQIAALEATLGTTST-LPM

Query:  YSEYPVNSFPNVSSPYLTNTVAQSSMYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKPLL
        YS+ PV SFPN  S Y+T ++  SS  + SGEKLNG NYFSWSQS+KM LEGR +F FLTGEI RP PGD  ER WK EDS++RS+LINSMEPQIGKPLL
Subjt:  YSEYPVNSFPNVSSPYLTNTVAQSSMYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKPLL

Query:  FAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTSFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRI
        +A TAKD+WDT QTLYSKRQNASRLYTLRKQVH CKQGT+DVT++FNKLSL+WQEMDLCRE VW  P D  QY+++EE DR+YDFLAGLNPKFD V GRI
Subjt:  FAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTSFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRI

Query:  LGQRPIPSLMEVCSEIRLEEDRTSAMNISATPTIDSAAFSARSSNSSSDKHNGKPIPVCEHCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYVSE
        LGQRP+PSLMEVC E+RLEEDRT+AM +  TPTIDSAAFSARSSN  SDK+NGK IPVCEHCKKQWHTK+QCWKLHGRPPG KKR SN+KQN+GRAY+SE
Subjt:  LGQRPIPSLMEVCSEIRLEEDRTSAMNISATPTIDSAAFSARSSNSSSDKHNGKPIPVCEHCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYVSE

Query:  S--AEPPQQSDPHKNQTDLSLATLGAIVQSGIPHSFGLVSIDGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRIADGSLAPVAGKGKISPCAGLSL
        +  A   Q +DP  +QT     TLGAI QSG+P S GL+S+DGKNPWILDSGATDHLTGSSEHF+SY PCAGNE IRIADGSLAP+AGKG+I P  G +L
Subjt:  S--AEPPQQSDPHKNQTDLSLATLGAIVQSGIPHSFGLVSIDGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRIADGSLAPVAGKGKISPCAGLSL

Query:  HNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDLSSGRMIGTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQDCMLWHFRLGHPNFQYMKH
         NVLHVPKLSYNLLSISKIT EL+CKAIFLP+SV FQD+SSGR IGTARHSRGLY+LDDDTS SS+ R SLLSSYF+TSEQDCMLWHFRLGHPNF YM+H
Subjt:  HNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDLSSGRMIGTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQDCMLWHFRLGHPNFQYMKH

Query:  LFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHSDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFH
        LFPHLFSKV++++LSCDVCI+AKQHRVSFPSQPYKPTQPF L+HSDVWGPSK+TTSSGKRWFVTFIDDHTRLTWVYLI+DKSEV S+FQNFYHTI+TQFH
Subjt:  LFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHSDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFH

Query:  QKIAILRSDNGREFQNHNLSEFLASKGIVHQNSCAYT---------------------------------------------------------------
         KIAILRSDNGREFQNHNLSEFLASKGIVHQ SCAYT                                                               
Subjt:  QKIAILRSDNGREFQNHNLSEFLASKGIVHQNSCAYT---------------------------------------------------------------

Query:  -------------------------------------------------------------VTMDVTFCEDRPYFPVSHLQGESVSEESNNTFEFIEPTP
                                                                     VTMDVTFCE+RPYFPVSHLQGE+VSEESNNTFEF+EPT 
Subjt:  -------------------------------------------------------------VTMDVTFCEDRPYFPVSHLQGESVSEESNNTFEFIEPTP

Query:  SVVSNIIPHSIVLPTNQVPWKTYYRRNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNEA
          VS+I PH I+LPTNQVPWKTYYRRN +KEVGSPTSQPPAPVQ+ EPPRDQGMENPT+PCT N +SEND+S++A LEN+EEK+  DE EVRIET N+EA
Subjt:  SVVSNIIPHSIVLPTNQVPWKTYYRRNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNEA

Query:  EQGHTGKSDEYDSSLDIPIALRKGTRSCTKHPICNYVSYDSLSPQFRAFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTV
        EQGHT K DEYD SLDIPIALRKGTRSCTKHPICNYVSYD+LSPQFRAFTA+LDSTIIPK+IYTAL+ PEWKNAVMEEMKALEKN TW+IC LPKGHKTV
Subjt:  EQGHTGKSDEYDSSLDIPIALRKGTRSCTKHPICNYVSYDSLSPQFRAFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTV

Query:  GCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTIIVLLSVAVNKDWHLYQLDIKNAFLNGDLVEEVYMSPPPGFEAQFGQHV---
        GCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNT+ VLLSVAVNKDW LYQLD+KNAFLNGDLVEEVYMSPPPGFEAQFGQ V   
Subjt:  GCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTIIVLLSVAVNKDWHLYQLDIKNAFLNGDLVEEVYMSPPPGFEAQFGQHV---

Query:  ----------------------------------------SKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGI
                                                SKTGKIA+LIVYVDDIVLTGDDQ EISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGI
Subjt:  ----------------------------------------SKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGI

Query:  SVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKG
        SVSQRKY LDLLTETGMLGCRP DTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQ P E+HM+AVNRILRYLK+TPGKG
Subjt:  SVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKG

Query:  LMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDMHQECETPLKLFYDNKAAISI
        LMFRKT+RKTIEAYTDSDWAGSV+DRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVL+D+HQECETPLKLF DNKAAISI
Subjt:  LMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDMHQECETPLKLFYDNKAAISI

Query:  ANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDIYVPT
        ANNPVQHDRTKHVEIDRHFIKE+LDSGSICIPYIPSSQQ+ADVLTKGLLRP+FD CVSKLGLIDIY+PT
Subjt:  ANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDIYVPT

A0A5D3DJM7 Beta-galactosidase0.0e+0069.78Show/hide
Query:  MVSERDNENTLETQKNQTTYENQTEVTAISFSAAVAAAIDARMSAAMDELLSRLQKTSENNFSSLPQSSAPPPDH---HEP---GFLPHTAPTIPSVQPF
        MVSE+ N  TLE    +T  E +        +AA AAA+DA ++AA+++LL  LQK        +PQ  APP D    H P   G   H  P      PF
Subjt:  MVSERDNENTLETQKNQTTYENQTEVTAISFSAAVAAAIDARMSAAMDELLSRLQKTSENNFSSLPQSSAPPPDH---HEP---GFLPHTAPTIPSVQPF

Query:  SSSTAYIAPHAPIYVLPSNSNRLP-PLLPSNLYGQPPNDPSYHPDVKNSQIHSTFEVGESSAYSNRNVQASSGIVHQQLEGLRQQIAALEATLGTTST-L
          +   +  +AP  V PSN +  P P  PS   GQ P+  +        Q++   +  +   +S   +              R  I A E++  +  T L
Subjt:  SSSTAYIAPHAPIYVLPSNSNRLP-PLLPSNLYGQPPNDPSYHPDVKNSQIHSTFEVGESSAYSNRNVQASSGIVHQQLEGLRQQIAALEATLGTTST-L

Query:  PMYSEYPVNSFPNVSSPYLTNTVAQSSMYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKP
        PMYS+ PV SFPN  S Y+T ++  SS  + SGEKLNG NYFSWSQS+KM LEGR +F FLTGEI RP PGD  ER WK EDS++RS+LINSMEPQIGKP
Subjt:  PMYSEYPVNSFPNVSSPYLTNTVAQSSMYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKP

Query:  LLFAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTSFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRG
        LL+A TAKD+WDT QTLYSKRQNASRLYTLRKQVH CKQGT+DVT++FNKLSL+WQEMDLCRE VW  P D  QY+++EE DR+YDFLAGLNPKFD V G
Subjt:  LLFAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTSFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRG

Query:  RILGQRPIPSLMEVCSEIRLEEDRTSAMNISATPTIDSAAFSARSSNSSSDKHNGKPIPVCEHCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYV
        RILGQRP+PSLMEVC E+RLEEDRT+AM +  TPTIDSAAFSARSSN  SDK+NGK IPVCEHCKKQWHTK+QCWKLHGRPPG KKR SN+KQN+GRAY+
Subjt:  RILGQRPIPSLMEVCSEIRLEEDRTSAMNISATPTIDSAAFSARSSNSSSDKHNGKPIPVCEHCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYV

Query:  SES--AEPPQQSDPHKNQTDLSLATLGAIVQSGIPHSFGLVSIDGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRIADGSLAPVAGKGKISPCAGL
        SE+  A   Q +DP  +QT     TLGAI QSG+P S GL+S+DGKNPWILDSGATDHLTGSSEHF+SY PCAGNE IRIADGSLAP+AGKG+I P  G 
Subjt:  SES--AEPPQQSDPHKNQTDLSLATLGAIVQSGIPHSFGLVSIDGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRIADGSLAPVAGKGKISPCAGL

Query:  SLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDLSSGRMIGTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQDCMLWHFRLGHPNFQYM
        +L NVLHVPKLSYNLLSISKIT EL+CKAIFLP+SV FQD+SSGR IGTARHSRGLY+LDDDTS SS+ R SLLSSYF+TSEQDCMLWHFRLGHPNF YM
Subjt:  SLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDLSSGRMIGTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQDCMLWHFRLGHPNFQYM

Query:  KHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHSDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQ
        +HLFPHLFSKV++++LSCDVCI+AKQHRVSFPSQPYKPTQPF L+HSDVWGPSK+TTSSGKRWFVTFIDDHTRLTWVYLI+DKSEV S+FQNFYHTI+TQ
Subjt:  KHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHSDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQ

Query:  FHQKIAILRSDNGREFQNHNLSEFLASKGIVHQNSCAYT-------------------------------------------------------------
        FH KIAILRSDNGREFQNHNLSEFLASKGIVHQ SCAYT                                                             
Subjt:  FHQKIAILRSDNGREFQNHNLSEFLASKGIVHQNSCAYT-------------------------------------------------------------

Query:  ---------------------------------------------------------------VTMDVTFCEDRPYFPVSHLQGESVSEESNNTFEFIEP
                                                                       VTMDVTFCE+RPYFPVSHLQGE+VSEESNNTFEF+EP
Subjt:  ---------------------------------------------------------------VTMDVTFCEDRPYFPVSHLQGESVSEESNNTFEFIEP

Query:  TPSVVSNIIPHSIVLPTNQVPWKTYYRRNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNN
        T   VS+I PH I+LPTNQVPWKTYYRRN +KEVGSPTSQPPAPVQ+ EPPRDQGMENPT+PCT N +SEND+S++A LEN+EEK+  DE EVRIET N+
Subjt:  TPSVVSNIIPHSIVLPTNQVPWKTYYRRNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNN

Query:  EAEQGHTGKSDEYDSSLDIPIALRKGTRSCTKHPICNYVSYDSLSPQFRAFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHK
        EAEQGHT K DEYD SLDIPIALRKGTRSCTKHPICNYVSYD+LSPQFRAFTA+LDSTIIPK+IYTAL+ PEWKNAVMEEMKALEKN TW+IC LPKGHK
Subjt:  EAEQGHTGKSDEYDSSLDIPIALRKGTRSCTKHPICNYVSYDSLSPQFRAFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHK

Query:  TVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTIIVLLSVAVNKDWHLYQLDIKNAFLNGDLVEEVYMSPPPGFEAQFGQHV-
        TVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNT+ VLLSVAVNKDW LYQLD+KNAFLNGDLVEEVYMSPPPGFEAQFGQ V 
Subjt:  TVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTIIVLLSVAVNKDWHLYQLDIKNAFLNGDLVEEVYMSPPPGFEAQFGQHV-

Query:  ------------------------------------------SKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKE
                                                  SKTGKIA+LIVYVDDIVLTGDDQ EISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKE
Subjt:  ------------------------------------------SKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKE

Query:  GISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPG
        GISVSQRKY LDLLTETGMLGCRP DTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQ P E+HM+AVNRILRYLK+TPG
Subjt:  GISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPG

Query:  KGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDMHQECETPLKLFYDNKAAI
        KGLMFRKT+RKTIEAYTDSDWAGSV+DRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVL+D+HQECETPLKLF DNKAAI
Subjt:  KGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDMHQECETPLKLFYDNKAAI

Query:  SIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDIYVPT
        SIANNPVQHDRTKHVEIDRHFIKE+LDSGSICIPYIPSSQQ+ADVLTKGLLRP+FD CVSKLGLIDIY+PT
Subjt:  SIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDIYVPT

SwissProt top hitse value%identityAlignment
P04146 Copia protein7.3e-10824.98Show/hide
Query:  NGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKPLLFAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHE
        +G  Y  W   ++ +L  +     + G +P  +     +  WK  +   +S +I  +            TA+ I +    +Y ++  AS+L  LRK++  
Subjt:  NGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKPLLFAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHE

Query:  CK-QGTMDVTSFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFD-----------------VVRGRILGQRPIPSLMEVCSEI
         K    M + S F+         +L  EL+          ++IEE D+I   L  L   +D                  V+ R+L Q           EI
Subjt:  CK-QGTMDVTSFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFD-----------------VVRGRILGQRPIPSLMEVCSEI

Query:  RLEEDRT-------SAMNISATPTIDSAAFSARSSNSSS-DKHNGKPIPVCEHCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYVSESAEPPQQS
        +++ D         +A+  +   T  +  F  R +      K N K    C HC ++ H K+ C+  H +   + K   N+KQ                 
Subjt:  RLEEDRT-------SAMNISATPTIDSAAFSARSSNSSS-DKHNGKPIPVCEHCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYVSESAEPPQQS

Query:  DPHKNQTDLSLATLGAIVQSGIPHSFGLVSIDGKNP-------WILDSGATDHLTGSSEHFVSYIPCAGNETIRIA-DGSLAPVAGKG--KISPCAGLSL
                         VQ+   H    +  +  N        ++LDSGA+DHL      +   +       I +A  G       +G  ++     ++L
Subjt:  DPHKNQTDLSLATLGAIVQSGIPHSFGLVSIDGKNP-------WILDSGATDHLTGSSEHFVSYIPCAGNETIRIA-DGSLAPVAGKG--KISPCAGLSL

Query:  HNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDLSSGRMIGTARHSRGLYLLDDDTSSSSIPRTSLLS-SYFTTSEQDCMLWHFRLGHPNFQYM-
         +VL   + + NL+S+ ++              +S +   SG  I       GL ++ +    +++P  +  + S     + +  LWH R GH +   + 
Subjt:  HNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDLSSGRMIGTARHSRGLYLLDDDTSSSSIPRTSLLS-SYFTTSEQDCMLWHFRLGHPNFQYM-

Query:  ----KHLF--PHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKP--TQPFTLVHSDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQN
            K++F    L + +E++   C+ C+  KQ R+ F     K    +P  +VHSDV GP    T   K +FV F+D  T     YLI  KS+V SMFQ+
Subjt:  ----KHLF--PHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKP--TQPFTLVHSDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQN

Query:  FYHTIETQFHQKIAILRSDNGREFQNHNLSEFLASKGIVHQNSCAYT------------------------VTMDVTFCED-------------------
        F    E  F+ K+  L  DNGRE+ ++ + +F   KGI +  +  +T                          +D +F  +                   
Subjt:  FYHTIETQFHQKIAILRSDNGREFQNHNLSEFLASKGIVHQNSCAYT------------------------VTMDVTFCED-------------------

Query:  ------------RPYFP------------VSHLQGESVSEESNNTFEFIEPT----------PSVVSN--IIPHSIVLPTNQVPWKTYYRRNHKKEVGSP
                    +PY              + + QG+   +   + F   EP             +V+   ++  + ++ +  V ++T + ++ K+     
Subjt:  ------------RPYFP------------VSHLQGESVSEESNNTFEFIEPT----------PSVVSN--IIPHSIVLPTNQVPWKTYYRRNHKKEVGSP

Query:  TSQPPAPVQDSEPPRD-------QGMENPTEPCTKN------------------------MISENDRSNVAVLENVEEKDSGDEI-EVRIETRNNEAEQG
               +  +E P +       Q +++  E   KN                         + ++  SN   L   +++   D + E +     NE+ + 
Subjt:  TSQPPAPVQDSEPPRD-------QGMENPTEPCTKN------------------------MISENDRSNVAVLENVEEKDSGDEI-EVRIETRNNEAEQG

Query:  HTGKS------DEYDSSLDIPIALRKGTRSCTKHPICNYVSYDSLSP-QFRAFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKG
         T +       D    +  I I  R+  R  TK  I      +SL+     A T   D      +I        W+ A+  E+ A + N+TW I   P+ 
Subjt:  HTGKS------DEYDSSLDIPIALRKGTRSCTKHPICNYVSYDSLSP-QFRAFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKG

Query:  HKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTIIVLLSVAVNKDWHLYQLDIKNAFLNGDLVEEVYMSPPPGF-------
           V  +WVFS+KY   G   R+KARLVA+GFTQ Y IDY ETF+PVA++++   +LS+ +  +  ++Q+D+K AFLNG L EE+YM  P G        
Subjt:  HKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTIIVLLSVAVNKDWHLYQLDIKNAFLNGDLVEEVYMSPPPGF-------

Query:  --------------------------EAQFGQ-------HVSKTGKI---AVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARS
                                  E +F         ++   G I     +++YVDD+V+   D   ++  K+ + ++F + DL  +K+F+G+ +   
Subjt:  --------------------------EAQFGQ-------HVSKTGKI---AVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARS

Query:  KEGISVSQRKYILDLLTETGMLGCRPTDTPI--EFNCKLGNSDDQVPVDKEQYQRLVGKLIYLS-HTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYL
        ++ I +SQ  Y+  +L++  M  C    TP+  + N +L NSD+         + L+G L+Y+   TRPD++ AV+++S++    N E  + + R+LRYL
Subjt:  KEGISVSQRKYILDLLTETGMLGCRPTDTPI--EFNCKLGNSDDQVPVDKEQYQRLVGKLIYLS-HTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYL

Query:  KSTPGKGLMFRK--TDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWG-NLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDMHQECETPLKL
        K T    L+F+K       I  Y DSDWAGS +DRKST+GY   ++  NL+ W +K+Q+ VA SS EAEY A+   + E +WL+ +LT ++ + E P+K+
Subjt:  KSTPGKGLMFRK--TDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWG-NLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDMHQECETPLKL

Query:  FYDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLI
        + DN+  ISIANNP  H R KH++I  HF +E++ +  IC+ YIP+  Q+AD+ TK L    F     KLGL+
Subjt:  FYDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.2e-11928.58Show/hide
Query:  KLNGNNYFS-WSQSVK--MVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKPLLFAATAKDIWDTAQTLYSKRQNASRLYTLR
        K NG+N FS W + ++  ++ +G  K   +  + P  +  +     W   D    S +   +   +   ++   TA+ IW   ++LY  +   ++LY L+
Subjt:  KLNGNNYFS-WSQSVK--MVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKPLLFAATAKDIWDTAQTLYSKRQNASRLYTLR

Query:  KQVHECKQGTMDVTSFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQRPIPSLMEVCSEIRLEED-RTSAMNI
        KQ++       + T+F + L++          L+ +    GV   +IEE D+    L  L   +D +   IL  +    L +V S + L E  R    N 
Subjt:  KQVHECKQGTMDVTSFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQRPIPSLMEVCSEIRLEED-RTSAMNI

Query:  SATPTIDSAAFS-ARSSNS--------SSDKHNGKPIPVCEHCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYVSESAEPPQQSDPHKNQTDLSL
              +    S  RSSN+         S   +   +  C +C +  H K  C       P  +K      + +G+     +A   Q +D          
Subjt:  SATPTIDSAAFS-ARSSNS--------SSDKHNGKPIPVCEHCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYVSESAEPPQQSDPHKNQTDLSL

Query:  ATLGAIVQSGIPHSFGLVSIDGKNPWILDSGATDHLTGSSEHFVSYIPCAGN-ETIRIADGSLAPVAGKG----KISPCAGLSLHNVLHVPKLSYNLLSI
          L    +    H  G      ++ W++D+ A+ H T   + F  Y+  AG+  T+++ + S + +AG G    K +    L L +V HVP L  NL+S 
Subjt:  ATLGAIVQSGIPHSFGLVSIDGKNPWILDSGATDHLTGSSEHFVSYIPCAGN-ETIRIADGSLAPVAGKG----KISPCAGLSLHNVLHVPKLSYNLLSI

Query:  SKITHELNCKAIFLPDSVSFQDLSSGRMIGTARHSRG-LYLLDDDTSSSSIPRTSLLSSYFTTSEQDCMLWHFRLGHPNFQYMKHLF-PHLFSKVEMTTL
          +  +   ++ F         L+ G ++     +RG LY  + +     +             E    LWH R+GH + + ++ L    L S  + TT+
Subjt:  SKITHELNCKAIFLPDSVSFQDLSSGRMIGTARHSRG-LYLLDDDTSSSSIPRTSLLSSYFTTSEQDCMLWHFRLGHPNFQYMKHLF-PHLFSKVEMTTL

Query:  S-CDVCIQAKQHRVSFPSQPYKPTQPFTLVHSDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKIAILRSDNGRE
          CD C+  KQHRVSF +   +      LV+SDV GP +I +  G ++FVTFIDD +R  WVY++  K +V  +FQ F+  +E +  +K+  LRSDNG E
Subjt:  S-CDVCIQAKQHRVSFPSQPYKPTQPFTLVHSDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKIAILRSDNGRE

Query:  FQNHNLSEFLASKGIVHQNSCAYT-------VTMDVTFCED-RPYFPVSHLQGESVSEESNNTFEFIEPTPSVVSNIIPHSIVLPTNQVPWKTYYRRNHK
        + +    E+ +S GI H+ +   T         M+ T  E  R    ++ L      E        I  +PSV     P +  +P  +V        +H 
Subjt:  FQNHNLSEFLASKGIVHQNSCAYT-------VTMDVTFCED-RPYFPVSHLQGESVSEESNNTFEFIEPTPSVVSNIIPHSIVLPTNQVPWKTYYRRNHK

Query:  KEVGS------PTSQPPAPVQDSEPPRDQGMENPT------EPCTKNMISEND----RSNVAVLENVEEKDSGDEIE--VRIETRNNE-----------A
        K  G       P  Q       S P    G  +        +P  K +I   D     S V    ++ EK     I   V I + +N            +
Subjt:  KEVGS------PTSQPPAPVQDSEPPRDQGMENPT------EPCTKNMISEND----RSNVAVLENVEEKDSGDEIE--VRIETRNNE-----------A

Query:  EQG-HTGKSDEYDSSLDIPIALRKGTRSCTKHPICNYVSYDSLSPQFRAFTAS----------LDSTIIPKDIYTALKYPEWKN----AVMEEMKALEKN
        EQG   G+  E    LD      +G     +HP      +  L    R    S          +     P+ +   L +PE KN    A+ EEM++L+KN
Subjt:  EQG-HTGKSDEYDSSLDIPIALRKGTRSCTKHPICNYVSYDSLSPQFRAFTAS----------LDSTIIPKDIYTALKYPEWKN----AVMEEMKALEKN

Query:  STWDICTLPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTIIVLLSVAVNKDWHLYQLDIKNAFLNGDLVEEVYMSP
         T+ +  LPKG + + CKWVF LK   D  L R+KARLV KGF Q  GID+ E FSPV K+ +I  +LS+A + D  + QLD+K AFL+GDL EE+YM  
Subjt:  STWDICTLPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTIIVLLSVAVNKDWHLYQLDIKNAFLNGDLVEEVYMSP

Query:  PPGFEAQFGQH-VSKTGK-------------------------------------------IAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGN
        P GFE    +H V K  K                                             +L++YVDD+++ G D+  I++LK  +   F++KDLG 
Subjt:  PPGFEAQFGQH-VSKTGK-------------------------------------------IAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGN

Query:  LKYFLGMEVARSKEG--ISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNS------DDQVPVDKEQYQRLVGKLIY-LSHTRPDISFAVSVVSQFM
         +  LGM++ R +    + +SQ KYI  +L    M   +P  TP+  + KL         +++  + K  Y   VG L+Y +  TRPDI+ AV VVS+F+
Subjt:  LKYFLGMEVARSKEG--ISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNS------DDQVPVDKEQYQRLVGKLIY-LSHTRPDISFAVSVVSQFM

Query:  QTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQK
        + P +EH +AV  ILRYL+ T G  L F  +D   ++ YTD+D AG + +RKS++GY     G  ++W+SK Q  VA S+ EAEY A +    E IWL++
Subjt:  QTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQK

Query:  VLTD--MHQECETPLKLFYDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGL
         L +  +HQ+      ++ D+++AI ++ N + H RTKH+++  H+I+E +D  S+ +  I +++  AD+LTK + R  F+ C   +G+
Subjt:  VLTD--MHQECETPLKLFYDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGL

P92519 Uncharacterized mitochondrial protein AtMg008101.4e-4540.62Show/hide
Query:  LIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQ
        L++YVDDI+LTG     ++ L  ++   F +KDLG + YFLG+++     G+ +SQ KY   +L   GML C+P  TP+        S  + P D   ++
Subjt:  LIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQ

Query:  RLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQ
         +VG L YL+ TRPDIS+AV++V Q M  P       + R+LRY+K T   GL   K  +  ++A+ DSDWAG    R+ST+G+CTF+  N+++W +K+Q
Subjt:  RLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQ

Query:  SVVARSSAEAEYRAMSLGICEEIW
          V+RSS E EYRA++L   E  W
Subjt:  SVVARSSAEAEYRAMSLGICEEIW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.2e-14527.62Show/hide
Query:  KLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPLPG---------DPHERYWKAEDSILRSILINSMEPQIGKPLLFAATAKDIWDTAQTLYSKRQNAS
        KL   NY  WS+ V  + +G +   FL G    P            +P    WK +D ++ S ++ ++   +   +  A TA  IW+T + +Y+   +  
Subjt:  KLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPLPG---------DPHERYWKAEDSILRSILINSMEPQIGKPLLFAATAKDIWDTAQTLYSKRQNAS

Query:  RLYTLRKQVHECKQGTMDVTSFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQRPIPSLMEVCSEIRLEEDRT
         +  LR Q+ +  +GT  +  +   L   + ++ L  + +  D     Q  R+ EN         L  ++  V  +I  +   P+L E+   +   E + 
Subjt:  RLYTLRKQVHECKQGTMDVTSFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQRPIPSLMEVCSEIRLEEDRT

Query:  SAMNISATPTIDSAAFSARSSNSSSDKHNGKPIPVCEHCKKQWHTKE-QCWKLHGRPPGSKKRP-------------SNDKQNTGRAYVS--ESAEPPQQ
         A++ +    I + A S R++ ++++ +NG      ++     ++K  Q    +  P  ++ +P             S  + +  + ++S   S +PP  
Subjt:  SAMNISATPTIDSAAFSARSSNSSSDKHNGKPIPVCEHCKKQWHTKE-QCWKLHGRPPGSKKRP-------------SNDKQNTGRAYVS--ESAEPPQQ

Query:  SDPHKNQTDLSLATLGAIVQSGIPHSFGLVSIDGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRIADGSLAPVAGKGKIS---PCAGLSLHNVLHV
          P + + +L+L         G P+S         N W+LDSGAT H+T    +   + P  G + + +ADGS  P++  G  S       L+LHN+L+V
Subjt:  SDPHKNQTDLSLATLGAIVQSGIPHSFGLVSIDGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRIADGSLAPVAGKGKIS---PCAGLSLHNVLHV

Query:  PKLSYNLLSISKITHELNCKAIFLPDSVSFQDLSSGRMIGTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQDCMLWHFRLGHPNFQYMKHLFPHLF
        P +  NL+S+ ++ +       F P S   +DL++G  +   +    LY    +   +S    SL +S   +S+     WH RLGHP    +  +  +  
Subjt:  PKLSYNLLSISKITHELNCKAIFLPDSVSFQDLSSGRMIGTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQDCMLWHFRLGHPNFQYMKHLFPHLF

Query:  SKV---EMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHSDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKI
          V       LSC  C+  K ++V F       T+P   ++SDVW  S I +    R++V F+D  TR TW+Y +  KS+V   F  F + +E +F  +I
Subjt:  SKV---EMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHSDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSSMFQNFYHTIETQFHQKI

Query:  AILRSDNGREFQNHNLSEFLASKGIVHQNSCAYT--------------VTMDVTFCEDR-------PY-----------FPVSHLQGESVSEESNNT---
            SDNG EF    L E+ +  GI H  S  +T              V   +T            PY            P   LQ ES  ++   T   
Subjt:  AILRSDNGREFQNHNLSEFLASKGIVHQNSCAYT--------------VTMDVTFCEDR-------PY-----------FPVSHLQGESVSEESNNT---

Query:  -----------FEFIEP--------------------TPSVV-------------------SNIIPHSIVLPT----------NQVPWKTYYRRNHKKEV
                   + ++ P                    T S                      N  P S  L T          +   W  +     +  V
Subjt:  -----------FEFIEP--------------------TPSVV-------------------SNIIPHSIVLPT----------NQVPWKTYYRRNHKKEV

Query:  ------------GSPTSQPPAPVQDSE---------------------PPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNE
                     +P S P AP ++S+                      PR  G +  T+P T+     +   N +      E  S     +    +++ 
Subjt:  ------------GSPTSQPPAPVQDSE---------------------PPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNE

Query:  AEQGHTGKSDEYDSSLDIP------------IALRKGTRSCTKHPICNYVSYDSLSPQFR-AFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNS
        +    T  +    +S   P            I           H +        + P  + +   SL +   P+    ALK   W+NA+  E+ A   N 
Subjt:  AEQGHTGKSDEYDSSLDIP------------IALRKGTRSCTKHPICNYVSYDSLSPQFR-AFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNS

Query:  TWDICTLPKGHKT-VGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTIIVLLSVAVNKDWHLYQLDIKNAFLNGDLVEEVYMSP
        TWD+   P  H T VGC+W+F+ KY +DG+L+R+KARLVAKG+ Q  G+DY+ETFSPV K  +I ++L VAV++ W + QLD+ NAFL G L ++VYMS 
Subjt:  TWDICTLPKGHKT-VGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTIIVLLSVAVNKDWHLYQLDIKNAFLNGDLVEEVYMSP

Query:  PPGF-------------EAQFG----------------------QHVSKTG--------KIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNL
        PPGF             +A +G                        VS T          I  ++VYVDDI++TG+D   +      +   F +KD   L
Subjt:  PPGF-------------EAQFG----------------------QHVSKTG--------KIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNL

Query:  KYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKA
         YFLG+E  R   G+ +SQR+YILDLL  T M+  +P  TP+  + KL         D  +Y+ +VG L YL+ TRPDIS+AV+ +SQFM  P EEH++A
Subjt:  KYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKA

Query:  VNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDMHQECE
        + RILRYL  TP  G+  +K +  ++ AY+D+DWAG   D  ST+GY  ++  + ++W SKKQ  V RSS EAEYR+++    E  W+  +LT++     
Subjt:  VNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDMHQECE

Query:  TPLKLFYDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGL
         P  ++ DN  A  +  NPV H R KH+ ID HFI+ ++ SG++ + ++ +  Q+AD LTK L R  F    SK+G+
Subjt:  TPLKLFYDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.4e-13526.98Show/hide
Query:  KLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRP--------LPG-DPHERYWKAEDSILRSILINSMEPQIGKPLLFAATAKDIWDTAQTLYSKRQNAS
        KL   NY  WS+ V  + +G +   FL G  P P        +P  +P    W+ +D ++ S ++ ++   +   +  A TA  IW+T + +Y+   N S
Subjt:  KLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRP--------LPG-DPHERYWKAEDSILRSILINSMEPQIGKPLLFAATAKDIWDTAQTLYSKRQNAS

Query:  RLYTLRKQVHECKQGTMDVTSFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQRPIPSLMEVCSEIRLEEDRT
          +  +          +   + F++L+L+ + MD                     ++++   L  L   +  V  +I  +   PSL E+   +   E + 
Subjt:  RLYTLRKQVHECKQGTMDVTSFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQRPIPSLMEVCSEIRLEEDRT

Query:  SAMNISATPTID------------------------------SAAFSARSSNSSSDKHNGKP-IPVCEHCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQN
         A+N +    I                               S ++   SS S SD    KP +  C+ C  Q H+ ++C +LH              Q+
Subjt:  SAMNISATPTID------------------------------SAAFSARSSNSSSDKHNGKP-IPVCEHCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQN

Query:  TGRAYVSESAEPPQQSDPHKNQTDLSLATLGAIVQSGIPHSFGLVSIDGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRIADGSLAPVAGKGKIS-
        T     S S   P Q  P  N                      + S    N W+LDSGAT H+T    +   + P  G + + IADGS  P+   G  S 
Subjt:  TGRAYVSESAEPPQQSDPHKNQTDLSLATLGAIVQSGIPHSFGLVSIDGKNPWILDSGATDHLTGSSEHFVSYIPCAGNETIRIADGSLAPVAGKGKIS-

Query:  --PCAGLSLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDLSSGRMIGTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQDCMLWHFRLG
              L L+ VL+VP +  NL+S+ ++ +       F P S   +DL++G  +   +    LY    +   +S    S+ +S    S+     WH RLG
Subjt:  --PCAGLSLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDLSSGRMIGTARHSRGLYLLDDDTSSSSIPRTSLLSSYFTTSEQDCMLWHFRLG

Query:  HPNFQYM-----KHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHSDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSS
        HP+   +      H  P L    ++  LSC  C   K H+V F +     ++P   ++SDVW  S I +    R++V F+D  TR TW+Y +  KS+V  
Subjt:  HPNFQYM-----KHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHSDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLITDKSEVSS

Query:  MFQNFYHTIETQFHQKIAILRSDNGREFQNHNLSEFLASKGIVHQNSCAYT--------------VTMDVTFCEDR-------PY---------------
         F  F   +E +F  +I  L SDNG EF    L ++L+  GI H  S  +T              V M +T            PY               
Subjt:  MFQNFYHTIETQFHQKIAILRSDNGREFQNHNLSEFLASKGIVHQNSCAYT--------------VTMDVTFCEDR-------PY---------------

Query:  -----FPVSHLQGESVSEESNNTF-----EFIEP-------TPSVVSNIIPHSIV--------LPTNQV-------------PWKT---------YYRRN
              P   L G+  + E    F      ++ P         S     + +S+         +PT ++             P+ T           R +
Subjt:  -----FPVSHLQGESVSEESNNTF-----EFIEP-------TPSVVSNIIPHSIV--------LPTNQV-------------PWKT---------YYRRN

Query:  HKKEVGSPTSQPPAPVQDSEPP-----RDQGMENPTEP---CTKNMIS---------------------------------ENDRSNVAVLENVEEKDSG
              S T+ P  P+    PP      D     P+ P   CT  + S                                 +N  SN  +L N       
Subjt:  HKKEVGSPTSQPPAPVQDSEPP-----RDQGMENPTEP---CTKNMIS---------------------------------ENDRSNVAVLENVEEKDSG

Query:  DE-------------IEVRIETRNNEAEQGHTGKSDEYDSS-----LDIPIALRKGTRS-CTKHPICNYVSYDSLSP-QFRAFTASLDSTIIPKDIYTAL
                           I T +    + ++  S    +      L  P  ++   ++    H +          P Q  ++  SL +   P+    A+
Subjt:  DE-------------IEVRIETRNNEAEQGHTGKSDEYDSS-----LDIPIALRKGTRS-CTKHPICNYVSYDSLSP-QFRAFTASLDSTIIPKDIYTAL

Query:  KYPEWKNAVMEEMKALEKNSTWDICTLPKGHKT-VGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTIIVLLSVAVNKDWHLYQ
        K   W+ A+  E+ A   N TWD+   P    T VGC+W+F+ K+ +DG+L+R+KARLVAKG+ Q  G+DY+ETFSPV K  +I ++L VAV++ W + Q
Subjt:  KYPEWKNAVMEEMKALEKNSTWDICTLPKGHKT-VGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTIIVLLSVAVNKDWHLYQ

Query:  LDIKNAFLNGDLVEEVYMSPPPGF-----------------------------------EAQFGQHVSKTG--------KIAVLIVYVDDIVLTGDDQAE
        LD+ NAFL G L +EVYMS PPGF                                      F   +S T          I  ++VYVDDI++TG+D   
Subjt:  LDIKNAFLNGDLVEEVYMSPPPGF-----------------------------------EAQFGQHVSKTG--------KIAVLIVYVDDIVLTGDDQAE

Query:  ISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLG-NSDDQVPVDKEQYQRLVGKLIYLSHTRPDI
        +      +   F +K+  +L YFLG+E  R  +G+ +SQR+Y LDLL  T ML  +P  TP+  + KL  +S  ++P D  +Y+ +VG L YL+ TRPD+
Subjt:  ISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLG-NSDDQVPVDKEQYQRLVGKLIYLSHTRPDI

Query:  SFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMS
        S+AV+ +SQ+M  P ++H  A+ R+LRYL  TP  G+  +K +  ++ AY+D+DWAG   D  ST+GY  ++  + ++W SKKQ  V RSS EAEYR+++
Subjt:  SFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMS

Query:  LGICEEIWLQKVLTDMHQECETPLKLFYDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLID
            E  W+  +LT++  +   P  ++ DN  A  +  NPV H R KH+ +D HFI+ ++ SG++ + ++ +  Q+AD LTK L R  F     K+G+I 
Subjt:  LGICEEIWLQKVLTDMHQECETPLKLFYDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLID

Query:  I
        +
Subjt:  I

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.8e-1625.59Show/hide
Query:  YLTNTVAQSSMYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKPLLFAATAKDIWDTAQTL
        YL   +   S + +     + +NY +W    +  L   +KF F+ G +P+P P  P  + W+  ++++   L+NSM  ++ + +++A TA  +W+  + +
Subjt:  YLTNTVAQSSMYHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKPLLFAATAKDIWDTAQTL

Query:  YSKRQNASRLYTLRKQVHECKQGTMDVTSFFNKLSLIWQEMDLCREL-------VWRDPTDGVQYSRIEENDRIYDFLAG--LNPKFDVVRGRILGQRPI
        +    +  ++Y LR+++   +QG   V  +F KLS +W E+     +          + T   + +R  E ++ Y+FL G  LN  F+ V  +I+ Q+P 
Subjt:  YSKRQNASRLYTLRKQVHECKQGTMDVTSFFNKLSLIWQEMDLCREL-------VWRDPTDGVQYSRIEENDRIYDFLAG--LNPKFDVVRGRILGQRPI

Query:  PSLMEVCSEIR
        PSL E  + ++
Subjt:  PSLMEVCSEIR

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.0e-11240.72Show/hide
Query:  SCTKHPICNYVSYDSLSPQFRAFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAK
        S T H I  ++SY+ +SP + +F   +     P     A ++  W  A+ +E+ A+E   TW+ICTLP   K +GCKWV+ +KY +DGT++R+KARLVAK
Subjt:  SCTKHPICNYVSYDSLSPQFRAFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAK

Query:  GFTQTYGIDYSETFSPVAKLNTIIVLLSVAVNKDWHLYQLDIKNAFLNGDLVEEVYMSPPPGFEAQFG--------------------------------
        G+TQ  GID+ ETFSPV KL ++ ++L+++   ++ L+QLDI NAFLNGDL EE+YM  PPG+ A+ G                                
Subjt:  GFTQTYGIDYSETFSPVAKLNTIIVLLSVAVNKDWHLYQLDIKNAFLNGDLVEEVYMSPPPGFEAQFG--------------------------------

Query:  --------QHVSKTGKIAV-------LIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRP
                 H   T  + +       ++VYVDDI++  ++ A + +LK ++   F+++DLG LKYFLG+E+ARS  GI++ QRKY LDLL ETG+LGC+P
Subjt:  --------QHVSKTGKIAV-------LIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRP

Query:  TDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGS
        +  P++ +           VD + Y+RL+G+L+YL  TR DISFAV+ +SQF + P   H +AV +IL Y+K T G+GL +       ++ ++D+ +   
Subjt:  TDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGS

Query:  VVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDMHQECETPLKLFYDNKAAISIANNPVQHDRTKHVEIDRHFIKE
           R+ST+GYC F+  +L++W+SKKQ VV++SSAEAEYRA+S    E +WL +   ++      P  LF DN AAI IA N V H+RTKH+E D H ++E
Subjt:  VVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDMHQECETPLKLFYDNKAAISIANNPVQHDRTKHVEIDRHFIKE

Query:  K
        +
Subjt:  K

ATMG00240.1 Gag-Pol-related retrotransposon family protein4.1e-1341.46Show/hide
Query:  IYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFV
        +YL+ TRPD++FAV+ +SQF        M+AV ++L Y+K T G+GL +  T    ++A+ DSDWA     R+S +G+C+ V
Subjt:  IYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFV

ATMG00810.1 DNA/RNA polymerases superfamily protein9.7e-4740.62Show/hide
Query:  LIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQ
        L++YVDDI+LTG     ++ L  ++   F +KDLG + YFLG+++     G+ +SQ KY   +L   GML C+P  TP+        S  + P D   ++
Subjt:  LIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQ

Query:  RLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQ
         +VG L YL+ TRPDIS+AV++V Q M  P       + R+LRY+K T   GL   K  +  ++A+ DSDWAG    R+ST+G+CTF+  N+++W +K+Q
Subjt:  RLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQ

Query:  SVVARSSAEAEYRAMSLGICEEIW
          V+RSS E EYRA++L   E  W
Subjt:  SVVARSSAEAEYRAMSLGICEEIW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)8.2e-2252.04Show/hide
Query:  PKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTIIVLLSVA
        PK +  ALK P W  A+ EE+ AL +N TW +   P     +GCKWVF  K  +DGTLDR KARLVAKGF Q  GI + ET+SPV +  TI  +L+VA
Subjt:  PKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTIIVLLSVA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTATCAGAGCGGGACAATGAAAACACCCTAGAAACCCAAAAAAACCAAACCACTTATGAAAATCAAACAGAAGTGACAGCCATCAGTTTTAGTGCTGCCGTAGCTGC
TGCCATCGATGCTCGGATGAGTGCTGCCATGGACGAATTATTAAGCCGGCTACAGAAAACGTCCGAAAATAATTTTTCGTCATTACCGCAGTCGTCCGCGCCGCCACCGG
ACCACCACGAGCCTGGTTTTCTTCCTCATACGGCACCGACCATCCCATCTGTCCAACCCTTTTCTTCGTCTACGGCCTATATTGCTCCCCACGCCCCGATTTATGTTCTG
CCATCTAATTCCAATCGGCTACCACCGCTTCTGCCGTCAAATCTGTATGGCCAGCCACCCAATGATCCTAGCTACCATCCCGATGTTAAAAACTCTCAAATTCACTCAAC
ATTTGAGGTTGGTGAATCTTCGGCATATTCCAACCGTAACGTGCAAGCTTCCTCGGGAATAGTTCATCAACAATTGGAAGGGCTTCGACAACAGATAGCAGCACTTGAGG
CTACCTTAGGGACGACATCCACTCTACCGATGTATTCTGAGTATCCGGTAAACTCGTTCCCTAATGTATCCTCTCCTTATTTGACTAATACGGTGGCTCAGTCTTCCATG
TATCATCTTTCAGGAGAAAAGTTGAATGGCAACAACTATTTCTCATGGTCTCAGTCAGTAAAGATGGTCCTCGAAGGACGACAAAAATTTAGCTTTCTGACAGGGGAAAT
ACCTCGCCCCCTACCGGGCGACCCACATGAACGATATTGGAAGGCAGAAGACTCTATTCTTCGATCCATATTGATCAATAGTATGGAACCTCAAATTGGCAAGCCGTTAT
TGTTTGCTGCAACAGCCAAGGATATTTGGGACACAGCCCAGACACTTTACTCAAAACGTCAGAATGCCTCTCGTCTATACACGCTGAGAAAGCAAGTTCATGAATGCAAG
CAAGGAACCATGGATGTCACATCCTTTTTCAATAAGCTTTCTCTTATATGGCAAGAAATGGACCTATGCAGAGAACTAGTCTGGCGTGATCCCACTGATGGTGTACAGTA
CTCGAGAATTGAAGAGAATGACAGGATTTATGACTTTCTTGCTGGTCTTAATCCTAAGTTTGATGTAGTTCGAGGGCGTATACTAGGTCAAAGACCGATTCCCTCCCTGA
TGGAAGTTTGCTCTGAAATCCGCCTCGAGGAAGATCGCACAAGTGCTATGAATATTTCCGCAACCCCTACTATTGACTCTGCTGCTTTTAGTGCAAGATCTTCTAACAGT
AGCAGTGACAAGCATAATGGAAAACCAATTCCTGTCTGCGAGCATTGCAAAAAACAATGGCATACCAAAGAACAATGTTGGAAGTTACATGGTCGTCCCCCAGGAAGTAA
GAAACGCCCTTCCAACGACAAACAGAACACAGGGCGGGCGTATGTGAGTGAGTCTGCTGAACCTCCTCAACAATCTGATCCACACAAAAACCAAACTGATCTCAGTCTTG
CCACTTTAGGTGCCATTGTCCAATCAGGTATACCTCATTCCTTCGGTCTTGTTAGTATTGATGGGAAGAACCCCTGGATTCTGGATTCTGGTGCCACAGATCATTTGACT
GGGTCCTCTGAACATTTTGTATCTTACATTCCTTGTGCTGGGAACGAGACAATTAGAATTGCAGATGGCTCCTTGGCCCCCGTTGCTGGAAAGGGGAAGATTTCTCCTTG
TGCAGGGCTCTCCTTACATAATGTTTTGCATGTGCCCAAACTATCTTATAATTTGCTTTCGATAAGCAAGATCACTCATGAGTTAAACTGCAAAGCAATATTCTTACCTG
ATTCTGTCTCTTTTCAGGACTTGAGCTCGGGGAGGATGATTGGCACTGCCCGGCATAGTAGGGGACTCTACCTCCTTGATGACGATACCTCTTCTAGTAGCATTCCTAGG
ACTAGTCTCTTATCTTCCTATTTCACTACTTCTGAACAAGATTGTATGTTGTGGCATTTTCGTTTAGGCCACCCTAATTTTCAATATATGAAACATTTATTTCCACATCT
CTTCTCTAAAGTTGAGATGACTACCTTATCTTGTGATGTGTGTATTCAGGCCAAACAACATCGAGTCTCTTTTCCCTCACAACCATACAAACCAACCCAACCCTTCACTC
TTGTTCATAGTGATGTCTGGGGACCATCCAAGATAACAACCTCATCTGGAAAACGGTGGTTCGTAACCTTCATTGATGATCATACCCGTCTTACCTGGGTCTACCTTATC
ACTGATAAATCTGAGGTTTCCTCTATGTTTCAAAATTTCTATCACACCATTGAAACACAATTCCATCAAAAAATTGCTATTCTTCGGAGTGATAATGGTCGGGAATTCCA
AAACCATAATCTTAGTGAATTTCTTGCGTCCAAGGGGATTGTTCATCAAAACTCGTGCGCCTACACTGTCACTATGGATGTTACTTTCTGTGAGGATCGACCCTACTTTC
CCGTTAGCCATCTTCAGGGGGAGAGTGTGAGTGAAGAGTCTAACAACACCTTTGAATTCATCGAACCCACTCCTAGTGTTGTGTCTAACATCATTCCTCATTCCATAGTC
CTACCCACAAACCAAGTCCCCTGGAAAACGTACTACAGGAGGAATCACAAAAAGGAAGTCGGTTCCCCTACTAGTCAGCCGCCGGCTCCAGTCCAAGACTCTGAACCTCC
TCGAGATCAAGGTATGGAAAACCCTACTGAACCCTGTACTAAGAATATGATAAGTGAGAATGACAGGTCTAATGTTGCTGTTCTTGAAAACGTGGAAGAAAAGGACAGTG
GTGATGAGATTGAGGTCAGAATAGAAACCCGTAATAATGAAGCGGAACAGGGTCATACAGGAAAATCAGATGAGTATGATTCCTCTCTTGACATTCCCATTGCTCTGAGA
AAAGGCACCAGGTCTTGTACTAAACACCCCATTTGCAATTATGTTTCCTACGATAGTCTCTCTCCTCAGTTCAGAGCTTTTACAGCAAGCCTTGACTCTACCATAATACC
AAAAGATATCTACACTGCTTTAAAGTATCCTGAATGGAAGAATGCTGTCATGGAAGAGATGAAAGCTCTTGAAAAGAATAGTACTTGGGACATTTGTACTCTACCTAAGG
GACACAAAACTGTGGGATGCAAATGGGTGTTCTCTCTCAAATACAAAGCTGATGGTACTCTTGACAGACACAAGGCAAGGTTAGTTGCGAAGGGATTTACTCAAACCTAT
GGTATTGACTATTCAGAAACTTTTTCTCCAGTTGCTAAGTTGAATACTATTATAGTTCTGTTATCTGTTGCTGTGAACAAAGATTGGCATTTATATCAGCTGGATATTAA
GAATGCCTTTTTGAATGGAGACCTCGTAGAGGAAGTCTACATGAGCCCTCCGCCTGGATTTGAAGCCCAGTTTGGTCAGCATGTTTCCAAAACAGGAAAGATTGCTGTTC
TAATAGTTTATGTGGATGACATTGTTTTGACTGGAGATGATCAGGCAGAAATCAGTCAACTAAAGCAGAGAATGGGCGATGAGTTTGAAATCAAGGATTTGGGAAATTTG
AAATATTTCCTTGGAATGGAGGTGGCCAGATCTAAAGAAGGTATCTCCGTATCTCAAAGAAAATACATCCTTGATTTGTTAACCGAGACAGGTATGTTAGGATGTCGTCC
CACTGACACTCCTATTGAATTCAACTGCAAACTAGGAAACTCTGATGATCAAGTTCCAGTTGATAAAGAACAATATCAACGTCTCGTGGGTAAATTAATTTACTTATCTC
ATACTCGTCCTGATATTTCCTTTGCTGTGAGTGTTGTCAGCCAGTTTATGCAGACCCCTAATGAGGAACACATGAAAGCTGTCAACAGAATCTTGAGATACTTAAAATCA
ACACCTGGTAAAGGGCTGATGTTTAGAAAAACAGACAGAAAGACCATTGAGGCATACACTGACTCGGATTGGGCAGGATCTGTTGTTGACAGAAAATCTACCTCTGGTTA
TTGTACCTTTGTTTGGGGCAATCTTGTAACTTGGAGGAGTAAGAAGCAAAGTGTTGTGGCCAGGAGCAGCGCTGAGGCTGAATATAGAGCTATGAGTTTAGGAATATGTG
AGGAAATTTGGCTTCAGAAAGTTTTGACAGATATGCATCAGGAATGTGAGACACCATTGAAGCTTTTCTATGATAATAAAGCCGCTATTAGTATTGCTAACAACCCTGTT
CAACATGATAGAACTAAACATGTTGAGATTGATCGACATTTTATCAAAGAAAAACTTGACAGTGGGAGCATATGCATTCCGTACATCCCTTCGAGTCAACAGGTTGCTGA
TGTTCTTACCAAAGGGCTTCTCAGACCAAACTTCGACTTCTGCGTTAGCAAGTTGGGCCTCATTGATATTTACGTCCCAACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTATCAGAGCGGGACAATGAAAACACCCTAGAAACCCAAAAAAACCAAACCACTTATGAAAATCAAACAGAAGTGACAGCCATCAGTTTTAGTGCTGCCGTAGCTGC
TGCCATCGATGCTCGGATGAGTGCTGCCATGGACGAATTATTAAGCCGGCTACAGAAAACGTCCGAAAATAATTTTTCGTCATTACCGCAGTCGTCCGCGCCGCCACCGG
ACCACCACGAGCCTGGTTTTCTTCCTCATACGGCACCGACCATCCCATCTGTCCAACCCTTTTCTTCGTCTACGGCCTATATTGCTCCCCACGCCCCGATTTATGTTCTG
CCATCTAATTCCAATCGGCTACCACCGCTTCTGCCGTCAAATCTGTATGGCCAGCCACCCAATGATCCTAGCTACCATCCCGATGTTAAAAACTCTCAAATTCACTCAAC
ATTTGAGGTTGGTGAATCTTCGGCATATTCCAACCGTAACGTGCAAGCTTCCTCGGGAATAGTTCATCAACAATTGGAAGGGCTTCGACAACAGATAGCAGCACTTGAGG
CTACCTTAGGGACGACATCCACTCTACCGATGTATTCTGAGTATCCGGTAAACTCGTTCCCTAATGTATCCTCTCCTTATTTGACTAATACGGTGGCTCAGTCTTCCATG
TATCATCTTTCAGGAGAAAAGTTGAATGGCAACAACTATTTCTCATGGTCTCAGTCAGTAAAGATGGTCCTCGAAGGACGACAAAAATTTAGCTTTCTGACAGGGGAAAT
ACCTCGCCCCCTACCGGGCGACCCACATGAACGATATTGGAAGGCAGAAGACTCTATTCTTCGATCCATATTGATCAATAGTATGGAACCTCAAATTGGCAAGCCGTTAT
TGTTTGCTGCAACAGCCAAGGATATTTGGGACACAGCCCAGACACTTTACTCAAAACGTCAGAATGCCTCTCGTCTATACACGCTGAGAAAGCAAGTTCATGAATGCAAG
CAAGGAACCATGGATGTCACATCCTTTTTCAATAAGCTTTCTCTTATATGGCAAGAAATGGACCTATGCAGAGAACTAGTCTGGCGTGATCCCACTGATGGTGTACAGTA
CTCGAGAATTGAAGAGAATGACAGGATTTATGACTTTCTTGCTGGTCTTAATCCTAAGTTTGATGTAGTTCGAGGGCGTATACTAGGTCAAAGACCGATTCCCTCCCTGA
TGGAAGTTTGCTCTGAAATCCGCCTCGAGGAAGATCGCACAAGTGCTATGAATATTTCCGCAACCCCTACTATTGACTCTGCTGCTTTTAGTGCAAGATCTTCTAACAGT
AGCAGTGACAAGCATAATGGAAAACCAATTCCTGTCTGCGAGCATTGCAAAAAACAATGGCATACCAAAGAACAATGTTGGAAGTTACATGGTCGTCCCCCAGGAAGTAA
GAAACGCCCTTCCAACGACAAACAGAACACAGGGCGGGCGTATGTGAGTGAGTCTGCTGAACCTCCTCAACAATCTGATCCACACAAAAACCAAACTGATCTCAGTCTTG
CCACTTTAGGTGCCATTGTCCAATCAGGTATACCTCATTCCTTCGGTCTTGTTAGTATTGATGGGAAGAACCCCTGGATTCTGGATTCTGGTGCCACAGATCATTTGACT
GGGTCCTCTGAACATTTTGTATCTTACATTCCTTGTGCTGGGAACGAGACAATTAGAATTGCAGATGGCTCCTTGGCCCCCGTTGCTGGAAAGGGGAAGATTTCTCCTTG
TGCAGGGCTCTCCTTACATAATGTTTTGCATGTGCCCAAACTATCTTATAATTTGCTTTCGATAAGCAAGATCACTCATGAGTTAAACTGCAAAGCAATATTCTTACCTG
ATTCTGTCTCTTTTCAGGACTTGAGCTCGGGGAGGATGATTGGCACTGCCCGGCATAGTAGGGGACTCTACCTCCTTGATGACGATACCTCTTCTAGTAGCATTCCTAGG
ACTAGTCTCTTATCTTCCTATTTCACTACTTCTGAACAAGATTGTATGTTGTGGCATTTTCGTTTAGGCCACCCTAATTTTCAATATATGAAACATTTATTTCCACATCT
CTTCTCTAAAGTTGAGATGACTACCTTATCTTGTGATGTGTGTATTCAGGCCAAACAACATCGAGTCTCTTTTCCCTCACAACCATACAAACCAACCCAACCCTTCACTC
TTGTTCATAGTGATGTCTGGGGACCATCCAAGATAACAACCTCATCTGGAAAACGGTGGTTCGTAACCTTCATTGATGATCATACCCGTCTTACCTGGGTCTACCTTATC
ACTGATAAATCTGAGGTTTCCTCTATGTTTCAAAATTTCTATCACACCATTGAAACACAATTCCATCAAAAAATTGCTATTCTTCGGAGTGATAATGGTCGGGAATTCCA
AAACCATAATCTTAGTGAATTTCTTGCGTCCAAGGGGATTGTTCATCAAAACTCGTGCGCCTACACTGTCACTATGGATGTTACTTTCTGTGAGGATCGACCCTACTTTC
CCGTTAGCCATCTTCAGGGGGAGAGTGTGAGTGAAGAGTCTAACAACACCTTTGAATTCATCGAACCCACTCCTAGTGTTGTGTCTAACATCATTCCTCATTCCATAGTC
CTACCCACAAACCAAGTCCCCTGGAAAACGTACTACAGGAGGAATCACAAAAAGGAAGTCGGTTCCCCTACTAGTCAGCCGCCGGCTCCAGTCCAAGACTCTGAACCTCC
TCGAGATCAAGGTATGGAAAACCCTACTGAACCCTGTACTAAGAATATGATAAGTGAGAATGACAGGTCTAATGTTGCTGTTCTTGAAAACGTGGAAGAAAAGGACAGTG
GTGATGAGATTGAGGTCAGAATAGAAACCCGTAATAATGAAGCGGAACAGGGTCATACAGGAAAATCAGATGAGTATGATTCCTCTCTTGACATTCCCATTGCTCTGAGA
AAAGGCACCAGGTCTTGTACTAAACACCCCATTTGCAATTATGTTTCCTACGATAGTCTCTCTCCTCAGTTCAGAGCTTTTACAGCAAGCCTTGACTCTACCATAATACC
AAAAGATATCTACACTGCTTTAAAGTATCCTGAATGGAAGAATGCTGTCATGGAAGAGATGAAAGCTCTTGAAAAGAATAGTACTTGGGACATTTGTACTCTACCTAAGG
GACACAAAACTGTGGGATGCAAATGGGTGTTCTCTCTCAAATACAAAGCTGATGGTACTCTTGACAGACACAAGGCAAGGTTAGTTGCGAAGGGATTTACTCAAACCTAT
GGTATTGACTATTCAGAAACTTTTTCTCCAGTTGCTAAGTTGAATACTATTATAGTTCTGTTATCTGTTGCTGTGAACAAAGATTGGCATTTATATCAGCTGGATATTAA
GAATGCCTTTTTGAATGGAGACCTCGTAGAGGAAGTCTACATGAGCCCTCCGCCTGGATTTGAAGCCCAGTTTGGTCAGCATGTTTCCAAAACAGGAAAGATTGCTGTTC
TAATAGTTTATGTGGATGACATTGTTTTGACTGGAGATGATCAGGCAGAAATCAGTCAACTAAAGCAGAGAATGGGCGATGAGTTTGAAATCAAGGATTTGGGAAATTTG
AAATATTTCCTTGGAATGGAGGTGGCCAGATCTAAAGAAGGTATCTCCGTATCTCAAAGAAAATACATCCTTGATTTGTTAACCGAGACAGGTATGTTAGGATGTCGTCC
CACTGACACTCCTATTGAATTCAACTGCAAACTAGGAAACTCTGATGATCAAGTTCCAGTTGATAAAGAACAATATCAACGTCTCGTGGGTAAATTAATTTACTTATCTC
ATACTCGTCCTGATATTTCCTTTGCTGTGAGTGTTGTCAGCCAGTTTATGCAGACCCCTAATGAGGAACACATGAAAGCTGTCAACAGAATCTTGAGATACTTAAAATCA
ACACCTGGTAAAGGGCTGATGTTTAGAAAAACAGACAGAAAGACCATTGAGGCATACACTGACTCGGATTGGGCAGGATCTGTTGTTGACAGAAAATCTACCTCTGGTTA
TTGTACCTTTGTTTGGGGCAATCTTGTAACTTGGAGGAGTAAGAAGCAAAGTGTTGTGGCCAGGAGCAGCGCTGAGGCTGAATATAGAGCTATGAGTTTAGGAATATGTG
AGGAAATTTGGCTTCAGAAAGTTTTGACAGATATGCATCAGGAATGTGAGACACCATTGAAGCTTTTCTATGATAATAAAGCCGCTATTAGTATTGCTAACAACCCTGTT
CAACATGATAGAACTAAACATGTTGAGATTGATCGACATTTTATCAAAGAAAAACTTGACAGTGGGAGCATATGCATTCCGTACATCCCTTCGAGTCAACAGGTTGCTGA
TGTTCTTACCAAAGGGCTTCTCAGACCAAACTTCGACTTCTGCGTTAGCAAGTTGGGCCTCATTGATATTTACGTCCCAACTTGA
Protein sequenceShow/hide protein sequence
MVSERDNENTLETQKNQTTYENQTEVTAISFSAAVAAAIDARMSAAMDELLSRLQKTSENNFSSLPQSSAPPPDHHEPGFLPHTAPTIPSVQPFSSSTAYIAPHAPIYVL
PSNSNRLPPLLPSNLYGQPPNDPSYHPDVKNSQIHSTFEVGESSAYSNRNVQASSGIVHQQLEGLRQQIAALEATLGTTSTLPMYSEYPVNSFPNVSSPYLTNTVAQSSM
YHLSGEKLNGNNYFSWSQSVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKPLLFAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECK
QGTMDVTSFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQRPIPSLMEVCSEIRLEEDRTSAMNISATPTIDSAAFSARSSNS
SSDKHNGKPIPVCEHCKKQWHTKEQCWKLHGRPPGSKKRPSNDKQNTGRAYVSESAEPPQQSDPHKNQTDLSLATLGAIVQSGIPHSFGLVSIDGKNPWILDSGATDHLT
GSSEHFVSYIPCAGNETIRIADGSLAPVAGKGKISPCAGLSLHNVLHVPKLSYNLLSISKITHELNCKAIFLPDSVSFQDLSSGRMIGTARHSRGLYLLDDDTSSSSIPR
TSLLSSYFTTSEQDCMLWHFRLGHPNFQYMKHLFPHLFSKVEMTTLSCDVCIQAKQHRVSFPSQPYKPTQPFTLVHSDVWGPSKITTSSGKRWFVTFIDDHTRLTWVYLI
TDKSEVSSMFQNFYHTIETQFHQKIAILRSDNGREFQNHNLSEFLASKGIVHQNSCAYTVTMDVTFCEDRPYFPVSHLQGESVSEESNNTFEFIEPTPSVVSNIIPHSIV
LPTNQVPWKTYYRRNHKKEVGSPTSQPPAPVQDSEPPRDQGMENPTEPCTKNMISENDRSNVAVLENVEEKDSGDEIEVRIETRNNEAEQGHTGKSDEYDSSLDIPIALR
KGTRSCTKHPICNYVSYDSLSPQFRAFTASLDSTIIPKDIYTALKYPEWKNAVMEEMKALEKNSTWDICTLPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTY
GIDYSETFSPVAKLNTIIVLLSVAVNKDWHLYQLDIKNAFLNGDLVEEVYMSPPPGFEAQFGQHVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNL
KYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKS
TPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDMHQECETPLKLFYDNKAAISIANNPV
QHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDIYVPT