; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG11G009140 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG11G009140
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionBeta-galactosidase
Genome locationCG_Chr11:14522289..14527170
RNA-Seq ExpressionClCG11G009140
SyntenyClCG11G009140
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031941.1 Beta-galactosidase [Cucumis melo var. makuwa]0.0e+0052.89Show/hide
Query:  NSSIKIVLEGRHKFGYLTGQISKPRPRDPQE------------LSVNSQH------------GTSNWEASVIC---NYDSGYLECYSETV--LQEGSMDV
        + SIK+ LEGR++FG+LTG+  +P P D  E            + +NS                  W+ +        ++  L    + V   ++G++DV
Subjt:  NSSIKIVLEGRHKFGYLTGQISKPRPRDPQE------------LSVNSQH------------GTSNWEASVIC---NYDSGYLECYSETV--LQEGSMDV

Query:  TSYFNKLSLIWQEMDLCRELVWDYPCEGVQYYKLEEVDRVYDFLA---------------------------EVRLEEDRSSAMNITTTSVTDSAAFSAE
        T+YFNKLSL+WQEMDLCRE VWD P +  QY KLEE DRVYDFLA                           EVRLEEDR++AM + TT   DSAAFSA 
Subjt:  TSYFNKLSLIWQEMDLCRELVWDYPCEGVQYYKLEEVDRVYDFLA---------------------------EVRLEEDRSSAMNITTTSVTDSAAFSAE

Query:  SSGITGDKQNGKPPYV-----------NTCWKLHGRPPNGRRRPPNNKTN--QALVSETTSGSQSQHQEIYPTDSGTISLGAIAQSGISQSLSLLSITSK
        SS    DK NGK   V           + CWKLHGRPP G++R  N K N  +A +SETT  S SQ  +   + + T +LGAIAQSG+ QSL L+S+  K
Subjt:  SSGITGDKQNGKPPYV-----------NTCWKLHGRPPNGRRRPPNNKTN--QALVSETTSGSQSQHQEIYPTDSGTISLGAIAQSGISQSLSLLSITSK

Query:  KPWILDSRATYHLTGTSDNFLSYHP----------------------------------------------------ELNCRVAFSPDDVIFQDLSSRRT
         PWILDS AT HLTG+S++F+SY P                                                    EL+C+  F P+ V FQD+SS RT
Subjt:  KPWILDSRATYHLTGTSDNFLSYHP----------------------------------------------------ELNCRVAFSPDDVIFQDLSSRRT

Query:  IGTVQHNRGLYFLNGDTSSRHNYRASLLFSYFSTSENDF-------------------------------------------------------------
        IGT +H+RGLY L+ DTS     R SLL SYFSTSE DF                                                             
Subjt:  IGTVQHNRGLYFLNGDTSSRHNYRASLLFSYFSTSENDF-------------------------------------------------------------

Query:  -----FLLTDKSEVSSIFQQFYTTIKTQFNAKISILRSDNGREFLTNTLREFLSTKGIVHQSSCAYTPQQNGVAERKNRHLLEVARSLMLSTS-------
             +L++DKSEV SIFQ FY TIKTQF+ KI+ILRSDNGREF  + L EFL++KGIVHQ+SCAYTPQQNGVAERKNRHL+EVARSLMLSTS       
Subjt:  -----FLLTDKSEVSSIFQQFYTTIKTQFNAKISILRSDNGREFLTNTLREFLSTKGIVHQSSCAYTPQQNGVAERKNRHLLEVARSLMLSTS-------

Query:  -------HLINRMPSRVLNFQTPLDCLKLSYPTTTTRLIPDVPLRVFGCTAFVHSFGPNQTKFTPHAQKCVFVGYPLHQRGYKCFHPTSQKYFISMDITF
               HLINRMPSR+L+ QTPLDCLK SYP  +TRL+ +VPLRVFGCTA+VH+FGPNQTKFTP AQ CVFVGYPLHQ GYKCFHP S+KYF++MD+TF
Subjt:  -------HLINRMPSRVLNFQTPLDCLKLSYPTTTTRLIPDVPLRVFGCTAFVHSFGPNQTKFTPHAQKCVFVGYPLHQRGYKCFHPTSQKYFISMDITF

Query:  LEDKPFFPVSPLHGESSNEEINCHNPPPEPIMAT------HDTVLPTRQAPWITYYRKNLRKEIVAPT-RMPAPIQSSEPTQVQG---TTDPNNHTVCVE
         E++P+FPVS L GE+ +EE N      EP + T      H  +LPT Q PW TYYR+NLRKE+ +PT + PAP+Q+ EP + QG    T P  +    E
Subjt:  LEDKPFFPVSPLHGESSNEEINCHNPPPEPIMAT------HDTVLPTRQAPWITYYRKNLRKEIVAPT-RMPAPIQSSEPTQVQG---TTDPNNHTVCVE

Query:  NDIVDLTKLPVKNNRGDMTENNQVAESDNSTTEE-ELEKAGEYDASLDMPIALRKGTRSCTRYPRYSFLSYNNLSSKFRAFAASLDIVTIPKNIYVVMEI
        ND  D+  L     +    E     E+ N   E+    K  EYD SLD+PIALRKGTRSCT++P  +++SY+NLS +FRAF A+LD   IPKNIY  +E 
Subjt:  NDIVDLTKLPVKNNRGDMTENNQVAESDNSTTEE-ELEKAGEYDASLDMPIALRKGTRSCTRYPRYSFLSYNNLSSKFRAFAASLDIVTIPKNIYVVMEI

Query:  LEWKAVVMEEVRALEKNNIWEQVALPKGHKPVGCKWVFTMKYKSNGTLDRYKARLVAKGFTQTFGVDYSKTFSPIAKLNTIRVLLSVAVNKDWPLYQPDV
         EWK  VMEE++ALEKN  WE  ALPKGHK VGCKWVF++KYK++GTLDR+KARLVAKGFTQT+G+DYS+TFSP+AKLNT+RVLLSVAVNKDWPLYQ DV
Subjt:  LEWKAVVMEEVRALEKNNIWEQVALPKGHKPVGCKWVFTMKYKSNGTLDRYKARLVAKGFTQTFGVDYSKTFSPIAKLNTIRVLLSVAVNKDWPLYQPDV

Query:  KNAFLNDYLEEVVYMSPPPGFEAQFDQNV------------------------------------------KSVSGKVAVLIVYVDDIVLSGDDTAEINK
        KNAFLN  L E VYMSPPPGFEAQF Q V                                           S +GK+A+LIVYVDDIVL+GDD  EI++
Subjt:  KNAFLNDYLEEVVYMSPPPGFEAQFDQNV------------------------------------------KSVSGKVAVLIVYVDDIVLSGDDTAEINK

Query:  LKQKMADEFEINDLGNLKYFLGMEVTRSREGIFVSQRK---------------PADTPIESNAKLGESVDKVPVNKERYQRLVGRLNYLAHTKPDISYAV
        LKQ+M DEFEI DLGNLKYFLGMEV RS+EGI VSQRK               PADTPIE N KLG S D+VPV+KE+YQRLVG+L YL+HT+PDIS+AV
Subjt:  LKQKMADEFEINDLGNLKYFLGMEVTRSREGIFVSQRK---------------PADTPIESNAKLGESVDKVPVNKERYQRLVGRLNYLAHTKPDISYAV

Query:  SVVSQFMQAPYEEHMEAVNRILRYLKSTPEKRLIFRKSDRRCIEAYTDSDSAGSVIDRKSTSGYCTFVWGNLVTWRSKKQSVVAKSSAEAEYRAMSQGI
        SVVSQFMQAPYE+HMEAVNRILRYLK+TP K L+FRK++R+ IEAYTDSD AGSVIDRKSTSGYCTFVWGNLVTWRSKKQSVVA+SSAEAEYRAMS GI
Subjt:  SVVSQFMQAPYEEHMEAVNRILRYLKSTPEKRLIFRKSDRRCIEAYTDSDSAGSVIDRKSTSGYCTFVWGNLVTWRSKKQSVVAKSSAEAEYRAMSQGI

KAA0034386.1 Beta-galactosidase [Cucumis melo var. makuwa]0.0e+0051.88Show/hide
Query:  NSSIKIVLEGRHKFGYLTGQISKPRPRDPQE------------LSVNSQH------------GTSNWEASVIC---NYDSGYLECYSETV--LQEGSMDV
        + SIK+ LEGR++FG+LTG+I +P P D  E            + +NS                  W+ +        ++  L    + V   ++G++DV
Subjt:  NSSIKIVLEGRHKFGYLTGQISKPRPRDPQE------------LSVNSQH------------GTSNWEASVIC---NYDSGYLECYSETV--LQEGSMDV

Query:  TSYFNKLSLIWQEMDLCRELVWDYPCEGVQYYKLEEVDRVYDFLA---------------------------EVRLEEDRSSAMNITTTSVTDSAAFSAE
        T+YFNKLSL+WQEMDLCRE VWD P +  QY KLEE DRVYDFLA                           EVRLEEDR++AM + TT   DSAAFSA 
Subjt:  TSYFNKLSLIWQEMDLCRELVWDYPCEGVQYYKLEEVDRVYDFLA---------------------------EVRLEEDRSSAMNITTTSVTDSAAFSAE

Query:  SSGITGDKQNGKPPYV-----------NTCWKLHGRPPNGRRRPPNNKTN--QALVSETTSGSQSQHQEIYPTDSGTISLGAIAQSGISQSLSLLSITSK
        SS    DK NGK   V           + CWKLHGRPP G++R  N K N  +A +SETT  S SQ  +   + + T +LGAIAQSG+ QSL L+S+  K
Subjt:  SSGITGDKQNGKPPYV-----------NTCWKLHGRPPNGRRRPPNNKTN--QALVSETTSGSQSQHQEIYPTDSGTISLGAIAQSGISQSLSLLSITSK

Query:  KPWILDSRATYHLTGTSDNFLSYHP----------------------------------------------------ELNCRVAFSPDDVIFQDLSSRRT
         PWILDS AT HLTG+S++F+SY P                                                    EL+C+  F P+ V FQD+SS RT
Subjt:  KPWILDSRATYHLTGTSDNFLSYHP----------------------------------------------------ELNCRVAFSPDDVIFQDLSSRRT

Query:  IGTVQHNRGLYFLNGDTSSRHNYRASLLFSYFSTSEND--------------------------------------------------------------
        IGT +H+RGLY L+ DTS     R SLL SYFSTSE D                                                              
Subjt:  IGTVQHNRGLYFLNGDTSSRHNYRASLLFSYFSTSEND--------------------------------------------------------------

Query:  ------------------------------FFLLTDKSEVSSIFQQFYTTIKTQFNAKISILRSDNGREFLTNTLREFLSTKGIVHQSSCAYTPQQNGVA
                                       +L++DKSEV SIFQ FY TIKTQF+ KI+ILRSDNGREF  + L EFL++KGIVHQ+SCAYTPQQNGVA
Subjt:  ------------------------------FFLLTDKSEVSSIFQQFYTTIKTQFNAKISILRSDNGREFLTNTLREFLSTKGIVHQSSCAYTPQQNGVA

Query:  ERKNRHLLEVARSLMLSTS--------------HLINRMPSRVLNFQTPLDCLKLSYPTTTTRLIPDVPLRVFGCTAFVHSFGPNQTKFTPHAQKCVFVG
        ERKNRHL+EVARSLMLSTS              HLINRMPSR+L+ QTPLDCLK SYP  +TRL+ +VPLRVFGCTA+VH+FGPNQTKFTP AQ CVFVG
Subjt:  ERKNRHLLEVARSLMLSTS--------------HLINRMPSRVLNFQTPLDCLKLSYPTTTTRLIPDVPLRVFGCTAFVHSFGPNQTKFTPHAQKCVFVG

Query:  YPLHQRGYKCFHPTSQKYFISMDITFLEDKPFFPVSPLHGESSNEEINCHNPPPEPIMAT------HDTVLPTRQAPWITYYRKNLRKEIVAPT-RMPAP
        YPLHQ GYKCFHP S+KYF++MD+TF E++P+FPVS L GE+ +EE N      EP + T      H  +LPT Q PW TYYR+NLRKE+ +PT + PAP
Subjt:  YPLHQRGYKCFHPTSQKYFISMDITFLEDKPFFPVSPLHGESSNEEINCHNPPPEPIMAT------HDTVLPTRQAPWITYYRKNLRKEIVAPT-RMPAP

Query:  IQSSEPTQVQG---TTDPNNHTVCVENDIVDLTKLPVKNNRGDMTENNQVAESDNSTTEE-ELEKAGEYDASLDMPIALRKGTRSCTRYPRYSFLSYNNL
        +Q+ EP + QG    T P  +    END  D+  L     +    E     E+ N   E+    K  EYD SLD+PIALRKGTRSCT++P  +++SY+NL
Subjt:  IQSSEPTQVQG---TTDPNNHTVCVENDIVDLTKLPVKNNRGDMTENNQVAESDNSTTEE-ELEKAGEYDASLDMPIALRKGTRSCTRYPRYSFLSYNNL

Query:  SSKFRAFAASLDIVTIPKNIYVVMEILEWKAVVMEEVRALEKNNIWEQVALPKGHKPVGCKWVFTMKYKSNGTLDRYKARLVAKGFTQTFGVDYSKTFSP
        S +FRAF A+LD   IPKNIY  +E  EWK  VMEE++ALEKN  WE  ALPKGHK VGCKWVF++KYK++GTLDR+KARLVAKGFTQT+G+DYS+TFSP
Subjt:  SSKFRAFAASLDIVTIPKNIYVVMEILEWKAVVMEEVRALEKNNIWEQVALPKGHKPVGCKWVFTMKYKSNGTLDRYKARLVAKGFTQTFGVDYSKTFSP

Query:  IAKLNTIRVLLSVAVNKDWPLYQPDVKNAFLNDYLEEVVYMSPPPGFEAQFDQNV------------------------------------------KSV
        +AKLNT+RVLLSVAVNKDWPLYQ DVKNAFLN  L E VYMSPPPGFEAQF Q V                                           S 
Subjt:  IAKLNTIRVLLSVAVNKDWPLYQPDVKNAFLNDYLEEVVYMSPPPGFEAQFDQNV------------------------------------------KSV

Query:  SGKVAVLIVYVDDIVLSGDDTAEINKLKQKMADEFEINDLGNLKYFLGMEVTRSREGIFVSQRK---------------PADTPIESNAKLGESVDKVPV
        +GK+A+LIVYVDDIVL+GDD  EI++LKQ+M DEFEI DLGNLKYFLGMEV RS+EGI VSQRK               PADTPIE N KLG S D+VPV
Subjt:  SGKVAVLIVYVDDIVLSGDDTAEINKLKQKMADEFEINDLGNLKYFLGMEVTRSREGIFVSQRK---------------PADTPIESNAKLGESVDKVPV

Query:  NKERYQRLVGRLNYLAHTKPDISYAVSVVSQFMQAPYEEHMEAVNRILRYLKSTPEKRLIFRKSDRRCIEAYTDSDSAGSVIDRKSTSGYCTFVWGNLVT
        +KE+YQRLVG+L YL+HT+PDIS+AVSVVSQFMQAPYE+HMEAVNRILRYLK+TP K L+FRK++R+ IEAYTDSD AGSVIDRKSTSGYCTFVWGNLVT
Subjt:  NKERYQRLVGRLNYLAHTKPDISYAVSVVSQFMQAPYEEHMEAVNRILRYLKSTPEKRLIFRKSDRRCIEAYTDSDSAGSVIDRKSTSGYCTFVWGNLVT

Query:  WRSKKQSVVAKSSAEAEYRAMSQGICEK
        WRSKKQSVVA+SSAEAEYRAMS GICE+
Subjt:  WRSKKQSVVAKSSAEAEYRAMSQGICEK

KAA0052172.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]0.0e+0052.84Show/hide
Query:  NSSIKIVLEGRHKFGYLTGQISKPRPRDPQE------------LSVNSQH------------GTSNWEASVIC---NYDSGYLECYSETV--LQEGSMDV
        + SIK+ LEGR++FG+LTG+  +P P D  E            + +NS                  W+ +        ++  L    + V   ++G++DV
Subjt:  NSSIKIVLEGRHKFGYLTGQISKPRPRDPQE------------LSVNSQH------------GTSNWEASVIC---NYDSGYLECYSETV--LQEGSMDV

Query:  TSYFNKLSLIWQEMDLCRELVWDYPCEGVQYYKLEEVDRVYDFLA---------------------------EVRLEEDRSSAMNITTTSVTDSAAFSAE
        T+YFNKLSL+WQEMDLCRE VWD P +  QY KLEE DRVYDFLA                           EVRLEEDR++AM + TT   DSAAFSA 
Subjt:  TSYFNKLSLIWQEMDLCRELVWDYPCEGVQYYKLEEVDRVYDFLA---------------------------EVRLEEDRSSAMNITTTSVTDSAAFSAE

Query:  SSGITGDKQNGKPPYV-----------NTCWKLHGRPPNGRRRPPNNKTN--QALVSETTSGSQSQHQEIYPTDSGTISLGAIAQSGISQSLSLLSITSK
        SS    DK NGK   V           + CWKLHGRPP G++R  N K N  +A +SET   S SQ  +   + + T +LGAIAQSG+ QSL L+SI  K
Subjt:  SSGITGDKQNGKPPYV-----------NTCWKLHGRPPNGRRRPPNNKTN--QALVSETTSGSQSQHQEIYPTDSGTISLGAIAQSGISQSLSLLSITSK

Query:  KPWILDSRATYHLTGTSDNFLSYHP----------------------------------------------------ELNCRVAFSPDDVIFQDLSSRRT
         PWILDS AT HLTG+S++F+SY                                                      EL+C+  F P+ V FQD+SS RT
Subjt:  KPWILDSRATYHLTGTSDNFLSYHP----------------------------------------------------ELNCRVAFSPDDVIFQDLSSRRT

Query:  IGTVQHNRGLYFLNGDTSSRHNYRASLLFSYFSTSENDF-------------------------------------------------------------
        IGT +H+RGLY L+ DTS     R SLL SYFSTSE DF                                                             
Subjt:  IGTVQHNRGLYFLNGDTSSRHNYRASLLFSYFSTSENDF-------------------------------------------------------------

Query:  -----FLLTDKSEVSSIFQQFYTTIKTQFNAKISILRSDNGREFLTNTLREFLSTKGIVHQSSCAYTPQQNGVAERKNRHLLEVARSLMLSTS-------
             +L++DKSEV SIFQ FY TIKTQF+ KI+ILRSDNGREF  + L EFL++KGIVHQ+SCAYTPQQNGVAERKNRHL+EVARSLMLSTS       
Subjt:  -----FLLTDKSEVSSIFQQFYTTIKTQFNAKISILRSDNGREFLTNTLREFLSTKGIVHQSSCAYTPQQNGVAERKNRHLLEVARSLMLSTS-------

Query:  -------HLINRMPSRVLNFQTPLDCLKLSYPTTTTRLIPDVPLRVFGCTAFVHSFGPNQTKFTPHAQKCVFVGYPLHQRGYKCFHPTSQKYFISMDITF
               HLINRMPSR+L+ QTPLDCLK SYP  +TRL+ +VPLRVFGCTA+VH+FGPNQTKFTP AQ CVFVGYPLHQ GYKCFHP S+KYF++MD+TF
Subjt:  -------HLINRMPSRVLNFQTPLDCLKLSYPTTTTRLIPDVPLRVFGCTAFVHSFGPNQTKFTPHAQKCVFVGYPLHQRGYKCFHPTSQKYFISMDITF

Query:  LEDKPFFPVSPLHGESSNEEINCHNPPPEPIMAT------HDTVLPTRQAPWITYYRKNLRKEIVAPT-RMPAPIQSSEPTQVQG---TTDPNNHTVCVE
         E++P+FPVS L GE+ +EE N      EP + T      H  +LPT Q PW TYYR+NLRKE+ +PT + PAP+Q+ EP + QG    T P  +    E
Subjt:  LEDKPFFPVSPLHGESSNEEINCHNPPPEPIMAT------HDTVLPTRQAPWITYYRKNLRKEIVAPT-RMPAPIQSSEPTQVQG---TTDPNNHTVCVE

Query:  NDIVDLTKLPVKNNRGDMTENNQVAESDNSTTEE-ELEKAGEYDASLDMPIALRKGTRSCTRYPRYSFLSYNNLSSKFRAFAASLDIVTIPKNIYVVMEI
        ND  D+  L     +    E     E+ N   E+    K  EYD SLD+PIALRKGTRSCT++P  +++SY+NLS +FRAF A+LD   IPKNIY  +E 
Subjt:  NDIVDLTKLPVKNNRGDMTENNQVAESDNSTTEE-ELEKAGEYDASLDMPIALRKGTRSCTRYPRYSFLSYNNLSSKFRAFAASLDIVTIPKNIYVVMEI

Query:  LEWKAVVMEEVRALEKNNIWEQVALPKGHKPVGCKWVFTMKYKSNGTLDRYKARLVAKGFTQTFGVDYSKTFSPIAKLNTIRVLLSVAVNKDWPLYQPDV
         EWK  VMEE++ALEKN  WE  ALPKGHK VGCKWVF++KYK++GTLDR+KARLVAKGFTQT+G+DYS+TFSP+AKLNT+RVLLSVAVNKDWPLYQ DV
Subjt:  LEWKAVVMEEVRALEKNNIWEQVALPKGHKPVGCKWVFTMKYKSNGTLDRYKARLVAKGFTQTFGVDYSKTFSPIAKLNTIRVLLSVAVNKDWPLYQPDV

Query:  KNAFLNDYLEEVVYMSPPPGFEAQFDQNV------------------------------------------KSVSGKVAVLIVYVDDIVLSGDDTAEINK
        KNAFLN  L E VYMSPPPGFEAQF Q V                                           S +GK+A+LIVYVDDIVL+GDD  EI++
Subjt:  KNAFLNDYLEEVVYMSPPPGFEAQFDQNV------------------------------------------KSVSGKVAVLIVYVDDIVLSGDDTAEINK

Query:  LKQKMADEFEINDLGNLKYFLGMEVTRSREGIFVSQRK---------------PADTPIESNAKLGESVDKVPVNKERYQRLVGRLNYLAHTKPDISYAV
        LKQ+M DEFEI DLGNLKYFLGMEV RS+EGI VSQRK               PADTPIE N KLG S D+VPV+KE+YQRLVG+L YL+HT+PDIS+AV
Subjt:  LKQKMADEFEINDLGNLKYFLGMEVTRSREGIFVSQRK---------------PADTPIESNAKLGESVDKVPVNKERYQRLVGRLNYLAHTKPDISYAV

Query:  SVVSQFMQAPYEEHMEAVNRILRYLKSTPEKRLIFRKSDRRCIEAYTDSDSAGSVIDRKSTSGYCTFVWGNLVTWRSKKQSVVAKSSAEAEYRAMSQGIC
        SVVSQFMQAPYE+HMEAVNRILRYLK+TP K L+FRK++R+ IEAYTDSD AGSVIDRKSTSGYCTFVWGNLVTWRSKKQSVVA+SSAEAEYRAMS GIC
Subjt:  SVVSQFMQAPYEEHMEAVNRILRYLKSTPEKRLIFRKSDRRCIEAYTDSDSAGSVIDRKSTSGYCTFVWGNLVTWRSKKQSVVAKSSAEAEYRAMSQGIC

Query:  EK
        E+
Subjt:  EK

KAA0052775.1 Beta-galactosidase [Cucumis melo var. makuwa]0.0e+0051.88Show/hide
Query:  NSSIKIVLEGRHKFGYLTGQISKPRPRDPQE------------LSVNSQH------------GTSNWEASVIC---NYDSGYLECYSETV--LQEGSMDV
        + SIK+ LEGR++FG+LTG+I +P P D  E            + +NS                  W+ +        ++  L    + V   ++G++DV
Subjt:  NSSIKIVLEGRHKFGYLTGQISKPRPRDPQE------------LSVNSQH------------GTSNWEASVIC---NYDSGYLECYSETV--LQEGSMDV

Query:  TSYFNKLSLIWQEMDLCRELVWDYPCEGVQYYKLEEVDRVYDFLA---------------------------EVRLEEDRSSAMNITTTSVTDSAAFSAE
        T+YFNKLSL+WQEMDLCRE VWD P +  QY KLEE DRVYDFLA                           EVRLEEDR++AM + TT   DSAAFSA 
Subjt:  TSYFNKLSLIWQEMDLCRELVWDYPCEGVQYYKLEEVDRVYDFLA---------------------------EVRLEEDRSSAMNITTTSVTDSAAFSAE

Query:  SSGITGDKQNGKPPYV-----------NTCWKLHGRPPNGRRRPPNNKTN--QALVSETTSGSQSQHQEIYPTDSGTISLGAIAQSGISQSLSLLSITSK
        SS    DK NGK   V           + CWKLHGRPP G++R  N K N  +A +SETT  S SQ  +   + + T +LGAIAQSG+ QSL L+S+  K
Subjt:  SSGITGDKQNGKPPYV-----------NTCWKLHGRPPNGRRRPPNNKTN--QALVSETTSGSQSQHQEIYPTDSGTISLGAIAQSGISQSLSLLSITSK

Query:  KPWILDSRATYHLTGTSDNFLSYHP----------------------------------------------------ELNCRVAFSPDDVIFQDLSSRRT
         PWILDS AT HLTG+S++F+SY P                                                    EL+C+  F P+ V FQD+SS RT
Subjt:  KPWILDSRATYHLTGTSDNFLSYHP----------------------------------------------------ELNCRVAFSPDDVIFQDLSSRRT

Query:  IGTVQHNRGLYFLNGDTSSRHNYRASLLFSYFSTSEND--------------------------------------------------------------
        IGT +H+RGLY L+ DTS     R SLL SYFSTSE D                                                              
Subjt:  IGTVQHNRGLYFLNGDTSSRHNYRASLLFSYFSTSEND--------------------------------------------------------------

Query:  ------------------------------FFLLTDKSEVSSIFQQFYTTIKTQFNAKISILRSDNGREFLTNTLREFLSTKGIVHQSSCAYTPQQNGVA
                                       +L++DKSEV SIFQ FY TIKTQF+ KI+ILRSDNGREF  + L EFL++KGIVHQ+SCAYTPQQNGVA
Subjt:  ------------------------------FFLLTDKSEVSSIFQQFYTTIKTQFNAKISILRSDNGREFLTNTLREFLSTKGIVHQSSCAYTPQQNGVA

Query:  ERKNRHLLEVARSLMLSTS--------------HLINRMPSRVLNFQTPLDCLKLSYPTTTTRLIPDVPLRVFGCTAFVHSFGPNQTKFTPHAQKCVFVG
        ERKNRHL+EVARSLMLSTS              HLINRMPSR+L+ QTPLDCLK SYP  +TRL+ +VPLRVFGCTA+VH+FGPNQTKFTP AQ CVFVG
Subjt:  ERKNRHLLEVARSLMLSTS--------------HLINRMPSRVLNFQTPLDCLKLSYPTTTTRLIPDVPLRVFGCTAFVHSFGPNQTKFTPHAQKCVFVG

Query:  YPLHQRGYKCFHPTSQKYFISMDITFLEDKPFFPVSPLHGESSNEEINCHNPPPEPIMAT------HDTVLPTRQAPWITYYRKNLRKEIVAPT-RMPAP
        YPLHQ GYKCFHP S+KYF++MD+TF E++P+FPVS L GE+ +EE N      EP + T      H  +LPT Q PW TYYR+NLRKE+ +PT + PAP
Subjt:  YPLHQRGYKCFHPTSQKYFISMDITFLEDKPFFPVSPLHGESSNEEINCHNPPPEPIMAT------HDTVLPTRQAPWITYYRKNLRKEIVAPT-RMPAP

Query:  IQSSEPTQVQG---TTDPNNHTVCVENDIVDLTKLPVKNNRGDMTENNQVAESDNSTTEE-ELEKAGEYDASLDMPIALRKGTRSCTRYPRYSFLSYNNL
        +Q+ EP + QG    T P  +    END  D+  L     +    E     E+ N   E+    K  EYD SLD+PIALRKGTRSCT++P  +++SY+NL
Subjt:  IQSSEPTQVQG---TTDPNNHTVCVENDIVDLTKLPVKNNRGDMTENNQVAESDNSTTEE-ELEKAGEYDASLDMPIALRKGTRSCTRYPRYSFLSYNNL

Query:  SSKFRAFAASLDIVTIPKNIYVVMEILEWKAVVMEEVRALEKNNIWEQVALPKGHKPVGCKWVFTMKYKSNGTLDRYKARLVAKGFTQTFGVDYSKTFSP
        S +FRAF A+LD   IPKNIY  +E  EWK  VMEE++ALEKN  WE  ALPKGHK VGCKWVF++KYK++GTLDR+KARLVAKGFTQT+G+DYS+TFSP
Subjt:  SSKFRAFAASLDIVTIPKNIYVVMEILEWKAVVMEEVRALEKNNIWEQVALPKGHKPVGCKWVFTMKYKSNGTLDRYKARLVAKGFTQTFGVDYSKTFSP

Query:  IAKLNTIRVLLSVAVNKDWPLYQPDVKNAFLNDYLEEVVYMSPPPGFEAQFDQNV------------------------------------------KSV
        +AKLNT+RVLLSVAVNKDWPLYQ DVKNAFLN  L E VYMSPPPGFEAQF Q V                                           S 
Subjt:  IAKLNTIRVLLSVAVNKDWPLYQPDVKNAFLNDYLEEVVYMSPPPGFEAQFDQNV------------------------------------------KSV

Query:  SGKVAVLIVYVDDIVLSGDDTAEINKLKQKMADEFEINDLGNLKYFLGMEVTRSREGIFVSQRK---------------PADTPIESNAKLGESVDKVPV
        +GK+A+LIVYVDDIVL+GDD  EI++LKQ+M DEFEI DLGNLKYFLGMEV RS+EGI VSQRK               PADTPIE N KLG S D+VPV
Subjt:  SGKVAVLIVYVDDIVLSGDDTAEINKLKQKMADEFEINDLGNLKYFLGMEVTRSREGIFVSQRK---------------PADTPIESNAKLGESVDKVPV

Query:  NKERYQRLVGRLNYLAHTKPDISYAVSVVSQFMQAPYEEHMEAVNRILRYLKSTPEKRLIFRKSDRRCIEAYTDSDSAGSVIDRKSTSGYCTFVWGNLVT
        +KE+YQRLVG+L YL+HT+PDIS+AVSVVSQFMQAPYE+HMEAVNRILRYLK+TP K L+FRK++R+ IEAYTDSD AGSVIDRKSTSGYCTFVWGNLVT
Subjt:  NKERYQRLVGRLNYLAHTKPDISYAVSVVSQFMQAPYEEHMEAVNRILRYLKSTPEKRLIFRKSDRRCIEAYTDSDSAGSVIDRKSTSGYCTFVWGNLVT

Query:  WRSKKQSVVAKSSAEAEYRAMSQGICEK
        WRSKKQSVVA+SSAEAEYRAMS GICE+
Subjt:  WRSKKQSVVAKSSAEAEYRAMSQGICEK

KAA0067540.1 Beta-galactosidase [Cucumis melo var. makuwa]0.0e+0055.26Show/hide
Query:  NSSIKIVLEGRHKFGYLTGQISKPRPRDPQE------------LSVNSQH------------GTSNWEASVIC---NYDSGYLECYSETV--LQEGSMDV
        + SIK+ LEGR++FG+LTG+  +P P D  E            + +NS                  W+ +        ++  L    + V   ++G++DV
Subjt:  NSSIKIVLEGRHKFGYLTGQISKPRPRDPQE------------LSVNSQH------------GTSNWEASVIC---NYDSGYLECYSETV--LQEGSMDV

Query:  TSYFNKLSLIWQEMDLCRELVWDYPCEGVQYYKLEEVDRVYDFLA---------------------------EVRLEEDRSSAMNITTTSVTDSAAFSAE
        T+YFNKLSL+WQEMDLCRE VWD P +  QY KLEE +RVYDFLA                           EVRLEEDR++AM + TT   DSAAFSA+
Subjt:  TSYFNKLSLIWQEMDLCRELVWDYPCEGVQYYKLEEVDRVYDFLA---------------------------EVRLEEDRSSAMNITTTSVTDSAAFSAE

Query:  SSGITGDKQNGKPPYV-----------NTCWKLHGRPPNGRRRPPNNKTN--QALVSETTSGSQSQHQEIYPTDSGTISLGAIAQSGISQSLSLLSITSK
        SS    DK NGK   V           + CWKLHGRPP G++R  N K N  +A +SETT  S SQ  +   + + T +LGAIAQSG+ QSL L+SI  K
Subjt:  SSGITGDKQNGKPPYV-----------NTCWKLHGRPPNGRRRPPNNKTN--QALVSETTSGSQSQHQEIYPTDSGTISLGAIAQSGISQSLSLLSITSK

Query:  KPWILDSRATYHLTGTSDNFLSYHP----------------------------------------------------ELNCRVAFSPDDVIFQDLSSRRT
         PWILDS A  HLTG+S++F+SY P                                                    EL+C+  F  + V FQD+SS RT
Subjt:  KPWILDSRATYHLTGTSDNFLSYHP----------------------------------------------------ELNCRVAFSPDDVIFQDLSSRRT

Query:  IGTVQHNRGLYFLNGDTSSRHNYRASLLFSYFSTSENDFFLLTDKSEVSSIFQQFYTTIKTQFNAKISILRSDNGREFLTNTLREFLSTKGIVHQSSCAY
        IGT +H+RGLY L+ DTS     R SLL SYFSTSE D+     KSEV SIFQ FY TIKTQF+ KI+ILRSDN REF  + L EFL++KGIVHQ+SCAY
Subjt:  IGTVQHNRGLYFLNGDTSSRHNYRASLLFSYFSTSENDFFLLTDKSEVSSIFQQFYTTIKTQFNAKISILRSDNGREFLTNTLREFLSTKGIVHQSSCAY

Query:  TPQQNGVAERKNRHLLEVARSLMLSTS--------------HLINRMPSRVLNFQTPLDCLKLSYPTTTTRLIPDVPLRVFGCTAFVHSFGPNQTKFTPH
        TPQQNGVAERKNRHL+EVARSLMLSTS              HLINRMPSR+L+ QTPLDCLK SYP  +TRL+ +VPLRVFGCTA+VH+FGPNQTKFTP 
Subjt:  TPQQNGVAERKNRHLLEVARSLMLSTS--------------HLINRMPSRVLNFQTPLDCLKLSYPTTTTRLIPDVPLRVFGCTAFVHSFGPNQTKFTPH

Query:  AQKCVFVGYPLHQRGYKCFHPTSQKYFISMDITFLEDKPFFPVSPLHGESSNEEINCHNPPPEPIMAT------HDTVLPTRQAPWITYYRKNLRKEIVA
        AQ CVFVGYPLHQRGYKCFHP S+KYF++MD+TF E++P+FPVS L GE+ +EE N      EP   T      H  +LPT Q PW TYYR+NLRK++ +
Subjt:  AQKCVFVGYPLHQRGYKCFHPTSQKYFISMDITFLEDKPFFPVSPLHGESSNEEINCHNPPPEPIMAT------HDTVLPTRQAPWITYYRKNLRKEIVA

Query:  PT-RMPAPIQSSEPTQVQG---TTDPNNHTVCVENDIVDLTKLPVKNNRGDMTENNQVAESDNSTTEE-ELEKAGEYDASLDMPIALRKGTRSCTRYPRY
        PT + PAPIQ+ EP + QG    T P  +    END  D+  L     +    E     E+ N   E+    K  EYD SLD+PIALRKGTRSCT++P  
Subjt:  PT-RMPAPIQSSEPTQVQG---TTDPNNHTVCVENDIVDLTKLPVKNNRGDMTENNQVAESDNSTTEE-ELEKAGEYDASLDMPIALRKGTRSCTRYPRY

Query:  SFLSYNNLSSKFRAFAASLDIVTIPKNIYVVMEILEWKAVVMEEVRALEKNNIWEQVALPKGHKPVGCKWVFTMKYKSNGTLDRYKARLVAKGFTQTFGV
        +++SY+NLS +FRAF ASLD   IPKNIY  +E  EWK  VMEE++ALEKN  WE  ALPKGHK VGCKWVF++KYK++GTLDR+KARLVAK FTQT+G+
Subjt:  SFLSYNNLSSKFRAFAASLDIVTIPKNIYVVMEILEWKAVVMEEVRALEKNNIWEQVALPKGHKPVGCKWVFTMKYKSNGTLDRYKARLVAKGFTQTFGV

Query:  DYSKTFSPIAKLNTIRVLLSVAVNKDWPLYQPDVKNAFLNDYLEEVVYMSPPPGFEAQFDQNV-------------------------------------
        DYS+TFSP+AKLNT+RVLLSVAVNKDWPLYQ DVKNAFLN  L E VYMSPPPGFEAQF Q V                                     
Subjt:  DYSKTFSPIAKLNTIRVLLSVAVNKDWPLYQPDVKNAFLNDYLEEVVYMSPPPGFEAQFDQNV-------------------------------------

Query:  -----KSVSGKVAVLIVYVDDIVLSGDDTAEINKLKQKMADEFEINDLGNLKYFLGMEVTRSREGIFVSQRK---------------PADTPIESNAKLG
              S +GK+A+LIVYVDDIVL+GDD  EI++LKQ+M DEFEI DLGNLKYFLGMEV RS+EGI VSQRK               PADTPIE N KLG
Subjt:  -----KSVSGKVAVLIVYVDDIVLSGDDTAEINKLKQKMADEFEINDLGNLKYFLGMEVTRSREGIFVSQRK---------------PADTPIESNAKLG

Query:  ESVDKVPVNKERYQRLVGRLNYLAHTKPDISYAVSVVSQFMQAPYEEHMEAVNRILRYLKSTPEKRLIFRKSDRRCIEAYTDSDSAGSVIDRKSTSGYCT
         S D+VPV+KE+Y RLVG+L YL+HT+PDIS+AVSVVSQFMQAPYE+HMEAVNRILRYLK+TP K L+FRK++R+ IEAYTDSD AGSVIDRKSTSGYCT
Subjt:  ESVDKVPVNKERYQRLVGRLNYLAHTKPDISYAVSVVSQFMQAPYEEHMEAVNRILRYLKSTPEKRLIFRKSDRRCIEAYTDSDSAGSVIDRKSTSGYCT

Query:  FVWGNLVTWRSKKQSVVAKSSAEAEYRAMSQGICEK
        FVWGNLVTWRSKKQSVVA+SSAEAEYRAMS GICE+
Subjt:  FVWGNLVTWRSKKQSVVAKSSAEAEYRAMSQGICEK

TrEMBL top hitse value%identityAlignment
A0A5A7SM64 Beta-galactosidase0.0e+0051.88Show/hide
Query:  NSSIKIVLEGRHKFGYLTGQISKPRPRDPQE------------LSVNSQH------------GTSNWEASVIC---NYDSGYLECYSETV--LQEGSMDV
        + SIK+ LEGR++FG+LTG+I +P P D  E            + +NS                  W+ +        ++  L    + V   ++G++DV
Subjt:  NSSIKIVLEGRHKFGYLTGQISKPRPRDPQE------------LSVNSQH------------GTSNWEASVIC---NYDSGYLECYSETV--LQEGSMDV

Query:  TSYFNKLSLIWQEMDLCRELVWDYPCEGVQYYKLEEVDRVYDFLA---------------------------EVRLEEDRSSAMNITTTSVTDSAAFSAE
        T+YFNKLSL+WQEMDLCRE VWD P +  QY KLEE DRVYDFLA                           EVRLEEDR++AM + TT   DSAAFSA 
Subjt:  TSYFNKLSLIWQEMDLCRELVWDYPCEGVQYYKLEEVDRVYDFLA---------------------------EVRLEEDRSSAMNITTTSVTDSAAFSAE

Query:  SSGITGDKQNGKPPYV-----------NTCWKLHGRPPNGRRRPPNNKTN--QALVSETTSGSQSQHQEIYPTDSGTISLGAIAQSGISQSLSLLSITSK
        SS    DK NGK   V           + CWKLHGRPP G++R  N K N  +A +SETT  S SQ  +   + + T +LGAIAQSG+ QSL L+S+  K
Subjt:  SSGITGDKQNGKPPYV-----------NTCWKLHGRPPNGRRRPPNNKTN--QALVSETTSGSQSQHQEIYPTDSGTISLGAIAQSGISQSLSLLSITSK

Query:  KPWILDSRATYHLTGTSDNFLSYHP----------------------------------------------------ELNCRVAFSPDDVIFQDLSSRRT
         PWILDS AT HLTG+S++F+SY P                                                    EL+C+  F P+ V FQD+SS RT
Subjt:  KPWILDSRATYHLTGTSDNFLSYHP----------------------------------------------------ELNCRVAFSPDDVIFQDLSSRRT

Query:  IGTVQHNRGLYFLNGDTSSRHNYRASLLFSYFSTSEND--------------------------------------------------------------
        IGT +H+RGLY L+ DTS     R SLL SYFSTSE D                                                              
Subjt:  IGTVQHNRGLYFLNGDTSSRHNYRASLLFSYFSTSEND--------------------------------------------------------------

Query:  ------------------------------FFLLTDKSEVSSIFQQFYTTIKTQFNAKISILRSDNGREFLTNTLREFLSTKGIVHQSSCAYTPQQNGVA
                                       +L++DKSEV SIFQ FY TIKTQF+ KI+ILRSDNGREF  + L EFL++KGIVHQ+SCAYTPQQNGVA
Subjt:  ------------------------------FFLLTDKSEVSSIFQQFYTTIKTQFNAKISILRSDNGREFLTNTLREFLSTKGIVHQSSCAYTPQQNGVA

Query:  ERKNRHLLEVARSLMLSTS--------------HLINRMPSRVLNFQTPLDCLKLSYPTTTTRLIPDVPLRVFGCTAFVHSFGPNQTKFTPHAQKCVFVG
        ERKNRHL+EVARSLMLSTS              HLINRMPSR+L+ QTPLDCLK SYP  +TRL+ +VPLRVFGCTA+VH+FGPNQTKFTP AQ CVFVG
Subjt:  ERKNRHLLEVARSLMLSTS--------------HLINRMPSRVLNFQTPLDCLKLSYPTTTTRLIPDVPLRVFGCTAFVHSFGPNQTKFTPHAQKCVFVG

Query:  YPLHQRGYKCFHPTSQKYFISMDITFLEDKPFFPVSPLHGESSNEEINCHNPPPEPIMAT------HDTVLPTRQAPWITYYRKNLRKEIVAPT-RMPAP
        YPLHQ GYKCFHP S+KYF++MD+TF E++P+FPVS L GE+ +EE N      EP + T      H  +LPT Q PW TYYR+NLRKE+ +PT + PAP
Subjt:  YPLHQRGYKCFHPTSQKYFISMDITFLEDKPFFPVSPLHGESSNEEINCHNPPPEPIMAT------HDTVLPTRQAPWITYYRKNLRKEIVAPT-RMPAP

Query:  IQSSEPTQVQG---TTDPNNHTVCVENDIVDLTKLPVKNNRGDMTENNQVAESDNSTTEE-ELEKAGEYDASLDMPIALRKGTRSCTRYPRYSFLSYNNL
        +Q+ EP + QG    T P  +    END  D+  L     +    E     E+ N   E+    K  EYD SLD+PIALRKGTRSCT++P  +++SY+NL
Subjt:  IQSSEPTQVQG---TTDPNNHTVCVENDIVDLTKLPVKNNRGDMTENNQVAESDNSTTEE-ELEKAGEYDASLDMPIALRKGTRSCTRYPRYSFLSYNNL

Query:  SSKFRAFAASLDIVTIPKNIYVVMEILEWKAVVMEEVRALEKNNIWEQVALPKGHKPVGCKWVFTMKYKSNGTLDRYKARLVAKGFTQTFGVDYSKTFSP
        S +FRAF A+LD   IPKNIY  +E  EWK  VMEE++ALEKN  WE  ALPKGHK VGCKWVF++KYK++GTLDR+KARLVAKGFTQT+G+DYS+TFSP
Subjt:  SSKFRAFAASLDIVTIPKNIYVVMEILEWKAVVMEEVRALEKNNIWEQVALPKGHKPVGCKWVFTMKYKSNGTLDRYKARLVAKGFTQTFGVDYSKTFSP

Query:  IAKLNTIRVLLSVAVNKDWPLYQPDVKNAFLNDYLEEVVYMSPPPGFEAQFDQNV------------------------------------------KSV
        +AKLNT+RVLLSVAVNKDWPLYQ DVKNAFLN  L E VYMSPPPGFEAQF Q V                                           S 
Subjt:  IAKLNTIRVLLSVAVNKDWPLYQPDVKNAFLNDYLEEVVYMSPPPGFEAQFDQNV------------------------------------------KSV

Query:  SGKVAVLIVYVDDIVLSGDDTAEINKLKQKMADEFEINDLGNLKYFLGMEVTRSREGIFVSQRK---------------PADTPIESNAKLGESVDKVPV
        +GK+A+LIVYVDDIVL+GDD  EI++LKQ+M DEFEI DLGNLKYFLGMEV RS+EGI VSQRK               PADTPIE N KLG S D+VPV
Subjt:  SGKVAVLIVYVDDIVLSGDDTAEINKLKQKMADEFEINDLGNLKYFLGMEVTRSREGIFVSQRK---------------PADTPIESNAKLGESVDKVPV

Query:  NKERYQRLVGRLNYLAHTKPDISYAVSVVSQFMQAPYEEHMEAVNRILRYLKSTPEKRLIFRKSDRRCIEAYTDSDSAGSVIDRKSTSGYCTFVWGNLVT
        +KE+YQRLVG+L YL+HT+PDIS+AVSVVSQFMQAPYE+HMEAVNRILRYLK+TP K L+FRK++R+ IEAYTDSD AGSVIDRKSTSGYCTFVWGNLVT
Subjt:  NKERYQRLVGRLNYLAHTKPDISYAVSVVSQFMQAPYEEHMEAVNRILRYLKSTPEKRLIFRKSDRRCIEAYTDSDSAGSVIDRKSTSGYCTFVWGNLVT

Query:  WRSKKQSVVAKSSAEAEYRAMSQGICEK
        WRSKKQSVVA+SSAEAEYRAMS GICE+
Subjt:  WRSKKQSVVAKSSAEAEYRAMSQGICEK

A0A5A7SQW1 Beta-galactosidase0.0e+0052.89Show/hide
Query:  NSSIKIVLEGRHKFGYLTGQISKPRPRDPQE------------LSVNSQH------------GTSNWEASVIC---NYDSGYLECYSETV--LQEGSMDV
        + SIK+ LEGR++FG+LTG+  +P P D  E            + +NS                  W+ +        ++  L    + V   ++G++DV
Subjt:  NSSIKIVLEGRHKFGYLTGQISKPRPRDPQE------------LSVNSQH------------GTSNWEASVIC---NYDSGYLECYSETV--LQEGSMDV

Query:  TSYFNKLSLIWQEMDLCRELVWDYPCEGVQYYKLEEVDRVYDFLA---------------------------EVRLEEDRSSAMNITTTSVTDSAAFSAE
        T+YFNKLSL+WQEMDLCRE VWD P +  QY KLEE DRVYDFLA                           EVRLEEDR++AM + TT   DSAAFSA 
Subjt:  TSYFNKLSLIWQEMDLCRELVWDYPCEGVQYYKLEEVDRVYDFLA---------------------------EVRLEEDRSSAMNITTTSVTDSAAFSAE

Query:  SSGITGDKQNGKPPYV-----------NTCWKLHGRPPNGRRRPPNNKTN--QALVSETTSGSQSQHQEIYPTDSGTISLGAIAQSGISQSLSLLSITSK
        SS    DK NGK   V           + CWKLHGRPP G++R  N K N  +A +SETT  S SQ  +   + + T +LGAIAQSG+ QSL L+S+  K
Subjt:  SSGITGDKQNGKPPYV-----------NTCWKLHGRPPNGRRRPPNNKTN--QALVSETTSGSQSQHQEIYPTDSGTISLGAIAQSGISQSLSLLSITSK

Query:  KPWILDSRATYHLTGTSDNFLSYHP----------------------------------------------------ELNCRVAFSPDDVIFQDLSSRRT
         PWILDS AT HLTG+S++F+SY P                                                    EL+C+  F P+ V FQD+SS RT
Subjt:  KPWILDSRATYHLTGTSDNFLSYHP----------------------------------------------------ELNCRVAFSPDDVIFQDLSSRRT

Query:  IGTVQHNRGLYFLNGDTSSRHNYRASLLFSYFSTSENDF-------------------------------------------------------------
        IGT +H+RGLY L+ DTS     R SLL SYFSTSE DF                                                             
Subjt:  IGTVQHNRGLYFLNGDTSSRHNYRASLLFSYFSTSENDF-------------------------------------------------------------

Query:  -----FLLTDKSEVSSIFQQFYTTIKTQFNAKISILRSDNGREFLTNTLREFLSTKGIVHQSSCAYTPQQNGVAERKNRHLLEVARSLMLSTS-------
             +L++DKSEV SIFQ FY TIKTQF+ KI+ILRSDNGREF  + L EFL++KGIVHQ+SCAYTPQQNGVAERKNRHL+EVARSLMLSTS       
Subjt:  -----FLLTDKSEVSSIFQQFYTTIKTQFNAKISILRSDNGREFLTNTLREFLSTKGIVHQSSCAYTPQQNGVAERKNRHLLEVARSLMLSTS-------

Query:  -------HLINRMPSRVLNFQTPLDCLKLSYPTTTTRLIPDVPLRVFGCTAFVHSFGPNQTKFTPHAQKCVFVGYPLHQRGYKCFHPTSQKYFISMDITF
               HLINRMPSR+L+ QTPLDCLK SYP  +TRL+ +VPLRVFGCTA+VH+FGPNQTKFTP AQ CVFVGYPLHQ GYKCFHP S+KYF++MD+TF
Subjt:  -------HLINRMPSRVLNFQTPLDCLKLSYPTTTTRLIPDVPLRVFGCTAFVHSFGPNQTKFTPHAQKCVFVGYPLHQRGYKCFHPTSQKYFISMDITF

Query:  LEDKPFFPVSPLHGESSNEEINCHNPPPEPIMAT------HDTVLPTRQAPWITYYRKNLRKEIVAPT-RMPAPIQSSEPTQVQG---TTDPNNHTVCVE
         E++P+FPVS L GE+ +EE N      EP + T      H  +LPT Q PW TYYR+NLRKE+ +PT + PAP+Q+ EP + QG    T P  +    E
Subjt:  LEDKPFFPVSPLHGESSNEEINCHNPPPEPIMAT------HDTVLPTRQAPWITYYRKNLRKEIVAPT-RMPAPIQSSEPTQVQG---TTDPNNHTVCVE

Query:  NDIVDLTKLPVKNNRGDMTENNQVAESDNSTTEE-ELEKAGEYDASLDMPIALRKGTRSCTRYPRYSFLSYNNLSSKFRAFAASLDIVTIPKNIYVVMEI
        ND  D+  L     +    E     E+ N   E+    K  EYD SLD+PIALRKGTRSCT++P  +++SY+NLS +FRAF A+LD   IPKNIY  +E 
Subjt:  NDIVDLTKLPVKNNRGDMTENNQVAESDNSTTEE-ELEKAGEYDASLDMPIALRKGTRSCTRYPRYSFLSYNNLSSKFRAFAASLDIVTIPKNIYVVMEI

Query:  LEWKAVVMEEVRALEKNNIWEQVALPKGHKPVGCKWVFTMKYKSNGTLDRYKARLVAKGFTQTFGVDYSKTFSPIAKLNTIRVLLSVAVNKDWPLYQPDV
         EWK  VMEE++ALEKN  WE  ALPKGHK VGCKWVF++KYK++GTLDR+KARLVAKGFTQT+G+DYS+TFSP+AKLNT+RVLLSVAVNKDWPLYQ DV
Subjt:  LEWKAVVMEEVRALEKNNIWEQVALPKGHKPVGCKWVFTMKYKSNGTLDRYKARLVAKGFTQTFGVDYSKTFSPIAKLNTIRVLLSVAVNKDWPLYQPDV

Query:  KNAFLNDYLEEVVYMSPPPGFEAQFDQNV------------------------------------------KSVSGKVAVLIVYVDDIVLSGDDTAEINK
        KNAFLN  L E VYMSPPPGFEAQF Q V                                           S +GK+A+LIVYVDDIVL+GDD  EI++
Subjt:  KNAFLNDYLEEVVYMSPPPGFEAQFDQNV------------------------------------------KSVSGKVAVLIVYVDDIVLSGDDTAEINK

Query:  LKQKMADEFEINDLGNLKYFLGMEVTRSREGIFVSQRK---------------PADTPIESNAKLGESVDKVPVNKERYQRLVGRLNYLAHTKPDISYAV
        LKQ+M DEFEI DLGNLKYFLGMEV RS+EGI VSQRK               PADTPIE N KLG S D+VPV+KE+YQRLVG+L YL+HT+PDIS+AV
Subjt:  LKQKMADEFEINDLGNLKYFLGMEVTRSREGIFVSQRK---------------PADTPIESNAKLGESVDKVPVNKERYQRLVGRLNYLAHTKPDISYAV

Query:  SVVSQFMQAPYEEHMEAVNRILRYLKSTPEKRLIFRKSDRRCIEAYTDSDSAGSVIDRKSTSGYCTFVWGNLVTWRSKKQSVVAKSSAEAEYRAMSQGI
        SVVSQFMQAPYE+HMEAVNRILRYLK+TP K L+FRK++R+ IEAYTDSD AGSVIDRKSTSGYCTFVWGNLVTWRSKKQSVVA+SSAEAEYRAMS GI
Subjt:  SVVSQFMQAPYEEHMEAVNRILRYLKSTPEKRLIFRKSDRRCIEAYTDSDSAGSVIDRKSTSGYCTFVWGNLVTWRSKKQSVVAKSSAEAEYRAMSQGI

A0A5A7U8U2 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0052.84Show/hide
Query:  NSSIKIVLEGRHKFGYLTGQISKPRPRDPQE------------LSVNSQH------------GTSNWEASVIC---NYDSGYLECYSETV--LQEGSMDV
        + SIK+ LEGR++FG+LTG+  +P P D  E            + +NS                  W+ +        ++  L    + V   ++G++DV
Subjt:  NSSIKIVLEGRHKFGYLTGQISKPRPRDPQE------------LSVNSQH------------GTSNWEASVIC---NYDSGYLECYSETV--LQEGSMDV

Query:  TSYFNKLSLIWQEMDLCRELVWDYPCEGVQYYKLEEVDRVYDFLA---------------------------EVRLEEDRSSAMNITTTSVTDSAAFSAE
        T+YFNKLSL+WQEMDLCRE VWD P +  QY KLEE DRVYDFLA                           EVRLEEDR++AM + TT   DSAAFSA 
Subjt:  TSYFNKLSLIWQEMDLCRELVWDYPCEGVQYYKLEEVDRVYDFLA---------------------------EVRLEEDRSSAMNITTTSVTDSAAFSAE

Query:  SSGITGDKQNGKPPYV-----------NTCWKLHGRPPNGRRRPPNNKTN--QALVSETTSGSQSQHQEIYPTDSGTISLGAIAQSGISQSLSLLSITSK
        SS    DK NGK   V           + CWKLHGRPP G++R  N K N  +A +SET   S SQ  +   + + T +LGAIAQSG+ QSL L+SI  K
Subjt:  SSGITGDKQNGKPPYV-----------NTCWKLHGRPPNGRRRPPNNKTN--QALVSETTSGSQSQHQEIYPTDSGTISLGAIAQSGISQSLSLLSITSK

Query:  KPWILDSRATYHLTGTSDNFLSYHP----------------------------------------------------ELNCRVAFSPDDVIFQDLSSRRT
         PWILDS AT HLTG+S++F+SY                                                      EL+C+  F P+ V FQD+SS RT
Subjt:  KPWILDSRATYHLTGTSDNFLSYHP----------------------------------------------------ELNCRVAFSPDDVIFQDLSSRRT

Query:  IGTVQHNRGLYFLNGDTSSRHNYRASLLFSYFSTSENDF-------------------------------------------------------------
        IGT +H+RGLY L+ DTS     R SLL SYFSTSE DF                                                             
Subjt:  IGTVQHNRGLYFLNGDTSSRHNYRASLLFSYFSTSENDF-------------------------------------------------------------

Query:  -----FLLTDKSEVSSIFQQFYTTIKTQFNAKISILRSDNGREFLTNTLREFLSTKGIVHQSSCAYTPQQNGVAERKNRHLLEVARSLMLSTS-------
             +L++DKSEV SIFQ FY TIKTQF+ KI+ILRSDNGREF  + L EFL++KGIVHQ+SCAYTPQQNGVAERKNRHL+EVARSLMLSTS       
Subjt:  -----FLLTDKSEVSSIFQQFYTTIKTQFNAKISILRSDNGREFLTNTLREFLSTKGIVHQSSCAYTPQQNGVAERKNRHLLEVARSLMLSTS-------

Query:  -------HLINRMPSRVLNFQTPLDCLKLSYPTTTTRLIPDVPLRVFGCTAFVHSFGPNQTKFTPHAQKCVFVGYPLHQRGYKCFHPTSQKYFISMDITF
               HLINRMPSR+L+ QTPLDCLK SYP  +TRL+ +VPLRVFGCTA+VH+FGPNQTKFTP AQ CVFVGYPLHQ GYKCFHP S+KYF++MD+TF
Subjt:  -------HLINRMPSRVLNFQTPLDCLKLSYPTTTTRLIPDVPLRVFGCTAFVHSFGPNQTKFTPHAQKCVFVGYPLHQRGYKCFHPTSQKYFISMDITF

Query:  LEDKPFFPVSPLHGESSNEEINCHNPPPEPIMAT------HDTVLPTRQAPWITYYRKNLRKEIVAPT-RMPAPIQSSEPTQVQG---TTDPNNHTVCVE
         E++P+FPVS L GE+ +EE N      EP + T      H  +LPT Q PW TYYR+NLRKE+ +PT + PAP+Q+ EP + QG    T P  +    E
Subjt:  LEDKPFFPVSPLHGESSNEEINCHNPPPEPIMAT------HDTVLPTRQAPWITYYRKNLRKEIVAPT-RMPAPIQSSEPTQVQG---TTDPNNHTVCVE

Query:  NDIVDLTKLPVKNNRGDMTENNQVAESDNSTTEE-ELEKAGEYDASLDMPIALRKGTRSCTRYPRYSFLSYNNLSSKFRAFAASLDIVTIPKNIYVVMEI
        ND  D+  L     +    E     E+ N   E+    K  EYD SLD+PIALRKGTRSCT++P  +++SY+NLS +FRAF A+LD   IPKNIY  +E 
Subjt:  NDIVDLTKLPVKNNRGDMTENNQVAESDNSTTEE-ELEKAGEYDASLDMPIALRKGTRSCTRYPRYSFLSYNNLSSKFRAFAASLDIVTIPKNIYVVMEI

Query:  LEWKAVVMEEVRALEKNNIWEQVALPKGHKPVGCKWVFTMKYKSNGTLDRYKARLVAKGFTQTFGVDYSKTFSPIAKLNTIRVLLSVAVNKDWPLYQPDV
         EWK  VMEE++ALEKN  WE  ALPKGHK VGCKWVF++KYK++GTLDR+KARLVAKGFTQT+G+DYS+TFSP+AKLNT+RVLLSVAVNKDWPLYQ DV
Subjt:  LEWKAVVMEEVRALEKNNIWEQVALPKGHKPVGCKWVFTMKYKSNGTLDRYKARLVAKGFTQTFGVDYSKTFSPIAKLNTIRVLLSVAVNKDWPLYQPDV

Query:  KNAFLNDYLEEVVYMSPPPGFEAQFDQNV------------------------------------------KSVSGKVAVLIVYVDDIVLSGDDTAEINK
        KNAFLN  L E VYMSPPPGFEAQF Q V                                           S +GK+A+LIVYVDDIVL+GDD  EI++
Subjt:  KNAFLNDYLEEVVYMSPPPGFEAQFDQNV------------------------------------------KSVSGKVAVLIVYVDDIVLSGDDTAEINK

Query:  LKQKMADEFEINDLGNLKYFLGMEVTRSREGIFVSQRK---------------PADTPIESNAKLGESVDKVPVNKERYQRLVGRLNYLAHTKPDISYAV
        LKQ+M DEFEI DLGNLKYFLGMEV RS+EGI VSQRK               PADTPIE N KLG S D+VPV+KE+YQRLVG+L YL+HT+PDIS+AV
Subjt:  LKQKMADEFEINDLGNLKYFLGMEVTRSREGIFVSQRK---------------PADTPIESNAKLGESVDKVPVNKERYQRLVGRLNYLAHTKPDISYAV

Query:  SVVSQFMQAPYEEHMEAVNRILRYLKSTPEKRLIFRKSDRRCIEAYTDSDSAGSVIDRKSTSGYCTFVWGNLVTWRSKKQSVVAKSSAEAEYRAMSQGIC
        SVVSQFMQAPYE+HMEAVNRILRYLK+TP K L+FRK++R+ IEAYTDSD AGSVIDRKSTSGYCTFVWGNLVTWRSKKQSVVA+SSAEAEYRAMS GIC
Subjt:  SVVSQFMQAPYEEHMEAVNRILRYLKSTPEKRLIFRKSDRRCIEAYTDSDSAGSVIDRKSTSGYCTFVWGNLVTWRSKKQSVVAKSSAEAEYRAMSQGIC

Query:  EK
        E+
Subjt:  EK

A0A5A7V0Y9 Beta-galactosidase0.0e+0051.88Show/hide
Query:  NSSIKIVLEGRHKFGYLTGQISKPRPRDPQE------------LSVNSQH------------GTSNWEASVIC---NYDSGYLECYSETV--LQEGSMDV
        + SIK+ LEGR++FG+LTG+I +P P D  E            + +NS                  W+ +        ++  L    + V   ++G++DV
Subjt:  NSSIKIVLEGRHKFGYLTGQISKPRPRDPQE------------LSVNSQH------------GTSNWEASVIC---NYDSGYLECYSETV--LQEGSMDV

Query:  TSYFNKLSLIWQEMDLCRELVWDYPCEGVQYYKLEEVDRVYDFLA---------------------------EVRLEEDRSSAMNITTTSVTDSAAFSAE
        T+YFNKLSL+WQEMDLCRE VWD P +  QY KLEE DRVYDFLA                           EVRLEEDR++AM + TT   DSAAFSA 
Subjt:  TSYFNKLSLIWQEMDLCRELVWDYPCEGVQYYKLEEVDRVYDFLA---------------------------EVRLEEDRSSAMNITTTSVTDSAAFSAE

Query:  SSGITGDKQNGKPPYV-----------NTCWKLHGRPPNGRRRPPNNKTN--QALVSETTSGSQSQHQEIYPTDSGTISLGAIAQSGISQSLSLLSITSK
        SS    DK NGK   V           + CWKLHGRPP G++R  N K N  +A +SETT  S SQ  +   + + T +LGAIAQSG+ QSL L+S+  K
Subjt:  SSGITGDKQNGKPPYV-----------NTCWKLHGRPPNGRRRPPNNKTN--QALVSETTSGSQSQHQEIYPTDSGTISLGAIAQSGISQSLSLLSITSK

Query:  KPWILDSRATYHLTGTSDNFLSYHP----------------------------------------------------ELNCRVAFSPDDVIFQDLSSRRT
         PWILDS AT HLTG+S++F+SY P                                                    EL+C+  F P+ V FQD+SS RT
Subjt:  KPWILDSRATYHLTGTSDNFLSYHP----------------------------------------------------ELNCRVAFSPDDVIFQDLSSRRT

Query:  IGTVQHNRGLYFLNGDTSSRHNYRASLLFSYFSTSEND--------------------------------------------------------------
        IGT +H+RGLY L+ DTS     R SLL SYFSTSE D                                                              
Subjt:  IGTVQHNRGLYFLNGDTSSRHNYRASLLFSYFSTSEND--------------------------------------------------------------

Query:  ------------------------------FFLLTDKSEVSSIFQQFYTTIKTQFNAKISILRSDNGREFLTNTLREFLSTKGIVHQSSCAYTPQQNGVA
                                       +L++DKSEV SIFQ FY TIKTQF+ KI+ILRSDNGREF  + L EFL++KGIVHQ+SCAYTPQQNGVA
Subjt:  ------------------------------FFLLTDKSEVSSIFQQFYTTIKTQFNAKISILRSDNGREFLTNTLREFLSTKGIVHQSSCAYTPQQNGVA

Query:  ERKNRHLLEVARSLMLSTS--------------HLINRMPSRVLNFQTPLDCLKLSYPTTTTRLIPDVPLRVFGCTAFVHSFGPNQTKFTPHAQKCVFVG
        ERKNRHL+EVARSLMLSTS              HLINRMPSR+L+ QTPLDCLK SYP  +TRL+ +VPLRVFGCTA+VH+FGPNQTKFTP AQ CVFVG
Subjt:  ERKNRHLLEVARSLMLSTS--------------HLINRMPSRVLNFQTPLDCLKLSYPTTTTRLIPDVPLRVFGCTAFVHSFGPNQTKFTPHAQKCVFVG

Query:  YPLHQRGYKCFHPTSQKYFISMDITFLEDKPFFPVSPLHGESSNEEINCHNPPPEPIMAT------HDTVLPTRQAPWITYYRKNLRKEIVAPT-RMPAP
        YPLHQ GYKCFHP S+KYF++MD+TF E++P+FPVS L GE+ +EE N      EP + T      H  +LPT Q PW TYYR+NLRKE+ +PT + PAP
Subjt:  YPLHQRGYKCFHPTSQKYFISMDITFLEDKPFFPVSPLHGESSNEEINCHNPPPEPIMAT------HDTVLPTRQAPWITYYRKNLRKEIVAPT-RMPAP

Query:  IQSSEPTQVQG---TTDPNNHTVCVENDIVDLTKLPVKNNRGDMTENNQVAESDNSTTEE-ELEKAGEYDASLDMPIALRKGTRSCTRYPRYSFLSYNNL
        +Q+ EP + QG    T P  +    END  D+  L     +    E     E+ N   E+    K  EYD SLD+PIALRKGTRSCT++P  +++SY+NL
Subjt:  IQSSEPTQVQG---TTDPNNHTVCVENDIVDLTKLPVKNNRGDMTENNQVAESDNSTTEE-ELEKAGEYDASLDMPIALRKGTRSCTRYPRYSFLSYNNL

Query:  SSKFRAFAASLDIVTIPKNIYVVMEILEWKAVVMEEVRALEKNNIWEQVALPKGHKPVGCKWVFTMKYKSNGTLDRYKARLVAKGFTQTFGVDYSKTFSP
        S +FRAF A+LD   IPKNIY  +E  EWK  VMEE++ALEKN  WE  ALPKGHK VGCKWVF++KYK++GTLDR+KARLVAKGFTQT+G+DYS+TFSP
Subjt:  SSKFRAFAASLDIVTIPKNIYVVMEILEWKAVVMEEVRALEKNNIWEQVALPKGHKPVGCKWVFTMKYKSNGTLDRYKARLVAKGFTQTFGVDYSKTFSP

Query:  IAKLNTIRVLLSVAVNKDWPLYQPDVKNAFLNDYLEEVVYMSPPPGFEAQFDQNV------------------------------------------KSV
        +AKLNT+RVLLSVAVNKDWPLYQ DVKNAFLN  L E VYMSPPPGFEAQF Q V                                           S 
Subjt:  IAKLNTIRVLLSVAVNKDWPLYQPDVKNAFLNDYLEEVVYMSPPPGFEAQFDQNV------------------------------------------KSV

Query:  SGKVAVLIVYVDDIVLSGDDTAEINKLKQKMADEFEINDLGNLKYFLGMEVTRSREGIFVSQRK---------------PADTPIESNAKLGESVDKVPV
        +GK+A+LIVYVDDIVL+GDD  EI++LKQ+M DEFEI DLGNLKYFLGMEV RS+EGI VSQRK               PADTPIE N KLG S D+VPV
Subjt:  SGKVAVLIVYVDDIVLSGDDTAEINKLKQKMADEFEINDLGNLKYFLGMEVTRSREGIFVSQRK---------------PADTPIESNAKLGESVDKVPV

Query:  NKERYQRLVGRLNYLAHTKPDISYAVSVVSQFMQAPYEEHMEAVNRILRYLKSTPEKRLIFRKSDRRCIEAYTDSDSAGSVIDRKSTSGYCTFVWGNLVT
        +KE+YQRLVG+L YL+HT+PDIS+AVSVVSQFMQAPYE+HMEAVNRILRYLK+TP K L+FRK++R+ IEAYTDSD AGSVIDRKSTSGYCTFVWGNLVT
Subjt:  NKERYQRLVGRLNYLAHTKPDISYAVSVVSQFMQAPYEEHMEAVNRILRYLKSTPEKRLIFRKSDRRCIEAYTDSDSAGSVIDRKSTSGYCTFVWGNLVT

Query:  WRSKKQSVVAKSSAEAEYRAMSQGICEK
        WRSKKQSVVA+SSAEAEYRAMS GICE+
Subjt:  WRSKKQSVVAKSSAEAEYRAMSQGICEK

A0A5A7VK49 Beta-galactosidase0.0e+0055.26Show/hide
Query:  NSSIKIVLEGRHKFGYLTGQISKPRPRDPQE------------LSVNSQH------------GTSNWEASVIC---NYDSGYLECYSETV--LQEGSMDV
        + SIK+ LEGR++FG+LTG+  +P P D  E            + +NS                  W+ +        ++  L    + V   ++G++DV
Subjt:  NSSIKIVLEGRHKFGYLTGQISKPRPRDPQE------------LSVNSQH------------GTSNWEASVIC---NYDSGYLECYSETV--LQEGSMDV

Query:  TSYFNKLSLIWQEMDLCRELVWDYPCEGVQYYKLEEVDRVYDFLA---------------------------EVRLEEDRSSAMNITTTSVTDSAAFSAE
        T+YFNKLSL+WQEMDLCRE VWD P +  QY KLEE +RVYDFLA                           EVRLEEDR++AM + TT   DSAAFSA+
Subjt:  TSYFNKLSLIWQEMDLCRELVWDYPCEGVQYYKLEEVDRVYDFLA---------------------------EVRLEEDRSSAMNITTTSVTDSAAFSAE

Query:  SSGITGDKQNGKPPYV-----------NTCWKLHGRPPNGRRRPPNNKTN--QALVSETTSGSQSQHQEIYPTDSGTISLGAIAQSGISQSLSLLSITSK
        SS    DK NGK   V           + CWKLHGRPP G++R  N K N  +A +SETT  S SQ  +   + + T +LGAIAQSG+ QSL L+SI  K
Subjt:  SSGITGDKQNGKPPYV-----------NTCWKLHGRPPNGRRRPPNNKTN--QALVSETTSGSQSQHQEIYPTDSGTISLGAIAQSGISQSLSLLSITSK

Query:  KPWILDSRATYHLTGTSDNFLSYHP----------------------------------------------------ELNCRVAFSPDDVIFQDLSSRRT
         PWILDS A  HLTG+S++F+SY P                                                    EL+C+  F  + V FQD+SS RT
Subjt:  KPWILDSRATYHLTGTSDNFLSYHP----------------------------------------------------ELNCRVAFSPDDVIFQDLSSRRT

Query:  IGTVQHNRGLYFLNGDTSSRHNYRASLLFSYFSTSENDFFLLTDKSEVSSIFQQFYTTIKTQFNAKISILRSDNGREFLTNTLREFLSTKGIVHQSSCAY
        IGT +H+RGLY L+ DTS     R SLL SYFSTSE D+     KSEV SIFQ FY TIKTQF+ KI+ILRSDN REF  + L EFL++KGIVHQ+SCAY
Subjt:  IGTVQHNRGLYFLNGDTSSRHNYRASLLFSYFSTSENDFFLLTDKSEVSSIFQQFYTTIKTQFNAKISILRSDNGREFLTNTLREFLSTKGIVHQSSCAY

Query:  TPQQNGVAERKNRHLLEVARSLMLSTS--------------HLINRMPSRVLNFQTPLDCLKLSYPTTTTRLIPDVPLRVFGCTAFVHSFGPNQTKFTPH
        TPQQNGVAERKNRHL+EVARSLMLSTS              HLINRMPSR+L+ QTPLDCLK SYP  +TRL+ +VPLRVFGCTA+VH+FGPNQTKFTP 
Subjt:  TPQQNGVAERKNRHLLEVARSLMLSTS--------------HLINRMPSRVLNFQTPLDCLKLSYPTTTTRLIPDVPLRVFGCTAFVHSFGPNQTKFTPH

Query:  AQKCVFVGYPLHQRGYKCFHPTSQKYFISMDITFLEDKPFFPVSPLHGESSNEEINCHNPPPEPIMAT------HDTVLPTRQAPWITYYRKNLRKEIVA
        AQ CVFVGYPLHQRGYKCFHP S+KYF++MD+TF E++P+FPVS L GE+ +EE N      EP   T      H  +LPT Q PW TYYR+NLRK++ +
Subjt:  AQKCVFVGYPLHQRGYKCFHPTSQKYFISMDITFLEDKPFFPVSPLHGESSNEEINCHNPPPEPIMAT------HDTVLPTRQAPWITYYRKNLRKEIVA

Query:  PT-RMPAPIQSSEPTQVQG---TTDPNNHTVCVENDIVDLTKLPVKNNRGDMTENNQVAESDNSTTEE-ELEKAGEYDASLDMPIALRKGTRSCTRYPRY
        PT + PAPIQ+ EP + QG    T P  +    END  D+  L     +    E     E+ N   E+    K  EYD SLD+PIALRKGTRSCT++P  
Subjt:  PT-RMPAPIQSSEPTQVQG---TTDPNNHTVCVENDIVDLTKLPVKNNRGDMTENNQVAESDNSTTEE-ELEKAGEYDASLDMPIALRKGTRSCTRYPRY

Query:  SFLSYNNLSSKFRAFAASLDIVTIPKNIYVVMEILEWKAVVMEEVRALEKNNIWEQVALPKGHKPVGCKWVFTMKYKSNGTLDRYKARLVAKGFTQTFGV
        +++SY+NLS +FRAF ASLD   IPKNIY  +E  EWK  VMEE++ALEKN  WE  ALPKGHK VGCKWVF++KYK++GTLDR+KARLVAK FTQT+G+
Subjt:  SFLSYNNLSSKFRAFAASLDIVTIPKNIYVVMEILEWKAVVMEEVRALEKNNIWEQVALPKGHKPVGCKWVFTMKYKSNGTLDRYKARLVAKGFTQTFGV

Query:  DYSKTFSPIAKLNTIRVLLSVAVNKDWPLYQPDVKNAFLNDYLEEVVYMSPPPGFEAQFDQNV-------------------------------------
        DYS+TFSP+AKLNT+RVLLSVAVNKDWPLYQ DVKNAFLN  L E VYMSPPPGFEAQF Q V                                     
Subjt:  DYSKTFSPIAKLNTIRVLLSVAVNKDWPLYQPDVKNAFLNDYLEEVVYMSPPPGFEAQFDQNV-------------------------------------

Query:  -----KSVSGKVAVLIVYVDDIVLSGDDTAEINKLKQKMADEFEINDLGNLKYFLGMEVTRSREGIFVSQRK---------------PADTPIESNAKLG
              S +GK+A+LIVYVDDIVL+GDD  EI++LKQ+M DEFEI DLGNLKYFLGMEV RS+EGI VSQRK               PADTPIE N KLG
Subjt:  -----KSVSGKVAVLIVYVDDIVLSGDDTAEINKLKQKMADEFEINDLGNLKYFLGMEVTRSREGIFVSQRK---------------PADTPIESNAKLG

Query:  ESVDKVPVNKERYQRLVGRLNYLAHTKPDISYAVSVVSQFMQAPYEEHMEAVNRILRYLKSTPEKRLIFRKSDRRCIEAYTDSDSAGSVIDRKSTSGYCT
         S D+VPV+KE+Y RLVG+L YL+HT+PDIS+AVSVVSQFMQAPYE+HMEAVNRILRYLK+TP K L+FRK++R+ IEAYTDSD AGSVIDRKSTSGYCT
Subjt:  ESVDKVPVNKERYQRLVGRLNYLAHTKPDISYAVSVVSQFMQAPYEEHMEAVNRILRYLKSTPEKRLIFRKSDRRCIEAYTDSDSAGSVIDRKSTSGYCT

Query:  FVWGNLVTWRSKKQSVVAKSSAEAEYRAMSQGICEK
        FVWGNLVTWRSKKQSVVA+SSAEAEYRAMS GICE+
Subjt:  FVWGNLVTWRSKKQSVVAKSSAEAEYRAMSQGICEK

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.4e-7428.61Show/hide
Query:  FLLTDKSEVSSIFQQFYTTIKTQFNAKISILRSDNGREFLTNTLREFLSTKGIVHQSSCAYTPQQNGVAERKNRHLLEVARSL--------------MLS
        +L+  KS+V S+FQ F    +  FN K+  L  DNGRE+L+N +R+F   KGI +  +  +TPQ NGV+ER  R + E AR++              +L+
Subjt:  FLLTDKSEVSSIFQQFYTTIKTQFNAKISILRSDNGREFLTNTLREFLSTKGIVHQSSCAYTPQQNGVAERKNRHLLEVARSL--------------MLS

Query:  TSHLINRMPSRVLNFQTPLDCLKLSYPTTTTRLIPDVPLRVFGCTAFVHSFGPNQTKFTPHAQKCVFVGYPLHQRGYKCFHPTSQKYFISMDI----TFL
         ++LINR+PSR L     +D  K  Y     +      LRVFG T +VH     Q KF   + K +FVGY     G+K +   ++K+ ++ D+    T +
Subjt:  TSHLINRMPSRVLNFQTPLDCLKLSYPTTTTRLIPDVPLRVFGCTAFVHSFGPNQTKFTPHAQKCVFVGYPLHQRGYKCFHPTSQKYFISMDI----TFL

Query:  EDKPFFPVSPLHGESSNEEINCHNPPPEPIMATHDTVLPTRQAPWITYYR-------KNLRKEIVAPTRMPAPIQSSEPTQVQGTTDPNNHTVCVENDIV
         +        +  + S E  N + P     +   +    +++   I + +       KN   +     +   P +S E   +Q   D         N   
Subjt:  EDKPFFPVSPLHGESSNEEINCHNPPPEPIMATHDTVLPTRQAPWITYYR-------KNLRKEIVAPTRMPAPIQSSEPTQVQGTTDPNNHTVCVENDIV

Query:  DLTKLPVKNNRGDMTENNQVAESDNSTTEEELEKAGEYDASLD--MPIALRKGTRSCTRYPRYSFLSYNNLSSK--FRAFAASLDIVTIPKNIYVVMEIL
        +  K    ++  +   +    ES  S T E L++ G  + + +  + I  R+  R  T+ P+ S+   +N  +K    A     D+      I    +  
Subjt:  DLTKLPVKNNRGDMTENNQVAESDNSTTEEELEKAGEYDASLD--MPIALRKGTRSCTRYPRYSFLSYNNLSSK--FRAFAASLDIVTIPKNIYVVMEIL

Query:  EWKAVVMEEVRALEKNNIWEQVALPKGHKPVGCKWVFTMKYKSNGTLDRYKARLVAKGFTQTFGVDYSKTFSPIAKLNTIRVLLSVAVNKDWPLYQPDVK
         W+  +  E+ A + NN W     P+    V  +WVF++KY   G   RYKARLVA+GFTQ + +DY +TF+P+A++++ R +LS+ +  +  ++Q DVK
Subjt:  EWKAVVMEEVRALEKNNIWEQVALPKGHKPVGCKWVFTMKYKSNGTLDRYKARLVAKGFTQTFGVDYSKTFSPIAKLNTIRVLLSVAVNKDWPLYQPDVK

Query:  NAFLNDYLEEVVYMSPPPGFEAQ------------------------FDQNVK-------SVSGKVAVL-----------IVYVDDIVLSGDDTAEINKL
         AFLN  L+E +YM  P G                            F+Q +K       SV   + +L           ++YVDD+V++  D   +N  
Subjt:  NAFLNDYLEEVVYMSPPPGFEAQ------------------------FDQNVK-------SVSGKVAVL-----------IVYVDDIVLSGDDTAEINKL

Query:  KQKMADEFEINDLGNLKYFLGMEVTRSREGIFVSQR---------------KPADTPIESNAKLGESVDKVPVNKERYQRLVGRLNY-LAHTKPDISYAV
        K+ + ++F + DL  +K+F+G+ +    + I++SQ                    TP+ S     E ++         + L+G L Y +  T+PD++ AV
Subjt:  KQKMADEFEINDLGNLKYFLGMEVTRSREGIFVSQR---------------KPADTPIESNAKLGESVDKVPVNKERYQRLVGRLNY-LAHTKPDISYAV

Query:  SVVSQFMQAPYEEHMEAVNRILRYLKSTPEKRLIFRK--SDRRCIEAYTDSDSAGSVIDRKSTSGYCTFVWG-NLVTWRSKKQSVVAKSSAEAEYRAMSQ
        +++S++      E  + + R+LRYLK T + +LIF+K  +    I  Y DSD AGS IDRKST+GY   ++  NL+ W +K+Q+ VA SS EAEY A+ +
Subjt:  SVVSQFMQAPYEEHMEAVNRILRYLKSTPEKRLIFRK--SDRRCIEAYTDSDSAGSVIDRKSTSGYCTFVWG-NLVTWRSKKQSVVAKSSAEAEYRAMSQ

Query:  GICE
         + E
Subjt:  GICE

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.8e-8430.34Show/hide
Query:  FLLTDKSEVSSIFQQFYTTIKTQFNAKISILRSDNGREFLTNTLREFLSTKGIVHQSSCAYTPQQNGVAERKNRHLLEVARSLM--------------LS
        ++L  K +V  +FQ+F+  ++ +   K+  LRSDNG E+ +    E+ S+ GI H+ +   TPQ NGVAER NR ++E  RS++               +
Subjt:  FLLTDKSEVSSIFQQFYTTIKTQFNAKISILRSDNGREFLTNTLREFLSTKGIVHQSSCAYTPQQNGVAERKNRHLLEVARSLM--------------LS

Query:  TSHLINRMPSRVLNFQTPLDCLKLSYPTTTTRLIPDVPLRVFGCTAFVHSFGPNQTKFTPHAQKCVFVGYPLHQRGYKCFHPTSQKYFISMDITFLEDKP
          +LINR PS  L F+ P           T + +    L+VFGC AF H     +TK    +  C+F+GY   + GY+ + P  +K   S D+ F E   
Subjt:  TSHLINRMPSRVLNFQTPLDCLKLSYPTTTTRLIPDVPLRVFGCTAFVHSFGPNQTKFTPHAQKCVFVGYPLHQRGYKCFHPTSQKYFISMDITFLEDKP

Query:  FFPVSPLHGESSNEEINCHNPPPEPIMATHDTVLPTRQAPWITYYRKNLRKEIVAPTRMPAPIQSSEPTQVQGTTDPNNHTVCVENDIVDLTKLPVKNNR
                      E+       E +                        K  + P  +  P  S+ PT  + TTD          ++ +  + P     
Subjt:  FFPVSPLHGESSNEEINCHNPPPEPIMATHDTVLPTRQAPWITYYRKNLRKEIVAPTRMPAPIQSSEPTQVQGTTDPNNHTVCVENDIVDLTKLPVKNNR

Query:  GDMTENNQVAESDNSTTEEELEKAGEYDASLDMPIALRKGTRSCTRYPRYSFLSYNNLSSKFRAFAASLDIVTIPKNIYVVMEILEWKAVVMEEVRALEK
        G++ E  +  + D    E E    GE     +    LR+  R      RY    Y  L S  R   +  ++++ P+   ++         + EE+ +L+K
Subjt:  GDMTENNQVAESDNSTTEEELEKAGEYDASLDMPIALRKGTRSCTRYPRYSFLSYNNLSSKFRAFAASLDIVTIPKNIYVVMEILEWKAVVMEEVRALEK

Query:  NNIWEQVALPKGHKPVGCKWVFTMKYKSNGTLDRYKARLVAKGFTQTFGVDYSKTFSPIAKLNTIRVLLSVAVNKDWPLYQPDVKNAFLNDYLEEVVYMS
        N  ++ V LPKG +P+ CKWVF +K   +  L RYKARLV KGF Q  G+D+ + FSP+ K+ +IR +LS+A + D  + Q DVK AFL+  LEE +YM 
Subjt:  NNIWEQVALPKGHKPVGCKWVFTMKYKSNGTLDRYKARLVAKGFTQTFGVDYSKTFSPIAKLNTIRVLLSVAVNKDWPLYQPDVKNAFLNDYLEEVVYMS

Query:  PPPGFEA--------------------------QFDQNVKSVS-----------------GKVAVLIVYVDDIVLSGDDTAEINKLKQKMADEFEINDLG
         P GFE                           +FD  +KS +                     +L++YVDD+++ G D   I KLK  ++  F++ DLG
Subjt:  PPPGFEA--------------------------QFDQNVKSVS-----------------GKVAVLIVYVDDIVLSGDDTAEINKLKQKMADEFEINDLG

Query:  NLKYFLGMEVTRSREG--IFVSQR---------------KPADTPIESNAKLGESV------DKVPVNKERYQRLVGRLNY-LAHTKPDISYAVSVVSQF
          +  LGM++ R R    +++SQ                KP  TP+  + KL + +      +K  + K  Y   VG L Y +  T+PDI++AV VVS+F
Subjt:  NLKYFLGMEVTRSREG--IFVSQR---------------KPADTPIESNAKLGESV------DKVPVNKERYQRLVGRLNY-LAHTKPDISYAVSVVSQF

Query:  MQAPYEEHMEAVNRILRYLKSTPEKRLIFRKSDRRCIEAYTDSDSAGSVIDRKSTSGYCTFVWGNLVTWRSKKQSVVAKSSAEAEYRAMSQ
        ++ P +EH EAV  ILRYL+ T    L F  SD   ++ YTD+D AG + +RKS++GY     G  ++W+SK Q  VA S+ EAEY A ++
Subjt:  MQAPYEEHMEAVNRILRYLKSTPEKRLIFRKSDRRCIEAYTDSDSAGSVIDRKSTSGYCTFVWGNLVTWRSKKQSVVAKSSAEAEYRAMSQ

P92519 Uncharacterized mitochondrial protein AtMg008103.0e-3336.57Show/hide
Query:  LIVYVDDIVLSGDDTAEINKLKQKMADEFEINDLGNLKYFLGMEVTRSREGIFVSQR---------------KPADTPIESNAKLGESVDKVPVNKERYQ
        L++YVDDI+L+G     +N L  +++  F + DLG + YFLG+++     G+F+SQ                KP  TP+        S  K P +   ++
Subjt:  LIVYVDDIVLSGDDTAEINKLKQKMADEFEINDLGNLKYFLGMEVTRSREGIFVSQR---------------KPADTPIESNAKLGESVDKVPVNKERYQ

Query:  RLVGRLNYLAHTKPDISYAVSVVSQFMQAPYEEHMEAVNRILRYLKSTPEKRLIFRKSDRRCIEAYTDSDSAGSVIDRKSTSGYCTFVWGNLVTWRSKKQ
         +VG L YL  T+PDISYAV++V Q M  P     + + R+LRY+K T    L   K+ +  ++A+ DSD AG    R+ST+G+CTF+  N+++W +K+Q
Subjt:  RLVGRLNYLAHTKPDISYAVSVVSQFMQAPYEEHMEAVNRILRYLKSTPEKRLIFRKSDRRCIEAYTDSDSAGSVIDRKSTSGYCTFVWGNLVTWRSKKQ

Query:  SVVAKSSAEAEYRAMS
          V++SS E EYRA++
Subjt:  SVVAKSSAEAEYRAMS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.0e-9430.13Show/hide
Query:  TVQHNRGLYFLNGDT-----SSRHNYRASLLFSYFSTSENDFFLLTDKSEVSSIFQQFYTTIKTQFNAKISILRSDNGREFLTNTLREFLSTKGIVHQSS
        T+   R L ++  D       S  NYR  ++F    T     + L  KS+V   F  F   ++ +F  +I    SDNG EF+   L E+ S  GI H +S
Subjt:  TVQHNRGLYFLNGDT-----SSRHNYRASLLFSYFSTSENDFFLLTDKSEVSSIFQQFYTTIKTQFNAKISILRSDNGREFLTNTLREFLSTKGIVHQSS

Query:  CAYTPQQNGVAERKNRHLLEVARSLMLSTS--------------HLINRMPSRVLNFQTPLDCLKLSYPTTTTRLIPDVPLRVFGCTAFVHSFGPNQTKF
          +TP+ NG++ERK+RH++E   +L+   S              +LINR+P+ +L  ++P   L  + P           LRVFGC  +      NQ K 
Subjt:  CAYTPQQNGVAERKNRHLLEVARSLMLSTS--------------HLINRMPSRVLNFQTPLDCLKLSYPTTTTRLIPDVPLRVFGCTAFVHSFGPNQTKF

Query:  TPHAQKCVFVGYPLHQRGYKCFHPTSQKYFISMDITFLEDKPFFP-------VSPLHGESSNEEINC----HN--PPPEPIMATHDTVLPTRQAPWITYY
           +++CVF+GY L Q  Y C H  + + +IS  + F  D+  FP       +SP+  +    E +C    H   P   P++       P   A   +  
Subjt:  TPHAQKCVFVGYPLHQRGYKCFHPTSQKYFISMDITFLEDKPFFP-------VSPLHGESSNEEINC----HN--PPPEPIMATHDTVLPTRQAPWITYY

Query:  RKNLRKEIVAPTRMPAPIQSSEPTQVQGTTDPNNHTVCVENDIVDLTKLPVKNNRGDMTENNQVAESDNSTTEEELEKAGEYDASLDMPIALRKGTRSCT
            R   V+ + + +   SS P+  + T    N            T+   + +    T  N       S   + L    +  +S   P      + +  
Subjt:  RKNLRKEIVAPTRMPAPIQSSEPTQVQGTTDPNNHTVCVENDIVDLTKLPVKNNRGDMTENNQVAESDNSTTEEELEKAGEYDASLDMPIALRKGTRSCT

Query:  RYP-----------------RYSFLSYNNLSSKFRA----------FAASLDIVTIPKNIYVVMEILEWKAVVMEEVRALEKNNIWEQVALPKGHKP-VG
          P                   + L+ +++ ++ +A           A SL   + P+     ++   W+  +  E+ A   N+ W+ V  P  H   VG
Subjt:  RYP-----------------RYSFLSYNNLSSKFRA----------FAASLDIVTIPKNIYVVMEILEWKAVVMEEVRALEKNNIWEQVALPKGHKP-VG

Query:  CKWVFTMKYKSNGTLDRYKARLVAKGFTQTFGVDYSKTFSPIAKLNTIRVLLSVAVNKDWPLYQPDVKNAFLNDYLEEVVYMSPPPGFEAQFDQN-----
        C+W+FT KY S+G+L+RYKARLVAKG+ Q  G+DY++TFSP+ K  +IR++L VAV++ WP+ Q DV NAFL   L + VYMS PPGF  +   N     
Subjt:  CKWVFTMKYKSNGTLDRYKARLVAKGFTQTFGVDYSKTFSPIAKLNTIRVLLSVAVNKDWPLYQPDVKNAFLNDYLEEVVYMSPPPGFEAQFDQN-----

Query:  ---------------------------VKSVS---------GK-VAVLIVYVDDIVLSGDDTAEINKLKQKMADEFEINDLGNLKYFLGMEVTRSREGIF
                                   V SVS         GK +  ++VYVDDI+++G+D   ++     ++  F + D   L YFLG+E  R   G+ 
Subjt:  ---------------------------VKSVS---------GK-VAVLIVYVDDIVLSGDDTAEINKLKQKMADEFEINDLGNLKYFLGMEVTRSREGIF

Query:  VSQR---------------KPADTPIESNAKLGESVDKVPVNKERYQRLVGRLNYLAHTKPDISYAVSVVSQFMQAPYEEHMEAVNRILRYLKSTPEKRL
        +SQR               KP  TP+  + KL         +   Y+ +VG L YLA T+PDISYAV+ +SQFM  P EEH++A+ RILRYL  TP   +
Subjt:  VSQR---------------KPADTPIESNAKLGESVDKVPVNKERYQRLVGRLNYLAHTKPDISYAVSVVSQFMQAPYEEHMEAVNRILRYLKSTPEKRL

Query:  IFRKSDRRCIEAYTDSDSAGSVIDRKSTSGYCTFVWGNLVTWRSKKQSVVAKSSAEAEYRAMS
          +K +   + AY+D+D AG   D  ST+GY  ++  + ++W SKKQ  V +SS EAEYR+++
Subjt:  IFRKSDRRCIEAYTDSDSAGSVIDRKSTSGYCTFVWGNLVTWRSKKQSVVAKSSAEAEYRAMS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.2e-9029.64Show/hide
Query:  TVQHNRGLYFLNGDTSSR-----HNYRASLLFSYFSTSENDFFLLTDKSEVSSIFQQFYTTIKTQFNAKISILRSDNGREFLTNTLREFLSTKGIVHQSS
        T+  ++ L ++  D  S       NYR  ++F    T     + L  KS+V   F  F + ++ +F  +I  L SDNG EF+   LR++LS  GI H +S
Subjt:  TVQHNRGLYFLNGDTSSR-----HNYRASLLFSYFSTSENDFFLLTDKSEVSSIFQQFYTTIKTQFNAKISILRSDNGREFLTNTLREFLSTKGIVHQSS

Query:  CAYTPQQNGVAERKNRHLLEVARSLMLSTS--------------HLINRMPSRVLNFQTPLDCLKLSYPTTTTRLIPDVPLRVFGCTAFVHSFGPNQTKF
          +TP+ NG++ERK+RH++E+  +L+   S              +LINR+P+ +L  Q+P   L    P           L+VFGC  +      N+ K 
Subjt:  CAYTPQQNGVAERKNRHLLEVARSLMLSTS--------------HLINRMPSRVLNFQTPLDCLKLSYPTTTTRLIPDVPLRVFGCTAFVHSFGPNQTKF

Query:  TPHAQKCVFVGYPLHQRGYKCFHPTSQKYFISMDITFLEDKPFFPVSPLH-GESSNEEINCHNPPPEPIMATHDTV-------------------LPTRQ
           +++C F+GY L Q  Y C H  + + + S  + F  D+  FP S  + G S+++E    + P  P   T  T                     P+  
Subjt:  TPHAQKCVFVGYPLHQRGYKCFHPTSQKYFISMDITFLEDKPFFPVSPLH-GESSNEEINCHNPPPEPIMATHDTV-------------------LPTRQ

Query:  APWIT--YYRKNLRKEIVA----------------PTRMPAPIQSSEP-TQVQGTTDPNNHTVCVENDIVDLTKLPVKNNRGDMTENNQVAE----SDNS
        +P  T      NL    ++                PT  P   Q+S   + +    +PN+ +    N    L + P+ +     T +  ++E    S +S
Subjt:  APWIT--YYRKNLRKEIVA----------------PTRMPAPIQSSEP-TQVQGTTDPNNHTVCVENDIVDLTKLPVKNNRGDMTENNQVAE----SDNS

Query:  TTEEELEKAGEYDASLDMPIALRKGTRSCTRYPRYSFLSYNNLSSKFRAFAASLDIVTIPKNIYVVMEILEWKAVVMEEVRALEKNNIWEQVALPKGHKP
        T+   L         + +       T S     +      N   S    +A SL   + P+     M+   W+  +  E+ A   N+ W+ V  P     
Subjt:  TTEEELEKAGEYDASLDMPIALRKGTRSCTRYPRYSFLSYNNLSSKFRAFAASLDIVTIPKNIYVVMEILEWKAVVMEEVRALEKNNIWEQVALPKGHKP

Query:  -VGCKWVFTMKYKSNGTLDRYKARLVAKGFTQTFGVDYSKTFSPIAKLNTIRVLLSVAVNKDWPLYQPDVKNAFLNDYLEEVVYMSPPPGF---------
         VGC+W+FT K+ S+G+L+RYKARLVAKG+ Q  G+DY++TFSP+ K  +IR++L VAV++ WP+ Q DV NAFL   L + VYMS PPGF         
Subjt:  -VGCKWVFTMKYKSNGTLDRYKARLVAKGFTQTFGVDYSKTFSPIAKLNTIRVLLSVAVNKDWPLYQPDVKNAFLNDYLEEVVYMSPPPGF---------

Query:  --------------------------EAQFDQNVKSVS-------GKVAVLIVYVDDIVLSGDDTAEINKLKQKMADEFEINDLGNLKYFLGMEVTRSRE
                                     F  ++   S         +  ++VYVDDI+++G+DT  +      ++  F + +  +L YFLG+E  R  +
Subjt:  --------------------------EAQFDQNVKSVS-------GKVAVLIVYVDDIVLSGDDTAEINKLKQKMADEFEINDLGNLKYFLGMEVTRSRE

Query:  GIFVSQR---------------KPADTPIESNAKLG-ESVDKVPVNKERYQRLVGRLNYLAHTKPDISYAVSVVSQFMQAPYEEHMEAVNRILRYLKSTP
        G+ +SQR               KP  TP+ ++ KL   S  K+P +   Y+ +VG L YLA T+PD+SYAV+ +SQ+M  P ++H  A+ R+LRYL  TP
Subjt:  GIFVSQR---------------KPADTPIESNAKLG-ESVDKVPVNKERYQRLVGRLNYLAHTKPDISYAVSVVSQFMQAPYEEHMEAVNRILRYLKSTP

Query:  EKRLIFRKSDRRCIEAYTDSDSAGSVIDRKSTSGYCTFVWGNLVTWRSKKQSVVAKSSAEAEYRAMS
        +  +  +K +   + AY+D+D AG   D  ST+GY  ++  + ++W SKKQ  V +SS EAEYR+++
Subjt:  EKRLIFRKSDRRCIEAYTDSDSAGSVIDRKSTSGYCTFVWGNLVTWRSKKQSVVAKSSAEAEYRAMS

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 84.9e-7634.21Show/hide
Query:  VAESDNSTTEEELEKAGEYDASLDMPIALRKGTRSCTRYPRY------------------SFLSYNNLSSKFRAFAASLDIVTIPKNIYVVMEILEWKAV
        V+++D ST+   ++     +   D+P      +   TR P Y                   FLSY  +S  + +F   +     P       E L W   
Subjt:  VAESDNSTTEEELEKAGEYDASLDMPIALRKGTRSCTRYPRY------------------SFLSYNNLSSKFRAFAASLDIVTIPKNIYVVMEILEWKAV

Query:  VMEEVRALEKNNIWEQVALPKGHKPVGCKWVFTMKYKSNGTLDRYKARLVAKGFTQTFGVDYSKTFSPIAKLNTIRVLLSVAVNKDWPLYQPDVKNAFLN
        + +E+ A+E  + WE   LP   KP+GCKWV+ +KY S+GT++RYKARLVAKG+TQ  G+D+ +TFSP+ KL +++++L+++   ++ L+Q D+ NAFLN
Subjt:  VMEEVRALEKNNIWEQVALPKGHKPVGCKWVFTMKYKSNGTLDRYKARLVAKGFTQTFGVDYSKTFSPIAKLNTIRVLLSVAVNKDWPLYQPDVKNAFLN

Query:  DYLEEVVYMSPPPGFEAQFDQNV---------KSVSG----------KVAV---------------------------LIVYVDDIVLSGDDTAEINKLK
          L+E +YM  PPG+ A+   ++         KS+ G          K +V                           ++VYVDDI++  ++ A +++LK
Subjt:  DYLEEVVYMSPPPGFEAQFDQNV---------KSVSG----------KVAV---------------------------LIVYVDDIVLSGDDTAEINKLK

Query:  QKMADEFEINDLGNLKYFLGMEVTRSREGIFVSQRKPADTPIESNAKLGESVDKVP---------------VNKERYQRLVGRLNYLAHTKPDISYAVSV
         ++   F++ DLG LKYFLG+E+ RS  GI + QRK A   ++    LG     VP               V+ + Y+RL+GRL YL  T+ DIS+AV+ 
Subjt:  QKMADEFEINDLGNLKYFLGMEVTRSREGIFVSQRKPADTPIESNAKLGESVDKVP---------------VNKERYQRLVGRLNYLAHTKPDISYAVSV

Query:  VSQFMQAPYEEHMEAVNRILRYLKSTPEKRLIFRKSDRRCIEAYTDSDSAGSVIDRKSTSGYCTFVWGNLVTWRSKKQSVVAKSSAEAEYRAMS
        +SQF +AP   H +AV +IL Y+K T  + L +       ++ ++D+        R+ST+GYC F+  +L++W+SKKQ VV+KSSAEAEYRA+S
Subjt:  VSQFMQAPYEEHMEAVNRILRYLKSTPEKRLIFRKSDRRCIEAYTDSDSAGSVIDRKSTSGYCTFVWGNLVTWRSKKQSVVAKSSAEAEYRAMS

ATMG00240.1 Gag-Pol-related retrotransposon family protein3.4e-0835.8Show/hide
Query:  YLAHTKPDISYAVSVVSQFMQAPYEEHMEAVNRILRYLKSTPEKRLIFRKSDRRCIEAYTDSDSAGSVIDRKSTSGYCTFV
        YL  T+PD+++AV+ +SQF  A     M+AV ++L Y+K T  + L +  +    ++A+ DSD A     R+S +G+C+ V
Subjt:  YLAHTKPDISYAVSVVSQFMQAPYEEHMEAVNRILRYLKSTPEKRLIFRKSDRRCIEAYTDSDSAGSVIDRKSTSGYCTFV

ATMG00810.1 DNA/RNA polymerases superfamily protein2.1e-3436.57Show/hide
Query:  LIVYVDDIVLSGDDTAEINKLKQKMADEFEINDLGNLKYFLGMEVTRSREGIFVSQR---------------KPADTPIESNAKLGESVDKVPVNKERYQ
        L++YVDDI+L+G     +N L  +++  F + DLG + YFLG+++     G+F+SQ                KP  TP+        S  K P +   ++
Subjt:  LIVYVDDIVLSGDDTAEINKLKQKMADEFEINDLGNLKYFLGMEVTRSREGIFVSQR---------------KPADTPIESNAKLGESVDKVPVNKERYQ

Query:  RLVGRLNYLAHTKPDISYAVSVVSQFMQAPYEEHMEAVNRILRYLKSTPEKRLIFRKSDRRCIEAYTDSDSAGSVIDRKSTSGYCTFVWGNLVTWRSKKQ
         +VG L YL  T+PDISYAV++V Q M  P     + + R+LRY+K T    L   K+ +  ++A+ DSD AG    R+ST+G+CTF+  N+++W +K+Q
Subjt:  RLVGRLNYLAHTKPDISYAVSVVSQFMQAPYEEHMEAVNRILRYLKSTPEKRLIFRKSDRRCIEAYTDSDSAGSVIDRKSTSGYCTFVWGNLVTWRSKKQ

Query:  SVVAKSSAEAEYRAMS
          V++SS E EYRA++
Subjt:  SVVAKSSAEAEYRAMS

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)8.0e-1841.03Show/hide
Query:  NNLSSKFRAFAASLDIVTIPKNIYVVMEILEWKAVVMEEVRALEKNNIWEQVALPKGHKPVGCKWVFTMKYKSNGTLDRYKARLVAKGFTQTFGVDYSKT
        N L+ K+ +   +  I   PK++   ++   W   + EE+ AL +N  W  V  P     +GCKWVF  K  S+GTLDR KARLVAKGF Q  G+ + +T
Subjt:  NNLSSKFRAFAASLDIVTIPKNIYVVMEILEWKAVVMEEVRALEKNNIWEQVALPKGHKPVGCKWVFTMKYKSNGTLDRYKARLVAKGFTQTFGVDYSKT

Query:  FSPIAKLNTIRVLLSVA
        +SP+ +  TIR +L+VA
Subjt:  FSPIAKLNTIRVLLSVA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTATCAGAGCATGGTGATGAAACCTCAGAAACCCTAGCCGTCATTGATGAAAATGATACTAGGCAAGATCCGCCGCCACCACCACCCTACCTGTCGTCGTTGCCGAT
TGAGTTTGCCGCCGCCACCAATTCCGCTGTCCGAGAAGCTGTCAACCACACTATTACAGTCCCTGCCGCCGCTTCCCTTAATCCGTCACTCGATTCAATGCGAGTGAGTC
TTGCTCCAAGTCTCGCTCCAAGTGAAGTCGCGTGTAGTCTCGCTCCAAGTGCAGTCACATGTTCTTCTTCACAACCAGCGAGAGTGAGTCTCGCTCCAAGTCTCGCTCCA
ACTGCATTCATGGGTTCTTATTCTCAACCAGCGAGAGTGAGTCTCGCTCCAAGTCTCGCTCAAGCTGCATATGCGAGTTCTTCTCAACCAGCGAGAGTGAGTCACATTTC
AAGCACTATTGCGGGAACTACTTCTCAAGGACTAGCGTTTCATGAGGCTGATGGCGCCAACTTCAATCCACATGCTTTGGGTTGTCACTATGTCCCTCCTACAGCTGCTG
CTGTCACTGGAATTCCTCACCAAGGACAATTTGATAATCTTCAAGCAAGTTTCCAACAACAAATTGCGGCTCTTGGGACAGCCCTTGGCGCTTCAACCATCCTTGACCAA
GGTGCAGTAAATTTAGGTATTCAATCAGGTTTACCGACGTATCCAGAGAATTCGTCCATTAAAATAGTACTTGAAGGACGCCACAAATTTGGGTATCTAACTGGCCAAAT
ATCGAAACCCAGACCAAGAGACCCTCAAGAGCTCTCTGTTAATTCACAGCATGGAACCTCAAATTGGGAAGCCTCTGTTATATGCAACTACGACTCAGGATATTTGGAAT
GCTATTCAGAAACTGTACTCCAAGAGGGGTCCATGGATGTTACGTCATATTTTAATAAATTATCTCTTATCTGGCAAGAGATGGACTTATGTCGTGAACTTGTCTGGGAC
TATCCTTGTGAAGGTGTCCAATATTATAAACTTGAAGAGGTTGACCGAGTGTATGATTTTTTAGCAGAGGTCCGTCTGGAGGAAGATAGATCGAGTGCCATGAATATCAC
TACTACCTCCGTAACTGATTCCGCCGCCTTCAGTGCGGAATCATCTGGAATTACTGGAGACAAGCAAAACGGGAAACCACCATATGTGAACACTTGCTGGAAGCTACATG
GTCGACCCCCGAATGGTAGACGACGACCTCCAAACAACAAAACCAACCAGGCTCTAGTCAGCGAGACTACTAGTGGCTCTCAATCACAACATCAGGAGATCTACCCAACT
GACAGCGGTACTATTTCTCTTGGGGCAATTGCACAATCAGGTATTTCTCAATCCTTGAGTCTTCTCAGTATTACTAGCAAGAAACCCTGGATCCTTGATTCAAGAGCCAC
ATACCACCTAACTGGAACTTCTGACAATTTTCTCTCTTATCATCCAGAGCTAAACTGTCGTGTGGCTTTCTCACCGGATGATGTTATATTTCAGGACTTGAGCTCAAGGA
GGACGATTGGCACTGTCCAGCACAATAGGGGACTCTATTTCCTTAATGGTGATACTTCCTCTAGGCACAATTATAGGGCTAGTCTATTATTTTCCTACTTTTCAACTTCT
GAAAATGACTTCTTCCTTCTAACAGATAAGTCTGAGGTTTCATCTATTTTTCAACAGTTTTACACCACCATTAAAACTCAGTTTAATGCCAAAATTTCCATCCTTCGGAG
TGACAATGGTCGTGAATTTCTCACCAATACCCTTCGTGAGTTCTTATCCACTAAAGGTATTGTTCACCAAAGCTCGTGTGCCTATACCCCCCAACAAAATGGAGTGGCTG
AAAGAAAAAACCGTCATCTCCTCGAAGTTGCTCGGTCTCTCATGTTGTCGACCTCTCATCTTATAAATCGGATGCCTTCCCGTGTTCTCAACTTCCAAACTCCTCTTGAC
TGTCTCAAATTATCTTATCCCACCACCACCACTCGCCTTATACCTGATGTCCCTCTCCGTGTATTTGGATGTACTGCATTTGTCCATAGCTTCGGTCCCAACCAGACTAA
GTTTACCCCTCATGCCCAGAAATGTGTCTTTGTTGGGTATCCTCTCCACCAGCGTGGTTATAAGTGCTTCCATCCCACTTCCCAGAAATACTTCATCTCTATGGATATCA
CCTTCCTGGAAGATAAACCTTTCTTTCCCGTTAGTCCTCTTCATGGGGAGAGTAGTAATGAAGAGATTAACTGCCATAATCCTCCTCCTGAACCCATCATGGCAACCCAC
GACACTGTCCTACCCACTAGACAGGCCCCATGGATAACGTACTACAGGAAGAATCTTAGAAAGGAAATAGTAGCCCCTACTCGCATGCCGGCTCCGATCCAATCTTCTGA
ACCAACACAGGTTCAAGGTACTACTGATCCTAATAATCATACTGTTTGCGTTGAAAATGATATTGTTGATCTGACAAAACTGCCAGTAAAGAATAATAGAGGTGATATGA
CAGAAAATAACCAGGTTGCTGAGAGCGATAATTCCACAACTGAAGAGGAACTCGAAAAAGCAGGAGAGTATGATGCCAGCCTTGATATGCCCATTGCTCTGAGAAAGGGC
ACAAGGTCTTGCACCAGGTATCCCAGGTATAGCTTCCTTTCTTATAATAATCTGTCTTCTAAGTTCAGAGCGTTTGCTGCCAGCCTTGACATTGTAACTATACCGAAGAA
CATATATGTGGTTATGGAAATTCTGGAGTGGAAGGCTGTTGTTATGGAAGAAGTGAGAGCTCTTGAGAAAAATAATATATGGGAACAAGTTGCTCTTCCGAAAGGGCATA
AACCAGTTGGGTGTAAGTGGGTGTTCACAATGAAGTACAAATCTAATGGAACTCTTGACAGGTATAAAGCTAGGCTTGTAGCAAAAGGGTTCACTCAAACTTTTGGGGTA
GACTACTCTAAGACTTTCTCCCCTATTGCGAAGTTAAACACGATTAGGGTTCTTCTGTCGGTTGCTGTCAACAAAGATTGGCCTCTTTATCAGCCTGATGTGAAAAACGC
TTTTTTAAACGATTATTTAGAAGAAGTGGTGTATATGAGCCCTCCTCCTGGCTTTGAAGCCCAATTTGACCAAAATGTAAAGTCTGTCTCAGGTAAGGTTGCCGTTTTGA
TTGTGTACGTTGATGACATTGTCTTATCAGGAGATGATACTGCTGAAATCAACAAATTGAAACAGAAAATGGCAGACGAATTTGAAATCAATGATCTAGGAAACCTAAAG
TATTTCCTTGGGATGGAGGTCACGAGGTCAAGAGAAGGAATCTTCGTTTCTCAACGGAAGCCTGCTGACACTCCTATCGAGTCTAATGCCAAATTAGGAGAGTCTGTTGA
CAAAGTTCCTGTTAACAAAGAAAGGTATCAGCGTCTAGTTGGAAGACTTAATTATCTAGCGCACACTAAACCAGACATTTCCTATGCTGTCAGCGTTGTGAGTCAGTTTA
TGCAGGCTCCCTACGAAGAACACATGGAAGCTGTAAACAGGATTCTGAGATATTTGAAAAGCACACCAGAAAAAAGGTTGATATTTAGAAAATCTGACAGAAGATGTATT
GAAGCATATACAGACTCTGACAGCGCAGGATCAGTGATTGACAGAAAATCTACCTCTGGGTACTGTACCTTTGTTTGGGGTAACCTTGTAACTTGGAGAAGTAAAAAGCA
AAGTGTGGTTGCCAAAAGCAGTGCTGAAGCAGAATACAGAGCCATGAGTCAAGGCATTTGTGAAAAATCTGGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTATCAGAGCATGGTGATGAAACCTCAGAAACCCTAGCCGTCATTGATGAAAATGATACTAGGCAAGATCCGCCGCCACCACCACCCTACCTGTCGTCGTTGCCGAT
TGAGTTTGCCGCCGCCACCAATTCCGCTGTCCGAGAAGCTGTCAACCACACTATTACAGTCCCTGCCGCCGCTTCCCTTAATCCGTCACTCGATTCAATGCGAGTGAGTC
TTGCTCCAAGTCTCGCTCCAAGTGAAGTCGCGTGTAGTCTCGCTCCAAGTGCAGTCACATGTTCTTCTTCACAACCAGCGAGAGTGAGTCTCGCTCCAAGTCTCGCTCCA
ACTGCATTCATGGGTTCTTATTCTCAACCAGCGAGAGTGAGTCTCGCTCCAAGTCTCGCTCAAGCTGCATATGCGAGTTCTTCTCAACCAGCGAGAGTGAGTCACATTTC
AAGCACTATTGCGGGAACTACTTCTCAAGGACTAGCGTTTCATGAGGCTGATGGCGCCAACTTCAATCCACATGCTTTGGGTTGTCACTATGTCCCTCCTACAGCTGCTG
CTGTCACTGGAATTCCTCACCAAGGACAATTTGATAATCTTCAAGCAAGTTTCCAACAACAAATTGCGGCTCTTGGGACAGCCCTTGGCGCTTCAACCATCCTTGACCAA
GGTGCAGTAAATTTAGGTATTCAATCAGGTTTACCGACGTATCCAGAGAATTCGTCCATTAAAATAGTACTTGAAGGACGCCACAAATTTGGGTATCTAACTGGCCAAAT
ATCGAAACCCAGACCAAGAGACCCTCAAGAGCTCTCTGTTAATTCACAGCATGGAACCTCAAATTGGGAAGCCTCTGTTATATGCAACTACGACTCAGGATATTTGGAAT
GCTATTCAGAAACTGTACTCCAAGAGGGGTCCATGGATGTTACGTCATATTTTAATAAATTATCTCTTATCTGGCAAGAGATGGACTTATGTCGTGAACTTGTCTGGGAC
TATCCTTGTGAAGGTGTCCAATATTATAAACTTGAAGAGGTTGACCGAGTGTATGATTTTTTAGCAGAGGTCCGTCTGGAGGAAGATAGATCGAGTGCCATGAATATCAC
TACTACCTCCGTAACTGATTCCGCCGCCTTCAGTGCGGAATCATCTGGAATTACTGGAGACAAGCAAAACGGGAAACCACCATATGTGAACACTTGCTGGAAGCTACATG
GTCGACCCCCGAATGGTAGACGACGACCTCCAAACAACAAAACCAACCAGGCTCTAGTCAGCGAGACTACTAGTGGCTCTCAATCACAACATCAGGAGATCTACCCAACT
GACAGCGGTACTATTTCTCTTGGGGCAATTGCACAATCAGGTATTTCTCAATCCTTGAGTCTTCTCAGTATTACTAGCAAGAAACCCTGGATCCTTGATTCAAGAGCCAC
ATACCACCTAACTGGAACTTCTGACAATTTTCTCTCTTATCATCCAGAGCTAAACTGTCGTGTGGCTTTCTCACCGGATGATGTTATATTTCAGGACTTGAGCTCAAGGA
GGACGATTGGCACTGTCCAGCACAATAGGGGACTCTATTTCCTTAATGGTGATACTTCCTCTAGGCACAATTATAGGGCTAGTCTATTATTTTCCTACTTTTCAACTTCT
GAAAATGACTTCTTCCTTCTAACAGATAAGTCTGAGGTTTCATCTATTTTTCAACAGTTTTACACCACCATTAAAACTCAGTTTAATGCCAAAATTTCCATCCTTCGGAG
TGACAATGGTCGTGAATTTCTCACCAATACCCTTCGTGAGTTCTTATCCACTAAAGGTATTGTTCACCAAAGCTCGTGTGCCTATACCCCCCAACAAAATGGAGTGGCTG
AAAGAAAAAACCGTCATCTCCTCGAAGTTGCTCGGTCTCTCATGTTGTCGACCTCTCATCTTATAAATCGGATGCCTTCCCGTGTTCTCAACTTCCAAACTCCTCTTGAC
TGTCTCAAATTATCTTATCCCACCACCACCACTCGCCTTATACCTGATGTCCCTCTCCGTGTATTTGGATGTACTGCATTTGTCCATAGCTTCGGTCCCAACCAGACTAA
GTTTACCCCTCATGCCCAGAAATGTGTCTTTGTTGGGTATCCTCTCCACCAGCGTGGTTATAAGTGCTTCCATCCCACTTCCCAGAAATACTTCATCTCTATGGATATCA
CCTTCCTGGAAGATAAACCTTTCTTTCCCGTTAGTCCTCTTCATGGGGAGAGTAGTAATGAAGAGATTAACTGCCATAATCCTCCTCCTGAACCCATCATGGCAACCCAC
GACACTGTCCTACCCACTAGACAGGCCCCATGGATAACGTACTACAGGAAGAATCTTAGAAAGGAAATAGTAGCCCCTACTCGCATGCCGGCTCCGATCCAATCTTCTGA
ACCAACACAGGTTCAAGGTACTACTGATCCTAATAATCATACTGTTTGCGTTGAAAATGATATTGTTGATCTGACAAAACTGCCAGTAAAGAATAATAGAGGTGATATGA
CAGAAAATAACCAGGTTGCTGAGAGCGATAATTCCACAACTGAAGAGGAACTCGAAAAAGCAGGAGAGTATGATGCCAGCCTTGATATGCCCATTGCTCTGAGAAAGGGC
ACAAGGTCTTGCACCAGGTATCCCAGGTATAGCTTCCTTTCTTATAATAATCTGTCTTCTAAGTTCAGAGCGTTTGCTGCCAGCCTTGACATTGTAACTATACCGAAGAA
CATATATGTGGTTATGGAAATTCTGGAGTGGAAGGCTGTTGTTATGGAAGAAGTGAGAGCTCTTGAGAAAAATAATATATGGGAACAAGTTGCTCTTCCGAAAGGGCATA
AACCAGTTGGGTGTAAGTGGGTGTTCACAATGAAGTACAAATCTAATGGAACTCTTGACAGGTATAAAGCTAGGCTTGTAGCAAAAGGGTTCACTCAAACTTTTGGGGTA
GACTACTCTAAGACTTTCTCCCCTATTGCGAAGTTAAACACGATTAGGGTTCTTCTGTCGGTTGCTGTCAACAAAGATTGGCCTCTTTATCAGCCTGATGTGAAAAACGC
TTTTTTAAACGATTATTTAGAAGAAGTGGTGTATATGAGCCCTCCTCCTGGCTTTGAAGCCCAATTTGACCAAAATGTAAAGTCTGTCTCAGGTAAGGTTGCCGTTTTGA
TTGTGTACGTTGATGACATTGTCTTATCAGGAGATGATACTGCTGAAATCAACAAATTGAAACAGAAAATGGCAGACGAATTTGAAATCAATGATCTAGGAAACCTAAAG
TATTTCCTTGGGATGGAGGTCACGAGGTCAAGAGAAGGAATCTTCGTTTCTCAACGGAAGCCTGCTGACACTCCTATCGAGTCTAATGCCAAATTAGGAGAGTCTGTTGA
CAAAGTTCCTGTTAACAAAGAAAGGTATCAGCGTCTAGTTGGAAGACTTAATTATCTAGCGCACACTAAACCAGACATTTCCTATGCTGTCAGCGTTGTGAGTCAGTTTA
TGCAGGCTCCCTACGAAGAACACATGGAAGCTGTAAACAGGATTCTGAGATATTTGAAAAGCACACCAGAAAAAAGGTTGATATTTAGAAAATCTGACAGAAGATGTATT
GAAGCATATACAGACTCTGACAGCGCAGGATCAGTGATTGACAGAAAATCTACCTCTGGGTACTGTACCTTTGTTTGGGGTAACCTTGTAACTTGGAGAAGTAAAAAGCA
AAGTGTGGTTGCCAAAAGCAGTGCTGAAGCAGAATACAGAGCCATGAGTCAAGGCATTTGTGAAAAATCTGGTTGA
Protein sequenceShow/hide protein sequence
MVSEHGDETSETLAVIDENDTRQDPPPPPPYLSSLPIEFAAATNSAVREAVNHTITVPAAASLNPSLDSMRVSLAPSLAPSEVACSLAPSAVTCSSSQPARVSLAPSLAP
TAFMGSYSQPARVSLAPSLAQAAYASSSQPARVSHISSTIAGTTSQGLAFHEADGANFNPHALGCHYVPPTAAAVTGIPHQGQFDNLQASFQQQIAALGTALGASTILDQ
GAVNLGIQSGLPTYPENSSIKIVLEGRHKFGYLTGQISKPRPRDPQELSVNSQHGTSNWEASVICNYDSGYLECYSETVLQEGSMDVTSYFNKLSLIWQEMDLCRELVWD
YPCEGVQYYKLEEVDRVYDFLAEVRLEEDRSSAMNITTTSVTDSAAFSAESSGITGDKQNGKPPYVNTCWKLHGRPPNGRRRPPNNKTNQALVSETTSGSQSQHQEIYPT
DSGTISLGAIAQSGISQSLSLLSITSKKPWILDSRATYHLTGTSDNFLSYHPELNCRVAFSPDDVIFQDLSSRRTIGTVQHNRGLYFLNGDTSSRHNYRASLLFSYFSTS
ENDFFLLTDKSEVSSIFQQFYTTIKTQFNAKISILRSDNGREFLTNTLREFLSTKGIVHQSSCAYTPQQNGVAERKNRHLLEVARSLMLSTSHLINRMPSRVLNFQTPLD
CLKLSYPTTTTRLIPDVPLRVFGCTAFVHSFGPNQTKFTPHAQKCVFVGYPLHQRGYKCFHPTSQKYFISMDITFLEDKPFFPVSPLHGESSNEEINCHNPPPEPIMATH
DTVLPTRQAPWITYYRKNLRKEIVAPTRMPAPIQSSEPTQVQGTTDPNNHTVCVENDIVDLTKLPVKNNRGDMTENNQVAESDNSTTEEELEKAGEYDASLDMPIALRKG
TRSCTRYPRYSFLSYNNLSSKFRAFAASLDIVTIPKNIYVVMEILEWKAVVMEEVRALEKNNIWEQVALPKGHKPVGCKWVFTMKYKSNGTLDRYKARLVAKGFTQTFGV
DYSKTFSPIAKLNTIRVLLSVAVNKDWPLYQPDVKNAFLNDYLEEVVYMSPPPGFEAQFDQNVKSVSGKVAVLIVYVDDIVLSGDDTAEINKLKQKMADEFEINDLGNLK
YFLGMEVTRSREGIFVSQRKPADTPIESNAKLGESVDKVPVNKERYQRLVGRLNYLAHTKPDISYAVSVVSQFMQAPYEEHMEAVNRILRYLKSTPEKRLIFRKSDRRCI
EAYTDSDSAGSVIDRKSTSGYCTFVWGNLVTWRSKKQSVVAKSSAEAEYRAMSQGICEKSG