; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI05G02000 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI05G02000
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionBeta-galactosidase
Genome locationChr5:2728558..2730843
RNA-Seq ExpressionCSPI05G02000
SyntenyCSPI05G02000
Gene Ontology termsNA
InterPro domainsIPR025724 - GAG-pre-integrase domain
IPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025363.1 Beta-galactosidase [Cucumis melo var. makuwa]2.1e-17349.47Show/hide
Query:  MVSERDNENTLETQKNQTTYENQTEGTAFNFSAVVAAAIDARMSAAMDELLSRLQKTSENNFSSLPQSSAPSPDHHALGFLPQTAPTIPSVQPFSSSAAY
        MVSE+ N  TLE    +T  E +          V AAA     +AA+++LL  LQK        +PQ  AP  D   +     +     +  PF  +A  
Subjt:  MVSERDNENTLETQKNQTTYENQTEGTAFNFSAVVAAAIDARMSAAMDELLSRLQKTSENNFSSLPQSSAPSPDHHALGFLPQTAPTIPSVQPFSSSAAY

Query:  IAPHAPIYVLPSNSNRLP-PLLPSNLYGQPPNDPSYHPDVKNSQIHSTFEVGESSAYSNRNVQASSGIVHQQLEGLRQQIAALEATLGTTST-LPMYSEY
        +  +AP  V PSN +  P P  PS   GQ P+  +        Q++   +  +   +S   +              R  I A E++  +  T LPMYS+ 
Subjt:  IAPHAPIYVLPSNSNRLP-PLLPSNLYGQPPNDPSYHPDVKNSQIHSTFEVGESSAYSNRNVQASSGIVHQQLEGLRQQIAALEATLGTTST-LPMYSEY

Query:  PVNSFPNVSSPYLTNTVAQSSMYHLSGEKLNGNNYFSWSQLVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKPLLFAAT
        PV SFPN  S Y+T ++  SS  + SGEKLNG NYFSWSQ +KM LEGR +F FLTGE  RP PGD  ER WK EDS++RS+LINSMEPQIGKPLL+AAT
Subjt:  PVNSFPNVSSPYLTNTVAQSSMYHLSGEKLNGNNYFSWSQLVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKPLLFAAT

Query:  AKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTSFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQR
        AKD+WDT QTLYSKRQNASRLYTLRKQVH CKQGT+DVT++FNKLSL+WQEMDLCRE VW  P D  QY+++EE DR+YDFLAGLNPKFD V GRILGQR
Subjt:  AKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTSFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQR

Query:  PIPSLMEVCSEIRLEEDRTSAMNISATPTIDSAAFCARFSNSSSDKHNRKPIP----------------------------------NTWRAYVSEY--A
        P+PSLMEVC E+RLEEDRT+AM +  TPTIDSAAF AR SN  SDK+N K IP                                  N+ RAY+SE   A
Subjt:  PIPSLMEVCSEIRLEEDRTSAMNISATPTIDSAAFCARFSNSSSDKHNRKPIP----------------------------------NTWRAYVSEY--A

Query:  EPPQQSDPHKNKIDLSLATLGAI-----------------NPWILDFGVTDHLTGSSEHFVSYIPCAGNETIRIADGSLTPIAGKREDFSCAGLSLHNVL
           Q +DP  ++      TLGAI                 NPWILD G TDHLTGSSEHF+SY PCAGNE IRIADGSL PIAGK +     G +L NVL
Subjt:  EPPQQSDPHKNKIDLSLATLGAI-----------------NPWILDFGVTDHLTGSSEHFVSYIPCAGNETIRIADGSLTPIAGKREDFSCAGLSLHNVL

Query:  H--------------------------------DLSSGRMIGTARHSRGLYLLDDDTSSSSIPRTSLLSTYFTTSEQDCMLWHFRLGHPNFQYMKHLFPH
        H                                D+SSGR IGTARHSRGLY+LDDDTS SS+ R SLLS+YF+TSEQDCMLWHFRLGHPNF YM+HLFPH
Subjt:  H--------------------------------DLSSGRMIGTARHSRGLYLLDDDTSSSSIPRTSLLSTYFTTSEQDCMLWHFRLGHPNFQYMKHLFPH

Query:  LFSKVEMTTLSCD-------------------------------GPSKITTSSEKQ
        LFSKV++++LSCD                               GPSK+TTSS K+
Subjt:  LFSKVEMTTLSCD-------------------------------GPSKITTSSEKQ

KAA0050140.1 Beta-galactosidase [Cucumis melo var. makuwa]3.0e-17250.98Show/hide
Query:  MDELLSRLQKTSENNFSSLPQSSAPSPDHHALGFLPQTAPTIPSVQPFSSSAAYIAPHAPIYVL---PSNSNRLPPLLPSNLYGQPPNDPSYHPD-VKNS
        M++LL  LQK        +PQ  AP P H     +P  AP+   VQP S+ + +  PHAP       PS  N         LY  P   P +  + +   
Subjt:  MDELLSRLQKTSENNFSSLPQSSAPSPDHHALGFLPQTAPTIPSVQPFSSSAAYIAPHAPIYVL---PSNSNRLPPLLPSNLYGQPPNDPSYHPD-VKNS

Query:  QIHSTFEVGESSAYSNRNVQASSGIVHQQLEGLRQQIAALEATLGTTSTLPMYSEYPVNSFPNVSSPYLTNTVAQSSMYHLSGEKLNGNNYFSWSQLVKM
        Q  S  E GESS +S                                + LPMYS+ PV SFPN  S Y+T ++  SS  + SGEKLNG NYFSWSQ +KM
Subjt:  QIHSTFEVGESSAYSNRNVQASSGIVHQQLEGLRQQIAALEATLGTTSTLPMYSEYPVNSFPNVSSPYLTNTVAQSSMYHLSGEKLNGNNYFSWSQLVKM

Query:  VLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKPLLFAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTSFFNK
         LEGR +F FLTGE  RP PGD  ER WK EDS++RS+LINSMEPQIGKPLL+AATAKD+WDT QTLYSKRQNASRLYTLRKQVH CKQGT+DVT++FNK
Subjt:  VLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKPLLFAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTSFFNK

Query:  LSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQRPIPSLMEVCSEIRLEEDRTSAMNISATPTIDSAAFCARFSNSSS
        LSL+WQEMDLCRE VW  P D  QY+++EE DR+YDFLAGLNPKFD V GRILGQRP+PSLMEVC E+RLEEDRT+AM +  TPTIDSAAF AR SN  S
Subjt:  LSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQRPIPSLMEVCSEIRLEEDRTSAMNISATPTIDSAAFCARFSNSSS

Query:  DKHNRKPIP----------------------------------NTWRAYVSEY--AEPPQQSDPHKNKIDLSLATLGAI-----------------NPWI
        DK+N K IP                                  N+ RAY+SE   A   Q +DP  ++      TLGAI                 NPWI
Subjt:  DKHNRKPIP----------------------------------NTWRAYVSEY--AEPPQQSDPHKNKIDLSLATLGAI-----------------NPWI

Query:  LDFGVTDHLTGSSEHFVSYIPCAGNETIRIADGSLTPIAGKREDFSCAGLSLHNVLH--------------------------------DLSSGRMIGTA
        LD G TDHLTGSSEHF+SY PCAGNE IRIADGSL PIAGK +     G +L NVLH                                D+SSGR IGTA
Subjt:  LDFGVTDHLTGSSEHFVSYIPCAGNETIRIADGSLTPIAGKREDFSCAGLSLHNVLH--------------------------------DLSSGRMIGTA

Query:  RHSRGLYLLDDDTSSSSIPRTSLLSTYFTTSEQDCMLWHFRLGHPNFQYMKHLFPHLFSKVEMTTLSCD-------------------------------
        RHSRGLY+LDDDTS SS+ R SLLS+YF+TSEQDCMLWHFRLGHPNF YM+HLFPHLFSKV++++LSCD                               
Subjt:  RHSRGLYLLDDDTSSSSIPRTSLLSTYFTTSEQDCMLWHFRLGHPNFQYMKHLFPHLFSKVEMTTLSCD-------------------------------

Query:  GPSKITTSSEKQ
        GPSK+TTSS K+
Subjt:  GPSKITTSSEKQ

KAA0052775.1 Beta-galactosidase [Cucumis melo var. makuwa]7.9e-17349.47Show/hide
Query:  MVSERDNENTLETQKNQTTYENQTEGTAFNFSAVVAAAIDARMSAAMDELLSRLQKTSENNFSSLPQSSAPSPDHHALGFLPQTAPTIPSVQPFSSSAAY
        MVSE+ N  TLE    +T  E  TE  A               +AAM++LL  LQK        +PQ  AP  D   +     +     +  PF  +A  
Subjt:  MVSERDNENTLETQKNQTTYENQTEGTAFNFSAVVAAAIDARMSAAMDELLSRLQKTSENNFSSLPQSSAPSPDHHALGFLPQTAPTIPSVQPFSSSAAY

Query:  IAPHAPIYVLPSNSNRLP-PLLPSNLYGQPPNDPSYHPDVKNSQIHSTFEVGESSAYSNRNVQASSGIVHQQLEGLRQQIAALEATLGTTST-LPMYSEY
        +  +AP  V PSN +  P P  PS   GQ P+  +        Q++   +  +   +S   +              R  I A E++  +  T LPMYS+ 
Subjt:  IAPHAPIYVLPSNSNRLP-PLLPSNLYGQPPNDPSYHPDVKNSQIHSTFEVGESSAYSNRNVQASSGIVHQQLEGLRQQIAALEATLGTTST-LPMYSEY

Query:  PVNSFPNVSSPYLTNTVAQSSMYHLSGEKLNGNNYFSWSQLVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKPLLFAAT
        PV SFPN  S Y+T ++  SS  + SGEKLNG NYFSWSQ +KM LEGR +F FLTGEI RP PGD  ER WK EDS++RS+LINSMEPQIGKPLL+A T
Subjt:  PVNSFPNVSSPYLTNTVAQSSMYHLSGEKLNGNNYFSWSQLVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKPLLFAAT

Query:  AKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTSFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQR
        AKD+WDT QTLYSKRQNASRLYTLRKQVH CKQGT+DVT++FNKLSL+WQEMDLCRE VW  P D  QY+++EE DR+YDFLAGLNPKFD V GRILGQR
Subjt:  AKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTSFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQR

Query:  PIPSLMEVCSEIRLEEDRTSAMNISATPTIDSAAFCARFSNSSSDKHNRKPIP----------------------------------NTWRAYVSEY--A
        P+PSLMEVC E+RLEEDRT+AM +  TPTIDSAAF AR SN  SDK+N K IP                                  N+ RAY+SE   A
Subjt:  PIPSLMEVCSEIRLEEDRTSAMNISATPTIDSAAFCARFSNSSSDKHNRKPIP----------------------------------NTWRAYVSEY--A

Query:  EPPQQSDPHKNKIDLSLATLGAI-----------------NPWILDFGVTDHLTGSSEHFVSYIPCAGNETIRIADGSLTPIAGKREDFSCAGLSLHNVL
           Q +DP  ++      TLGAI                 NPWILD G TDHLTGSSEHF+SY PCAGNE IRIADGSL PIAGK +     G +L NVL
Subjt:  EPPQQSDPHKNKIDLSLATLGAI-----------------NPWILDFGVTDHLTGSSEHFVSYIPCAGNETIRIADGSLTPIAGKREDFSCAGLSLHNVL

Query:  H--------------------------------DLSSGRMIGTARHSRGLYLLDDDTSSSSIPRTSLLSTYFTTSEQDCMLWHFRLGHPNFQYMKHLFPH
        H                                D+SSGR IGTARHSRGLY+LDDDTS SS+ R SLLS+YF+TSEQDCMLWHFRLGHPNF YM+HLFPH
Subjt:  H--------------------------------DLSSGRMIGTARHSRGLYLLDDDTSSSSIPRTSLLSTYFTTSEQDCMLWHFRLGHPNFQYMKHLFPH

Query:  LFSKVEMTTLSCD-------------------------------GPSKITTSSEKQ
        LFSKV++++LSCD                               GPSK+TTSS K+
Subjt:  LFSKVEMTTLSCD-------------------------------GPSKITTSSEKQ

TYK11240.1 Beta-galactosidase [Cucumis melo var. makuwa]1.2e-17649.73Show/hide
Query:  SERDNENTLETQKNQTTYENQTEGTAFNFSAVVAAAIDARMSAAMDELLSRLQKTSENNFSSLPQSSAPSPDHHALGFLPQTAPTIPSVQPFSSSAAYIA
        SE+ N  TLE    +T  E      A   +A ++AA+DA ++AAM++LL  LQK        +PQ  AP  D   +     +     +  PF  +A  + 
Subjt:  SERDNENTLETQKNQTTYENQTEGTAFNFSAVVAAAIDARMSAAMDELLSRLQKTSENNFSSLPQSSAPSPDHHALGFLPQTAPTIPSVQPFSSSAAYIA

Query:  PHAPIYVLPSNSNRLP-PLLPSNLYGQPPNDPSYHPDVKNSQIHSTFEVGESSAYSNRNVQASSGIVHQQLEGLRQQIAALEATLGTTST-LPMYSEYPV
         +AP  V PSN +  P P  PS   GQ P+  +        Q++   +  +   +S   +              R  I A E++  +  T LPMYS+ PV
Subjt:  PHAPIYVLPSNSNRLP-PLLPSNLYGQPPNDPSYHPDVKNSQIHSTFEVGESSAYSNRNVQASSGIVHQQLEGLRQQIAALEATLGTTST-LPMYSEYPV

Query:  NSFPNVSSPYLTNTVAQSSMYHLSGEKLNGNNYFSWSQLVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKPLLFAATAK
         SFPN  S Y+T ++  SS  + SGEKLNG NYFSWSQ +KM LEGR +F FLTGEI RP PGD  ER WK EDS++RS+LINSMEPQIGKPLL+A TAK
Subjt:  NSFPNVSSPYLTNTVAQSSMYHLSGEKLNGNNYFSWSQLVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKPLLFAATAK

Query:  DIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTSFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQRPI
        D+WDT QTLYSKRQNASRLYTLRKQVH CKQGT+DVT++FNKLSL+WQEMDLCRE VW  P D  QY+++EE DR+YDFLAGLNPKFD V GRILGQRP+
Subjt:  DIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTSFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQRPI

Query:  PSLMEVCSEIRLEEDRTSAMNISATPTIDSAAFCARFSNSSSDKHNRKPIP----------------------------------NTWRAYVSEY--AEP
        PSLMEVC E+RLEEDRT+AM +  TPTIDSAAF AR SN  SDK+N K IP                                  N+ RAY+SE   A  
Subjt:  PSLMEVCSEIRLEEDRTSAMNISATPTIDSAAFCARFSNSSSDKHNRKPIP----------------------------------NTWRAYVSEY--AEP

Query:  PQQSDPHKNKIDLSLATLGAI-----------------NPWILDFGVTDHLTGSSEHFVSYIPCAGNETIRIADGSLTPIAGKREDFSCAGLSLHNVLH-
         Q +DP  ++      TLGAI                 NPWILD G TDHLTGSSEHF+SY PCAGNE IRIADGSL PIAGK +     G +L NVLH 
Subjt:  PQQSDPHKNKIDLSLATLGAI-----------------NPWILDFGVTDHLTGSSEHFVSYIPCAGNETIRIADGSLTPIAGKREDFSCAGLSLHNVLH-

Query:  -------------------------------DLSSGRMIGTARHSRGLYLLDDDTSSSSIPRTSLLSTYFTTSEQDCMLWHFRLGHPNFQYMKHLFPHLF
                                       D+SSGR IGTARHSRGLY+LDDDTS SS+ R SLLS+YF+TSEQDCMLWHFRLGHPNF YM+HLFPHLF
Subjt:  -------------------------------DLSSGRMIGTARHSRGLYLLDDDTSSSSIPRTSLLSTYFTTSEQDCMLWHFRLGHPNFQYMKHLFPHLF

Query:  SKVEMTTLSCD-------------------------------GPSKITTSSEKQ
        SKV++++LSCD                               GPSK+TTSS K+
Subjt:  SKVEMTTLSCD-------------------------------GPSKITTSSEKQ

TYK23439.1 Beta-galactosidase [Cucumis melo var. makuwa]9.0e-17749.74Show/hide
Query:  MVSERDNENTLETQKNQTTYENQTEGTAFNFSAVVAAAIDARMSAAMDELLSRLQKTSENNFSSLPQSSAPSPDHHALGFLPQTAPTIPSVQPFSSSAAY
        MVSE+ N  TLE    +T  E +        +A  AAA+DA ++AA+++LL  LQK        +PQ  AP  D   +     +     +  PF  +A  
Subjt:  MVSERDNENTLETQKNQTTYENQTEGTAFNFSAVVAAAIDARMSAAMDELLSRLQKTSENNFSSLPQSSAPSPDHHALGFLPQTAPTIPSVQPFSSSAAY

Query:  IAPHAPIYVLPSNSNRLP-PLLPSNLYGQPPNDPSYHPDVKNSQIHSTFEVGESSAYSNRNVQASSGIVHQQLEGLRQQIAALEATLGTTST-LPMYSEY
        +  +AP  V PSN +  P P  PS   GQ P+  +        Q++   +  +   +S   +              R  I A E++  +  T LPMYS+ 
Subjt:  IAPHAPIYVLPSNSNRLP-PLLPSNLYGQPPNDPSYHPDVKNSQIHSTFEVGESSAYSNRNVQASSGIVHQQLEGLRQQIAALEATLGTTST-LPMYSEY

Query:  PVNSFPNVSSPYLTNTVAQSSMYHLSGEKLNGNNYFSWSQLVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKPLLFAAT
        PV SFPN  S Y+T ++  SS  + SGEKLNG NYFSWSQ +KM LEGR +F FLTGEI RP PGD  ER WK EDS++RS+LINSMEPQIGKPLL+A T
Subjt:  PVNSFPNVSSPYLTNTVAQSSMYHLSGEKLNGNNYFSWSQLVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKPLLFAAT

Query:  AKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTSFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQR
        AKD+WDT QTLYSKRQNASRLYTLRKQVH CKQGT+DVT++FNKLSL+WQEMDLCRE VW  P D  QY+++EE DR+YDFLAGLNPKFD V GRILGQR
Subjt:  AKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTSFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQR

Query:  PIPSLMEVCSEIRLEEDRTSAMNISATPTIDSAAFCARFSNSSSDKHNRKPIP----------------------------------NTWRAYVSEY--A
        P+PSLMEVC E+RLEEDRT+AM +  TPTIDSAAF AR SN  SDK+N K IP                                  N+ RAY+SE   A
Subjt:  PIPSLMEVCSEIRLEEDRTSAMNISATPTIDSAAFCARFSNSSSDKHNRKPIP----------------------------------NTWRAYVSEY--A

Query:  EPPQQSDPHKNKIDLSLATLGAI-----------------NPWILDFGVTDHLTGSSEHFVSYIPCAGNETIRIADGSLTPIAGKREDFSCAGLSLHNVL
           Q +DP  ++      TLGAI                 NPWILD G TDHLTGSSEHF+SY PCAGNE IRIADGSL PIAGK +     G +L NVL
Subjt:  EPPQQSDPHKNKIDLSLATLGAI-----------------NPWILDFGVTDHLTGSSEHFVSYIPCAGNETIRIADGSLTPIAGKREDFSCAGLSLHNVL

Query:  H--------------------------------DLSSGRMIGTARHSRGLYLLDDDTSSSSIPRTSLLSTYFTTSEQDCMLWHFRLGHPNFQYMKHLFPH
        H                                D+SSGR IGTARHSRGLY+LDDDTS SS+ R SLLS+YF+TSEQDCMLWHFRLGHPNF YM+HLFPH
Subjt:  H--------------------------------DLSSGRMIGTARHSRGLYLLDDDTSSSSIPRTSLLSTYFTTSEQDCMLWHFRLGHPNFQYMKHLFPH

Query:  LFSKVEMTTLSCD-------------------------------GPSKITTSSEKQ
        LFSKV++++LSCD                               GPSK+TTSS K+
Subjt:  LFSKVEMTTLSCD-------------------------------GPSKITTSSEKQ

TrEMBL top hitse value%identityAlignment
A0A5A7SL21 Beta-galactosidase1.0e-17349.47Show/hide
Query:  MVSERDNENTLETQKNQTTYENQTEGTAFNFSAVVAAAIDARMSAAMDELLSRLQKTSENNFSSLPQSSAPSPDHHALGFLPQTAPTIPSVQPFSSSAAY
        MVSE+ N  TLE    +T  E +          V AAA     +AA+++LL  LQK        +PQ  AP  D   +     +     +  PF  +A  
Subjt:  MVSERDNENTLETQKNQTTYENQTEGTAFNFSAVVAAAIDARMSAAMDELLSRLQKTSENNFSSLPQSSAPSPDHHALGFLPQTAPTIPSVQPFSSSAAY

Query:  IAPHAPIYVLPSNSNRLP-PLLPSNLYGQPPNDPSYHPDVKNSQIHSTFEVGESSAYSNRNVQASSGIVHQQLEGLRQQIAALEATLGTTST-LPMYSEY
        +  +AP  V PSN +  P P  PS   GQ P+  +        Q++   +  +   +S   +              R  I A E++  +  T LPMYS+ 
Subjt:  IAPHAPIYVLPSNSNRLP-PLLPSNLYGQPPNDPSYHPDVKNSQIHSTFEVGESSAYSNRNVQASSGIVHQQLEGLRQQIAALEATLGTTST-LPMYSEY

Query:  PVNSFPNVSSPYLTNTVAQSSMYHLSGEKLNGNNYFSWSQLVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKPLLFAAT
        PV SFPN  S Y+T ++  SS  + SGEKLNG NYFSWSQ +KM LEGR +F FLTGE  RP PGD  ER WK EDS++RS+LINSMEPQIGKPLL+AAT
Subjt:  PVNSFPNVSSPYLTNTVAQSSMYHLSGEKLNGNNYFSWSQLVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKPLLFAAT

Query:  AKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTSFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQR
        AKD+WDT QTLYSKRQNASRLYTLRKQVH CKQGT+DVT++FNKLSL+WQEMDLCRE VW  P D  QY+++EE DR+YDFLAGLNPKFD V GRILGQR
Subjt:  AKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTSFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQR

Query:  PIPSLMEVCSEIRLEEDRTSAMNISATPTIDSAAFCARFSNSSSDKHNRKPIP----------------------------------NTWRAYVSEY--A
        P+PSLMEVC E+RLEEDRT+AM +  TPTIDSAAF AR SN  SDK+N K IP                                  N+ RAY+SE   A
Subjt:  PIPSLMEVCSEIRLEEDRTSAMNISATPTIDSAAFCARFSNSSSDKHNRKPIP----------------------------------NTWRAYVSEY--A

Query:  EPPQQSDPHKNKIDLSLATLGAI-----------------NPWILDFGVTDHLTGSSEHFVSYIPCAGNETIRIADGSLTPIAGKREDFSCAGLSLHNVL
           Q +DP  ++      TLGAI                 NPWILD G TDHLTGSSEHF+SY PCAGNE IRIADGSL PIAGK +     G +L NVL
Subjt:  EPPQQSDPHKNKIDLSLATLGAI-----------------NPWILDFGVTDHLTGSSEHFVSYIPCAGNETIRIADGSLTPIAGKREDFSCAGLSLHNVL

Query:  H--------------------------------DLSSGRMIGTARHSRGLYLLDDDTSSSSIPRTSLLSTYFTTSEQDCMLWHFRLGHPNFQYMKHLFPH
        H                                D+SSGR IGTARHSRGLY+LDDDTS SS+ R SLLS+YF+TSEQDCMLWHFRLGHPNF YM+HLFPH
Subjt:  H--------------------------------DLSSGRMIGTARHSRGLYLLDDDTSSSSIPRTSLLSTYFTTSEQDCMLWHFRLGHPNFQYMKHLFPH

Query:  LFSKVEMTTLSCD-------------------------------GPSKITTSSEKQ
        LFSKV++++LSCD                               GPSK+TTSS K+
Subjt:  LFSKVEMTTLSCD-------------------------------GPSKITTSSEKQ

A0A5A7U4D7 Beta-galactosidase1.5e-17250.98Show/hide
Query:  MDELLSRLQKTSENNFSSLPQSSAPSPDHHALGFLPQTAPTIPSVQPFSSSAAYIAPHAPIYVL---PSNSNRLPPLLPSNLYGQPPNDPSYHPD-VKNS
        M++LL  LQK        +PQ  AP P H     +P  AP+   VQP S+ + +  PHAP       PS  N         LY  P   P +  + +   
Subjt:  MDELLSRLQKTSENNFSSLPQSSAPSPDHHALGFLPQTAPTIPSVQPFSSSAAYIAPHAPIYVL---PSNSNRLPPLLPSNLYGQPPNDPSYHPD-VKNS

Query:  QIHSTFEVGESSAYSNRNVQASSGIVHQQLEGLRQQIAALEATLGTTSTLPMYSEYPVNSFPNVSSPYLTNTVAQSSMYHLSGEKLNGNNYFSWSQLVKM
        Q  S  E GESS +S                                + LPMYS+ PV SFPN  S Y+T ++  SS  + SGEKLNG NYFSWSQ +KM
Subjt:  QIHSTFEVGESSAYSNRNVQASSGIVHQQLEGLRQQIAALEATLGTTSTLPMYSEYPVNSFPNVSSPYLTNTVAQSSMYHLSGEKLNGNNYFSWSQLVKM

Query:  VLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKPLLFAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTSFFNK
         LEGR +F FLTGE  RP PGD  ER WK EDS++RS+LINSMEPQIGKPLL+AATAKD+WDT QTLYSKRQNASRLYTLRKQVH CKQGT+DVT++FNK
Subjt:  VLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKPLLFAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTSFFNK

Query:  LSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQRPIPSLMEVCSEIRLEEDRTSAMNISATPTIDSAAFCARFSNSSS
        LSL+WQEMDLCRE VW  P D  QY+++EE DR+YDFLAGLNPKFD V GRILGQRP+PSLMEVC E+RLEEDRT+AM +  TPTIDSAAF AR SN  S
Subjt:  LSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQRPIPSLMEVCSEIRLEEDRTSAMNISATPTIDSAAFCARFSNSSS

Query:  DKHNRKPIP----------------------------------NTWRAYVSEY--AEPPQQSDPHKNKIDLSLATLGAI-----------------NPWI
        DK+N K IP                                  N+ RAY+SE   A   Q +DP  ++      TLGAI                 NPWI
Subjt:  DKHNRKPIP----------------------------------NTWRAYVSEY--AEPPQQSDPHKNKIDLSLATLGAI-----------------NPWI

Query:  LDFGVTDHLTGSSEHFVSYIPCAGNETIRIADGSLTPIAGKREDFSCAGLSLHNVLH--------------------------------DLSSGRMIGTA
        LD G TDHLTGSSEHF+SY PCAGNE IRIADGSL PIAGK +     G +L NVLH                                D+SSGR IGTA
Subjt:  LDFGVTDHLTGSSEHFVSYIPCAGNETIRIADGSLTPIAGKREDFSCAGLSLHNVLH--------------------------------DLSSGRMIGTA

Query:  RHSRGLYLLDDDTSSSSIPRTSLLSTYFTTSEQDCMLWHFRLGHPNFQYMKHLFPHLFSKVEMTTLSCD-------------------------------
        RHSRGLY+LDDDTS SS+ R SLLS+YF+TSEQDCMLWHFRLGHPNF YM+HLFPHLFSKV++++LSCD                               
Subjt:  RHSRGLYLLDDDTSSSSIPRTSLLSTYFTTSEQDCMLWHFRLGHPNFQYMKHLFPHLFSKVEMTTLSCD-------------------------------

Query:  GPSKITTSSEKQ
        GPSK+TTSS K+
Subjt:  GPSKITTSSEKQ

A0A5A7UGB2 Beta-galactosidase3.8e-17349.47Show/hide
Query:  MVSERDNENTLETQKNQTTYENQTEGTAFNFSAVVAAAIDARMSAAMDELLSRLQKTSENNFSSLPQSSAPSPDHHALGFLPQTAPTIPSVQPFSSSAAY
        MVSE+ N  TLE    +T  E  TE  A               +AAM++LL  LQK        +PQ  AP  D   +     +     +  PF  +A  
Subjt:  MVSERDNENTLETQKNQTTYENQTEGTAFNFSAVVAAAIDARMSAAMDELLSRLQKTSENNFSSLPQSSAPSPDHHALGFLPQTAPTIPSVQPFSSSAAY

Query:  IAPHAPIYVLPSNSNRLP-PLLPSNLYGQPPNDPSYHPDVKNSQIHSTFEVGESSAYSNRNVQASSGIVHQQLEGLRQQIAALEATLGTTST-LPMYSEY
        +  +AP  V PSN +  P P  PS   GQ P+  +        Q++   +  +   +S   +              R  I A E++  +  T LPMYS+ 
Subjt:  IAPHAPIYVLPSNSNRLP-PLLPSNLYGQPPNDPSYHPDVKNSQIHSTFEVGESSAYSNRNVQASSGIVHQQLEGLRQQIAALEATLGTTST-LPMYSEY

Query:  PVNSFPNVSSPYLTNTVAQSSMYHLSGEKLNGNNYFSWSQLVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKPLLFAAT
        PV SFPN  S Y+T ++  SS  + SGEKLNG NYFSWSQ +KM LEGR +F FLTGEI RP PGD  ER WK EDS++RS+LINSMEPQIGKPLL+A T
Subjt:  PVNSFPNVSSPYLTNTVAQSSMYHLSGEKLNGNNYFSWSQLVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKPLLFAAT

Query:  AKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTSFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQR
        AKD+WDT QTLYSKRQNASRLYTLRKQVH CKQGT+DVT++FNKLSL+WQEMDLCRE VW  P D  QY+++EE DR+YDFLAGLNPKFD V GRILGQR
Subjt:  AKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTSFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQR

Query:  PIPSLMEVCSEIRLEEDRTSAMNISATPTIDSAAFCARFSNSSSDKHNRKPIP----------------------------------NTWRAYVSEY--A
        P+PSLMEVC E+RLEEDRT+AM +  TPTIDSAAF AR SN  SDK+N K IP                                  N+ RAY+SE   A
Subjt:  PIPSLMEVCSEIRLEEDRTSAMNISATPTIDSAAFCARFSNSSSDKHNRKPIP----------------------------------NTWRAYVSEY--A

Query:  EPPQQSDPHKNKIDLSLATLGAI-----------------NPWILDFGVTDHLTGSSEHFVSYIPCAGNETIRIADGSLTPIAGKREDFSCAGLSLHNVL
           Q +DP  ++      TLGAI                 NPWILD G TDHLTGSSEHF+SY PCAGNE IRIADGSL PIAGK +     G +L NVL
Subjt:  EPPQQSDPHKNKIDLSLATLGAI-----------------NPWILDFGVTDHLTGSSEHFVSYIPCAGNETIRIADGSLTPIAGKREDFSCAGLSLHNVL

Query:  H--------------------------------DLSSGRMIGTARHSRGLYLLDDDTSSSSIPRTSLLSTYFTTSEQDCMLWHFRLGHPNFQYMKHLFPH
        H                                D+SSGR IGTARHSRGLY+LDDDTS SS+ R SLLS+YF+TSEQDCMLWHFRLGHPNF YM+HLFPH
Subjt:  H--------------------------------DLSSGRMIGTARHSRGLYLLDDDTSSSSIPRTSLLSTYFTTSEQDCMLWHFRLGHPNFQYMKHLFPH

Query:  LFSKVEMTTLSCD-------------------------------GPSKITTSSEKQ
        LFSKV++++LSCD                               GPSK+TTSS K+
Subjt:  LFSKVEMTTLSCD-------------------------------GPSKITTSSEKQ

A0A5D3CIR0 Beta-galactosidase5.7e-17749.73Show/hide
Query:  SERDNENTLETQKNQTTYENQTEGTAFNFSAVVAAAIDARMSAAMDELLSRLQKTSENNFSSLPQSSAPSPDHHALGFLPQTAPTIPSVQPFSSSAAYIA
        SE+ N  TLE    +T  E      A   +A ++AA+DA ++AAM++LL  LQK        +PQ  AP  D   +     +     +  PF  +A  + 
Subjt:  SERDNENTLETQKNQTTYENQTEGTAFNFSAVVAAAIDARMSAAMDELLSRLQKTSENNFSSLPQSSAPSPDHHALGFLPQTAPTIPSVQPFSSSAAYIA

Query:  PHAPIYVLPSNSNRLP-PLLPSNLYGQPPNDPSYHPDVKNSQIHSTFEVGESSAYSNRNVQASSGIVHQQLEGLRQQIAALEATLGTTST-LPMYSEYPV
         +AP  V PSN +  P P  PS   GQ P+  +        Q++   +  +   +S   +              R  I A E++  +  T LPMYS+ PV
Subjt:  PHAPIYVLPSNSNRLP-PLLPSNLYGQPPNDPSYHPDVKNSQIHSTFEVGESSAYSNRNVQASSGIVHQQLEGLRQQIAALEATLGTTST-LPMYSEYPV

Query:  NSFPNVSSPYLTNTVAQSSMYHLSGEKLNGNNYFSWSQLVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKPLLFAATAK
         SFPN  S Y+T ++  SS  + SGEKLNG NYFSWSQ +KM LEGR +F FLTGEI RP PGD  ER WK EDS++RS+LINSMEPQIGKPLL+A TAK
Subjt:  NSFPNVSSPYLTNTVAQSSMYHLSGEKLNGNNYFSWSQLVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKPLLFAATAK

Query:  DIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTSFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQRPI
        D+WDT QTLYSKRQNASRLYTLRKQVH CKQGT+DVT++FNKLSL+WQEMDLCRE VW  P D  QY+++EE DR+YDFLAGLNPKFD V GRILGQRP+
Subjt:  DIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTSFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQRPI

Query:  PSLMEVCSEIRLEEDRTSAMNISATPTIDSAAFCARFSNSSSDKHNRKPIP----------------------------------NTWRAYVSEY--AEP
        PSLMEVC E+RLEEDRT+AM +  TPTIDSAAF AR SN  SDK+N K IP                                  N+ RAY+SE   A  
Subjt:  PSLMEVCSEIRLEEDRTSAMNISATPTIDSAAFCARFSNSSSDKHNRKPIP----------------------------------NTWRAYVSEY--AEP

Query:  PQQSDPHKNKIDLSLATLGAI-----------------NPWILDFGVTDHLTGSSEHFVSYIPCAGNETIRIADGSLTPIAGKREDFSCAGLSLHNVLH-
         Q +DP  ++      TLGAI                 NPWILD G TDHLTGSSEHF+SY PCAGNE IRIADGSL PIAGK +     G +L NVLH 
Subjt:  PQQSDPHKNKIDLSLATLGAI-----------------NPWILDFGVTDHLTGSSEHFVSYIPCAGNETIRIADGSLTPIAGKREDFSCAGLSLHNVLH-

Query:  -------------------------------DLSSGRMIGTARHSRGLYLLDDDTSSSSIPRTSLLSTYFTTSEQDCMLWHFRLGHPNFQYMKHLFPHLF
                                       D+SSGR IGTARHSRGLY+LDDDTS SS+ R SLLS+YF+TSEQDCMLWHFRLGHPNF YM+HLFPHLF
Subjt:  -------------------------------DLSSGRMIGTARHSRGLYLLDDDTSSSSIPRTSLLSTYFTTSEQDCMLWHFRLGHPNFQYMKHLFPHLF

Query:  SKVEMTTLSCD-------------------------------GPSKITTSSEKQ
        SKV++++LSCD                               GPSK+TTSS K+
Subjt:  SKVEMTTLSCD-------------------------------GPSKITTSSEKQ

A0A5D3DJM7 Beta-galactosidase4.4e-17749.74Show/hide
Query:  MVSERDNENTLETQKNQTTYENQTEGTAFNFSAVVAAAIDARMSAAMDELLSRLQKTSENNFSSLPQSSAPSPDHHALGFLPQTAPTIPSVQPFSSSAAY
        MVSE+ N  TLE    +T  E +        +A  AAA+DA ++AA+++LL  LQK        +PQ  AP  D   +     +     +  PF  +A  
Subjt:  MVSERDNENTLETQKNQTTYENQTEGTAFNFSAVVAAAIDARMSAAMDELLSRLQKTSENNFSSLPQSSAPSPDHHALGFLPQTAPTIPSVQPFSSSAAY

Query:  IAPHAPIYVLPSNSNRLP-PLLPSNLYGQPPNDPSYHPDVKNSQIHSTFEVGESSAYSNRNVQASSGIVHQQLEGLRQQIAALEATLGTTST-LPMYSEY
        +  +AP  V PSN +  P P  PS   GQ P+  +        Q++   +  +   +S   +              R  I A E++  +  T LPMYS+ 
Subjt:  IAPHAPIYVLPSNSNRLP-PLLPSNLYGQPPNDPSYHPDVKNSQIHSTFEVGESSAYSNRNVQASSGIVHQQLEGLRQQIAALEATLGTTST-LPMYSEY

Query:  PVNSFPNVSSPYLTNTVAQSSMYHLSGEKLNGNNYFSWSQLVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKPLLFAAT
        PV SFPN  S Y+T ++  SS  + SGEKLNG NYFSWSQ +KM LEGR +F FLTGEI RP PGD  ER WK EDS++RS+LINSMEPQIGKPLL+A T
Subjt:  PVNSFPNVSSPYLTNTVAQSSMYHLSGEKLNGNNYFSWSQLVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKPLLFAAT

Query:  AKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTSFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQR
        AKD+WDT QTLYSKRQNASRLYTLRKQVH CKQGT+DVT++FNKLSL+WQEMDLCRE VW  P D  QY+++EE DR+YDFLAGLNPKFD V GRILGQR
Subjt:  AKDIWDTAQTLYSKRQNASRLYTLRKQVHECKQGTMDVTSFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQR

Query:  PIPSLMEVCSEIRLEEDRTSAMNISATPTIDSAAFCARFSNSSSDKHNRKPIP----------------------------------NTWRAYVSEY--A
        P+PSLMEVC E+RLEEDRT+AM +  TPTIDSAAF AR SN  SDK+N K IP                                  N+ RAY+SE   A
Subjt:  PIPSLMEVCSEIRLEEDRTSAMNISATPTIDSAAFCARFSNSSSDKHNRKPIP----------------------------------NTWRAYVSEY--A

Query:  EPPQQSDPHKNKIDLSLATLGAI-----------------NPWILDFGVTDHLTGSSEHFVSYIPCAGNETIRIADGSLTPIAGKREDFSCAGLSLHNVL
           Q +DP  ++      TLGAI                 NPWILD G TDHLTGSSEHF+SY PCAGNE IRIADGSL PIAGK +     G +L NVL
Subjt:  EPPQQSDPHKNKIDLSLATLGAI-----------------NPWILDFGVTDHLTGSSEHFVSYIPCAGNETIRIADGSLTPIAGKREDFSCAGLSLHNVL

Query:  H--------------------------------DLSSGRMIGTARHSRGLYLLDDDTSSSSIPRTSLLSTYFTTSEQDCMLWHFRLGHPNFQYMKHLFPH
        H                                D+SSGR IGTARHSRGLY+LDDDTS SS+ R SLLS+YF+TSEQDCMLWHFRLGHPNF YM+HLFPH
Subjt:  H--------------------------------DLSSGRMIGTARHSRGLYLLDDDTSSSSIPRTSLLSTYFTTSEQDCMLWHFRLGHPNFQYMKHLFPH

Query:  LFSKVEMTTLSCD-------------------------------GPSKITTSSEKQ
        LFSKV++++LSCD                               GPSK+TTSS K+
Subjt:  LFSKVEMTTLSCD-------------------------------GPSKITTSSEKQ

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.7e-1620.8Show/hide
Query:  KLNGNNYFSWSQLVKMVLEGRQKFSFLTGEIPRPLPG---------DPHERYWKAEDSILRSILINSMEPQIGKPLLFAATAKDIWDTAQTLYSKRQNAS
        KL   NY  WS+ V  + +G +   FL G    P            +P    WK +D ++ S ++ ++   +   +  A TA  IW+T + +Y+   +  
Subjt:  KLNGNNYFSWSQLVKMVLEGRQKFSFLTGEIPRPLPG---------DPHERYWKAEDSILRSILINSMEPQIGKPLLFAATAKDIWDTAQTLYSKRQNAS

Query:  RLYTLRKQVHECKQGTMDVTSF-------FNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLN--PKFDVVRGRILGQRPIPSLMEVCS
         +  LR Q+ +  +GT  +  +       F++L+L+ + MD        +  + V  +  EE   + D +A  +  P    +  R+L        +   +
Subjt:  RLYTLRKQVHECKQGTMDVTSF-------FNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLN--PKFDVVRGRILGQRPIPSLMEVCS

Query:  EIRLEEDRTSAMNISATPTIDSAAFCARFSNSSSDKH---------NRKPIPNTWRAYVSE-------------------------YAEPPQQSDPHKNK
         I +  +  S  N + T   ++     R+ N +++ +         N  P  N  + Y+ +                           +PP    P + +
Subjt:  EIRLEEDRTSAMNISATPTIDSAAFCARFSNSSSDKH---------NRKPIPNTWRAYVSE-------------------------YAEPPQQSDPHKNK

Query:  IDLSLATLGAINPWILDFGVTDHLTGSSEHFVSYIPCAGNETIRIADGSLTPIA---GKREDFSCAGLSLHNVLH
         +L+L +  + N W+LD G T H+T    +   + P  G + + +ADGS  PI+             L+LHN+L+
Subjt:  IDLSLATLGAINPWILDFGVTDHLTGSSEHFVSYIPCAGNETIRIADGSLTPIA---GKREDFSCAGLSLHNVLH

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.1e-1120.95Show/hide
Query:  KLNGNNYFSWSQLVKMVLEGRQKFSFLTGEIPRP--------LPG-DPHERYWKAEDSILRSILINSMEPQIGKPLLFAATAKDIWDTAQTLYSKRQNAS
        KL   NY  WS+ V  + +G +   FL G  P P        +P  +P    W+ +D ++ S ++ ++   +   +  A TA  IW+T + +Y+   N S
Subjt:  KLNGNNYFSWSQLVKMVLEGRQKFSFLTGEIPRP--------LPG-DPHERYWKAEDSILRSILINSMEPQIGKPLLFAATAKDIWDTAQTLYSKRQNAS

Query:  RLYTLRKQVHECKQGTMDVTSFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQRPIPSLMEVCSEIRLEEDRT
          +  +          +   + F++L+L+ + MD                     ++++   L  L   +  V  +I  +   PSL E+   +   E + 
Subjt:  RLYTLRKQVHECKQGTMDVTSFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQRPIPSLMEVCSEIRLEEDRT

Query:  SAMNISATPTIDSAAFCARFSN----------------------------SSSDKHNRKPIPNTWRAYVS----------------EYAEPPQQS----D
         A+N +    I +     R +N                            S S   NR+P P   R  +                 +     QQS     
Subjt:  SAMNISATPTIDSAAFCARFSN----------------------------SSSDKHNRKPIPNTWRAYVS----------------EYAEPPQQS----D

Query:  PHKNKIDLSLATLGAINPWILDFGVTDHLTGSSEHFVSYIPCAGNETIRIADGSLTPI
        P + + +L++ +    N W+LD G T H+T    +   + P  G + + IADGS  PI
Subjt:  PHKNKIDLSLATLGAINPWILDFGVTDHLTGSSEHFVSYIPCAGNETIRIADGSLTPI

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).7.7e-1725.59Show/hide
Query:  YLTNTVAQSSMYHLSGEKLNGNNYFSWSQLVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKPLLFAATAKDIWDTAQTL
        YL   +   S + +     + +NY +W    +  L   +KF F+ G +P+P P  P  + W+  ++++   L+NSM  ++ + +++A TA  +W+  + +
Subjt:  YLTNTVAQSSMYHLSGEKLNGNNYFSWSQLVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKPLLFAATAKDIWDTAQTL

Query:  YSKRQNASRLYTLRKQVHECKQGTMDVTSFFNKLSLIWQEMDLCREL-------VWRDPTDGVQYSRIEENDRIYDFLAG--LNPKFDVVRGRILGQRPI
        +    +  ++Y LR+++   +QG   V  +F KLS +W E+     +          + T   + +R  E ++ Y+FL G  LN  F+ V  +I+ Q+P 
Subjt:  YSKRQNASRLYTLRKQVHECKQGTMDVTSFFNKLSLIWQEMDLCREL-------VWRDPTDGVQYSRIEENDRIYDFLAG--LNPKFDVVRGRILGQRPI

Query:  PSLMEVCSEIR
        PSL E  + ++
Subjt:  PSLMEVCSEIR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTATCAGAGCGGGACAATGAAAACACCCTAGAAACCCAAAAAAACCAAACCACTTATGAAAATCAAACAGAAGGGACAGCCTTCAATTTTAGTGCTGTCGTAGCTGC
TGCCATCGATGCTCGGATGAGTGCTGCCATGGACGAATTATTAAGCCGTCTACAGAAAACGTCCGAAAATAATTTTTCGTCATTACCGCAGTCGTCCGCGCCGTCACCGG
ATCACCACGCACTTGGTTTTCTTCCTCAGACGGCGCCGACCATCCCATCTGTCCAACCCTTTTCTTCGTCCGCGGCCTATATTGCTCCCCACGCCCCGATTTATGTTCTG
CCATCTAATTCCAATCGGCTACCACCGCTTCTGCCGTCAAATCTGTATGGCCAGCCACCCAATGATCCTAGCTACCATCCCGATGTTAAAAACTCTCAAATTCACTCAAC
ATTTGAGGTTGGTGAATCTTCGGCATATTCCAACCGTAACGTGCAAGCTTCCTCGGGAATAGTTCATCAACAATTGGAAGGGCTTCGACAACAGATAGCAGCACTTGAGG
CTACCTTAGGGACGACATCCACTCTACCGATGTATTCTGAGTATCCGGTAAACTCGTTCCCTAATGTATCCTCTCCTTATTTGACTAATACGGTGGCTCAGTCTTCCATG
TATCATCTTTCAGGAGAAAAGTTGAATGGCAACAACTATTTCTCATGGTCTCAGTTAGTAAAGATGGTCCTCGAAGGACGACAAAAATTTAGCTTTCTGACAGGGGAAAT
ACCTCGCCCCCTACCGGGCGACCCACATGAACGATATTGGAAGGCAGAAGACTCTATTCTTCGATCCATATTGATCAATAGTATGGAACCTCAAATTGGCAAGCCGTTAT
TGTTTGCTGCAACAGCCAAGGATATTTGGGACACAGCCCAGACACTTTACTCAAAACGTCAGAATGCCTCTCGTCTATACACGCTGAGAAAGCAAGTTCATGAATGCAAG
CAAGGAACCATGGATGTCACATCCTTTTTCAATAAGCTTTCTCTTATATGGCAAGAAATGGACCTATGCAGAGAACTAGTCTGGCGTGATCCCACTGATGGTGTACAGTA
CTCGAGAATTGAAGAGAATGACAGGATTTATGACTTTCTTGCTGGTCTTAATCCTAAGTTTGATGTAGTTCGAGGGCGTATACTAGGTCAAAGACCGATTCCCTCCCTGA
TGGAAGTTTGCTCTGAAATCCGCCTCGAGGAAGATCGCACAAGTGCTATGAATATTTCCGCAACCCCTACTATTGACTCCGCTGCTTTTTGTGCAAGGTTTTCTAACAGT
AGCAGTGACAAACATAATCGAAAACCAATTCCTAACACATGGCGGGCGTATGTGAGTGAGTATGCTGAACCTCCTCAACAATCTGATCCACACAAAAACAAAATTGATCT
CAGTCTTGCCACTTTAGGTGCCATTAACCCCTGGATTCTGGATTTTGGTGTCACCGATCATTTGACTGGGTCCTCTGAACATTTTGTATCTTACATTCCTTGTGCTGGGA
ACGAGACAATTAGAATTGCAGATGGCTCCTTGACCCCCATTGCTGGAAAAAGGGAAGATTTCTCTTGTGCAGGGCTCTCCTTACATAATGTTTTGCATGACTTGAGCTCG
GGGAGGATGATTGGCACTGCCCGGCATAGTAGGGGACTCTACCTCCTTGATGACGATACCTCTTCTAGTAGCATTCCTAGGACTAGTCTCTTATCTACCTATTTCACTAC
TTCTGAACAAGATTGTATGTTGTGGCATTTTCGTTTAGGCCACCCTAATTTTCAATATATGAAACATTTATTTCCACATCTCTTCTCTAAAGTTGAGATGACTACCTTAT
CTTGTGATGGACCATCCAAGATAACAACCTCATCTGAAAAACAACAACCTCATCTGGAAAACGGTGGTTCGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTATCAGAGCGGGACAATGAAAACACCCTAGAAACCCAAAAAAACCAAACCACTTATGAAAATCAAACAGAAGGGACAGCCTTCAATTTTAGTGCTGTCGTAGCTGC
TGCCATCGATGCTCGGATGAGTGCTGCCATGGACGAATTATTAAGCCGTCTACAGAAAACGTCCGAAAATAATTTTTCGTCATTACCGCAGTCGTCCGCGCCGTCACCGG
ATCACCACGCACTTGGTTTTCTTCCTCAGACGGCGCCGACCATCCCATCTGTCCAACCCTTTTCTTCGTCCGCGGCCTATATTGCTCCCCACGCCCCGATTTATGTTCTG
CCATCTAATTCCAATCGGCTACCACCGCTTCTGCCGTCAAATCTGTATGGCCAGCCACCCAATGATCCTAGCTACCATCCCGATGTTAAAAACTCTCAAATTCACTCAAC
ATTTGAGGTTGGTGAATCTTCGGCATATTCCAACCGTAACGTGCAAGCTTCCTCGGGAATAGTTCATCAACAATTGGAAGGGCTTCGACAACAGATAGCAGCACTTGAGG
CTACCTTAGGGACGACATCCACTCTACCGATGTATTCTGAGTATCCGGTAAACTCGTTCCCTAATGTATCCTCTCCTTATTTGACTAATACGGTGGCTCAGTCTTCCATG
TATCATCTTTCAGGAGAAAAGTTGAATGGCAACAACTATTTCTCATGGTCTCAGTTAGTAAAGATGGTCCTCGAAGGACGACAAAAATTTAGCTTTCTGACAGGGGAAAT
ACCTCGCCCCCTACCGGGCGACCCACATGAACGATATTGGAAGGCAGAAGACTCTATTCTTCGATCCATATTGATCAATAGTATGGAACCTCAAATTGGCAAGCCGTTAT
TGTTTGCTGCAACAGCCAAGGATATTTGGGACACAGCCCAGACACTTTACTCAAAACGTCAGAATGCCTCTCGTCTATACACGCTGAGAAAGCAAGTTCATGAATGCAAG
CAAGGAACCATGGATGTCACATCCTTTTTCAATAAGCTTTCTCTTATATGGCAAGAAATGGACCTATGCAGAGAACTAGTCTGGCGTGATCCCACTGATGGTGTACAGTA
CTCGAGAATTGAAGAGAATGACAGGATTTATGACTTTCTTGCTGGTCTTAATCCTAAGTTTGATGTAGTTCGAGGGCGTATACTAGGTCAAAGACCGATTCCCTCCCTGA
TGGAAGTTTGCTCTGAAATCCGCCTCGAGGAAGATCGCACAAGTGCTATGAATATTTCCGCAACCCCTACTATTGACTCCGCTGCTTTTTGTGCAAGGTTTTCTAACAGT
AGCAGTGACAAACATAATCGAAAACCAATTCCTAACACATGGCGGGCGTATGTGAGTGAGTATGCTGAACCTCCTCAACAATCTGATCCACACAAAAACAAAATTGATCT
CAGTCTTGCCACTTTAGGTGCCATTAACCCCTGGATTCTGGATTTTGGTGTCACCGATCATTTGACTGGGTCCTCTGAACATTTTGTATCTTACATTCCTTGTGCTGGGA
ACGAGACAATTAGAATTGCAGATGGCTCCTTGACCCCCATTGCTGGAAAAAGGGAAGATTTCTCTTGTGCAGGGCTCTCCTTACATAATGTTTTGCATGACTTGAGCTCG
GGGAGGATGATTGGCACTGCCCGGCATAGTAGGGGACTCTACCTCCTTGATGACGATACCTCTTCTAGTAGCATTCCTAGGACTAGTCTCTTATCTACCTATTTCACTAC
TTCTGAACAAGATTGTATGTTGTGGCATTTTCGTTTAGGCCACCCTAATTTTCAATATATGAAACATTTATTTCCACATCTCTTCTCTAAAGTTGAGATGACTACCTTAT
CTTGTGATGGACCATCCAAGATAACAACCTCATCTGAAAAACAACAACCTCATCTGGAAAACGGTGGTTCGTAA
Protein sequenceShow/hide protein sequence
MVSERDNENTLETQKNQTTYENQTEGTAFNFSAVVAAAIDARMSAAMDELLSRLQKTSENNFSSLPQSSAPSPDHHALGFLPQTAPTIPSVQPFSSSAAYIAPHAPIYVL
PSNSNRLPPLLPSNLYGQPPNDPSYHPDVKNSQIHSTFEVGESSAYSNRNVQASSGIVHQQLEGLRQQIAALEATLGTTSTLPMYSEYPVNSFPNVSSPYLTNTVAQSSM
YHLSGEKLNGNNYFSWSQLVKMVLEGRQKFSFLTGEIPRPLPGDPHERYWKAEDSILRSILINSMEPQIGKPLLFAATAKDIWDTAQTLYSKRQNASRLYTLRKQVHECK
QGTMDVTSFFNKLSLIWQEMDLCRELVWRDPTDGVQYSRIEENDRIYDFLAGLNPKFDVVRGRILGQRPIPSLMEVCSEIRLEEDRTSAMNISATPTIDSAAFCARFSNS
SSDKHNRKPIPNTWRAYVSEYAEPPQQSDPHKNKIDLSLATLGAINPWILDFGVTDHLTGSSEHFVSYIPCAGNETIRIADGSLTPIAGKREDFSCAGLSLHNVLHDLSS
GRMIGTARHSRGLYLLDDDTSSSSIPRTSLLSTYFTTSEQDCMLWHFRLGHPNFQYMKHLFPHLFSKVEMTTLSCDGPSKITTSSEKQQPHLENGGS