; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG01G012940 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG01G012940
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionBeta-galactosidase
Genome locationCG_Chr01:26014381..26017285
RNA-Seq ExpressionClCG01G012940
SyntenyClCG01G012940
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031941.1 Beta-galactosidase [Cucumis melo var. makuwa]3.3e-16747.19Show/hide
Query:  HLANLQASFQQQIAALGAALGASTTLDQSAESLGVQSG----------LPMYPDNPVTIYPTLTTTHHMSGQMGNSTWLIVGEKLNGQNYFSWSQSVKMV
        +L+N  +  Q  +  L   L +   +DQ      +++G          LPMY  NPVT +P  + +++++G +G+ST    GEKLNGQNYFSWSQS+KM 
Subjt:  HLANLQASFQQQIAALGAALGASTTLDQSAESLGVQSG----------LPMYPDNPVTIYPTLTTTHHMSGQMGNSTWLIVGEKLNGQNYFSWSQSVKMV

Query:  LEGCHKFECLTGEIPKPRPKDPQKRFWKGEDSLL-----------------------------------RQNASRLYTLRKQIHECKQGSMD--------
        LEG ++F  LTGE  +P P D  +R WKGEDSL+                                   RQNASRLYTLRKQ+H CKQG++D        
Subjt:  LEGCHKFECLTGEIPKPRPKDPQKRFWKGEDSLL-----------------------------------RQNASRLYTLRKQIHECKQGSMD--------

Query:  -----EMDLCRELIWDCSCGGVQYYKLEEVDRVYDFLAGLNSKFD--------------------VVRSSEDRSSAMNITASSTADSAAFSAKSSGTTGT
             EMDLCRE +WD      QY KLEE DRVYDFLAGLN KFD                     VR  EDR++AM +  + T DSAAFSA+SS     
Subjt:  -----EMDLCRELIWDCSCGGVQYYKLEEVDRVYDFLAGLNSKFD--------------------VVRSSEDRSSAMNITASSTADSAAFSAKSSGTTGT

Query:  SRTGNHL----------QCVNICWKLHGRPPNGKRRPPNNKPN---------------QALRTTLVIPVLPPLGQLHNQGTSQSLSLLNITGKKPWILDS
           G  +             + CWKLHGRPP GK+R  N K N               Q+   T+     P LG +   G  QSL L+++ GK PWILDS
Subjt:  SRTGNHL----------QCVNICWKLHGRPPNGKRRPPNNKPN---------------QALRTTLVIPVLPPLGQLHNQGTSQSLSLLNITGKKPWILDS

Query:  RATDHLSGTSANFISYHPCAGNEKIRIAERTLTLVAGKGHVSPYDGLILQN-----------------------------------DLSSGKAIGTAQHN
         ATDHL+G+S +FISY PCAGNEKIRIA+ +L  +AGKG + P+DG  LQN                                   D+SSG+ IGTA+H+
Subjt:  RATDHLSGTSANFISYHPCAGNEKIRIAERTLTLVAGKGHVSPYDGLILQN-----------------------------------DLSSGKAIGTAQHN

Query:  KGLYFLNDNSS----YRTSLLSSYFSTFEND----------CI--------------------------------ITTSSGKRWFVTFIDDHTRLTLVYL
        +GLY L+D++S     R SLLSSYFST E D          CI                                +TTSSGKRWFVTFIDDHTRLT VYL
Subjt:  KGLYFLNDNSS----YRTSLLSSYFSTFEND----------CI--------------------------------ITTSSGKRWFVTFIDDHTRLTLVYL

Query:  LTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQSSCAYTPQQNGVAERKNHHLLEVARSLMLLTSLPSYLWGDAVLTAA
        ++DKS+V  IFQ FY TI+T+F+TKIAIL+S NGREF  + L EFL+SKGI+HQ+SCAYTPQQNGVAERKN HL+EVARSLML TSLPSYLWGDA+LTAA
Subjt:  LTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQSSCAYTPQQNGVAERKNHHLLEVARSLMLLTSLPSYLWGDAVLTAA

Query:  HLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVHSFG
        HLINR+PSRIL+ QTPL+CLK SYP+TRL+ +VPLRVF CTA+VH+FG
Subjt:  HLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVHSFG

KAA0052172.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.6e-16647.19Show/hide
Query:  HLANLQASFQQQIAALGAALGASTTLDQSAESLGVQSG----------LPMYPDNPVTIYPTLTTTHHMSGQMGNSTWLIVGEKLNGQNYFSWSQSVKMV
        +L+N  +  Q  +  L   L +   +DQ      +++G          LPMY  NPVT +P  + +++++G +G+ST    GEKLNGQNYFSWSQS+KM 
Subjt:  HLANLQASFQQQIAALGAALGASTTLDQSAESLGVQSG----------LPMYPDNPVTIYPTLTTTHHMSGQMGNSTWLIVGEKLNGQNYFSWSQSVKMV

Query:  LEGCHKFECLTGEIPKPRPKDPQKRFWKGEDSLL-----------------------------------RQNASRLYTLRKQIHECKQGSMD--------
        LEG ++F  LTGE  +P P D  +R WKGEDSL+                                   RQNASRLYTLRKQ+H CKQG++D        
Subjt:  LEGCHKFECLTGEIPKPRPKDPQKRFWKGEDSLL-----------------------------------RQNASRLYTLRKQIHECKQGSMD--------

Query:  -----EMDLCRELIWDCSCGGVQYYKLEEVDRVYDFLAGLNSKFD--------------------VVRSSEDRSSAMNITASSTADSAAFSAKSSGTTGT
             EMDLCRE +WD      QY KLEE DRVYDFLAGLN KFD                     VR  EDR++AM +  + T DSAAFSA+SS     
Subjt:  -----EMDLCRELIWDCSCGGVQYYKLEEVDRVYDFLAGLNSKFD--------------------VVRSSEDRSSAMNITASSTADSAAFSAKSSGTTGT

Query:  SRTGNHLQCV----------NICWKLHGRPPNGKRRPPNNKPN---------------QALRTTLVIPVLPPLGQLHNQGTSQSLSLLNITGKKPWILDS
           G  +             + CWKLHGRPP GK+R  N K N               Q+   T+     P LG +   G  QSL L++I GK PWILDS
Subjt:  SRTGNHLQCV----------NICWKLHGRPPNGKRRPPNNKPN---------------QALRTTLVIPVLPPLGQLHNQGTSQSLSLLNITGKKPWILDS

Query:  RATDHLSGTSANFISYHPCAGNEKIRIAERTLTLVAGKGHVSPYDGLILQN-----------------------------------DLSSGKAIGTAQHN
         ATDHL+G+S +FISY  CAGNEKIRIA+ +L  +AGKG + P+DG  LQN                                   D+SSG+ IGTA+H+
Subjt:  RATDHLSGTSANFISYHPCAGNEKIRIAERTLTLVAGKGHVSPYDGLILQN-----------------------------------DLSSGKAIGTAQHN

Query:  KGLYFLNDNSS----YRTSLLSSYFSTFEND----------CI--------------------------------ITTSSGKRWFVTFIDDHTRLTLVYL
        +GLY L+D++S     R SLLSSYFST E D          CI                                +TTSSGKRWFVTFIDDHTRLT VYL
Subjt:  KGLYFLNDNSS----YRTSLLSSYFSTFEND----------CI--------------------------------ITTSSGKRWFVTFIDDHTRLTLVYL

Query:  LTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQSSCAYTPQQNGVAERKNHHLLEVARSLMLLTSLPSYLWGDAVLTAA
        ++DKS+V  IFQ FY TI+T+F+TKIAIL+S NGREF  + L EFL+SKGI+HQ+SCAYTPQQNGVAERKN HL+EVARSLML TSLPSYLWGDA+LTAA
Subjt:  LTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQSSCAYTPQQNGVAERKNHHLLEVARSLMLLTSLPSYLWGDAVLTAA

Query:  HLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVHSFG
        HLINR+PSRIL+ QTPL+CLK SYP+TRL+ +VPLRVF CTA+VH+FG
Subjt:  HLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVHSFG

KAA0061447.1 Beta-galactosidase [Cucumis melo var. makuwa]1.6e-16647.06Show/hide
Query:  HLANLQASFQQQIAALGAALGASTTLDQSAESLGVQSG----------LPMYPDNPVTIYPTLTTTHHMSGQMGNSTWLIVGEKLNGQNYFSWSQSVKMV
        +L+N  +  Q  +  L   L +   +DQ      +++G          LPMY  NP T +P  + +++++G +G+ST    GEKLNGQNYFSWSQS+KM 
Subjt:  HLANLQASFQQQIAALGAALGASTTLDQSAESLGVQSG----------LPMYPDNPVTIYPTLTTTHHMSGQMGNSTWLIVGEKLNGQNYFSWSQSVKMV

Query:  LEGCHKFECLTGEIPKPRPKDPQKRFWKGEDSLL-----------------------------------RQNASRLYTLRKQIHECKQGSMD--------
        LEG ++F  LTGE  +P P D  +R WKGEDSL+                                   RQNASRLYTLRKQ+H CKQG++D        
Subjt:  LEGCHKFECLTGEIPKPRPKDPQKRFWKGEDSLL-----------------------------------RQNASRLYTLRKQIHECKQGSMD--------

Query:  -----EMDLCRELIWDCSCGGVQYYKLEEVDRVYDFLAGLNSKFD--------------------VVRSSEDRSSAMNITASSTADSAAFSAKSSGTTGT
             EMDLCRE +WD      QY KLEE DRVYDFLAGLN KFD                     VR  EDR++AM +  + T DSAAFSA+SS     
Subjt:  -----EMDLCRELIWDCSCGGVQYYKLEEVDRVYDFLAGLNSKFD--------------------VVRSSEDRSSAMNITASSTADSAAFSAKSSGTTGT

Query:  SRTGNHL----------QCVNICWKLHGRPPNGKRRPPNNKPN---------------QALRTTLVIPVLPPLGQLHNQGTSQSLSLLNITGKKPWILDS
           G  +             + CWKLHGRPP GK+R  N K N               Q+   T+     P LG +   G  QSL L+++ GK PWILDS
Subjt:  SRTGNHL----------QCVNICWKLHGRPPNGKRRPPNNKPN---------------QALRTTLVIPVLPPLGQLHNQGTSQSLSLLNITGKKPWILDS

Query:  RATDHLSGTSANFISYHPCAGNEKIRIAERTLTLVAGKGHVSPYDGLILQN-----------------------------------DLSSGKAIGTAQHN
         ATDHL+G+S +FISY PCAGNEKIRIA+ +L  +AGKG + P+DG  LQN                                   D+SSG+ IGTA+H+
Subjt:  RATDHLSGTSANFISYHPCAGNEKIRIAERTLTLVAGKGHVSPYDGLILQN-----------------------------------DLSSGKAIGTAQHN

Query:  KGLYFLNDNSS----YRTSLLSSYFSTFEND----------CI--------------------------------ITTSSGKRWFVTFIDDHTRLTLVYL
        +GLY L+D++S     R SLLSSYFST E D          CI                                +TTSSGKRWFVTFIDDHTRLT VYL
Subjt:  KGLYFLNDNSS----YRTSLLSSYFSTFEND----------CI--------------------------------ITTSSGKRWFVTFIDDHTRLTLVYL

Query:  LTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQSSCAYTPQQNGVAERKNHHLLEVARSLMLLTSLPSYLWGDAVLTAA
        ++DKS+V  IFQ FY TI+T+F+TKIAIL+S NGREF  + L EFL+SKGI+HQ+SCAYTPQQNGVAERKN HL+EVARSLML TSLPSYLWGDA+LTAA
Subjt:  LTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQSSCAYTPQQNGVAERKNHHLLEVARSLMLLTSLPSYLWGDAVLTAA

Query:  HLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVHSFG
        HLINR+PSRIL+ QTPL+CLK SYP+TRL+ +VPLRVF CTA+VH+FG
Subjt:  HLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVHSFG

TYK31050.1 Beta-galactosidase [Cucumis melo var. makuwa]8.6e-16847.33Show/hide
Query:  HLANLQASFQQQIAALGAALGASTTLDQSAESLGVQSG----------LPMYPDNPVTIYPTLTTTHHMSGQMGNSTWLIVGEKLNGQNYFSWSQSVKMV
        +L+N  +  Q  +  L   L +   +DQ      +++G          LPMY  NPVT +P  + +++++G +G+ST    GEKLNGQNYFSWSQS+KM 
Subjt:  HLANLQASFQQQIAALGAALGASTTLDQSAESLGVQSG----------LPMYPDNPVTIYPTLTTTHHMSGQMGNSTWLIVGEKLNGQNYFSWSQSVKMV

Query:  LEGCHKFECLTGEIPKPRPKDPQKRFWKGEDSLL-----------------------------------RQNASRLYTLRKQIHECKQGSMD--------
        LEG ++F  LTGEI +P P D  +R WKGEDSL+                                   RQNASRLYTLRKQ+H CKQG++D        
Subjt:  LEGCHKFECLTGEIPKPRPKDPQKRFWKGEDSLL-----------------------------------RQNASRLYTLRKQIHECKQGSMD--------

Query:  -----EMDLCRELIWDCSCGGVQYYKLEEVDRVYDFLAGLNSKFD--------------------VVRSSEDRSSAMNITASSTADSAAFSAKSSGTTGT
             EMDLCRE +WD      QY KLEE DRVYDFLAGLN KFD                     VR  EDR++AM +  + T DSAAFSA+SS     
Subjt:  -----EMDLCRELIWDCSCGGVQYYKLEEVDRVYDFLAGLNSKFD--------------------VVRSSEDRSSAMNITASSTADSAAFSAKSSGTTGT

Query:  SRTGNHL----------QCVNICWKLHGRPPNGKRRPPNNKPN---------------QALRTTLVIPVLPPLGQLHNQGTSQSLSLLNITGKKPWILDS
           G  +             + CWKLHGRPP GK+R  N K N               Q+   T+     P LG +   G  QSL L+++ GK PWILDS
Subjt:  SRTGNHL----------QCVNICWKLHGRPPNGKRRPPNNKPN---------------QALRTTLVIPVLPPLGQLHNQGTSQSLSLLNITGKKPWILDS

Query:  RATDHLSGTSANFISYHPCAGNEKIRIAERTLTLVAGKGHVSPYDGLILQN-----------------------------------DLSSGKAIGTAQHN
         ATDHL+G+S +FISY PCAGNEKIRIA+ +L  +AGKG + P+DG  LQN                                   D+SSG+ IGTA+H+
Subjt:  RATDHLSGTSANFISYHPCAGNEKIRIAERTLTLVAGKGHVSPYDGLILQN-----------------------------------DLSSGKAIGTAQHN

Query:  KGLYFLNDNSS----YRTSLLSSYFSTFEND----------CI--------------------------------ITTSSGKRWFVTFIDDHTRLTLVYL
        +GLY L+D++S     R SLLSSYFST E D          CI                                +TTSSGKRWFVTFIDDHTRLT VYL
Subjt:  KGLYFLNDNSS----YRTSLLSSYFSTFEND----------CI--------------------------------ITTSSGKRWFVTFIDDHTRLTLVYL

Query:  LTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQSSCAYTPQQNGVAERKNHHLLEVARSLMLLTSLPSYLWGDAVLTAA
        ++DKS+V  IFQ FY TI+T+F+TKIAIL+S NGREF  + L EFL+SKGI+HQ+SCAYTPQQNGVAERKN HL+EVARSLML TSLPSYLWGDA+LTAA
Subjt:  LTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQSSCAYTPQQNGVAERKNHHLLEVARSLMLLTSLPSYLWGDAVLTAA

Query:  HLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVHSFG
        HLINR+PSRIL+ QTPL+CLK SYP+TRL+ +VPLRVF CTA+VH+FG
Subjt:  HLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVHSFG

TYK31717.1 Beta-galactosidase [Cucumis melo var. makuwa]1.9e-16745.92Show/hide
Query:  APTLAQPAMDLWEYFPPTAAATAGISHQGVGNYYEQQGSAVASSAMVQQHLANLQASFQQQIAALGAALGASTTLDQSAESLGVQSG----------LPM
        AP+  QP+       P +A+ ++G           QQ S V        +L+N  +  Q  I +L   L +   +DQ      +++G          LPM
Subjt:  APTLAQPAMDLWEYFPPTAAATAGISHQGVGNYYEQQGSAVASSAMVQQHLANLQASFQQQIAALGAALGASTTLDQSAESLGVQSG----------LPM

Query:  YPDNPVTIYPTLTTTHHMSGQMGNSTWLIVGEKLNGQNYFSWSQSVKMVLEGCHKFECLTGEIPKPRPKDPQKRFWKGEDSLL-----------------
        Y  NPVT +P  + +++++G +G+ST    GEKLNGQNYFSWSQS+KM LEG ++F  LTGEI +P P D  +R WKGEDSL+                 
Subjt:  YPDNPVTIYPTLTTTHHMSGQMGNSTWLIVGEKLNGQNYFSWSQSVKMVLEGCHKFECLTGEIPKPRPKDPQKRFWKGEDSLL-----------------

Query:  ------------------RQNASRLYTLRKQIHECKQGSMD-------------EMDLCRELIWDCSCGGVQYYKLEEVDRVYDFLAGLNSKFD------
                          RQNASRLYTLRKQ+H CKQG++D             EMDLCRE +WD      QY KLEE DRVYDFLAGLN KFD      
Subjt:  ------------------RQNASRLYTLRKQIHECKQGSMD-------------EMDLCRELIWDCSCGGVQYYKLEEVDRVYDFLAGLNSKFD------

Query:  --------------VVRSSEDRSSAMNITASSTADSAAFSAKSSGTTGTSRTGNHL----------QCVNICWKLHGRPPNGKRRPPNNKPN--------
                       VR  EDR++AM +  + T DSAAFSA+SS        G  +             + CWKLHGRPP GK+R  N K N        
Subjt:  --------------VVRSSEDRSSAMNITASSTADSAAFSAKSSGTTGTSRTGNHL----------QCVNICWKLHGRPPNGKRRPPNNKPN--------

Query:  -------QALRTTLVIPVLPPLGQLHNQGTSQSLSLLNITGKKPWILDSRATDHLSGTSANFISYHPCAGNEKIRIAERTLTLVAGKGHVSPYDGLILQN
               Q+   T+     P LG +   G  QSL L+++ GK PWILDS ATDHL+G+S +FISY PCAGNEKIRIA+ +L  +AGKG + P+DG  LQN
Subjt:  -------QALRTTLVIPVLPPLGQLHNQGTSQSLSLLNITGKKPWILDSRATDHLSGTSANFISYHPCAGNEKIRIAERTLTLVAGKGHVSPYDGLILQN

Query:  -----------------------------------DLSSGKAIGTAQHNKGLYFLNDNSS----YRTSLLSSYFSTFEND----------CI--------
                                           D+SSG+ IGTA+H++GLY L+D++S     R SLLSSYFST E D          CI        
Subjt:  -----------------------------------DLSSGKAIGTAQHNKGLYFLNDNSS----YRTSLLSSYFSTFEND----------CI--------

Query:  ------------------------ITTSSGKRWFVTFIDDHTRLTLVYLLTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGI
                                +TTSSGKRWFVTFIDDHTRLT VYL++DKS+V  IFQ FY TI+T+F+TKIAIL+S NGREF  + L EFL+SKGI
Subjt:  ------------------------ITTSSGKRWFVTFIDDHTRLTLVYLLTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGI

Query:  IHQSSCAYTPQQNGVAERKNHHLLEVARSLMLLTSLPSYLWGDAVLTAAHLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVHSFG
        +HQ+SCAYTPQQNGVAERKN HL+EVARSLML TSLPSYLWGDA+LTAAHLINR+PSRIL+ QTPL+CLK SYP+TRL+ +VPLRVF CTA+VH+FG
Subjt:  IHQSSCAYTPQQNGVAERKNHHLLEVARSLMLLTSLPSYLWGDAVLTAAHLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVHSFG

TrEMBL top hitse value%identityAlignment
A0A5A7SQW1 Beta-galactosidase1.6e-16747.19Show/hide
Query:  HLANLQASFQQQIAALGAALGASTTLDQSAESLGVQSG----------LPMYPDNPVTIYPTLTTTHHMSGQMGNSTWLIVGEKLNGQNYFSWSQSVKMV
        +L+N  +  Q  +  L   L +   +DQ      +++G          LPMY  NPVT +P  + +++++G +G+ST    GEKLNGQNYFSWSQS+KM 
Subjt:  HLANLQASFQQQIAALGAALGASTTLDQSAESLGVQSG----------LPMYPDNPVTIYPTLTTTHHMSGQMGNSTWLIVGEKLNGQNYFSWSQSVKMV

Query:  LEGCHKFECLTGEIPKPRPKDPQKRFWKGEDSLL-----------------------------------RQNASRLYTLRKQIHECKQGSMD--------
        LEG ++F  LTGE  +P P D  +R WKGEDSL+                                   RQNASRLYTLRKQ+H CKQG++D        
Subjt:  LEGCHKFECLTGEIPKPRPKDPQKRFWKGEDSLL-----------------------------------RQNASRLYTLRKQIHECKQGSMD--------

Query:  -----EMDLCRELIWDCSCGGVQYYKLEEVDRVYDFLAGLNSKFD--------------------VVRSSEDRSSAMNITASSTADSAAFSAKSSGTTGT
             EMDLCRE +WD      QY KLEE DRVYDFLAGLN KFD                     VR  EDR++AM +  + T DSAAFSA+SS     
Subjt:  -----EMDLCRELIWDCSCGGVQYYKLEEVDRVYDFLAGLNSKFD--------------------VVRSSEDRSSAMNITASSTADSAAFSAKSSGTTGT

Query:  SRTGNHL----------QCVNICWKLHGRPPNGKRRPPNNKPN---------------QALRTTLVIPVLPPLGQLHNQGTSQSLSLLNITGKKPWILDS
           G  +             + CWKLHGRPP GK+R  N K N               Q+   T+     P LG +   G  QSL L+++ GK PWILDS
Subjt:  SRTGNHL----------QCVNICWKLHGRPPNGKRRPPNNKPN---------------QALRTTLVIPVLPPLGQLHNQGTSQSLSLLNITGKKPWILDS

Query:  RATDHLSGTSANFISYHPCAGNEKIRIAERTLTLVAGKGHVSPYDGLILQN-----------------------------------DLSSGKAIGTAQHN
         ATDHL+G+S +FISY PCAGNEKIRIA+ +L  +AGKG + P+DG  LQN                                   D+SSG+ IGTA+H+
Subjt:  RATDHLSGTSANFISYHPCAGNEKIRIAERTLTLVAGKGHVSPYDGLILQN-----------------------------------DLSSGKAIGTAQHN

Query:  KGLYFLNDNSS----YRTSLLSSYFSTFEND----------CI--------------------------------ITTSSGKRWFVTFIDDHTRLTLVYL
        +GLY L+D++S     R SLLSSYFST E D          CI                                +TTSSGKRWFVTFIDDHTRLT VYL
Subjt:  KGLYFLNDNSS----YRTSLLSSYFSTFEND----------CI--------------------------------ITTSSGKRWFVTFIDDHTRLTLVYL

Query:  LTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQSSCAYTPQQNGVAERKNHHLLEVARSLMLLTSLPSYLWGDAVLTAA
        ++DKS+V  IFQ FY TI+T+F+TKIAIL+S NGREF  + L EFL+SKGI+HQ+SCAYTPQQNGVAERKN HL+EVARSLML TSLPSYLWGDA+LTAA
Subjt:  LTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQSSCAYTPQQNGVAERKNHHLLEVARSLMLLTSLPSYLWGDAVLTAA

Query:  HLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVHSFG
        HLINR+PSRIL+ QTPL+CLK SYP+TRL+ +VPLRVF CTA+VH+FG
Subjt:  HLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVHSFG

A0A5A7U8U2 Retrovirus-related Pol polyprotein from transposon TNT 1-947.8e-16747.19Show/hide
Query:  HLANLQASFQQQIAALGAALGASTTLDQSAESLGVQSG----------LPMYPDNPVTIYPTLTTTHHMSGQMGNSTWLIVGEKLNGQNYFSWSQSVKMV
        +L+N  +  Q  +  L   L +   +DQ      +++G          LPMY  NPVT +P  + +++++G +G+ST    GEKLNGQNYFSWSQS+KM 
Subjt:  HLANLQASFQQQIAALGAALGASTTLDQSAESLGVQSG----------LPMYPDNPVTIYPTLTTTHHMSGQMGNSTWLIVGEKLNGQNYFSWSQSVKMV

Query:  LEGCHKFECLTGEIPKPRPKDPQKRFWKGEDSLL-----------------------------------RQNASRLYTLRKQIHECKQGSMD--------
        LEG ++F  LTGE  +P P D  +R WKGEDSL+                                   RQNASRLYTLRKQ+H CKQG++D        
Subjt:  LEGCHKFECLTGEIPKPRPKDPQKRFWKGEDSLL-----------------------------------RQNASRLYTLRKQIHECKQGSMD--------

Query:  -----EMDLCRELIWDCSCGGVQYYKLEEVDRVYDFLAGLNSKFD--------------------VVRSSEDRSSAMNITASSTADSAAFSAKSSGTTGT
             EMDLCRE +WD      QY KLEE DRVYDFLAGLN KFD                     VR  EDR++AM +  + T DSAAFSA+SS     
Subjt:  -----EMDLCRELIWDCSCGGVQYYKLEEVDRVYDFLAGLNSKFD--------------------VVRSSEDRSSAMNITASSTADSAAFSAKSSGTTGT

Query:  SRTGNHLQCV----------NICWKLHGRPPNGKRRPPNNKPN---------------QALRTTLVIPVLPPLGQLHNQGTSQSLSLLNITGKKPWILDS
           G  +             + CWKLHGRPP GK+R  N K N               Q+   T+     P LG +   G  QSL L++I GK PWILDS
Subjt:  SRTGNHLQCV----------NICWKLHGRPPNGKRRPPNNKPN---------------QALRTTLVIPVLPPLGQLHNQGTSQSLSLLNITGKKPWILDS

Query:  RATDHLSGTSANFISYHPCAGNEKIRIAERTLTLVAGKGHVSPYDGLILQN-----------------------------------DLSSGKAIGTAQHN
         ATDHL+G+S +FISY  CAGNEKIRIA+ +L  +AGKG + P+DG  LQN                                   D+SSG+ IGTA+H+
Subjt:  RATDHLSGTSANFISYHPCAGNEKIRIAERTLTLVAGKGHVSPYDGLILQN-----------------------------------DLSSGKAIGTAQHN

Query:  KGLYFLNDNSS----YRTSLLSSYFSTFEND----------CI--------------------------------ITTSSGKRWFVTFIDDHTRLTLVYL
        +GLY L+D++S     R SLLSSYFST E D          CI                                +TTSSGKRWFVTFIDDHTRLT VYL
Subjt:  KGLYFLNDNSS----YRTSLLSSYFSTFEND----------CI--------------------------------ITTSSGKRWFVTFIDDHTRLTLVYL

Query:  LTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQSSCAYTPQQNGVAERKNHHLLEVARSLMLLTSLPSYLWGDAVLTAA
        ++DKS+V  IFQ FY TI+T+F+TKIAIL+S NGREF  + L EFL+SKGI+HQ+SCAYTPQQNGVAERKN HL+EVARSLML TSLPSYLWGDA+LTAA
Subjt:  LTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQSSCAYTPQQNGVAERKNHHLLEVARSLMLLTSLPSYLWGDAVLTAA

Query:  HLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVHSFG
        HLINR+PSRIL+ QTPL+CLK SYP+TRL+ +VPLRVF CTA+VH+FG
Subjt:  HLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVHSFG

A0A5A7V3J5 Beta-galactosidase7.8e-16747.06Show/hide
Query:  HLANLQASFQQQIAALGAALGASTTLDQSAESLGVQSG----------LPMYPDNPVTIYPTLTTTHHMSGQMGNSTWLIVGEKLNGQNYFSWSQSVKMV
        +L+N  +  Q  +  L   L +   +DQ      +++G          LPMY  NP T +P  + +++++G +G+ST    GEKLNGQNYFSWSQS+KM 
Subjt:  HLANLQASFQQQIAALGAALGASTTLDQSAESLGVQSG----------LPMYPDNPVTIYPTLTTTHHMSGQMGNSTWLIVGEKLNGQNYFSWSQSVKMV

Query:  LEGCHKFECLTGEIPKPRPKDPQKRFWKGEDSLL-----------------------------------RQNASRLYTLRKQIHECKQGSMD--------
        LEG ++F  LTGE  +P P D  +R WKGEDSL+                                   RQNASRLYTLRKQ+H CKQG++D        
Subjt:  LEGCHKFECLTGEIPKPRPKDPQKRFWKGEDSLL-----------------------------------RQNASRLYTLRKQIHECKQGSMD--------

Query:  -----EMDLCRELIWDCSCGGVQYYKLEEVDRVYDFLAGLNSKFD--------------------VVRSSEDRSSAMNITASSTADSAAFSAKSSGTTGT
             EMDLCRE +WD      QY KLEE DRVYDFLAGLN KFD                     VR  EDR++AM +  + T DSAAFSA+SS     
Subjt:  -----EMDLCRELIWDCSCGGVQYYKLEEVDRVYDFLAGLNSKFD--------------------VVRSSEDRSSAMNITASSTADSAAFSAKSSGTTGT

Query:  SRTGNHL----------QCVNICWKLHGRPPNGKRRPPNNKPN---------------QALRTTLVIPVLPPLGQLHNQGTSQSLSLLNITGKKPWILDS
           G  +             + CWKLHGRPP GK+R  N K N               Q+   T+     P LG +   G  QSL L+++ GK PWILDS
Subjt:  SRTGNHL----------QCVNICWKLHGRPPNGKRRPPNNKPN---------------QALRTTLVIPVLPPLGQLHNQGTSQSLSLLNITGKKPWILDS

Query:  RATDHLSGTSANFISYHPCAGNEKIRIAERTLTLVAGKGHVSPYDGLILQN-----------------------------------DLSSGKAIGTAQHN
         ATDHL+G+S +FISY PCAGNEKIRIA+ +L  +AGKG + P+DG  LQN                                   D+SSG+ IGTA+H+
Subjt:  RATDHLSGTSANFISYHPCAGNEKIRIAERTLTLVAGKGHVSPYDGLILQN-----------------------------------DLSSGKAIGTAQHN

Query:  KGLYFLNDNSS----YRTSLLSSYFSTFEND----------CI--------------------------------ITTSSGKRWFVTFIDDHTRLTLVYL
        +GLY L+D++S     R SLLSSYFST E D          CI                                +TTSSGKRWFVTFIDDHTRLT VYL
Subjt:  KGLYFLNDNSS----YRTSLLSSYFSTFEND----------CI--------------------------------ITTSSGKRWFVTFIDDHTRLTLVYL

Query:  LTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQSSCAYTPQQNGVAERKNHHLLEVARSLMLLTSLPSYLWGDAVLTAA
        ++DKS+V  IFQ FY TI+T+F+TKIAIL+S NGREF  + L EFL+SKGI+HQ+SCAYTPQQNGVAERKN HL+EVARSLML TSLPSYLWGDA+LTAA
Subjt:  LTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQSSCAYTPQQNGVAERKNHHLLEVARSLMLLTSLPSYLWGDAVLTAA

Query:  HLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVHSFG
        HLINR+PSRIL+ QTPL+CLK SYP+TRL+ +VPLRVF CTA+VH+FG
Subjt:  HLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVHSFG

A0A5D3E603 Beta-galactosidase4.1e-16847.33Show/hide
Query:  HLANLQASFQQQIAALGAALGASTTLDQSAESLGVQSG----------LPMYPDNPVTIYPTLTTTHHMSGQMGNSTWLIVGEKLNGQNYFSWSQSVKMV
        +L+N  +  Q  +  L   L +   +DQ      +++G          LPMY  NPVT +P  + +++++G +G+ST    GEKLNGQNYFSWSQS+KM 
Subjt:  HLANLQASFQQQIAALGAALGASTTLDQSAESLGVQSG----------LPMYPDNPVTIYPTLTTTHHMSGQMGNSTWLIVGEKLNGQNYFSWSQSVKMV

Query:  LEGCHKFECLTGEIPKPRPKDPQKRFWKGEDSLL-----------------------------------RQNASRLYTLRKQIHECKQGSMD--------
        LEG ++F  LTGEI +P P D  +R WKGEDSL+                                   RQNASRLYTLRKQ+H CKQG++D        
Subjt:  LEGCHKFECLTGEIPKPRPKDPQKRFWKGEDSLL-----------------------------------RQNASRLYTLRKQIHECKQGSMD--------

Query:  -----EMDLCRELIWDCSCGGVQYYKLEEVDRVYDFLAGLNSKFD--------------------VVRSSEDRSSAMNITASSTADSAAFSAKSSGTTGT
             EMDLCRE +WD      QY KLEE DRVYDFLAGLN KFD                     VR  EDR++AM +  + T DSAAFSA+SS     
Subjt:  -----EMDLCRELIWDCSCGGVQYYKLEEVDRVYDFLAGLNSKFD--------------------VVRSSEDRSSAMNITASSTADSAAFSAKSSGTTGT

Query:  SRTGNHL----------QCVNICWKLHGRPPNGKRRPPNNKPN---------------QALRTTLVIPVLPPLGQLHNQGTSQSLSLLNITGKKPWILDS
           G  +             + CWKLHGRPP GK+R  N K N               Q+   T+     P LG +   G  QSL L+++ GK PWILDS
Subjt:  SRTGNHL----------QCVNICWKLHGRPPNGKRRPPNNKPN---------------QALRTTLVIPVLPPLGQLHNQGTSQSLSLLNITGKKPWILDS

Query:  RATDHLSGTSANFISYHPCAGNEKIRIAERTLTLVAGKGHVSPYDGLILQN-----------------------------------DLSSGKAIGTAQHN
         ATDHL+G+S +FISY PCAGNEKIRIA+ +L  +AGKG + P+DG  LQN                                   D+SSG+ IGTA+H+
Subjt:  RATDHLSGTSANFISYHPCAGNEKIRIAERTLTLVAGKGHVSPYDGLILQN-----------------------------------DLSSGKAIGTAQHN

Query:  KGLYFLNDNSS----YRTSLLSSYFSTFEND----------CI--------------------------------ITTSSGKRWFVTFIDDHTRLTLVYL
        +GLY L+D++S     R SLLSSYFST E D          CI                                +TTSSGKRWFVTFIDDHTRLT VYL
Subjt:  KGLYFLNDNSS----YRTSLLSSYFSTFEND----------CI--------------------------------ITTSSGKRWFVTFIDDHTRLTLVYL

Query:  LTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQSSCAYTPQQNGVAERKNHHLLEVARSLMLLTSLPSYLWGDAVLTAA
        ++DKS+V  IFQ FY TI+T+F+TKIAIL+S NGREF  + L EFL+SKGI+HQ+SCAYTPQQNGVAERKN HL+EVARSLML TSLPSYLWGDA+LTAA
Subjt:  LTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQSSCAYTPQQNGVAERKNHHLLEVARSLMLLTSLPSYLWGDAVLTAA

Query:  HLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVHSFG
        HLINR+PSRIL+ QTPL+CLK SYP+TRL+ +VPLRVF CTA+VH+FG
Subjt:  HLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVHSFG

A0A5D3E6F8 Beta-galactosidase9.2e-16845.92Show/hide
Query:  APTLAQPAMDLWEYFPPTAAATAGISHQGVGNYYEQQGSAVASSAMVQQHLANLQASFQQQIAALGAALGASTTLDQSAESLGVQSG----------LPM
        AP+  QP+       P +A+ ++G           QQ S V        +L+N  +  Q  I +L   L +   +DQ      +++G          LPM
Subjt:  APTLAQPAMDLWEYFPPTAAATAGISHQGVGNYYEQQGSAVASSAMVQQHLANLQASFQQQIAALGAALGASTTLDQSAESLGVQSG----------LPM

Query:  YPDNPVTIYPTLTTTHHMSGQMGNSTWLIVGEKLNGQNYFSWSQSVKMVLEGCHKFECLTGEIPKPRPKDPQKRFWKGEDSLL-----------------
        Y  NPVT +P  + +++++G +G+ST    GEKLNGQNYFSWSQS+KM LEG ++F  LTGEI +P P D  +R WKGEDSL+                 
Subjt:  YPDNPVTIYPTLTTTHHMSGQMGNSTWLIVGEKLNGQNYFSWSQSVKMVLEGCHKFECLTGEIPKPRPKDPQKRFWKGEDSLL-----------------

Query:  ------------------RQNASRLYTLRKQIHECKQGSMD-------------EMDLCRELIWDCSCGGVQYYKLEEVDRVYDFLAGLNSKFD------
                          RQNASRLYTLRKQ+H CKQG++D             EMDLCRE +WD      QY KLEE DRVYDFLAGLN KFD      
Subjt:  ------------------RQNASRLYTLRKQIHECKQGSMD-------------EMDLCRELIWDCSCGGVQYYKLEEVDRVYDFLAGLNSKFD------

Query:  --------------VVRSSEDRSSAMNITASSTADSAAFSAKSSGTTGTSRTGNHL----------QCVNICWKLHGRPPNGKRRPPNNKPN--------
                       VR  EDR++AM +  + T DSAAFSA+SS        G  +             + CWKLHGRPP GK+R  N K N        
Subjt:  --------------VVRSSEDRSSAMNITASSTADSAAFSAKSSGTTGTSRTGNHL----------QCVNICWKLHGRPPNGKRRPPNNKPN--------

Query:  -------QALRTTLVIPVLPPLGQLHNQGTSQSLSLLNITGKKPWILDSRATDHLSGTSANFISYHPCAGNEKIRIAERTLTLVAGKGHVSPYDGLILQN
               Q+   T+     P LG +   G  QSL L+++ GK PWILDS ATDHL+G+S +FISY PCAGNEKIRIA+ +L  +AGKG + P+DG  LQN
Subjt:  -------QALRTTLVIPVLPPLGQLHNQGTSQSLSLLNITGKKPWILDSRATDHLSGTSANFISYHPCAGNEKIRIAERTLTLVAGKGHVSPYDGLILQN

Query:  -----------------------------------DLSSGKAIGTAQHNKGLYFLNDNSS----YRTSLLSSYFSTFEND----------CI--------
                                           D+SSG+ IGTA+H++GLY L+D++S     R SLLSSYFST E D          CI        
Subjt:  -----------------------------------DLSSGKAIGTAQHNKGLYFLNDNSS----YRTSLLSSYFSTFEND----------CI--------

Query:  ------------------------ITTSSGKRWFVTFIDDHTRLTLVYLLTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGI
                                +TTSSGKRWFVTFIDDHTRLT VYL++DKS+V  IFQ FY TI+T+F+TKIAIL+S NGREF  + L EFL+SKGI
Subjt:  ------------------------ITTSSGKRWFVTFIDDHTRLTLVYLLTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGI

Query:  IHQSSCAYTPQQNGVAERKNHHLLEVARSLMLLTSLPSYLWGDAVLTAAHLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVHSFG
        +HQ+SCAYTPQQNGVAERKN HL+EVARSLML TSLPSYLWGDA+LTAAHLINR+PSRIL+ QTPL+CLK SYP+TRL+ +VPLRVF CTA+VH+FG
Subjt:  IHQSSCAYTPQQNGVAERKNHHLLEVARSLMLLTSLPSYLWGDAVLTAAHLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVHSFG

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.9e-2440Show/hide
Query:  TSSGKRWFVTFIDDHTRLTLVYLLTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQSSCAYTPQQNGVAERKNHHLLEV
        T   K +FV F+D  T   + YL+  KS V  +FQ F    E  FN K+  L   NGRE+L+N + +F   KGI +  +  +TPQ NGV+ER    + E 
Subjt:  TSSGKRWFVTFIDDHTRLTLVYLLTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQSSCAYTPQQNGVAERKNHHLLEV

Query:  ARSLMLLTSLPSYLWGDAVLTAAHLINRIPSRIL--NFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVH
        AR+++    L    WG+AVLTA +LINRIPSR L  + +TP        P  +      LRVF  T +VH
Subjt:  ARSLMLLTSLPSYLWGDAVLTAAHLINRIPSRIL--NFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVH

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.4e-2737.7Show/hide
Query:  NSSYRTSLLSSYFSTFENDCIITTSSGKRWFVTFIDDHTRLTLVYLLTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQ
        +S  + ++L   +S       I +  G ++FVTFIDD +R   VY+L  K +V  +FQ+F+  +E +   K+  L+S NG E+ +    E+ SS GI H+
Subjt:  NSSYRTSLLSSYFSTFENDCIITTSSGKRWFVTFIDDHTRLTLVYLLTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQ

Query:  SSCAYTPQQNGVAERKNHHLLEVARSLMLLTSLPSYLWGDAVLTAAHLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVH
         +   TPQ NGVAER N  ++E  RS++ +  LP   WG+AV TA +LINR PS  L F+ P         T + +    L+VF C AF H
Subjt:  SSCAYTPQQNGVAERKNHHLLEVARSLMLLTSLPSYLWGDAVLTAAHLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVH

Q12491 Transposon Ty2-B Gag-Pol polyprotein1.3e-1232.06Show/hide
Query:  SGKRWFVTFIDDHTRLTLVYLLTDKSKVSF--IFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQSSCAYTPQQNGVAERKNHHLLEV
        S   +F++F D+ TR   VY L D+ + S   +F      I+ +FN ++ ++Q   G E+   TLH+F +++GI    +     + +GVAER N  LL  
Subjt:  SGKRWFVTFIDDHTRLTLVYLLTDKSKVSF--IFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQSSCAYTPQQNGVAERKNHHLLEV

Query:  ARSLMLLTSLPSYLWGDAVLTAAHLINRIPS
         R+L+  + LP++LW  AV  +  + N + S
Subjt:  ARSLMLLTSLPSYLWGDAVLTAAHLINRIPS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.3e-2135.71Show/hide
Query:  ITTSSGKRWFVTFIDDHTRLTLVYLLTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQSSCAYTPQQNGVAERKNHHLL
        I +    R++V F+D  TR T +Y L  KS+V   F  F   +E +F T+I    S NG EF+   L E+ S  GI H +S  +TP+ NG++ERK+ H++
Subjt:  ITTSSGKRWFVTFIDDHTRLTLVYLLTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQSSCAYTPQQNGVAERKNHHLL

Query:  EVARSLMLLTSLPSYLWGDAVLTAAHLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAF
        E   +L+   S+P   W  A   A +LINR+P+ +L  ++P   L  + P         LRVF C  +
Subjt:  EVARSLMLLTSLPSYLWGDAVLTAAHLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAF

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.5e-2137.27Show/hide
Query:  RWFVTFIDDHTRLTLVYLLTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQSSCAYTPQQNGVAERKNHHLLEVARSLM
        R++V F+D  TR T +Y L  KS+V   F  F + +E +F T+I  L S NG EF+   L ++LS  GI H +S  +TP+ NG++ERK+ H++E+  +L+
Subjt:  RWFVTFIDDHTRLTLVYLLTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQSSCAYTPQQNGVAERKNHHLLEVARSLM

Query:  LLTSLPSYLWGDAVLTAAHLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAF
           S+P   W  A   A +LINR+P+ +L  Q+P   L    P         L+VF C  +
Subjt:  LLTSLPSYLWGDAVLTAAHLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAF

Arabidopsis top hitse value%identityAlignment
ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein4.4e-0536Show/hide
Query:  NHHLLEVARSLMLLTSLPSYLWGDAVLTAAHLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVH
        N  ++E  RS++    LP     DA  TA H+IN+ PS  +NF  P      S PT        LR F C A++H
Subjt:  NHHLLEVARSLMLLTSLPSYLWGDAVLTAAHLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTATCAAACCCGCCGCCGCCGCTACCTCCGAGTGAACCATCCAAGCCACCGTCGACCGCGACCTCTAATCACTCAACCCAGAAGTTGCCGCCGCCCTCTTTCGAAAT
CCGATCGTTGCCAACGGGGAATGGTGAAGCCGCATTCGATTTTGCAAGATTGAGTCTCGATCCAAGTATCGCTCTTGAGTGTCGCTACGAATCGCATCTCGCTCCAAGTC
TCGCTCCAACTCTCGCTCAGACCGCAGTCGAAGGAACCTCTCAATCAGCGAGAGTGAATCTCGCTCCAACTCTCGCTCCAAATCCAGTGAGAGTGCATCTCGCTGCTCCG
ACTCTCGCTCAACCTGCAATGGATCTTTGGGAGTATTTCCCTCCTACAGCTGCTGCCACTGCTGGAATTTCTCACCAAGGGGTCGGAAACTACTATGAGCAACAAGGATC
GGCTGTCGCTTCCTCGGCTATGGTTCAACAGCACCTTGCTAACCTACAAGCAAGTTTCCAACAACAAATTGCAGCTCTTGGGGCAGCCCTTGGTGCTTCAACCACTCTAG
ATCAAAGTGCAGAAAGTTTAGGTGTTCAATCAGGGTTACCGATGTATCCAGACAATCCGGTGACTATCTACCCTACCTTAACAACTACACATCATATGTCTGGACAAATG
GGTAACTCTACATGGTTAATTGTTGGTGAAAAATTGAATGGCCAAAACTATTTCTCCTGGTCTCAGTCAGTTAAAATGGTGCTTGAAGGGTGCCACAAGTTCGAATGCCT
GACTGGTGAAATACCTAAACCCAGACCCAAAGATCCTCAGAAGCGTTTTTGGAAGGGAGAAGATTCATTACTTCGGCAGAATGCATCACGACTTTACACTCTGAGAAAAC
AAATCCATGAATGTAAACAGGGATCCATGGATGAGATGGACTTATGTCGTGAGCTTATCTGGGACTGTTCTTGTGGAGGAGTCCAATATTATAAACTTGAAGAGGTTGAC
CGTGTTTATGATTTCTTAGCAGGCCTCAATTCTAAGTTTGATGTTGTACGGAGTTCGGAAGATAGATCAAGCGCCATGAATATTACTGCCTCTTCCACAGCTGATTCTGC
TGCATTTAGTGCGAAATCATCTGGAACTACTGGGACAAGTAGAACAGGAAACCACCTCCAGTGTGTGAACATTTGTTGGAAGTTACACGGTCGGCCCCCGAATGGAAAAC
GACGACCTCCAAATAACAAGCCCAACCAGGCTTTGAGAACCACCCTAGTGATACCAGTACTTCCTCCCTTGGGGCAATTGCACAATCAGGGTACTTCTCAATCCTTGAGT
CTCCTCAATATTACAGGTAAGAAACCTTGGATTCTTGACTCAAGAGCTACAGACCATTTATCTGGAACTTCTGCAAACTTCATATCTTATCATCCGTGTGCCGGTAATGA
GAAAATTCGGATTGCTGAGAGGACACTTACCCTAGTGGCCGGCAAGGGTCATGTTTCTCCTTATGATGGTTTAATATTACAGAATGACTTGAGCTCGGGGAAGGCAATTG
GCACTGCCCAGCACAATAAAGGACTCTATTTCCTTAATGATAATTCTTCCTATAGGACTAGTCTGCTATCTTCCTACTTTTCAACTTTTGAAAATGACTGTATAATTACC
ACTTCCTCTGGTAAACGTTGGTTTGTCACCTTCATTGATGACCACACTCGTCTTACTTTGGTTTACCTTCTTACAGATAAATCTAAAGTCTCATTCATCTTCCAACAATT
TTACACCACCATTGAAACTAAGTTTAATACCAAAATTGCCATCCTTCAGAGTGGCAATGGTCGTGAATTCCTTACCAATACCCTCCATGAGTTCTTATCCTCTAAAGGCA
TTATTCACCAGAGTTCATGTGCTTATACTCCCCAACAAAATGGAGTGGCTGAAAGGAAAAATCATCATCTCCTTGAAGTTGCTCGATCTCTCATGTTGTTGACCTCTCTA
CCGTCTTACTTGTGGGGGGATGCAGTCTTGACTGCCGCCCATCTCATTAACCGGATACCTTCCCGCATTCTTAATTTCCAAACTCCTCTAAACTGCCTCAAATTGTCTTA
TCCGACCACTCGCCTAATACCTGACGTCCCTCTCCGAGTATTTGAGTGTACTGCATTTGTCCATAGCTTTGGTGTCACACCCCCTCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTATCAAACCCGCCGCCGCCGCTACCTCCGAGTGAACCATCCAAGCCACCGTCGACCGCGACCTCTAATCACTCAACCCAGAAGTTGCCGCCGCCCTCTTTCGAAAT
CCGATCGTTGCCAACGGGGAATGGTGAAGCCGCATTCGATTTTGCAAGATTGAGTCTCGATCCAAGTATCGCTCTTGAGTGTCGCTACGAATCGCATCTCGCTCCAAGTC
TCGCTCCAACTCTCGCTCAGACCGCAGTCGAAGGAACCTCTCAATCAGCGAGAGTGAATCTCGCTCCAACTCTCGCTCCAAATCCAGTGAGAGTGCATCTCGCTGCTCCG
ACTCTCGCTCAACCTGCAATGGATCTTTGGGAGTATTTCCCTCCTACAGCTGCTGCCACTGCTGGAATTTCTCACCAAGGGGTCGGAAACTACTATGAGCAACAAGGATC
GGCTGTCGCTTCCTCGGCTATGGTTCAACAGCACCTTGCTAACCTACAAGCAAGTTTCCAACAACAAATTGCAGCTCTTGGGGCAGCCCTTGGTGCTTCAACCACTCTAG
ATCAAAGTGCAGAAAGTTTAGGTGTTCAATCAGGGTTACCGATGTATCCAGACAATCCGGTGACTATCTACCCTACCTTAACAACTACACATCATATGTCTGGACAAATG
GGTAACTCTACATGGTTAATTGTTGGTGAAAAATTGAATGGCCAAAACTATTTCTCCTGGTCTCAGTCAGTTAAAATGGTGCTTGAAGGGTGCCACAAGTTCGAATGCCT
GACTGGTGAAATACCTAAACCCAGACCCAAAGATCCTCAGAAGCGTTTTTGGAAGGGAGAAGATTCATTACTTCGGCAGAATGCATCACGACTTTACACTCTGAGAAAAC
AAATCCATGAATGTAAACAGGGATCCATGGATGAGATGGACTTATGTCGTGAGCTTATCTGGGACTGTTCTTGTGGAGGAGTCCAATATTATAAACTTGAAGAGGTTGAC
CGTGTTTATGATTTCTTAGCAGGCCTCAATTCTAAGTTTGATGTTGTACGGAGTTCGGAAGATAGATCAAGCGCCATGAATATTACTGCCTCTTCCACAGCTGATTCTGC
TGCATTTAGTGCGAAATCATCTGGAACTACTGGGACAAGTAGAACAGGAAACCACCTCCAGTGTGTGAACATTTGTTGGAAGTTACACGGTCGGCCCCCGAATGGAAAAC
GACGACCTCCAAATAACAAGCCCAACCAGGCTTTGAGAACCACCCTAGTGATACCAGTACTTCCTCCCTTGGGGCAATTGCACAATCAGGGTACTTCTCAATCCTTGAGT
CTCCTCAATATTACAGGTAAGAAACCTTGGATTCTTGACTCAAGAGCTACAGACCATTTATCTGGAACTTCTGCAAACTTCATATCTTATCATCCGTGTGCCGGTAATGA
GAAAATTCGGATTGCTGAGAGGACACTTACCCTAGTGGCCGGCAAGGGTCATGTTTCTCCTTATGATGGTTTAATATTACAGAATGACTTGAGCTCGGGGAAGGCAATTG
GCACTGCCCAGCACAATAAAGGACTCTATTTCCTTAATGATAATTCTTCCTATAGGACTAGTCTGCTATCTTCCTACTTTTCAACTTTTGAAAATGACTGTATAATTACC
ACTTCCTCTGGTAAACGTTGGTTTGTCACCTTCATTGATGACCACACTCGTCTTACTTTGGTTTACCTTCTTACAGATAAATCTAAAGTCTCATTCATCTTCCAACAATT
TTACACCACCATTGAAACTAAGTTTAATACCAAAATTGCCATCCTTCAGAGTGGCAATGGTCGTGAATTCCTTACCAATACCCTCCATGAGTTCTTATCCTCTAAAGGCA
TTATTCACCAGAGTTCATGTGCTTATACTCCCCAACAAAATGGAGTGGCTGAAAGGAAAAATCATCATCTCCTTGAAGTTGCTCGATCTCTCATGTTGTTGACCTCTCTA
CCGTCTTACTTGTGGGGGGATGCAGTCTTGACTGCCGCCCATCTCATTAACCGGATACCTTCCCGCATTCTTAATTTCCAAACTCCTCTAAACTGCCTCAAATTGTCTTA
TCCGACCACTCGCCTAATACCTGACGTCCCTCTCCGAGTATTTGAGTGTACTGCATTTGTCCATAGCTTTGGTGTCACACCCCCTCCTTGA
Protein sequenceShow/hide protein sequence
MVSNPPPPLPPSEPSKPPSTATSNHSTQKLPPPSFEIRSLPTGNGEAAFDFARLSLDPSIALECRYESHLAPSLAPTLAQTAVEGTSQSARVNLAPTLAPNPVRVHLAAP
TLAQPAMDLWEYFPPTAAATAGISHQGVGNYYEQQGSAVASSAMVQQHLANLQASFQQQIAALGAALGASTTLDQSAESLGVQSGLPMYPDNPVTIYPTLTTTHHMSGQM
GNSTWLIVGEKLNGQNYFSWSQSVKMVLEGCHKFECLTGEIPKPRPKDPQKRFWKGEDSLLRQNASRLYTLRKQIHECKQGSMDEMDLCRELIWDCSCGGVQYYKLEEVD
RVYDFLAGLNSKFDVVRSSEDRSSAMNITASSTADSAAFSAKSSGTTGTSRTGNHLQCVNICWKLHGRPPNGKRRPPNNKPNQALRTTLVIPVLPPLGQLHNQGTSQSLS
LLNITGKKPWILDSRATDHLSGTSANFISYHPCAGNEKIRIAERTLTLVAGKGHVSPYDGLILQNDLSSGKAIGTAQHNKGLYFLNDNSSYRTSLLSSYFSTFENDCIIT
TSSGKRWFVTFIDDHTRLTLVYLLTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQSSCAYTPQQNGVAERKNHHLLEVARSLMLLTSL
PSYLWGDAVLTAAHLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVHSFGVTPPP