; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G12775 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G12775
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionBeta-galactosidase
Genome locationClcChr01:24509140..24512044
RNA-Seq ExpressionClc01G12775
SyntenyClc01G12775
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031941.1 Beta-galactosidase [Cucumis melo var. makuwa]3.8e-16847.46Show/hide
Query:  HLANLQASFQQQIAALGAALGASTTLDQSAESLGVQSG----------LPMYPDNPVTIYPTLTTTHHMSGQMGNSTWLIVGEKLNGQNYFSWSQSVKMV
        +L+N  +  Q  +  L   L +   +DQ      +++G          LPMY  NPVT +P  + +++++G +G+ST    GEKLNGQNYFSWSQS+KM 
Subjt:  HLANLQASFQQQIAALGAALGASTTLDQSAESLGVQSG----------LPMYPDNPVTIYPTLTTTHHMSGQMGNSTWLIVGEKLNGQNYFSWSQSVKMV

Query:  LEGCHKFECLTGEIPKPRPKDPQKRFWKGEDSLL-----------------------------------RQNASRLYTLRKQIHECKQGSMD--------
        LEG ++F  LTGE  +P P D  +R WKGEDSL+                                   RQNASRLYTLRKQ+H CKQG++D        
Subjt:  LEGCHKFECLTGEIPKPRPKDPQKRFWKGEDSLL-----------------------------------RQNASRLYTLRKQIHECKQGSMD--------

Query:  -----EMDLCRELIWDCSCGGVQYYKLEEVDRVYDFLAGLNSKFD--------------------VVRSSEDRSSAMNITASSTADSAAFSAKSSGTTGT
             EMDLCRE +WD      QY KLEE DRVYDFLAGLN KFD                     VR  EDR++AM +  + T DSAAFSA+SS     
Subjt:  -----EMDLCRELIWDCSCGGVQYYKLEEVDRVYDFLAGLNSKFD--------------------VVRSSEDRSSAMNITASSTADSAAFSAKSSGTTGT

Query:  SRTGNHL----------QCVNICWKLHGRPPNGKRRPPNNKPN---------------QALRTTLVIPVLPPLGQLHNQGTSQSLSLLNITGKKPWILDS
           G  +             + CWKLHGRPP GK+R  N K N               Q+   T+     P LG +   G  QSL L+++ GK PWILDS
Subjt:  SRTGNHL----------QCVNICWKLHGRPPNGKRRPPNNKPN---------------QALRTTLVIPVLPPLGQLHNQGTSQSLSLLNITGKKPWILDS

Query:  RATDHLSGTSANFISYHPCAGNEKIRIAERTLTLVAGKGHVSPYDGLILQN-----------------------------------DLSSGKAIGTAQHN
         ATDHL+G+S +FISY PCAGNEKIRIA+ +L  +AGKG + P+DG  LQN                                   D+SSG+ IGTA+H+
Subjt:  RATDHLSGTSANFISYHPCAGNEKIRIAERTLTLVAGKGHVSPYDGLILQN-----------------------------------DLSSGKAIGTAQHN

Query:  RGLYFLNDNSS----YRTSLLSSYFSTFEND----------CI--------------------------------ITTSSGKRWFVTFIDDHTRLTLVYL
        RGLY L+D++S     R SLLSSYFST E D          CI                                +TTSSGKRWFVTFIDDHTRLT VYL
Subjt:  RGLYFLNDNSS----YRTSLLSSYFSTFEND----------CI--------------------------------ITTSSGKRWFVTFIDDHTRLTLVYL

Query:  LTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQSSCAYTPQQNGVAERKNRHLLEVARSLMLLTSLPSYLWGDAVLTAA
        ++DKS+V  IFQ FY TI+T+F+TKIAIL+S NGREF  + L EFL+SKGI+HQ+SCAYTPQQNGVAERKNRHL+EVARSLML TSLPSYLWGDA+LTAA
Subjt:  LTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQSSCAYTPQQNGVAERKNRHLLEVARSLMLLTSLPSYLWGDAVLTAA

Query:  HLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVHSFG
        HLINR+PSRIL+ QTPL+CLK SYP+TRL+ +VPLRVF CTA+VH+FG
Subjt:  HLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVHSFG

KAA0052172.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.9e-16747.46Show/hide
Query:  HLANLQASFQQQIAALGAALGASTTLDQSAESLGVQSG----------LPMYPDNPVTIYPTLTTTHHMSGQMGNSTWLIVGEKLNGQNYFSWSQSVKMV
        +L+N  +  Q  +  L   L +   +DQ      +++G          LPMY  NPVT +P  + +++++G +G+ST    GEKLNGQNYFSWSQS+KM 
Subjt:  HLANLQASFQQQIAALGAALGASTTLDQSAESLGVQSG----------LPMYPDNPVTIYPTLTTTHHMSGQMGNSTWLIVGEKLNGQNYFSWSQSVKMV

Query:  LEGCHKFECLTGEIPKPRPKDPQKRFWKGEDSLL-----------------------------------RQNASRLYTLRKQIHECKQGSMD--------
        LEG ++F  LTGE  +P P D  +R WKGEDSL+                                   RQNASRLYTLRKQ+H CKQG++D        
Subjt:  LEGCHKFECLTGEIPKPRPKDPQKRFWKGEDSLL-----------------------------------RQNASRLYTLRKQIHECKQGSMD--------

Query:  -----EMDLCRELIWDCSCGGVQYYKLEEVDRVYDFLAGLNSKFD--------------------VVRSSEDRSSAMNITASSTADSAAFSAKSSGTTGT
             EMDLCRE +WD      QY KLEE DRVYDFLAGLN KFD                     VR  EDR++AM +  + T DSAAFSA+SS     
Subjt:  -----EMDLCRELIWDCSCGGVQYYKLEEVDRVYDFLAGLNSKFD--------------------VVRSSEDRSSAMNITASSTADSAAFSAKSSGTTGT

Query:  SRTGNHLQCV----------NICWKLHGRPPNGKRRPPNNKPN---------------QALRTTLVIPVLPPLGQLHNQGTSQSLSLLNITGKKPWILDS
           G  +             + CWKLHGRPP GK+R  N K N               Q+   T+     P LG +   G  QSL L++I GK PWILDS
Subjt:  SRTGNHLQCV----------NICWKLHGRPPNGKRRPPNNKPN---------------QALRTTLVIPVLPPLGQLHNQGTSQSLSLLNITGKKPWILDS

Query:  RATDHLSGTSANFISYHPCAGNEKIRIAERTLTLVAGKGHVSPYDGLILQN-----------------------------------DLSSGKAIGTAQHN
         ATDHL+G+S +FISY  CAGNEKIRIA+ +L  +AGKG + P+DG  LQN                                   D+SSG+ IGTA+H+
Subjt:  RATDHLSGTSANFISYHPCAGNEKIRIAERTLTLVAGKGHVSPYDGLILQN-----------------------------------DLSSGKAIGTAQHN

Query:  RGLYFLNDNSS----YRTSLLSSYFSTFEND----------CI--------------------------------ITTSSGKRWFVTFIDDHTRLTLVYL
        RGLY L+D++S     R SLLSSYFST E D          CI                                +TTSSGKRWFVTFIDDHTRLT VYL
Subjt:  RGLYFLNDNSS----YRTSLLSSYFSTFEND----------CI--------------------------------ITTSSGKRWFVTFIDDHTRLTLVYL

Query:  LTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQSSCAYTPQQNGVAERKNRHLLEVARSLMLLTSLPSYLWGDAVLTAA
        ++DKS+V  IFQ FY TI+T+F+TKIAIL+S NGREF  + L EFL+SKGI+HQ+SCAYTPQQNGVAERKNRHL+EVARSLML TSLPSYLWGDA+LTAA
Subjt:  LTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQSSCAYTPQQNGVAERKNRHLLEVARSLMLLTSLPSYLWGDAVLTAA

Query:  HLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVHSFG
        HLINR+PSRIL+ QTPL+CLK SYP+TRL+ +VPLRVF CTA+VH+FG
Subjt:  HLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVHSFG

KAA0061447.1 Beta-galactosidase [Cucumis melo var. makuwa]1.9e-16747.33Show/hide
Query:  HLANLQASFQQQIAALGAALGASTTLDQSAESLGVQSG----------LPMYPDNPVTIYPTLTTTHHMSGQMGNSTWLIVGEKLNGQNYFSWSQSVKMV
        +L+N  +  Q  +  L   L +   +DQ      +++G          LPMY  NP T +P  + +++++G +G+ST    GEKLNGQNYFSWSQS+KM 
Subjt:  HLANLQASFQQQIAALGAALGASTTLDQSAESLGVQSG----------LPMYPDNPVTIYPTLTTTHHMSGQMGNSTWLIVGEKLNGQNYFSWSQSVKMV

Query:  LEGCHKFECLTGEIPKPRPKDPQKRFWKGEDSLL-----------------------------------RQNASRLYTLRKQIHECKQGSMD--------
        LEG ++F  LTGE  +P P D  +R WKGEDSL+                                   RQNASRLYTLRKQ+H CKQG++D        
Subjt:  LEGCHKFECLTGEIPKPRPKDPQKRFWKGEDSLL-----------------------------------RQNASRLYTLRKQIHECKQGSMD--------

Query:  -----EMDLCRELIWDCSCGGVQYYKLEEVDRVYDFLAGLNSKFD--------------------VVRSSEDRSSAMNITASSTADSAAFSAKSSGTTGT
             EMDLCRE +WD      QY KLEE DRVYDFLAGLN KFD                     VR  EDR++AM +  + T DSAAFSA+SS     
Subjt:  -----EMDLCRELIWDCSCGGVQYYKLEEVDRVYDFLAGLNSKFD--------------------VVRSSEDRSSAMNITASSTADSAAFSAKSSGTTGT

Query:  SRTGNHL----------QCVNICWKLHGRPPNGKRRPPNNKPN---------------QALRTTLVIPVLPPLGQLHNQGTSQSLSLLNITGKKPWILDS
           G  +             + CWKLHGRPP GK+R  N K N               Q+   T+     P LG +   G  QSL L+++ GK PWILDS
Subjt:  SRTGNHL----------QCVNICWKLHGRPPNGKRRPPNNKPN---------------QALRTTLVIPVLPPLGQLHNQGTSQSLSLLNITGKKPWILDS

Query:  RATDHLSGTSANFISYHPCAGNEKIRIAERTLTLVAGKGHVSPYDGLILQN-----------------------------------DLSSGKAIGTAQHN
         ATDHL+G+S +FISY PCAGNEKIRIA+ +L  +AGKG + P+DG  LQN                                   D+SSG+ IGTA+H+
Subjt:  RATDHLSGTSANFISYHPCAGNEKIRIAERTLTLVAGKGHVSPYDGLILQN-----------------------------------DLSSGKAIGTAQHN

Query:  RGLYFLNDNSS----YRTSLLSSYFSTFEND----------CI--------------------------------ITTSSGKRWFVTFIDDHTRLTLVYL
        RGLY L+D++S     R SLLSSYFST E D          CI                                +TTSSGKRWFVTFIDDHTRLT VYL
Subjt:  RGLYFLNDNSS----YRTSLLSSYFSTFEND----------CI--------------------------------ITTSSGKRWFVTFIDDHTRLTLVYL

Query:  LTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQSSCAYTPQQNGVAERKNRHLLEVARSLMLLTSLPSYLWGDAVLTAA
        ++DKS+V  IFQ FY TI+T+F+TKIAIL+S NGREF  + L EFL+SKGI+HQ+SCAYTPQQNGVAERKNRHL+EVARSLML TSLPSYLWGDA+LTAA
Subjt:  LTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQSSCAYTPQQNGVAERKNRHLLEVARSLMLLTSLPSYLWGDAVLTAA

Query:  HLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVHSFG
        HLINR+PSRIL+ QTPL+CLK SYP+TRL+ +VPLRVF CTA+VH+FG
Subjt:  HLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVHSFG

TYK31050.1 Beta-galactosidase [Cucumis melo var. makuwa]1.0e-16847.59Show/hide
Query:  HLANLQASFQQQIAALGAALGASTTLDQSAESLGVQSG----------LPMYPDNPVTIYPTLTTTHHMSGQMGNSTWLIVGEKLNGQNYFSWSQSVKMV
        +L+N  +  Q  +  L   L +   +DQ      +++G          LPMY  NPVT +P  + +++++G +G+ST    GEKLNGQNYFSWSQS+KM 
Subjt:  HLANLQASFQQQIAALGAALGASTTLDQSAESLGVQSG----------LPMYPDNPVTIYPTLTTTHHMSGQMGNSTWLIVGEKLNGQNYFSWSQSVKMV

Query:  LEGCHKFECLTGEIPKPRPKDPQKRFWKGEDSLL-----------------------------------RQNASRLYTLRKQIHECKQGSMD--------
        LEG ++F  LTGEI +P P D  +R WKGEDSL+                                   RQNASRLYTLRKQ+H CKQG++D        
Subjt:  LEGCHKFECLTGEIPKPRPKDPQKRFWKGEDSLL-----------------------------------RQNASRLYTLRKQIHECKQGSMD--------

Query:  -----EMDLCRELIWDCSCGGVQYYKLEEVDRVYDFLAGLNSKFD--------------------VVRSSEDRSSAMNITASSTADSAAFSAKSSGTTGT
             EMDLCRE +WD      QY KLEE DRVYDFLAGLN KFD                     VR  EDR++AM +  + T DSAAFSA+SS     
Subjt:  -----EMDLCRELIWDCSCGGVQYYKLEEVDRVYDFLAGLNSKFD--------------------VVRSSEDRSSAMNITASSTADSAAFSAKSSGTTGT

Query:  SRTGNHL----------QCVNICWKLHGRPPNGKRRPPNNKPN---------------QALRTTLVIPVLPPLGQLHNQGTSQSLSLLNITGKKPWILDS
           G  +             + CWKLHGRPP GK+R  N K N               Q+   T+     P LG +   G  QSL L+++ GK PWILDS
Subjt:  SRTGNHL----------QCVNICWKLHGRPPNGKRRPPNNKPN---------------QALRTTLVIPVLPPLGQLHNQGTSQSLSLLNITGKKPWILDS

Query:  RATDHLSGTSANFISYHPCAGNEKIRIAERTLTLVAGKGHVSPYDGLILQN-----------------------------------DLSSGKAIGTAQHN
         ATDHL+G+S +FISY PCAGNEKIRIA+ +L  +AGKG + P+DG  LQN                                   D+SSG+ IGTA+H+
Subjt:  RATDHLSGTSANFISYHPCAGNEKIRIAERTLTLVAGKGHVSPYDGLILQN-----------------------------------DLSSGKAIGTAQHN

Query:  RGLYFLNDNSS----YRTSLLSSYFSTFEND----------CI--------------------------------ITTSSGKRWFVTFIDDHTRLTLVYL
        RGLY L+D++S     R SLLSSYFST E D          CI                                +TTSSGKRWFVTFIDDHTRLT VYL
Subjt:  RGLYFLNDNSS----YRTSLLSSYFSTFEND----------CI--------------------------------ITTSSGKRWFVTFIDDHTRLTLVYL

Query:  LTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQSSCAYTPQQNGVAERKNRHLLEVARSLMLLTSLPSYLWGDAVLTAA
        ++DKS+V  IFQ FY TI+T+F+TKIAIL+S NGREF  + L EFL+SKGI+HQ+SCAYTPQQNGVAERKNRHL+EVARSLML TSLPSYLWGDA+LTAA
Subjt:  LTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQSSCAYTPQQNGVAERKNRHLLEVARSLMLLTSLPSYLWGDAVLTAA

Query:  HLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVHSFG
        HLINR+PSRIL+ QTPL+CLK SYP+TRL+ +VPLRVF CTA+VH+FG
Subjt:  HLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVHSFG

TYK31717.1 Beta-galactosidase [Cucumis melo var. makuwa]1.7e-16846.17Show/hide
Query:  APTLAQPAMDLWEYFPPTAAATAGISHQGVGNYYEQQGSAVASSAMVQQHLANLQASFQQQIAALGAALGASTTLDQSAESLGVQSG----------LPM
        AP+  QP+       P +A+ ++G           QQ S V        +L+N  +  Q  I +L   L +   +DQ      +++G          LPM
Subjt:  APTLAQPAMDLWEYFPPTAAATAGISHQGVGNYYEQQGSAVASSAMVQQHLANLQASFQQQIAALGAALGASTTLDQSAESLGVQSG----------LPM

Query:  YPDNPVTIYPTLTTTHHMSGQMGNSTWLIVGEKLNGQNYFSWSQSVKMVLEGCHKFECLTGEIPKPRPKDPQKRFWKGEDSLL-----------------
        Y  NPVT +P  + +++++G +G+ST    GEKLNGQNYFSWSQS+KM LEG ++F  LTGEI +P P D  +R WKGEDSL+                 
Subjt:  YPDNPVTIYPTLTTTHHMSGQMGNSTWLIVGEKLNGQNYFSWSQSVKMVLEGCHKFECLTGEIPKPRPKDPQKRFWKGEDSLL-----------------

Query:  ------------------RQNASRLYTLRKQIHECKQGSMD-------------EMDLCRELIWDCSCGGVQYYKLEEVDRVYDFLAGLNSKFD------
                          RQNASRLYTLRKQ+H CKQG++D             EMDLCRE +WD      QY KLEE DRVYDFLAGLN KFD      
Subjt:  ------------------RQNASRLYTLRKQIHECKQGSMD-------------EMDLCRELIWDCSCGGVQYYKLEEVDRVYDFLAGLNSKFD------

Query:  --------------VVRSSEDRSSAMNITASSTADSAAFSAKSSGTTGTSRTGNHL----------QCVNICWKLHGRPPNGKRRPPNNKPN--------
                       VR  EDR++AM +  + T DSAAFSA+SS        G  +             + CWKLHGRPP GK+R  N K N        
Subjt:  --------------VVRSSEDRSSAMNITASSTADSAAFSAKSSGTTGTSRTGNHL----------QCVNICWKLHGRPPNGKRRPPNNKPN--------

Query:  -------QALRTTLVIPVLPPLGQLHNQGTSQSLSLLNITGKKPWILDSRATDHLSGTSANFISYHPCAGNEKIRIAERTLTLVAGKGHVSPYDGLILQN
               Q+   T+     P LG +   G  QSL L+++ GK PWILDS ATDHL+G+S +FISY PCAGNEKIRIA+ +L  +AGKG + P+DG  LQN
Subjt:  -------QALRTTLVIPVLPPLGQLHNQGTSQSLSLLNITGKKPWILDSRATDHLSGTSANFISYHPCAGNEKIRIAERTLTLVAGKGHVSPYDGLILQN

Query:  -----------------------------------DLSSGKAIGTAQHNRGLYFLNDNSS----YRTSLLSSYFSTFEND----------CI--------
                                           D+SSG+ IGTA+H+RGLY L+D++S     R SLLSSYFST E D          CI        
Subjt:  -----------------------------------DLSSGKAIGTAQHNRGLYFLNDNSS----YRTSLLSSYFSTFEND----------CI--------

Query:  ------------------------ITTSSGKRWFVTFIDDHTRLTLVYLLTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGI
                                +TTSSGKRWFVTFIDDHTRLT VYL++DKS+V  IFQ FY TI+T+F+TKIAIL+S NGREF  + L EFL+SKGI
Subjt:  ------------------------ITTSSGKRWFVTFIDDHTRLTLVYLLTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGI

Query:  IHQSSCAYTPQQNGVAERKNRHLLEVARSLMLLTSLPSYLWGDAVLTAAHLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVHSFG
        +HQ+SCAYTPQQNGVAERKNRHL+EVARSLML TSLPSYLWGDA+LTAAHLINR+PSRIL+ QTPL+CLK SYP+TRL+ +VPLRVF CTA+VH+FG
Subjt:  IHQSSCAYTPQQNGVAERKNRHLLEVARSLMLLTSLPSYLWGDAVLTAAHLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVHSFG

TrEMBL top hitse value%identityAlignment
A0A5A7SQW1 Beta-galactosidase1.9e-16847.46Show/hide
Query:  HLANLQASFQQQIAALGAALGASTTLDQSAESLGVQSG----------LPMYPDNPVTIYPTLTTTHHMSGQMGNSTWLIVGEKLNGQNYFSWSQSVKMV
        +L+N  +  Q  +  L   L +   +DQ      +++G          LPMY  NPVT +P  + +++++G +G+ST    GEKLNGQNYFSWSQS+KM 
Subjt:  HLANLQASFQQQIAALGAALGASTTLDQSAESLGVQSG----------LPMYPDNPVTIYPTLTTTHHMSGQMGNSTWLIVGEKLNGQNYFSWSQSVKMV

Query:  LEGCHKFECLTGEIPKPRPKDPQKRFWKGEDSLL-----------------------------------RQNASRLYTLRKQIHECKQGSMD--------
        LEG ++F  LTGE  +P P D  +R WKGEDSL+                                   RQNASRLYTLRKQ+H CKQG++D        
Subjt:  LEGCHKFECLTGEIPKPRPKDPQKRFWKGEDSLL-----------------------------------RQNASRLYTLRKQIHECKQGSMD--------

Query:  -----EMDLCRELIWDCSCGGVQYYKLEEVDRVYDFLAGLNSKFD--------------------VVRSSEDRSSAMNITASSTADSAAFSAKSSGTTGT
             EMDLCRE +WD      QY KLEE DRVYDFLAGLN KFD                     VR  EDR++AM +  + T DSAAFSA+SS     
Subjt:  -----EMDLCRELIWDCSCGGVQYYKLEEVDRVYDFLAGLNSKFD--------------------VVRSSEDRSSAMNITASSTADSAAFSAKSSGTTGT

Query:  SRTGNHL----------QCVNICWKLHGRPPNGKRRPPNNKPN---------------QALRTTLVIPVLPPLGQLHNQGTSQSLSLLNITGKKPWILDS
           G  +             + CWKLHGRPP GK+R  N K N               Q+   T+     P LG +   G  QSL L+++ GK PWILDS
Subjt:  SRTGNHL----------QCVNICWKLHGRPPNGKRRPPNNKPN---------------QALRTTLVIPVLPPLGQLHNQGTSQSLSLLNITGKKPWILDS

Query:  RATDHLSGTSANFISYHPCAGNEKIRIAERTLTLVAGKGHVSPYDGLILQN-----------------------------------DLSSGKAIGTAQHN
         ATDHL+G+S +FISY PCAGNEKIRIA+ +L  +AGKG + P+DG  LQN                                   D+SSG+ IGTA+H+
Subjt:  RATDHLSGTSANFISYHPCAGNEKIRIAERTLTLVAGKGHVSPYDGLILQN-----------------------------------DLSSGKAIGTAQHN

Query:  RGLYFLNDNSS----YRTSLLSSYFSTFEND----------CI--------------------------------ITTSSGKRWFVTFIDDHTRLTLVYL
        RGLY L+D++S     R SLLSSYFST E D          CI                                +TTSSGKRWFVTFIDDHTRLT VYL
Subjt:  RGLYFLNDNSS----YRTSLLSSYFSTFEND----------CI--------------------------------ITTSSGKRWFVTFIDDHTRLTLVYL

Query:  LTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQSSCAYTPQQNGVAERKNRHLLEVARSLMLLTSLPSYLWGDAVLTAA
        ++DKS+V  IFQ FY TI+T+F+TKIAIL+S NGREF  + L EFL+SKGI+HQ+SCAYTPQQNGVAERKNRHL+EVARSLML TSLPSYLWGDA+LTAA
Subjt:  LTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQSSCAYTPQQNGVAERKNRHLLEVARSLMLLTSLPSYLWGDAVLTAA

Query:  HLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVHSFG
        HLINR+PSRIL+ QTPL+CLK SYP+TRL+ +VPLRVF CTA+VH+FG
Subjt:  HLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVHSFG

A0A5A7U8U2 Retrovirus-related Pol polyprotein from transposon TNT 1-949.2e-16847.46Show/hide
Query:  HLANLQASFQQQIAALGAALGASTTLDQSAESLGVQSG----------LPMYPDNPVTIYPTLTTTHHMSGQMGNSTWLIVGEKLNGQNYFSWSQSVKMV
        +L+N  +  Q  +  L   L +   +DQ      +++G          LPMY  NPVT +P  + +++++G +G+ST    GEKLNGQNYFSWSQS+KM 
Subjt:  HLANLQASFQQQIAALGAALGASTTLDQSAESLGVQSG----------LPMYPDNPVTIYPTLTTTHHMSGQMGNSTWLIVGEKLNGQNYFSWSQSVKMV

Query:  LEGCHKFECLTGEIPKPRPKDPQKRFWKGEDSLL-----------------------------------RQNASRLYTLRKQIHECKQGSMD--------
        LEG ++F  LTGE  +P P D  +R WKGEDSL+                                   RQNASRLYTLRKQ+H CKQG++D        
Subjt:  LEGCHKFECLTGEIPKPRPKDPQKRFWKGEDSLL-----------------------------------RQNASRLYTLRKQIHECKQGSMD--------

Query:  -----EMDLCRELIWDCSCGGVQYYKLEEVDRVYDFLAGLNSKFD--------------------VVRSSEDRSSAMNITASSTADSAAFSAKSSGTTGT
             EMDLCRE +WD      QY KLEE DRVYDFLAGLN KFD                     VR  EDR++AM +  + T DSAAFSA+SS     
Subjt:  -----EMDLCRELIWDCSCGGVQYYKLEEVDRVYDFLAGLNSKFD--------------------VVRSSEDRSSAMNITASSTADSAAFSAKSSGTTGT

Query:  SRTGNHLQCV----------NICWKLHGRPPNGKRRPPNNKPN---------------QALRTTLVIPVLPPLGQLHNQGTSQSLSLLNITGKKPWILDS
           G  +             + CWKLHGRPP GK+R  N K N               Q+   T+     P LG +   G  QSL L++I GK PWILDS
Subjt:  SRTGNHLQCV----------NICWKLHGRPPNGKRRPPNNKPN---------------QALRTTLVIPVLPPLGQLHNQGTSQSLSLLNITGKKPWILDS

Query:  RATDHLSGTSANFISYHPCAGNEKIRIAERTLTLVAGKGHVSPYDGLILQN-----------------------------------DLSSGKAIGTAQHN
         ATDHL+G+S +FISY  CAGNEKIRIA+ +L  +AGKG + P+DG  LQN                                   D+SSG+ IGTA+H+
Subjt:  RATDHLSGTSANFISYHPCAGNEKIRIAERTLTLVAGKGHVSPYDGLILQN-----------------------------------DLSSGKAIGTAQHN

Query:  RGLYFLNDNSS----YRTSLLSSYFSTFEND----------CI--------------------------------ITTSSGKRWFVTFIDDHTRLTLVYL
        RGLY L+D++S     R SLLSSYFST E D          CI                                +TTSSGKRWFVTFIDDHTRLT VYL
Subjt:  RGLYFLNDNSS----YRTSLLSSYFSTFEND----------CI--------------------------------ITTSSGKRWFVTFIDDHTRLTLVYL

Query:  LTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQSSCAYTPQQNGVAERKNRHLLEVARSLMLLTSLPSYLWGDAVLTAA
        ++DKS+V  IFQ FY TI+T+F+TKIAIL+S NGREF  + L EFL+SKGI+HQ+SCAYTPQQNGVAERKNRHL+EVARSLML TSLPSYLWGDA+LTAA
Subjt:  LTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQSSCAYTPQQNGVAERKNRHLLEVARSLMLLTSLPSYLWGDAVLTAA

Query:  HLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVHSFG
        HLINR+PSRIL+ QTPL+CLK SYP+TRL+ +VPLRVF CTA+VH+FG
Subjt:  HLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVHSFG

A0A5A7V3J5 Beta-galactosidase9.2e-16847.33Show/hide
Query:  HLANLQASFQQQIAALGAALGASTTLDQSAESLGVQSG----------LPMYPDNPVTIYPTLTTTHHMSGQMGNSTWLIVGEKLNGQNYFSWSQSVKMV
        +L+N  +  Q  +  L   L +   +DQ      +++G          LPMY  NP T +P  + +++++G +G+ST    GEKLNGQNYFSWSQS+KM 
Subjt:  HLANLQASFQQQIAALGAALGASTTLDQSAESLGVQSG----------LPMYPDNPVTIYPTLTTTHHMSGQMGNSTWLIVGEKLNGQNYFSWSQSVKMV

Query:  LEGCHKFECLTGEIPKPRPKDPQKRFWKGEDSLL-----------------------------------RQNASRLYTLRKQIHECKQGSMD--------
        LEG ++F  LTGE  +P P D  +R WKGEDSL+                                   RQNASRLYTLRKQ+H CKQG++D        
Subjt:  LEGCHKFECLTGEIPKPRPKDPQKRFWKGEDSLL-----------------------------------RQNASRLYTLRKQIHECKQGSMD--------

Query:  -----EMDLCRELIWDCSCGGVQYYKLEEVDRVYDFLAGLNSKFD--------------------VVRSSEDRSSAMNITASSTADSAAFSAKSSGTTGT
             EMDLCRE +WD      QY KLEE DRVYDFLAGLN KFD                     VR  EDR++AM +  + T DSAAFSA+SS     
Subjt:  -----EMDLCRELIWDCSCGGVQYYKLEEVDRVYDFLAGLNSKFD--------------------VVRSSEDRSSAMNITASSTADSAAFSAKSSGTTGT

Query:  SRTGNHL----------QCVNICWKLHGRPPNGKRRPPNNKPN---------------QALRTTLVIPVLPPLGQLHNQGTSQSLSLLNITGKKPWILDS
           G  +             + CWKLHGRPP GK+R  N K N               Q+   T+     P LG +   G  QSL L+++ GK PWILDS
Subjt:  SRTGNHL----------QCVNICWKLHGRPPNGKRRPPNNKPN---------------QALRTTLVIPVLPPLGQLHNQGTSQSLSLLNITGKKPWILDS

Query:  RATDHLSGTSANFISYHPCAGNEKIRIAERTLTLVAGKGHVSPYDGLILQN-----------------------------------DLSSGKAIGTAQHN
         ATDHL+G+S +FISY PCAGNEKIRIA+ +L  +AGKG + P+DG  LQN                                   D+SSG+ IGTA+H+
Subjt:  RATDHLSGTSANFISYHPCAGNEKIRIAERTLTLVAGKGHVSPYDGLILQN-----------------------------------DLSSGKAIGTAQHN

Query:  RGLYFLNDNSS----YRTSLLSSYFSTFEND----------CI--------------------------------ITTSSGKRWFVTFIDDHTRLTLVYL
        RGLY L+D++S     R SLLSSYFST E D          CI                                +TTSSGKRWFVTFIDDHTRLT VYL
Subjt:  RGLYFLNDNSS----YRTSLLSSYFSTFEND----------CI--------------------------------ITTSSGKRWFVTFIDDHTRLTLVYL

Query:  LTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQSSCAYTPQQNGVAERKNRHLLEVARSLMLLTSLPSYLWGDAVLTAA
        ++DKS+V  IFQ FY TI+T+F+TKIAIL+S NGREF  + L EFL+SKGI+HQ+SCAYTPQQNGVAERKNRHL+EVARSLML TSLPSYLWGDA+LTAA
Subjt:  LTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQSSCAYTPQQNGVAERKNRHLLEVARSLMLLTSLPSYLWGDAVLTAA

Query:  HLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVHSFG
        HLINR+PSRIL+ QTPL+CLK SYP+TRL+ +VPLRVF CTA+VH+FG
Subjt:  HLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVHSFG

A0A5D3E603 Beta-galactosidase4.9e-16947.59Show/hide
Query:  HLANLQASFQQQIAALGAALGASTTLDQSAESLGVQSG----------LPMYPDNPVTIYPTLTTTHHMSGQMGNSTWLIVGEKLNGQNYFSWSQSVKMV
        +L+N  +  Q  +  L   L +   +DQ      +++G          LPMY  NPVT +P  + +++++G +G+ST    GEKLNGQNYFSWSQS+KM 
Subjt:  HLANLQASFQQQIAALGAALGASTTLDQSAESLGVQSG----------LPMYPDNPVTIYPTLTTTHHMSGQMGNSTWLIVGEKLNGQNYFSWSQSVKMV

Query:  LEGCHKFECLTGEIPKPRPKDPQKRFWKGEDSLL-----------------------------------RQNASRLYTLRKQIHECKQGSMD--------
        LEG ++F  LTGEI +P P D  +R WKGEDSL+                                   RQNASRLYTLRKQ+H CKQG++D        
Subjt:  LEGCHKFECLTGEIPKPRPKDPQKRFWKGEDSLL-----------------------------------RQNASRLYTLRKQIHECKQGSMD--------

Query:  -----EMDLCRELIWDCSCGGVQYYKLEEVDRVYDFLAGLNSKFD--------------------VVRSSEDRSSAMNITASSTADSAAFSAKSSGTTGT
             EMDLCRE +WD      QY KLEE DRVYDFLAGLN KFD                     VR  EDR++AM +  + T DSAAFSA+SS     
Subjt:  -----EMDLCRELIWDCSCGGVQYYKLEEVDRVYDFLAGLNSKFD--------------------VVRSSEDRSSAMNITASSTADSAAFSAKSSGTTGT

Query:  SRTGNHL----------QCVNICWKLHGRPPNGKRRPPNNKPN---------------QALRTTLVIPVLPPLGQLHNQGTSQSLSLLNITGKKPWILDS
           G  +             + CWKLHGRPP GK+R  N K N               Q+   T+     P LG +   G  QSL L+++ GK PWILDS
Subjt:  SRTGNHL----------QCVNICWKLHGRPPNGKRRPPNNKPN---------------QALRTTLVIPVLPPLGQLHNQGTSQSLSLLNITGKKPWILDS

Query:  RATDHLSGTSANFISYHPCAGNEKIRIAERTLTLVAGKGHVSPYDGLILQN-----------------------------------DLSSGKAIGTAQHN
         ATDHL+G+S +FISY PCAGNEKIRIA+ +L  +AGKG + P+DG  LQN                                   D+SSG+ IGTA+H+
Subjt:  RATDHLSGTSANFISYHPCAGNEKIRIAERTLTLVAGKGHVSPYDGLILQN-----------------------------------DLSSGKAIGTAQHN

Query:  RGLYFLNDNSS----YRTSLLSSYFSTFEND----------CI--------------------------------ITTSSGKRWFVTFIDDHTRLTLVYL
        RGLY L+D++S     R SLLSSYFST E D          CI                                +TTSSGKRWFVTFIDDHTRLT VYL
Subjt:  RGLYFLNDNSS----YRTSLLSSYFSTFEND----------CI--------------------------------ITTSSGKRWFVTFIDDHTRLTLVYL

Query:  LTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQSSCAYTPQQNGVAERKNRHLLEVARSLMLLTSLPSYLWGDAVLTAA
        ++DKS+V  IFQ FY TI+T+F+TKIAIL+S NGREF  + L EFL+SKGI+HQ+SCAYTPQQNGVAERKNRHL+EVARSLML TSLPSYLWGDA+LTAA
Subjt:  LTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQSSCAYTPQQNGVAERKNRHLLEVARSLMLLTSLPSYLWGDAVLTAA

Query:  HLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVHSFG
        HLINR+PSRIL+ QTPL+CLK SYP+TRL+ +VPLRVF CTA+VH+FG
Subjt:  HLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVHSFG

A0A5D3E6F8 Beta-galactosidase8.4e-16946.17Show/hide
Query:  APTLAQPAMDLWEYFPPTAAATAGISHQGVGNYYEQQGSAVASSAMVQQHLANLQASFQQQIAALGAALGASTTLDQSAESLGVQSG----------LPM
        AP+  QP+       P +A+ ++G           QQ S V        +L+N  +  Q  I +L   L +   +DQ      +++G          LPM
Subjt:  APTLAQPAMDLWEYFPPTAAATAGISHQGVGNYYEQQGSAVASSAMVQQHLANLQASFQQQIAALGAALGASTTLDQSAESLGVQSG----------LPM

Query:  YPDNPVTIYPTLTTTHHMSGQMGNSTWLIVGEKLNGQNYFSWSQSVKMVLEGCHKFECLTGEIPKPRPKDPQKRFWKGEDSLL-----------------
        Y  NPVT +P  + +++++G +G+ST    GEKLNGQNYFSWSQS+KM LEG ++F  LTGEI +P P D  +R WKGEDSL+                 
Subjt:  YPDNPVTIYPTLTTTHHMSGQMGNSTWLIVGEKLNGQNYFSWSQSVKMVLEGCHKFECLTGEIPKPRPKDPQKRFWKGEDSLL-----------------

Query:  ------------------RQNASRLYTLRKQIHECKQGSMD-------------EMDLCRELIWDCSCGGVQYYKLEEVDRVYDFLAGLNSKFD------
                          RQNASRLYTLRKQ+H CKQG++D             EMDLCRE +WD      QY KLEE DRVYDFLAGLN KFD      
Subjt:  ------------------RQNASRLYTLRKQIHECKQGSMD-------------EMDLCRELIWDCSCGGVQYYKLEEVDRVYDFLAGLNSKFD------

Query:  --------------VVRSSEDRSSAMNITASSTADSAAFSAKSSGTTGTSRTGNHL----------QCVNICWKLHGRPPNGKRRPPNNKPN--------
                       VR  EDR++AM +  + T DSAAFSA+SS        G  +             + CWKLHGRPP GK+R  N K N        
Subjt:  --------------VVRSSEDRSSAMNITASSTADSAAFSAKSSGTTGTSRTGNHL----------QCVNICWKLHGRPPNGKRRPPNNKPN--------

Query:  -------QALRTTLVIPVLPPLGQLHNQGTSQSLSLLNITGKKPWILDSRATDHLSGTSANFISYHPCAGNEKIRIAERTLTLVAGKGHVSPYDGLILQN
               Q+   T+     P LG +   G  QSL L+++ GK PWILDS ATDHL+G+S +FISY PCAGNEKIRIA+ +L  +AGKG + P+DG  LQN
Subjt:  -------QALRTTLVIPVLPPLGQLHNQGTSQSLSLLNITGKKPWILDSRATDHLSGTSANFISYHPCAGNEKIRIAERTLTLVAGKGHVSPYDGLILQN

Query:  -----------------------------------DLSSGKAIGTAQHNRGLYFLNDNSS----YRTSLLSSYFSTFEND----------CI--------
                                           D+SSG+ IGTA+H+RGLY L+D++S     R SLLSSYFST E D          CI        
Subjt:  -----------------------------------DLSSGKAIGTAQHNRGLYFLNDNSS----YRTSLLSSYFSTFEND----------CI--------

Query:  ------------------------ITTSSGKRWFVTFIDDHTRLTLVYLLTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGI
                                +TTSSGKRWFVTFIDDHTRLT VYL++DKS+V  IFQ FY TI+T+F+TKIAIL+S NGREF  + L EFL+SKGI
Subjt:  ------------------------ITTSSGKRWFVTFIDDHTRLTLVYLLTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGI

Query:  IHQSSCAYTPQQNGVAERKNRHLLEVARSLMLLTSLPSYLWGDAVLTAAHLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVHSFG
        +HQ+SCAYTPQQNGVAERKNRHL+EVARSLML TSLPSYLWGDA+LTAAHLINR+PSRIL+ QTPL+CLK SYP+TRL+ +VPLRVF CTA+VH+FG
Subjt:  IHQSSCAYTPQQNGVAERKNRHLLEVARSLMLLTSLPSYLWGDAVLTAAHLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVHSFG

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.9e-2540.59Show/hide
Query:  TSSGKRWFVTFIDDHTRLTLVYLLTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQSSCAYTPQQNGVAERKNRHLLEV
        T   K +FV F+D  T   + YL+  KS V  +FQ F    E  FN K+  L   NGRE+L+N + +F   KGI +  +  +TPQ NGV+ER  R + E 
Subjt:  TSSGKRWFVTFIDDHTRLTLVYLLTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQSSCAYTPQQNGVAERKNRHLLEV

Query:  ARSLMLLTSLPSYLWGDAVLTAAHLINRIPSRIL--NFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVH
        AR+++    L    WG+AVLTA +LINRIPSR L  + +TP        P  +      LRVF  T +VH
Subjt:  ARSLMLLTSLPSYLWGDAVLTAAHLINRIPSRIL--NFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVH

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.6e-2838.22Show/hide
Query:  NSSYRTSLLSSYFSTFENDCIITTSSGKRWFVTFIDDHTRLTLVYLLTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQ
        +S  + ++L   +S       I +  G ++FVTFIDD +R   VY+L  K +V  +FQ+F+  +E +   K+  L+S NG E+ +    E+ SS GI H+
Subjt:  NSSYRTSLLSSYFSTFENDCIITTSSGKRWFVTFIDDHTRLTLVYLLTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQ

Query:  SSCAYTPQQNGVAERKNRHLLEVARSLMLLTSLPSYLWGDAVLTAAHLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVH
         +   TPQ NGVAER NR ++E  RS++ +  LP   WG+AV TA +LINR PS  L F+ P         T + +    L+VF C AF H
Subjt:  SSCAYTPQQNGVAERKNRHLLEVARSLMLLTSLPSYLWGDAVLTAAHLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVH

Q12491 Transposon Ty2-B Gag-Pol polyprotein3.3e-1332.82Show/hide
Query:  SGKRWFVTFIDDHTRLTLVYLLTDKSKVSF--IFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQSSCAYTPQQNGVAERKNRHLLEV
        S   +F++F D+ TR   VY L D+ + S   +F      I+ +FN ++ ++Q   G E+   TLH+F +++GI    +     + +GVAER NR LL  
Subjt:  SGKRWFVTFIDDHTRLTLVYLLTDKSKVSF--IFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQSSCAYTPQQNGVAERKNRHLLEV

Query:  ARSLMLLTSLPSYLWGDAVLTAAHLINRIPS
         R+L+  + LP++LW  AV  +  + N + S
Subjt:  ARSLMLLTSLPSYLWGDAVLTAAHLINRIPS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE18.7e-2236.31Show/hide
Query:  ITTSSGKRWFVTFIDDHTRLTLVYLLTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQSSCAYTPQQNGVAERKNRHLL
        I +    R++V F+D  TR T +Y L  KS+V   F  F   +E +F T+I    S NG EF+   L E+ S  GI H +S  +TP+ NG++ERK+RH++
Subjt:  ITTSSGKRWFVTFIDDHTRLTLVYLLTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQSSCAYTPQQNGVAERKNRHLL

Query:  EVARSLMLLTSLPSYLWGDAVLTAAHLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAF
        E   +L+   S+P   W  A   A +LINR+P+ +L  ++P   L  + P         LRVF C  +
Subjt:  EVARSLMLLTSLPSYLWGDAVLTAAHLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAF

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.9e-2237.89Show/hide
Query:  RWFVTFIDDHTRLTLVYLLTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQSSCAYTPQQNGVAERKNRHLLEVARSLM
        R++V F+D  TR T +Y L  KS+V   F  F + +E +F T+I  L S NG EF+   L ++LS  GI H +S  +TP+ NG++ERK+RH++E+  +L+
Subjt:  RWFVTFIDDHTRLTLVYLLTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQSSCAYTPQQNGVAERKNRHLLEVARSLM

Query:  LLTSLPSYLWGDAVLTAAHLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAF
           S+P   W  A   A +LINR+P+ +L  Q+P   L    P         L+VF C  +
Subjt:  LLTSLPSYLWGDAVLTAAHLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAF

Arabidopsis top hitse value%identityAlignment
ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.2e-0537.33Show/hide
Query:  NRHLLEVARSLMLLTSLPSYLWGDAVLTAAHLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVH
        NR ++E  RS++    LP     DA  TA H+IN+ PS  +NF  P      S PT        LR F C A++H
Subjt:  NRHLLEVARSLMLLTSLPSYLWGDAVLTAAHLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTATCAAACCCGCCGCCGCCGCTACCTCCGAGTGAACCATCCAAGCCACCGTCGACCGCGACCTCTAATCACTCAACCCAGAAGTTGCCGCCGCCCTCTTTCGAAAT
CCGATCGTTGCCAACGGGGAATGGTGAAGCCGCATTCGATTTTGCAAGATTGAGTCTCGATCCAAGTATCGCTCTTGAGTGTCGCTACGAATCGCATCTCGCTCCAACTC
TCGCTCCAACTCTCGCTCAGACCGCAGTCGAAGGAACCTCTCAATCAGCGAGAGTGAATCTCGCTTCAACTCTCGCTCCAAATCCAGTGAGAGTGCATCTCGCTGCTCCG
ACTCTCGCTCAACCTGCAATGGATCTTTGGGAGTATTTCCCTCCTACAGCTGCTGCCACTGCTGGAATTTCTCACCAAGGGGTCGGAAACTACTATGAGCAACAAGGATC
GGCTGTCGCTTCCTCGGCTATGGTTCAACAGCACCTTGCTAACCTACAAGCAAGTTTCCAACAACAAATTGCAGCTCTTGGGGCAGCCCTTGGTGCTTCAACCACTCTAG
ATCAAAGTGCAGAAAGTTTAGGTGTTCAATCAGGGTTACCGATGTATCCAGACAATCCGGTAACTATCTACCCTACCTTAACAACTACACATCATATGTCTGGACAAATG
GGTAACTCTACATGGTTAATTGTTGGTGAAAAATTGAATGGCCAAAACTATTTCTCCTGGTCTCAGTCAGTTAAAATGGTGCTTGAAGGGTGCCACAAGTTCGAATGCCT
GACTGGTGAAATACCTAAACCCAGACCCAAAGATCCTCAGAAGCGTTTTTGGAAGGGAGAAGATTCATTACTTCGGCAGAATGCATCACGACTTTACACTCTGAGAAAAC
AAATCCATGAATGTAAACAGGGATCCATGGATGAGATGGACTTATGTCGTGAGCTTATCTGGGACTGTTCTTGTGGAGGAGTCCAATATTATAAACTTGAAGAGGTTGAC
CGTGTTTATGATTTCTTAGCAGGCCTCAATTCTAAGTTTGATGTTGTACGGAGTTCGGAAGATAGATCAAGCGCCATGAATATTACTGCCTCTTCCACAGCTGATTCTGC
TGCATTTAGTGCGAAATCATCTGGAACTACTGGGACAAGTAGAACAGGAAACCACCTCCAGTGTGTGAACATTTGTTGGAAGTTACACGGTCGGCCCCCGAATGGAAAAC
GACGACCTCCAAATAACAAGCCCAACCAGGCTTTGAGAACCACCCTAGTGATACCAGTACTTCCTCCCTTGGGGCAATTGCACAATCAGGGTACTTCTCAATCCTTGAGT
CTCCTCAATATTACAGGTAAGAAACCTTGGATTCTTGACTCAAGAGCTACAGACCATTTATCTGGAACTTCTGCAAACTTCATATCTTATCATCCGTGTGCCGGTAATGA
GAAAATTCGGATTGCTGAGAGGACACTTACCCTAGTGGCCGGCAAGGGTCATGTTTCTCCTTATGATGGTTTAATATTACAGAATGACTTGAGCTCGGGGAAGGCAATTG
GCACTGCCCAGCACAATAGAGGACTCTATTTCCTTAATGATAATTCTTCCTATAGGACTAGTCTGCTATCTTCCTACTTTTCAACTTTTGAAAATGACTGTATAATTACC
ACTTCCTCTGGTAAACGTTGGTTTGTCACCTTCATTGATGACCACACTCGTCTTACTTTGGTTTACCTTCTTACAGATAAATCTAAAGTCTCATTCATCTTCCAACAATT
TTACACCACCATTGAAACTAAGTTTAATACCAAAATTGCCATCCTTCAGAGTGGCAATGGTCGTGAATTCCTTACCAATACCCTCCATGAGTTCTTATCCTCTAAAGGCA
TTATTCACCAGAGTTCATGTGCTTATACTCCCCAACAAAATGGAGTGGCTGAAAGGAAAAATCGTCATCTCCTTGAAGTTGCTCGATCTCTCATGTTGTTGACCTCTCTA
CCGTCTTACTTGTGGGGGGATGCAGTCTTGACTGCCGCCCATCTCATTAACCGGATACCTTCCCGCATTCTTAATTTCCAAACTCCTCTAAACTGCCTCAAATTGTCTTA
TCCGACCACTCGCCTAATACCTGACGTCCCTCTCCGAGTATTTGAGTGTACTGCATTTGTCCATAGCTTTGGTGTCACACCCCCTCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTATCAAACCCGCCGCCGCCGCTACCTCCGAGTGAACCATCCAAGCCACCGTCGACCGCGACCTCTAATCACTCAACCCAGAAGTTGCCGCCGCCCTCTTTCGAAAT
CCGATCGTTGCCAACGGGGAATGGTGAAGCCGCATTCGATTTTGCAAGATTGAGTCTCGATCCAAGTATCGCTCTTGAGTGTCGCTACGAATCGCATCTCGCTCCAACTC
TCGCTCCAACTCTCGCTCAGACCGCAGTCGAAGGAACCTCTCAATCAGCGAGAGTGAATCTCGCTTCAACTCTCGCTCCAAATCCAGTGAGAGTGCATCTCGCTGCTCCG
ACTCTCGCTCAACCTGCAATGGATCTTTGGGAGTATTTCCCTCCTACAGCTGCTGCCACTGCTGGAATTTCTCACCAAGGGGTCGGAAACTACTATGAGCAACAAGGATC
GGCTGTCGCTTCCTCGGCTATGGTTCAACAGCACCTTGCTAACCTACAAGCAAGTTTCCAACAACAAATTGCAGCTCTTGGGGCAGCCCTTGGTGCTTCAACCACTCTAG
ATCAAAGTGCAGAAAGTTTAGGTGTTCAATCAGGGTTACCGATGTATCCAGACAATCCGGTAACTATCTACCCTACCTTAACAACTACACATCATATGTCTGGACAAATG
GGTAACTCTACATGGTTAATTGTTGGTGAAAAATTGAATGGCCAAAACTATTTCTCCTGGTCTCAGTCAGTTAAAATGGTGCTTGAAGGGTGCCACAAGTTCGAATGCCT
GACTGGTGAAATACCTAAACCCAGACCCAAAGATCCTCAGAAGCGTTTTTGGAAGGGAGAAGATTCATTACTTCGGCAGAATGCATCACGACTTTACACTCTGAGAAAAC
AAATCCATGAATGTAAACAGGGATCCATGGATGAGATGGACTTATGTCGTGAGCTTATCTGGGACTGTTCTTGTGGAGGAGTCCAATATTATAAACTTGAAGAGGTTGAC
CGTGTTTATGATTTCTTAGCAGGCCTCAATTCTAAGTTTGATGTTGTACGGAGTTCGGAAGATAGATCAAGCGCCATGAATATTACTGCCTCTTCCACAGCTGATTCTGC
TGCATTTAGTGCGAAATCATCTGGAACTACTGGGACAAGTAGAACAGGAAACCACCTCCAGTGTGTGAACATTTGTTGGAAGTTACACGGTCGGCCCCCGAATGGAAAAC
GACGACCTCCAAATAACAAGCCCAACCAGGCTTTGAGAACCACCCTAGTGATACCAGTACTTCCTCCCTTGGGGCAATTGCACAATCAGGGTACTTCTCAATCCTTGAGT
CTCCTCAATATTACAGGTAAGAAACCTTGGATTCTTGACTCAAGAGCTACAGACCATTTATCTGGAACTTCTGCAAACTTCATATCTTATCATCCGTGTGCCGGTAATGA
GAAAATTCGGATTGCTGAGAGGACACTTACCCTAGTGGCCGGCAAGGGTCATGTTTCTCCTTATGATGGTTTAATATTACAGAATGACTTGAGCTCGGGGAAGGCAATTG
GCACTGCCCAGCACAATAGAGGACTCTATTTCCTTAATGATAATTCTTCCTATAGGACTAGTCTGCTATCTTCCTACTTTTCAACTTTTGAAAATGACTGTATAATTACC
ACTTCCTCTGGTAAACGTTGGTTTGTCACCTTCATTGATGACCACACTCGTCTTACTTTGGTTTACCTTCTTACAGATAAATCTAAAGTCTCATTCATCTTCCAACAATT
TTACACCACCATTGAAACTAAGTTTAATACCAAAATTGCCATCCTTCAGAGTGGCAATGGTCGTGAATTCCTTACCAATACCCTCCATGAGTTCTTATCCTCTAAAGGCA
TTATTCACCAGAGTTCATGTGCTTATACTCCCCAACAAAATGGAGTGGCTGAAAGGAAAAATCGTCATCTCCTTGAAGTTGCTCGATCTCTCATGTTGTTGACCTCTCTA
CCGTCTTACTTGTGGGGGGATGCAGTCTTGACTGCCGCCCATCTCATTAACCGGATACCTTCCCGCATTCTTAATTTCCAAACTCCTCTAAACTGCCTCAAATTGTCTTA
TCCGACCACTCGCCTAATACCTGACGTCCCTCTCCGAGTATTTGAGTGTACTGCATTTGTCCATAGCTTTGGTGTCACACCCCCTCCTTGA
Protein sequenceShow/hide protein sequence
MVSNPPPPLPPSEPSKPPSTATSNHSTQKLPPPSFEIRSLPTGNGEAAFDFARLSLDPSIALECRYESHLAPTLAPTLAQTAVEGTSQSARVNLASTLAPNPVRVHLAAP
TLAQPAMDLWEYFPPTAAATAGISHQGVGNYYEQQGSAVASSAMVQQHLANLQASFQQQIAALGAALGASTTLDQSAESLGVQSGLPMYPDNPVTIYPTLTTTHHMSGQM
GNSTWLIVGEKLNGQNYFSWSQSVKMVLEGCHKFECLTGEIPKPRPKDPQKRFWKGEDSLLRQNASRLYTLRKQIHECKQGSMDEMDLCRELIWDCSCGGVQYYKLEEVD
RVYDFLAGLNSKFDVVRSSEDRSSAMNITASSTADSAAFSAKSSGTTGTSRTGNHLQCVNICWKLHGRPPNGKRRPPNNKPNQALRTTLVIPVLPPLGQLHNQGTSQSLS
LLNITGKKPWILDSRATDHLSGTSANFISYHPCAGNEKIRIAERTLTLVAGKGHVSPYDGLILQNDLSSGKAIGTAQHNRGLYFLNDNSSYRTSLLSSYFSTFENDCIIT
TSSGKRWFVTFIDDHTRLTLVYLLTDKSKVSFIFQQFYTTIETKFNTKIAILQSGNGREFLTNTLHEFLSSKGIIHQSSCAYTPQQNGVAERKNRHLLEVARSLMLLTSL
PSYLWGDAVLTAAHLINRIPSRILNFQTPLNCLKLSYPTTRLIPDVPLRVFECTAFVHSFGVTPPP