; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh05G002090 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh05G002090
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionTransposon Ty1-LR2 Gag-Pol polyprotein
Genome locationCmo_Chr05:905025..907163
RNA-Seq ExpressionCmoCh05G002090
SyntenyCmoCh05G002090
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BAF23632.1 Os08g0389500 [Oryza sativa Japonica Group]7.2e-24459.07Show/hide
Query:  MTGARSAFSELDSGIRGTVKFGDGSVVEIEGRGTILFISKGGEHRKLTDVYFIPRLKANLVSLGQLDETGCFISIERGLLKICDNQRRLLTQARRTTNRL
        MTG RSAFSEL++GIRGTVKFGDGSVV IEGRGT+LF  K GEH+ L  VY IPRL  N+VSLGQLDE     S E G+LKI + QRRLL +  R+ NRL
Subjt:  MTGARSAFSELDSGIRGTVKFGDGSVVEIEGRGTILFISKGGEHRKLTDVYFIPRLKANLVSLGQLDETGCFISIERGLLKICDNQRRLLTQARRTTNRL

Query:  YVLELEIDQPVSLSAKTEEVSWRWHARYGHLNFPALEKLQKKELVHGLPEIKGVNKLCDGCLIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPG
        YV++L I +PV L+A+  +++WRWHAR+GHLNF ALEKL +  +V GLP I  V+++CD CL+GKQRR PFPS+  YRA E LELVHGDICGP+ PATP 
Subjt:  YVLELEIDQPVSLSAKTEEVSWRWHARYGHLNFPALEKLQKKELVHGLPEIKGVNKLCDGCLIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPG

Query:  GKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKARAEAECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARS
        G  LFLLLVDD SR+MWL LL +K +A+ A+KR  A AEAE  +K+R LRTDRGGEFT+ +F++YC E GIQRHLTAPY+PQQNGVVERRNQT++G ARS
Subjt:  GKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKARAEAECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARS

Query:  LLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMKVTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDPVGGRAHV
        ++    +PG FWGEAV TAV+LLNR+PT+ +DGKTP+E W+  KP VH  R FGCVA++K     LAKLD R + +VF+GYE G+KAYR Y+PV  R HV
Subjt:  LLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMKVTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDPVGGRAHV

Query:  SRDVVFDENTFWQWNDVIEA--DRDPNQFTVEYLVTEPE-EGG-------AQHQETSPP-----------------------PAGAPPEPVEFATPRTAD
        SRD VF+E   W+W     A  D D   F VE+L T P  +GG       A  + TS P                       PA A    +EFA+P   D
Subjt:  SRDVVFDENTFWQWNDVIEA--DRDPNQFTVEYLVTEPE-EGG-------AQHQETSPP-----------------------PAGAPPEPVEFATPRTAD

Query:  STLDADHDTDLEARYRRMDDLVGGGEPPGLAARELEEVAELHAVSADEPNTFAEAEKNPCWRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKR
          LD DHD D+  R+R +D+L+G   PPGLA RE+ E   L     DEP T  EA++   WR+AM EEM SI  N+TWSL ++P G RAIGLKWVFK+K+
Subjt:  STLDADHDTDLEARYRRMDDLVGGGEPPGLAARELEEVAELHAVSADEPNTFAEAEKNPCWRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKR

Query:  NEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALYGLRQ
        +E G + KHKARLVAKGYVQ+QG+D+EEVFAPVAR+ESVR LLA+AAH SW VHHMDVKSAFLNG+L E VYV+QPPGF+   +  KVL+LHKALYGL+Q
Subjt:  NEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALYGLRQ

Query:  APRAWNAKLDGTLLSLNFKRCASEHGMYTYGHG
        APRAWN+KLD +LL L F R   EHG+YT   G
Subjt:  APRAWNAKLDGTLLSLNFKRCASEHGMYTYGHG

CAE03692.2 OSJNBb0026E15.10 [Oryza sativa Japonica Group]3.6e-25961.68Show/hide
Query:  MTGARSAFSELDSGIRGTVKFGDGSVVEIEGRGTILFISKGGEHRKLTDVYFIPRLKANLVSLGQLDETGCFISIERGLLKICDNQRRLLTQARRTTNRL
        MTG+RSAF+ELD+ + GTV+FGDGSVV IEGR T+LF  + GEHR +  VY+IPRL AN+VSLGQLD +G  + I  G+L + D +  LL + RR+ + L
Subjt:  MTGARSAFSELDSGIRGTVKFGDGSVVEIEGRGTILFISKGGEHRKLTDVYFIPRLKANLVSLGQLDETGCFISIERGLLKICDNQRRLLTQARRTTNRL

Query:  YVLELEIDQPVSLSAKTEEVSWRWHARYGHLNFPALEKLQKKELVHGLPEIKGVNKLCDGCLIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPG
        Y ++L+ID+PV L+A++ E +WRWHARYGHLNFPAL KL ++E+V GLP ++ V ++CDGCL+GKQRR  FP+++ YRADE LELVHGD+CGPI+PATP 
Subjt:  YVLELEIDQPVSLSAKTEEVSWRWHARYGHLNFPALEKLQKKELVHGLPEIKGVNKLCDGCLIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPG

Query:  GKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKARAEAECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARS
        G   FLLLVDD SR+MWLT++++K EAA A+K  +ARAE E  +K+R LR DRG EFTS  F +YC  +G+ R LTAPYSPQQNGVVERRNQTIV TARS
Subjt:  GKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKARAEAECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARS

Query:  LLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMKVTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDPVGGRAHV
        ++   G+PGRFWGEA+ TAV+LLNRSPT+SLD +TPYEAWY + P VH  R FGCV ++K+T+P L KLD R   +V +GYE GSKAYRLYDPV  R HV
Subjt:  LLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMKVTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDPVGGRAHV

Query:  SRDVVFDENTFWQWNDVI-EADRDPNQFTVEYLVT--------------EPEEGGAQHQETSPPPAGAPPEPVEFATPRTADSTLDADHDTDLEARYRRM
        SRDVVFDE+  W W  V  +       FTVE +VT               P         T  PP+   PE VEF TP T DS LDAD D D+  RYR +
Subjt:  SRDVVFDENTFWQWNDVI-EADRDPNQFTVEYLVT--------------EPEEGGAQHQETSPPPAGAPPEPVEFATPRTADSTLDADHDTDLEARYRRM

Query:  DDLVGGGEPPGLAARELEEVAELHAVSADEPNTFAEAEKNPCWRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGY
        D+L+G   PPG A R LE++ ELH VSADEP + AEAE +P WR AMQ+E+ +I +N TWSL D+P GHRAIGLKWV+KLKR+E+G +V++KARLVAKGY
Subjt:  DDLVGGGEPPGLAARELEEVAELHAVSADEPNTFAEAEKNPCWRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGY

Query:  VQKQGVDFEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALYGLRQAPRAWNAKLDGTLLSLNF
        VQ+QGVDF+EVFA VARLESVR LLA+AAH  W+VHHMDVKSAFLNGEL E VYV QPPGF+D+++ NKV RLHKALYGLRQAPRAWNAKLD +LLSL F
Subjt:  VQKQGVDFEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALYGLRQAPRAWNAKLDGTLLSLNF

Query:  KRCASEHGMYTYGHG
         R +SEHG+YT   G
Subjt:  KRCASEHGMYTYGHG

CAH66352.1 OSIGBa0135C09.3 [Oryza sativa]1.5e-24960.2Show/hide
Query:  MTGARSAFSELDSGIRGTVKFGDGSVVEIEGRGTILFISKGGEHRKLTDVYFIPRLKANLVSLGQLDETGCFISIERGLLKICDNQRRLLTQARRTTNRL
        MTG+RSAF++LD+ + GTV+FGDGSVV IEGRGT+LF  + GEHR +  VY+IPRL AN+VSLGQLD +G  + I  G+L++ D +  LL + RR+ + L
Subjt:  MTGARSAFSELDSGIRGTVKFGDGSVVEIEGRGTILFISKGGEHRKLTDVYFIPRLKANLVSLGQLDETGCFISIERGLLKICDNQRRLLTQARRTTNRL

Query:  YVLELEIDQPVSLSAKTEEVSWRWHARYGHLNFPALEKLQKKELVHGLPEIKGVNKLCDGCLIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPG
        Y ++L ID+PV L+A++ + +WRWHARYGHLNFP+L KL ++E+V GLP ++ V ++CDGCL+GKQRR  FP+++ YRADE LELVHGD+CGPI+PATP 
Subjt:  YVLELEIDQPVSLSAKTEEVSWRWHARYGHLNFPALEKLQKKELVHGLPEIKGVNKLCDGCLIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPG

Query:  GKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKARAEAECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARS
        G   FLLLVDD SR+MWLTL+++K EAA A+K  +A AE E  +K+R LRTDRGGEFTS  F +YC  + + R LTAPYSPQQNGVVERRNQTIV TARS
Subjt:  GKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKARAEAECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARS

Query:  LLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMKVTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDPVGGRAHV
        ++   G+PGRFWGEA+ TAV+LLNRSPT+SLD +TPYEAWY ++P VH  R FGCV ++K+T+P L KLD R   +V +GYE GSKAYRLYDPV  R HV
Subjt:  LLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMKVTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDPVGGRAHV

Query:  SRDVVFDENTFWQWNDVIEADRDP--NQFTVEYLVT--------------EPEEGGAQHQETSPPPAGAPPEPVEFATPRTADSTLDADHDTDLEARYRR
        SRDVVFDE+  W W   +  D  P    FTVE +VT               P         T  PP+   PE VEF TP T DS LDAD D D+  RY  
Subjt:  SRDVVFDENTFWQWNDVIEADRDP--NQFTVEYLVT--------------EPEEGGAQHQETSPPPAGAPPEPVEFATPRTADSTLDADHDTDLEARYRR

Query:  MDDLVGGGEPPGLAARELEEVAELHAVSADEPNTFAEAEKNPCWRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKG
        +D+L+G   PPG A R LE++ ELH VSADEP + AEAE +P WR AMQ+E+ +I +N TWSL D+P GHRAIGLKW                ARLVAKG
Subjt:  MDDLVGGGEPPGLAARELEEVAELHAVSADEPNTFAEAEKNPCWRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKG

Query:  YVQKQGVDFEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALYGLRQAPRAWNAKLDGTLLSLN
        YVQ+QGVDF+EVFAPVARLE VR LLAIAAH  W+VHHMDVKSAFLNGEL E VYV QPPGF+D+++ NKV RLHKALYGLRQAPRAWN KLD +LLSL 
Subjt:  YVQKQGVDFEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALYGLRQAPRAWNAKLDGTLLSLN

Query:  FKRCASEHGMYTYGHG
        F R +SEHG+YT   G
Subjt:  FKRCASEHGMYTYGHG

EEC67008.1 hypothetical protein OsI_33720 [Oryza sativa Indica Group]2.5e-23657.26Show/hide
Query:  MTGARSAFSELDSGIRGTVKFGDGSVVEIEGRGTILFISKGGEHRKLTDVYFIPRLKANLVSLGQLDETGCFISIERGLLKICDNQRRLLTQARRTTNRL
        MTG+R AF++LD+ I G V+ GDGSVV I GRGTILF  K GEHR L++ Y++PRL AN++S+GQLDETG  +  E G++++ D QRRLL +  RT  RL
Subjt:  MTGARSAFSELDSGIRGTVKFGDGSVVEIEGRGTILFISKGGEHRKLTDVYFIPRLKANLVSLGQLDETGCFISIERGLLKICDNQRRLLTQARRTTNRL

Query:  YVLELEIDQPVSLSAKTEEVSWRWHARYGHLNFPALEKLQKKELVHGLPEIKGVNKLCDGCLIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPG
        Y+L++ + +PV L+A  +E +WRWHAR GH+NF  L K+ K+ELV GLP +  V+++C+ CL GK RR+PFP +   R+DEPL L+HGD+CGPI PATP 
Subjt:  YVLELEIDQPVSLSAKTEEVSWRWHARYGHLNFPALEKLQKKELVHGLPEIKGVNKLCDGCLIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPG

Query:  GKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKARAEAECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARS
        G   FLLLVDD SR+MW+ LL  K  A  A+KRI+A AE +  +K+R LRTDRGGEFTS  F++YC E+G++R LTAPYSPQQNGVVERRNQ++VGTARS
Subjt:  GKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKARAEAECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARS

Query:  LLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMKVTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDPVGGRAHV
        +L   G+PG FWGEA+ TAVYLLNRS ++ + GKTPY  W    P VHH R FGCVA++K T P+L KLD R   ++F+GYEPGSKAYR YDP   R H+
Subjt:  LLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMKVTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDPVGGRAHV

Query:  SRDVVFDENTFWQWNDVIEADRDPNQFTVEY-LVTEPEEGGAQHQETSPPPAGAPPEP---------------------VEFATPRT-ADSTLDADHDTD
        SRD+VFDE   W W+    AD D   F VEY  V  P       Q+   PPA +   P                     VEF +P T A + LDADHD D
Subjt:  SRDVVFDENTFWQWNDVIEADRDPNQFTVEY-LVTEPEEGGAQHQETSPPPAGAPPEP---------------------VEFATPRT-ADSTLDADHDTD

Query:  LEARYRRMDDLVGGGEPPGLAARELEEVAELHAVSADEPNTFAEAEKNPCWRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHK
           R+R MD+++G    PGLA RE++E  EL  VS +EP TFA+AE++  WR+AM +E++SI EN+TW L D+P GHR IGLKWV+KLK++ +G VVKHK
Subjt:  LEARYRRMDDLVGGGEPPGLAARELEEVAELHAVSADEPNTFAEAEKNPCWRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHK

Query:  ARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALYGLRQAPRAWNAKLD
        ARLVAKGYVQ+ G+DF+EVFAPVARL+SVR LLA+AA   W VHHMDVKSAFLNGEL E VYV QPPGF  +   NKV RL KALYGLRQAPRAWN KLD
Subjt:  ARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALYGLRQAPRAWNAKLD

Query:  GTLLSLNFKRCASEHGMYTYGHG
         TL  L FK+   EHG+Y  G G
Subjt:  GTLLSLNFKRCASEHGMYTYGHG

RLN27572.1 hypothetical protein C2845_PM05G34840 [Panicum miliaceum]1.9e-23655.7Show/hide
Query:  MTGARSAFSELDSGIRGTVKFGDGSVVEIEGRGTILFISKGGEHRKLTDVYFIPRLKANLVSLGQLDETGCFISIERGLLKICDNQRRLLTQARRTTNRL
        M+GARSAF+ELD+G+ G V+FGDGS + IEGRGT+LF  K GEH+KLT VY IPRL+AN++SLGQL+E    I +E G LKI D   RL+   RR  NRL
Subjt:  MTGARSAFSELDSGIRGTVKFGDGSVVEIEGRGTILFISKGGEHRKLTDVYFIPRLKANLVSLGQLDETGCFISIERGLLKICDNQRRLLTQARRTTNRL

Query:  YVLELEIDQPVSLSAKTEEVSWRWHARYGHLNFPALEKLQKKELVHGLPEIKGVNKLCDGCLIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPG
        YVL   +D+PV L+A+ EE SWRWHARYG L F  L+KL K E+V GLP I+ V+++CDGCL+GKQRR PFP+ + +RA   LELVH D+CGP+ P TPG
Subjt:  YVLELEIDQPVSLSAKTEEVSWRWHARYGHLNFPALEKLQKKELVHGLPEIKGVNKLCDGCLIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPG

Query:  GKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKARAEAECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARS
        GK LFLL +DD SR+MWL LL  K EAA A+  +++RAEAE  +K+  LRTDRGGEFT+ +F +YC   GIQRHLTAPY+PQQNGVVERRNQT++G AR 
Subjt:  GKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKARAEAECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARS

Query:  LLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMKVTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDPVGGRAHV
        +L    +PGRFWGEAV  AV++LNR+PT++L   TPYEAWY  KP VH FR+FGCVA++KV   HL KLD R   +V IGYEPG+KAYRLY+P   R HV
Subjt:  LLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMKVTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDPVGGRAHV

Query:  SRDVVFDENTFWQWND---VIEADRDPNQFTVEYLVTEPEEGGAQHQETSPPPA--------------------------------------GAP-----
        SRDVVF+E   W W +      A  D + F VEY  T    G     E  P  A                                      GAP     
Subjt:  SRDVVFDENTFWQWND---VIEADRDPNQFTVEYLVTEPEEGGAQHQETSPPPA--------------------------------------GAP-----

Query:  --------PEPVEFATPRTADSTLDADHDTDLEARYRRMDDLVGGGEPPGLAARELEEVAELHAVSADEPNTFAEAEKNPCWRKAMQEEMTSITENQTWS
                P  VEF +P +    LD DHD     R+R++DD++   +P       +E+   + AV   EP  F EA +   W KAM+EEMTSI +N TW 
Subjt:  --------PEPVEFATPRTADSTLDADHDTDLEARYRRMDDLVGGGEPPGLAARELEEVAELHAVSADEPNTFAEAEKNPCWRKAMQEEMTSITENQTWS

Query:  LEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGF
        L D+P GH+ IGLKWVFKLKR+E G+VVKHKARLVAKGYVQ+QG+DF+EVFAPVAR+ESVR LLA AA   W VHHMDVKSAFLNGELKE VYV+QPPGF
Subjt:  LEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGF

Query:  LDNDNPNKVLRLHKALYGLRQAPRAWNAKLDGTLLSLNFKRCASEHGMYTYGHG
        +   +  KVL+L+KALYGLRQAPRAWN KLD +L  + F RC SEHGMYT G G
Subjt:  LDNDNPNKVLRLHKALYGLRQAPRAWNAKLDGTLLSLNFKRCASEHGMYTYGHG

TrEMBL top hitse value%identityAlignment
A0A3L6ST08 Integrase catalytic domain-containing protein9.2e-23755.7Show/hide
Query:  MTGARSAFSELDSGIRGTVKFGDGSVVEIEGRGTILFISKGGEHRKLTDVYFIPRLKANLVSLGQLDETGCFISIERGLLKICDNQRRLLTQARRTTNRL
        M+GARSAF+ELD+G+ G V+FGDGS + IEGRGT+LF  K GEH+KLT VY IPRL+AN++SLGQL+E    I +E G LKI D   RL+   RR  NRL
Subjt:  MTGARSAFSELDSGIRGTVKFGDGSVVEIEGRGTILFISKGGEHRKLTDVYFIPRLKANLVSLGQLDETGCFISIERGLLKICDNQRRLLTQARRTTNRL

Query:  YVLELEIDQPVSLSAKTEEVSWRWHARYGHLNFPALEKLQKKELVHGLPEIKGVNKLCDGCLIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPG
        YVL   +D+PV L+A+ EE SWRWHARYG L F  L+KL K E+V GLP I+ V+++CDGCL+GKQRR PFP+ + +RA   LELVH D+CGP+ P TPG
Subjt:  YVLELEIDQPVSLSAKTEEVSWRWHARYGHLNFPALEKLQKKELVHGLPEIKGVNKLCDGCLIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPG

Query:  GKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKARAEAECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARS
        GK LFLL +DD SR+MWL LL  K EAA A+  +++RAEAE  +K+  LRTDRGGEFT+ +F +YC   GIQRHLTAPY+PQQNGVVERRNQT++G AR 
Subjt:  GKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKARAEAECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARS

Query:  LLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMKVTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDPVGGRAHV
        +L    +PGRFWGEAV  AV++LNR+PT++L   TPYEAWY  KP VH FR+FGCVA++KV   HL KLD R   +V IGYEPG+KAYRLY+P   R HV
Subjt:  LLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMKVTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDPVGGRAHV

Query:  SRDVVFDENTFWQWND---VIEADRDPNQFTVEYLVTEPEEGGAQHQETSPPPA--------------------------------------GAP-----
        SRDVVF+E   W W +      A  D + F VEY  T    G     E  P  A                                      GAP     
Subjt:  SRDVVFDENTFWQWND---VIEADRDPNQFTVEYLVTEPEEGGAQHQETSPPPA--------------------------------------GAP-----

Query:  --------PEPVEFATPRTADSTLDADHDTDLEARYRRMDDLVGGGEPPGLAARELEEVAELHAVSADEPNTFAEAEKNPCWRKAMQEEMTSITENQTWS
                P  VEF +P +    LD DHD     R+R++DD++   +P       +E+   + AV   EP  F EA +   W KAM+EEMTSI +N TW 
Subjt:  --------PEPVEFATPRTADSTLDADHDTDLEARYRRMDDLVGGGEPPGLAARELEEVAELHAVSADEPNTFAEAEKNPCWRKAMQEEMTSITENQTWS

Query:  LEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGF
        L D+P GH+ IGLKWVFKLKR+E G+VVKHKARLVAKGYVQ+QG+DF+EVFAPVAR+ESVR LLA AA   W VHHMDVKSAFLNGELKE VYV+QPPGF
Subjt:  LEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGF

Query:  LDNDNPNKVLRLHKALYGLRQAPRAWNAKLDGTLLSLNFKRCASEHGMYTYGHG
        +   +  KVL+L+KALYGLRQAPRAWN KLD +L  + F RC SEHGMYT G G
Subjt:  LDNDNPNKVLRLHKALYGLRQAPRAWNAKLDGTLLSLNFKRCASEHGMYTYGHG

A0B9X7 OSIGBa0135C09.3 protein7.2e-25060.2Show/hide
Query:  MTGARSAFSELDSGIRGTVKFGDGSVVEIEGRGTILFISKGGEHRKLTDVYFIPRLKANLVSLGQLDETGCFISIERGLLKICDNQRRLLTQARRTTNRL
        MTG+RSAF++LD+ + GTV+FGDGSVV IEGRGT+LF  + GEHR +  VY+IPRL AN+VSLGQLD +G  + I  G+L++ D +  LL + RR+ + L
Subjt:  MTGARSAFSELDSGIRGTVKFGDGSVVEIEGRGTILFISKGGEHRKLTDVYFIPRLKANLVSLGQLDETGCFISIERGLLKICDNQRRLLTQARRTTNRL

Query:  YVLELEIDQPVSLSAKTEEVSWRWHARYGHLNFPALEKLQKKELVHGLPEIKGVNKLCDGCLIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPG
        Y ++L ID+PV L+A++ + +WRWHARYGHLNFP+L KL ++E+V GLP ++ V ++CDGCL+GKQRR  FP+++ YRADE LELVHGD+CGPI+PATP 
Subjt:  YVLELEIDQPVSLSAKTEEVSWRWHARYGHLNFPALEKLQKKELVHGLPEIKGVNKLCDGCLIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPG

Query:  GKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKARAEAECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARS
        G   FLLLVDD SR+MWLTL+++K EAA A+K  +A AE E  +K+R LRTDRGGEFTS  F +YC  + + R LTAPYSPQQNGVVERRNQTIV TARS
Subjt:  GKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKARAEAECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARS

Query:  LLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMKVTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDPVGGRAHV
        ++   G+PGRFWGEA+ TAV+LLNRSPT+SLD +TPYEAWY ++P VH  R FGCV ++K+T+P L KLD R   +V +GYE GSKAYRLYDPV  R HV
Subjt:  LLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMKVTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDPVGGRAHV

Query:  SRDVVFDENTFWQWNDVIEADRDP--NQFTVEYLVT--------------EPEEGGAQHQETSPPPAGAPPEPVEFATPRTADSTLDADHDTDLEARYRR
        SRDVVFDE+  W W   +  D  P    FTVE +VT               P         T  PP+   PE VEF TP T DS LDAD D D+  RY  
Subjt:  SRDVVFDENTFWQWNDVIEADRDP--NQFTVEYLVT--------------EPEEGGAQHQETSPPPAGAPPEPVEFATPRTADSTLDADHDTDLEARYRR

Query:  MDDLVGGGEPPGLAARELEEVAELHAVSADEPNTFAEAEKNPCWRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKG
        +D+L+G   PPG A R LE++ ELH VSADEP + AEAE +P WR AMQ+E+ +I +N TWSL D+P GHRAIGLKW                ARLVAKG
Subjt:  MDDLVGGGEPPGLAARELEEVAELHAVSADEPNTFAEAEKNPCWRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKG

Query:  YVQKQGVDFEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALYGLRQAPRAWNAKLDGTLLSLN
        YVQ+QGVDF+EVFAPVARLE VR LLAIAAH  W+VHHMDVKSAFLNGEL E VYV QPPGF+D+++ NKV RLHKALYGLRQAPRAWN KLD +LLSL 
Subjt:  YVQKQGVDFEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALYGLRQAPRAWNAKLDGTLLSLN

Query:  FKRCASEHGMYTYGHG
        F R +SEHG+YT   G
Subjt:  FKRCASEHGMYTYGHG

B8BH06 Integrase catalytic domain-containing protein1.2e-23657.26Show/hide
Query:  MTGARSAFSELDSGIRGTVKFGDGSVVEIEGRGTILFISKGGEHRKLTDVYFIPRLKANLVSLGQLDETGCFISIERGLLKICDNQRRLLTQARRTTNRL
        MTG+R AF++LD+ I G V+ GDGSVV I GRGTILF  K GEHR L++ Y++PRL AN++S+GQLDETG  +  E G++++ D QRRLL +  RT  RL
Subjt:  MTGARSAFSELDSGIRGTVKFGDGSVVEIEGRGTILFISKGGEHRKLTDVYFIPRLKANLVSLGQLDETGCFISIERGLLKICDNQRRLLTQARRTTNRL

Query:  YVLELEIDQPVSLSAKTEEVSWRWHARYGHLNFPALEKLQKKELVHGLPEIKGVNKLCDGCLIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPG
        Y+L++ + +PV L+A  +E +WRWHAR GH+NF  L K+ K+ELV GLP +  V+++C+ CL GK RR+PFP +   R+DEPL L+HGD+CGPI PATP 
Subjt:  YVLELEIDQPVSLSAKTEEVSWRWHARYGHLNFPALEKLQKKELVHGLPEIKGVNKLCDGCLIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPG

Query:  GKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKARAEAECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARS
        G   FLLLVDD SR+MW+ LL  K  A  A+KRI+A AE +  +K+R LRTDRGGEFTS  F++YC E+G++R LTAPYSPQQNGVVERRNQ++VGTARS
Subjt:  GKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKARAEAECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARS

Query:  LLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMKVTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDPVGGRAHV
        +L   G+PG FWGEA+ TAVYLLNRS ++ + GKTPY  W    P VHH R FGCVA++K T P+L KLD R   ++F+GYEPGSKAYR YDP   R H+
Subjt:  LLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMKVTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDPVGGRAHV

Query:  SRDVVFDENTFWQWNDVIEADRDPNQFTVEY-LVTEPEEGGAQHQETSPPPAGAPPEP---------------------VEFATPRT-ADSTLDADHDTD
        SRD+VFDE   W W+    AD D   F VEY  V  P       Q+   PPA +   P                     VEF +P T A + LDADHD D
Subjt:  SRDVVFDENTFWQWNDVIEADRDPNQFTVEY-LVTEPEEGGAQHQETSPPPAGAPPEP---------------------VEFATPRT-ADSTLDADHDTD

Query:  LEARYRRMDDLVGGGEPPGLAARELEEVAELHAVSADEPNTFAEAEKNPCWRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHK
           R+R MD+++G    PGLA RE++E  EL  VS +EP TFA+AE++  WR+AM +E++SI EN+TW L D+P GHR IGLKWV+KLK++ +G VVKHK
Subjt:  LEARYRRMDDLVGGGEPPGLAARELEEVAELHAVSADEPNTFAEAEKNPCWRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHK

Query:  ARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALYGLRQAPRAWNAKLD
        ARLVAKGYVQ+ G+DF+EVFAPVARL+SVR LLA+AA   W VHHMDVKSAFLNGEL E VYV QPPGF  +   NKV RL KALYGLRQAPRAWN KLD
Subjt:  ARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALYGLRQAPRAWNAKLD

Query:  GTLLSLNFKRCASEHGMYTYGHG
         TL  L FK+   EHG+Y  G G
Subjt:  GTLLSLNFKRCASEHGMYTYGHG

Q0J5Y3 Os08g0389500 protein3.5e-24459.07Show/hide
Query:  MTGARSAFSELDSGIRGTVKFGDGSVVEIEGRGTILFISKGGEHRKLTDVYFIPRLKANLVSLGQLDETGCFISIERGLLKICDNQRRLLTQARRTTNRL
        MTG RSAFSEL++GIRGTVKFGDGSVV IEGRGT+LF  K GEH+ L  VY IPRL  N+VSLGQLDE     S E G+LKI + QRRLL +  R+ NRL
Subjt:  MTGARSAFSELDSGIRGTVKFGDGSVVEIEGRGTILFISKGGEHRKLTDVYFIPRLKANLVSLGQLDETGCFISIERGLLKICDNQRRLLTQARRTTNRL

Query:  YVLELEIDQPVSLSAKTEEVSWRWHARYGHLNFPALEKLQKKELVHGLPEIKGVNKLCDGCLIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPG
        YV++L I +PV L+A+  +++WRWHAR+GHLNF ALEKL +  +V GLP I  V+++CD CL+GKQRR PFPS+  YRA E LELVHGDICGP+ PATP 
Subjt:  YVLELEIDQPVSLSAKTEEVSWRWHARYGHLNFPALEKLQKKELVHGLPEIKGVNKLCDGCLIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPG

Query:  GKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKARAEAECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARS
        G  LFLLLVDD SR+MWL LL +K +A+ A+KR  A AEAE  +K+R LRTDRGGEFT+ +F++YC E GIQRHLTAPY+PQQNGVVERRNQT++G ARS
Subjt:  GKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKARAEAECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARS

Query:  LLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMKVTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDPVGGRAHV
        ++    +PG FWGEAV TAV+LLNR+PT+ +DGKTP+E W+  KP VH  R FGCVA++K     LAKLD R + +VF+GYE G+KAYR Y+PV  R HV
Subjt:  LLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMKVTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDPVGGRAHV

Query:  SRDVVFDENTFWQWNDVIEA--DRDPNQFTVEYLVTEPE-EGG-------AQHQETSPP-----------------------PAGAPPEPVEFATPRTAD
        SRD VF+E   W+W     A  D D   F VE+L T P  +GG       A  + TS P                       PA A    +EFA+P   D
Subjt:  SRDVVFDENTFWQWNDVIEA--DRDPNQFTVEYLVTEPE-EGG-------AQHQETSPP-----------------------PAGAPPEPVEFATPRTAD

Query:  STLDADHDTDLEARYRRMDDLVGGGEPPGLAARELEEVAELHAVSADEPNTFAEAEKNPCWRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKR
          LD DHD D+  R+R +D+L+G   PPGLA RE+ E   L     DEP T  EA++   WR+AM EEM SI  N+TWSL ++P G RAIGLKWVFK+K+
Subjt:  STLDADHDTDLEARYRRMDDLVGGGEPPGLAARELEEVAELHAVSADEPNTFAEAEKNPCWRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKR

Query:  NEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALYGLRQ
        +E G + KHKARLVAKGYVQ+QG+D+EEVFAPVAR+ESVR LLA+AAH SW VHHMDVKSAFLNG+L E VYV+QPPGF+   +  KVL+LHKALYGL+Q
Subjt:  NEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALYGLRQ

Query:  APRAWNAKLDGTLLSLNFKRCASEHGMYTYGHG
        APRAWN+KLD +LL L F R   EHG+YT   G
Subjt:  APRAWNAKLDGTLLSLNFKRCASEHGMYTYGHG

Q7XPB1 OSJNBb0026E15.10 protein1.7e-25961.68Show/hide
Query:  MTGARSAFSELDSGIRGTVKFGDGSVVEIEGRGTILFISKGGEHRKLTDVYFIPRLKANLVSLGQLDETGCFISIERGLLKICDNQRRLLTQARRTTNRL
        MTG+RSAF+ELD+ + GTV+FGDGSVV IEGR T+LF  + GEHR +  VY+IPRL AN+VSLGQLD +G  + I  G+L + D +  LL + RR+ + L
Subjt:  MTGARSAFSELDSGIRGTVKFGDGSVVEIEGRGTILFISKGGEHRKLTDVYFIPRLKANLVSLGQLDETGCFISIERGLLKICDNQRRLLTQARRTTNRL

Query:  YVLELEIDQPVSLSAKTEEVSWRWHARYGHLNFPALEKLQKKELVHGLPEIKGVNKLCDGCLIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPG
        Y ++L+ID+PV L+A++ E +WRWHARYGHLNFPAL KL ++E+V GLP ++ V ++CDGCL+GKQRR  FP+++ YRADE LELVHGD+CGPI+PATP 
Subjt:  YVLELEIDQPVSLSAKTEEVSWRWHARYGHLNFPALEKLQKKELVHGLPEIKGVNKLCDGCLIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPG

Query:  GKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKARAEAECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARS
        G   FLLLVDD SR+MWLT++++K EAA A+K  +ARAE E  +K+R LR DRG EFTS  F +YC  +G+ R LTAPYSPQQNGVVERRNQTIV TARS
Subjt:  GKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKARAEAECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARS

Query:  LLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMKVTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDPVGGRAHV
        ++   G+PGRFWGEA+ TAV+LLNRSPT+SLD +TPYEAWY + P VH  R FGCV ++K+T+P L KLD R   +V +GYE GSKAYRLYDPV  R HV
Subjt:  LLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMKVTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDPVGGRAHV

Query:  SRDVVFDENTFWQWNDVI-EADRDPNQFTVEYLVT--------------EPEEGGAQHQETSPPPAGAPPEPVEFATPRTADSTLDADHDTDLEARYRRM
        SRDVVFDE+  W W  V  +       FTVE +VT               P         T  PP+   PE VEF TP T DS LDAD D D+  RYR +
Subjt:  SRDVVFDENTFWQWNDVI-EADRDPNQFTVEYLVT--------------EPEEGGAQHQETSPPPAGAPPEPVEFATPRTADSTLDADHDTDLEARYRRM

Query:  DDLVGGGEPPGLAARELEEVAELHAVSADEPNTFAEAEKNPCWRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGY
        D+L+G   PPG A R LE++ ELH VSADEP + AEAE +P WR AMQ+E+ +I +N TWSL D+P GHRAIGLKWV+KLKR+E+G +V++KARLVAKGY
Subjt:  DDLVGGGEPPGLAARELEEVAELHAVSADEPNTFAEAEKNPCWRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGY

Query:  VQKQGVDFEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALYGLRQAPRAWNAKLDGTLLSLNF
        VQ+QGVDF+EVFA VARLESVR LLA+AAH  W+VHHMDVKSAFLNGEL E VYV QPPGF+D+++ NKV RLHKALYGLRQAPRAWNAKLD +LLSL F
Subjt:  VQKQGVDFEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALYGLRQAPRAWNAKLDGTLLSLNF

Query:  KRCASEHGMYTYGHG
         R +SEHG+YT   G
Subjt:  KRCASEHGMYTYGHG

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.2e-9230.08Show/hide
Query:  VEIEGRGTILFISKGG------EHR-KLTDVYFIPRLKANLVSLGQLDETGCFISIERGLLKICDNQRRLLTQARRTTNRLYVLELEIDQPVSLSAKTEE
        + +  +G  ++ +K G      +H   L DV F      NL+S+ +L E G  I  ++  + I  N   ++ +     N + V+     Q  S++AK + 
Subjt:  VEIEGRGTILFISKGG------EHR-KLTDVYFIPRLKANLVSLGQLDETGCFISIERGLLKICDNQRRLLTQARRTTNRLYVLELEIDQPVSLSAKTEE

Query:  VSWRWHARYGHLNFPALEKLQKKELVHGLPEIKGVN---KLCDGCLIGKQRRTPFPS-RTAYRADEPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRF
            WH R+GH++   L ++++K +      +  +    ++C+ CL GKQ R PF   +       PL +VH D+CGPI P T   K+ F++ VD  + +
Subjt:  VSWRWHARYGHLNFPALEKLQKKELVHGLPEIKGVN---KLCDGCLIGKQRRTPFPS-RTAYRADEPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRF

Query:  MWLTLLQAKSEAAEAVKRIKARAEAECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARSLLVTAGMPGRFWGEA
            L++ KS+     +   A++EA    K+  L  D G E+ S    ++C + GI  HLT P++PQ NGV ER  +TI   AR+++  A +   FWGEA
Subjt:  MWLTLLQAKSEAAEAVKRIKARAEAECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARSLLVTAGMPGRFWGEA

Query:  VMTAVYLLNRSPTRSL--DGKTPYEAWYNKKPTVHHFRVFGCVAYMKVTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDPVGGRAHVSRDVVFD------
        V+TA YL+NR P+R+L    KTPYE W+NKKP + H RVFG   Y+ +      K D +  K +F+GYEP    ++L+D V  +  V+RDVV D      
Subjt:  VMTAVYLLNRSPTRSL--DGKTPYEAWYNKKPTVHHFRVFGCVAYMKVTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDPVGGRAHVSRDVVFD------

Query:  ------ENTFWQWNDVIEADRDPNQFTVEYLVTEPEEG----------GAQHQETSPPPAGA--------PPEPVEFATPRTADSTLDADHDTDLEARYR
              E  F + +   E    PN          P E            ++  E    P  +        P E  E    +    + +++     E++ R
Subjt:  ------ENTFWQWNDVIEADRDPNQFTVEYLVTEPEEG----------GAQHQETSPPPAGA--------PPEPVEFATPRTADSTLDADHDTDLEARYR

Query:  RMDDLV----GGGEP---------------------------------------PGLAARELEE-----VAELHAVSADEPNTFAE---AEKNPCWRKAM
        + DD +    G G P                                       P ++  E +      V   H +  D PN+F E    +    W +A+
Subjt:  RMDDLV----GGGEP---------------------------------------PGLAARELEE-----VAELHAVSADEPNTFAE---AEKNPCWRKAM

Query:  QEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKSAFLNG
          E+ +   N TW++   P     +  +WVF +K NE G  +++KARLVA+G+ QK  +D+EE FAPVAR+ S RF+L++   ++ +VH MDVK+AFLNG
Subjt:  QEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKSAFLNG

Query:  ELKETVYVRQPPGFLDNDNPNKVLRLHKALYGLRQAPRAWNAKLDGTLLSLNFKRCASEHGMYTYGHG
         LKE +Y+R P G   + N + V +L+KA+YGL+QA R W    +  L    F   + +  +Y    G
Subjt:  ELKETVYVRQPPGFLDNDNPNKVLRLHKALYGLRQAPRAWNAKLDGTLLSLNFKRCASEHGMYTYGHG

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.9e-11436.56Show/hide
Query:  TGARSAFSELDSGIRGTVKFGDGSVVEIEGRGTILFISKGGEHRKLTDVYFIPRLKANLVSLGQLDETGCFISIERGLLKICDNQRRLLTQARRTTNRLY
        T  R  F    +G  GTVK G+ S  +I G G I   +  G    L DV  +P L+ NL+S   LD  G          ++      +     R T  LY
Subjt:  TGARSAFSELDSGIRGTVKFGDGSVVEIEGRGTILFISKGGEHRKLTDVYFIPRLKANLVSLGQLDETGCFISIERGLLKICDNQRRLLTQARRTTNRLY

Query:  VLELEIDQPVSLSAKTEEVSW-RWHARYGHLNFPALEKLQKKELVHGLPEIKGVN-KLCDGCLIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATP
            EI Q   L+A  +E+S   WH R GH++   L+ L KK L+      KG   K CD CL GKQ R  F + ++ R    L+LV+ D+CGP++  + 
Subjt:  VLELEIDQPVSLSAKTEEVSW-RWHARYGHLNFPALEKLQKKELVHGLPEIKGVN-KLCDGCLIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATP

Query:  GGKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKARAEAECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTAR
        GG   F+  +DD SR +W+ +L+ K +  +  ++  A  E E  +K++ LR+D GGE+TS  F +YC   GI+   T P +PQ NGV ER N+TIV   R
Subjt:  GGKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKARAEAECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTAR

Query:  SLLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMKVTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDPVGGRAH
        S+L  A +P  FWGEAV TA YL+NRSP+  L  + P   W NK+ +  H +VFGC A+  V +    KLD + +  +FIGY      YRL+DPV  +  
Subjt:  SLLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMKVTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDPVGGRAH

Query:  VSRDVVFDENTFWQWNDVIEADRD---PNQFTVEYLVTEPEEGGAQHQETSPPPAGAPPEPVEFATPRTADSTLDADHDTDLEARYRRMDDLVGGGEPPG
         SRDVVF E+      D+ E  ++   PN  T+      P    +   E S    G  P  V     +  +   + +H T  E +++ +       E P 
Subjt:  VSRDVVFDENTFWQWNDVIEADRD---PNQFTVEYLVTEPEEGGAQHQETSPPPAGAPPEPVEFATPRTADSTLDADHDTDLEARYRRMDDLVGGGEPPG

Query:  LAARELEEVAELHAVSADEPNTFAEA----EKNPCWRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVD
        + +R       +      EP +  E     EKN    KAMQEEM S+ +N T+ L ++P G R +  KWVFKLK++   ++V++KARLV KG+ QK+G+D
Subjt:  LAARELEEVAELHAVSADEPNTFAEA----EKNPCWRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVD

Query:  FEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALYGLRQAPRAWNAKLDGTLLSLNFKRCASEH
        F+E+F+PV ++ S+R +L++AA    EV  +DVK+AFL+G+L+E +Y+ QP GF      + V +L+K+LYGL+QAPR W  K D  + S  + +  S+ 
Subjt:  FEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALYGLRQAPRAWNAKLDGTLLSLNFKRCASEH

Query:  GMY
         +Y
Subjt:  GMY

Q12491 Transposon Ty2-B Gag-Pol polyprotein1.0e-1931.43Show/hide
Query:  HARYGHLNFPALEKLQKKELVHGLPE--IKGVNK---LCDGCLIGK--QRRTPFPSRTAYRAD-EPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFM
        H   GH NF +++K  KK  V  L E  I+  N     C  CLIGK  + R    SR  Y+   EP + +H DI GP+        S F+   D+K+RF 
Subjt:  HARYGHLNFPALEKLQKKELVHGLPE--IKGVNK---LCDGCLIGK--QRRTPFPSRTAYRAD-EPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFM

Query:  WLTLLQAKSEAA--EAVKRIKARAEAECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARSLLVTAGMPGRFWGE
        W+  L  + E +       I A  + +   ++ V++ DRG E+T+ +  K+    GI    T     + +GV ER N+T++   R+LL  +G+P   W  
Subjt:  WLTLLQAKSEAA--EAVKRIKARAEAECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARSLLVTAGMPGRFWGE

Query:  AVMTAVYLLN
        AV  +  + N
Subjt:  AVMTAVYLLN

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.1e-8027.81Show/hide
Query:  VKFGDGSVVEIEGRGTILFISKGGEHRKLTDVYFIPRLKANLVSLGQL-DETGCFISIERGLLKICDNQRRLLTQARRTTNRLYVLELEIDQPVSL--SA
        V   DGS + I   G+    +K      L ++ ++P +  NL+S+ +L +  G  +       ++ D    +     +T + LY   +   QPVSL  S 
Subjt:  VKFGDGSVVEIEGRGTILFISKGGEHRKLTDVYFIPRLKANLVSLGQL-DETGCFISIERGLLKICDNQRRLLTQARRTTNRLYVLELEIDQPVSL--SA

Query:  KTEEVSWRWHARYGHLNFPALEKLQKKELVHGLPEIKGVNKL--CDGCLIGKQRRTPFPSRTAYRADEPLELVHGDI-CGPIKPATPGGKSLFLLLVDDK
         ++     WHAR GH   PA   L      + L  +   +K   C  CLI K  + PF S++   +  PLE ++ D+   PI   +      +++ VD  
Subjt:  KTEEVSWRWHARYGHLNFPALEKLQKKELVHGLPEIKGVNKL--CDGCLIGKQRRTPFPSRTAYRADEPLELVHGDI-CGPIKPATPGGKSLFLLLVDDK

Query:  SRFMWLTLLQAKSEAAEAVKRIKARAEAECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARSLLVTAGMPGRFW
        +R+ WL  L+ KS+  E     K   E   + ++    +D GGEF   +  +Y  + GI    + P++P+ NG+ ER+++ IV T  +LL  A +P  +W
Subjt:  SRFMWLTLLQAKSEAAEAVKRIKARAEAECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARSLLVTAGMPGRFW

Query:  GEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMKVTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDPVGGRAHVSRDVVFDENTFW
          A   AVYL+NR PT  L  ++P++  +   P     RVFGC  Y  +   +  KLD +  + VF+GY     AY        R ++SR V FDEN F 
Subjt:  GEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMKVTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDPVGGRAHVSRDVVFDENTFW

Query:  QWNDVIEADRDPNQFTVEYLVTEPEEGGAQHQETSPPP-------AGAPPE--PVEFATPRTADSTLDADHDTDLEARYRRMDDLVGGGEP---------
          N +        Q      V  P           P P       A  PP      F   + + S LD+   +   +          G +P         
Subjt:  QWNDVIEADRDPNQFTVEYLVTEPEEGGAQHQETSPPP-------AGAPPE--PVEFATPRTADSTLDADHDTDLEARYRRMDDLVGGGEP---------

Query:  -----------------PGLAARELEEVAE-------------------------------------------------------------------LHA
                         P   A+ L   A+                                                                   +  
Subjt:  -----------------PGLAARELEEVAE-------------------------------------------------------------------LHA

Query:  VSADEPNTFAEAEKNPCWRKAMQEEMTSITENQTWSLEDMPPGH-RAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFL
         +  EP T  +A K+  WR AM  E+ +   N TW L   PP H   +G +W+F  K N  G + ++KARLVAKGY Q+ G+D+ E F+PV +  S+R +
Subjt:  VSADEPNTFAEAEKNPCWRKAMQEEMTSITENQTWSLEDMPPGH-RAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFL

Query:  LAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALYGLRQAPRAWNAKLDGTLLSLNFKRCASEHGMYTYGHG
        L +A   SW +  +DV +AFL G L + VY+ QPPGF+D D PN V +L KALYGL+QAPRAW  +L   LL++ F    S+  ++    G
Subjt:  LAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALYGLRQAPRAWNAKLDGTLLSLNFKRCASEHGMYTYGHG

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.7e-7627.39Show/hide
Query:  VKFGDGSVVEIEGRGTILFISKGGEHRKLTDVYFIPRLKANLVSLGQLDETGCFISIE--RGLLKICDNQRRLLTQARRTTNRLYVLELEIDQPVSLSAK
        V   DGS + I   G+   +        L  V ++P +  NL+S+ +L  T   +S+E      ++ D    +     +T + LY   +   Q VS+ A 
Subjt:  VKFGDGSVVEIEGRGTILFISKGGEHRKLTDVYFIPRLKANLVSLGQLDETGCFISIE--RGLLKICDNQRRLLTQARRTTNRLYVLELEIDQPVSLSAK

Query:  --TEEVSWRWHARYGHLNFPALEKLQKKELVHGLPEIKGVNKL--CDGCLIGKQRRTPFPSRTAYRADEPLELVHGDI-CGPIKPATPGGKSLFLLLVDD
          ++     WH+R GH   P+L  L      H LP +   +KL  C  C I K  + PF S +   + +PLE ++ D+   PI   +      +++ VD 
Subjt:  --TEEVSWRWHARYGHLNFPALEKLQKKELVHGLPEIKGVNKL--CDGCLIGKQRRTPFPSRTAYRADEPLELVHGDI-CGPIKPATPGGKSLFLLLVDD

Query:  KSRFMWLTLLQAKSEAAEAVKRIKARAEAECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARSLLVTAGMPGRF
         +R+ WL  L+ KS+  +     K+  E   + ++  L +D GGEF       Y  + GI    + P++P+ NG+ ER+++ IV    +LL  A +P  +
Subjt:  KSRFMWLTLLQAKSEAAEAVKRIKARAEAECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARSLLVTAGMPGRF

Query:  WGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMKVTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDPVGGRAHVSRDVVFDENTF
        W  A   AVYL+NR PT  L  ++P++  + + P     +VFGC  Y  +   +  KL+ +  +  F+GY     AY       GR + SR V FDE  F
Subjt:  WGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMKVTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDPVGGRAHVSRDVVFDENTF

Query:  --------WQWNDVIEADRDPNQFTVEYLVTEPEEGGA-----QHQETSPPPAGAPP------------EPVEFATPRTADSTLDADHDTDLEARYRRMD
                   +    +D  PN  +   L T P    A      H +TSP P  +P                  ++P +++ T  + +     A+  +  
Subjt:  --------WQWNDVIEADRDPNQFTVEYLVTEPEEGGA-----QHQETSPPPAGAPP------------EPVEFATPRTADSTLDADHDTDLEARYRRMD

Query:  DLVGGGE------------------------------------------------------PPGLAARELEEV-----AELHAVSA--------------
        +                                                            PP L A  + +V        H+++               
Subjt:  DLVGGGE------------------------------------------------------PPGLAARELEEV-----AELHAVSA--------------

Query:  --------DEPNTFAEAEKNPCWRKAMQEEMTSITENQTWSL-EDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLE
                 EP T  +A K+  WR+AM  E+ +   N TW L    PP    +G +W+F  K N  G + ++KARLVAKGY Q+ G+D+ E F+PV +  
Subjt:  --------DEPNTFAEAEKNPCWRKAMQEEMTSITENQTWSL-EDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLE

Query:  SVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALYGLRQAPRAWNAKLDGTLLSLNFKRCASEHGMYTYGHG
        S+R +L +A   SW +  +DV +AFL G L + VY+ QPPGF+D D P+ V RL KA+YGL+QAPRAW  +L   LL++ F    S+  ++    G
Subjt:  SVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALYGLRQAPRAWNAKLDGTLLSLNFKRCASEHGMYTYGHG

Arabidopsis top hitse value%identityAlignment
AT3G21000.1 Gag-Pol-related retrotransposon family protein5.7e-0530.63Show/hide
Query:  MTGARSAFSELDSGIRGTVKFGDGSVVEIEGRGTILFISKGGEHRKLTDVYFIPRLKANLVSLGQLDETGCFISI-ERGLLKICDNQRRLLTQARRTTN-
        MT     F+ LD   + TV   DG+V+ +EG+G +    K G+ + + +V F+P L  N++S G++      IS   +G   +CD     L  A   T+ 
Subjt:  MTGARSAFSELDSGIRGTVKFGDGSVVEIEGRGTILFISKGGEHRKLTDVYFIPRLKANLVSLGQLDETGCFISI-ERGLLKICDNQRRLLTQARRTTN-

Query:  -----RLYVLE
             RL V+E
Subjt:  -----RLYVLE

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 87.6e-4242.93Show/hide
Query:  ADEPNTFAEAEKNPCWRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAI
        A EP+T+ EA++   W  AM +E+ ++    TW +  +PP  + IG KWV+K+K N  G + ++KARLVAKGY Q++G+DF E F+PV +L SV+ +LAI
Subjt:  ADEPNTFAEAEKNPCWRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAI

Query:  AAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGFL----DNDNPNKVLRLHKALYGLRQAPRAWNAKLDGTLLSLNFKRCASEH
        +A +++ +H +D+ +AFLNG+L E +Y++ PPG+     D+  PN V  L K++YGL+QA R W  K   TL+   F +  S+H
Subjt:  AAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGFL----DNDNPNKVLRLHKALYGLRQAPRAWNAKLDGTLLSLNFKRCASEH

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.8e-1137.89Show/hide
Query:  NQTIVGTARSLLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMKVTRPHLAKLDPRGLKVVFIGYEPGS
        N+TI+   RS+L   G+P  F  +A  TAV+++N+ P+ +++   P E W+   PT  + R FGCVAY+        KL PR  K    G E GS
Subjt:  NQTIVGTARSLLVTAGMPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAWYNKKPTVHHFRVFGCVAYMKVTRPHLAKLDPRGLKVVFIGYEPGS

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)5.0e-1742.42Show/hide
Query:  EPNTFAEAEKNPCWRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAIA
        EP +   A K+P W +AMQEE+ +++ N+TW L   P     +G KWVFK K +  G + + KARLVAKG+ Q++G+ F E ++PV R  ++R +L +A
Subjt:  EPNTFAEAEKNPCWRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCGGGGCTAGATCTGCGTTCTCCGAGCTCGACTCGGGGATCCGTGGGACGGTGAAATTCGGCGACGGCTCCGTCGTCGAGATCGAAGGGCGCGGCACCATTCTGTT
CATCAGTAAGGGAGGCGAGCATCGCAAGCTGACCGACGTCTACTTCATCCCGAGGCTCAAGGCTAACCTTGTGAGCCTGGGTCAACTCGATGAGACAGGTTGCTTCATTT
CCATCGAGCGCGGACTACTCAAAATCTGCGATAATCAACGACGGCTGCTCACGCAGGCAAGGCGCACGACAAACCGCCTTTACGTCCTGGAGTTAGAGATAGACCAACCC
GTTAGCCTCTCGGCCAAGACCGAAGAGGTATCTTGGAGGTGGCACGCAAGGTACGGACACTTAAACTTTCCTGCCCTAGAAAAGCTACAGAAGAAGGAGTTGGTGCACGG
CTTGCCAGAAATCAAAGGCGTGAACAAGCTGTGCGATGGGTGCCTCATCGGCAAACAGAGGCGCACACCCTTTCCGTCCCGAACAGCCTACCGAGCCGATGAGCCATTGG
AGCTTGTACACGGCGATATTTGCGGGCCCATCAAGCCGGCGACCCCAGGTGGTAAGAGTCTCTTCCTCCTATTAGTCGATGACAAAAGCCGCTTCATGTGGCTGACCCTG
CTGCAAGCGAAAAGTGAGGCGGCAGAGGCGGTTAAGCGCATTAAAGCGCGAGCGGAGGCCGAATGTGAGAAGAAGATGCGAGTGCTGCGTACAGACCGAGGCGGAGAATT
CACCTCGGCAAGTTTCAGTAAGTACTGCGACGAGATCGGCATACAACGGCACCTAACGGCGCCCTACTCCCCCCAACAGAACGGAGTGGTAGAGCGCCGAAATCAGACCA
TTGTCGGGACAGCGAGGTCATTGTTGGTGACGGCCGGGATGCCTGGGAGATTCTGGGGAGAGGCAGTAATGACGGCTGTCTATCTCCTCAATCGGTCACCAACCCGAAGC
CTCGACGGAAAGACGCCATATGAGGCCTGGTACAACAAAAAACCAACAGTACATCACTTTCGCGTGTTCGGCTGCGTCGCATACATGAAGGTAACACGTCCCCATCTCGC
CAAGCTCGATCCCAGGGGGCTGAAGGTCGTCTTCATCGGCTACGAACCCGGGAGCAAGGCGTACAGACTCTATGATCCTGTAGGGGGGCGAGCTCACGTGTCTCGCGACG
TCGTCTTCGACGAAAACACCTTCTGGCAGTGGAATGACGTGATCGAGGCAGACCGTGATCCAAATCAATTCACGGTGGAGTACCTCGTCACCGAGCCTGAAGAAGGAGGA
GCCCAGCATCAGGAGACGTCACCGCCGCCAGCAGGTGCACCACCTGAACCAGTGGAATTCGCAACACCACGGACTGCGGATTCGACGCTGGATGCCGATCACGATACTGA
TCTGGAGGCTAGGTATCGGAGGATGGATGACCTAGTGGGAGGAGGTGAACCACCTGGACTAGCAGCGCGCGAACTCGAAGAAGTGGCCGAACTACATGCTGTCAGTGCAG
ATGAACCAAACACCTTCGCCGAAGCAGAAAAGAACCCGTGCTGGCGGAAGGCAATGCAGGAGGAGATGACATCCATCACTGAGAACCAGACGTGGAGTCTAGAGGATATG
CCACCGGGACACCGAGCCATAGGGCTCAAATGGGTCTTCAAACTGAAGCGCAACGAAAAAGGAGAAGTTGTGAAGCACAAGGCTCGTCTGGTGGCGAAGGGCTACGTCCA
GAAGCAAGGAGTGGACTTCGAAGAGGTATTTGCGCCAGTAGCAAGGTTAGAATCCGTTCGTTTCTTGCTAGCAATTGCAGCACATCACTCTTGGGAGGTTCACCATATGG
ACGTAAAGTCTGCTTTCCTTAACGGAGAGTTGAAGGAGACCGTCTATGTTCGACAACCACCTGGCTTCCTGGATAACGACAACCCTAATAAGGTTCTGCGCCTGCACAAA
GCACTCTACGGGCTTCGACAAGCCCCACGAGCCTGGAACGCGAAGCTCGACGGTACCCTACTGTCACTGAATTTCAAACGTTGTGCCTCTGAGCATGGCATGTACACGTA
CGGCCACGGGTACCCTACTGTCACTGAATTTCAAACGTTGTGCCTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGACCGGGGCTAGATCTGCGTTCTCCGAGCTCGACTCGGGGATCCGTGGGACGGTGAAATTCGGCGACGGCTCCGTCGTCGAGATCGAAGGGCGCGGCACCATTCTGTT
CATCAGTAAGGGAGGCGAGCATCGCAAGCTGACCGACGTCTACTTCATCCCGAGGCTCAAGGCTAACCTTGTGAGCCTGGGTCAACTCGATGAGACAGGTTGCTTCATTT
CCATCGAGCGCGGACTACTCAAAATCTGCGATAATCAACGACGGCTGCTCACGCAGGCAAGGCGCACGACAAACCGCCTTTACGTCCTGGAGTTAGAGATAGACCAACCC
GTTAGCCTCTCGGCCAAGACCGAAGAGGTATCTTGGAGGTGGCACGCAAGGTACGGACACTTAAACTTTCCTGCCCTAGAAAAGCTACAGAAGAAGGAGTTGGTGCACGG
CTTGCCAGAAATCAAAGGCGTGAACAAGCTGTGCGATGGGTGCCTCATCGGCAAACAGAGGCGCACACCCTTTCCGTCCCGAACAGCCTACCGAGCCGATGAGCCATTGG
AGCTTGTACACGGCGATATTTGCGGGCCCATCAAGCCGGCGACCCCAGGTGGTAAGAGTCTCTTCCTCCTATTAGTCGATGACAAAAGCCGCTTCATGTGGCTGACCCTG
CTGCAAGCGAAAAGTGAGGCGGCAGAGGCGGTTAAGCGCATTAAAGCGCGAGCGGAGGCCGAATGTGAGAAGAAGATGCGAGTGCTGCGTACAGACCGAGGCGGAGAATT
CACCTCGGCAAGTTTCAGTAAGTACTGCGACGAGATCGGCATACAACGGCACCTAACGGCGCCCTACTCCCCCCAACAGAACGGAGTGGTAGAGCGCCGAAATCAGACCA
TTGTCGGGACAGCGAGGTCATTGTTGGTGACGGCCGGGATGCCTGGGAGATTCTGGGGAGAGGCAGTAATGACGGCTGTCTATCTCCTCAATCGGTCACCAACCCGAAGC
CTCGACGGAAAGACGCCATATGAGGCCTGGTACAACAAAAAACCAACAGTACATCACTTTCGCGTGTTCGGCTGCGTCGCATACATGAAGGTAACACGTCCCCATCTCGC
CAAGCTCGATCCCAGGGGGCTGAAGGTCGTCTTCATCGGCTACGAACCCGGGAGCAAGGCGTACAGACTCTATGATCCTGTAGGGGGGCGAGCTCACGTGTCTCGCGACG
TCGTCTTCGACGAAAACACCTTCTGGCAGTGGAATGACGTGATCGAGGCAGACCGTGATCCAAATCAATTCACGGTGGAGTACCTCGTCACCGAGCCTGAAGAAGGAGGA
GCCCAGCATCAGGAGACGTCACCGCCGCCAGCAGGTGCACCACCTGAACCAGTGGAATTCGCAACACCACGGACTGCGGATTCGACGCTGGATGCCGATCACGATACTGA
TCTGGAGGCTAGGTATCGGAGGATGGATGACCTAGTGGGAGGAGGTGAACCACCTGGACTAGCAGCGCGCGAACTCGAAGAAGTGGCCGAACTACATGCTGTCAGTGCAG
ATGAACCAAACACCTTCGCCGAAGCAGAAAAGAACCCGTGCTGGCGGAAGGCAATGCAGGAGGAGATGACATCCATCACTGAGAACCAGACGTGGAGTCTAGAGGATATG
CCACCGGGACACCGAGCCATAGGGCTCAAATGGGTCTTCAAACTGAAGCGCAACGAAAAAGGAGAAGTTGTGAAGCACAAGGCTCGTCTGGTGGCGAAGGGCTACGTCCA
GAAGCAAGGAGTGGACTTCGAAGAGGTATTTGCGCCAGTAGCAAGGTTAGAATCCGTTCGTTTCTTGCTAGCAATTGCAGCACATCACTCTTGGGAGGTTCACCATATGG
ACGTAAAGTCTGCTTTCCTTAACGGAGAGTTGAAGGAGACCGTCTATGTTCGACAACCACCTGGCTTCCTGGATAACGACAACCCTAATAAGGTTCTGCGCCTGCACAAA
GCACTCTACGGGCTTCGACAAGCCCCACGAGCCTGGAACGCGAAGCTCGACGGTACCCTACTGTCACTGAATTTCAAACGTTGTGCCTCTGAGCATGGCATGTACACGTA
CGGCCACGGGTACCCTACTGTCACTGAATTTCAAACGTTGTGCCTCTGA
Protein sequenceShow/hide protein sequence
MTGARSAFSELDSGIRGTVKFGDGSVVEIEGRGTILFISKGGEHRKLTDVYFIPRLKANLVSLGQLDETGCFISIERGLLKICDNQRRLLTQARRTTNRLYVLELEIDQP
VSLSAKTEEVSWRWHARYGHLNFPALEKLQKKELVHGLPEIKGVNKLCDGCLIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTL
LQAKSEAAEAVKRIKARAEAECEKKMRVLRTDRGGEFTSASFSKYCDEIGIQRHLTAPYSPQQNGVVERRNQTIVGTARSLLVTAGMPGRFWGEAVMTAVYLLNRSPTRS
LDGKTPYEAWYNKKPTVHHFRVFGCVAYMKVTRPHLAKLDPRGLKVVFIGYEPGSKAYRLYDPVGGRAHVSRDVVFDENTFWQWNDVIEADRDPNQFTVEYLVTEPEEGG
AQHQETSPPPAGAPPEPVEFATPRTADSTLDADHDTDLEARYRRMDDLVGGGEPPGLAARELEEVAELHAVSADEPNTFAEAEKNPCWRKAMQEEMTSITENQTWSLEDM
PPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHK
ALYGLRQAPRAWNAKLDGTLLSLNFKRCASEHGMYTYGHGYPTVTEFQTLCL