| GenBank top hits | e value | %identity | Alignment |
|---|
| EEF44287.1 conserved hypothetical protein [Ricinus communis] | 8.2e-90 | 50.93 | Show/hide |
Query: MARKLRRSPRPLPRRTVDYAS--DYDASPSPSQSLYASNEDDYDASESINFQPTDPKSKAQEIKGSDLLTSAESASNSPSYFQSPNAAETLFPYINIAPL
M RK + S R+++ ++S DY S SPSQS Y SN+DD + + QP +S + L S+ S SNS PN + YIN+APL
Subjt: MARKLRRSPRPLPRRTVDYAS--DYDASPSPSQSLYASNEDDYDASESINFQPTDPKSKAQEIKGSDLLTSAESASNSPSYFQSPNAAETLFPYINIAPL
Query: PAFHGGVDECPAMHLSRFAKVCRANNAASVDMMMRIFPVTLEGEAALWYDLNIEPYPPISWEELKSSFLDAYNKIELTDQLRSELMTINQQQEENVRSYF
P FHG +ECP HLSRF KVCRANNA+S DMMMRIFPVTLE EAALWYDLNI+PYP +SW+E+ SFL+AY +I+L DQLRS+LM +NQ +E+VRSYF
Subjt: PAFHGGVDECPAMHLSRFAKVCRANNAASVDMMMRIFPVTLEGEAALWYDLNIEPYPPISWEELKSSFLDAYNKIELTDQLRSELMTINQQQEENVRSYF
Query: LRLQLILKKWPTGNELSDGLLKAIFMDGLREEFKEWMIPQKPSSLNEALRLAFGFEQVRSVRTSGKKRFLQCGFCEGPHEELLCEVRERMRQLWKSREKK
+RLQ ILK+WP + LSD +LK IF+DGL FK+W+IP KP+SLNEALRLAF FEQV+S+R + K++ ++CGFCEG HEE C VRE+MR+L+++ +KK
Subjt: LRLQLILKKWPTGNELSDGLLKAIFMDGLREEFKEWMIPQKPSSLNEALRLAFGFEQVRSVRTSGKKRFLQCGFCEGPHEELLCEVRERMRQLWKSREKK
Query: KAVDLAESDGREAATATATAELVRSVSAISRNEAGVDKDGGEMVGLKK--KSQCQCWKHQCGMKKLDRNLSIVSRNS
+ S+ EA A + + + G DK+ M+ K KS CQC KH C MKK +R+ S+ +RNS
Subjt: KAVDLAESDGREAATATATAELVRSVSAISRNEAGVDKDGGEMVGLKK--KSQCQCWKHQCGMKKLDRNLSIVSRNS
|
|
| EXB78111.1 hypothetical protein L484_004813 [Morus notabilis] | 9.4e-86 | 50.51 | Show/hide |
Query: RRSPRPLPRRTVDYASDYD-------ASPSPSQSL-YASNEDD--YDASESINFQPTDPKSKAQEIKGSDLLTSAE---SASNSPSYF--QSPNAAETLF
RR+P P DY+S YD SP+ S N+DD DAS++ T+P S Q S+ + + + SAS+SP Q P +
Subjt: RRSPRPLPRRTVDYASDYD-------ASPSPSQSL-YASNEDD--YDASESINFQPTDPKSKAQEIKGSDLLTSAE---SASNSPSYF--QSPNAAETLF
Query: PYINIAPLPAFHGGVDECPAMHLSRFAKVCRANNAASVDMMMRIFPVTLEGEAALWYDLNIEPYPPISWEELKSSFLDAYNKIELTDQLRSELMTINQQQ
Y+NIA P F GG +ECP HLSRFAKVCRANN +S+DMMM+IFPVTLE EAALWYDLN+EPY +SWEE+KSSF AY KIELT+QLRS+LMTINQ
Subjt: PYINIAPLPAFHGGVDECPAMHLSRFAKVCRANNAASVDMMMRIFPVTLEGEAALWYDLNIEPYPPISWEELKSSFLDAYNKIELTDQLRSELMTINQQQ
Query: EENVRSYFLRLQLILKKWPTGNELSDGLLKAIFMDGLREEFKEWMIPQKPSSLNEALRLAFGFEQVRSVRTSGKKRFLQCGFCEGPHEELLCEVRERMRQ
E+VRSYFLRLQ ILKKWP + LSD LLK +F+DGLR +F+EWM PQKP SLN+ALRLAF FEQV+S+R + ++CGFC G HEE CEVRERMR+
Subjt: EENVRSYFLRLQLILKKWPTGNELSDGLLKAIFMDGLREEFKEWMIPQKPSSLNEALRLAFGFEQVRSVRTSGKKRFLQCGFCEGPHEELLCEVRERMRQ
Query: LWKSREKKKAVD--------LAESDGREAATATATAELVRSVSAISRNEAGVDKDG----GEMVGLKKKSQCQCWKHQCGMKKLDRNLSIVSRN
LW K + + +S+G + + + RS + +N+ V++DG E+ KK+SQCQC KHQC K ++RN S VS N
Subjt: LWKSREKKKAVD--------LAESDGREAATATATAELVRSVSAISRNEAGVDKDG----GEMVGLKKKSQCQCWKHQCGMKKLDRNLSIVSRN
|
|
| KAF3973299.1 hypothetical protein CMV_003263 [Castanea mollissima] | 4.2e-86 | 55.94 | Show/hide |
Query: NEDDYDASESINFQPTDPKS---KAQEIKGSDLLTSA--ESASNSPSYFQSPNAAETLFPYINIAPLPAFHGGVDECPAMHLSRFAKVCRANNAASVDMM
N+D Y SES P D S + + + +L T+A S SN P Q P+ L YINIAP P FHG +ECP H+SRFAKVC ANN ++ DMM
Subjt: NEDDYDASESINFQPTDPKS---KAQEIKGSDLLTSA--ESASNSPSYFQSPNAAETLFPYINIAPLPAFHGGVDECPAMHLSRFAKVCRANNAASVDMM
Query: MRIFPVTLEGEAALWYDLNIEPYPPISWEELKSSFLDAYNKIELTDQLRSELMTINQQQEENVRSYFLRLQLILKKWPTGNELSDGLLKAIFMDGLREEF
M IFPVTLE EAALWYDLNI+PYP ++WEE+KSSFL AY+KI++ DQLRSELM INQ EE+VRSYFLRLQ ILK+WP + + DGLLK +F+DGLREEF
Subjt: MRIFPVTLEGEAALWYDLNIEPYPPISWEELKSSFLDAYNKIELTDQLRSELMTINQQQEENVRSYFLRLQLILKKWPTGNELSDGLLKAIFMDGLREEF
Query: KEWMIPQKPSSLNEALRLAFGFEQVRSVRTSGKKRFLQCGFCEGPHEELLCEVRERMRQLWK-SREKKKAVDLAESDGREAATATATAELVRSV-----S
++W+ PQKP SL+EALRLAF FEQV+S+R K+ L+CGFC+G HEE CEVRERMR+LW+ S+EK++ V LA+S + ELVRSV S
Subjt: KEWMIPQKPSSLNEALRLAFGFEQVRSVRTSGKKRFLQCGFCEGPHEELLCEVRERMRQLWK-SREKKKAVDLAESDGREAATATATAELVRSV-----S
Query: AISRNEAGVDKDGGEMVGLKKKSQCQCWKHQCGMKKLDRNLSIVS
++ +N G ++GG M G KK+Q Q K+Q MKKL+RN S++S
Subjt: AISRNEAGVDKDGGEMVGLKKKSQCQCWKHQCGMKKLDRNLSIVS
|
|
| KAF3973300.1 hypothetical protein CMV_003263 [Castanea mollissima] | 4.2e-86 | 55.94 | Show/hide |
Query: NEDDYDASESINFQPTDPKS---KAQEIKGSDLLTSA--ESASNSPSYFQSPNAAETLFPYINIAPLPAFHGGVDECPAMHLSRFAKVCRANNAASVDMM
N+D Y SES P D S + + + +L T+A S SN P Q P+ L YINIAP P FHG +ECP H+SRFAKVC ANN ++ DMM
Subjt: NEDDYDASESINFQPTDPKS---KAQEIKGSDLLTSA--ESASNSPSYFQSPNAAETLFPYINIAPLPAFHGGVDECPAMHLSRFAKVCRANNAASVDMM
Query: MRIFPVTLEGEAALWYDLNIEPYPPISWEELKSSFLDAYNKIELTDQLRSELMTINQQQEENVRSYFLRLQLILKKWPTGNELSDGLLKAIFMDGLREEF
M IFPVTLE EAALWYDLNI+PYP ++WEE+KSSFL AY+KI++ DQLRSELM INQ EE+VRSYFLRLQ ILK+WP + + DGLLK +F+DGLREEF
Subjt: MRIFPVTLEGEAALWYDLNIEPYPPISWEELKSSFLDAYNKIELTDQLRSELMTINQQQEENVRSYFLRLQLILKKWPTGNELSDGLLKAIFMDGLREEF
Query: KEWMIPQKPSSLNEALRLAFGFEQVRSVRTSGKKRFLQCGFCEGPHEELLCEVRERMRQLWK-SREKKKAVDLAESDGREAATATATAELVRSV-----S
++W+ PQKP SL+EALRLAF FEQV+S+R K+ L+CGFC+G HEE CEVRERMR+LW+ S+EK++ V LA+S + ELVRSV S
Subjt: KEWMIPQKPSSLNEALRLAFGFEQVRSVRTSGKKRFLQCGFCEGPHEELLCEVRERMRQLWK-SREKKKAVDLAESDGREAATATATAELVRSV-----S
Query: AISRNEAGVDKDGGEMVGLKKKSQCQCWKHQCGMKKLDRNLSIVS
++ +N G ++GG M G KK+Q Q K+Q MKKL+RN S++S
Subjt: AISRNEAGVDKDGGEMVGLKKKSQCQCWKHQCGMKKLDRNLSIVS
|
|
| KAG6604769.1 hypothetical protein SDJN03_02086, partial [Cucurbita argyrosperma subsp. sororia] | 4.6e-157 | 79.41 | Show/hide |
Query: MARKLRRSPRPLPRRTVDYASDYDASPSPSQSLYASNEDDYDASESINFQPTDPKSKAQEIKGSDLLTSAESASNSPSYFQSPNAAETLFPYINIAPLPA
MA KLRRSP PL RR +YA+DYDA S SQSL ASNEDDYDASES NFQ + KSK+ EI + ESA+NSP+ QSPNAA T+FPYINIAPLP
Subjt: MARKLRRSPRPLPRRTVDYASDYDASPSPSQSLYASNEDDYDASESINFQPTDPKSKAQEIKGSDLLTSAESASNSPSYFQSPNAAETLFPYINIAPLPA
Query: FHGGVDECPAMHLSRFAKVCRANNAASVDMMMRIFPVTLEGEAALWYDLNIEPYPPISWEELKSSFLDAYNKIELTDQLRSELMTINQQQEENVRSYFLR
FHGG DECPA HLSRFAKVCRANNAASV++MMRIFPVTL+GEA LWYDLNIEPYPPISWEELKSSFLDAYNKIEL +QLRSELMTI+Q+ EENVRSYFLR
Subjt: FHGGVDECPAMHLSRFAKVCRANNAASVDMMMRIFPVTLEGEAALWYDLNIEPYPPISWEELKSSFLDAYNKIELTDQLRSELMTINQQQEENVRSYFLR
Query: LQLILKKWPTGNELSDGLLKAIFMDGLREEFKEWMIPQKPSSLNEALRLAFGFEQVRSVRTSGKKRFLQCGFCEGPHEELLCEVRERMRQLWKSREKKKA
LQLILKKWP GNELSDG LKAIFMDGLREEFKEWMIPQKP SLNEALRLAFG EQV +RTSG KRFL+CGFCEG HEEL+CEVRERMR+LWKSREKK
Subjt: LQLILKKWPTGNELSDGLLKAIFMDGLREEFKEWMIPQKPSSLNEALRLAFGFEQVRSVRTSGKKRFLQCGFCEGPHEELLCEVRERMRQLWKSREKKKA
Query: VDLAESDGREAATATATAELVRSVSAISRNEAGVDKDGGEMVGLKKKSQCQCWKHQCGMKKLDRNLSIVSRNSK
D+AES+G TAELVRSVSAISRNEA V KDGGEMVGLKKK QCQCWKHQCGMKKLDRNLS++S+ SK
Subjt: VDLAESDGREAATATATAELVRSVSAISRNEAGVDKDGGEMVGLKKKSQCQCWKHQCGMKKLDRNLSIVSRNSK
|
|
| TrEMBL top hits | e value | %identity | Alignment |
|---|
| A0A6J5UDI4 Retrotrans_gag domain-containing protein | 1.7e-85 | 52.08 | Show/hide |
Query: DYDASPSPSQSLYASNEDDYDASESINFQPTDPKSKAQ----EIKGSDLLTSAESASNSPSYFQSPNAAETLFPYINIAPLPAFHGGVDECPAMHLSRFA
D D S SQS N+ Y ASES P+D S +Q + S+ + + S + F P +T + YI IAPLP F GG +ECP HL+RFA
Subjt: DYDASPSPSQSLYASNEDDYDASESINFQPTDPKSKAQ----EIKGSDLLTSAESASNSPSYFQSPNAAETLFPYINIAPLPAFHGGVDECPAMHLSRFA
Query: KVCRAN-NAASVDMMMRIFPVTLEGEAALWYDLNIEPYPPISWEELKSSFLDAYNKIELTDQLRSELMTINQQQEENVRSYFLRLQLILKKWPTGNELSD
K+CRAN + +VD+M+RIFPVTLE EAALWYDLNI+PYP +SWEE++S F AY++I DQLRSEL I Q ++E VRSYFLRLQ ILK+WP + L D
Subjt: KVCRAN-NAASVDMMMRIFPVTLEGEAALWYDLNIEPYPPISWEELKSSFLDAYNKIELTDQLRSELMTINQQQEENVRSYFLRLQLILKKWPTGNELSD
Query: GLLKAIFMDGLREEFKEWMIPQKPSSLNEALRLAFGFEQVRSVR--TSGKKRFLQCGFCEGPHEELLCEVRERMRQLW-KSREKKKAVDLAESDGREAAT
+LK +F+DGLR+EFK+W++ +KPSSLN+ALRLAFGFE+V+SVR T+ K++ ++CGFC G HEE CEVRERMR+LW KS+E+
Subjt: GLLKAIFMDGLREEFKEWMIPQKPSSLNEALRLAFGFEQVRSVR--TSGKKRFLQCGFCEGPHEELLCEVRERMRQLW-KSREKKKAVDLAESDGREAAT
Query: ATATAELVRSVSAI-SRNEAGVDK-DGGEMVGLKKKSQCQCWKHQCGMKKLDRNLSIVSRN
LVR VS + R E GV++ + GE+V LKKK QCQCWKHQC KKL+R+ S+V N
Subjt: ATATAELVRSVSAI-SRNEAGVDK-DGGEMVGLKKKSQCQCWKHQCGMKKLDRNLSIVSRN
|
|
| A0A7N2R9A7 Retrotrans_gag domain-containing protein | 6.2e-91 | 57.77 | Show/hide |
Query: NEDDYDASESINFQPTDPKS---KAQEIKGSDLLTSA--ESASNSPSYFQSPNAAETLFPYINIAPLPAFHGGVDECPAMHLSRFAKVCRANNAASVDMM
N+D Y +SES P D S + + + +L T+A S SN P Q P+ L Y+NIAP+P FHG +ECP H+SRFAKVC ANN ++ DMM
Subjt: NEDDYDASESINFQPTDPKS---KAQEIKGSDLLTSA--ESASNSPSYFQSPNAAETLFPYINIAPLPAFHGGVDECPAMHLSRFAKVCRANNAASVDMM
Query: MRIFPVTLEGEAALWYDLNIEPYPPISWEELKSSFLDAYNKIELTDQLRSELMTINQQQEENVRSYFLRLQLILKKWPTGNELSDGLLKAIFMDGLREEF
MRIFPVTLE EAALWYDLNIEPYP ++WEE+KSSFL AY+KIE+ DQLRSELM INQ EE+VRSYFLRLQ ILK+WP + +SDGLLK +F+DGLREEF
Subjt: MRIFPVTLEGEAALWYDLNIEPYPPISWEELKSSFLDAYNKIELTDQLRSELMTINQQQEENVRSYFLRLQLILKKWPTGNELSDGLLKAIFMDGLREEF
Query: KEWMIPQKPSSLNEALRLAFGFEQVRSVRTSGKKRFLQCGFCEGPHEELLCEVRERMRQLWK-SREKKKAVDLAESDGREAATATATAELVRSVSAISRN
+ W+IPQKP SL+EALRLAFGFEQV+S+R K+ L+CGFC+G HEE CEVRERMR+LW+ S+EK++AV LA+S G + ELVRSVS + +
Subjt: KEWMIPQKPSSLNEALRLAFGFEQVRSVRTSGKKRFLQCGFCEGPHEELLCEVRERMRQLWK-SREKKKAVDLAESDGREAATATATAELVRSVSAISRN
Query: EAGVDKDGGEMVGLK-KKSQCQCWKHQCGMKKLDRNLSIVS
G + +G E + KK+Q Q K+Q MKKL+RN S++S
Subjt: EAGVDKDGGEMVGLK-KKSQCQCWKHQCGMKKLDRNLSIVS
|
|
| A5C7E6 Retrotrans_gag domain-containing protein | 5.1e-85 | 52.08 | Show/hide |
Query: YASDYDASPSPSQSLYASNEDDYDA----SESINFQPTDPKSKAQEIKGSDLLTSAESASNSPSYFQSPNAAETLF--PYINIAPLPAFHGGVDECPAMH
+ DY SPSQS Y +E++ D +++ + T+ + + + +S S S S N+ YINIAPLP F G DECP H
Subjt: YASDYDASPSPSQSLYASNEDDYDA----SESINFQPTDPKSKAQEIKGSDLLTSAESASNSPSYFQSPNAAETLF--PYINIAPLPAFHGGVDECPAMH
Query: LSRFAKVCRANNAASVDMMMRIFPVTLEGEAALWYDLNIEPYPPISWEELKSSFLDAYNKIELTDQLRSELMTINQQQEENVRSYFLRLQLILKKWPTGN
LSRF KVCRANN +SV+M+MRIFPVTL+GEAALWYDLNIEPY +SWEE+KSSFL AY++ LTD+LRSELM INQ EE+VRSYFLRLQ ILK+WP +
Subjt: LSRFAKVCRANNAASVDMMMRIFPVTLEGEAALWYDLNIEPYPPISWEELKSSFLDAYNKIELTDQLRSELMTINQQQEENVRSYFLRLQLILKKWPTGN
Query: ELSDGLLKAIFMDGLREEFKEWMIPQKPSSLNEALRLAFGFEQVRSVRTSGKKRFLQCGFCEGPHEELLCEVRERMRQLWKSREKKKAVDLAESDGREAA
L DGLL+ IF+DGLR++F++W+IPQKPSSLNEALRLAF +E+V+S+R +K +CGFC G H+E CE+RERMR LW + KK+ D + GR
Subjt: ELSDGLLKAIFMDGLREEFKEWMIPQKPSSLNEALRLAFGFEQVRSVRTSGKKRFLQCGFCEGPHEELLCEVRERMRQLWKSREKKKAVDLAESDGREAA
Query: TATATAELVR--SVSAISRNEAGVDKDGGE-MVGLKKKSQCQCWKHQCGMKKLDRNLSIVS
E R SV SR+ +++G E +G KKKSQCQC KHQC KKL+RN S+++
Subjt: TATATAELVR--SVSAISRNEAGVDKDGGE-MVGLKKKSQCQCWKHQCGMKKLDRNLSIVS
|
|
| B9RWN5 Retrotrans_gag domain-containing protein | 4.0e-90 | 50.93 | Show/hide |
Query: MARKLRRSPRPLPRRTVDYAS--DYDASPSPSQSLYASNEDDYDASESINFQPTDPKSKAQEIKGSDLLTSAESASNSPSYFQSPNAAETLFPYINIAPL
M RK + S R+++ ++S DY S SPSQS Y SN+DD + + QP +S + L S+ S SNS PN + YIN+APL
Subjt: MARKLRRSPRPLPRRTVDYAS--DYDASPSPSQSLYASNEDDYDASESINFQPTDPKSKAQEIKGSDLLTSAESASNSPSYFQSPNAAETLFPYINIAPL
Query: PAFHGGVDECPAMHLSRFAKVCRANNAASVDMMMRIFPVTLEGEAALWYDLNIEPYPPISWEELKSSFLDAYNKIELTDQLRSELMTINQQQEENVRSYF
P FHG +ECP HLSRF KVCRANNA+S DMMMRIFPVTLE EAALWYDLNI+PYP +SW+E+ SFL+AY +I+L DQLRS+LM +NQ +E+VRSYF
Subjt: PAFHGGVDECPAMHLSRFAKVCRANNAASVDMMMRIFPVTLEGEAALWYDLNIEPYPPISWEELKSSFLDAYNKIELTDQLRSELMTINQQQEENVRSYF
Query: LRLQLILKKWPTGNELSDGLLKAIFMDGLREEFKEWMIPQKPSSLNEALRLAFGFEQVRSVRTSGKKRFLQCGFCEGPHEELLCEVRERMRQLWKSREKK
+RLQ ILK+WP + LSD +LK IF+DGL FK+W+IP KP+SLNEALRLAF FEQV+S+R + K++ ++CGFCEG HEE C VRE+MR+L+++ +KK
Subjt: LRLQLILKKWPTGNELSDGLLKAIFMDGLREEFKEWMIPQKPSSLNEALRLAFGFEQVRSVRTSGKKRFLQCGFCEGPHEELLCEVRERMRQLWKSREKK
Query: KAVDLAESDGREAATATATAELVRSVSAISRNEAGVDKDGGEMVGLKK--KSQCQCWKHQCGMKKLDRNLSIVSRNS
+ S+ EA A + + + G DK+ M+ K KS CQC KH C MKK +R+ S+ +RNS
Subjt: KAVDLAESDGREAATATATAELVRSVSAISRNEAGVDKDGGEMVGLKK--KSQCQCWKHQCGMKKLDRNLSIVSRNS
|
|
| W9R9S0 Retrotrans_gag domain-containing protein | 4.6e-86 | 50.51 | Show/hide |
Query: RRSPRPLPRRTVDYASDYD-------ASPSPSQSL-YASNEDD--YDASESINFQPTDPKSKAQEIKGSDLLTSAE---SASNSPSYF--QSPNAAETLF
RR+P P DY+S YD SP+ S N+DD DAS++ T+P S Q S+ + + + SAS+SP Q P +
Subjt: RRSPRPLPRRTVDYASDYD-------ASPSPSQSL-YASNEDD--YDASESINFQPTDPKSKAQEIKGSDLLTSAE---SASNSPSYF--QSPNAAETLF
Query: PYINIAPLPAFHGGVDECPAMHLSRFAKVCRANNAASVDMMMRIFPVTLEGEAALWYDLNIEPYPPISWEELKSSFLDAYNKIELTDQLRSELMTINQQQ
Y+NIA P F GG +ECP HLSRFAKVCRANN +S+DMMM+IFPVTLE EAALWYDLN+EPY +SWEE+KSSF AY KIELT+QLRS+LMTINQ
Subjt: PYINIAPLPAFHGGVDECPAMHLSRFAKVCRANNAASVDMMMRIFPVTLEGEAALWYDLNIEPYPPISWEELKSSFLDAYNKIELTDQLRSELMTINQQQ
Query: EENVRSYFLRLQLILKKWPTGNELSDGLLKAIFMDGLREEFKEWMIPQKPSSLNEALRLAFGFEQVRSVRTSGKKRFLQCGFCEGPHEELLCEVRERMRQ
E+VRSYFLRLQ ILKKWP + LSD LLK +F+DGLR +F+EWM PQKP SLN+ALRLAF FEQV+S+R + ++CGFC G HEE CEVRERMR+
Subjt: EENVRSYFLRLQLILKKWPTGNELSDGLLKAIFMDGLREEFKEWMIPQKPSSLNEALRLAFGFEQVRSVRTSGKKRFLQCGFCEGPHEELLCEVRERMRQ
Query: LWKSREKKKAVD--------LAESDGREAATATATAELVRSVSAISRNEAGVDKDG----GEMVGLKKKSQCQCWKHQCGMKKLDRNLSIVSRN
LW K + + +S+G + + + RS + +N+ V++DG E+ KK+SQCQC KHQC K ++RN S VS N
Subjt: LWKSREKKKAVD--------LAESDGREAATATATAELVRSVSAISRNEAGVDKDG----GEMVGLKKKSQCQCWKHQCGMKKLDRNLSIVSRN
|
|