; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0002966 (gene) of Snake gourd v1 genome

Gene IDTan0002966
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRNA-directed DNA polymerase (reverse transcriptase)-related family protein
Genome locationLG10:4021590..4031971
RNA-Seq ExpressionTan0002966
SyntenyTan0002966
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582484.1 hypothetical protein SDJN03_22486, partial [Cucurbita argyrosperma subsp. sororia]2.7e-19085.65Show/hide
Query:  MPESMELEATPSVSPSLDLQAVRS---ELEELQRSLEEDEASTTDSSGSEKLLKECALHLESRLQQILSECSDVDSFLGIDDLDAYVERMKEELVAVEAE
        MPESM  EATPSVS SLDLQAVRS   ELEELQRSL EDEA +TDS GSEKLLKECALHLESRLQQ+LSECS+VDSFLGIDDLDAYVE MKEELVAVEAE
Subjt:  MPESMELEATPSVSPSLDLQAVRS---ELEELQRSLEEDEASTTDSSGSEKLLKECALHLESRLQQILSECSDVDSFLGIDDLDAYVERMKEELVAVEAE

Query:  SSKITNEIEVLKRTNIEDSNKLKMDLEAFKWSLDRFTSQDPETATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKRILKSLQELDEMFKSMDVV
        SSKI+NEIEVLKRTNIE SNKL++DLE    SLDRFTSQD E  TFN CS+NGEDQMN+IV+ ECNAFEVLELDS IEKNKRILKSLQE+DE+FKS+DVV
Subjt:  SSKITNEIEVLKRTNIEDSNKLKMDLEAFKWSLDRFTSQDPETATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKRILKSLQELDEMFKSMDVV

Query:  EQVEDTIGGLKVIDVADNFIRLSLHTHIPNLEDFSSLQKLEGMIEPSEVDHELLVEVLEGTMELKNAEIIPGDVHLHNIINASKSLSNSLEWFVRKVQDR
        EQVEDTIGGLKVI VADNFIRLSL THIPNLEDFSSLQ+LEGMIEPSE++HELL+EVLEGTMELKNAEI PGDVHLH+IINASKS+SNSLEWFV+KVQDR
Subjt:  EQVEDTIGGLKVIDVADNFIRLSLHTHIPNLEDFSSLQKLEGMIEPSEVDHELLVEVLEGTMELKNAEIIPGDVHLHNIINASKSLSNSLEWFVRKVQDR

Query:  IVLCTLRRFVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVSQGWPLADSPLTLISLKSSDHYTKGVSLSLICKVEKMANSLNVRLRQNLSSFADAV
        IVLCTLRRFVVKSANKSSHSF+Y+DQDETI+C MIGGIDA IKVSQGWPLADSPL L+SLKSSDHYTKG SLSL+CKVEKMANSL+ R+RQNLSSFADAV
Subjt:  IVLCTLRRFVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVSQGWPLADSPLTLISLKSSDHYTKGVSLSLICKVEKMANSLNVRLRQNLSSFADAV

Query:  EKILKEQIYLELQSDSAL
        EKILKEQ++LELQ+D  L
Subjt:  EKILKEQIYLELQSDSAL

XP_022924674.1 uncharacterized protein LOC111432106 isoform X3 [Cucurbita moschata]2.3e-18985.17Show/hide
Query:  MPESMELEATPSVSPSLDLQAVRS---ELEELQRSLEEDEASTTDSSGSEKLLKECALHLESRLQQILSECSDVDSFLGIDDLDAYVERMKEELVAVEAE
        MPESM  EATPSVS SLDLQAVRS   ELEELQRSL EDEA +TDS GSEKLLKECALHLESRLQQ+LSECS+VDSFLGIDDLDAYVE MKEELVAVEAE
Subjt:  MPESMELEATPSVSPSLDLQAVRS---ELEELQRSLEEDEASTTDSSGSEKLLKECALHLESRLQQILSECSDVDSFLGIDDLDAYVERMKEELVAVEAE

Query:  SSKITNEIEVLKRTNIEDSNKLKMDLEAFKWSLDRFTSQDPETATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKRILKSLQELDEMFKSMDVV
        SS+I+NEIEVLKRTNIE SNKL+++LE    SLDRFTSQDPE  TFN CS+NGEDQMN+IV+RE NAFEVLELDS IEKNKRILKSLQE+DE+FKS+DVV
Subjt:  SSKITNEIEVLKRTNIEDSNKLKMDLEAFKWSLDRFTSQDPETATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKRILKSLQELDEMFKSMDVV

Query:  EQVEDTIGGLKVIDVADNFIRLSLHTHIPNLEDFSSLQKLEGMIEPSEVDHELLVEVLEGTMELKNAEIIPGDVHLHNIINASKSLSNSLEWFVRKVQDR
        EQVEDTIGGLKVI VADNFIRLSL THIPNLEDFSSLQ+LEGMIEPSE++HELL+EVLEGTMELKNAEI PGDVHLH+IINASKS+SNSLEWFV+KVQDR
Subjt:  EQVEDTIGGLKVIDVADNFIRLSLHTHIPNLEDFSSLQKLEGMIEPSEVDHELLVEVLEGTMELKNAEIIPGDVHLHNIINASKSLSNSLEWFVRKVQDR

Query:  IVLCTLRRFVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVSQGWPLADSPLTLISLKSSDHYTKGVSLSLICKVEKMANSLNVRLRQNLSSFADAV
        IVLCTLRRFVVKSANKSSHSF+Y+DQDETI+C MIGGIDA IKVSQGWPLADSPL L+SLKSSDHYTKG SLSL+CKVEKMANSL+ R+RQNLSSFADAV
Subjt:  IVLCTLRRFVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVSQGWPLADSPLTLISLKSSDHYTKGVSLSLICKVEKMANSLNVRLRQNLSSFADAV

Query:  EKILKEQIYLELQSDSAL
        EKILKEQ++LEL++D  L
Subjt:  EKILKEQIYLELQSDSAL

XP_023528067.1 uncharacterized protein LOC111791098 isoform X1 [Cucurbita pepo subsp. pepo]5.7e-19386.6Show/hide
Query:  MPESMELEATPSVSPSLDLQAVR---SELEELQRSLEEDEASTTDSSGSEKLLKECALHLESRLQQILSECSDVDSFLGIDDLDAYVERMKEELVAVEAE
        MPESM  EATPSVS SLDLQAVR   SELEELQRSLEEDEA  TDS GSEKLLKECALHLESRLQQ+LSECS+VDSFL IDDLDAYVE MKEELVAVEAE
Subjt:  MPESMELEATPSVSPSLDLQAVR---SELEELQRSLEEDEASTTDSSGSEKLLKECALHLESRLQQILSECSDVDSFLGIDDLDAYVERMKEELVAVEAE

Query:  SSKITNEIEVLKRTNIEDSNKLKMDLEAFKWSLDRFTSQDPETATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKRILKSLQELDEMFKSMDVV
        SSKI+NEIEVLKRTNIE SNKL++DLE    SLDRFTSQDPE  TFN CS+NGEDQMN+IV+ ECNAFEVLELDS IEKNKRILKSLQE+DE+FKS+DVV
Subjt:  SSKITNEIEVLKRTNIEDSNKLKMDLEAFKWSLDRFTSQDPETATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKRILKSLQELDEMFKSMDVV

Query:  EQVEDTIGGLKVIDVADNFIRLSLHTHIPNLEDFSSLQKLEGMIEPSEVDHELLVEVLEGTMELKNAEIIPGDVHLHNIINASKSLSNSLEWFVRKVQDR
        EQVEDTIGGLKVIDVADNFIRLSL THIPNLEDFSSLQKLEGMIEPSE++HELL+EVLEGTMELKNAEI PGDVHLH+IINASKS+SNSLEWFV+KVQDR
Subjt:  EQVEDTIGGLKVIDVADNFIRLSLHTHIPNLEDFSSLQKLEGMIEPSEVDHELLVEVLEGTMELKNAEIIPGDVHLHNIINASKSLSNSLEWFVRKVQDR

Query:  IVLCTLRRFVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVSQGWPLADSPLTLISLKSSDHYTKGVSLSLICKVEKMANSLNVRLRQNLSSFADAV
        IVLCTLRRFVVKSANKSSHSF+Y+DQDETI+C+MIGGIDA IKVSQGWPLADSPL L+SLKSSDHYTKG SLSL+CKVEKMANSL+ R+RQNLSSFADAV
Subjt:  IVLCTLRRFVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVSQGWPLADSPLTLISLKSSDHYTKGVSLSLICKVEKMANSLNVRLRQNLSSFADAV

Query:  EKILKEQIYLELQSDSAL
        EKILKEQ++LELQ+DS L
Subjt:  EKILKEQIYLELQSDSAL

XP_023528068.1 uncharacterized protein LOC111791098 isoform X2 [Cucurbita pepo subsp. pepo]1.4e-19487.23Show/hide
Query:  MPESMELEATPSVSPSLDLQAVRSELEELQRSLEEDEASTTDSSGSEKLLKECALHLESRLQQILSECSDVDSFLGIDDLDAYVERMKEELVAVEAESSK
        MPESM  EATPSVS SLDLQAVRSELEELQRSLEEDEA  TDS GSEKLLKECALHLESRLQQ+LSECS+VDSFL IDDLDAYVE MKEELVAVEAESSK
Subjt:  MPESMELEATPSVSPSLDLQAVRSELEELQRSLEEDEASTTDSSGSEKLLKECALHLESRLQQILSECSDVDSFLGIDDLDAYVERMKEELVAVEAESSK

Query:  ITNEIEVLKRTNIEDSNKLKMDLEAFKWSLDRFTSQDPETATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKRILKSLQELDEMFKSMDVVEQV
        I+NEIEVLKRTNIE SNKL++DLE    SLDRFTSQDPE  TFN CS+NGEDQMN+IV+ ECNAFEVLELDS IEKNKRILKSLQE+DE+FKS+DVVEQV
Subjt:  ITNEIEVLKRTNIEDSNKLKMDLEAFKWSLDRFTSQDPETATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKRILKSLQELDEMFKSMDVVEQV

Query:  EDTIGGLKVIDVADNFIRLSLHTHIPNLEDFSSLQKLEGMIEPSEVDHELLVEVLEGTMELKNAEIIPGDVHLHNIINASKSLSNSLEWFVRKVQDRIVL
        EDTIGGLKVIDVADNFIRLSL THIPNLEDFSSLQKLEGMIEPSE++HELL+EVLEGTMELKNAEI PGDVHLH+IINASKS+SNSLEWFV+KVQDRIVL
Subjt:  EDTIGGLKVIDVADNFIRLSLHTHIPNLEDFSSLQKLEGMIEPSEVDHELLVEVLEGTMELKNAEIIPGDVHLHNIINASKSLSNSLEWFVRKVQDRIVL

Query:  CTLRRFVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVSQGWPLADSPLTLISLKSSDHYTKGVSLSLICKVEKMANSLNVRLRQNLSSFADAVEKI
        CTLRRFVVKSANKSSHSF+Y+DQDETI+C+MIGGIDA IKVSQGWPLADSPL L+SLKSSDHYTKG SLSL+CKVEKMANSL+ R+RQNLSSFADAVEKI
Subjt:  CTLRRFVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVSQGWPLADSPLTLISLKSSDHYTKGVSLSLICKVEKMANSLNVRLRQNLSSFADAVEKI

Query:  LKEQIYLELQSDSAL
        LKEQ++LELQ+DS L
Subjt:  LKEQIYLELQSDSAL

XP_038897559.1 uncharacterized protein LOC120085576 isoform X1 [Benincasa hispida]3.9e-18986.71Show/hide
Query:  MPESMELEATPSVSPSLDLQAVRSELEELQRSLEEDEASTTDSSGSEKLLKECALHLESRLQQILSECSDVDSFLGIDDLDAYVERMKEELVAVEAESSK
        MPESM  EATPSV PSLDLQ+VRSELEELQRSLEE+EA T DS GSEKLLKECALHLESRLQQILSE S+VDSFLGIDDLDAYVE MKEELVAVEAESSK
Subjt:  MPESMELEATPSVSPSLDLQAVRSELEELQRSLEEDEASTTDSSGSEKLLKECALHLESRLQQILSECSDVDSFLGIDDLDAYVERMKEELVAVEAESSK

Query:  ITNEIEVLKRTNIEDSNKLKMDLEAFKWSLDRFTSQDPETATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKRILKSLQELDEMFKSMDVVEQV
        I+NEIEVLKRTNIEDSNKLKMDLE  K SLDRFTSQDPE ATFNC S+NGEDQMN IV RECNAFEVLEL+ QIE+NK+ILKSLQE+D++FKS+DV+EQV
Subjt:  ITNEIEVLKRTNIEDSNKLKMDLEAFKWSLDRFTSQDPETATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKRILKSLQELDEMFKSMDVVEQV

Query:  EDTIGGLKVIDVADNFIRLSLHTHIPNLEDFSSLQKLEGMIEPSEVDHELLVEVLEGTMELKNAEIIPGDVHLHNIINASKSLSN-SLEWFVRKVQDRIV
        EDTIGG+KVIDVADNFIRLSL THIPNLEDFS+LQ+LEG+IE S +DHELL+EVLEGTMELKNAEI PGDVHLH+IINASKS+SN SLEWFVRKVQDRIV
Subjt:  EDTIGGLKVIDVADNFIRLSLHTHIPNLEDFSSLQKLEGMIEPSEVDHELLVEVLEGTMELKNAEIIPGDVHLHNIINASKSLSN-SLEWFVRKVQDRIV

Query:  LCTLRRFVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVSQGWPLADSPLTLISLKSSDHYTKGVSLSLICKVEKMANSLNVRLRQNLSSFADAVEK
        LCTLRRFVVKSANKSSHSFEY DQDE IIC+MIGGIDA IKVSQGWPLADSPL LISLKSSDHYTKGVSLSLICKVEKMANSL+ R+ +NLSSFADAVEK
Subjt:  LCTLRRFVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVSQGWPLADSPLTLISLKSSDHYTKGVSLSLICKVEKMANSLNVRLRQNLSSFADAVEK

Query:  ILKEQIYLELQSDS
        ILKEQ++LELQ+DS
Subjt:  ILKEQIYLELQSDS

TrEMBL top hitse value%identityAlignment
A0A0A0L6Q3 Uncharacterized protein2.4e-18985.27Show/hide
Query:  MPESMELEATPSVSPSLDLQAVRSELEELQRSLEEDEASTTDSSGSEKLLKECALHLESRLQQILSECSDVDSFLGIDDLDAYVERMKEELVAVEAESSK
        MPESM  EATPSV PSLDLQAVRSELEELQRSLEE+E STTDS GSEKLL+ECALHLESR+QQ+LSE S+VDSFLGIDDLDAYVE MKEELVAVEAESSK
Subjt:  MPESMELEATPSVSPSLDLQAVRSELEELQRSLEEDEASTTDSSGSEKLLKECALHLESRLQQILSECSDVDSFLGIDDLDAYVERMKEELVAVEAESSK

Query:  ITNEIEVLKRTNIEDSNKLKMDLEAFKWSLDRFTSQDPETATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKRILKSLQELDEMFKSMDVVEQV
        I+NEIEVLKRTNIEDSNKLKMDLE  K SLDRF SQDPE ATFNC S+NGED MN+IV RECNAFEVLEL+SQIEKNK+ILKSLQE+DE+FKS+DV+EQV
Subjt:  ITNEIEVLKRTNIEDSNKLKMDLEAFKWSLDRFTSQDPETATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKRILKSLQELDEMFKSMDVVEQV

Query:  EDTIGGLKVIDVADNFIRLSLHTHIPNLEDFSSLQKLEGMIEPSEVDHELLVEVLEGTMELKNAEIIPGDVHLHNIINASKSLSN-SLEWFVRKVQDRIV
        E TIGG+KVIDVADN IRLSLHTHIPN+EDFS+LQ+LEG+IE SE+DHEL++EVL+GTMELKNAEI P DVHLH+IINASKS+SN SLEWFVRKVQDRIV
Subjt:  EDTIGGLKVIDVADNFIRLSLHTHIPNLEDFSSLQKLEGMIEPSEVDHELLVEVLEGTMELKNAEIIPGDVHLHNIINASKSLSN-SLEWFVRKVQDRIV

Query:  LCTLRRFVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVSQGWPLADSPLTLISLKSSDHYTKGVSLSLICKVEKMANSLNVRLRQNLSSFADAVEK
        LCTLRRF VKSANKS HSFEYLDQDE I+C+MIGGIDA IKVSQGWPLADSPL LISLKSSDHYTKGVSLSLICKVEKMANSL+  +R+NLSSFADAVEK
Subjt:  LCTLRRFVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVSQGWPLADSPLTLISLKSSDHYTKGVSLSLICKVEKMANSLNVRLRQNLSSFADAVEK

Query:  ILKEQIYLELQSDS
        ILKEQ++LELQ+DS
Subjt:  ILKEQIYLELQSDS

A0A5A7U6L2 Uncharacterized protein7.8e-18884.54Show/hide
Query:  MPESMELEATPSVSPSLDLQAVRSELEELQRSLEEDEASTTDSSGSEKLLKECALHLESRLQQILSECSDVDSFLGIDDLDAYVERMKEELVAVEAESSK
        MPESME+  TPSV PSLDLQAVRSELEELQRSLEE+E S+ DS GSEKLL+ECALHLESR+QQ+LSE S+VDSFLGIDDLDAYVE MKEELVAVEAESSK
Subjt:  MPESMELEATPSVSPSLDLQAVRSELEELQRSLEEDEASTTDSSGSEKLLKECALHLESRLQQILSECSDVDSFLGIDDLDAYVERMKEELVAVEAESSK

Query:  ITNEIEVLKRTNIEDSNKLKMDLEAFKWSLDRFTSQDPETATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKRILKSLQELDEMFKSMDVVEQV
        I+NEIEVLKRT IEDSNKLKMDLE  K SLDRF SQDPE ATFNC S+NGED+MN+IV+RECNAFEVLEL+SQIEKNK+ILKSLQE+DE+FKS+DV+EQV
Subjt:  ITNEIEVLKRTNIEDSNKLKMDLEAFKWSLDRFTSQDPETATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKRILKSLQELDEMFKSMDVVEQV

Query:  EDTIGGLKVIDVADNFIRLSLHTHIPNLEDFSSLQKLEGMIEPSEVDHELLVEVLEGTMELKNAEIIPGDVHLHNIINASKSLSN-SLEWFVRKVQDRIV
        E TIGG+KVIDVADN IRLSLHTHIPN+EDFS+LQ+LEG+IE SE+DHEL++EV  GTMELKNAEI P DVHLH+IINASKS+SN SLEWFVRKVQDRIV
Subjt:  EDTIGGLKVIDVADNFIRLSLHTHIPNLEDFSSLQKLEGMIEPSEVDHELLVEVLEGTMELKNAEIIPGDVHLHNIINASKSLSN-SLEWFVRKVQDRIV

Query:  LCTLRRFVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVSQGWPLADSPLTLISLKSSDHYTKGVSLSLICKVEKMANSLNVRLRQNLSSFADAVEK
        LCTLRRF VKSANKSSHSFEYLDQDE I+C+MIGGIDA IKVSQGWPLADSPL LISLKSSDHYTKG+SLSLICKVEKMANSL+ R+RQNLSSFADAVEK
Subjt:  LCTLRRFVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVSQGWPLADSPLTLISLKSSDHYTKGVSLSLICKVEKMANSLNVRLRQNLSSFADAVEK

Query:  ILKEQIYLELQSDS
        ILKEQ++LELQ+DS
Subjt:  ILKEQIYLELQSDS

A0A6J1E9M5 uncharacterized protein LOC111432106 isoform X16.0e-18884.36Show/hide
Query:  MPESMELEATPSVSPSLDLQAVRS---ELEELQRSLEEDEASTTDSSGSEKLLKECALHLESRLQQILSECSDVDSFLGIDD----LDAYVERMKEELVA
        MPESM  EATPSVS SLDLQAVRS   ELEELQRSL EDEA +TDS GSEKLLKECALHLESRLQQ+LSECS+VDSFLGIDD    LDAYVE MKEELVA
Subjt:  MPESMELEATPSVSPSLDLQAVRS---ELEELQRSLEEDEASTTDSSGSEKLLKECALHLESRLQQILSECSDVDSFLGIDD----LDAYVERMKEELVA

Query:  VEAESSKITNEIEVLKRTNIEDSNKLKMDLEAFKWSLDRFTSQDPETATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKRILKSLQELDEMFKS
        VEAESS+I+NEIEVLKRTNIE SNKL+++LE    SLDRFTSQDPE  TFN CS+NGEDQMN+IV+RE NAFEVLELDS IEKNKRILKSLQE+DE+FKS
Subjt:  VEAESSKITNEIEVLKRTNIEDSNKLKMDLEAFKWSLDRFTSQDPETATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKRILKSLQELDEMFKS

Query:  MDVVEQVEDTIGGLKVIDVADNFIRLSLHTHIPNLEDFSSLQKLEGMIEPSEVDHELLVEVLEGTMELKNAEIIPGDVHLHNIINASKSLSNSLEWFVRK
        +DVVEQVEDTIGGLKVI VADNFIRLSL THIPNLEDFSSLQ+LEGMIEPSE++HELL+EVLEGTMELKNAEI PGDVHLH+IINASKS+SNSLEWFV+K
Subjt:  MDVVEQVEDTIGGLKVIDVADNFIRLSLHTHIPNLEDFSSLQKLEGMIEPSEVDHELLVEVLEGTMELKNAEIIPGDVHLHNIINASKSLSNSLEWFVRK

Query:  VQDRIVLCTLRRFVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVSQGWPLADSPLTLISLKSSDHYTKGVSLSLICKVEKMANSLNVRLRQNLSSF
        VQDRIVLCTLRRFVVKSANKSSHSF+Y+DQDETI+C MIGGIDA IKVSQGWPLADSPL L+SLKSSDHYTKG SLSL+CKVEKMANSL+ R+RQNLSSF
Subjt:  VQDRIVLCTLRRFVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVSQGWPLADSPLTLISLKSSDHYTKGVSLSLICKVEKMANSLNVRLRQNLSSF

Query:  ADAVEKILKEQIYLELQSDSAL
        ADAVEKILKEQ++LEL++D  L
Subjt:  ADAVEKILKEQIYLELQSDSAL

A0A6J1E9V8 uncharacterized protein LOC111432106 isoform X31.1e-18985.17Show/hide
Query:  MPESMELEATPSVSPSLDLQAVRS---ELEELQRSLEEDEASTTDSSGSEKLLKECALHLESRLQQILSECSDVDSFLGIDDLDAYVERMKEELVAVEAE
        MPESM  EATPSVS SLDLQAVRS   ELEELQRSL EDEA +TDS GSEKLLKECALHLESRLQQ+LSECS+VDSFLGIDDLDAYVE MKEELVAVEAE
Subjt:  MPESMELEATPSVSPSLDLQAVRS---ELEELQRSLEEDEASTTDSSGSEKLLKECALHLESRLQQILSECSDVDSFLGIDDLDAYVERMKEELVAVEAE

Query:  SSKITNEIEVLKRTNIEDSNKLKMDLEAFKWSLDRFTSQDPETATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKRILKSLQELDEMFKSMDVV
        SS+I+NEIEVLKRTNIE SNKL+++LE    SLDRFTSQDPE  TFN CS+NGEDQMN+IV+RE NAFEVLELDS IEKNKRILKSLQE+DE+FKS+DVV
Subjt:  SSKITNEIEVLKRTNIEDSNKLKMDLEAFKWSLDRFTSQDPETATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKRILKSLQELDEMFKSMDVV

Query:  EQVEDTIGGLKVIDVADNFIRLSLHTHIPNLEDFSSLQKLEGMIEPSEVDHELLVEVLEGTMELKNAEIIPGDVHLHNIINASKSLSNSLEWFVRKVQDR
        EQVEDTIGGLKVI VADNFIRLSL THIPNLEDFSSLQ+LEGMIEPSE++HELL+EVLEGTMELKNAEI PGDVHLH+IINASKS+SNSLEWFV+KVQDR
Subjt:  EQVEDTIGGLKVIDVADNFIRLSLHTHIPNLEDFSSLQKLEGMIEPSEVDHELLVEVLEGTMELKNAEIIPGDVHLHNIINASKSLSNSLEWFVRKVQDR

Query:  IVLCTLRRFVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVSQGWPLADSPLTLISLKSSDHYTKGVSLSLICKVEKMANSLNVRLRQNLSSFADAV
        IVLCTLRRFVVKSANKSSHSF+Y+DQDETI+C MIGGIDA IKVSQGWPLADSPL L+SLKSSDHYTKG SLSL+CKVEKMANSL+ R+RQNLSSFADAV
Subjt:  IVLCTLRRFVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVSQGWPLADSPLTLISLKSSDHYTKGVSLSLICKVEKMANSLNVRLRQNLSSFADAV

Query:  EKILKEQIYLELQSDSAL
        EKILKEQ++LEL++D  L
Subjt:  EKILKEQIYLELQSDSAL

A0A6J1ED63 uncharacterized protein LOC111432106 isoform X25.4e-18984.73Show/hide
Query:  MPESMELEATPSVSPSLDLQAVRSELEELQRSLEEDEASTTDSSGSEKLLKECALHLESRLQQILSECSDVDSFLGIDD----LDAYVERMKEELVAVEA
        MPESM  EATPSVS SLDLQAVR ELEELQRSL EDEA +TDS GSEKLLKECALHLESRLQQ+LSECS+VDSFLGIDD    LDAYVE MKEELVAVEA
Subjt:  MPESMELEATPSVSPSLDLQAVRSELEELQRSLEEDEASTTDSSGSEKLLKECALHLESRLQQILSECSDVDSFLGIDD----LDAYVERMKEELVAVEA

Query:  ESSKITNEIEVLKRTNIEDSNKLKMDLEAFKWSLDRFTSQDPETATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKRILKSLQELDEMFKSMDV
        ESS+I+NEIEVLKRTNIE SNKL+++LE    SLDRFTSQDPE  TFN CS+NGEDQMN+IV+RE NAFEVLELDS IEKNKRILKSLQE+DE+FKS+DV
Subjt:  ESSKITNEIEVLKRTNIEDSNKLKMDLEAFKWSLDRFTSQDPETATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKRILKSLQELDEMFKSMDV

Query:  VEQVEDTIGGLKVIDVADNFIRLSLHTHIPNLEDFSSLQKLEGMIEPSEVDHELLVEVLEGTMELKNAEIIPGDVHLHNIINASKSLSNSLEWFVRKVQD
        VEQVEDTIGGLKVI VADNFIRLSL THIPNLEDFSSLQ+LEGMIEPSE++HELL+EVLEGTMELKNAEI PGDVHLH+IINASKS+SNSLEWFV+KVQD
Subjt:  VEQVEDTIGGLKVIDVADNFIRLSLHTHIPNLEDFSSLQKLEGMIEPSEVDHELLVEVLEGTMELKNAEIIPGDVHLHNIINASKSLSNSLEWFVRKVQD

Query:  RIVLCTLRRFVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVSQGWPLADSPLTLISLKSSDHYTKGVSLSLICKVEKMANSLNVRLRQNLSSFADA
        RIVLCTLRRFVVKSANKSSHSF+Y+DQDETI+C MIGGIDA IKVSQGWPLADSPL L+SLKSSDHYTKG SLSL+CKVEKMANSL+ R+RQNLSSFADA
Subjt:  RIVLCTLRRFVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVSQGWPLADSPLTLISLKSSDHYTKGVSLSLICKVEKMANSLNVRLRQNLSSFADA

Query:  VEKILKEQIYLELQSDSAL
        VEKILKEQ++LEL++D  L
Subjt:  VEKILKEQIYLELQSDSAL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G23910.1 BEST Arabidopsis thaliana protein match is: RNA-directed DNA polymerase (reverse transcriptase)-related family protein (TAIR:AT3G24255.2)4.2e-8542.75Show/hide
Query:  SLDLQAVRSELEELQ---RSLEEDEASTTDSSGSEKLLKECALHLESRLQQILSECSDVDSFLGIDDLDAYVERMKEELVAVEAESSKITNEIEVLKRTN
        SLDLQ +R  ++EL    R+  E+   +  S     ++++  L  E ++++I+ E  DVD  L ++D DAY+E ++ EL +VEAES+K++ EIE L +++
Subjt:  SLDLQAVRSELEELQ---RSLEEDEASTTDSSGSEKLLKECALHLESRLQQILSECSDVDSFLGIDDLDAYVERMKEELVAVEAESSKITNEIEVLKRTN

Query:  IEDSNKLKMDLEAFKWSLDRFTSQDPETATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKRILKSLQELDEMFKSMDVVEQVEDTIGGLKVIDV
         +DS++L+ DLE    SLD  +SQD E +  N  S +  +   +I   + + F++ EL++Q+E+ + ILKSL++LD + K  D  EQVED + GLKV++ 
Subjt:  IEDSNKLKMDLEAFKWSLDRFTSQDPETATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKRILKSLQELDEMFKSMDVVEQVEDTIGGLKVIDV

Query:  ADNFIRLSLHTHIPNLEDFSSLQKLEGMIEPSEVDHELLVEVLEGTMELKNAEIIPGDVHLHNIINASKSL------------SNSLEWFVRKVQDRIVL
          NFIRL L T+I  L+ F    K + + EPSE+ HELL+ + + T E+   E+ P D+++ +II A+ S              +S++W V KVQD+I+ 
Subjt:  ADNFIRLSLHTHIPNLEDFSSLQKLEGMIEPSEVDHELLVEVLEGTMELKNAEIIPGDVHLHNIINASKSL------------SNSLEWFVRKVQDRIVL

Query:  CTLRRFVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVSQGWPLADSPLTLISLKSSDHYTKGVSLSLICKVEKMANSLNVRLRQNLSSFADAVEKI
         TLR+++V S+    ++FEY D+DETI+  + GGIDA +KVS GWPL ++PL L SLK+SD+ +KG+SLSLICKVE++ANSL++  RQNLS F DA+EKI
Subjt:  CTLRRFVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVSQGWPLADSPLTLISLKSSDHYTKGVSLSLICKVEKMANSLNVRLRQNLSSFADAVEKI

Query:  LKEQIYLELQSDSA
        L EQ   ELQS+ +
Subjt:  LKEQIYLELQSDSA

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.7e-7344.51Show/hide
Query:  DAYVERMKEELVAVEAESSKITNEIEVLKRTNIEDSNKLKMDLEAFKWSLDRFTSQDPETATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKRI
        DAY+E ++ EL +VEAES+K++ EIE L +++  DS++L+ DLE    SLD  +SQD E +  N  S +  +   +I   + + F++ EL++Q+E+ + I
Subjt:  DAYVERMKEELVAVEAESSKITNEIEVLKRTNIEDSNKLKMDLEAFKWSLDRFTSQDPETATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKRI

Query:  LKSLQELDEMFKSMDVVEQVEDTIGGLKVIDVADNFIRLSLHTHIPNLEDFSSLQKLEGMIEPSEVDHELLVEVLEGTMELKNAEIIPGDVHLHNIINAS
        LKSL++LD + K  D  EQVED + GLKV++   NFIRL L T+I  L+ F    K + + EPSE+ HELL+ + + T E+   E+ P D+++ +II A+
Subjt:  LKSLQELDEMFKSMDVVEQVEDTIGGLKVIDVADNFIRLSLHTHIPNLEDFSSLQKLEGMIEPSEVDHELLVEVLEGTMELKNAEIIPGDVHLHNIINAS

Query:  KSL------------SNSLEWFVRKVQDRIVLCTLRRFVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVSQGWPLADSPLTLISLKSSDHYTKGVS
         S              +S++W V KVQD+I+  TLR+  V S+    ++FEY D+DETI+  + GGIDA +KVS GWPL ++PL L SLK+SD+ +KG S
Subjt:  KSL------------SNSLEWFVRKVQDRIVLCTLRRFVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVSQGWPLADSPLTLISLKSSDHYTKGVS

Query:  LSLICKVEKMANSLNVRLRQNLSSFADAVEKILKEQIYLELQSDSA
        LSLI K+E++ANSL++  RQNLS F DAVEKIL +Q   EL+S+ +
Subjt:  LSLICKVEKMANSLNVRLRQNLSSFADAVEKILKEQIYLELQSDSA

AT3G24255.2 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.5e-7740.86Show/hide
Query:  SLDLQAVRSELEELQ---RSLEEDEASTTDSSGSEKLLKECALHLESRLQQILSECSDVDSFLGIDD-------LDAYVERMKEELVAVEAESSKITNEI
        SLDLQ +R  ++E     R+  E+   +  S     ++++  L  E ++++I+ +  DVD  L +D         DAY+E ++ EL +VEAES+K++ EI
Subjt:  SLDLQAVRSELEELQ---RSLEEDEASTTDSSGSEKLLKECALHLESRLQQILSECSDVDSFLGIDD-------LDAYVERMKEELVAVEAESSKITNEI

Query:  EVLKRTNIEDSNKLKMDLEAFKWSLDRFTSQDPETATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKRILKSLQELDEMFKSMDVVEQVEDTIG
        E L +++  DS++L+ DLE    SLD  +SQD E +  N  S +  +   +I   + + F++ EL++Q+E+ + ILKSL++LD + K  D  EQVED + 
Subjt:  EVLKRTNIEDSNKLKMDLEAFKWSLDRFTSQDPETATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKRILKSLQELDEMFKSMDVVEQVEDTIG

Query:  GLKVIDVADNFIRLSLHTHIPNLEDFSSLQKLEGMIEPSEVDHELLVEVLEGTMELKNAEIIPGDVHLHNIINASKSL------------SNSLEWFVRK
        GLKV++   NFIRL L T+I  L+ F    K + + EPSE+ HELL+ + + T E+   E+ P D+++ +II A+ S              +S++W V K
Subjt:  GLKVIDVADNFIRLSLHTHIPNLEDFSSLQKLEGMIEPSEVDHELLVEVLEGTMELKNAEIIPGDVHLHNIINASKSL------------SNSLEWFVRK

Query:  VQDRIVLCTLRRFVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVSQGWPLADSPLTLISLKSSDHYTKGVSLSLICKVEKMANSLNVRLRQNLSSF
        VQD+I+  TLR+  V S+    ++FEY D+DETI+  + GGIDA +KVS GWPL ++PL L SLK+SD+ +KG SLSLI K+E++ANSL++  RQNLS F
Subjt:  VQDRIVLCTLRRFVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVSQGWPLADSPLTLISLKSSDHYTKGVSLSLICKVEKMANSLNVRLRQNLSSF

Query:  ADAVEKILKEQIYLELQSDSA
         DAVEKIL +Q   EL+S+ +
Subjt:  ADAVEKILKEQIYLELQSDSA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAGAATCGATGGAACTGGAAGCTACTCCGTCTGTATCTCCAAGCCTCGATCTCCAAGCAGTTCGCAGCGAGCTAGAAGAGTTGCAGAGATCTTTGGAGGAAGATGA
AGCTTCTACGACGGATTCATCAGGTTCTGAGAAGTTACTGAAGGAGTGTGCTCTCCATCTAGAGAGTCGACTGCAGCAAATTCTGTCAGAATGCTCTGACGTTGATAGTT
TCTTGGGGATTGATGATTTAGATGCGTATGTTGAACGTATGAAAGAGGAACTTGTTGCGGTGGAAGCTGAAAGCAGCAAAATCACCAATGAGATCGAGGTTCTTAAGAGA
ACCAATATAGAAGATTCTAATAAATTAAAGATGGATCTTGAAGCATTTAAATGGTCGTTAGATCGTTTTACATCACAGGATCCAGAAACGGCAACATTTAATTGCTGCTC
TGTGAATGGTGAAGATCAAATGAACATGATAGTCGAGCGTGAATGCAATGCCTTTGAGGTGTTGGAACTTGATAGTCAGATTGAGAAGAACAAAAGAATTCTAAAATCTT
TGCAGGAATTAGATGAGATGTTTAAAAGTATGGATGTTGTTGAACAGGTTGAGGACACAATTGGTGGTCTGAAGGTCATTGACGTTGCTGATAATTTCATTAGATTGTCA
TTACATACACATATTCCGAACTTAGAAGATTTTTCAAGCTTACAGAAACTAGAAGGTATGATTGAGCCATCTGAAGTGGATCACGAGTTGCTAGTAGAAGTTTTGGAGGG
GACCATGGAGCTAAAGAATGCTGAGATCATTCCGGGTGATGTCCACTTGCACAATATCATCAATGCTTCAAAGTCACTCAGCAATTCTTTGGAATGGTTTGTGAGAAAAG
TACAAGATAGGATTGTTTTGTGTACTCTAAGGCGATTTGTTGTGAAGAGTGCAAACAAATCGAGTCATTCCTTCGAGTATTTAGATCAAGACGAAACGATAATATGTACT
ATGATTGGAGGAATTGATGCACTTATTAAGGTGTCTCAAGGTTGGCCATTAGCTGATTCTCCTTTGACACTCATATCTCTTAAGAGCTCAGACCATTATACAAAAGGAGT
TTCTCTAAGCCTCATTTGCAAGGTGGAGAAAATGGCAAATTCGTTGAACGTTCGCCTTCGCCAAAATCTATCAAGCTTTGCAGATGCTGTTGAAAAAATATTGAAGGAGC
AAATTTATTTAGAACTCCAATCTGACAGCGCTCTATGA
mRNA sequenceShow/hide mRNA sequence
GCGGATACCGAAGAATTCTCCCAACTTCTCTCTCAGTTTGAATTTCGCGCAGACGAATAACTCGAGGAATCTCGTACGGATTCTGTGCAAATTCCGGCGAAGGAAGCAGA
AAATAATGCCAGAATCGATGGAACTGGAAGCTACTCCGTCTGTATCTCCAAGCCTCGATCTCCAAGCAGTTCGCAGCGAGCTAGAAGAGTTGCAGAGATCTTTGGAGGAA
GATGAAGCTTCTACGACGGATTCATCAGGTTCTGAGAAGTTACTGAAGGAGTGTGCTCTCCATCTAGAGAGTCGACTGCAGCAAATTCTGTCAGAATGCTCTGACGTTGA
TAGTTTCTTGGGGATTGATGATTTAGATGCGTATGTTGAACGTATGAAAGAGGAACTTGTTGCGGTGGAAGCTGAAAGCAGCAAAATCACCAATGAGATCGAGGTTCTTA
AGAGAACCAATATAGAAGATTCTAATAAATTAAAGATGGATCTTGAAGCATTTAAATGGTCGTTAGATCGTTTTACATCACAGGATCCAGAAACGGCAACATTTAATTGC
TGCTCTGTGAATGGTGAAGATCAAATGAACATGATAGTCGAGCGTGAATGCAATGCCTTTGAGGTGTTGGAACTTGATAGTCAGATTGAGAAGAACAAAAGAATTCTAAA
ATCTTTGCAGGAATTAGATGAGATGTTTAAAAGTATGGATGTTGTTGAACAGGTTGAGGACACAATTGGTGGTCTGAAGGTCATTGACGTTGCTGATAATTTCATTAGAT
TGTCATTACATACACATATTCCGAACTTAGAAGATTTTTCAAGCTTACAGAAACTAGAAGGTATGATTGAGCCATCTGAAGTGGATCACGAGTTGCTAGTAGAAGTTTTG
GAGGGGACCATGGAGCTAAAGAATGCTGAGATCATTCCGGGTGATGTCCACTTGCACAATATCATCAATGCTTCAAAGTCACTCAGCAATTCTTTGGAATGGTTTGTGAG
AAAAGTACAAGATAGGATTGTTTTGTGTACTCTAAGGCGATTTGTTGTGAAGAGTGCAAACAAATCGAGTCATTCCTTCGAGTATTTAGATCAAGACGAAACGATAATAT
GTACTATGATTGGAGGAATTGATGCACTTATTAAGGTGTCTCAAGGTTGGCCATTAGCTGATTCTCCTTTGACACTCATATCTCTTAAGAGCTCAGACCATTATACAAAA
GGAGTTTCTCTAAGCCTCATTTGCAAGGTGGAGAAAATGGCAAATTCGTTGAACGTTCGCCTTCGCCAAAATCTATCAAGCTTTGCAGATGCTGTTGAAAAAATATTGAA
GGAGCAAATTTATTTAGAACTCCAATCTGACAGCGCTCTATGATGATTAAGAAGAACTGTGGTTCTTCATCATACAGTTTGATGGTTTCTACCTACTTCAATCAGATCTC
AACTGGATGTTTCAAATTTACAACCACACCAATAGAACAAATCGAGATCCTGTTAACAGGTTAGAAAACTGAACATCTATCTATCTACTTGTACAAGTGCCATGCATTCC
AAGTAAAATTTTAGCTGATGATTTTTCATGCTGAAAATTTTAGCTCGATTATTAATTACTATTTTTCCCCCTTTTGTCTGTATATAGACTAAAGACTTTGTTGCTCTGTT
TATTTATTTATCATTATTATTTTTGCTAATTAGGATATCAAAAGCCCTATTCTTTGCATTTTGCTCATTGTTCATTCCAGTAATACCCAGAGAACTTAGG
Protein sequenceShow/hide protein sequence
MPESMELEATPSVSPSLDLQAVRSELEELQRSLEEDEASTTDSSGSEKLLKECALHLESRLQQILSECSDVDSFLGIDDLDAYVERMKEELVAVEAESSKITNEIEVLKR
TNIEDSNKLKMDLEAFKWSLDRFTSQDPETATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKRILKSLQELDEMFKSMDVVEQVEDTIGGLKVIDVADNFIRLS
LHTHIPNLEDFSSLQKLEGMIEPSEVDHELLVEVLEGTMELKNAEIIPGDVHLHNIINASKSLSNSLEWFVRKVQDRIVLCTLRRFVVKSANKSSHSFEYLDQDETIICT
MIGGIDALIKVSQGWPLADSPLTLISLKSSDHYTKGVSLSLICKVEKMANSLNVRLRQNLSSFADAVEKILKEQIYLELQSDSAL