; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Bhi08G001477 (gene) of Wax gourd (B227) v1 genome

Gene IDBhi08G001477
OrganismBenincasa hispida cv. B227 (Wax gourd (B227) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr8:52378227..52380591
RNA-Seq ExpressionBhi08G001477
SyntenyBhi08G001477
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011648413.2 probable serine/threonine-protein kinase DDB_G0280111 [Cucumis sativus]1.5e-6148.39Show/hide
Query:  RGIMSP--------KRSRFSPKMTKKGTPQPAKPAKNKHPR---NVTGSKIEVTPIPNPTPPRAATNPSPNHSKTQSRSEPSSPIPPPTPTPTTR---SQ
        RGI  P        ++   SP  TKK T    K + NKHP    N   S         P P  AA N +PN     S S PSSP PPPTP PTTR   SQ
Subjt:  RGIMSP--------KRSRFSPKMTKKGTPQPAKPAKNKHPR---NVTGSKIEVTPIPNPTPPRAATNPSPNHSKTQSRSEPSSPIPPPTPTPTTR---SQ

Query:  PTTPS-------SSNTNVVRQGGLSFDNPSVSK---------SFHSPKDYHKSSPGSWLGL-QKDVNTSPDPSSPASSHTASDLSDRLLQRLSFDGKDVA
        P TPS       S   +  R+       PS S          + +SPK   K SP S     +K V T+  P+SPASS  ++D+  RLLQ LSF+GKD+ 
Subjt:  PTTPS-------SSNTNVVRQGGLSFDNPSVSK---------SFHSPKDYHKSSPGSWLGL-QKDVNTSPDPSSPASSHTASDLSDRLLQRLSFDGKDVA

Query:  DILQGKSIYDIMGS-NKKEETS-QSIDGLRVLQIYKEIASHRQENLFVEPYFRKLNALWDELSFYITDLAQ-CSSGGAIQNVSELTERQKVVQFLVGLND
        DIL+G SI D+MGS NKKEE+S +++  L +LQIY++IASHRQ NL VE YF+KL  LW+++  Y +D AQ  SS G I   SELTER KV+QF +GLND
Subjt:  DILQGKSIYDIMGS-NKKEETS-QSIDGLRVLQIYKEIASHRQENLFVEPYFRKLNALWDELSFYITDLAQ-CSSGGAIQNVSELTERQKVVQFLVGLND

Query:  SYATICGQILVKRPFPTVEEAYSEIIGEEKRRELVAALETVAAKVIQSNWLLKNQNSRSNNNRGIDQEVDNA
         Y+ IC QILV +PFPTVEEAYSEII EEKRREL  AL  +AA+VIQS++   + N+  N N GIDQE+D +
Subjt:  SYATICGQILVKRPFPTVEEAYSEIIGEEKRRELVAALETVAAKVIQSNWLLKNQNSRSNNNRGIDQEVDNA

XP_011652531.1 protein ENL [Cucumis sativus]1.9e-6447.68Show/hide
Query:  MEGRGIMSPKRSRFSPKMTKKGTPQPAKPAKNKHPR---NVTGSKIEVTPIPNPTPPRAATNPSPNHSKTQSRSE--PSSPIPPPTP--TPTTRSQPTTP
        M+ RGIM PKRS+FSPK  +K     AK +  K+P+   N   S         P P  AATNP+ N +KT   SE   S+P PPPTP  TP  +SQP TP
Subjt:  MEGRGIMSPKRSRFSPKMTKKGTPQPAKPAKNKHPR---NVTGSKIEVTPIPNPTPPRAATNPSPNHSKTQSRSE--PSSPIPPPTP--TPTTRSQPTTP

Query:  SSS-------NTNVVRQGGLSFDNPSVSKSF---------HSPKDYHKSSPGSWLG-LQKDVNTSPDPSSPASSHTASDLSDRLLQRLSFDGKDVADILQ
        SS        N +  R+       PS +            +S K + K+SP S     QK V T+ DP SP  S ++ D+ DRLLQRLS +GKD+ DIL+
Subjt:  SSS-------NTNVVRQGGLSFDNPSVSKSF---------HSPKDYHKSSPGSWLG-LQKDVNTSPDPSSPASSHTASDLSDRLLQRLSFDGKDVADILQ

Query:  GKSIYDIMGSN--KKEETSQSIDGLRVLQIYKEIASHRQENLFVEPYFRKLNALWDELSFYITDLAQCSSGGAIQNVSELTERQKVVQFLVGLNDSYATI
        G +I D+MGSN  K+E +S+++  L +LQIY++IASHRQ NL VE YF+KL  LW+++  Y ++  +      I   SELTER KV+QF +GLND Y+ I
Subjt:  GKSIYDIMGSN--KKEETSQSIDGLRVLQIYKEIASHRQENLFVEPYFRKLNALWDELSFYITDLAQCSSGGAIQNVSELTERQKVVQFLVGLNDSYATI

Query:  CGQILVKRPFPTVEEAYSEIIGEEKRRELVAALETVAAKVIQSNWLLKNQNSRSNNNRGIDQEVDNA
        C QILV +PFPTVEEAYSEII EEKRREL  AL TVAA+VIQS++   + N+  N N GIDQE+D +
Subjt:  CGQILVKRPFPTVEEAYSEIIGEEKRRELVAALETVAAKVIQSNWLLKNQNSRSNNNRGIDQEVDNA

XP_038895148.1 proline-rich receptor-like protein kinase PERK2 isoform X1 [Benincasa hispida]7.9e-188100Show/hide
Query:  MEGRGIMSPKRSRFSPKMTKKGTPQPAKPAKNKHPRNVTGSKIEVTPIPNPTPPRAATNPSPNHSKTQSRSEPSSPIPPPTPTPTTRSQPTTPSSSNTNV
        MEGRGIMSPKRSRFSPKMTKKGTPQPAKPAKNKHPRNVTGSKIEVTPIPNPTPPRAATNPSPNHSKTQSRSEPSSPIPPPTPTPTTRSQPTTPSSSNTNV
Subjt:  MEGRGIMSPKRSRFSPKMTKKGTPQPAKPAKNKHPRNVTGSKIEVTPIPNPTPPRAATNPSPNHSKTQSRSEPSSPIPPPTPTPTTRSQPTTPSSSNTNV

Query:  VRQGGLSFDNPSVSKSFHSPKDYHKSSPGSWLGLQKDVNTSPDPSSPASSHTASDLSDRLLQRLSFDGKDVADILQGKSIYDIMGSNKKEETSQSIDGLR
        VRQGGLSFDNPSVSKSFHSPKDYHKSSPGSWLGLQKDVNTSPDPSSPASSHTASDLSDRLLQRLSFDGKDVADILQGKSIYDIMGSNKKEETSQSIDGLR
Subjt:  VRQGGLSFDNPSVSKSFHSPKDYHKSSPGSWLGLQKDVNTSPDPSSPASSHTASDLSDRLLQRLSFDGKDVADILQGKSIYDIMGSNKKEETSQSIDGLR

Query:  VLQIYKEIASHRQENLFVEPYFRKLNALWDELSFYITDLAQCSSGGAIQNVSELTERQKVVQFLVGLNDSYATICGQILVKRPFPTVEEAYSEIIGEEKR
        VLQIYKEIASHRQENLFVEPYFRKLNALWDELSFYITDLAQCSSGGAIQNVSELTERQKVVQFLVGLNDSYATICGQILVKRPFPTVEEAYSEIIGEEKR
Subjt:  VLQIYKEIASHRQENLFVEPYFRKLNALWDELSFYITDLAQCSSGGAIQNVSELTERQKVVQFLVGLNDSYATICGQILVKRPFPTVEEAYSEIIGEEKR

Query:  RELVAALETVAAKVIQSNWLLKNQNSRSNNNRGIDQEVDNANLGSPM
        RELVAALETVAAKVIQSNWLLKNQNSRSNNNRGIDQEVDNANLGSPM
Subjt:  RELVAALETVAAKVIQSNWLLKNQNSRSNNNRGIDQEVDNANLGSPM

XP_038895149.1 proline-rich receptor-like protein kinase PERK2 isoform X2 [Benincasa hispida]4.8e-18599.42Show/hide
Query:  MEGRGIMSPKRSRFSPKMTKKGTPQPAKPAKNKHPRNVTGSKIEVTPIPNPTPPRAATNPSPNHSKTQSRSEPSSPIPPPTPTPTTRSQPTTPSSSNTNV
        MEGRGIMSPKRSRFSPKMTKKGTPQPAKPAKNKHPRNVTGSKIEVTPIPNPTPPRAATNPSPNHSKTQSRSEPSSPIPPPTPTPTTRSQPTTPSSSNTNV
Subjt:  MEGRGIMSPKRSRFSPKMTKKGTPQPAKPAKNKHPRNVTGSKIEVTPIPNPTPPRAATNPSPNHSKTQSRSEPSSPIPPPTPTPTTRSQPTTPSSSNTNV

Query:  VRQGGLSFDNPSVSKSFHSPKDYHKSSPGSWLGLQKDVNTSPDPSSPASSHTASDLSDRLLQRLSFDGKDVADILQGKSIYDIMGSNKKEETSQSIDGLR
        VRQGGLSFDNPSVSKSFHSPKDYHKSSPGSWLGLQKDVNTSPDPSSPASSHTASDLSDRLLQRLSFD  DVADILQGKSIYDIMGSNKKEETSQSIDGLR
Subjt:  VRQGGLSFDNPSVSKSFHSPKDYHKSSPGSWLGLQKDVNTSPDPSSPASSHTASDLSDRLLQRLSFDGKDVADILQGKSIYDIMGSNKKEETSQSIDGLR

Query:  VLQIYKEIASHRQENLFVEPYFRKLNALWDELSFYITDLAQCSSGGAIQNVSELTERQKVVQFLVGLNDSYATICGQILVKRPFPTVEEAYSEIIGEEKR
        VLQIYKEIASHRQENLFVEPYFRKLNALWDELSFYITDLAQCSSGGAIQNVSELTERQKVVQFLVGLNDSYATICGQILVKRPFPTVEEAYSEIIGEEKR
Subjt:  VLQIYKEIASHRQENLFVEPYFRKLNALWDELSFYITDLAQCSSGGAIQNVSELTERQKVVQFLVGLNDSYATICGQILVKRPFPTVEEAYSEIIGEEKR

Query:  RELVAALETVAAKVIQSNWLLKNQNSRSNNNRGIDQEVDNANLGSPM
        RELVAALETVAAKVIQSNWLLKNQNSRSNNNRGIDQEVDNANLGSPM
Subjt:  RELVAALETVAAKVIQSNWLLKNQNSRSNNNRGIDQEVDNANLGSPM

XP_038895286.1 hybrid signal transduction histidine kinase L-like isoform X2 [Benincasa hispida]5.0e-4941.67Show/hide
Query:  RGIMSPKRSRFS------------PKMTKKGTPQPAKPAKNKHPRNVTG---SKIEVTPIPNPT----------PPRAATNPS-----PNHSKTQSRSEP
        RGI+SP R++FS            P    K   +  KP   +  +N  G   S    T   NP+          P  A +NP+       +S T+  S+P
Subjt:  RGIMSPKRSRFS------------PKMTKKGTPQPAKPAKNKHPRNVTG---SKIEVTPIPNPT----------PPRAATNPS-----PNHSKTQSRSEP

Query:  SSPIPPPTPTPTTRSQPTTPSSSNTNVVRQGGLSFDNPSVSKS--FH----SPKDY---HKSSPGSWLGLQKDVNTSPDPSSPASSHTASDLSDRLLQRL
        SSP  P   T  TR  P +  +    +    G   +  S  K+  FH    SP D    H+ S G +  L  D++ S                   LQRL
Subjt:  SSPIPPPTPTPTTRSQPTTPSSSNTNVVRQGGLSFDNPSVSKS--FH----SPKDY---HKSSPGSWLGLQKDVNTSPDPSSPASSHTASDLSDRLLQRL

Query:  SFDGKDVAD-ILQGKSIYDIMGSNKKEETSQSIDGLRVLQIYKEIASHRQENLFVEPYFRKLNALWDELSFYITDLAQCSSGGAIQNVSELTERQKVVQF
        S DGKD+A  +L   SIY+ MGS+ KE +S   +  R+ QIYKEIA HRQEN  +  YF KL ALWDEL+ + TDL QCS GGA + +SE  ER+KV+QF
Subjt:  SFDGKDVAD-ILQGKSIYDIMGSNKKEETSQSIDGLRVLQIYKEIASHRQENLFVEPYFRKLNALWDELSFYITDLAQCSSGGAIQNVSELTERQKVVQF

Query:  LVGLNDSYATICGQILVKRPFPTVEEAYSEIIGEEKRRELVAALETVAAKVIQSNWL-LKNQNSRS----NNNRGIDQEVDNAN
        LVGLNDSY+ IC QIL+  PFPT+E+AYS +I EEK RELV  LE+VA KVIQ+NWL L+NQN+ S    +NN G+ Q VD++N
Subjt:  LVGLNDSYATICGQILVKRPFPTVEEAYSEIIGEEKRRELVAALETVAAKVIQSNWL-LKNQNSRS----NNNRGIDQEVDNAN

TrEMBL top hitse value%identityAlignment
A0A0A0LRE6 Uncharacterized protein9.2e-6547.68Show/hide
Query:  MEGRGIMSPKRSRFSPKMTKKGTPQPAKPAKNKHPR---NVTGSKIEVTPIPNPTPPRAATNPSPNHSKTQSRSE--PSSPIPPPTP--TPTTRSQPTTP
        M+ RGIM PKRS+FSPK  +K     AK +  K+P+   N   S         P P  AATNP+ N +KT   SE   S+P PPPTP  TP  +SQP TP
Subjt:  MEGRGIMSPKRSRFSPKMTKKGTPQPAKPAKNKHPR---NVTGSKIEVTPIPNPTPPRAATNPSPNHSKTQSRSE--PSSPIPPPTP--TPTTRSQPTTP

Query:  SSS-------NTNVVRQGGLSFDNPSVSKSF---------HSPKDYHKSSPGSWLG-LQKDVNTSPDPSSPASSHTASDLSDRLLQRLSFDGKDVADILQ
        SS        N +  R+       PS +            +S K + K+SP S     QK V T+ DP SP  S ++ D+ DRLLQRLS +GKD+ DIL+
Subjt:  SSS-------NTNVVRQGGLSFDNPSVSKSF---------HSPKDYHKSSPGSWLG-LQKDVNTSPDPSSPASSHTASDLSDRLLQRLSFDGKDVADILQ

Query:  GKSIYDIMGSN--KKEETSQSIDGLRVLQIYKEIASHRQENLFVEPYFRKLNALWDELSFYITDLAQCSSGGAIQNVSELTERQKVVQFLVGLNDSYATI
        G +I D+MGSN  K+E +S+++  L +LQIY++IASHRQ NL VE YF+KL  LW+++  Y ++  +      I   SELTER KV+QF +GLND Y+ I
Subjt:  GKSIYDIMGSN--KKEETSQSIDGLRVLQIYKEIASHRQENLFVEPYFRKLNALWDELSFYITDLAQCSSGGAIQNVSELTERQKVVQFLVGLNDSYATI

Query:  CGQILVKRPFPTVEEAYSEIIGEEKRRELVAALETVAAKVIQSNWLLKNQNSRSNNNRGIDQEVDNA
        C QILV +PFPTVEEAYSEII EEKRREL  AL TVAA+VIQS++   + N+  N N GIDQE+D +
Subjt:  CGQILVKRPFPTVEEAYSEIIGEEKRRELVAALETVAAKVIQSNWLLKNQNSRSNNNRGIDQEVDNA

A0A0A0LU31 Uncharacterized protein7.3e-6248.39Show/hide
Query:  RGIMSP--------KRSRFSPKMTKKGTPQPAKPAKNKHPR---NVTGSKIEVTPIPNPTPPRAATNPSPNHSKTQSRSEPSSPIPPPTPTPTTR---SQ
        RGI  P        ++   SP  TKK T    K + NKHP    N   S         P P  AA N +PN     S S PSSP PPPTP PTTR   SQ
Subjt:  RGIMSP--------KRSRFSPKMTKKGTPQPAKPAKNKHPR---NVTGSKIEVTPIPNPTPPRAATNPSPNHSKTQSRSEPSSPIPPPTPTPTTR---SQ

Query:  PTTPS-------SSNTNVVRQGGLSFDNPSVSK---------SFHSPKDYHKSSPGSWLGL-QKDVNTSPDPSSPASSHTASDLSDRLLQRLSFDGKDVA
        P TPS       S   +  R+       PS S          + +SPK   K SP S     +K V T+  P+SPASS  ++D+  RLLQ LSF+GKD+ 
Subjt:  PTTPS-------SSNTNVVRQGGLSFDNPSVSK---------SFHSPKDYHKSSPGSWLGL-QKDVNTSPDPSSPASSHTASDLSDRLLQRLSFDGKDVA

Query:  DILQGKSIYDIMGS-NKKEETS-QSIDGLRVLQIYKEIASHRQENLFVEPYFRKLNALWDELSFYITDLAQ-CSSGGAIQNVSELTERQKVVQFLVGLND
        DIL+G SI D+MGS NKKEE+S +++  L +LQIY++IASHRQ NL VE YF+KL  LW+++  Y +D AQ  SS G I   SELTER KV+QF +GLND
Subjt:  DILQGKSIYDIMGS-NKKEETS-QSIDGLRVLQIYKEIASHRQENLFVEPYFRKLNALWDELSFYITDLAQ-CSSGGAIQNVSELTERQKVVQFLVGLND

Query:  SYATICGQILVKRPFPTVEEAYSEIIGEEKRRELVAALETVAAKVIQSNWLLKNQNSRSNNNRGIDQEVDNA
         Y+ IC QILV +PFPTVEEAYSEII EEKRREL  AL  +AA+VIQS++   + N+  N N GIDQE+D +
Subjt:  SYATICGQILVKRPFPTVEEAYSEIIGEEKRRELVAALETVAAKVIQSNWLLKNQNSRSNNNRGIDQEVDNA

A0A6J1C5Z8 uncharacterized protein LOC1110085889.5e-4640.51Show/hide
Query:  RGIMSPKRSRFSPKMTK---KGTPQ-PAKP-----------AKNKH-----------PRNVTGSKIEVTPIPNPTPPRA-ATNPSPNHSKTQSR------
        RG++SP RSR SP+  +    G P  P++P           A+ +H            R    S   + P P PT P   +  P+PN    Q +      
Subjt:  RGIMSPKRSRFSPKMTK---KGTPQ-PAKP-----------AKNKH-----------PRNVTGSKIEVTPIPNPTPPRA-ATNPSPNHSKTQSR------

Query:  -----SEPSSP-----------IPPPTPTPTTRSQPTTPSSSNTNVVRQGGLSFDNPSVSKSFHSPKDYHKSSPGSWLGLQKDVNTSPDPSSPASSHTAS
             ++PSSP            PP  PTP +++  TTP       +     S D+P+ +    SP   H  SPGS      D+       SP S  T +
Subjt:  -----SEPSSP-----------IPPPTPTPTTRSQPTTPSSSNTNVVRQGGLSFDNPSVSKSFHSPKDYHKSSPGSWLGLQKDVNTSPDPSSPASSHTAS

Query:  DLSD----RLLQRLSFDGKDVAD-ILQGKSIYDIMGSNKKEETSQSIDGLRVLQIYKEIASHRQENLFVEPYFRKLNALWDELSFYITDLAQCSSGGAIQ
         L+D      LQRLS DGKD+A  IL   SIY+ +GS+  EE+ QS +  R+ QIYK+IASHRQEN  V  YF KL  LWDEL  Y  D+ QC S GA++
Subjt:  DLSD----RLLQRLSFDGKDVAD-ILQGKSIYDIMGSNKKEETSQSIDGLRVLQIYKEIASHRQENLFVEPYFRKLNALWDELSFYITDLAQCSSGGAIQ

Query:  NVSELTERQKVVQFLVGLNDSYATICGQILVKRPFPTVEEAYSEIIGEEKRRELVAALETVAAKVIQSNWLLKNQNSRSNNNRGIDQEVD
         +S   ER+KV+QFL+GLN+SY+TIC QIL+ +PFPT+E+AYS II EEKR ELV +LE VAAKV+++ WLL+N  S +  + GI +EV+
Subjt:  NVSELTERQKVVQFLVGLNDSYATICGQILVKRPFPTVEEAYSEIIGEEKRRELVAALETVAAKVIQSNWLLKNQNSRSNNNRGIDQEVD

A0A6J1C7L7 uncharacterized protein LOC1110089864.9e-2636.71Show/hide
Query:  MEGRGIMSPKRSRFSPKMTKKGTPQPAKPAKNKHPRNVTGSKIEVTPIPNPTPPRAATNPSPNHSKTQSRSEPSSPIPPPTPTPTTRSQPT----TPSSS
        M GRG++SP +SRFS   +       A P    +  + T  +     + NP   +  T+  P       R+  +S  P PTP   +R +PT     P+  
Subjt:  MEGRGIMSPKRSRFSPKMTKKGTPQPAKPAKNKHPRNVTGSKIEVTPIPNPTPPRAATNPSPNHSKTQSRSEPSSPIPPPTPTPTTRSQPT----TPSSS

Query:  NTNVVRQGGLSFDNPSVSKSFHSPKDYHK----------SSPGSWLGLQKDVNTSPDPSSPASSHTASDLSDRL--LQRLSFDGKDVAD-ILQGKSIYDI
        N N   +  ++    S + + +  +   +          SS GS  G  +D N +        S T       +  LQ+LS DGK  A  + +  S+ + 
Subjt:  NTNVVRQGGLSFDNPSVSKSFHSPKDYHK----------SSPGSWLGLQKDVNTSPDPSSPASSHTASDLSDRL--LQRLSFDGKDVAD-ILQGKSIYDI

Query:  MGSNKKEETSQSIDGLRVLQIYKEIASHRQENLFVEPYFRKLNALWDELSFYITDLAQCSSGGAI-QNVSELTERQKVVQFLVGLNDSYATICGQILVKR
        +G   KEE S   +  R+L+IYK+IASHRQ N  +  YF KL  LW+EL  Y +DL QC S  A  Q  S+L ER+KV+QFLVGLNDSY+TIC QIL+ R
Subjt:  MGSNKKEETSQSIDGLRVLQIYKEIASHRQENLFVEPYFRKLNALWDELSFYITDLAQCSSGGAI-QNVSELTERQKVVQFLVGLNDSYATICGQILVKR

Query:  PFPTVEEAYSEIIGEE
        PFPTVE+AYS II +E
Subjt:  PFPTVEEAYSEIIGEE

A0A6J1GTG4 serine/arginine repetitive matrix protein 1-like2.7e-4041.88Show/hide
Query:  MSPKRSRFSP-----KMTKKGTPQPAKPAKNKHPRNVTGSKIEVTPI-PNP--TPPRAATNPSPNHSKTQSRSEPSSPIPPPTPTPTTRSQPTTPSSSNT
        MSP+R   +P       T +  PQP    +   P + + +   + P  PN    P R A  PSPN  K  +++ P +     +P PT   +P TP S + 
Subjt:  MSPKRSRFSP-----KMTKKGTPQPAKPAKNKHPRNVTGSKIEVTPI-PNP--TPPRAATNPSPNHSKTQSRSEPSSPIPPPTPTPTTRSQPTTPSSSNT

Query:  NVVRQGGLSFDNPSVSKSFHSPKDYHKSSPGSWLGLQKD---VNTSPDPSSPASSHTASDLSDRLLQRLSFDGKDVADI-LQGKSIYDIMGSNKKEE--T
             G  S  + S +K   S K   K+     L  Q+D   V +  D S  A   T SD     L +LS D KD+A+I L    +Y+ + S  KEE  +
Subjt:  NVVRQGGLSFDNPSVSKSFHSPKDYHKSSPGSWLGLQKD---VNTSPDPSSPASSHTASDLSDRLLQRLSFDGKDVADI-LQGKSIYDIMGSNKKEE--T

Query:  SQSIDGLRVLQIYKEIASHRQENLFVEPYFRKLNALWDELSFYITDLAQCSSGGAIQNVSELTERQKVVQFLVGLNDSYATICGQILVKRPFPTVEEAYS
        SQ  +  R+ QIYKEIASH Q N  +  Y  KL ALWDEL  YI D  +CS  G+ +  SE  ER+KV+QFL+GLNDSY+TIC QIL  +PFPTVE+A  
Subjt:  SQSIDGLRVLQIYKEIASHRQENLFVEPYFRKLNALWDELSFYITDLAQCSSGGAIQNVSELTERQKVVQFLVGLNDSYATICGQILVKRPFPTVEEAYS

Query:  EIIGEEKRRELVAALETVAAKVIQSNWLLKNQNSRSNNNRGIDQEVDNANL
         I+ EEKRRELV +LE VAAKVIQ+NWLL+N +S + +N    +EVD+ NL
Subjt:  EIIGEEKRRELVAALETVAAKVIQSNWLLKNQNSRSNNNRGIDQEVDNANL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).6.0e-0830.39Show/hide
Query:  LRVLQIYKEIASHRQENLFVEPYFRKLNALWDELSFYITDLAQCSSGG----AIQNVSELTERQKVVQFLVG--LNDSYATICGQILVKRPFPTVEEAYS
        L++ Q+ + +A+ RQ    VE YF KL+ +W ELS Y   + +C  GG      +   E  E+++  +FL+G  LN  +  +  +I+ ++P P++ EA++
Subjt:  LRVLQIYKEIASHRQENLFVEPYFRKLNALWDELSFYITDLAQCSSGG----AIQNVSELTERQKVVQFLVG--LNDSYATICGQILVKRPFPTVEEAYS

Query:  EI
         +
Subjt:  EI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGGAAGGGGAATTATGAGTCCCAAAAGATCCCGATTTTCTCCGAAAATGACCAAAAAGGGTACTCCACAGCCAGCAAAACCAGCCAAGAACAAACATCCCAGAAA
TGTTACTGGATCCAAAATAGAAGTCACTCCAATTCCAAACCCAACGCCACCAAGGGCAGCCACAAATCCTAGTCCAAACCACAGCAAAACTCAGTCCAGATCGGAACCAT
CGTCCCCGATTCCACCTCCAACTCCAACTCCGACGACAAGAAGTCAACCCACCACCCCTTCTTCTTCAAATACCAATGTTGTAAGACAAGGTGGACTCTCCTTTGATAAC
CCTTCTGTTTCAAAATCCTTTCATTCTCCAAAAGATTATCACAAAAGTTCTCCGGGCTCTTGGTTGGGTCTTCAGAAAGATGTCAACACTAGTCCTGATCCTTCTTCCCC
TGCTTCTTCTCATACTGCTTCTGATCTTAGTGATCGCTTGTTACAACGCCTTTCTTTTGACGGTAAAGATGTTGCGGACATCCTTCAAGGAAAGTCAATATATGATATAA
TGGGCTCAAATAAAAAGGAAGAAACATCTCAAAGCATTGATGGTTTGAGAGTACTTCAAATTTACAAGGAAATTGCATCTCATCGCCAAGAAAACTTATTCGTTGAACCT
TACTTCAGAAAGCTCAATGCATTATGGGATGAACTCTCATTCTATATTACTGATTTGGCTCAATGTTCCAGCGGCGGTGCAATTCAAAATGTTAGTGAGCTTACGGAGAG
ACAAAAGGTTGTGCAATTTCTTGTGGGATTAAATGATTCTTATGCCACAATTTGCGGCCAAATCCTTGTTAAGAGGCCATTTCCGACGGTGGAGGAAGCCTATTCTGAAA
TAATTGGAGAAGAAAAACGTAGGGAATTGGTTGCTGCATTAGAAACTGTGGCTGCAAAAGTAATCCAAAGCAATTGGCTTCTTAAAAATCAAAATAGTCGATCCAATAAT
AATCGTGGTATTGATCAAGAAGTTGATAATGCTAACCTTGGATCTCCTATGTGA
mRNA sequenceShow/hide mRNA sequence
CAAGCTTCGTAGTTGGGGATAACCAGTGAAGGTTTGGGCATTTGTCCATTTCCCATCTGATCAAAGAATTGAGAGGTGAAGGACCAAGCGAGTGAGAGAGAAATGGAGGG
AAGGGGAATTATGAGTCCCAAAAGATCCCGATTTTCTCCGAAAATGACCAAAAAGGGTACTCCACAGCCAGCAAAACCAGCCAAGAACAAACATCCCAGAAATGTTACTG
GATCCAAAATAGAAGTCACTCCAATTCCAAACCCAACGCCACCAAGGGCAGCCACAAATCCTAGTCCAAACCACAGCAAAACTCAGTCCAGATCGGAACCATCGTCCCCG
ATTCCACCTCCAACTCCAACTCCGACGACAAGAAGTCAACCCACCACCCCTTCTTCTTCAAATACCAATGTTGTAAGACAAGGTGGACTCTCCTTTGATAACCCTTCTGT
TTCAAAATCCTTTCATTCTCCAAAAGATTATCACAAAAGTTCTCCGGGCTCTTGGTTGGGTCTTCAGAAAGATGTCAACACTAGTCCTGATCCTTCTTCCCCTGCTTCTT
CTCATACTGCTTCTGATCTTAGTGATCGCTTGTTACAACGCCTTTCTTTTGACGGTAAAGATGTTGCGGACATCCTTCAAGGAAAGTCAATATATGATATAATGGGCTCA
AATAAAAAGGAAGAAACATCTCAAAGCATTGATGGTTTGAGAGTACTTCAAATTTACAAGGAAATTGCATCTCATCGCCAAGAAAACTTATTCGTTGAACCTTACTTCAG
AAAGCTCAATGCATTATGGGATGAACTCTCATTCTATATTACTGATTTGGCTCAATGTTCCAGCGGCGGTGCAATTCAAAATGTTAGTGAGCTTACGGAGAGACAAAAGG
TTGTGCAATTTCTTGTGGGATTAAATGATTCTTATGCCACAATTTGCGGCCAAATCCTTGTTAAGAGGCCATTTCCGACGGTGGAGGAAGCCTATTCTGAAATAATTGGA
GAAGAAAAACGTAGGGAATTGGTTGCTGCATTAGAAACTGTGGCTGCAAAAGTAATCCAAAGCAATTGGCTTCTTAAAAATCAAAATAGTCGATCCAATAATAATCGTGG
TATTGATCAAGAAGTTGATAATGCTAACCTTGGATCTCCTATGTGAAGTTGAAGATTTTCAGCTCAATGAAAATTCAGGGTAAACTAGTCCAAACTATTGATCATCATGG
TGAGACTTCACGAGGAAAATGTGTTACCTCTACAATGGTCGAGTCAGATAATGTATTTTACTTCTTCATTATGTGTGGCTTGTTTCAAGTGAATAAGAGTATTAATGTGA
CTCATTTATTCGAGCCCTTCTTG
Protein sequenceShow/hide protein sequence
MEGRGIMSPKRSRFSPKMTKKGTPQPAKPAKNKHPRNVTGSKIEVTPIPNPTPPRAATNPSPNHSKTQSRSEPSSPIPPPTPTPTTRSQPTTPSSSNTNVVRQGGLSFDN
PSVSKSFHSPKDYHKSSPGSWLGLQKDVNTSPDPSSPASSHTASDLSDRLLQRLSFDGKDVADILQGKSIYDIMGSNKKEETSQSIDGLRVLQIYKEIASHRQENLFVEP
YFRKLNALWDELSFYITDLAQCSSGGAIQNVSELTERQKVVQFLVGLNDSYATICGQILVKRPFPTVEEAYSEIIGEEKRRELVAALETVAAKVIQSNWLLKNQNSRSNN
NRGIDQEVDNANLGSPM