; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0104411 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0104411
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr04:22114240..22115376
RNA-Seq ExpressionCmc04g0104411
SyntenyCmc04g0104411
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]2.8e-17084.02Show/hide
Query:  VDQPNGAKPIGCKWIYKRKRDQAGKVQTFKAQLVEKCYTQKEGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVNTAFLNEYLKESIYMVQPEGFI
        VD P+  KPIGCKWIYKRKRDQAGKVQTFKA+LV K YTQKEGVDYEETFSPVAMLKSIRILLSIATFY+YEIWQMDV TAFLN  L+ESIYMVQPEGFI
Subjt:  VDQPNGAKPIGCKWIYKRKRDQAGKVQTFKAQLVEKCYTQKEGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVNTAFLNEYLKESIYMVQPEGFI

Query:  -----QKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGCEQNVDEPCVYKRIINSTVAFLILYVDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQDVLG
             QKVCKLQKSIYGLKQASRSWNIRFDTAIKSYG EQNVDEPCVYK+I+NS VAFLILYVDDILLIGNDV +LTD+KKWL TQFQMKDLG AQ +LG
Subjt:  -----QKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGCEQNVDEPCVYKRIINSTVAFLILYVDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQDVLG

Query:  IQIVQNRKNKTLAMSQTSYIDKMLSRYKMQNSKMGLLSYIYEIHLLKEQCLKTPQQVEDMSNIPYASVVESLMYAMLCTRPDICYSVGMVSRYQSIPGCD
        IQIV+NRKNKTLAMSQ SYIDK+LSRYKMQNSK G L + + IHL KEQC KTPQ+VEDM NIPY+S V SLMYAMLCTRPDICYSVG+VSRYQS PG D
Subjt:  IQIVQNRKNKTLAMSQTSYIDKMLSRYKMQNSKMGLLSYIYEIHLLKEQCLKTPQQVEDMSNIPYASVVESLMYAMLCTRPDICYSVGMVSRYQSIPGCD

Query:  HWTVIKNIQKYLRRTQDYMLLYDSKDLILTGYIDSDFQSDKDARKSTSGSVFTLNGGAVVWRT
        HWT +KNI KYLRRT++YML+Y +KDLILTGY DSDFQSDKDARKSTSGSVFTLNGGAVVWR+
Subjt:  HWTVIKNIQKYLRRTQDYMLLYDSKDLILTGYIDSDFQSDKDARKSTSGSVFTLNGGAVVWRT

KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]3.7e-16277.84Show/hide
Query:  FQFCMDSVDQPNGAKPIGCKWIYKRKRDQAGKVQTFKAQLVEKCYTQKEGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVNTAFLNEYLKESIYM
        F    + VD P G KPIGCKWIYKRKRD AGKVQTFKA+LV K YTQ+EGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDV TAFLN  L+ESI+M
Subjt:  FQFCMDSVDQPNGAKPIGCKWIYKRKRDQAGKVQTFKAQLVEKCYTQKEGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVNTAFLNEYLKESIYM

Query:  VQPEGFI-----QKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGCEQNVDEPCVYKRIINSTVAFLILYVDDILLIGNDVGHLTDIKKWLATQFQMKDLG
         QPEGFI     QKVCKL +SIYGLKQASRSWNIRFDTAIKSYG +QNVDEPCVYK+I    VAFL+LYVDDILLIGNDVG+LTD+K WLA QFQMKDLG
Subjt:  VQPEGFI-----QKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGCEQNVDEPCVYKRIINSTVAFLILYVDDILLIGNDVGHLTDIKKWLATQFQMKDLG

Query:  NAQDVLGIQIVQNRKNKTLAMSQTSYIDKMLSRYKMQNSKMGLLSYIYEIHLLKEQCLKTPQQVEDMSNIPYASVVESLMYAMLCTRPDICYSVGMVSRY
         AQ VLGIQI+++RKNKTLA+SQ +YIDK+L RY MQNSK GLL + + +HL KEQ  KTPQ+VEDM  IPYAS V SLMYAMLCTRPDICY+VG+VSRY
Subjt:  NAQDVLGIQIVQNRKNKTLAMSQTSYIDKMLSRYKMQNSKMGLLSYIYEIHLLKEQCLKTPQQVEDMSNIPYASVVESLMYAMLCTRPDICYSVGMVSRY

Query:  QSIPGCDHWTVIKNIQKYLRRTQDYMLLYDSKDLILTGYIDSDFQSDKDARKSTSGSVFTLNGGAVVWRT
        QS PG DHWT +K + KYLRRT+DYML+Y +KDLILTGY DSDFQ+DKD+RKSTSGSVFTLNGGAVVWR+
Subjt:  QSIPGCDHWTVIKNIQKYLRRTQDYMLLYDSKDLILTGYIDSDFQSDKDARKSTSGSVFTLNGGAVVWRT

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]3.7e-16277.84Show/hide
Query:  FQFCMDSVDQPNGAKPIGCKWIYKRKRDQAGKVQTFKAQLVEKCYTQKEGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVNTAFLNEYLKESIYM
        F    + VD P G KPIGCKWIYKRKRD AGKVQTFKA+LV K YTQ+EGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDV TAFLN  L+ESI+M
Subjt:  FQFCMDSVDQPNGAKPIGCKWIYKRKRDQAGKVQTFKAQLVEKCYTQKEGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVNTAFLNEYLKESIYM

Query:  VQPEGFI-----QKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGCEQNVDEPCVYKRIINSTVAFLILYVDDILLIGNDVGHLTDIKKWLATQFQMKDLG
         QPEGFI     QKVCKL +SIYGLKQASRSWNIRFDTAIKSYG +QNVDEPCVYK+I    VAFL+LYVDDILLIGNDVG+LTD+K WLA QFQMKDLG
Subjt:  VQPEGFI-----QKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGCEQNVDEPCVYKRIINSTVAFLILYVDDILLIGNDVGHLTDIKKWLATQFQMKDLG

Query:  NAQDVLGIQIVQNRKNKTLAMSQTSYIDKMLSRYKMQNSKMGLLSYIYEIHLLKEQCLKTPQQVEDMSNIPYASVVESLMYAMLCTRPDICYSVGMVSRY
         AQ VLGIQI+++RKNKTLA+SQ +YIDK+L RY MQNSK GLL + + +HL KEQ  KTPQ+VEDM  IPYAS V SLMYAMLCTRPDICY+VG+VSRY
Subjt:  NAQDVLGIQIVQNRKNKTLAMSQTSYIDKMLSRYKMQNSKMGLLSYIYEIHLLKEQCLKTPQQVEDMSNIPYASVVESLMYAMLCTRPDICYSVGMVSRY

Query:  QSIPGCDHWTVIKNIQKYLRRTQDYMLLYDSKDLILTGYIDSDFQSDKDARKSTSGSVFTLNGGAVVWRT
        QS PG DHWT +K + KYLRRT+DYML+Y +KDLILTGY DSDFQ+DKD+RKSTSGSVFTLNGGAVVWR+
Subjt:  QSIPGCDHWTVIKNIQKYLRRTQDYMLLYDSKDLILTGYIDSDFQSDKDARKSTSGSVFTLNGGAVVWRT

KAA0062993.1 gag/pol protein [Cucumis melo var. makuwa]6.5e-15976.76Show/hide
Query:  FQFCMDSVDQPNGAKPIGCKWIYKRKRDQAGKVQTFKAQLVEKCYTQKEGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVNTAFLNEYLKESIYM
        F    D VDQP+G KPIGCKWIYKRKR   GKVQTFKA+LV K YTQ EGVDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDV TAFLN  L+E+IYM
Subjt:  FQFCMDSVDQPNGAKPIGCKWIYKRKRDQAGKVQTFKAQLVEKCYTQKEGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVNTAFLNEYLKESIYM

Query:  VQPEGFI-----QKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGCEQNVDEPCVYKRIINSTVAFLILYVDDILLIGNDVGHLTDIKKWLATQFQMKDLG
         QPEGFI     QK+CKL +SIYGLKQASRSWNIRFDTAIKSYG  Q VDEPCVYKRIIN +VAFL+LYVDDILLIGND+G LTDIK+WLATQFQMKDLG
Subjt:  VQPEGFI-----QKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGCEQNVDEPCVYKRIINSTVAFLILYVDDILLIGNDVGHLTDIKKWLATQFQMKDLG

Query:  NAQDVLGIQIVQNRKNKTLAMSQTSYIDKMLSRYKMQNSKMGLLSYIYEIHLLKEQCLKTPQQVEDMSNIPYASVVESLMYAMLCTRPDICYSVGMVSRY
         AQ VLGIQI ++RKNK LA+SQ SYIDK++ +Y MQNSK GLL + + + L KEQC KTPQ VE+M +IPYAS V+SLMYAMLCTRPDICY+VG+VSRY
Subjt:  NAQDVLGIQIVQNRKNKTLAMSQTSYIDKMLSRYKMQNSKMGLLSYIYEIHLLKEQCLKTPQQVEDMSNIPYASVVESLMYAMLCTRPDICYSVGMVSRY

Query:  QSIPGCDHWTVIKNIQKYLRRTQDYMLLYDSKDLILTGYIDSDFQSDKDARKSTSGSVFTLNGGAVVWRT
        QS PG  HWT +K I KYLRRT+DYML+Y SKDLILTGY DSDFQ+D+D+RKSTSGSVFTLNGGAVVWR+
Subjt:  QSIPGCDHWTVIKNIQKYLRRTQDYMLLYDSKDLILTGYIDSDFQSDKDARKSTSGSVFTLNGGAVVWRT

TYK03644.1 gag/pol protein [Cucumis melo var. makuwa]3.8e-16783.47Show/hide
Query:  VDQPNGAKPIGCKWIYKRKRDQAGKVQTFKAQLVEKCYTQKEGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVNTAFLNEYLKESIYMVQPEGFI
        VDQPN  KPIGCKWIYKRKRDQAGKVQTFKA+LV K YTQKEG+DYEE FS  AM+KSIRILLSIATFYDYEIWQMDV T FLN  L+ESIYMVQPE FI
Subjt:  VDQPNGAKPIGCKWIYKRKRDQAGKVQTFKAQLVEKCYTQKEGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVNTAFLNEYLKESIYMVQPEGFI

Query:  -----QKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGCEQNVDEPCVYKRIINSTVAFLILYVDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQDVLG
             QK+CKLQKSIYGLKQASRS NIRFDTAIKSYG EQNVDEPCVYKRI+NSTVAFL+LYVDDILLIGNDVGHL DIKKWLA QFQMKDLGNAQ VLG
Subjt:  -----QKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGCEQNVDEPCVYKRIINSTVAFLILYVDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQDVLG

Query:  IQIVQNRKNKTLAMSQTSYIDKMLSRYKMQNSKMGLLSYIYEIHLLKEQCLKTPQQVEDMSNIPYASVVESLMYAMLCTRPDICYSVGMVSRYQSIPGCD
        +QIV+NRKNKTLAMSQTSYIDKMLSRYKM NSK GLL Y Y IHL KEQC KTPQ+VEDMSNIPYAS V SLMY MLCTRP+ICYSVG+VSR QS PG D
Subjt:  IQIVQNRKNKTLAMSQTSYIDKMLSRYKMQNSKMGLLSYIYEIHLLKEQCLKTPQQVEDMSNIPYASVVESLMYAMLCTRPDICYSVGMVSRYQSIPGCD

Query:  HWTVIKNIQKYLRRTQDYMLLYDSKDLILTGYIDSDFQSDKDARKSTSGSVFTLNGGAVVWRT
        HWT +KNI KYLRRT+DYML+Y SKDLILTGY D  FQ+DKDARKSTSG VFT+NGGAVVWR+
Subjt:  HWTVIKNIQKYLRRTQDYMLLYDSKDLILTGYIDSDFQSDKDARKSTSGSVFTLNGGAVVWRT

TrEMBL top hitse value%identityAlignment
A0A5A7TZD0 Gag/pol protein1.8e-16277.84Show/hide
Query:  FQFCMDSVDQPNGAKPIGCKWIYKRKRDQAGKVQTFKAQLVEKCYTQKEGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVNTAFLNEYLKESIYM
        F    + VD P G KPIGCKWIYKRKRD AGKVQTFKA+LV K YTQ+EGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDV TAFLN  L+ESI+M
Subjt:  FQFCMDSVDQPNGAKPIGCKWIYKRKRDQAGKVQTFKAQLVEKCYTQKEGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVNTAFLNEYLKESIYM

Query:  VQPEGFI-----QKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGCEQNVDEPCVYKRIINSTVAFLILYVDDILLIGNDVGHLTDIKKWLATQFQMKDLG
         QPEGFI     QKVCKL +SIYGLKQASRSWNIRFDTAIKSYG +QNVDEPCVYK+I    VAFL+LYVDDILLIGNDVG+LTD+K WLA QFQMKDLG
Subjt:  VQPEGFI-----QKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGCEQNVDEPCVYKRIINSTVAFLILYVDDILLIGNDVGHLTDIKKWLATQFQMKDLG

Query:  NAQDVLGIQIVQNRKNKTLAMSQTSYIDKMLSRYKMQNSKMGLLSYIYEIHLLKEQCLKTPQQVEDMSNIPYASVVESLMYAMLCTRPDICYSVGMVSRY
         AQ VLGIQI+++RKNKTLA+SQ +YIDK+L RY MQNSK GLL + + +HL KEQ  KTPQ+VEDM  IPYAS V SLMYAMLCTRPDICY+VG+VSRY
Subjt:  NAQDVLGIQIVQNRKNKTLAMSQTSYIDKMLSRYKMQNSKMGLLSYIYEIHLLKEQCLKTPQQVEDMSNIPYASVVESLMYAMLCTRPDICYSVGMVSRY

Query:  QSIPGCDHWTVIKNIQKYLRRTQDYMLLYDSKDLILTGYIDSDFQSDKDARKSTSGSVFTLNGGAVVWRT
        QS PG DHWT +K + KYLRRT+DYML+Y +KDLILTGY DSDFQ+DKD+RKSTSGSVFTLNGGAVVWR+
Subjt:  QSIPGCDHWTVIKNIQKYLRRTQDYMLLYDSKDLILTGYIDSDFQSDKDARKSTSGSVFTLNGGAVVWRT

A0A5A7UYE8 Gag/pol protein1.8e-16277.84Show/hide
Query:  FQFCMDSVDQPNGAKPIGCKWIYKRKRDQAGKVQTFKAQLVEKCYTQKEGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVNTAFLNEYLKESIYM
        F    + VD P G KPIGCKWIYKRKRD AGKVQTFKA+LV K YTQ+EGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDV TAFLN  L+ESI+M
Subjt:  FQFCMDSVDQPNGAKPIGCKWIYKRKRDQAGKVQTFKAQLVEKCYTQKEGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVNTAFLNEYLKESIYM

Query:  VQPEGFI-----QKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGCEQNVDEPCVYKRIINSTVAFLILYVDDILLIGNDVGHLTDIKKWLATQFQMKDLG
         QPEGFI     QKVCKL +SIYGLKQASRSWNIRFDTAIKSYG +QNVDEPCVYK+I    VAFL+LYVDDILLIGNDVG+LTD+K WLA QFQMKDLG
Subjt:  VQPEGFI-----QKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGCEQNVDEPCVYKRIINSTVAFLILYVDDILLIGNDVGHLTDIKKWLATQFQMKDLG

Query:  NAQDVLGIQIVQNRKNKTLAMSQTSYIDKMLSRYKMQNSKMGLLSYIYEIHLLKEQCLKTPQQVEDMSNIPYASVVESLMYAMLCTRPDICYSVGMVSRY
         AQ VLGIQI+++RKNKTLA+SQ +YIDK+L RY MQNSK GLL + + +HL KEQ  KTPQ+VEDM  IPYAS V SLMYAMLCTRPDICY+VG+VSRY
Subjt:  NAQDVLGIQIVQNRKNKTLAMSQTSYIDKMLSRYKMQNSKMGLLSYIYEIHLLKEQCLKTPQQVEDMSNIPYASVVESLMYAMLCTRPDICYSVGMVSRY

Query:  QSIPGCDHWTVIKNIQKYLRRTQDYMLLYDSKDLILTGYIDSDFQSDKDARKSTSGSVFTLNGGAVVWRT
        QS PG DHWT +K + KYLRRT+DYML+Y +KDLILTGY DSDFQ+DKD+RKSTSGSVFTLNGGAVVWR+
Subjt:  QSIPGCDHWTVIKNIQKYLRRTQDYMLLYDSKDLILTGYIDSDFQSDKDARKSTSGSVFTLNGGAVVWRT

A0A5A7V4M1 Gag/pol protein3.1e-15976.76Show/hide
Query:  FQFCMDSVDQPNGAKPIGCKWIYKRKRDQAGKVQTFKAQLVEKCYTQKEGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVNTAFLNEYLKESIYM
        F    D VDQP+G KPIGCKWIYKRKR   GKVQTFKA+LV K YTQ EGVDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDV TAFLN  L+E+IYM
Subjt:  FQFCMDSVDQPNGAKPIGCKWIYKRKRDQAGKVQTFKAQLVEKCYTQKEGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVNTAFLNEYLKESIYM

Query:  VQPEGFI-----QKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGCEQNVDEPCVYKRIINSTVAFLILYVDDILLIGNDVGHLTDIKKWLATQFQMKDLG
         QPEGFI     QK+CKL +SIYGLKQASRSWNIRFDTAIKSYG  Q VDEPCVYKRIIN +VAFL+LYVDDILLIGND+G LTDIK+WLATQFQMKDLG
Subjt:  VQPEGFI-----QKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGCEQNVDEPCVYKRIINSTVAFLILYVDDILLIGNDVGHLTDIKKWLATQFQMKDLG

Query:  NAQDVLGIQIVQNRKNKTLAMSQTSYIDKMLSRYKMQNSKMGLLSYIYEIHLLKEQCLKTPQQVEDMSNIPYASVVESLMYAMLCTRPDICYSVGMVSRY
         AQ VLGIQI ++RKNK LA+SQ SYIDK++ +Y MQNSK GLL + + + L KEQC KTPQ VE+M +IPYAS V+SLMYAMLCTRPDICY+VG+VSRY
Subjt:  NAQDVLGIQIVQNRKNKTLAMSQTSYIDKMLSRYKMQNSKMGLLSYIYEIHLLKEQCLKTPQQVEDMSNIPYASVVESLMYAMLCTRPDICYSVGMVSRY

Query:  QSIPGCDHWTVIKNIQKYLRRTQDYMLLYDSKDLILTGYIDSDFQSDKDARKSTSGSVFTLNGGAVVWRT
        QS PG  HWT +K I KYLRRT+DYML+Y SKDLILTGY DSDFQ+D+D+RKSTSGSVFTLNGGAVVWR+
Subjt:  QSIPGCDHWTVIKNIQKYLRRTQDYMLLYDSKDLILTGYIDSDFQSDKDARKSTSGSVFTLNGGAVVWRT

A0A5D3BX45 Gag/pol protein1.8e-16783.47Show/hide
Query:  VDQPNGAKPIGCKWIYKRKRDQAGKVQTFKAQLVEKCYTQKEGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVNTAFLNEYLKESIYMVQPEGFI
        VDQPN  KPIGCKWIYKRKRDQAGKVQTFKA+LV K YTQKEG+DYEE FS  AM+KSIRILLSIATFYDYEIWQMDV T FLN  L+ESIYMVQPE FI
Subjt:  VDQPNGAKPIGCKWIYKRKRDQAGKVQTFKAQLVEKCYTQKEGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVNTAFLNEYLKESIYMVQPEGFI

Query:  -----QKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGCEQNVDEPCVYKRIINSTVAFLILYVDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQDVLG
             QK+CKLQKSIYGLKQASRS NIRFDTAIKSYG EQNVDEPCVYKRI+NSTVAFL+LYVDDILLIGNDVGHL DIKKWLA QFQMKDLGNAQ VLG
Subjt:  -----QKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGCEQNVDEPCVYKRIINSTVAFLILYVDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQDVLG

Query:  IQIVQNRKNKTLAMSQTSYIDKMLSRYKMQNSKMGLLSYIYEIHLLKEQCLKTPQQVEDMSNIPYASVVESLMYAMLCTRPDICYSVGMVSRYQSIPGCD
        +QIV+NRKNKTLAMSQTSYIDKMLSRYKM NSK GLL Y Y IHL KEQC KTPQ+VEDMSNIPYAS V SLMY MLCTRP+ICYSVG+VSR QS PG D
Subjt:  IQIVQNRKNKTLAMSQTSYIDKMLSRYKMQNSKMGLLSYIYEIHLLKEQCLKTPQQVEDMSNIPYASVVESLMYAMLCTRPDICYSVGMVSRYQSIPGCD

Query:  HWTVIKNIQKYLRRTQDYMLLYDSKDLILTGYIDSDFQSDKDARKSTSGSVFTLNGGAVVWRT
        HWT +KNI KYLRRT+DYML+Y SKDLILTGY D  FQ+DKDARKSTSG VFT+NGGAVVWR+
Subjt:  HWTVIKNIQKYLRRTQDYMLLYDSKDLILTGYIDSDFQSDKDARKSTSGSVFTLNGGAVVWRT

E2GK51 Gag/pol protein (Fragment)1.4e-17084.02Show/hide
Query:  VDQPNGAKPIGCKWIYKRKRDQAGKVQTFKAQLVEKCYTQKEGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVNTAFLNEYLKESIYMVQPEGFI
        VD P+  KPIGCKWIYKRKRDQAGKVQTFKA+LV K YTQKEGVDYEETFSPVAMLKSIRILLSIATFY+YEIWQMDV TAFLN  L+ESIYMVQPEGFI
Subjt:  VDQPNGAKPIGCKWIYKRKRDQAGKVQTFKAQLVEKCYTQKEGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVNTAFLNEYLKESIYMVQPEGFI

Query:  -----QKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGCEQNVDEPCVYKRIINSTVAFLILYVDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQDVLG
             QKVCKLQKSIYGLKQASRSWNIRFDTAIKSYG EQNVDEPCVYK+I+NS VAFLILYVDDILLIGNDV +LTD+KKWL TQFQMKDLG AQ +LG
Subjt:  -----QKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGCEQNVDEPCVYKRIINSTVAFLILYVDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQDVLG

Query:  IQIVQNRKNKTLAMSQTSYIDKMLSRYKMQNSKMGLLSYIYEIHLLKEQCLKTPQQVEDMSNIPYASVVESLMYAMLCTRPDICYSVGMVSRYQSIPGCD
        IQIV+NRKNKTLAMSQ SYIDK+LSRYKMQNSK G L + + IHL KEQC KTPQ+VEDM NIPY+S V SLMYAMLCTRPDICYSVG+VSRYQS PG D
Subjt:  IQIVQNRKNKTLAMSQTSYIDKMLSRYKMQNSKMGLLSYIYEIHLLKEQCLKTPQQVEDMSNIPYASVVESLMYAMLCTRPDICYSVGMVSRYQSIPGCD

Query:  HWTVIKNIQKYLRRTQDYMLLYDSKDLILTGYIDSDFQSDKDARKSTSGSVFTLNGGAVVWRT
        HWT +KNI KYLRRT++YML+Y +KDLILTGY DSDFQSDKDARKSTSGSVFTLNGGAVVWR+
Subjt:  HWTVIKNIQKYLRRTQDYMLLYDSKDLILTGYIDSDFQSDKDARKSTSGSVFTLNGGAVVWRT

SwissProt top hitse value%identityAlignment
P04146 Copia protein7.4e-5736.12Show/hide
Query:  QPNGAKPIGCKWIYKRKRDQAGKVQTFKAQLVEKCYTQKEGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVNTAFLNEYLKESIYMVQPEGF---
        +P     +  +W++  K ++ G    +KA+LV + +TQK  +DYEETF+PVA + S R +LS+   Y+ ++ QMDV TAFLN  LKE IYM  P+G    
Subjt:  QPNGAKPIGCKWIYKRKRDQAGKVQTFKAQLVEKCYTQKEGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVNTAFLNEYLKESIYMVQPEGF---

Query:  IQKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGCEQNVDEPCVY---KRIINSTVAFLILYVDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQDVLGI
           VCKL K+IYGLKQA+R W   F+ A+K      +  + C+Y   K  IN  + +++LYVDD+++   D+  + + K++L  +F+M DL   +  +GI
Subjt:  IQKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGCEQNVDEPCVY---KRIINSTVAFLILYVDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQDVLGI

Query:  QIVQNRKNKTLAMSQTSYIDKMLSRYKMQNS---KMGLLSYI-YEIHLLKEQCLKTPQQVEDMSNIPYASVVESLMYAMLCTRPDICYSVGMVSRYQSIP
        +I    +   + +SQ++Y+ K+LS++ M+N       L S I YE+    E C           N P  S++  LMY MLCTRPD+  +V ++SRY S  
Subjt:  QIVQNRKNKTLAMSQTSYIDKMLSRYKMQNS---KMGLLSYI-YEIHLLKEQCLKTPQQVEDMSNIPYASVVESLMYAMLCTRPDICYSVGMVSRYQSIP

Query:  GCDHWTVIKNIQKYLRRTQDYMLLYDSKDLI----LTGYIDSDFQSDKDARKSTSGSVFTL-NGGAVVWRT
          + W  +K + +YL+ T D  L++  K+L     + GY+DSD+   +  RKST+G +F + +   + W T
Subjt:  GCDHWTVIKNIQKYLRRTQDYMLLYDSKDLI----LTGYIDSDFQSDKDARKSTSGSVFTL-NGGAVVWRT

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.2e-8845.33Show/hide
Query:  VDQPNGAKPIGCKWIYKRKRDQAGKVQTFKAQLVEKCYTQKEGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVNTAFLNEYLKESIYMVQPEGF-
        V+ P G +P+ CKW++K K+D   K+  +KA+LV K + QK+G+D++E FSPV  + SIR +LS+A   D E+ Q+DV TAFL+  L+E IYM QPEGF 
Subjt:  VDQPNGAKPIGCKWIYKRKRDQAGKVQTFKAQLVEKCYTQKEGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVNTAFLNEYLKESIYMVQPEGF-

Query:  ----IQKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGCEQNVDEPCVY-KRIINSTVAFLILYVDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQDVL
               VCKL KS+YGLKQA R W ++FD+ +KS    +   +PCVY KR   +    L+LYVDD+L++G D G +  +K  L+  F MKDLG AQ +L
Subjt:  ----IQKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGCEQNVDEPCVY-KRIINSTVAFLILYVDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQDVL

Query:  GIQIVQNRKNKTLAMSQTSYIDKMLSRYKMQNSKMGLLSYIYEIHLLKEQCLKTPQQVEDMSNIPYASVVESLMYAMLCTRPDICYSVGMVSRYQSIPGC
        G++IV+ R ++ L +SQ  YI+++L R+ M+N+K         + L K+ C  T ++  +M+ +PY+S V SLMYAM+CTRPDI ++VG+VSR+   PG 
Subjt:  GIQIVQNRKNKTLAMSQTSYIDKMLSRYKMQNSKMGLLSYIYEIHLLKEQCLKTPQQVEDMSNIPYASVVESLMYAMLCTRPDICYSVGMVSRYQSIPGC

Query:  DHWTVIKNIQKYLRRTQDYMLLYDSKDLILTGYIDSDFQSDKDARKSTSGSVFTLNGGAVVWRT
        +HW  +K I +YLR T    L +   D IL GY D+D   D D RKS++G +FT +GGA+ W++
Subjt:  DHWTVIKNIQKYLRRTQDYMLLYDSKDLILTGYIDSDFQSDKDARKSTSGSVFTLNGGAVVWRT

P25600 Putative transposon Ty5-1 protein YCL074W2.5e-2831.71Show/hide
Query:  MDVNTAFLNEYLKESIYMVQPEGFIQK-----VCKLQKSIYGLKQASRSWNIRFDTAIKSYGCEQNVDEPCVYKRIINSTVAFLILYVDDILLIGNDVGH
        MDV+TAFLN  + E IY+ QP GF+ +     V +L   +YGLKQA   WN   +  +K  G  ++  E  +Y R  +    ++ +YVDD+L+       
Subjt:  MDVNTAFLNEYLKESIYMVQPEGFIQK-----VCKLQKSIYGLKQASRSWNIRFDTAIKSYGCEQNVDEPCVYKRIINSTVAFLILYVDDILLIGNDVGH

Query:  LTDIKKWLATQFQMKDLGNAQDVLGIQIVQNRKNKTLAMSQTSYIDKMLSRYKMQNSKMGLLSYIYEIHLLKEQCLKTPQQVEDMSNIPYASVVESLMYA
           +K+ L   + MKDLG     LG+ I Q+  N  + +S   YI K  S  ++   K+          L +     T   ++D++  PY S+V  L++ 
Subjt:  LTDIKKWLATQFQMKDLGNAQDVLGIQIVQNRKNKTLAMSQTSYIDKMLSRYKMQNSKMGLLSYIYEIHLLKEQCLKTPQQVEDMSNIPYASVVESLMYA

Query:  MLCTRPDICYSVGMVSRYQSIPGCDHWTVIKNIQKYLRRTQDYMLLYDS-KDLILTGYIDSDFQSDKDARKSTSGSVFTLNGGAVVW
            RPDI Y V ++SR+   P   H    + + +YL  T+   L Y S   L LT Y D+   +  D   ST G V  L G  V W
Subjt:  MLCTRPDICYSVGMVSRYQSIPGCDHWTVIKNIQKYLRRTQDYMLLYDS-KDLILTGYIDSDFQSDKDARKSTSGSVFTLNGGAVVW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.1e-4933.95Show/hide
Query:  SLGPRIGIYVFQFCMDSV-DQPNGAKPIGCKWIYKRKRDQAGKVQTFKAQLVEKCYTQKEGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVNTAF
        ++G  I   +     D V   P+    +GC+WI+ +K +  G +  +KA+LV K Y Q+ G+DY ETFSPV    SIRI+L +A    + I Q+DVN AF
Subjt:  SLGPRIGIYVFQFCMDSV-DQPNGAKPIGCKWIYKRKRDQAGKVQTFKAQLVEKCYTQKEGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVNTAF

Query:  LNEYLKESIYMVQPEGFIQK-----VCKLQKSIYGLKQASRSWNIRFDTAIKSYGCEQNVDEPCVYKRIINSTVAFLILYVDDILLIGNDVGHLTDIKKW
        L   L + +YM QP GFI K     VCKL+K++YGLKQA R+W +     + + G   +V +  ++      ++ ++++YVDDIL+ GND   L +    
Subjt:  LNEYLKESIYMVQPEGFIQK-----VCKLQKSIYGLKQASRSWNIRFDTAIKSYGCEQNVDEPCVYKRIINSTVAFLILYVDDILLIGNDVGHLTDIKKW

Query:  LATQFQMKDLGNAQDVLGIQIVQNRKNKTLAMSQTSYIDKMLSRYKMQNSKMGLLSYIYEIHLLKEQCLKTPQQVEDMSNIPYASVVESLMYAMLCTRPD
        L+ +F +KD       LGI+    R    L +SQ  YI  +L+R  M  +K           L      K     E      Y  +V SL Y +  TRPD
Subjt:  LATQFQMKDLGNAQDVLGIQIVQNRKNKTLAMSQTSYIDKMLSRYKMQNSKMGLLSYIYEIHLLKEQCLKTPQQVEDMSNIPYASVVESLMYAMLCTRPD

Query:  ICYSVGMVSRYQSIPGCDHWTVIKNIQKYLRRTQDY-MLLYDSKDLILTGYIDSDFQSDKDARKSTSGSVFTLNGGAVVW
        I Y+V  +S++  +P  +H   +K I +YL  T ++ + L     L L  Y D+D+  DKD   ST+G +  L    + W
Subjt:  ICYSVGMVSRYQSIPGCDHWTVIKNIQKYLRRTQDY-MLLYDSKDLILTGYIDSDFQSDKDARKSTSGSVFTLNGGAVVW

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.9e-5334.12Show/hide
Query:  QSLGPRIGIYVFQFCMDSVDQPNGAKPI-GCKWIYKRKRDQAGKVQTFKAQLVEKCYTQKEGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVNTA
        Q++G  I   +     D V  P  +  I GC+WI+ +K +  G +  +KA+LV K Y Q+ G+DY ETFSPV    SIRI+L +A    + I Q+DVN A
Subjt:  QSLGPRIGIYVFQFCMDSVDQPNGAKPI-GCKWIYKRKRDQAGKVQTFKAQLVEKCYTQKEGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVNTA

Query:  FLNEYLKESIYMVQPEGFIQK-----VCKLQKSIYGLKQASRSWNIRFDTAIKSYGCEQNVDEPCVYKRIINSTVAFLILYVDDILLIGNDVGHLTDIKK
        FL   L + +YM QP GF+ K     VC+L+K+IYGLKQA R+W +   T + + G   ++ +  ++      ++ ++++YVDDIL+ GND   L     
Subjt:  FLNEYLKESIYMVQPEGFIQK-----VCKLQKSIYGLKQASRSWNIRFDTAIKSYGCEQNVDEPCVYKRIINSTVAFLILYVDDILLIGNDVGHLTDIKK

Query:  WLATQFQMKDLGNAQDVLGIQIVQNRKNKTLAMSQTSYIDKMLSRYKMQNSKMGLLSYIYEIHLLKEQCLKTPQQVEDMSNIPYASVVESLMYAMLCTRP
         L+ +F +K+  +    LGI+    R  + L +SQ  Y   +L+R  M  +K           L      K P   E      Y  +V SL Y +  TRP
Subjt:  WLATQFQMKDLGNAQDVLGIQIVQNRKNKTLAMSQTSYIDKMLSRYKMQNSKMGLLSYIYEIHLLKEQCLKTPQQVEDMSNIPYASVVESLMYAMLCTRP

Query:  DICYSVGMVSRYQSIPGCDHWTVIKNIQKYLRRTQDY-MLLYDSKDLILTGYIDSDFQSDKDARKSTSGSVFTLNGGAVVW
        D+ Y+V  +S+Y  +P  DHW  +K + +YL  T D+ + L     L L  Y D+D+  D D   ST+G +  L    + W
Subjt:  DICYSVGMVSRYQSIPGCDHWTVIKNIQKYLRRTQDY-MLLYDSKDLILTGYIDSDFQSDKDARKSTSGSVFTLNGGAVVW

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 88.7e-5331.19Show/hide
Query:  MDQSLGPRIGIYVFQFCMDSVDQPNGAKPIGCKWIYKRKRDQAGKVQTFKAQLVEKCYTQKEGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVNT
        MD  +G     + ++ C      P   KPIGCKW+YK K +  G ++ +KA+LV K YTQ+EG+D+ ETFSPV  L S++++L+I+  Y++ + Q+D++ 
Subjt:  MDQSLGPRIGIYVFQFCMDSVDQPNGAKPIGCKWIYKRKRDQAGKVQTFKAQLVEKCYTQKEGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVNT

Query:  AFLNEYLKESIYMVQPEGFIQK---------VCKLQKSIYGLKQASRSWNIRFDTAIKSYGCEQNVDEPCVYKRIINSTVAFLILYVDDILLIGNDVGHL
        AFLN  L E IYM  P G+  +         VC L+KSIYGLKQASR W ++F   +  +G  Q+  +   + +I  +    +++YVDDI++  N+   +
Subjt:  AFLNEYLKESIYMVQPEGFIQK---------VCKLQKSIYGLKQASRSWNIRFDTAIKSYGCEQNVDEPCVYKRIINSTVAFLILYVDDILLIGNDVGHL

Query:  TDIKKWLATQFQMKDLGNAQDVLGIQIVQNRKNKTLAMSQTSYIDKMLSRYKMQNSKMGLLSYIYEIHLLKEQCLKTPQQVEDMSNIPYASVVESLMYAM
         ++K  L + F+++DLG  +  LG++I   R    + + Q  Y   +L    +   K         + +            + +    Y  ++  LMY  
Subjt:  TDIKKWLATQFQMKDLGNAQDVLGIQIVQNRKNKTLAMSQTSYIDKMLSRYKMQNSKMGLLSYIYEIHLLKEQCLKTPQQVEDMSNIPYASVVESLMYAM

Query:  LCTRPDICYSVGMVSRYQSIPGCDHWTVIKNIQKYLRRTQDYMLLYDSK-DLILTGYIDSDFQSDKDARKSTSGSVFTLNGGAVVWRT
        + TR DI ++V  +S++   P   H   +  I  Y++ T    L Y S+ ++ L  + D+ FQS KD R+ST+G    L    + W++
Subjt:  LCTRPDICYSVGMVSRYQSIPGCDHWTVIKNIQKYLRRTQDYMLLYDSK-DLILTGYIDSDFQSDKDARKSTSGSVFTLNGGAVVWRT

ATMG00810.1 DNA/RNA polymerases superfamily protein2.3e-1330.24Show/hide
Query:  FLILYVDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQDVLGIQIVQNRKNKTLAMSQTSYIDKMLSRYKMQNSKMGLLSYIYEIHLLKEQCLKTPQQV
        +L+LYVDDILL G+    L  +   L++ F MKDLG     LGIQI  +     L +SQT Y +++L+   M + K      +     LK     +  + 
Subjt:  FLILYVDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQDVLGIQIVQNRKNKTLAMSQTSYIDKMLSRYKMQNSKMGLLSYIYEIHLLKEQCLKTPQQV

Query:  EDMSNIPYASVVESLMYAMLCTRPDICYSVGMVSRYQSIPGCDHWTVIKNIQKYLRRTQDY-MLLYDSKDLILTGYIDSDFQSDKDARKSTSGSVFTLNG
         D S+  + S+V +L Y  L TRPDI Y+V +V +    P    + ++K + +Y++ T  + + ++ +  L +  + DSD+      R+ST+G    L  
Subjt:  EDMSNIPYASVVESLMYAMLCTRPDICYSVGMVSRYQSIPGCDHWTVIKNIQKYLRRTQDY-MLLYDSKDLILTGYIDSDFQSDKDARKSTSGSVFTLNG

Query:  GAVVW
          + W
Subjt:  GAVVW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)2.9e-0839.39Show/hide
Query:  VDQPNGAKPIGCKWIYKRKRDQAGKVQTFKAQLVEKCYTQKEGVDYEETFSPVAMLKSIRILLSIA
        V  P     +GCKW++K K    G +   KA+LV K + Q+EG+ + ET+SPV    +IR +L++A
Subjt:  VDQPNGAKPIGCKWIYKRKRDQAGKVQTFKAQLVEKCYTQKEGVDYEETFSPVAMLKSIRILLSIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCAAAGCCTTGGACCTCGAATTGGAATCTATGTATTCCAATTCTGTATGGACTCTGTAGATCAACCAAATGGTGCAAAACCTATTGGTTGTAAATGGATCTACAA
GAGAAAACGAGATCAAGCTGGTAAAGTACAGACTTTCAAAGCTCAACTAGTGGAAAAATGTTATACACAAAAGGAGGGAGTGGATTATGAAGAAACTTTCTCTCCTGTTG
CCATGCTAAAGTCGATTAGAATACTCTTATCCATTGCCACTTTTTATGATTATGAAATTTGGCAGATGGATGTCAATACAGCCTTTTTGAACGAATATCTTAAGGAGAGT
ATCTATATGGTCCAACCAGAGGGGTTTATACAAAAGGTTTGTAAGCTTCAAAAATCCATATATGGATTAAAACAAGCATCAAGATCCTGGAATATAAGGTTTGATACTGC
CATCAAATCTTATGGTTGTGAACAAAATGTTGATGAACCTTGTGTTTATAAAAGGATCATCAATTCTACTGTAGCATTCTTAATTTTATATGTAGATGACATACTACTCA
TTGGGAATGATGTAGGTCATCTAACTGATATTAAGAAATGGCTAGCTACGCAATTCCAAATGAAAGATTTGGGAAATGCTCAAGACGTTCTTGGTATCCAAATAGTTCAA
AATCGAAAGAACAAAACACTAGCCATGTCTCAAACATCTTATATAGACAAAATGTTGTCAAGATATAAGATGCAGAATTCCAAAATGGGTCTGTTGTCGTACATATATGA
AATTCATTTATTAAAAGAACAATGTCTAAAGACACCTCAACAAGTTGAGGATATGAGTAACATTCCCTATGCTTCTGTTGTTGAGAGCCTGATGTATGCAATGTTATGTA
CTAGACCTGACATTTGTTATTCAGTAGGGATGGTTAGTAGATATCAGTCCATTCCTGGATGTGATCATTGGACAGTCATTAAGAATATTCAAAAATATCTTAGAAGAACA
CAAGACTACATGCTTCTGTATGATTCTAAGGATCTGATCCTTACTGGATACATTGACTCTGATTTTCAATCTGATAAAGATGCTAGAAAGTCTACATCGGGATCAGTTTT
CACTCTGAATGGAGGAGCGGTAGTGTGGAGAACATAA
mRNA sequenceShow/hide mRNA sequence
ATGGATCAAAGCCTTGGACCTCGAATTGGAATCTATGTATTCCAATTCTGTATGGACTCTGTAGATCAACCAAATGGTGCAAAACCTATTGGTTGTAAATGGATCTACAA
GAGAAAACGAGATCAAGCTGGTAAAGTACAGACTTTCAAAGCTCAACTAGTGGAAAAATGTTATACACAAAAGGAGGGAGTGGATTATGAAGAAACTTTCTCTCCTGTTG
CCATGCTAAAGTCGATTAGAATACTCTTATCCATTGCCACTTTTTATGATTATGAAATTTGGCAGATGGATGTCAATACAGCCTTTTTGAACGAATATCTTAAGGAGAGT
ATCTATATGGTCCAACCAGAGGGGTTTATACAAAAGGTTTGTAAGCTTCAAAAATCCATATATGGATTAAAACAAGCATCAAGATCCTGGAATATAAGGTTTGATACTGC
CATCAAATCTTATGGTTGTGAACAAAATGTTGATGAACCTTGTGTTTATAAAAGGATCATCAATTCTACTGTAGCATTCTTAATTTTATATGTAGATGACATACTACTCA
TTGGGAATGATGTAGGTCATCTAACTGATATTAAGAAATGGCTAGCTACGCAATTCCAAATGAAAGATTTGGGAAATGCTCAAGACGTTCTTGGTATCCAAATAGTTCAA
AATCGAAAGAACAAAACACTAGCCATGTCTCAAACATCTTATATAGACAAAATGTTGTCAAGATATAAGATGCAGAATTCCAAAATGGGTCTGTTGTCGTACATATATGA
AATTCATTTATTAAAAGAACAATGTCTAAAGACACCTCAACAAGTTGAGGATATGAGTAACATTCCCTATGCTTCTGTTGTTGAGAGCCTGATGTATGCAATGTTATGTA
CTAGACCTGACATTTGTTATTCAGTAGGGATGGTTAGTAGATATCAGTCCATTCCTGGATGTGATCATTGGACAGTCATTAAGAATATTCAAAAATATCTTAGAAGAACA
CAAGACTACATGCTTCTGTATGATTCTAAGGATCTGATCCTTACTGGATACATTGACTCTGATTTTCAATCTGATAAAGATGCTAGAAAGTCTACATCGGGATCAGTTTT
CACTCTGAATGGAGGAGCGGTAGTGTGGAGAACATAA
Protein sequenceShow/hide protein sequence
MDQSLGPRIGIYVFQFCMDSVDQPNGAKPIGCKWIYKRKRDQAGKVQTFKAQLVEKCYTQKEGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVNTAFLNEYLKES
IYMVQPEGFIQKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGCEQNVDEPCVYKRIINSTVAFLILYVDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQDVLGIQIVQ
NRKNKTLAMSQTSYIDKMLSRYKMQNSKMGLLSYIYEIHLLKEQCLKTPQQVEDMSNIPYASVVESLMYAMLCTRPDICYSVGMVSRYQSIPGCDHWTVIKNIQKYLRRT
QDYMLLYDSKDLILTGYIDSDFQSDKDARKSTSGSVFTLNGGAVVWRT