; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0104161 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0104161
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr04:21580174..21581505
RNA-Seq ExpressionCmc04g0104161
SyntenyCmc04g0104161
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]4.5e-19680.41Show/hide
Query:  IDEANQVSFILESLSKSFIPFQTNASLNKIEFNLTILLNELQRFQNLTKGKGKEVEANVATTKRKFKRGSSSKSKAGPSKSNRKIEKKRKGKTPKQNKGK
        IDEANQVSFIL+SL KSF+PFQTNASLNKIEFNLT LLNELQRFQNLT  KGKEVEANVA TKRKF RGSSSK+K GPSK+  K  KK KGK P  +K K
Subjt:  IDEANQVSFILESLSKSFIPFQTNASLNKIEFNLTILLNELQRFQNLTKGKGKEVEANVATTKRKFKRGSSSKSKAGPSKSNRKIEKKRKGKTPKQNKGK

Query:  KTAEEGKCYHCGKNGHWLRNCPKYLAQKKAEKKEQGKCDLLFLVTCLVENENSTWILDSGATNHICFSFQETSSWKMLSEGEITLKVGTGEMVSAKAVGD
        K A++GKC+HC ++GHW RNCPKYLA+KKAEK  QGK DLL + TCLVE + STWILDSGATNHICFSFQETSSWK L EGEITLKVGTGE+VSA+AVGD
Subjt:  KTAEEGKCYHCGKNGHWLRNCPKYLAQKKAEKKEQGKCDLLFLVTCLVENENSTWILDSGATNHICFSFQETSSWKMLSEGEITLKVGTGEMVSAKAVGD

Query:  LKLFFNDRYIILKNVLYAPQMKRNLISISCLLEHMYRISFEINEGFILRKGIHICSTILENNLYKLRPTRANFVLNTEMFRTVETHNKRQKVSFNAFLWH
        L LFF DRY+ILK+VLY P MKRNLISI+C+LEH+Y ISFE+NE FIL KGI ICS I ENNLYKLRPTRAN VLNTEMFRT+ET NK+QKVS NA+LWH
Subjt:  LKLFFNDRYIILKNVLYAPQMKRNLISISCLLEHMYRISFEINEGFILRKGIHICSTILENNLYKLRPTRANFVLNTEMFRTVETHNKRQKVSFNAFLWH

Query:  LRLDHINLNRIGRLVKSGLLSQLEDNSLPLCDSCLEGKMTKRSFIGKGLRAKMPLELVHSDLCGPMNVKAWGGYEYFITFIDDYSRYGHVYLIQNKFDSF
        LRL HINLNRI RLVKSG+L+QLEDNSLP C+SCLEGKMTKRSF GKGLRAK+PLELVHSDLCGPMNVKA GGYEYFI+FIDD+SRYGHVYL+ +K +SF
Subjt:  LRLDHINLNRIGRLVKSGLLSQLEDNSLPLCDSCLEGKMTKRSFIGKGLRAKMPLELVHSDLCGPMNVKAWGGYEYFITFIDDYSRYGHVYLIQNKFDSF

Query:  EKFKEYKAEVKNESGKTIKTFRSDRGGEYMDLRFQDYLI
        EKFKEYKAEV+NE GKTIKT RSDRGGEYMD +FQDYLI
Subjt:  EKFKEYKAEVKNESGKTIKTFRSDRGGEYMDLRFQDYLI

KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]7.6e-15163.84Show/hide
Query:  IDEANQVSFILESLSKSFIPFQTNASLNKIEFNLTILLNELQRFQNLTKGKGKEVEANVATTKRKFKRGSSSKSKAGPSKS-NRKIEKKRKGKTPKQNKG
        IDEA+QVSFILESL +SF+ F++NA +NKI + LT LLNELQ F++L K KG++ EANVAT+ RKF RGS+S +K+ PS S N+K +KK+ G+  K N  
Subjt:  IDEANQVSFILESLSKSFIPFQTNASLNKIEFNLTILLNELQRFQNLTKGKGKEVEANVATTKRKFKRGSSSKSKAGPSKS-NRKIEKKRKGKTPKQNKG

Query:  -----KKT-AEEGKCYHCGKNGHWLRNCPKYLAQKKAEKKEQGKCDLLFLVTCLVENENSTWILDSGATNHICFSFQETSSWKMLSEGEITLKVGTGEMV
             KKT A +G C+HC + GHW RNCPKYLA+KK  K +QGK DLL L TCLVEN++S WI+DSGATNH+C SFQ  SSW+ L  GE+T++VGTG +V
Subjt:  -----KKT-AEEGKCYHCGKNGHWLRNCPKYLAQKKAEKKEQGKCDLLFLVTCLVENENSTWILDSGATNHICFSFQETSSWKMLSEGEITLKVGTGEMV

Query:  SAKAVGDLKLFFNDRYIILKNVLYAPQMKRNLISISCLLEHMYRISFEINEGFILRKGIHICSTILENNLYKLRPTRANFVLNTEMFRTVETHNKRQKVS
        SA AVG L+L     +++L+NV   P +KRNLIS+ CLLE  Y ++F +N+ FI + G+ ICS  LENNLY LR   +  +LNTEMF+T  T NKR K+S
Subjt:  SAKAVGDLKLFFNDRYIILKNVLYAPQMKRNLISISCLLEHMYRISFEINEGFILRKGIHICSTILENNLYKLRPTRANFVLNTEMFRTVETHNKRQKVS

Query:  --FNAFLWHLRLDHINLNRIGRLVKSGLLSQLEDNSLPLCDSCLEGKMTKRSFIGKGLRAKMPLELVHSDLCGPMNVKAWGGYEYFITFIDDYSRYGHVY
           NA LWHLRL HINLNRI RLVK+GLLS+LE+NSLP+C+SCLEGKMTKR F GKG RAK PLELVHSDLCGPMNVKA GG+EYFITF DDYSRYG+VY
Subjt:  --FNAFLWHLRLDHINLNRIGRLVKSGLLSQLEDNSLPLCDSCLEGKMTKRSFIGKGLRAKMPLELVHSDLCGPMNVKAWGGYEYFITFIDDYSRYGHVY

Query:  LIQNKFDSFEKFKEYKAEVKNESGKTIKTFRSDRGGEYMDLRFQDYLI
        L+Q+K ++ EKFKEYKAEV+N   KTIKTFRSDRGGEYMDL+FQ+YL+
Subjt:  LIQNKFDSFEKFKEYKAEVKNESGKTIKTFRSDRGGEYMDLRFQDYLI

KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]9.9e-15163.17Show/hide
Query:  IDEANQVSFILESLSKSFIPFQTNASLNKIEFNLTILLNELQRFQNLTKGKGKEVEANVATTKRKFKRGSSSKSKAGPSKS-NRKIEKKRKGKTPKQNKG
        IDEA+QVSFILESL +SF+ F++NA +NKI + LT LLNELQ F++L K KG++ EANVAT+ RKF RGS+S +K+ PS S N+K +KK+ G+  K N  
Subjt:  IDEANQVSFILESLSKSFIPFQTNASLNKIEFNLTILLNELQRFQNLTKGKGKEVEANVATTKRKFKRGSSSKSKAGPSKS-NRKIEKKRKGKTPKQNKG

Query:  ------KKTAEEGKCYHCGKNGHWLRNCPKYLAQKKAEKKEQGKCDLLFLVTCLVENENSTWILDSGATNHICFSFQETSSWKMLSEGEITLKVGTGEMV
              K  A +G C+HC + GHW RNCPKYLA+KK  K +QGK DLL L TCLVEN++S WI+DSGATNH+C SFQ  SSW+ L  GE+T++VGTG +V
Subjt:  ------KKTAEEGKCYHCGKNGHWLRNCPKYLAQKKAEKKEQGKCDLLFLVTCLVENENSTWILDSGATNHICFSFQETSSWKMLSEGEITLKVGTGEMV

Query:  SAKAVGDLKLFFNDRYIILKNVLYAPQMKRNLISISCLLEHMYRISFEINEGFILRKGIHICSTILENNLYKLRPTRANFVLNTEMFRTVETHNKRQKVS
        SA AVG L+L+    +++L+NV   P +KRNLIS+ CLLE  Y ++F +N+ FI + G+ ICS  LENNLY LR   +  +LNTEMF+T  T NKR K+S
Subjt:  SAKAVGDLKLFFNDRYIILKNVLYAPQMKRNLISISCLLEHMYRISFEINEGFILRKGIHICSTILENNLYKLRPTRANFVLNTEMFRTVETHNKRQKVS

Query:  --FNAFLWHLRLDHINLNRIGRLVKSGLLSQLEDNSLPLCDSCLEGKMTKRSFIGKGLRAKMPLELVHSDLCGPMNVKAWGGYEYFITFIDDYSRYGHVY
           NA LWHLRL HINLNRI RLVK+GLLS+LE+NSLP+C+SCLEGKMTKR F GKG RAK PLELVHS+LCGPMNVKA GG+EYFITF DDYSRYG+VY
Subjt:  --FNAFLWHLRLDHINLNRIGRLVKSGLLSQLEDNSLPLCDSCLEGKMTKRSFIGKGLRAKMPLELVHSDLCGPMNVKAWGGYEYFITFIDDYSRYGHVY

Query:  LIQNKFDSFEKFKEYKAEVKNESGKTIKTFRSDRGGEYMDLRFQDYLI
        L+Q+K ++ EKFKEYKAEV+N   KTIKTFRSDRGGEYMDL+FQ+YL+
Subjt:  LIQNKFDSFEKFKEYKAEVKNESGKTIKTFRSDRGGEYMDLRFQDYLI

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]9.9e-15163.39Show/hide
Query:  IDEANQVSFILESLSKSFIPFQTNASLNKIEFNLTILLNELQRFQNLTKGKGKEVEANVATTKRKFKRGSSSKSKAGPSKS-NRKIEKKRKGKTPKQNKG
        IDEA+QVSFILESL +SF+ F++NA +NKI + LT LLNELQ F++L K KG++ EANVAT+ RKF RGS+S +K+ PS S N+K +KK+ G+  K N  
Subjt:  IDEANQVSFILESLSKSFIPFQTNASLNKIEFNLTILLNELQRFQNLTKGKGKEVEANVATTKRKFKRGSSSKSKAGPSKS-NRKIEKKRKGKTPKQNKG

Query:  ------KKTAEEGKCYHCGKNGHWLRNCPKYLAQKKAEKKEQGKCDLLFLVTCLVENENSTWILDSGATNHICFSFQETSSWKMLSEGEITLKVGTGEMV
              K  A +G C+HC + GHW RNCPKYLA+KK  K +QGK DLL L TCLVEN++S WI+DSGATNH+C SFQ  SSW+ L  GE+T++VGTG +V
Subjt:  ------KKTAEEGKCYHCGKNGHWLRNCPKYLAQKKAEKKEQGKCDLLFLVTCLVENENSTWILDSGATNHICFSFQETSSWKMLSEGEITLKVGTGEMV

Query:  SAKAVGDLKLFFNDRYIILKNVLYAPQMKRNLISISCLLEHMYRISFEINEGFILRKGIHICSTILENNLYKLRPTRANFVLNTEMFRTVETHNKRQKVS
        SA AVG L+L     +++L+NV   P +KRNLIS+ CLLE  Y ++F +N+ FI + G+ ICS  LENNLY LR   +  +LNTEMF+T  T NKR K+S
Subjt:  SAKAVGDLKLFFNDRYIILKNVLYAPQMKRNLISISCLLEHMYRISFEINEGFILRKGIHICSTILENNLYKLRPTRANFVLNTEMFRTVETHNKRQKVS

Query:  --FNAFLWHLRLDHINLNRIGRLVKSGLLSQLEDNSLPLCDSCLEGKMTKRSFIGKGLRAKMPLELVHSDLCGPMNVKAWGGYEYFITFIDDYSRYGHVY
           NA LWHLRL HINLNRI RLVK+GLLS+LE+NSLP+C+SCLEGKMTKR F GKG RAK PLELVHSDLCGPMNVKA GG+EYFITF DDYSRYG+VY
Subjt:  --FNAFLWHLRLDHINLNRIGRLVKSGLLSQLEDNSLPLCDSCLEGKMTKRSFIGKGLRAKMPLELVHSDLCGPMNVKAWGGYEYFITFIDDYSRYGHVY

Query:  LIQNKFDSFEKFKEYKAEVKNESGKTIKTFRSDRGGEYMDLRFQDYLI
        L+Q+K ++ EKFKEYKAEV+N   KTIKTFRSDRGGEYMDL+FQ+YL+
Subjt:  LIQNKFDSFEKFKEYKAEVKNESGKTIKTFRSDRGGEYMDLRFQDYLI

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]7.6e-15163.84Show/hide
Query:  IDEANQVSFILESLSKSFIPFQTNASLNKIEFNLTILLNELQRFQNLTKGKGKEVEANVATTKRKFKRGSSSKSKAGPSKS-NRKIEKKRKGKTPKQNKG
        IDEA+QVSFILESL +SF+ F++NA +NKI + LT LLNELQ F++L K KG++ EANVAT+ RKF RGS+S +K+ PS S N+K +KK+ G+  K N  
Subjt:  IDEANQVSFILESLSKSFIPFQTNASLNKIEFNLTILLNELQRFQNLTKGKGKEVEANVATTKRKFKRGSSSKSKAGPSKS-NRKIEKKRKGKTPKQNKG

Query:  -----KKT-AEEGKCYHCGKNGHWLRNCPKYLAQKKAEKKEQGKCDLLFLVTCLVENENSTWILDSGATNHICFSFQETSSWKMLSEGEITLKVGTGEMV
             KKT A +G C+HC + GHW RNCPKYLA+KK  K +QGK DLL L TCLVEN++S WI+DSGATNH+C SFQ  SSW+ L  GE+T++VGTG +V
Subjt:  -----KKT-AEEGKCYHCGKNGHWLRNCPKYLAQKKAEKKEQGKCDLLFLVTCLVENENSTWILDSGATNHICFSFQETSSWKMLSEGEITLKVGTGEMV

Query:  SAKAVGDLKLFFNDRYIILKNVLYAPQMKRNLISISCLLEHMYRISFEINEGFILRKGIHICSTILENNLYKLRPTRANFVLNTEMFRTVETHNKRQKVS
        SA AVG L+L     +++L+NV   P +KRNLIS+ CLLE  Y ++F +N+ FI + G+ ICS  LENNLY LR   +  +LNTEMF+T  T NKR K+S
Subjt:  SAKAVGDLKLFFNDRYIILKNVLYAPQMKRNLISISCLLEHMYRISFEINEGFILRKGIHICSTILENNLYKLRPTRANFVLNTEMFRTVETHNKRQKVS

Query:  --FNAFLWHLRLDHINLNRIGRLVKSGLLSQLEDNSLPLCDSCLEGKMTKRSFIGKGLRAKMPLELVHSDLCGPMNVKAWGGYEYFITFIDDYSRYGHVY
           NA LWHLRL HINLNRI RLVK+GLLS+LE+NSLP+C+SCLEGKMTKR F GKG RAK PLELVHSDLCGPMNVKA GG+EYFITF DDYSRYG+VY
Subjt:  --FNAFLWHLRLDHINLNRIGRLVKSGLLSQLEDNSLPLCDSCLEGKMTKRSFIGKGLRAKMPLELVHSDLCGPMNVKAWGGYEYFITFIDDYSRYGHVY

Query:  LIQNKFDSFEKFKEYKAEVKNESGKTIKTFRSDRGGEYMDLRFQDYLI
        L+Q+K ++ EKFKEYKAEV+N   KTIKTFRSDRGGEYMDL+FQ+YL+
Subjt:  LIQNKFDSFEKFKEYKAEVKNESGKTIKTFRSDRGGEYMDLRFQDYLI

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein4.8e-15163.39Show/hide
Query:  IDEANQVSFILESLSKSFIPFQTNASLNKIEFNLTILLNELQRFQNLTKGKGKEVEANVATTKRKFKRGSSSKSKAGPSKS-NRKIEKKRKGKTPKQNKG
        IDEA+QVSFILESL +SF+ F++NA +NKI + LT LLNELQ F++L K KG++ EANVAT+ RKF RGS+S +K+ PS S N+K +KK+ G+  K N  
Subjt:  IDEANQVSFILESLSKSFIPFQTNASLNKIEFNLTILLNELQRFQNLTKGKGKEVEANVATTKRKFKRGSSSKSKAGPSKS-NRKIEKKRKGKTPKQNKG

Query:  ------KKTAEEGKCYHCGKNGHWLRNCPKYLAQKKAEKKEQGKCDLLFLVTCLVENENSTWILDSGATNHICFSFQETSSWKMLSEGEITLKVGTGEMV
              K  A +G C+HC + GHW RNCPKYLA+KK  K +QGK DLL L TCLVEN++S WI+DSGATNH+C SFQ  SSW+ L  GE+T++VGTG +V
Subjt:  ------KKTAEEGKCYHCGKNGHWLRNCPKYLAQKKAEKKEQGKCDLLFLVTCLVENENSTWILDSGATNHICFSFQETSSWKMLSEGEITLKVGTGEMV

Query:  SAKAVGDLKLFFNDRYIILKNVLYAPQMKRNLISISCLLEHMYRISFEINEGFILRKGIHICSTILENNLYKLRPTRANFVLNTEMFRTVETHNKRQKVS
        SA AVG L+L     +++L+NV   P +KRNLIS+ CLLE  Y ++F +N+ FI + G+ ICS  LENNLY LR   +  +LNTEMF+T  T NKR K+S
Subjt:  SAKAVGDLKLFFNDRYIILKNVLYAPQMKRNLISISCLLEHMYRISFEINEGFILRKGIHICSTILENNLYKLRPTRANFVLNTEMFRTVETHNKRQKVS

Query:  --FNAFLWHLRLDHINLNRIGRLVKSGLLSQLEDNSLPLCDSCLEGKMTKRSFIGKGLRAKMPLELVHSDLCGPMNVKAWGGYEYFITFIDDYSRYGHVY
           NA LWHLRL HINLNRI RLVK+GLLS+LE+NSLP+C+SCLEGKMTKR F GKG RAK PLELVHSDLCGPMNVKA GG+EYFITF DDYSRYG+VY
Subjt:  --FNAFLWHLRLDHINLNRIGRLVKSGLLSQLEDNSLPLCDSCLEGKMTKRSFIGKGLRAKMPLELVHSDLCGPMNVKAWGGYEYFITFIDDYSRYGHVY

Query:  LIQNKFDSFEKFKEYKAEVKNESGKTIKTFRSDRGGEYMDLRFQDYLI
        L+Q+K ++ EKFKEYKAEV+N   KTIKTFRSDRGGEYMDL+FQ+YL+
Subjt:  LIQNKFDSFEKFKEYKAEVKNESGKTIKTFRSDRGGEYMDLRFQDYLI

A0A5A7V4M1 Gag/pol protein4.8e-15163.39Show/hide
Query:  IDEANQVSFILESLSKSFIPFQTNASLNKIEFNLTILLNELQRFQNLTKGKGKEVEANVATTKRKFKRGSSSKSKAGPSKS-NRKIEKKRKGKTPKQNKG
        IDEA+QVSFILESL +SF+ F++NA +NKI + LT LLNELQ F++L K KG++ EANVAT+ RKF RGS+S +K+ PS S N+K +KK+ G+  K N  
Subjt:  IDEANQVSFILESLSKSFIPFQTNASLNKIEFNLTILLNELQRFQNLTKGKGKEVEANVATTKRKFKRGSSSKSKAGPSKS-NRKIEKKRKGKTPKQNKG

Query:  ------KKTAEEGKCYHCGKNGHWLRNCPKYLAQKKAEKKEQGKCDLLFLVTCLVENENSTWILDSGATNHICFSFQETSSWKMLSEGEITLKVGTGEMV
              K  A +G C+HC + GHW RNCPKYLA+KK  K +QGK DLL L TCLVEN++S WI+DSGATNH+C SFQ  SSW+ L  GE+T++VGTG +V
Subjt:  ------KKTAEEGKCYHCGKNGHWLRNCPKYLAQKKAEKKEQGKCDLLFLVTCLVENENSTWILDSGATNHICFSFQETSSWKMLSEGEITLKVGTGEMV

Query:  SAKAVGDLKLFFNDRYIILKNVLYAPQMKRNLISISCLLEHMYRISFEINEGFILRKGIHICSTILENNLYKLRPTRANFVLNTEMFRTVETHNKRQKVS
        SA AVG L+L     +++L+NV   P +KRNLIS+ CLLE  Y ++F +N+ FI + G+ ICS  LENNLY LR   +  +LNTEMF+T  T NKR K+S
Subjt:  SAKAVGDLKLFFNDRYIILKNVLYAPQMKRNLISISCLLEHMYRISFEINEGFILRKGIHICSTILENNLYKLRPTRANFVLNTEMFRTVETHNKRQKVS

Query:  --FNAFLWHLRLDHINLNRIGRLVKSGLLSQLEDNSLPLCDSCLEGKMTKRSFIGKGLRAKMPLELVHSDLCGPMNVKAWGGYEYFITFIDDYSRYGHVY
           NA LWHLRL HINLNRI RLVK+GLLS+LE+NSLP+C+SCLEGKMTKR F GKG RAK PLELVHSDLCGPMNVKA GG+EYFITF DDYSRYG+VY
Subjt:  --FNAFLWHLRLDHINLNRIGRLVKSGLLSQLEDNSLPLCDSCLEGKMTKRSFIGKGLRAKMPLELVHSDLCGPMNVKAWGGYEYFITFIDDYSRYGHVY

Query:  LIQNKFDSFEKFKEYKAEVKNESGKTIKTFRSDRGGEYMDLRFQDYLI
        L+Q+K ++ EKFKEYKAEV+N   KTIKTFRSDRGGEYMDL+FQ+YL+
Subjt:  LIQNKFDSFEKFKEYKAEVKNESGKTIKTFRSDRGGEYMDLRFQDYLI

A0A5D3CPJ6 Gag/pol protein3.7e-15163.84Show/hide
Query:  IDEANQVSFILESLSKSFIPFQTNASLNKIEFNLTILLNELQRFQNLTKGKGKEVEANVATTKRKFKRGSSSKSKAGPSKS-NRKIEKKRKGKTPKQNKG
        IDEA+QVSFILESL +SF+ F++NA +NKI + LT LLNELQ F++L K KG++ EANVAT+ RKF RGS+S +K+ PS S N+K +KK+ G+  K N  
Subjt:  IDEANQVSFILESLSKSFIPFQTNASLNKIEFNLTILLNELQRFQNLTKGKGKEVEANVATTKRKFKRGSSSKSKAGPSKS-NRKIEKKRKGKTPKQNKG

Query:  -----KKT-AEEGKCYHCGKNGHWLRNCPKYLAQKKAEKKEQGKCDLLFLVTCLVENENSTWILDSGATNHICFSFQETSSWKMLSEGEITLKVGTGEMV
             KKT A +G C+HC + GHW RNCPKYLA+KK  K +QGK DLL L TCLVEN++S WI+DSGATNH+C SFQ  SSW+ L  GE+T++VGTG +V
Subjt:  -----KKT-AEEGKCYHCGKNGHWLRNCPKYLAQKKAEKKEQGKCDLLFLVTCLVENENSTWILDSGATNHICFSFQETSSWKMLSEGEITLKVGTGEMV

Query:  SAKAVGDLKLFFNDRYIILKNVLYAPQMKRNLISISCLLEHMYRISFEINEGFILRKGIHICSTILENNLYKLRPTRANFVLNTEMFRTVETHNKRQKVS
        SA AVG L+L     +++L+NV   P +KRNLIS+ CLLE  Y ++F +N+ FI + G+ ICS  LENNLY LR   +  +LNTEMF+T  T NKR K+S
Subjt:  SAKAVGDLKLFFNDRYIILKNVLYAPQMKRNLISISCLLEHMYRISFEINEGFILRKGIHICSTILENNLYKLRPTRANFVLNTEMFRTVETHNKRQKVS

Query:  --FNAFLWHLRLDHINLNRIGRLVKSGLLSQLEDNSLPLCDSCLEGKMTKRSFIGKGLRAKMPLELVHSDLCGPMNVKAWGGYEYFITFIDDYSRYGHVY
           NA LWHLRL HINLNRI RLVK+GLLS+LE+NSLP+C+SCLEGKMTKR F GKG RAK PLELVHSDLCGPMNVKA GG+EYFITF DDYSRYG+VY
Subjt:  --FNAFLWHLRLDHINLNRIGRLVKSGLLSQLEDNSLPLCDSCLEGKMTKRSFIGKGLRAKMPLELVHSDLCGPMNVKAWGGYEYFITFIDDYSRYGHVY

Query:  LIQNKFDSFEKFKEYKAEVKNESGKTIKTFRSDRGGEYMDLRFQDYLI
        L+Q+K ++ EKFKEYKAEV+N   KTIKTFRSDRGGEYMDL+FQ+YL+
Subjt:  LIQNKFDSFEKFKEYKAEVKNESGKTIKTFRSDRGGEYMDLRFQDYLI

A0A5D3CSZ6 Gag/pol protein3.7e-15163.84Show/hide
Query:  IDEANQVSFILESLSKSFIPFQTNASLNKIEFNLTILLNELQRFQNLTKGKGKEVEANVATTKRKFKRGSSSKSKAGPSKS-NRKIEKKRKGKTPKQNKG
        IDEA+QVSFILESL +SF+ F++NA +NKI + LT LLNELQ F++L K KG++ EANVAT+ RKF RGS+S +K+ PS S N+K +KK+ G+  K N  
Subjt:  IDEANQVSFILESLSKSFIPFQTNASLNKIEFNLTILLNELQRFQNLTKGKGKEVEANVATTKRKFKRGSSSKSKAGPSKS-NRKIEKKRKGKTPKQNKG

Query:  -----KKT-AEEGKCYHCGKNGHWLRNCPKYLAQKKAEKKEQGKCDLLFLVTCLVENENSTWILDSGATNHICFSFQETSSWKMLSEGEITLKVGTGEMV
             KKT A +G C+HC + GHW RNCPKYLA+KK  K +QGK DLL L TCLVEN++S WI+DSGATNH+C SFQ  SSW+ L  GE+T++VGTG +V
Subjt:  -----KKT-AEEGKCYHCGKNGHWLRNCPKYLAQKKAEKKEQGKCDLLFLVTCLVENENSTWILDSGATNHICFSFQETSSWKMLSEGEITLKVGTGEMV

Query:  SAKAVGDLKLFFNDRYIILKNVLYAPQMKRNLISISCLLEHMYRISFEINEGFILRKGIHICSTILENNLYKLRPTRANFVLNTEMFRTVETHNKRQKVS
        SA AVG L+L     +++L+NV   P +KRNLIS+ CLLE  Y ++F +N+ FI + G+ ICS  LENNLY LR   +  +LNTEMF+T  T NKR K+S
Subjt:  SAKAVGDLKLFFNDRYIILKNVLYAPQMKRNLISISCLLEHMYRISFEINEGFILRKGIHICSTILENNLYKLRPTRANFVLNTEMFRTVETHNKRQKVS

Query:  --FNAFLWHLRLDHINLNRIGRLVKSGLLSQLEDNSLPLCDSCLEGKMTKRSFIGKGLRAKMPLELVHSDLCGPMNVKAWGGYEYFITFIDDYSRYGHVY
           NA LWHLRL HINLNRI RLVK+GLLS+LE+NSLP+C+SCLEGKMTKR F GKG RAK PLELVHSDLCGPMNVKA GG+EYFITF DDYSRYG+VY
Subjt:  --FNAFLWHLRLDHINLNRIGRLVKSGLLSQLEDNSLPLCDSCLEGKMTKRSFIGKGLRAKMPLELVHSDLCGPMNVKAWGGYEYFITFIDDYSRYGHVY

Query:  LIQNKFDSFEKFKEYKAEVKNESGKTIKTFRSDRGGEYMDLRFQDYLI
        L+Q+K ++ EKFKEYKAEV+N   KTIKTFRSDRGGEYMDL+FQ+YL+
Subjt:  LIQNKFDSFEKFKEYKAEVKNESGKTIKTFRSDRGGEYMDLRFQDYLI

E2GK51 Gag/pol protein (Fragment)2.2e-19680.41Show/hide
Query:  IDEANQVSFILESLSKSFIPFQTNASLNKIEFNLTILLNELQRFQNLTKGKGKEVEANVATTKRKFKRGSSSKSKAGPSKSNRKIEKKRKGKTPKQNKGK
        IDEANQVSFIL+SL KSF+PFQTNASLNKIEFNLT LLNELQRFQNLT  KGKEVEANVA TKRKF RGSSSK+K GPSK+  K  KK KGK P  +K K
Subjt:  IDEANQVSFILESLSKSFIPFQTNASLNKIEFNLTILLNELQRFQNLTKGKGKEVEANVATTKRKFKRGSSSKSKAGPSKSNRKIEKKRKGKTPKQNKGK

Query:  KTAEEGKCYHCGKNGHWLRNCPKYLAQKKAEKKEQGKCDLLFLVTCLVENENSTWILDSGATNHICFSFQETSSWKMLSEGEITLKVGTGEMVSAKAVGD
        K A++GKC+HC ++GHW RNCPKYLA+KKAEK  QGK DLL + TCLVE + STWILDSGATNHICFSFQETSSWK L EGEITLKVGTGE+VSA+AVGD
Subjt:  KTAEEGKCYHCGKNGHWLRNCPKYLAQKKAEKKEQGKCDLLFLVTCLVENENSTWILDSGATNHICFSFQETSSWKMLSEGEITLKVGTGEMVSAKAVGD

Query:  LKLFFNDRYIILKNVLYAPQMKRNLISISCLLEHMYRISFEINEGFILRKGIHICSTILENNLYKLRPTRANFVLNTEMFRTVETHNKRQKVSFNAFLWH
        L LFF DRY+ILK+VLY P MKRNLISI+C+LEH+Y ISFE+NE FIL KGI ICS I ENNLYKLRPTRAN VLNTEMFRT+ET NK+QKVS NA+LWH
Subjt:  LKLFFNDRYIILKNVLYAPQMKRNLISISCLLEHMYRISFEINEGFILRKGIHICSTILENNLYKLRPTRANFVLNTEMFRTVETHNKRQKVSFNAFLWH

Query:  LRLDHINLNRIGRLVKSGLLSQLEDNSLPLCDSCLEGKMTKRSFIGKGLRAKMPLELVHSDLCGPMNVKAWGGYEYFITFIDDYSRYGHVYLIQNKFDSF
        LRL HINLNRI RLVKSG+L+QLEDNSLP C+SCLEGKMTKRSF GKGLRAK+PLELVHSDLCGPMNVKA GGYEYFI+FIDD+SRYGHVYL+ +K +SF
Subjt:  LRLDHINLNRIGRLVKSGLLSQLEDNSLPLCDSCLEGKMTKRSFIGKGLRAKMPLELVHSDLCGPMNVKAWGGYEYFITFIDDYSRYGHVYLIQNKFDSF

Query:  EKFKEYKAEVKNESGKTIKTFRSDRGGEYMDLRFQDYLI
        EKFKEYKAEV+NE GKTIKT RSDRGGEYMD +FQDYLI
Subjt:  EKFKEYKAEVKNESGKTIKTFRSDRGGEYMDLRFQDYLI

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.3e-2427.81Show/hide
Query:  KKRKGKTPKQNKGKKTAEEGKCYHCGKNGHWLRNCPKY---LAQKKAEKKEQGKCDLLFLVTCLVENENST-------WILDSGATNHICFSFQETSSWK
        K R  K  K  KG  +  + KC+HCG+ GH  ++C  Y   L  K  E ++Q +      +  +V+  N+T       ++LDSGA++H+     + S + 
Subjt:  KKRKGKTPKQNKGKKTAEEGKCYHCGKNGHWLRNCPKY---LAQKKAEKKEQGKCDLLFLVTCLVENENST-------WILDSGATNHICFSFQETSSWK

Query:  MLSEGEITLKVGT---GEMVSAKAVGDLKLFFNDRYIILKNVLYAPQMKRNLISISCLLEHMYRISFEINEGFILRKGIHICSTILENNLYKLRPTRANF
           E    LK+     GE + A   G ++L  ND  I L++VL+  +   NL+S+  L E    I F+       + G+    TI +N L  ++ +    
Subjt:  MLSEGEITLKVGT---GEMVSAKAVGDLKLFFNDRYIILKNVLYAPQMKRNLISISCLLEHMYRISFEINEGFILRKGIHICSTILENNLYKLRPTRANF

Query:  VLNTEMFRTVETHNKRQKVSFNAFLWHLRLDHIN------LNRIGRLVKSGLLSQLEDNSLPLCDSCLEGKMTKRSF--IGKGLRAKMPLELVHSDLCGP
        +LN       + ++   K   N  LWH R  HI+      + R        LL+ LE  S  +C+ CL GK  +  F  +      K PL +VHSD+CGP
Subjt:  VLNTEMFRTVETHNKRQKVSFNAFLWHLRLDHIN------LNRIGRLVKSGLLSQLEDNSLPLCDSCLEGKMTKRSF--IGKGLRAKMPLELVHSDLCGP

Query:  MNVKAWGGYEYFITFIDDYSRYGHVYLIQNKFDSFEKFKEYKAEVKNESGKTIKTFRSDRGGEYMDLRFQDYLI
        +         YF+ F+D ++ Y   YLI+ K D F  F+++ A+ +      +     D G EY+    + + +
Subjt:  MNVKAWGGYEYFITFIDDYSRYGHVYLIQNKFDSFEKFKEYKAEVKNESGKTIKTFRSDRGGEYMDLRFQDYLI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.5e-3427.71Show/hide
Query:  VQIDEANQVSFILESLSKSFIPFQTNASLNKIEFNL-----TILLNELQRFQNLTKGKGKEVEANVATTKRKFKRGSSSKSKAGPSKSNRKIEKKRKGKT
        V+I+E ++   +L SL  S+    T     K    L      +LLNE  R +   +G+    E       R ++R S++  ++G            +GK+
Subjt:  VQIDEANQVSFILESLSKSFIPFQTNASLNKIEFNL-----TILLNELQRFQNLTKGKGKEVEANVATTKRKFKRGSSSKSKAGPSKSNRKIEKKRKGKT

Query:  PKQNKGKKTAEEGKCYHCGKNGHWLRNCP---KYLAQKKAEKKEQGKCDL--------LFL---VTCL-VENENSTWILDSGATNHICFSFQETSSWKML
          ++K +       CY+C + GH+ R+CP   K   +   +K +     +        LF+     C+ +    S W++D+ A++H   +      +   
Subjt:  PKQNKGKKTAEEGKCYHCGKNGHWLRNCP---KYLAQKKAEKKEQGKCDL--------LFL---VTCL-VENENSTWILDSGATNHICFSFQETSSWKML

Query:  SEGEI-TLKVGTGEMVSAKAVGDLKLFFN-DRYIILKNVLYAPQMKRNLISISCLLEHMYRISFEINEGFILRKG-IHICSTILENNLYKLRPTRANFVL
          G+  T+K+G         +GD+ +  N    ++LK+V + P ++ NLIS   L    Y  S+  N+ + L KG + I   +    LY+          
Subjt:  SEGEI-TLKVGTGEMVSAKAVGDLKLFFN-DRYIILKNVLYAPQMKRNLISISCLLEHMYRISFEINEGFILRKG-IHICSTILENNLYKLRPTRANFVL

Query:  NTEMFRTVETHNKRQKVSFNAFLWHLRLDHINLNRIGRLVKSGLLSQLEDNSLPLCDSCLEGKMTKRSFIGKGLRAKMPLELVHSDLCGPMNVKAWGGYE
        N E+ +  E +  + ++S +  LWH R+ H++   +  L K  L+S  +  ++  CD CL GK  + SF     R    L+LV+SD+CGPM +++ GG +
Subjt:  NTEMFRTVETHNKRQKVSFNAFLWHLRLDHINLNRIGRLVKSGLLSQLEDNSLPLCDSCLEGKMTKRSFIGKGLRAKMPLELVHSDLCGPMNVKAWGGYE

Query:  YFITFIDDYSRYGHVYLIQNKFDSFEKFKEYKAEVKNESGKTIKTFRSDRGGEYMDLRFQDY
        YF+TFIDD SR   VY+++ K   F+ F+++ A V+ E+G+ +K  RSD GGEY    F++Y
Subjt:  YFITFIDDYSRYGHVYLIQNKFDSFEKFKEYKAEVKNESGKTIKTFRSDRGGEYMDLRFQDY

Q07791 Transposon Ty2-DR3 Gag-Pol polyprotein1.2e-0521.49Show/hide
Query:  VQIDEANQVSFILESLSKSFIPFQTNASLNKIEFNLTILLNELQRFQNLTKGKGKEVEANVATTKRKFKRGSSSK--SKAGPSKSNRKIEKKRKGKT---
        + + +      IL+ LS  F  +  N    K    L+ L  E+Q    L   + K +  N  +   ++K+ S  K  S+  P+ +N K+  +   +T   
Subjt:  VQIDEANQVSFILESLSKSFIPFQTNASLNKIEFNLTILLNELQRFQNLTKGKGKEVEANVATTKRKFKRGSSSK--SKAGPSKSNRKIEKKRKGKT---

Query:  -PKQNKGKKTAEEGKCYHCGKNGHWLRNCPKYLAQKKAEKKEQGKCDLLFLVTCLVENENST---WILDSGATNHICFSFQETSSWKMLSEGEITLKVGT
         P+  K    A   K +    N H   +           +   G+       T  +++ +      ++DSGA+  +  S             EI +    
Subjt:  -PKQNKGKKTAEEGKCYHCGKNGHWLRNCPKYLAQKKAEKKEQGKCDLLFLVTCLVENENST---WILDSGATNHICFSFQETSSWKMLSEGEITLKVGT

Query:  GEMVSAKAVGDLKLFFNDRYIILKNVLYAPQMKRNLISISCLLEHMYRISFEINEGFILRKGIHICSTILENNLYKLRPTRANFVLNTEMFR-TVETHNK
         + +   A+G+L   F +        L+ P +  +L+S+S L        F  N       G  +   +   + Y L      +++ + + + T+   NK
Subjt:  GEMVSAKAVGDLKLFFNDRYIILKNVLYAPQMKRNLISISCLLEHMYRISFEINEGFILRKGIHICSTILENNLYKLRPTRANFVLNTEMFR-TVETHNK

Query:  RQKVSFNAF-LWHLRLDHINLNRIGRLVKSGLLSQLEDNSLP-------LCDSCLEGKMTKRSFIGKGLRAKM-----PLELVHSDLCGPMNVKAWGGYE
         + V+   + L H  L H N   I + +K   ++ L+++ +         C  CL GK TK   I KG R K      P + +H+D+ GP++        
Subjt:  RQKVSFNAF-LWHLRLDHINLNRIGRLVKSGLLSQLEDNSLP-------LCDSCLEGKMTKRSFIGKGLRAKM-----PLELVHSDLCGPMNVKAWGGYE

Query:  YFITFIDDYSRYGHVYLIQNKFDS--FEKFKEYKAEVKNESGKTIKTFRSDRGGEY
        YFI+F D+ +R+  VY + ++ +      F    A +KN+    +   + DRG EY
Subjt:  YFITFIDDYSRYGHVYLIQNKFDS--FEKFKEYKAEVKNESGKTIKTFRSDRGGEY

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.1e-1823.41Show/hide
Query:  IDEANQVSFILESLSKSFIPFQTNASLNKIEFNLT----ILLNELQRFQNLTKGKGKEVEANV-----ATTKRKFKRGSSSKSKAGPSKSNRKIEKKRKG
        +D   QV  +LE+L + + P     +       LT     LLN   +   ++      + AN       TT      G+ +      + +N     ++  
Subjt:  IDEANQVSFILESLSKSFIPFQTNASLNKIEFNLT----ILLNELQRFQNLTKGKGKEVEANV-----ATTKRKFKRGSSSKSKAGPSKSNRKIEKKRKG

Query:  KTPKQNKGKKTAEEGKCYHCGKNGHWLRNCPK---YLAQKKAEKKE------QGKCDLLFLVTCLVENENSTWILDSGATNHICFSFQETSSWKMLSEGE
             N  +     GKC  CG  GH  + C +   +L+   +++        Q + +L           ++ W+LDSGAT+HI   F   S  +  + G+
Subjt:  KTPKQNKGKKTAEEGKCYHCGKNGHWLRNCPK---YLAQKKAEKKE------QGKCDLLFLVTCLVENENSTWILDSGATNHICFSFQETSSWKMLSEGE

Query:  ITLKVGTGEMVSAKAVGDLKLFFNDRYIILKNVLYAPQMKRNLISISCLLE------HMYRISFEINEGFILRKGIHICSTILENNLYKLRPTRANFVLN
          + V  G  +     G   L    R + L N+LY P + +NLIS+  L          +  SF++ +   L  G+ +     ++ LY+     +  V  
Subjt:  ITLKVGTGEMVSAKAVGDLKLFFNDRYIILKNVLYAPQMKRNLISISCLLE------HMYRISFEINEGFILRKGIHICSTILENNLYKLRPTRANFVLN

Query:  TEMFRTVETHNKRQKVSFNAFLWHLRLDHINLNRIGRLVKSGLLSQLEDNSLPL-CDSCLEGKMTKRSFIGKGLRAKMPLELVHSDLCGPMNVKAWGGYE
             +  TH+           WH RL H   + +  ++ +  LS L  +   L C  CL  K  K  F    + +  PLE ++SD+     + +   Y 
Subjt:  TEMFRTVETHNKRQKVSFNAFLWHLRLDHINLNRIGRLVKSGLLSQLEDNSLPL-CDSCLEGKMTKRSFIGKGLRAKMPLELVHSDLCGPMNVKAWGGYE

Query:  YFITFIDDYSRYGHVYLIQNKFDSFEKFKEYKAEVKNESGKTIKTFRSDRGGEYMDL
        Y++ F+D ++RY  +Y ++ K    E F  +K  ++N     I TF SD GGE++ L
Subjt:  YFITFIDDYSRYGHVYLIQNKFDSFEKFKEYKAEVKNESGKTIKTFRSDRGGEYMDL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.2e-1923.58Show/hide
Query:  IDEANQVSFILESLSKSFIPFQTNASLNKIEFNLT----ILLNELQRFQNLTKGKGKEVEANVAT---TKRKFKRGSSSKSKAGPSKSNRKIEKKRKGKT
        +D   QV  +LE+L   + P     +      +LT     L+N   +   L   +   + ANV T   T     + +   ++   + +NR    +     
Subjt:  IDEANQVSFILESLSKSFIPFQTNASLNKIEFNLT----ILLNELQRFQNLTKGKGKEVEANVAT---TKRKFKRGSSSKSKAGPSKSNRKIEKKRKGKT

Query:  PKQNKGKKTAEEGKCYHCGKNGHWLRNCPKYLAQKKAEKKEQGKCDL------LFLVTCLVENENSTWILDSGATNHICFSFQETSSWKMLSEGEITLKV
         + +  +     G+C  C   GH  + CP+    +    ++Q             L      N N+ W+LDSGAT+HI   F   S  +  + G+  + +
Subjt:  PKQNKGKKTAEEGKCYHCGKNGHWLRNCPKYLAQKKAEKKEQGKCDL------LFLVTCLVENENSTWILDSGATNHICFSFQETSSWKMLSEGEITLKV

Query:  GTGEMVSAKAVGDLKLFFNDRYIILKNVLYAPQMKRNLISISCLLE------HMYRISFEINEGFILRKGIHICSTILENNLYKLRPTRANFVLNTEMFR
          G  +     G   L  + R + L  VLY P + +NLIS+  L          +  SF++ +   L  G+ +     ++ LY+     +  V       
Subjt:  GTGEMVSAKAVGDLKLFFNDRYIILKNVLYAPQMKRNLISISCLLE------HMYRISFEINEGFILRKGIHICSTILENNLYKLRPTRANFVLNTEMFR

Query:  TVETHNKRQKVSFNAFLWHLRLDHINLNRIGRLVKSGLLSQLE-DNSLPLCDSCLEGKMTKRSFIGKGLRAKMPLELVHSDLCGPMNVKAWGGYEYFITF
        +  TH+           WH RL H +L  +  ++ +  L  L   + L  C  C   K  K  F    + +  PLE ++SD+     + +   Y Y++ F
Subjt:  TVETHNKRQKVSFNAFLWHLRLDHINLNRIGRLVKSGLLSQLE-DNSLPLCDSCLEGKMTKRSFIGKGLRAKMPLELVHSDLCGPMNVKAWGGYEYFITF

Query:  IDDYSRYGHVYLIQNKFDSFEKFKEYKAEVKNESGKTIKTFRSDRGGEYMDLRFQDYL
        +D ++RY  +Y ++ K    + F  +K+ V+N     I T  SD GGE++ LR  DYL
Subjt:  IDDYSRYGHVYLIQNKFDSFEKFKEYKAEVKNESGKTIKTFRSDRGGEYMDLRFQDYL

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCGTGCAAATCGATGAGGCTAATCAAGTTAGCTTTATCTTAGAGTCTCTTTCGAAGAGCTTCATACCATTTCAAACGAATGCGTCCCTGAATAAGATAGAG
TTTAACCTAACAATTCTTCTGAACGAGCTCCAGCGTTTCCAAAATCTTACCAAAGGTAAAGGAAAAGAAGTAGAAGCAAATGTTGCTACTACAAAACGAAAATTT
AAAAGAGGATCGTCCTCTAAATCCAAAGCTGGACCCTCAAAATCTAATAGAAAGATTGAGAAGAAGAGAAAGGGGAAAACCCCCAAGCAGAACAAGGGTAAGAAG
ACTGCAGAAGAAGGTAAATGTTACCACTGTGGTAAAAACGGGCACTGGCTGAGAAACTGCCCAAAATACCTTGCTCAGAAAAAGGCAGAGAAAAAAGAACAAGGT
AAATGTGATTTACTTTTCCTTGTAACATGTTTAGTGGAAAATGAAAATTCCACCTGGATATTAGATTCAGGAGCCACTAATCATATTTGTTTCTCATTTCAGGAA
ACTAGTTCTTGGAAAATGCTTTCTGAGGGCGAGATCACTCTCAAAGTTGGAACTGGAGAAATGGTCTCGGCTAAAGCAGTGGGAGATTTGAAGTTGTTTTTTAAT
GATAGATACATTATACTCAAGAATGTCTTGTATGCACCTCAAATGAAGAGGAATTTGATATCTATCTCTTGTTTATTAGAACATATGTATAGAATATCTTTTGAA
ATTAATGAAGGGTTCATTTTAAGAAAAGGTATTCACATTTGCTCTACTATACTTGAAAACAACTTATATAAGTTACGACCAACACGAGCAAATTTTGTCTTAAAT
ACTGAGATGTTTAGAACAGTCGAAACTCATAATAAAAGACAAAAAGTTTCTTTCAATGCCTTCTTATGGCACTTAAGACTCGATCACATTAATCTCAATAGGATT
GGGAGATTGGTTAAGAGTGGACTTCTGAGTCAGTTAGAAGATAACTCTCTACCTCTATGTGATTCCTGTCTTGAAGGAAAGATGACAAAAAGATCTTTTATTGGA
AAAGGTCTTAGAGCCAAAATGCCTTTGGAGCTCGTACATTCGGACCTTTGTGGACCAATGAATGTCAAGGCTTGGGGAGGATACGAATATTTCATTACTTTCATT
GATGATTATTCGAGGTATGGTCATGTTTACCTAATTCAAAACAAGTTTGATTCTTTTGAAAAGTTCAAAGAATATAAGGCTGAAGTCAAAAATGAGTCAGGTAAA
ACAATAAAGACATTTCGATCAGATCGAGGTGGAGAGTATATGGACTTGCGATTCCAAGACTATTTGATATAG
mRNA sequenceShow/hide mRNA sequence
ATGGTCGTGCAAATCGATGAGGCTAATCAAGTTAGCTTTATCTTAGAGTCTCTTTCGAAGAGCTTCATACCATTTCAAACGAATGCGTCCCTGAATAAGATAGAG
TTTAACCTAACAATTCTTCTGAACGAGCTCCAGCGTTTCCAAAATCTTACCAAAGGTAAAGGAAAAGAAGTAGAAGCAAATGTTGCTACTACAAAACGAAAATTT
AAAAGAGGATCGTCCTCTAAATCCAAAGCTGGACCCTCAAAATCTAATAGAAAGATTGAGAAGAAGAGAAAGGGGAAAACCCCCAAGCAGAACAAGGGTAAGAAG
ACTGCAGAAGAAGGTAAATGTTACCACTGTGGTAAAAACGGGCACTGGCTGAGAAACTGCCCAAAATACCTTGCTCAGAAAAAGGCAGAGAAAAAAGAACAAGGT
AAATGTGATTTACTTTTCCTTGTAACATGTTTAGTGGAAAATGAAAATTCCACCTGGATATTAGATTCAGGAGCCACTAATCATATTTGTTTCTCATTTCAGGAA
ACTAGTTCTTGGAAAATGCTTTCTGAGGGCGAGATCACTCTCAAAGTTGGAACTGGAGAAATGGTCTCGGCTAAAGCAGTGGGAGATTTGAAGTTGTTTTTTAAT
GATAGATACATTATACTCAAGAATGTCTTGTATGCACCTCAAATGAAGAGGAATTTGATATCTATCTCTTGTTTATTAGAACATATGTATAGAATATCTTTTGAA
ATTAATGAAGGGTTCATTTTAAGAAAAGGTATTCACATTTGCTCTACTATACTTGAAAACAACTTATATAAGTTACGACCAACACGAGCAAATTTTGTCTTAAAT
ACTGAGATGTTTAGAACAGTCGAAACTCATAATAAAAGACAAAAAGTTTCTTTCAATGCCTTCTTATGGCACTTAAGACTCGATCACATTAATCTCAATAGGATT
GGGAGATTGGTTAAGAGTGGACTTCTGAGTCAGTTAGAAGATAACTCTCTACCTCTATGTGATTCCTGTCTTGAAGGAAAGATGACAAAAAGATCTTTTATTGGA
AAAGGTCTTAGAGCCAAAATGCCTTTGGAGCTCGTACATTCGGACCTTTGTGGACCAATGAATGTCAAGGCTTGGGGAGGATACGAATATTTCATTACTTTCATT
GATGATTATTCGAGGTATGGTCATGTTTACCTAATTCAAAACAAGTTTGATTCTTTTGAAAAGTTCAAAGAATATAAGGCTGAAGTCAAAAATGAGTCAGGTAAA
ACAATAAAGACATTTCGATCAGATCGAGGTGGAGAGTATATGGACTTGCGATTCCAAGACTATTTGATATAG
Protein sequenceShow/hide protein sequence
MVVQIDEANQVSFILESLSKSFIPFQTNASLNKIEFNLTILLNELQRFQNLTKGKGKEVEANVATTKRKFKRGSSSKSKAGPSKSNRKIEKKRKGKTPKQNKGKK
TAEEGKCYHCGKNGHWLRNCPKYLAQKKAEKKEQGKCDLLFLVTCLVENENSTWILDSGATNHICFSFQETSSWKMLSEGEITLKVGTGEMVSAKAVGDLKLFFN
DRYIILKNVLYAPQMKRNLISISCLLEHMYRISFEINEGFILRKGIHICSTILENNLYKLRPTRANFVLNTEMFRTVETHNKRQKVSFNAFLWHLRLDHINLNRI
GRLVKSGLLSQLEDNSLPLCDSCLEGKMTKRSFIGKGLRAKMPLELVHSDLCGPMNVKAWGGYEYFITFIDDYSRYGHVYLIQNKFDSFEKFKEYKAEVKNESGK
TIKTFRSDRGGEYMDLRFQDYLI