; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc02g0051761 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc02g0051761
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag-pol polyprotein
Genome locationCMiso1.1chr02:18589151..18590350
RNA-Seq ExpressionCmc02g0051761
SyntenyCmc02g0051761
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAO73521.1 gag-pol polyprotein [Glycine max]9.8e-15867.25Show/hide
Query:  MDVNSAFLNGYLNEEAYVAQPKGFVDSEHPKHVYKLNKALYVVKQAPRAWYERLTVYLRGKGYSKGEIDKTLFIHRKSDKLLVAQIYVDDIIFGGFPQDL
        MDV SAFLNGYLNEE YV QPKGF D  HP HVY+L KALY +KQAPRAWYERLT +L  +GY KG IDKTLF+ + ++ L++AQIYVDDI+FGG   ++
Subjt:  MDVNSAFLNGYLNEEAYVAQPKGFVDSEHPKHVYKLNKALYVVKQAPRAWYERLTVYLRGKGYSKGEIDKTLFIHRKSDKLLVAQIYVDDIIFGGFPQDL

Query:  VNNFINIMQSEFEMSMVGELSCLLDLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRNLAATHVKLTRDTDGTEVDHKLYRSIVGSLLYLTASRSDIA
        + +F+  MQSEFEMS+VGEL+  L LQ+KQ  D IF+SQ +YAKN+VKKFG+E A +KR  A TH+KL++D  GT VD  LYRS++GSLLYLTASR DI 
Subjt:  VNNFINIMQSEFEMSMVGELSCLLDLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRNLAATHVKLTRDTDGTEVDHKLYRSIVGSLLYLTASRSDIA

Query:  YAVGICVRYQVDPRIPHLEAVKQILKYVHKTSDFGMIYFYDTTPTLIGYCDVDWAGSVDDRKSTSGGCFFLGNNLISWLSKKHNYVSLSTAETEYIAAGS
        YAVG+C RYQ +P+I HL  VK+ILKYV+ TSD+G++Y + + P L+GYCD DWAGS DDRKSTSGGCF+LGNNLISW SKK N VSLSTAE EYIAAGS
Subjt:  YAVGICVRYQVDPRIPHLEAVKQILKYVHKTSDFGMIYFYDTTPTLIGYCDVDWAGSVDDRKSTSGGCFFLGNNLISWLSKKHNYVSLSTAETEYIAAGS

Query:  GCTQLIWMKNMLHEYDFDQDLMTLYCDNMSAIDKWKNPVQHSRTKHINIRHHFIRELVEDKVIRLDHIRSNLQLTDIFTKPLDANSFEHLRAGLGVC
         C+QL+WMK ML EY+ +QD+MTLYCDNMSAI+  KNPVQHSRTKHI+IRHH+IR+LV+DKVI L H+ +  Q+ DIFTK LDAN FE LR  LG+C
Subjt:  GCTQLIWMKNMLHEYDFDQDLMTLYCDNMSAIDKWKNPVQHSRTKHINIRHHFIRELVEDKVIRLDHIRSNLQLTDIFTKPLDANSFEHLRAGLGVC

AAO73529.1 gag-pol polyprotein [Glycine max]5.8e-15867.76Show/hide
Query:  MDVNSAFLNGYLNEEAYVAQPKGFVDSEHPKHVYKLNKALYVVKQAPRAWYERLTVYLRGKGYSKGEIDKTLFIHRKSDKLLVAQIYVDDIIFGGFPQDL
        MDV SAFLNGYLNEEAYV QPKGFVD  HP HVY+L KALY +KQAPRAWYERLT +L  +GY KG IDKTLF+ + ++ L++AQIYVDDI+FGG   ++
Subjt:  MDVNSAFLNGYLNEEAYVAQPKGFVDSEHPKHVYKLNKALYVVKQAPRAWYERLTVYLRGKGYSKGEIDKTLFIHRKSDKLLVAQIYVDDIIFGGFPQDL

Query:  VNNFINIMQSEFEMSMVGELSCLLDLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRNLAATHVKLTRDTDGTEVDHKLYRSIVGSLLYLTASRSDIA
        + +F+  MQSEFEMS+VGEL+  L LQ+KQ  D IF+SQ KYAKN+VKKFG+E A +KR  A TH+KL++D  GT VD  LYRS++GSLLYLTASR DI 
Subjt:  VNNFINIMQSEFEMSMVGELSCLLDLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRNLAATHVKLTRDTDGTEVDHKLYRSIVGSLLYLTASRSDIA

Query:  YAVGICVRYQVDPRIPHLEAVKQILKYVHKTSDFGMIYFYDTTPTLIGYCDVDWAGSVDDRKSTSGGCFFLGNNLISWLSKKHNYVSLSTAETEYIAAGS
        YAVG+C RYQ +P+I HL  VK+ILKYV+ TSD+G++Y + +   L+GYCD DWAGS DDRKSTSGGCF+LGNNLISW SKK N VSLSTAE EYIAAGS
Subjt:  YAVGICVRYQVDPRIPHLEAVKQILKYVHKTSDFGMIYFYDTTPTLIGYCDVDWAGSVDDRKSTSGGCFFLGNNLISWLSKKHNYVSLSTAETEYIAAGS

Query:  GCTQLIWMKNMLHEYDFDQDLMTLYCDNMSAIDKWKNPVQHSRTKHINIRHHFIRELVEDKVIRLDHIRSNLQLTDIFTKPLDANSFEHLRAGLGVC
         C+QL+WMK ML EY+ +QD+MTLYCDNMSAI+  KNPVQHSRTKHI+IRHH+IRELV+DKVI L+H+ +  Q+ DIFTK LDA  FE LR  LG+C
Subjt:  GCTQLIWMKNMLHEYDFDQDLMTLYCDNMSAIDKWKNPVQHSRTKHINIRHHFIRELVEDKVIRLDHIRSNLQLTDIFTKPLDANSFEHLRAGLGVC

KAA0066405.1 gag-pol polyprotein [Cucumis melo var. makuwa]4.4e-15877.36Show/hide
Query:  MDVNSAFLNGYLNEEAYVAQPKGFVDSEHPKHVYKLNKALYVVKQAPRAWYERLTVYLRGKGYSKGEIDKTLFIHRKSDKLLVAQIYVDDIIFGGFPQDL
        MDV S FLNGYLNEE YVAQPKGFVDSEHPKH+YK NKALY +KQA RAWY+ LTVYLRGKGYS+GEIDKTLFI+RKSD+LLV QIYVDDIIFGGFPQDL
Subjt:  MDVNSAFLNGYLNEEAYVAQPKGFVDSEHPKHVYKLNKALYVVKQAPRAWYERLTVYLRGKGYSKGEIDKTLFIHRKSDKLLVAQIYVDDIIFGGFPQDL

Query:  VNNFINIMQSEFEMSMVGELSCLLDLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRNLAATHVKLTRDTDGTEVDHKLYRSIVGSLLYLTASRSDIA
        VNNFINIMQSEF+MSMVGELSC L LQIKQ NDDIFISQEKY +NMVKKFGLEQARNKR  A THVKLT+DT+  EVDHKLYRSI+GSLLYLTASR DIA
Subjt:  VNNFINIMQSEFEMSMVGELSCLLDLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRNLAATHVKLTRDTDGTEVDHKLYRSIVGSLLYLTASRSDIA

Query:  YAVGICVRYQVDPRIPHLEAVKQILKYVHKTSDFGMIYFYDTTPTLIGYCDVDWAGSVDDRKSTSGGCFFLGNNLISWLSKKHNYVSLSTAETEYIAAGS
        Y VGIC RYQVDP I HL AVK ILKYVH TSDFGM+Y YDTT TL+GYCD DW GS DDRK+T                        S  E EYIAAGS
Subjt:  YAVGICVRYQVDPRIPHLEAVKQILKYVHKTSDFGMIYFYDTTPTLIGYCDVDWAGSVDDRKSTSGGCFFLGNNLISWLSKKHNYVSLSTAETEYIAAGS

Query:  GCTQLIWMKNMLHEYDFDQDLMTLYCDNMSAIDKWKNPVQHSRTKHINIRHHFIRELVEDKVIRLDHIRSN
        GCTQLIW KNML EY FDQD MTLYCDNMSAID   NPVQHSRT+HI+IRHHFI ELV+DKVI+LDHI SN
Subjt:  GCTQLIWMKNMLHEYDFDQDLMTLYCDNMSAIDKWKNPVQHSRTKHINIRHHFIRELVEDKVIRLDHIRSN

KAA0066740.1 gag-pol polyprotein [Cucumis melo var. makuwa]2.8e-17378.48Show/hide
Query:  MDVNSAFLNGYLNEEAYVAQPKGFVDSEHPKHVYKLNKALYVVKQAPRAWYERLTVYLRGKGYSKGEIDKTLFIHRKSDKLLVAQIYVDDIIFGGFPQDL
        MDV S FLNGYLNEE YVAQPKGFVDSEH KHVYKLNKALY +KQAPRAWY+ LTVYLRGKGYS+GEIDKTLFIHRKSD+LLVAQIYVDDIIFGGFPQDL
Subjt:  MDVNSAFLNGYLNEEAYVAQPKGFVDSEHPKHVYKLNKALYVVKQAPRAWYERLTVYLRGKGYSKGEIDKTLFIHRKSDKLLVAQIYVDDIIFGGFPQDL

Query:  VNNFINIMQSEFEMSMVGELSCLLDLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRNLAATHVKLTRDTDGTEVDHKLYRSIVGSLLYLTASRSDIA
        VNNFINIMQSEFEMSMVGEL C L LQI+QKNDDIFISQ+KYA+N+VKKFGLEQARNKR  A THVKLT+D +G EVDHKLYRSIVGSLLYLTASR DIA
Subjt:  VNNFINIMQSEFEMSMVGELSCLLDLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRNLAATHVKLTRDTDGTEVDHKLYRSIVGSLLYLTASRSDIA

Query:  YAVGICVRYQVDPRIPHLEAVKQILKYVHKTSDFGMIYFYDTTPTLIGYCDVDWAGSVDDRKSTSGGCFFLGNNLISWLSKKHNYVSLSTAETEYIAAGS
        YA+GI  RYQV PRI HLEA+K+ILKYVH+T DFGM+Y YDTTPTL+GYCD DWAGS DDRK                             E EYIAAGS
Subjt:  YAVGICVRYQVDPRIPHLEAVKQILKYVHKTSDFGMIYFYDTTPTLIGYCDVDWAGSVDDRKSTSGGCFFLGNNLISWLSKKHNYVSLSTAETEYIAAGS

Query:  GCTQLIWMKNMLHEYDFDQDLMTLYCDNMSAIDKWKNPVQHSRTKHINIRHHFIRELVEDKVIRLDHIRSNLQLTDIFTKPLDANSFEHLRAGLG
        GCTQLIWMKN+LHEY FDQD MTLYC+NMSAID  KN VQHSRTKHI+IRHHFIRE VE+KVI+LDHIRSNLQL +IFTKPLDA+SFE+L AGLG
Subjt:  GCTQLIWMKNMLHEYDFDQDLMTLYCDNMSAIDKWKNPVQHSRTKHINIRHHFIRELVEDKVIRLDHIRSNLQLTDIFTKPLDANSFEHLRAGLG

TYK23188.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.0e-18686.02Show/hide
Query:  MDVNSAFLNGYLNEEAYVAQPKGFVDSEHPKHVYKLNKALYVVKQAPRAWYERLTVYLRGKGYSKGEIDKTLFIHRKSDKLLVAQIYVDDIIFGGFPQDL
        MDV SAFLNGYLNEE YVAQPK FVDSEHPKHVYKLNKALY +KQAPR WYERLTVYLRGKGYS+GEIDKTLFIHRKSD+LLVAQIYVDDIIFGGFP DL
Subjt:  MDVNSAFLNGYLNEEAYVAQPKGFVDSEHPKHVYKLNKALYVVKQAPRAWYERLTVYLRGKGYSKGEIDKTLFIHRKSDKLLVAQIYVDDIIFGGFPQDL

Query:  VNNFINIMQSEFEMSMVGELSCLLDLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRNLAATHVKLTRDTDGTEVDHKLYRSIVGSLLYLTASRSDIA
        VNNFINIMQSEFEMSMVGELSC L  QIKQKNDDI ISQ+KYAKNM KKFGLEQARNKR  AATHVKLTRD DG EVDHKLYRSIV +LLYLTASR DIA
Subjt:  VNNFINIMQSEFEMSMVGELSCLLDLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRNLAATHVKLTRDTDGTEVDHKLYRSIVGSLLYLTASRSDIA

Query:  YAVGICVRYQVDPRIPHLEAVKQILKYVHKTSDFGMIYFYDTTPTLIGYCDVDWAGSVDDRKSTSGGCFFLGNNLISWLSKKHNYVSLSTAETEYIAAGS
        YAVGIC RYQ DPRI HLEAVK+ILKYVH T+DFGM+Y YDTTPTL+GYCD DWAG  DDRKSTSGGCFFLGNNLI WLSKK N VSLST E EYI AGS
Subjt:  YAVGICVRYQVDPRIPHLEAVKQILKYVHKTSDFGMIYFYDTTPTLIGYCDVDWAGSVDDRKSTSGGCFFLGNNLISWLSKKHNYVSLSTAETEYIAAGS

Query:  GCTQLIWMKNMLHEYDFDQDLMTLYCDNMSAIDKWKNPVQHSRTKHINIRHHFIRELVEDKVIRLDHIRSNLQLTDIFT
        GCTQLIWM+N+L EY FDQ  +TLY DNMSAID  KNPVQHSR KHI+IRHHFIRELVEDKVIRLDHIRSNLQL DIFT
Subjt:  GCTQLIWMKNMLHEYDFDQDLMTLYCDNMSAIDKWKNPVQHSRTKHINIRHHFIRELVEDKVIRLDHIRSNLQLTDIFT

TrEMBL top hitse value%identityAlignment
A0A5D3BPB5 Gag-pol polyprotein2.1e-15877.36Show/hide
Query:  MDVNSAFLNGYLNEEAYVAQPKGFVDSEHPKHVYKLNKALYVVKQAPRAWYERLTVYLRGKGYSKGEIDKTLFIHRKSDKLLVAQIYVDDIIFGGFPQDL
        MDV S FLNGYLNEE YVAQPKGFVDSEHPKH+YK NKALY +KQA RAWY+ LTVYLRGKGYS+GEIDKTLFI+RKSD+LLV QIYVDDIIFGGFPQDL
Subjt:  MDVNSAFLNGYLNEEAYVAQPKGFVDSEHPKHVYKLNKALYVVKQAPRAWYERLTVYLRGKGYSKGEIDKTLFIHRKSDKLLVAQIYVDDIIFGGFPQDL

Query:  VNNFINIMQSEFEMSMVGELSCLLDLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRNLAATHVKLTRDTDGTEVDHKLYRSIVGSLLYLTASRSDIA
        VNNFINIMQSEF+MSMVGELSC L LQIKQ NDDIFISQEKY +NMVKKFGLEQARNKR  A THVKLT+DT+  EVDHKLYRSI+GSLLYLTASR DIA
Subjt:  VNNFINIMQSEFEMSMVGELSCLLDLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRNLAATHVKLTRDTDGTEVDHKLYRSIVGSLLYLTASRSDIA

Query:  YAVGICVRYQVDPRIPHLEAVKQILKYVHKTSDFGMIYFYDTTPTLIGYCDVDWAGSVDDRKSTSGGCFFLGNNLISWLSKKHNYVSLSTAETEYIAAGS
        Y VGIC RYQVDP I HL AVK ILKYVH TSDFGM+Y YDTT TL+GYCD DW GS DDRK+T                        S  E EYIAAGS
Subjt:  YAVGICVRYQVDPRIPHLEAVKQILKYVHKTSDFGMIYFYDTTPTLIGYCDVDWAGSVDDRKSTSGGCFFLGNNLISWLSKKHNYVSLSTAETEYIAAGS

Query:  GCTQLIWMKNMLHEYDFDQDLMTLYCDNMSAIDKWKNPVQHSRTKHINIRHHFIRELVEDKVIRLDHIRSN
        GCTQLIW KNML EY FDQD MTLYCDNMSAID   NPVQHSRT+HI+IRHHFI ELV+DKVI+LDHI SN
Subjt:  GCTQLIWMKNMLHEYDFDQDLMTLYCDNMSAIDKWKNPVQHSRTKHINIRHHFIRELVEDKVIRLDHIRSN

A0A5D3DI97 Gag-pol polyprotein4.9e-18786.02Show/hide
Query:  MDVNSAFLNGYLNEEAYVAQPKGFVDSEHPKHVYKLNKALYVVKQAPRAWYERLTVYLRGKGYSKGEIDKTLFIHRKSDKLLVAQIYVDDIIFGGFPQDL
        MDV SAFLNGYLNEE YVAQPK FVDSEHPKHVYKLNKALY +KQAPR WYERLTVYLRGKGYS+GEIDKTLFIHRKSD+LLVAQIYVDDIIFGGFP DL
Subjt:  MDVNSAFLNGYLNEEAYVAQPKGFVDSEHPKHVYKLNKALYVVKQAPRAWYERLTVYLRGKGYSKGEIDKTLFIHRKSDKLLVAQIYVDDIIFGGFPQDL

Query:  VNNFINIMQSEFEMSMVGELSCLLDLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRNLAATHVKLTRDTDGTEVDHKLYRSIVGSLLYLTASRSDIA
        VNNFINIMQSEFEMSMVGELSC L  QIKQKNDDI ISQ+KYAKNM KKFGLEQARNKR  AATHVKLTRD DG EVDHKLYRSIV +LLYLTASR DIA
Subjt:  VNNFINIMQSEFEMSMVGELSCLLDLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRNLAATHVKLTRDTDGTEVDHKLYRSIVGSLLYLTASRSDIA

Query:  YAVGICVRYQVDPRIPHLEAVKQILKYVHKTSDFGMIYFYDTTPTLIGYCDVDWAGSVDDRKSTSGGCFFLGNNLISWLSKKHNYVSLSTAETEYIAAGS
        YAVGIC RYQ DPRI HLEAVK+ILKYVH T+DFGM+Y YDTTPTL+GYCD DWAG  DDRKSTSGGCFFLGNNLI WLSKK N VSLST E EYI AGS
Subjt:  YAVGICVRYQVDPRIPHLEAVKQILKYVHKTSDFGMIYFYDTTPTLIGYCDVDWAGSVDDRKSTSGGCFFLGNNLISWLSKKHNYVSLSTAETEYIAAGS

Query:  GCTQLIWMKNMLHEYDFDQDLMTLYCDNMSAIDKWKNPVQHSRTKHINIRHHFIRELVEDKVIRLDHIRSNLQLTDIFT
        GCTQLIWM+N+L EY FDQ  +TLY DNMSAID  KNPVQHSR KHI+IRHHFIRELVEDKVIRLDHIRSNLQL DIFT
Subjt:  GCTQLIWMKNMLHEYDFDQDLMTLYCDNMSAIDKWKNPVQHSRTKHINIRHHFIRELVEDKVIRLDHIRSNLQLTDIFT

A0A5D3DWS6 Gag-pol polyprotein1.4e-17378.48Show/hide
Query:  MDVNSAFLNGYLNEEAYVAQPKGFVDSEHPKHVYKLNKALYVVKQAPRAWYERLTVYLRGKGYSKGEIDKTLFIHRKSDKLLVAQIYVDDIIFGGFPQDL
        MDV S FLNGYLNEE YVAQPKGFVDSEH KHVYKLNKALY +KQAPRAWY+ LTVYLRGKGYS+GEIDKTLFIHRKSD+LLVAQIYVDDIIFGGFPQDL
Subjt:  MDVNSAFLNGYLNEEAYVAQPKGFVDSEHPKHVYKLNKALYVVKQAPRAWYERLTVYLRGKGYSKGEIDKTLFIHRKSDKLLVAQIYVDDIIFGGFPQDL

Query:  VNNFINIMQSEFEMSMVGELSCLLDLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRNLAATHVKLTRDTDGTEVDHKLYRSIVGSLLYLTASRSDIA
        VNNFINIMQSEFEMSMVGEL C L LQI+QKNDDIFISQ+KYA+N+VKKFGLEQARNKR  A THVKLT+D +G EVDHKLYRSIVGSLLYLTASR DIA
Subjt:  VNNFINIMQSEFEMSMVGELSCLLDLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRNLAATHVKLTRDTDGTEVDHKLYRSIVGSLLYLTASRSDIA

Query:  YAVGICVRYQVDPRIPHLEAVKQILKYVHKTSDFGMIYFYDTTPTLIGYCDVDWAGSVDDRKSTSGGCFFLGNNLISWLSKKHNYVSLSTAETEYIAAGS
        YA+GI  RYQV PRI HLEA+K+ILKYVH+T DFGM+Y YDTTPTL+GYCD DWAGS DDRK                             E EYIAAGS
Subjt:  YAVGICVRYQVDPRIPHLEAVKQILKYVHKTSDFGMIYFYDTTPTLIGYCDVDWAGSVDDRKSTSGGCFFLGNNLISWLSKKHNYVSLSTAETEYIAAGS

Query:  GCTQLIWMKNMLHEYDFDQDLMTLYCDNMSAIDKWKNPVQHSRTKHINIRHHFIRELVEDKVIRLDHIRSNLQLTDIFTKPLDANSFEHLRAGLG
        GCTQLIWMKN+LHEY FDQD MTLYC+NMSAID  KN VQHSRTKHI+IRHHFIRE VE+KVI+LDHIRSNLQL +IFTKPLDA+SFE+L AGLG
Subjt:  GCTQLIWMKNMLHEYDFDQDLMTLYCDNMSAIDKWKNPVQHSRTKHINIRHHFIRELVEDKVIRLDHIRSNLQLTDIFTKPLDANSFEHLRAGLG

Q84VH6 Gag-pol polyprotein2.8e-15867.76Show/hide
Query:  MDVNSAFLNGYLNEEAYVAQPKGFVDSEHPKHVYKLNKALYVVKQAPRAWYERLTVYLRGKGYSKGEIDKTLFIHRKSDKLLVAQIYVDDIIFGGFPQDL
        MDV SAFLNGYLNEEAYV QPKGFVD  HP HVY+L KALY +KQAPRAWYERLT +L  +GY KG IDKTLF+ + ++ L++AQIYVDDI+FGG   ++
Subjt:  MDVNSAFLNGYLNEEAYVAQPKGFVDSEHPKHVYKLNKALYVVKQAPRAWYERLTVYLRGKGYSKGEIDKTLFIHRKSDKLLVAQIYVDDIIFGGFPQDL

Query:  VNNFINIMQSEFEMSMVGELSCLLDLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRNLAATHVKLTRDTDGTEVDHKLYRSIVGSLLYLTASRSDIA
        + +F+  MQSEFEMS+VGEL+  L LQ+KQ  D IF+SQ KYAKN+VKKFG+E A +KR  A TH+KL++D  GT VD  LYRS++GSLLYLTASR DI 
Subjt:  VNNFINIMQSEFEMSMVGELSCLLDLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRNLAATHVKLTRDTDGTEVDHKLYRSIVGSLLYLTASRSDIA

Query:  YAVGICVRYQVDPRIPHLEAVKQILKYVHKTSDFGMIYFYDTTPTLIGYCDVDWAGSVDDRKSTSGGCFFLGNNLISWLSKKHNYVSLSTAETEYIAAGS
        YAVG+C RYQ +P+I HL  VK+ILKYV+ TSD+G++Y + +   L+GYCD DWAGS DDRKSTSGGCF+LGNNLISW SKK N VSLSTAE EYIAAGS
Subjt:  YAVGICVRYQVDPRIPHLEAVKQILKYVHKTSDFGMIYFYDTTPTLIGYCDVDWAGSVDDRKSTSGGCFFLGNNLISWLSKKHNYVSLSTAETEYIAAGS

Query:  GCTQLIWMKNMLHEYDFDQDLMTLYCDNMSAIDKWKNPVQHSRTKHINIRHHFIRELVEDKVIRLDHIRSNLQLTDIFTKPLDANSFEHLRAGLGVC
         C+QL+WMK ML EY+ +QD+MTLYCDNMSAI+  KNPVQHSRTKHI+IRHH+IRELV+DKVI L+H+ +  Q+ DIFTK LDA  FE LR  LG+C
Subjt:  GCTQLIWMKNMLHEYDFDQDLMTLYCDNMSAIDKWKNPVQHSRTKHINIRHHFIRELVEDKVIRLDHIRSNLQLTDIFTKPLDANSFEHLRAGLGVC

Q84VI4 Gag-pol polyprotein4.8e-15867.25Show/hide
Query:  MDVNSAFLNGYLNEEAYVAQPKGFVDSEHPKHVYKLNKALYVVKQAPRAWYERLTVYLRGKGYSKGEIDKTLFIHRKSDKLLVAQIYVDDIIFGGFPQDL
        MDV SAFLNGYLNEE YV QPKGF D  HP HVY+L KALY +KQAPRAWYERLT +L  +GY KG IDKTLF+ + ++ L++AQIYVDDI+FGG   ++
Subjt:  MDVNSAFLNGYLNEEAYVAQPKGFVDSEHPKHVYKLNKALYVVKQAPRAWYERLTVYLRGKGYSKGEIDKTLFIHRKSDKLLVAQIYVDDIIFGGFPQDL

Query:  VNNFINIMQSEFEMSMVGELSCLLDLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRNLAATHVKLTRDTDGTEVDHKLYRSIVGSLLYLTASRSDIA
        + +F+  MQSEFEMS+VGEL+  L LQ+KQ  D IF+SQ +YAKN+VKKFG+E A +KR  A TH+KL++D  GT VD  LYRS++GSLLYLTASR DI 
Subjt:  VNNFINIMQSEFEMSMVGELSCLLDLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRNLAATHVKLTRDTDGTEVDHKLYRSIVGSLLYLTASRSDIA

Query:  YAVGICVRYQVDPRIPHLEAVKQILKYVHKTSDFGMIYFYDTTPTLIGYCDVDWAGSVDDRKSTSGGCFFLGNNLISWLSKKHNYVSLSTAETEYIAAGS
        YAVG+C RYQ +P+I HL  VK+ILKYV+ TSD+G++Y + + P L+GYCD DWAGS DDRKSTSGGCF+LGNNLISW SKK N VSLSTAE EYIAAGS
Subjt:  YAVGICVRYQVDPRIPHLEAVKQILKYVHKTSDFGMIYFYDTTPTLIGYCDVDWAGSVDDRKSTSGGCFFLGNNLISWLSKKHNYVSLSTAETEYIAAGS

Query:  GCTQLIWMKNMLHEYDFDQDLMTLYCDNMSAIDKWKNPVQHSRTKHINIRHHFIRELVEDKVIRLDHIRSNLQLTDIFTKPLDANSFEHLRAGLGVC
         C+QL+WMK ML EY+ +QD+MTLYCDNMSAI+  KNPVQHSRTKHI+IRHH+IR+LV+DKVI L H+ +  Q+ DIFTK LDAN FE LR  LG+C
Subjt:  GCTQLIWMKNMLHEYDFDQDLMTLYCDNMSAIDKWKNPVQHSRTKHINIRHHFIRELVEDKVIRLDHIRSNLQLTDIFTKPLDANSFEHLRAGLGVC

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.9e-5834.73Show/hide
Query:  MDVNSAFLNGYLNEEAYVAQPKGFVDSEHPKHVYKLNKALYVVKQAPRAWYERLTVYLRGKGYSKGEIDKTLFIHRKS--DKLLVAQIYVDDIIFGGFPQ
        MDV +AFLNG L EE Y+  P+G   S +  +V KLNKA+Y +KQA R W+E     L+   +    +D+ ++I  K   ++ +   +YVDD++      
Subjt:  MDVNSAFLNGYLNEEAYVAQPKGFVDSEHPKHVYKLNKALYVVKQAPRAWYERLTVYLRGKGYSKGEIDKTLFIHRKS--DKLLVAQIYVDDIIFGGFPQ

Query:  DLVNNFINIMQSEFEMSMVGELSCLLDLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRNLAATHV--KLTRDTDGTEVD-HKLYRSIVGSLLY-LTA
          +NNF   +  +F M+ + E+   + ++I+ + D I++SQ  Y K ++ KF +E      N  +T +  K+  +   ++ D +   RS++G L+Y +  
Subjt:  DLVNNFINIMQSEFEMSMVGELSCLLDLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRNLAATHV--KLTRDTDGTEVD-HKLYRSIVGSLLY-LTA

Query:  SRSDIAYAVGICVRYQVDPRIPHLEAVKQILKYVHKTSDFGMIYFYDTT--PTLIGYCDVDWAGSVDDRKSTSGGCFFLGN-NLISWLSKKHNYVSLSTA
        +R D+  AV I  RY         + +K++L+Y+  T D  +I+  +      +IGY D DWAGS  DRKST+G  F + + NLI W +K+ N V+ S+ 
Subjt:  SRSDIAYAVGICVRYQVDPRIPHLEAVKQILKYVHKTSDFGMIYFYDTT--PTLIGYCDVDWAGSVDDRKSTSGGCFFLGN-NLISWLSKKHNYVSLSTA

Query:  ETEYIAAGSGCTQLIWMKNMLHEYDFD-QDLMTLYCDNMSAIDKWKNPVQHSRTKHINIRHHFIRELVEDKVIRLDHIRSNLQLTDIFTKPLDANSFEHL
        E EY+A      + +W+K +L   +   ++ + +Y DN   I    NP  H R KHI+I++HF RE V++ VI L++I +  QL DIFTKPL A  F  L
Subjt:  ETEYIAAGSGCTQLIWMKNMLHEYDFD-QDLMTLYCDNMSAIDKWKNPVQHSRTKHINIRHHFIRELVEDKVIRLDHIRSNLQLTDIFTKPLDANSFEHL

Query:  RAGLGV
        R  LG+
Subjt:  RAGLGV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.3e-6936.95Show/hide
Query:  MDVNSAFLNGYLNEEAYVAQPKGFVDSEHPKHVYKLNKALYVVKQAPRAWYERLTVYLRGKGYSKGEIDKTLFIHRKSD-KLLVAQIYVDDIIFGGFPQD
        +DV +AFL+G L EE Y+ QP+GF  +     V KLNK+LY +KQAPR WY +   +++ + Y K   D  ++  R S+   ++  +YVDD++  G  + 
Subjt:  MDVNSAFLNGYLNEEAYVAQPKGFVDSEHPKHVYKLNKALYVVKQAPRAWYERLTVYLRGKGYSKGEIDKTLFIHRKSD-KLLVAQIYVDDIIFGGFPQD

Query:  LVNNFINIMQSEFEMSMVGELSCLLDLQI--KQKNDDIFISQEKYAKNMVKKFGLEQARNKRNLAATHVKLTRDTDGTEVDHK------LYRSIVGSLLY
        L+      +   F+M  +G    +L ++I  ++ +  +++SQEKY + ++++F ++ A+      A H+KL++    T V+ K       Y S VGSL+Y
Subjt:  LVNNFINIMQSEFEMSMVGELSCLLDLQI--KQKNDDIFISQEKYAKNMVKKFGLEQARNKRNLAATHVKLTRDTDGTEVDHK------LYRSIVGSLLY

Query:  -LTASRSDIAYAVGICVRYQVDPRIPHLEAVKQILKYVHKTSDFGMIYFYDTTPTLIGYCDVDWAGSVDDRKSTSGGCFFLGNNLISWLSKKHNYVSLST
         +  +R DIA+AVG+  R+  +P   H EAVK IL+Y+  T+    + F  + P L GY D D AG +D+RKS++G  F      ISW SK    V+LST
Subjt:  -LTASRSDIAYAVGICVRYQVDPRIPHLEAVKQILKYVHKTSDFGMIYFYDTTPTLIGYCDVDWAGSVDDRKSTSGGCFFLGNNLISWLSKKHNYVSLST

Query:  AETEYIAAGSGCTQLIWMKNMLHEYDFDQDLMTLYCDNMSAIDKWKNPVQHSRTKHINIRHHFIRELVEDKVIRLDHIRSNLQLTDIFTKPLDANSFEHL
         E EYIAA     ++IW+K  L E    Q    +YCD+ SAID  KN + H+RTKHI++R+H+IRE+V+D+ +++  I +N    D+ TK +  N FE  
Subjt:  AETEYIAAGSGCTQLIWMKNMLHEYDFDQDLMTLYCDNMSAIDKWKNPVQHSRTKHINIRHHFIRELVEDKVIRLDHIRSNLQLTDIFTKPLDANSFEHL

Query:  RAGLGV
        +  +G+
Subjt:  RAGLGV

P25600 Putative transposon Ty5-1 protein YCL074W3.3e-3129.57Show/hide
Query:  MDVNSAFLNGYLNEEAYVAQPKGFVDSEHPKHVYKLNKALYVVKQAPRAWYERLTVYLRGKGYSKGEIDKTLFIHRKSDKLLVAQIYVDDIIFGGFPQDL
        MDV++AFLN  ++E  YV QP GFV+  +P +V++L   +Y +KQAP  W E +   L+  G+ + E +  L+    SD  +   +YVDD++       +
Subjt:  MDVNSAFLNGYLNEEAYVAQPKGFVDSEHPKHVYKLNKALYVVKQAPRAWYERLTVYLRGKGYSKGEIDKTLFIHRKSDKLLVAQIYVDDIIFGGFPQDL

Query:  VNNFINIMQSEFEMSMVGELSCLLDLQIKQ-KNDDIFISQEKYAKNMVKKFGLEQARNKRNLAATHVKLTRDTDGTEVDHKLYRSIVGSLLY-LTASRSD
         +     +   + M  +G++   L L I Q  N DI +S + Y      +  +   +  +        L   T     D   Y+SIVG LL+     R D
Subjt:  VNNFINIMQSEFEMSMVGELSCLLDLQIKQ-KNDDIFISQEKYAKNMVKKFGLEQARNKRNLAATHVKLTRDTDGTEVDHKLYRSIVGSLLY-LTASRSD

Query:  IAYAVGICVRYQVDPRIPHLEAVKQILKYVHKTSDFGMIYFYDTTPTLIGYCDVDWAGSVDDRKSTSGGCFFLGNNLISWLSKK-HNYVSLSTAETEYIA
        I+Y V +  R+  +PR  HLE+ +++L+Y++ T    + Y   +   L  YCD       D   ST G    L    ++W SKK    + + + E EYI 
Subjt:  IAYAVGICVRYQVDPRIPHLEAVKQILKYVHKTSDFGMIYFYDTTPTLIGYCDVDWAGSVDDRKSTSGGCFFLGNNLISWLSKK-HNYVSLSTAETEYIA

Query:  A
        A
Subjt:  A

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.4e-6836.34Show/hide
Query:  MDVNSAFLNGYLNEEAYVAQPKGFVDSEHPKHVYKLNKALYVVKQAPRAWYERLTVYLRGKGYSKGEIDKTLFIHRKSDKLLVAQIYVDDIIFGGFPQDL
        +DVN+AFL G L ++ Y++QP GF+D + P +V KL KALY +KQAPRAWY  L  YL   G+     D +LF+ ++   ++   +YVDDI+  G    L
Subjt:  MDVNSAFLNGYLNEEAYVAQPKGFVDSEHPKHVYKLNKALYVVKQAPRAWYERLTVYLRGKGYSKGEIDKTLFIHRKSDKLLVAQIYVDDIIFGGFPQDL

Query:  VNNFINIMQSEFEMSMVGELSCLLDLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRNLAATHVKLTRDTDGTEVDHKLYRSIVGSLLYLTASRSDIA
        ++N ++ +   F +    EL   L ++ K+    + +SQ +Y  +++ +  +  A+      A   KL+  +     D   YR IVGSL YL  +R DI+
Subjt:  VNNFINIMQSEFEMSMVGELSCLLDLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRNLAATHVKLTRDTDGTEVDHKLYRSIVGSLLYLTASRSDIA

Query:  YAVGICVRYQVDPRIPHLEAVKQILKYVHKTSDFGMIYFYDTTPTLIGYCDVDWAGSVDDRKSTSGGCFFLGNNLISWLSKKHNYVSLSTAETEYIAAGS
        YAV    ++   P   HL+A+K+IL+Y+  T + G+      T +L  Y D DWAG  DD  ST+G   +LG++ ISW SKK   V  S+ E EY +  +
Subjt:  YAVGICVRYQVDPRIPHLEAVKQILKYVHKTSDFGMIYFYDTTPTLIGYCDVDWAGSVDDRKSTSGGCFFLGNNLISWLSKKHNYVSLSTAETEYIAAGS

Query:  GCTQLIWMKNMLHEYDFDQDL-MTLYCDNMSAIDKWKNPVQHSRTKHINIRHHFIRELVEDKVIRLDHIRSNLQLTDIFTKPLDANSFEHLRAGLGVCR
          +++ W+ ++L E          +YCDN+ A     NPV HSR KHI I +HFIR  V+   +R+ H+ ++ QL D  TKPL   +F++  + +GV R
Subjt:  GCTQLIWMKNMLHEYDFDQDL-MTLYCDNMSAIDKWKNPVQHSRTKHINIRHHFIRELVEDKVIRLDHIRSNLQLTDIFTKPLDANSFEHLRAGLGVCR

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.4e-6835.84Show/hide
Query:  MDVNSAFLNGYLNEEAYVAQPKGFVDSEHPKHVYKLNKALYVVKQAPRAWYERLTVYLRGKGYSKGEIDKTLFIHRKSDKLLVAQIYVDDIIFGGFPQDL
        +DVN+AFL G L +E Y++QP GFVD + P +V +L KA+Y +KQAPRAWY  L  YL   G+     D +LF+ ++   ++   +YVDDI+  G    L
Subjt:  MDVNSAFLNGYLNEEAYVAQPKGFVDSEHPKHVYKLNKALYVVKQAPRAWYERLTVYLRGKGYSKGEIDKTLFIHRKSDKLLVAQIYVDDIIFGGFPQDL

Query:  VNNFINIMQSEFEMSMVGELSCLLDLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRNLAATHVKLTRDTDGTEVDHKLYRSIVGSLLYLTASRSDIA
        + + ++ +   F +    +L   L ++ K+    + +SQ +Y  +++ +  +  A+      AT  KLT  +     D   YR IVGSL YL  +R D++
Subjt:  VNNFINIMQSEFEMSMVGELSCLLDLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRNLAATHVKLTRDTDGTEVDHKLYRSIVGSLLYLTASRSDIA

Query:  YAVGICVRYQVDPRIPHLEAVKQILKYVHKTSDFGMIYFYDTTPTLIGYCDVDWAGSVDDRKSTSGGCFFLGNNLISWLSKKHNYVSLSTAETEYIAAGS
        YAV    +Y   P   H  A+K++L+Y+  T D G+      T +L  Y D DWAG  DD  ST+G   +LG++ ISW SKK   V  S+ E EY +  +
Subjt:  YAVGICVRYQVDPRIPHLEAVKQILKYVHKTSDFGMIYFYDTTPTLIGYCDVDWAGSVDDRKSTSGGCFFLGNNLISWLSKKHNYVSLSTAETEYIAAGS

Query:  GCTQLIWMKNMLHEYDFD-QDLMTLYCDNMSAIDKWKNPVQHSRTKHINIRHHFIRELVEDKVIRLDHIRSNLQLTDIFTKPLDANSFEHLRAGLGVCR
          ++L W+ ++L E          +YCDN+ A     NPV HSR KHI + +HFIR  V+   +R+ H+ ++ QL D  TKPL   +F++    +GV +
Subjt:  GCTQLIWMKNMLHEYDFD-QDLMTLYCDNMSAIDKWKNPVQHSRTKHINIRHHFIRELVEDKVIRLDHIRSNLQLTDIFTKPLDANSFEHLRAGLGVCR

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.4e-5335.18Show/hide
Query:  MDVNSAFLNGYLNEEAYVAQPKGFV----DSEHPKHVYKLNKALYVVKQAPRAWYERLTVYLRGKGYSKGEIDKTLFIHRKSDKLLVAQIYVDDIIFGGF
        +D+++AFLNG L+EE Y+  P G+     DS  P  V  L K++Y +KQA R W+ + +V L G G+ +   D T F+   +   L   +YVDDII    
Subjt:  MDVNSAFLNGYLNEEAYVAQPKGFV----DSEHPKHVYKLNKALYVVKQAPRAWYERLTVYLRGKGYSKGEIDKTLFIHRKSDKLLVAQIYVDDIIFGGF

Query:  PQDLVNNFINIMQSEFEMSMVGELSCLLDLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRNLAATHVKLTRDTDGTEVDHKLYRSIVGSLLYLTASR
            V+   + ++S F++  +G L   L L+I +    I I Q KYA +++ + GL   +         V  +  + G  VD K YR ++G L+YL  +R
Subjt:  PQDLVNNFINIMQSEFEMSMVGELSCLLDLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRNLAATHVKLTRDTDGTEVDHKLYRSIVGSLLYLTASR

Query:  SDIAYAVGICVRYQVDPRIPHLEAVKQILKYVHKTSDFGMIYFYDTTPTLIGYCDVDWAGSVDDRKSTSGGCFFLGNNLISWLSKKHNYVSLSTAETEYI
         DI++AV    ++   PR+ H +AV +IL Y+  T   G+ Y       L  + D  +    D R+ST+G C FLG +LISW SKK   VS S+AE EY 
Subjt:  SDIAYAVGICVRYQVDPRIPHLEAVKQILKYVHKTSDFGMIYFYDTTPTLIGYCDVDWAGSVDDRKSTSGGCFFLGNNLISWLSKKHNYVSLSTAETEYI

Query:  AAGSGCTQLIWMKNMLHEYDFDQDLMT-LYCDNMSAIDKWKNPVQHSRTKHINIRHHFIRE
        A      +++W+     E        T L+CDN +AI    N V H RTKHI    H +RE
Subjt:  AAGSGCTQLIWMKNMLHEYDFDQDLMT-LYCDNMSAIDKWKNPVQHSRTKHINIRHHFIRE

ATMG00240.1 Gag-Pol-related retrotransposon family protein5.8e-0732.95Show/hide
Query:  LYLTASRSDIAYAVGICVRYQVDPRIPHLEAVKQILKYVHKTSDFGMIYFYDTTPTLIGYCDVDWAGSVDDRKSTSGGC-----FFLG
        +YLT +R D+ +AV    ++    R   ++AV ++L YV  T   G+ Y   +   L  + D DWA   D R+S +G C     +FLG
Subjt:  LYLTASRSDIAYAVGICVRYQVDPRIPHLEAVKQILKYVHKTSDFGMIYFYDTTPTLIGYCDVDWAGSVDDRKSTSGGC-----FFLG

ATMG00810.1 DNA/RNA polymerases superfamily protein4.0e-3235.87Show/hide
Query:  IYVDDIIFGGFPQDLVNNFINIMQSEFEMSMVGELSCLLDLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRNLAATHVKLTRDTDGTEV-DHKLYRS
        +YVDDI+  G    L+N  I  + S F M  +G +   L +QIK     +F+SQ KYA+ ++   G+     K       +KL       +  D   +RS
Subjt:  IYVDDIIFGGFPQDLVNNFINIMQSEFEMSMVGELSCLLDLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRNLAATHVKLTRDTDGTEV-DHKLYRS

Query:  IVGSLLYLTASRSDIAYAVGICVRYQVDPRIPHLEAVKQILKYVHKTSDFGMIYFYDTTPTLIGYCDVDWAGSVDDRKSTSGGCFFLGNNLISWLSKKHN
        IVG+L YLT +R DI+YAV I  +   +P +   + +K++L+YV  T   G+    ++   +  +CD DWAG    R+ST+G C FLG N+ISW +K+  
Subjt:  IVGSLLYLTASRSDIAYAVGICVRYQVDPRIPHLEAVKQILKYVHKTSDFGMIYFYDTTPTLIGYCDVDWAGSVDDRKSTSGGCFFLGNNLISWLSKKHN

Query:  YVSLSTAETEYIAAGSGCTQLIW
         VS S+ ETEY A      +L W
Subjt:  YVSLSTAETEYIAAGSGCTQLIW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGTAAATAGTGCTTTCTTAAATGGATATTTGAATGAAGAGGCTTATGTTGCTCAACCAAAAGGTTTTGTTGATTCCGAGCACCCGAAGCATGTGTATAAGCTCAA
CAAAGCTTTATATGTGGTAAAGCAAGCTCCGAGAGCTTGGTATGAACGGCTAACTGTTTACTTAAGAGGTAAAGGATATTCTAAAGGAGAAATTGACAAGACCTTGTTTA
TACACAGGAAATCTGATAAACTTTTGGTTGCTCAAATTTATGTTGATGACATCATTTTTGGAGGTTTTCCTCAAGATCTTGTAAATAATTTCATTAACATCATGCAATCA
GAATTCGAAATGAGCATGGTTGGAGAACTTTCATGCCTTTTGGATCTTCAAATTAAGCAAAAGAATGACGACATATTCATATCTCAGGAAAAGTATGCCAAGAATATGGT
TAAAAAGTTTGGTTTGGAACAGGCTCGAAATAAGCGAAATCTAGCTGCGACACATGTTAAACTTACAAGAGATACTGATGGTACAGAAGTTGATCACAAACTCTACAGGA
GTATAGTAGGTAGCTTATTATATTTAACAGCAAGTCGATCTGACATAGCTTATGCTGTGGGAATATGTGTCCGTTATCAGGTTGATCCCCGCATCCCTCACTTAGAAGCT
GTTAAACAAATTCTTAAGTATGTTCATAAGACCAGTGACTTTGGAATGATCTATTTCTATGATACCACCCCCACTCTTATTGGATATTGTGATGTTGACTGGGCAGGTTC
GGTTGATGATCGTAAAAGTACGTCTGGAGGATGTTTCTTTTTAGGAAACAATCTAATTTCTTGGTTAAGTAAGAAGCATAACTATGTTTCTTTATCTACAGCTGAAACTG
AATATATAGCAGCAGGTAGTGGTTGTACACAGTTGATTTGGATGAAAAATATGTTGCATGAATATGACTTTGATCAGGACCTTATGACGTTGTATTGTGACAATATGAGC
GCAATTGATAAATGGAAGAATCCAGTTCAACATAGTCGAACAAAGCACATTAACATAAGACATCATTTTATTCGAGAACTTGTTGAAGATAAAGTGATTAGGCTTGATCA
TATTCGTTCCAACTTACAATTAACCGATATTTTCACTAAGCCTCTGGATGCAAATTCATTCGAACACTTACGTGCTGGTTTAGGAGTGTGTCGCACTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATGTAAATAGTGCTTTCTTAAATGGATATTTGAATGAAGAGGCTTATGTTGCTCAACCAAAAGGTTTTGTTGATTCCGAGCACCCGAAGCATGTGTATAAGCTCAA
CAAAGCTTTATATGTGGTAAAGCAAGCTCCGAGAGCTTGGTATGAACGGCTAACTGTTTACTTAAGAGGTAAAGGATATTCTAAAGGAGAAATTGACAAGACCTTGTTTA
TACACAGGAAATCTGATAAACTTTTGGTTGCTCAAATTTATGTTGATGACATCATTTTTGGAGGTTTTCCTCAAGATCTTGTAAATAATTTCATTAACATCATGCAATCA
GAATTCGAAATGAGCATGGTTGGAGAACTTTCATGCCTTTTGGATCTTCAAATTAAGCAAAAGAATGACGACATATTCATATCTCAGGAAAAGTATGCCAAGAATATGGT
TAAAAAGTTTGGTTTGGAACAGGCTCGAAATAAGCGAAATCTAGCTGCGACACATGTTAAACTTACAAGAGATACTGATGGTACAGAAGTTGATCACAAACTCTACAGGA
GTATAGTAGGTAGCTTATTATATTTAACAGCAAGTCGATCTGACATAGCTTATGCTGTGGGAATATGTGTCCGTTATCAGGTTGATCCCCGCATCCCTCACTTAGAAGCT
GTTAAACAAATTCTTAAGTATGTTCATAAGACCAGTGACTTTGGAATGATCTATTTCTATGATACCACCCCCACTCTTATTGGATATTGTGATGTTGACTGGGCAGGTTC
GGTTGATGATCGTAAAAGTACGTCTGGAGGATGTTTCTTTTTAGGAAACAATCTAATTTCTTGGTTAAGTAAGAAGCATAACTATGTTTCTTTATCTACAGCTGAAACTG
AATATATAGCAGCAGGTAGTGGTTGTACACAGTTGATTTGGATGAAAAATATGTTGCATGAATATGACTTTGATCAGGACCTTATGACGTTGTATTGTGACAATATGAGC
GCAATTGATAAATGGAAGAATCCAGTTCAACATAGTCGAACAAAGCACATTAACATAAGACATCATTTTATTCGAGAACTTGTTGAAGATAAAGTGATTAGGCTTGATCA
TATTCGTTCCAACTTACAATTAACCGATATTTTCACTAAGCCTCTGGATGCAAATTCATTCGAACACTTACGTGCTGGTTTAGGAGTGTGTCGCACTTAA
Protein sequenceShow/hide protein sequence
MDVNSAFLNGYLNEEAYVAQPKGFVDSEHPKHVYKLNKALYVVKQAPRAWYERLTVYLRGKGYSKGEIDKTLFIHRKSDKLLVAQIYVDDIIFGGFPQDLVNNFINIMQS
EFEMSMVGELSCLLDLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRNLAATHVKLTRDTDGTEVDHKLYRSIVGSLLYLTASRSDIAYAVGICVRYQVDPRIPHLEA
VKQILKYVHKTSDFGMIYFYDTTPTLIGYCDVDWAGSVDDRKSTSGGCFFLGNNLISWLSKKHNYVSLSTAETEYIAAGSGCTQLIWMKNMLHEYDFDQDLMTLYCDNMS
AIDKWKNPVQHSRTKHINIRHHFIRELVEDKVIRLDHIRSNLQLTDIFTKPLDANSFEHLRAGLGVCRT