; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc01g0023701 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc01g0023701
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag-pol polyprotein
Genome locationCMiso1.1chr01:23658065..23659234
RNA-Seq ExpressionCmc01g0023701
SyntenyCmc01g0023701
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035705.1 gag-pol polyprotein [Cucumis melo var. makuwa]2.6e-16381.39Show/hide
Query:  MQEELLQFKRNNVWTLVPKPDVANIIGTKWIFKNKTDESESVIRNKARLVAQGYAQVKGVDFNKTFAPMARLETIRLLLSISCFRKFKLFQMDVKSAFLN
        MQEELLQFKRN+VWTLVPKPD ANIIGTKWIF+NKTDES  VIRN+ARLVAQGYAQV+GV F++TFAP+ARLE I LLLS+S FRKFKL+QMD+KSAFLN
Subjt:  MQEELLQFKRNNVWTLVPKPDVANIIGTKWIFKNKTDESESVIRNKARLVAQGYAQVKGVDFNKTFAPMARLETIRLLLSISCFRKFKLFQMDVKSAFLN

Query:  GYLNEEVYVAQLKRFVDSEFPQYVYKLNKALYGLKQAPRAWYEQLTMYLSERGYSRGETDKTLFINRTSTGLIVAQIYVDDIIFGGFPKTLVNNFINIMK
        GYLNEEVYVAQ K F+DSEFPQYVYKLNKALYGLKQAPRAWYE LTMYL ++GYS+GETDKTLFIN+T+  LIVAQIYVDDIIFGGFPK LVNNFI+I+K
Subjt:  GYLNEEVYVAQLKRFVDSEFPQYVYKLNKALYGLKQAPRAWYEQLTMYLSERGYSRGETDKTLFINRTSTGLIVAQIYVDDIIFGGFPKTLVNNFINIMK

Query:  SEFEMSLVGELSCFLALQIKQRNEGIFISQEKYAKNLVKKFGLDHSQHKRIPAATHAKITKDTVDNAVDHKLYRSMIGSLLYLTASRPDIAYVVGICARY
        SEFEMSLVGELS FL LQIKQR+EGIFISQEKYAKN+VKKF LD SQ KR PAATHAKITKD++  AVDHKLYRSMIGSLLYL ASRPDI Y VGICARY
Subjt:  SEFEMSLVGELSCFLALQIKQRNEGIFISQEKYAKNLVKKFGLDHSQHKRIPAATHAKITKDTVDNAVDHKLYRSMIGSLLYLTASRPDIAYVVGICARY

Query:  QSDPRTSHLNAVKRIIKYVHRTTDFGILYFYDTSSELVGYCNADWAGTSTNNLFTLNQSQ
        QSDPR SHLNAVKRIIKYVH TT+F ILY YDTSSE V YC+ADWAG++ +   T  +++
Subjt:  QSDPRTSHLNAVKRIIKYVHRTTDFGILYFYDTSSELVGYCNADWAGTSTNNLFTLNQSQ

KAA0042206.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.5e-21196.66Show/hide
Query:  MQEELLQFKRNNVWTLVPKPDVANIIGTKWIFKNKTDESESVIRNKARLVAQGYAQVKGVDFNKTFAPMARLETIRLLLSISCFRKFKLFQMDVKSAFLN
        MQEELLQFKRNN+WTLVPKPDVANIIGTKWIFKNKTDESESVIRN+ARLVAQGYAQVKGVDFNKTFAP+ARLE IRLLLSISCFRKFKLFQMDVKSAFLN
Subjt:  MQEELLQFKRNNVWTLVPKPDVANIIGTKWIFKNKTDESESVIRNKARLVAQGYAQVKGVDFNKTFAPMARLETIRLLLSISCFRKFKLFQMDVKSAFLN

Query:  GYLNEEVYVAQLKRFVDSEFPQYVYKLNKALYGLKQAPRAWYEQLTMYLSERGYSRGETDKTLFINRTSTGLIVAQIYVDDIIFGGFPKTLVNNFINIMK
        GYLNEEVYVAQLKRFVDSEFPQYVYK NKALYGLKQAPRAWYEQLTMYLSERGYSRGE DKTLFINRTST LIVAQIYVDDIIFGGFPKTLVNNFINIMK
Subjt:  GYLNEEVYVAQLKRFVDSEFPQYVYKLNKALYGLKQAPRAWYEQLTMYLSERGYSRGETDKTLFINRTSTGLIVAQIYVDDIIFGGFPKTLVNNFINIMK

Query:  SEFEMSLVGELSCFLALQIKQRNEGIFISQEKYAKNLVKKFGLDHSQHKRIPAATHAKITKDTVDNAVDHKLYRSMIGSLLYLTASRPDIAYVVGICARY
        SEFEMSLVGELSCFLALQIKQRNEGIFISQEKY KNLVKKFGLDHSQHKRIPAATHAKI KDTVDNAVDHKLYRSMIGSLLYLTASRPDIAYVVGICARY
Subjt:  SEFEMSLVGELSCFLALQIKQRNEGIFISQEKYAKNLVKKFGLDHSQHKRIPAATHAKITKDTVDNAVDHKLYRSMIGSLLYLTASRPDIAYVVGICARY

Query:  QSDPRTSHLNAVKRIIKYVHRTTDFGILYFYDTSSELVGYCNADWAGTSTNNLFTLNQSQLSSLILPRQLQAFNSSIFSTSLDHLTMNK
        QS+PRTSHLNAVKRIIKYV RTTDFGILYFYDTSSELVGYCNADWAGTSTNNL TLNQSQLSSLILP QLQAFNSSIFSTSLDHLTMNK
Subjt:  QSDPRTSHLNAVKRIIKYVHRTTDFGILYFYDTSSELVGYCNADWAGTSTNNLFTLNQSQLSSLILPRQLQAFNSSIFSTSLDHLTMNK

KAA0042877.1 gag-pol polyprotein [Cucumis melo var. makuwa]5.8e-15575.77Show/hide
Query:  QFKRNNVWTLVPKPDVANIIGTKWIFKNKTDESESVIRNKARLVAQGYAQVKGVDFNKTFAPMARLETIRLLLSISCFRKFKLFQMDVKSAFLNGYLNEE
        +FK NNVWTLVPKPD ANIIGTKWIFKNKTDES SVIRNKARLVAQGYAQV+GVD ++TFA +AR E I LL SI+CFRKFKLFQMDVKSAFLNGYLNEE
Subjt:  QFKRNNVWTLVPKPDVANIIGTKWIFKNKTDESESVIRNKARLVAQGYAQVKGVDFNKTFAPMARLETIRLLLSISCFRKFKLFQMDVKSAFLNGYLNEE

Query:  VYVAQLKRFVDSEFPQYVYKLNKALYGLKQAPRAWYEQLTMYLSERGYSRGETDKTLFINRTSTGLIVAQIYVDDIIFGGFPKTLVNNFINIMKSEFEMS
        VYVAQ + FVD EFPQYVYKLNKALYGLKQAPRAWY+ LTMYL ERGYSRGETDKTLFINRTST LIVAQIYVDDIIFGGFPKTLV   +   KSEFEMS
Subjt:  VYVAQLKRFVDSEFPQYVYKLNKALYGLKQAPRAWYEQLTMYLSERGYSRGETDKTLFINRTSTGLIVAQIYVDDIIFGGFPKTLVNNFINIMKSEFEMS

Query:  LVGELSCFLALQIKQRNEGIFISQEKYAKNLVKKFGLDHSQHKRIPAATHAKITKDT-------------------------------------------
        LVGELSCFL LQIKQR+EGIFISQEKYAKNLVKKFGLD SQHKR    THAKITKDT                                           
Subjt:  LVGELSCFLALQIKQRNEGIFISQEKYAKNLVKKFGLDHSQHKRIPAATHAKITKDT-------------------------------------------

Query:  --VDNAVDHKLYRSMIGSLLYLTASRPDIAYVVGICARYQSDPRTSHLNAVKRIIKYVHRTTDFGILYFYDTSSELVGYCNADWAGTS
          V  AVDHK YRSMIGSLLYLTASRPDIAYVVGI ARYQS+PRTSHLNAVKRIIKYVH TTDFGILY YDTSSELVGYC+ +WAG++
Subjt:  --VDNAVDHKLYRSMIGSLLYLTASRPDIAYVVGICARYQSDPRTSHLNAVKRIIKYVHRTTDFGILYFYDTSSELVGYCNADWAGTS

KAA0053137.1 gag-pol polyprotein [Cucumis melo var. makuwa]9.3e-15376.22Show/hide
Query:  MQEELLQFKRNNVWTLVPKPDVANIIGTKWIFKNKTDESESVIRNKARLVAQGYAQVKGVDFNKTFAPMARLETIRLLLSISCFRKFKLFQMDVKSAFLN
        MQEELLQF+RNNVWTL+ KP+  N+IGTKWIFKNKTDE+  V +NKARLVAQGY QV+GVDF++TFAP+ARLE IRLLL ISC +KFKL+Q+DVKS FLN
Subjt:  MQEELLQFKRNNVWTLVPKPDVANIIGTKWIFKNKTDESESVIRNKARLVAQGYAQVKGVDFNKTFAPMARLETIRLLLSISCFRKFKLFQMDVKSAFLN

Query:  GYLNEEVYVAQLKRFVDSEFPQYVYKLNKALYGLKQAPRAWYEQLTMYLSERGYSRGETDKTLFINRTSTGLIVAQIYVDDIIFGGFPKTLVNNFINIMK
        GYLNEEVYVAQ K FVDSE P++VYKLNKALYGLKQA RAWY++LT+YL  RGYSRGE DK LFI+R S  L+VAQIYVDDIIFGGFP  L+NNFINIM+
Subjt:  GYLNEEVYVAQLKRFVDSEFPQYVYKLNKALYGLKQAPRAWYEQLTMYLSERGYSRGETDKTLFINRTSTGLIVAQIYVDDIIFGGFPKTLVNNFINIMK

Query:  SEFEMSLVGELSCFLALQIKQRNEGIFISQEKYAKNLVKKFGLDHSQHKRIPAATHAKITKDTVDNAVDHKLYRSMIGSLLYLTASRPDIAYVVGICARY
        SEFEMS+VGELSCFL LQIKQ+N+GIFISQEKYA+N+VKKFGL  +++KR PAATH K+TKDT    VDHKLYRS++GSLLYLTASRPDIAYVVGICARY
Subjt:  SEFEMSLVGELSCFLALQIKQRNEGIFISQEKYAKNLVKKFGLDHSQHKRIPAATHAKITKDTVDNAVDHKLYRSMIGSLLYLTASRPDIAYVVGICARY

Query:  QSDPRTSHLNAVKRIIKYVHRTTDFGILYFYDTSSELVGYCNADWAGTS
        Q+DPR + L  VKRI+KYVH T+DFG++Y YDT+S LVGYC+ADWAG++
Subjt:  QSDPRTSHLNAVKRIIKYVHRTTDFGILYFYDTSSELVGYCNADWAGTS

TYJ98295.1 gag-pol polyprotein [Cucumis melo var. makuwa]4.9e-17088.08Show/hide
Query:  MQEELLQFKRNNVWTLVPKPDVANIIGTKWIFKNKTDESESVIRNKARLVAQGYAQVKGVDFNKTFAPMARLETIRLLLSISCFRKFKLFQMDVKSAFLN
        MQEELLQFK NNVWTLVPKPD ANIIGTKWIFKNKTDES SV+RNKA LVAQGYAQV+GVDF++TFAP+ARLE IRLLL ISCFRKFKLFQMDVKSAFLN
Subjt:  MQEELLQFKRNNVWTLVPKPDVANIIGTKWIFKNKTDESESVIRNKARLVAQGYAQVKGVDFNKTFAPMARLETIRLLLSISCFRKFKLFQMDVKSAFLN

Query:  GYLNEEVYVAQLKRFVDSEFPQYVYKLNKALYGLKQAPRAWYEQLTMYLSERGYSRGETDKTLFINRTSTGLIVAQIYVDDIIFGGFPKTLVNNFINIMK
        GYLNEEVYVAQ K F+DSEFPQYVYK+NKALYGLKQAPRAWYE+L +YL ERGYS+GETDKTLFINRTST LIVAQIYVDDIIFGGFPKTLVNNFINI+K
Subjt:  GYLNEEVYVAQLKRFVDSEFPQYVYKLNKALYGLKQAPRAWYEQLTMYLSERGYSRGETDKTLFINRTSTGLIVAQIYVDDIIFGGFPKTLVNNFINIMK

Query:  SEFEMSLVGELSCFLALQIKQRNEGIFISQEKYAKNLVKKFGLDHSQHKRIPAATHAKITKDTVDNAVDHKLYRSMIGSLLYLTASRPDIAYVVGICARY
        SEFE+SLVG+LS FL LQIKQR++G+FISQEKYAKNLVKKFGLD SQ+KR  AATH KITKDTV  A+DHKLYRSMIGSLLYLTASRPDIAY VGICARY
Subjt:  SEFEMSLVGELSCFLALQIKQRNEGIFISQEKYAKNLVKKFGLDHSQHKRIPAATHAKITKDTVDNAVDHKLYRSMIGSLLYLTASRPDIAYVVGICARY

Query:  QSDPRTSHLNAVKRIIKYVHRTTDFGILYFYDTSSELVGYCNAD
        QSDPRTSHLNAVKRIIKYVH TTDFGILY YDTSSELVGYC+AD
Subjt:  QSDPRTSHLNAVKRIIKYVHRTTDFGILYFYDTSSELVGYCNAD

TrEMBL top hitse value%identityAlignment
A0A5A7T2M1 Gag-pol polyprotein1.3e-16381.39Show/hide
Query:  MQEELLQFKRNNVWTLVPKPDVANIIGTKWIFKNKTDESESVIRNKARLVAQGYAQVKGVDFNKTFAPMARLETIRLLLSISCFRKFKLFQMDVKSAFLN
        MQEELLQFKRN+VWTLVPKPD ANIIGTKWIF+NKTDES  VIRN+ARLVAQGYAQV+GV F++TFAP+ARLE I LLLS+S FRKFKL+QMD+KSAFLN
Subjt:  MQEELLQFKRNNVWTLVPKPDVANIIGTKWIFKNKTDESESVIRNKARLVAQGYAQVKGVDFNKTFAPMARLETIRLLLSISCFRKFKLFQMDVKSAFLN

Query:  GYLNEEVYVAQLKRFVDSEFPQYVYKLNKALYGLKQAPRAWYEQLTMYLSERGYSRGETDKTLFINRTSTGLIVAQIYVDDIIFGGFPKTLVNNFINIMK
        GYLNEEVYVAQ K F+DSEFPQYVYKLNKALYGLKQAPRAWYE LTMYL ++GYS+GETDKTLFIN+T+  LIVAQIYVDDIIFGGFPK LVNNFI+I+K
Subjt:  GYLNEEVYVAQLKRFVDSEFPQYVYKLNKALYGLKQAPRAWYEQLTMYLSERGYSRGETDKTLFINRTSTGLIVAQIYVDDIIFGGFPKTLVNNFINIMK

Query:  SEFEMSLVGELSCFLALQIKQRNEGIFISQEKYAKNLVKKFGLDHSQHKRIPAATHAKITKDTVDNAVDHKLYRSMIGSLLYLTASRPDIAYVVGICARY
        SEFEMSLVGELS FL LQIKQR+EGIFISQEKYAKN+VKKF LD SQ KR PAATHAKITKD++  AVDHKLYRSMIGSLLYL ASRPDI Y VGICARY
Subjt:  SEFEMSLVGELSCFLALQIKQRNEGIFISQEKYAKNLVKKFGLDHSQHKRIPAATHAKITKDTVDNAVDHKLYRSMIGSLLYLTASRPDIAYVVGICARY

Query:  QSDPRTSHLNAVKRIIKYVHRTTDFGILYFYDTSSELVGYCNADWAGTSTNNLFTLNQSQ
        QSDPR SHLNAVKRIIKYVH TT+F ILY YDTSSE V YC+ADWAG++ +   T  +++
Subjt:  QSDPRTSHLNAVKRIIKYVHRTTDFGILYFYDTSSELVGYCNADWAGTSTNNLFTLNQSQ

A0A5D3BIP9 Gag-pol polyprotein2.4e-17088.08Show/hide
Query:  MQEELLQFKRNNVWTLVPKPDVANIIGTKWIFKNKTDESESVIRNKARLVAQGYAQVKGVDFNKTFAPMARLETIRLLLSISCFRKFKLFQMDVKSAFLN
        MQEELLQFK NNVWTLVPKPD ANIIGTKWIFKNKTDES SV+RNKA LVAQGYAQV+GVDF++TFAP+ARLE IRLLL ISCFRKFKLFQMDVKSAFLN
Subjt:  MQEELLQFKRNNVWTLVPKPDVANIIGTKWIFKNKTDESESVIRNKARLVAQGYAQVKGVDFNKTFAPMARLETIRLLLSISCFRKFKLFQMDVKSAFLN

Query:  GYLNEEVYVAQLKRFVDSEFPQYVYKLNKALYGLKQAPRAWYEQLTMYLSERGYSRGETDKTLFINRTSTGLIVAQIYVDDIIFGGFPKTLVNNFINIMK
        GYLNEEVYVAQ K F+DSEFPQYVYK+NKALYGLKQAPRAWYE+L +YL ERGYS+GETDKTLFINRTST LIVAQIYVDDIIFGGFPKTLVNNFINI+K
Subjt:  GYLNEEVYVAQLKRFVDSEFPQYVYKLNKALYGLKQAPRAWYEQLTMYLSERGYSRGETDKTLFINRTSTGLIVAQIYVDDIIFGGFPKTLVNNFINIMK

Query:  SEFEMSLVGELSCFLALQIKQRNEGIFISQEKYAKNLVKKFGLDHSQHKRIPAATHAKITKDTVDNAVDHKLYRSMIGSLLYLTASRPDIAYVVGICARY
        SEFE+SLVG+LS FL LQIKQR++G+FISQEKYAKNLVKKFGLD SQ+KR  AATH KITKDTV  A+DHKLYRSMIGSLLYLTASRPDIAY VGICARY
Subjt:  SEFEMSLVGELSCFLALQIKQRNEGIFISQEKYAKNLVKKFGLDHSQHKRIPAATHAKITKDTVDNAVDHKLYRSMIGSLLYLTASRPDIAYVVGICARY

Query:  QSDPRTSHLNAVKRIIKYVHRTTDFGILYFYDTSSELVGYCNAD
        QSDPRTSHLNAVKRIIKYVH TTDFGILY YDTSSELVGYC+AD
Subjt:  QSDPRTSHLNAVKRIIKYVHRTTDFGILYFYDTSSELVGYCNAD

A0A5D3BPB3 Gag-pol polyprotein4.5e-15376.22Show/hide
Query:  MQEELLQFKRNNVWTLVPKPDVANIIGTKWIFKNKTDESESVIRNKARLVAQGYAQVKGVDFNKTFAPMARLETIRLLLSISCFRKFKLFQMDVKSAFLN
        MQEELLQF+RNNVWTL+ KP+  N+IGTKWIFKNKTDE+  V +NKARLVAQGY QV+GVDF++TFAP+ARLE IRLLL ISC +KFKL+Q+DVKS FLN
Subjt:  MQEELLQFKRNNVWTLVPKPDVANIIGTKWIFKNKTDESESVIRNKARLVAQGYAQVKGVDFNKTFAPMARLETIRLLLSISCFRKFKLFQMDVKSAFLN

Query:  GYLNEEVYVAQLKRFVDSEFPQYVYKLNKALYGLKQAPRAWYEQLTMYLSERGYSRGETDKTLFINRTSTGLIVAQIYVDDIIFGGFPKTLVNNFINIMK
        GYLNEEVYVAQ K FVDSE P++VYKLNKALYGLKQA RAWY++LT+YL  RGYSRGE DK LFI+R S  L+VAQIYVDDIIFGGFP  L+NNFINIM+
Subjt:  GYLNEEVYVAQLKRFVDSEFPQYVYKLNKALYGLKQAPRAWYEQLTMYLSERGYSRGETDKTLFINRTSTGLIVAQIYVDDIIFGGFPKTLVNNFINIMK

Query:  SEFEMSLVGELSCFLALQIKQRNEGIFISQEKYAKNLVKKFGLDHSQHKRIPAATHAKITKDTVDNAVDHKLYRSMIGSLLYLTASRPDIAYVVGICARY
        SEFEMS+VGELSCFL LQIKQ+N+GIFISQEKYA+N+VKKFGL  +++KR PAATH K+TKDT    VDHKLYRS++GSLLYLTASRPDIAYVVGICARY
Subjt:  SEFEMSLVGELSCFLALQIKQRNEGIFISQEKYAKNLVKKFGLDHSQHKRIPAATHAKITKDTVDNAVDHKLYRSMIGSLLYLTASRPDIAYVVGICARY

Query:  QSDPRTSHLNAVKRIIKYVHRTTDFGILYFYDTSSELVGYCNADWAGTS
        Q+DPR + L  VKRI+KYVH T+DFG++Y YDT+S LVGYC+ADWAG++
Subjt:  QSDPRTSHLNAVKRIIKYVHRTTDFGILYFYDTSSELVGYCNADWAGTS

A0A5D3C1P5 Gag-pol polyprotein2.8e-15575.77Show/hide
Query:  QFKRNNVWTLVPKPDVANIIGTKWIFKNKTDESESVIRNKARLVAQGYAQVKGVDFNKTFAPMARLETIRLLLSISCFRKFKLFQMDVKSAFLNGYLNEE
        +FK NNVWTLVPKPD ANIIGTKWIFKNKTDES SVIRNKARLVAQGYAQV+GVD ++TFA +AR E I LL SI+CFRKFKLFQMDVKSAFLNGYLNEE
Subjt:  QFKRNNVWTLVPKPDVANIIGTKWIFKNKTDESESVIRNKARLVAQGYAQVKGVDFNKTFAPMARLETIRLLLSISCFRKFKLFQMDVKSAFLNGYLNEE

Query:  VYVAQLKRFVDSEFPQYVYKLNKALYGLKQAPRAWYEQLTMYLSERGYSRGETDKTLFINRTSTGLIVAQIYVDDIIFGGFPKTLVNNFINIMKSEFEMS
        VYVAQ + FVD EFPQYVYKLNKALYGLKQAPRAWY+ LTMYL ERGYSRGETDKTLFINRTST LIVAQIYVDDIIFGGFPKTLV   +   KSEFEMS
Subjt:  VYVAQLKRFVDSEFPQYVYKLNKALYGLKQAPRAWYEQLTMYLSERGYSRGETDKTLFINRTSTGLIVAQIYVDDIIFGGFPKTLVNNFINIMKSEFEMS

Query:  LVGELSCFLALQIKQRNEGIFISQEKYAKNLVKKFGLDHSQHKRIPAATHAKITKDT-------------------------------------------
        LVGELSCFL LQIKQR+EGIFISQEKYAKNLVKKFGLD SQHKR    THAKITKDT                                           
Subjt:  LVGELSCFLALQIKQRNEGIFISQEKYAKNLVKKFGLDHSQHKRIPAATHAKITKDT-------------------------------------------

Query:  --VDNAVDHKLYRSMIGSLLYLTASRPDIAYVVGICARYQSDPRTSHLNAVKRIIKYVHRTTDFGILYFYDTSSELVGYCNADWAGTS
          V  AVDHK YRSMIGSLLYLTASRPDIAYVVGI ARYQS+PRTSHLNAVKRIIKYVH TTDFGILY YDTSSELVGYC+ +WAG++
Subjt:  --VDNAVDHKLYRSMIGSLLYLTASRPDIAYVVGICARYQSDPRTSHLNAVKRIIKYVHRTTDFGILYFYDTSSELVGYCNADWAGTS

A0A5D3DSN1 Gag-pol polyprotein7.3e-21296.66Show/hide
Query:  MQEELLQFKRNNVWTLVPKPDVANIIGTKWIFKNKTDESESVIRNKARLVAQGYAQVKGVDFNKTFAPMARLETIRLLLSISCFRKFKLFQMDVKSAFLN
        MQEELLQFKRNN+WTLVPKPDVANIIGTKWIFKNKTDESESVIRN+ARLVAQGYAQVKGVDFNKTFAP+ARLE IRLLLSISCFRKFKLFQMDVKSAFLN
Subjt:  MQEELLQFKRNNVWTLVPKPDVANIIGTKWIFKNKTDESESVIRNKARLVAQGYAQVKGVDFNKTFAPMARLETIRLLLSISCFRKFKLFQMDVKSAFLN

Query:  GYLNEEVYVAQLKRFVDSEFPQYVYKLNKALYGLKQAPRAWYEQLTMYLSERGYSRGETDKTLFINRTSTGLIVAQIYVDDIIFGGFPKTLVNNFINIMK
        GYLNEEVYVAQLKRFVDSEFPQYVYK NKALYGLKQAPRAWYEQLTMYLSERGYSRGE DKTLFINRTST LIVAQIYVDDIIFGGFPKTLVNNFINIMK
Subjt:  GYLNEEVYVAQLKRFVDSEFPQYVYKLNKALYGLKQAPRAWYEQLTMYLSERGYSRGETDKTLFINRTSTGLIVAQIYVDDIIFGGFPKTLVNNFINIMK

Query:  SEFEMSLVGELSCFLALQIKQRNEGIFISQEKYAKNLVKKFGLDHSQHKRIPAATHAKITKDTVDNAVDHKLYRSMIGSLLYLTASRPDIAYVVGICARY
        SEFEMSLVGELSCFLALQIKQRNEGIFISQEKY KNLVKKFGLDHSQHKRIPAATHAKI KDTVDNAVDHKLYRSMIGSLLYLTASRPDIAYVVGICARY
Subjt:  SEFEMSLVGELSCFLALQIKQRNEGIFISQEKYAKNLVKKFGLDHSQHKRIPAATHAKITKDTVDNAVDHKLYRSMIGSLLYLTASRPDIAYVVGICARY

Query:  QSDPRTSHLNAVKRIIKYVHRTTDFGILYFYDTSSELVGYCNADWAGTSTNNLFTLNQSQLSSLILPRQLQAFNSSIFSTSLDHLTMNK
        QS+PRTSHLNAVKRIIKYV RTTDFGILYFYDTSSELVGYCNADWAGTSTNNL TLNQSQLSSLILP QLQAFNSSIFSTSLDHLTMNK
Subjt:  QSDPRTSHLNAVKRIIKYVHRTTDFGILYFYDTSSELVGYCNADWAGTSTNNLFTLNQSQLSSLILPRQLQAFNSSIFSTSLDHLTMNK

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.7e-5433.8Show/hide
Query:  ELLQFKRNNVWTLVPKPDVANIIGTKWIFKNKTDESESVIRNKARLVAQGYAQVKGVDFNKTFAPMARLETIRLLLSISCFRKFKLFQMDVKSAFLNGYL
        EL   K NN WT+  +P+  NI+ ++W+F  K +E  + IR KARLVA+G+ Q   +D+ +TFAP+AR+ + R +LS+      K+ QMDVK+AFLNG L
Subjt:  ELLQFKRNNVWTLVPKPDVANIIGTKWIFKNKTDESESVIRNKARLVAQGYAQVKGVDFNKTFAPMARLETIRLLLSISCFRKFKLFQMDVKSAFLNGYL

Query:  NEEVYVAQLKRFVDSEFPQYVYKLNKALYGLKQAPRAWYEQLTMYLSERGYSRGETDKTLFI--NRTSTGLIVAQIYVDDIIFGGFPKTLVNNFINIMKS
         EE+Y+ +L + +       V KLNKA+YGLKQA R W+E     L E  +     D+ ++I         I   +YVDD++      T +NNF   +  
Subjt:  NEEVYVAQLKRFVDSEFPQYVYKLNKALYGLKQAPRAWYEQLTMYLSERGYSRGETDKTLFI--NRTSTGLIVAQIYVDDIIFGGFPKTLVNNFINIMKS

Query:  EFEMSLVGELSCFLALQIKQRNEGIFISQEKYAKNLVKKFGLDHSQHKRIPAATHAKITKDTVDNAVD-HKLYRSMIGSLLY-LTASRPDIAYVVGICAR
        +F M+ + E+  F+ ++I+ + + I++SQ  Y K ++ KF +++      P    +KI  + +++  D +   RS+IG L+Y +  +RPD+   V I +R
Subjt:  EFEMSLVGELSCFLALQIKQRNEGIFISQEKYAKNLVKKFGLDHSQHKRIPAATHAKITKDTVDNAVD-HKLYRSMIGSLLY-LTASRPDIAYVVGICAR

Query:  YQSDPRTSHLNAVKRIIKYVHRTTDFGILYFYDTSSE--LVGYCNADWAGTSTNNLFT
        Y S   +     +KR+++Y+  T D  +++  + + E  ++GY ++DWAG+  +   T
Subjt:  YQSDPRTSHLNAVKRIIKYVHRTTDFGILYFYDTSSE--LVGYCNADWAGTSTNNLFT

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.5e-5734.39Show/hide
Query:  MQEELLQFKRNNVWTLVPKPDVANIIGTKWIFKNKTDESESVIRNKARLVAQGYAQVKGVDFNKTFAPMARLETIRLLLSISCFRKFKLFQMDVKSAFLN
        MQEE+   ++N  + LV  P     +  KW+FK K D    ++R KARLV +G+ Q KG+DF++ F+P+ ++ +IR +LS++     ++ Q+DVK+AFL+
Subjt:  MQEELLQFKRNNVWTLVPKPDVANIIGTKWIFKNKTDESESVIRNKARLVAQGYAQVKGVDFNKTFAPMARLETIRLLLSISCFRKFKLFQMDVKSAFLN

Query:  GYLNEEVYVAQLKRFVDSEFPQYVYKLNKALYGLKQAPRAWYEQLTMYLSERGYSRGETDKTLFINRTS-TGLIVAQIYVDDIIFGGFPKTLVNNFINIM
        G L EE+Y+ Q + F  +     V KLNK+LYGLKQAPR WY +   ++  + Y +  +D  ++  R S    I+  +YVDD++  G  K L+      +
Subjt:  GYLNEEVYVAQLKRFVDSEFPQYVYKLNKALYGLKQAPRAWYEQLTMYLSERGYSRGETDKTLFINRTS-TGLIVAQIYVDDIIFGGFPKTLVNNFINIM

Query:  KSEFEMSLVGELSCFLALQI--KQRNEGIFISQEKYAKNLVKKFGLDHSQHKRIPAATHAKITKDTVDNAVDHK------LYRSMIGSLLY-LTASRPDI
           F+M  +G     L ++I  ++ +  +++SQEKY + ++++F + +++    P A H K++K      V+ K       Y S +GSL+Y +  +RPDI
Subjt:  KSEFEMSLVGELSCFLALQI--KQRNEGIFISQEKYAKNLVKKFGLDHSQHKRIPAATHAKITKDTVDNAVDHK------LYRSMIGSLLY-LTASRPDI

Query:  AYVVGICARYQSDPRTSHLNAVKRIIKYVHRTTDFGILYFYDTSSELVGYCNADWAG------TSTNNLFTLNQSQLS
        A+ VG+ +R+  +P   H  AVK I++Y+  TT    L F  +   L GY +AD AG      +ST  LFT +   +S
Subjt:  AYVVGICARYQSDPRTSHLNAVKRIIKYVHRTTDFGILYFYDTSSELVGYCNADWAG------TSTNNLFTLNQSQLS

P25600 Putative transposon Ty5-1 protein YCL074W1.2e-3030.31Show/hide
Query:  MDVKSAFLNGYLNEEVYVAQLKRFVDSEFPQYVYKLNKALYGLKQAPRAWYEQLTMYLSERGYSRGETDKTLFINRTSTGLIVAQIYVDDIIFGGFPKTL
        MDV +AFLN  ++E +YV Q   FV+   P YV++L   +YGLKQAP  W E +   L + G+ R E +  L+   TS G I   +YVDD++       +
Subjt:  MDVKSAFLNGYLNEEVYVAQLKRFVDSEFPQYVYKLNKALYGLKQAPRAWYEQLTMYLSERGYSRGETDKTLFINRTSTGLIVAQIYVDDIIFGGFPKTL

Query:  VNNFINIMKSEFEMSLVGELSCFLALQIKQRNEG-IFISQEKYAKNLVKKFGLDHSQHKRIPAATHAKITKDTVDNAVDHKLYRSMIGSLLY-LTASRPD
         +     +   + M  +G++  FL L I Q + G I +S + Y      +  ++  +  + P      + + T  +  D   Y+S++G LL+     RPD
Subjt:  VNNFINIMKSEFEMSLVGELSCFLALQIKQRNEG-IFISQEKYAKNLVKKFGLDHSQHKRIPAATHAKITKDTVDNAVDHKLYRSMIGSLLY-LTASRPD

Query:  IAYVVGICARYQSDPRTSHLNAVKRIIKYVHRTTDFGILYFYDTSSELVGYCNA
        I+Y V + +R+  +PR  HL + +R+++Y++ T    + Y   +   L  YC+A
Subjt:  IAYVVGICARYQSDPRTSHLNAVKRIIKYVHRTTDFGILYFYDTSSELVGYCNA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.6e-6236.71Show/hide
Query:  NNVWTLV-PKPDVANIIGTKWIFKNKTDESESVIRNKARLVAQGYAQVKGVDFNKTFAPMARLETIRLLLSISCFRKFKLFQMDVKSAFLNGYLNEEVYV
        N+ W LV P P    I+G +WIF  K +   S+ R KARLVA+GY Q  G+D+ +TF+P+ +  +IR++L ++  R + + Q+DV +AFL G L ++VY+
Subjt:  NNVWTLV-PKPDVANIIGTKWIFKNKTDESESVIRNKARLVAQGYAQVKGVDFNKTFAPMARLETIRLLLSISCFRKFKLFQMDVKSAFLNGYLNEEVYV

Query:  AQLKRFVDSEFPQYVYKLNKALYGLKQAPRAWYEQLTMYLSERGYSRGETDKTLFINRTSTGLIVAQIYVDDIIFGGFPKTLVNNFINIMKSEFEMSLVG
        +Q   F+D + P YV KL KALYGLKQAPRAWY +L  YL   G+    +D +LF+ +    ++   +YVDDI+  G   TL++N ++ +   F +    
Subjt:  AQLKRFVDSEFPQYVYKLNKALYGLKQAPRAWYEQLTMYLSERGYSRGETDKTLFINRTSTGLIVAQIYVDDIIFGGFPKTLVNNFINIMKSEFEMSLVG

Query:  ELSCFLALQIKQRNEGIFISQEKYAKNLVKKFGLDHSQHKRIPAATHAKITKDTVDNAVDHKLYRSMIGSLLYLTASRPDIAYVVGICARYQSDPRTSHL
        EL  FL ++ K+   G+ +SQ +Y  +L+ +  +  ++    P A   K++  +     D   YR ++GSL YL  +RPDI+Y V   +++   P   HL
Subjt:  ELSCFLALQIKQRNEGIFISQEKYAKNLVKKFGLDHSQHKRIPAATHAKITKDTVDNAVDHKLYRSMIGSLLYLTASRPDIAYVVGICARYQSDPRTSHL

Query:  NAVKRIIKYVHRTTDFGILYFYDTSSELVGYCNADWAGTSTNNLFT
         A+KRI++Y+  T + GI      +  L  Y +ADWAG   + + T
Subjt:  NAVKRIIKYVHRTTDFGILYFYDTSSELVGYCNADWAGTSTNNLFT

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.2e-6336.71Show/hide
Query:  NNVWTLVPKPDVA-NIIGTKWIFKNKTDESESVIRNKARLVAQGYAQVKGVDFNKTFAPMARLETIRLLLSISCFRKFKLFQMDVKSAFLNGYLNEEVYV
        N+ W LVP P  +  I+G +WIF  K +   S+ R KARLVA+GY Q  G+D+ +TF+P+ +  +IR++L ++  R + + Q+DV +AFL G L +EVY+
Subjt:  NNVWTLVPKPDVA-NIIGTKWIFKNKTDESESVIRNKARLVAQGYAQVKGVDFNKTFAPMARLETIRLLLSISCFRKFKLFQMDVKSAFLNGYLNEEVYV

Query:  AQLKRFVDSEFPQYVYKLNKALYGLKQAPRAWYEQLTMYLSERGYSRGETDKTLFINRTSTGLIVAQIYVDDIIFGGFPKTLVNNFINIMKSEFEMSLVG
        +Q   FVD + P YV +L KA+YGLKQAPRAWY +L  YL   G+    +D +LF+ +    +I   +YVDDI+  G    L+ + ++ +   F +    
Subjt:  AQLKRFVDSEFPQYVYKLNKALYGLKQAPRAWYEQLTMYLSERGYSRGETDKTLFINRTSTGLIVAQIYVDDIIFGGFPKTLVNNFINIMKSEFEMSLVG

Query:  ELSCFLALQIKQRNEGIFISQEKYAKNLVKKFGLDHSQHKRIPAATHAKITKDTVDNAVDHKLYRSMIGSLLYLTASRPDIAYVVGICARYQSDPRTSHL
        +L  FL ++ K+  +G+ +SQ +Y  +L+ +  +  ++    P AT  K+T  +     D   YR ++GSL YL  +RPD++Y V   ++Y   P   H 
Subjt:  ELSCFLALQIKQRNEGIFISQEKYAKNLVKKFGLDHSQHKRIPAATHAKITKDTVDNAVDHKLYRSMIGSLLYLTASRPDIAYVVGICARYQSDPRTSHL

Query:  NAVKRIIKYVHRTTDFGILYFYDTSSELVGYCNADWAGTSTNNLFT
        NA+KR+++Y+  T D GI      +  L  Y +ADWAG + + + T
Subjt:  NAVKRIIKYVHRTTDFGILYFYDTSSELVGYCNADWAGTSTNNLFT

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.6e-5434.1Show/hide
Query:  MQEELLQFKRNNVWTLVPKPDVANIIGTKWIFKNKTDESESVIRNKARLVAQGYAQVKGVDFNKTFAPMARLETIRLLLSISCFRKFKLFQMDVKSAFLN
        M +E+   +  + W +   P     IG KW++K K +   ++ R KARLVA+GY Q +G+DF +TF+P+ +L +++L+L+IS    F L Q+D+ +AFLN
Subjt:  MQEELLQFKRNNVWTLVPKPDVANIIGTKWIFKNKTDESESVIRNKARLVAQGYAQVKGVDFNKTFAPMARLETIRLLLSISCFRKFKLFQMDVKSAFLN

Query:  GYLNEEVYV----AQLKRFVDSEFPQYVYKLNKALYGLKQAPRAWYEQLTMYLSERGYSRGETDKTLFINRTSTGLIVAQIYVDDIIFGGFPKTLVNNFI
        G L+EE+Y+        R  DS  P  V  L K++YGLKQA R W+ + ++ L   G+ +  +D T F+  T+T  +   +YVDDII        V+   
Subjt:  GYLNEEVYV----AQLKRFVDSEFPQYVYKLNKALYGLKQAPRAWYEQLTMYLSERGYSRGETDKTLFINRTSTGLIVAQIYVDDIIFGGFPKTLVNNFI

Query:  NIMKSEFEMSLVGELSCFLALQIKQRNEGIFISQEKYAKNLVKKFGLDHSQHKRIPAATHAKITKDTVDNAVDHKLYRSMIGSLLYLTASRPDIAYVVGI
        + +KS F++  +G L  FL L+I +   GI I Q KYA +L+ + GL   +   +P       +  +  + VD K YR +IG L+YL  +R DI++ V  
Subjt:  NIMKSEFEMSLVGELSCFLALQIKQRNEGIFISQEKYAKNLVKKFGLDHSQHKRIPAATHAKITKDTVDNAVDHKLYRSMIGSLLYLTASRPDIAYVVGI

Query:  CARYQSDPRTSHLNAVKRIIKYVHRTTDFGILYFYDTSSELVGYCNADW
         +++   PR +H  AV +I+ Y+  T   G+ Y      +L  + +A +
Subjt:  CARYQSDPRTSHLNAVKRIIKYVHRTTDFGILYFYDTSSELVGYCNADW

ATMG00240.1 Gag-Pol-related retrotransposon family protein1.6e-0631.82Show/hide
Query:  LYLTASRPDIAYVVGICARYQSDPRTSHLNAVKRIIKYVHRTTDFGILYFYDTSSELVGYCNADWA
        +YLT +RPD+ + V   +++ S  RT+ + AV +++ YV  T   G+ Y   +  +L  + ++DWA
Subjt:  LYLTASRPDIAYVVGICARYQSDPRTSHLNAVKRIIKYVHRTTDFGILYFYDTSSELVGYCNADWA

ATMG00810.1 DNA/RNA polymerases superfamily protein3.3e-2333.33Show/hide
Query:  IYVDDIIFGGFPKTLVNNFINIMKSEFEMSLVGELSCFLALQIKQRNEGIFISQEKYAKNLVKKFGLDHSQHKRIPAATHAKITKDTVDNAVDHKLYRSM
        +YVDDI+  G   TL+N  I  + S F M  +G +  FL +QIK    G+F+SQ KYA+ ++   G+   +    P       +  T     D   +RS+
Subjt:  IYVDDIIFGGFPKTLVNNFINIMKSEFEMSLVGELSCFLALQIKQRNEGIFISQEKYAKNLVKKFGLDHSQHKRIPAATHAKITKDTVDNAVDHKLYRSM

Query:  IGSLLYLTASRPDIAYVVGICARYQSDPRTSHLNAVKRIIKYVHRTTDFGILYFYDTSSELVGYCNADWAG-TSTNNLFT
        +G+L YLT +RPDI+Y V I  +   +P  +  + +KR+++YV  T   G+    ++   +  +C++DWAG TST    T
Subjt:  IGSLLYLTASRPDIAYVVGICARYQSDPRTSHLNAVKRIIKYVHRTTDFGILYFYDTSSELVGYCNADWAG-TSTNNLFT

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)3.3e-1546.34Show/hide
Query:  MQEELLQFKRNNVWTLVPKPDVANIIGTKWIFKNKTDESESVIRNKARLVAQGYAQVKGVDFNKTFAPMARLETIRLLLSIS
        MQEEL    RN  W LVP P   NI+G KW+FK K     ++ R KARLVA+G+ Q +G+ F +T++P+ R  TIR +L+++
Subjt:  MQEELLQFKRNNVWTLVPKPDVANIIGTKWIFKNKTDESESVIRNKARLVAQGYAQVKGVDFNKTFAPMARLETIRLLLSIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGAAGAGTTATTACAGTTCAAGCGTAACAACGTTTGGACTTTGGTTCCTAAACCTGATGTGGCGAACATCATAGGAACTAAGTGGATTTTTAAAAATAAAACTGA
TGAATCTGAGAGTGTAATAAGGAACAAGGCCCGTTTGGTGGCTCAAGGTTATGCACAGGTAAAAGGTGTTGATTTTAATAAAACTTTTGCACCTATGGCTAGACTTGAAA
CTATCCGCCTTTTGCTCAGTATATCATGTTTCCGAAAATTTAAATTGTTTCAAATGGACGTTAAAAGTGCCTTCCTGAATGGATACTTAAATGAGGAAGTCTATGTAGCA
CAACTTAAAAGGTTTGTTGATTCTGAATTTCCTCAGTATGTCTACAAGCTAAATAAAGCTCTATATGGGTTAAAGCAAGCTCCTCGGGCTTGGTATGAACAACTAACAAT
GTATCTTAGTGAAAGAGGATATTCCAGGGGTGAGACTGACAAGACACTATTCATAAATAGAACCAGCACTGGTCTCATTGTAGCTCAAATTTATGTTGATGACATCATCT
TTGGTGGATTTCCTAAAACACTTGTTAATAATTTCATTAACATAATGAAATCAGAATTCGAAATGAGCCTAGTAGGTGAACTGTCCTGCTTTCTGGCATTGCAGATCAAA
CAGAGAAATGAGGGAATATTTATATCACAAGAGAAGTATGCCAAGAACTTAGTCAAGAAGTTTGGTCTGGATCATTCACAACACAAAAGGATTCCAGCTGCGACTCATGC
TAAAATTACAAAGGATACGGTAGATAATGCAGTCGATCACAAATTGTACAGAAGCATGATTGGAAGCCTTTTATATTTGACAGCAAGCAGACCTGATATTGCCTATGTTG
TGGGAATATGTGCTCGGTATCAGTCAGATCCACGTACCTCTCATTTAAATGCAGTTAAACGAATAATAAAGTATGTTCACCGAACAACTGATTTCGGGATTCTGTACTTC
TACGATACATCTTCTGAACTAGTGGGATATTGTAATGCCGACTGGGCAGGTACTTCTACAAATAACCTCTTTACACTTAACCAAAGTCAATTATCAAGCCTTATTCTTCC
TCGCCAACTTCAAGCCTTCAACTCTTCCATTTTTTCCACTTCTCTTGACCACCTAACAATGAACAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGCAAGAAGAGTTATTACAGTTCAAGCGTAACAACGTTTGGACTTTGGTTCCTAAACCTGATGTGGCGAACATCATAGGAACTAAGTGGATTTTTAAAAATAAAACTGA
TGAATCTGAGAGTGTAATAAGGAACAAGGCCCGTTTGGTGGCTCAAGGTTATGCACAGGTAAAAGGTGTTGATTTTAATAAAACTTTTGCACCTATGGCTAGACTTGAAA
CTATCCGCCTTTTGCTCAGTATATCATGTTTCCGAAAATTTAAATTGTTTCAAATGGACGTTAAAAGTGCCTTCCTGAATGGATACTTAAATGAGGAAGTCTATGTAGCA
CAACTTAAAAGGTTTGTTGATTCTGAATTTCCTCAGTATGTCTACAAGCTAAATAAAGCTCTATATGGGTTAAAGCAAGCTCCTCGGGCTTGGTATGAACAACTAACAAT
GTATCTTAGTGAAAGAGGATATTCCAGGGGTGAGACTGACAAGACACTATTCATAAATAGAACCAGCACTGGTCTCATTGTAGCTCAAATTTATGTTGATGACATCATCT
TTGGTGGATTTCCTAAAACACTTGTTAATAATTTCATTAACATAATGAAATCAGAATTCGAAATGAGCCTAGTAGGTGAACTGTCCTGCTTTCTGGCATTGCAGATCAAA
CAGAGAAATGAGGGAATATTTATATCACAAGAGAAGTATGCCAAGAACTTAGTCAAGAAGTTTGGTCTGGATCATTCACAACACAAAAGGATTCCAGCTGCGACTCATGC
TAAAATTACAAAGGATACGGTAGATAATGCAGTCGATCACAAATTGTACAGAAGCATGATTGGAAGCCTTTTATATTTGACAGCAAGCAGACCTGATATTGCCTATGTTG
TGGGAATATGTGCTCGGTATCAGTCAGATCCACGTACCTCTCATTTAAATGCAGTTAAACGAATAATAAAGTATGTTCACCGAACAACTGATTTCGGGATTCTGTACTTC
TACGATACATCTTCTGAACTAGTGGGATATTGTAATGCCGACTGGGCAGGTACTTCTACAAATAACCTCTTTACACTTAACCAAAGTCAATTATCAAGCCTTATTCTTCC
TCGCCAACTTCAAGCCTTCAACTCTTCCATTTTTTCCACTTCTCTTGACCACCTAACAATGAACAAATAA
Protein sequenceShow/hide protein sequence
MQEELLQFKRNNVWTLVPKPDVANIIGTKWIFKNKTDESESVIRNKARLVAQGYAQVKGVDFNKTFAPMARLETIRLLLSISCFRKFKLFQMDVKSAFLNGYLNEEVYVA
QLKRFVDSEFPQYVYKLNKALYGLKQAPRAWYEQLTMYLSERGYSRGETDKTLFINRTSTGLIVAQIYVDDIIFGGFPKTLVNNFINIMKSEFEMSLVGELSCFLALQIK
QRNEGIFISQEKYAKNLVKKFGLDHSQHKRIPAATHAKITKDTVDNAVDHKLYRSMIGSLLYLTASRPDIAYVVGICARYQSDPRTSHLNAVKRIIKYVHRTTDFGILYF
YDTSSELVGYCNADWAGTSTNNLFTLNQSQLSSLILPRQLQAFNSSIFSTSLDHLTMNK