; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0010918 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0010918
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionGag-pol polyprotein
Genome locationchr06:24570624..24577285
RNA-Seq ExpressionPay0010918
SyntenyPay0010918
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAO73521.1 gag-pol polyprotein [Glycine max]1.8e-30653.53Show/hide
Query:  MIAFTSVQ-TADDAWYFDSGCSRHMIGNRSYFTNLKDCVTGHVTFGDGAKGKIIAKGNIDKNNLPR------------------------YKVSFDDIGC
        ++  TS++ +A + WY DSGCSRHM G + +  N++ C T +VTFGDG+KGKII  G +  + LP                         + V+F    C
Subjt:  MIAFTSVQ-TADDAWYFDSGCSRHMIGNRSYFTNLKDCVTGHVTFGDGAKGKIIAKGNIDKNNLPR------------------------YKVSFDDIGC

Query:  VVMNKENQICMSGKRQTNNCYHW---NSNTSDTCQLTRSNQTWLWHKMLGRVSMRGLEKIIKNEAIVGIPDLDVNGKFFCGDCQIGKQTRSTHKSLKKCY
        +V N+++++ M G R  +NCY W    ++ S TC  ++ ++  +WH+  G + +RG++KII   A+ GIP+L +     CG+CQIGKQ + +H+ L+   
Subjt:  VVMNKENQICMSGKRQTNNCYHW---NSNTSDTCQLTRSNQTWLWHKMLGRVSMRGLEKIIKNEAIVGIPDLDVNGKFFCGDCQIGKQTRSTHKSLKKCY

Query:  TNKVLELLHMDLMGQMQTESLRGKRYVLVVVDDYSRYTWVCFLKGKIDTVEICKNLCLKLQREKGKKIIKIQSDHGKEFDSEGFNSFCLLEGIHHEFSAP
        T++VLELLHMDLMG MQ ESL GKRY  VVVDD+SR+TWV F++ K +T E+ K L L+LQREK   I +I+SDHG+EF++     FC  EGI HEFSA 
Subjt:  TNKVLELLHMDLMGQMQTESLRGKRYVLVVVDDYSRYTWVCFLKGKIDTVEICKNLCLKLQREKGKKIIKIQSDHGKEFDSEGFNSFCLLEGIHHEFSAP

Query:  ITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAKISGLQNTACHIHNRVTIRTRTIVTLYELWKERKLNVKYFHVFGSTCYILADREYRQKWDARS
        ITPQQNG+VERKNRTLQE ARVM+HAK LP   WAEA      NTAC+IHNRVT+R  T  TLYE+WK RK +VK+FH+FGS CYILADRE R+K D +S
Subjt:  ITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAKISGLQNTACHIHNRVTIRTRTIVTLYELWKERKLNVKYFHVFGSTCYILADREYRQKWDARS

Query:  KQGIFLGYSKNSRAYRVFNNRFGSVMKTINVVINDLDSAIKHVNDEE--DESPNMSDARTMSKSLKKS---SEEIITKKSELIPSAHVKKNHPASSIIGD
          GIFLGYS NSRAYRVFN+R  +VM++INVV++DL  A K   +E+      N++DA    ++ + S   ++E    + +   S  ++K HP   IIGD
Subjt:  KQGIFLGYSKNSRAYRVFNNRFGSVMKTINVVINDLDSAIKHVNDEE--DESPNMSDARTMSKSLKKS---SEEIITKKSELIPSAHVKKNHPASSIIGD

Query:  PSAGMQTRRKEKIDYMKMVVDLCYISTIEPSTVDSALKDEYWLNVMQEELLQFRRNNVWTL------------------KIDEARCVTKNKVRLVAQGYT
        P+ G+ TR +E    +++V + C++S IEP  V  AL DE+W+N MQEEL QF+RN VW L                  K +E   +T+NK RLVAQGYT
Subjt:  PSAGMQTRRKEKIDYMKMVVDLCYISTIEPSTVDSALKDEYWLNVMQEELLQFRRNNVWTL------------------KIDEARCVTKNKVRLVAQGYT

Query:  QVEGIDFDETFAPVAQLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQLKGFVDSEHPKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYS
        Q+EG+DFDETFAPVA+LE+IRLLLG++CI KFKLYQMDVKSAFLNGYLNEEVYV Q KGF D  HP HV++L K LYGLKQAPRAWY+RLT +L  +GY 
Subjt:  QVEGIDFDETFAPVAQLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQLKGFVDSEHPKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYS

Query:  GGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFSQDLVNNFINIMQSEFEMSMVGELSCFWGLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAAT
         G  DKTLF+ + ++ L++AQIYVDDI+FGG S +++ +F+  MQSEFEMS+VGEL+ F GLQ+KQ  D IF+SQ +YAKN+VKKFG+E A +KRTPA T
Subjt:  GGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFSQDLVNNFINIMQSEFEMSMVGELSCFWGLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAAT

Query:  HVKLTTDTDGVEVDHKLYRSIVGSLLYLTASRPDIAYAVEICARYQ----------------YVHETSDFGMMYSCDTTLTLVGYYDANWAGSVDDRKST
        H+KL+ D  G  VD  LYRS++GSLLYLTASRPDI YAV +CARYQ                YV+ TSD+G+MY   +   LVGY DA+WAGS DDRKST
Subjt:  HVKLTTDTDGVEVDHKLYRSIVGSLLYLTASRPDIAYAVEICARYQ----------------YVHETSDFGMMYSCDTTLTLVGYYDANWAGSVDDRKST

Query:  SGGRFFLGNNLISWLSKKQNCVSLSTAEAEYIAIGSGCT---------------QTTMTLYCDNMSAIDILKNPVQHSRTKHIKIRHHF-----------
        SGG F+LGNNLISW SKKQNCVSLSTAEAEYIA GS C+               Q  MTLYCDNMSAI+I KNPVQHSRTKHI IRHH+           
Subjt:  SGGRFFLGNNLISWLSKKQNCVSLSTAEAEYIAIGSGCT---------------QTTMTLYCDNMSAIDILKNPVQHSRTKHIKIRHHF-----------

Query:  ---------ILDIFTKPLDANSFEYLHAGLGVC
                 I DIFTK LDAN FE L   LG+C
Subjt:  ---------ILDIFTKPLDANSFEYLHAGLGVC

AAO73523.1 gag-pol polyprotein [Glycine max]8.2e-30753.53Show/hide
Query:  MIAFTSVQ-TADDAWYFDSGCSRHMIGNRSYFTNLKDCVTGHVTFGDGAKGKIIAKGNIDKNNLPR------------------------YKVSFDDIGC
        ++  TS++ +A + WY DSGCSRHM G + +  N++ C T +VTFGDG+KGKII  G +  + LP                         + V+F    C
Subjt:  MIAFTSVQ-TADDAWYFDSGCSRHMIGNRSYFTNLKDCVTGHVTFGDGAKGKIIAKGNIDKNNLPR------------------------YKVSFDDIGC

Query:  VVMNKENQICMSGKRQTNNCYHW---NSNTSDTCQLTRSNQTWLWHKMLGRVSMRGLEKIIKNEAIVGIPDLDVNGKFFCGDCQIGKQTRSTHKSLKKCY
        +V N+++++ M G R  +NCY W    ++ S TC  ++ ++  +WH+  G + +RG++KI+   A+ GIP+L +     CG+CQIGKQ + +H+ L+   
Subjt:  VVMNKENQICMSGKRQTNNCYHW---NSNTSDTCQLTRSNQTWLWHKMLGRVSMRGLEKIIKNEAIVGIPDLDVNGKFFCGDCQIGKQTRSTHKSLKKCY

Query:  TNKVLELLHMDLMGQMQTESLRGKRYVLVVVDDYSRYTWVCFLKGKIDTVEICKNLCLKLQREKGKKIIKIQSDHGKEFDSEGFNSFCLLEGIHHEFSAP
        T++VLELLHMDLMG MQ ESL GKRY  VVVDD+SR+TWV F++ K  T E+ K L L+LQREK   I +I+SDHG+EF++  F  FC  EGI HEFSA 
Subjt:  TNKVLELLHMDLMGQMQTESLRGKRYVLVVVDDYSRYTWVCFLKGKIDTVEICKNLCLKLQREKGKKIIKIQSDHGKEFDSEGFNSFCLLEGIHHEFSAP

Query:  ITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAKISGLQNTACHIHNRVTIRTRTIVTLYELWKERKLNVKYFHVFGSTCYILADREYRQKWDARS
        ITPQQNG+VERKNRTLQE ARVM+HAK LP   WAEA      NTAC+IHNRVT+R  T  TLYE+WK RK +VK+FH+FGS CYILADRE R+K D +S
Subjt:  ITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAKISGLQNTACHIHNRVTIRTRTIVTLYELWKERKLNVKYFHVFGSTCYILADREYRQKWDARS

Query:  KQGIFLGYSKNSRAYRVFNNRFGSVMKTINVVINDLDSAIKHVNDEE--DESPNMSDARTMSKSLKKS---SEEIITKKSELIPSAHVKKNHPASSIIGD
          GIFLGYS NSRAYRVFN+R  +VM++INVV++DL  A K   +E+      N++DA    ++ + S   ++E    + +   S  ++K HP   IIGD
Subjt:  KQGIFLGYSKNSRAYRVFNNRFGSVMKTINVVINDLDSAIKHVNDEE--DESPNMSDARTMSKSLKKS---SEEIITKKSELIPSAHVKKNHPASSIIGD

Query:  PSAGMQTRRKEKIDYMKMVVDLCYISTIEPSTVDSALKDEYWLNVMQEELLQFRRNNVWTL------------------KIDEARCVTKNKVRLVAQGYT
        P+ G+ TR +E    +++V + C++S IEP  V  AL DE+W+N MQEEL QF+RN VW L                  K +E   +T+NK RLVAQGYT
Subjt:  PSAGMQTRRKEKIDYMKMVVDLCYISTIEPSTVDSALKDEYWLNVMQEELLQFRRNNVWTL------------------KIDEARCVTKNKVRLVAQGYT

Query:  QVEGIDFDETFAPVAQLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQLKGFVDSEHPKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYS
        Q+EG+DFDETFAPVA+LE+IRLLLG++CI KFKLYQMDVKSAFLNGYLNEEVYV Q KGF D  HP HV++L K LYGLKQAPRAWY+RLT +L  +GY 
Subjt:  QVEGIDFDETFAPVAQLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQLKGFVDSEHPKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYS

Query:  GGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFSQDLVNNFINIMQSEFEMSMVGELSCFWGLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAAT
         G  DKTLF+ + ++ L++AQIYVDDI+FGG S +++ +F+  MQSEFEMS+VGEL+ F GLQ+KQ  D IF+SQ +YAKN+VKKFG+E A +KRTPA T
Subjt:  GGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFSQDLVNNFINIMQSEFEMSMVGELSCFWGLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAAT

Query:  HVKLTTDTDGVEVDHKLYRSIVGSLLYLTASRPDIAYAVEICARYQ----------------YVHETSDFGMMYSCDTTLTLVGYYDANWAGSVDDRKST
        H+KL+ D  G  VD K YRS++GSLLYLTASRPDI YAV +CARYQ                YV+ TSD+G+MY   ++  LVGY DA+WAGS DDRKST
Subjt:  HVKLTTDTDGVEVDHKLYRSIVGSLLYLTASRPDIAYAVEICARYQ----------------YVHETSDFGMMYSCDTTLTLVGYYDANWAGSVDDRKST

Query:  SGGRFFLGNNLISWLSKKQNCVSLSTAEAEYIAIGSGCT---------------QTTMTLYCDNMSAIDILKNPVQHSRTKHIKIRHHF-----------
        SGG F+LGNNLISW SKKQNCVSLSTAEAEYIA GS C+               Q  MTLYCDNMSAI+I KNPVQHSRTKHI IRHH+           
Subjt:  SGGRFFLGNNLISWLSKKQNCVSLSTAEAEYIAIGSGCT---------------QTTMTLYCDNMSAIDILKNPVQHSRTKHIKIRHHF-----------

Query:  ---------ILDIFTKPLDANSFEYLHAGLGVC
                 I DIFTK LDAN FE L   LG+C
Subjt:  ---------ILDIFTKPLDANSFEYLHAGLGVC

AAO73525.1 gag-pol polyprotein [Glycine max]8.0e-30253.05Show/hide
Query:  MIAFTSVQ-TADDAWYFDSGCSRHMIGNRSYFTNLKDCVTGHVTFGDGAKGKIIAKGNIDKNNLPR------------------------YKVSFDDIGC
        ++  TS++ +A + WY DSGCSRHM G + +  N++ C T +VTFGDG+KGKI   G +  + LP                         + V+F    C
Subjt:  MIAFTSVQ-TADDAWYFDSGCSRHMIGNRSYFTNLKDCVTGHVTFGDGAKGKIIAKGNIDKNNLPR------------------------YKVSFDDIGC

Query:  VVMNKENQICMSGKRQTNNCYHW---NSNTSDTCQLTRSNQTWLWHKMLGRVSMRGLEKIIKNEAIVGIPDLDVNGKFFCGDCQIGKQTRSTHKSLKKCY
        +V N+++++ M G R  +NCY W    ++ S TC  ++ ++  +WH+  G + +RG++KII   A+ GIP+L +     CG+CQIGKQ + +H+ L+   
Subjt:  VVMNKENQICMSGKRQTNNCYHW---NSNTSDTCQLTRSNQTWLWHKMLGRVSMRGLEKIIKNEAIVGIPDLDVNGKFFCGDCQIGKQTRSTHKSLKKCY

Query:  TNKVLELLHMDLMGQMQTESLRGKRYVLVVVDDYSRYTWVCFLKGKIDTVEICKNLCLKLQREKGKKIIKIQSDHGKEFDSEGFNSFCLLEGIHHEFSAP
        T+ VLELLHMDLMG MQ ESL GKRY  VVVDD+SR+TWV F++ K DT E+ K L L+LQREK   I +I+SDHG+EF++  F  FC  EGI HEFSA 
Subjt:  TNKVLELLHMDLMGQMQTESLRGKRYVLVVVDDYSRYTWVCFLKGKIDTVEICKNLCLKLQREKGKKIIKIQSDHGKEFDSEGFNSFCLLEGIHHEFSAP

Query:  ITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAKISGLQNTACHIHNRVTIRTRTIVTLYELWKERKLNVKYFHVFGSTCYILADREYRQKWDARS
        ITPQQNG+VERKNRTLQE  RVM+HAK LP   WAEA      NTAC+IHNRVT+R  T  TLYE+WK RK  VK+FH+FGS CYILADRE R+K D +S
Subjt:  ITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAKISGLQNTACHIHNRVTIRTRTIVTLYELWKERKLNVKYFHVFGSTCYILADREYRQKWDARS

Query:  KQGIFLGYSKNSRAYRVFNNRFGSVMKTINVVINDLDSAIKHVNDEE--DESPNMSDARTMSKSLKKS---SEEIITKKSELIPSAHVKKNHPASSIIGD
          GIFLGYS NSRAYRVFN+R  +VM++INVV++DL  A K   +E+      N++D    +++ +KS   ++E    + +  P   ++K  P   IIGD
Subjt:  KQGIFLGYSKNSRAYRVFNNRFGSVMKTINVVINDLDSAIKHVNDEE--DESPNMSDARTMSKSLKKS---SEEIITKKSELIPSAHVKKNHPASSIIGD

Query:  PSAGMQTRRKEKIDYMKMVVDLCYISTIEPSTVDSALKDEYWLNVMQEELLQFRRNNVWTL------------------KIDEARCVTKNKVRLVAQGYT
        P+ G+ TR +E    +++V + C++S IEP  V  AL DE+W+N MQEEL QF+RN VW L                  K +E   +T+NK RLVAQGYT
Subjt:  PSAGMQTRRKEKIDYMKMVVDLCYISTIEPSTVDSALKDEYWLNVMQEELLQFRRNNVWTL------------------KIDEARCVTKNKVRLVAQGYT

Query:  QVEGIDFDETFAPVAQLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQLKGFVDSEHPKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYS
        Q+EG+DFDETFAPVA+LE+IRLLLG++CI KFKLYQMDVKSAFLNGYLNEE YV Q KGFVD  H  HV++L K LYGLKQAPRAWY+RLT +L  +GY 
Subjt:  QVEGIDFDETFAPVAQLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQLKGFVDSEHPKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYS

Query:  GGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFSQDLVNNFINIMQSEFEMSMVGELSCFWGLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAAT
         G  DKTLF+ + ++ L++AQIYVDDI+FGG S +++ +F+  MQSEFEMS+VGEL  F GLQ+KQ  D IF+SQ KYAKN+VKKFG+E A +KRTPA T
Subjt:  GGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFSQDLVNNFINIMQSEFEMSMVGELSCFWGLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAAT

Query:  HVKLTTDTDGVEVDHKLYRSIVGSLLYLTASRPDIAYAVEICARYQ----------------YVHETSDFGMMYSCDTTLTLVGYYDANWAGSVDDRKST
        H+KL+ D  G  VD  LYRS++GSLLYLTASRPDI +AV +CARYQ                YV+ TSD+G+MY   +   LVGY DA+WAGS DDRK T
Subjt:  HVKLTTDTDGVEVDHKLYRSIVGSLLYLTASRPDIAYAVEICARYQ----------------YVHETSDFGMMYSCDTTLTLVGYYDANWAGSVDDRKST

Query:  SGGRFFLGNNLISWLSKKQNCVSLSTAEAEYIAIGSGCT---------------QTTMTLYCDNMSAIDILKNPVQHSRTKHIKIRHHFI----------
        SGG F+LG NLISW SKKQNCVSLSTAEAEYIA GS C+               Q  MTLYCDNMSAI+I KNPVQH+RTKHI IRHH+I          
Subjt:  SGGRFFLGNNLISWLSKKQNCVSLSTAEAEYIAIGSGCT---------------QTTMTLYCDNMSAIDILKNPVQHSRTKHIKIRHHFI----------

Query:  ----------LDIFTKPLDANSFEYLHAGLGVC
                   DIFTK LDAN FE L   LG C
Subjt:  ----------LDIFTKPLDANSFEYLHAGLGVC

AAO73527.1 gag-pol polyprotein [Glycine max]3.7e-30753.63Show/hide
Query:  MIAFTSVQ-TADDAWYFDSGCSRHMIGNRSYFTNLKDCVTGHVTFGDGAKGKIIAKGNIDKNNLPR------------------------YKVSFDDIGC
        ++  TS++ +A + WY DSGCSRHM G + +  N++ C T +VTFGDG+KGKII  G +  + LP                         + V+F    C
Subjt:  MIAFTSVQ-TADDAWYFDSGCSRHMIGNRSYFTNLKDCVTGHVTFGDGAKGKIIAKGNIDKNNLPR------------------------YKVSFDDIGC

Query:  VVMNKENQICMSGKRQTNNCYHW---NSNTSDTCQLTRSNQTWLWHKMLGRVSMRGLEKIIKNEAIVGIPDLDVNGKFFCGDCQIGKQTRSTHKSLKKCY
        +V N+++++ M G R  +NCY W    ++ S TC  ++ ++  +WH+  G + +RG++KII   A+ GIP+L +     CG+CQIGKQ + +H+ L+   
Subjt:  VVMNKENQICMSGKRQTNNCYHW---NSNTSDTCQLTRSNQTWLWHKMLGRVSMRGLEKIIKNEAIVGIPDLDVNGKFFCGDCQIGKQTRSTHKSLKKCY

Query:  TNKVLELLHMDLMGQMQTESLRGKRYVLVVVDDYSRYTWVCFLKGKIDTVEICKNLCLKLQREKGKKIIKIQSDHGKEFDSEGFNSFCLLEGIHHEFSAP
        T++VLELLHMDLMG MQ ESL GKRY  VVVDD+SR+TWV F++ K +T E+ K L L+LQREK   I +I+SDHG+EF++  F  FC  EGI HEFSA 
Subjt:  TNKVLELLHMDLMGQMQTESLRGKRYVLVVVDDYSRYTWVCFLKGKIDTVEICKNLCLKLQREKGKKIIKIQSDHGKEFDSEGFNSFCLLEGIHHEFSAP

Query:  ITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAKISGLQNTACHIHNRVTIRTRTIVTLYELWKERKLNVKYFHVFGSTCYILADREYRQKWDARS
        ITPQQNG+VERKNRTLQE ARVM+HAK LP   WAEA      NTAC+IHNRVT+R  T  TLYE+WK RK +VK+FH+FGS CYILADRE R+K D +S
Subjt:  ITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAKISGLQNTACHIHNRVTIRTRTIVTLYELWKERKLNVKYFHVFGSTCYILADREYRQKWDARS

Query:  KQGIFLGYSKNSRAYRVFNNRFGSVMKTINVVINDLDSAIKHVNDEEDES--PNMSDARTMSKSLKKS---SEEIITKKSELIPSAHVKKNHPASSIIGD
          GIFLGYS NSRAYRVFN+R  +VM++INVV++DL  A K   +E+  +   N++DA    ++ + S   ++E    + +   S  ++K HP   IIGD
Subjt:  KQGIFLGYSKNSRAYRVFNNRFGSVMKTINVVINDLDSAIKHVNDEEDES--PNMSDARTMSKSLKKS---SEEIITKKSELIPSAHVKKNHPASSIIGD

Query:  PSAGMQTRRKEKIDYMKMVVDLCYISTIEPSTVDSALKDEYWLNVMQEELLQFRRNNVWTL------------------KIDEARCVTKNKVRLVAQGYT
        P+ G+ TR +E    +++V + C++S IEP  V  AL DE+W+N MQEEL QF+RN VW L                  K +E   +T+NK RLVAQGYT
Subjt:  PSAGMQTRRKEKIDYMKMVVDLCYISTIEPSTVDSALKDEYWLNVMQEELLQFRRNNVWTL------------------KIDEARCVTKNKVRLVAQGYT

Query:  QVEGIDFDETFAPVAQLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQLKGFVDSEHPKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYS
        Q+EG+DFDETFAPVA+LE+IRLLLG++CI KFKLYQMDVKSAFLNGYLNEEVYV Q KGF D  HP HV++L K LYGLKQAPRAWY+RLT +L  +GY 
Subjt:  QVEGIDFDETFAPVAQLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQLKGFVDSEHPKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYS

Query:  GGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFSQDLVNNFINIMQSEFEMSMVGELSCFWGLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAAT
         G  DKTLF+ + ++ L++AQIYVDDI+FGG S +++ +F+  MQSEFEMS+VGEL+ F GLQ+KQ  D IF+SQ +YAKN+VKKFG+E A +KRTPA T
Subjt:  GGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFSQDLVNNFINIMQSEFEMSMVGELSCFWGLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAAT

Query:  HVKLTTDTDGVEVDHKLYRSIVGSLLYLTASRPDIAYAVEICARYQ----------------YVHETSDFGMMYSCDTTLTLVGYYDANWAGSVDDRKST
        H+KL+ D  G  VD  LYRS++GSLLYLTASRPDI YAV +CARYQ                YV+ TSD+G+MY   +   LVGY DA+WAGS DDRKST
Subjt:  HVKLTTDTDGVEVDHKLYRSIVGSLLYLTASRPDIAYAVEICARYQ----------------YVHETSDFGMMYSCDTTLTLVGYYDANWAGSVDDRKST

Query:  SGGRFFLGNNLISWLSKKQNCVSLSTAEAEYIAIGSGCT---------------QTTMTLYCDNMSAIDILKNPVQHSRTKHIKIRHHF-----------
        SGG F+LGNNLISW SKKQNCVSLSTAEAEYIA GS C+               Q  MTLYCDNMSAI+I KNPVQHSRTKHI IRHH+           
Subjt:  SGGRFFLGNNLISWLSKKQNCVSLSTAEAEYIAIGSGCT---------------QTTMTLYCDNMSAIDILKNPVQHSRTKHIKIRHHF-----------

Query:  ---------ILDIFTKPLDANSFEYLHAGLGVC
                 I DIFTK LDAN FE L   LG+C
Subjt:  ---------ILDIFTKPLDANSFEYLHAGLGVC

AAO73529.1 gag-pol polyprotein [Glycine max]2.8e-30753.73Show/hide
Query:  MIAFTSVQ-TADDAWYFDSGCSRHMIGNRSYFTNLKDCVTGHVTFGDGAKGKIIAKGNIDKNNLPR------------------------YKVSFDDIGC
        ++  TS++ +A + WY DSGCSRHM G + +  N++ C T +VTFGDG+KGKI   G +    LP                         + V+F    C
Subjt:  MIAFTSVQ-TADDAWYFDSGCSRHMIGNRSYFTNLKDCVTGHVTFGDGAKGKIIAKGNIDKNNLPR------------------------YKVSFDDIGC

Query:  VVMNKENQICMSGKRQTNNCYHW---NSNTSDTCQLTRSNQTWLWHKMLGRVSMRGLEKIIKNEAIVGIPDLDVNGKFFCGDCQIGKQTRSTHKSLKKCY
        +V N+++++ M G R  +NCY W    S+ S TC  ++ ++  +WH+  G + +RG++KII   A+ GIP+L +     CG+CQIGKQ + +H+ L+   
Subjt:  VVMNKENQICMSGKRQTNNCYHW---NSNTSDTCQLTRSNQTWLWHKMLGRVSMRGLEKIIKNEAIVGIPDLDVNGKFFCGDCQIGKQTRSTHKSLKKCY

Query:  TNKVLELLHMDLMGQMQTESLRGKRYVLVVVDDYSRYTWVCFLKGKIDTVEICKNLCLKLQREKGKKIIKIQSDHGKEFDSEGFNSFCLLEGIHHEFSAP
        T++VLELLHMDLMG MQ ESL GKRY  VVVDD+SR+TWV F++ K DT E+ K L L+LQREK   I +I+SDHG+EF++  F  FC  EGI HEFSA 
Subjt:  TNKVLELLHMDLMGQMQTESLRGKRYVLVVVDDYSRYTWVCFLKGKIDTVEICKNLCLKLQREKGKKIIKIQSDHGKEFDSEGFNSFCLLEGIHHEFSAP

Query:  ITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAKISGLQNTACHIHNRVTIRTRTIVTLYELWKERKLNVKYFHVFGSTCYILADREYRQKWDARS
        ITPQQNG+VERKNRTLQE ARVM+HAK LP   WAEA      NTAC+IHNRVT+R  T  TLYE+WK RK  VK+FH+FGS CYILADRE R+K D +S
Subjt:  ITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAKISGLQNTACHIHNRVTIRTRTIVTLYELWKERKLNVKYFHVFGSTCYILADREYRQKWDARS

Query:  KQGIFLGYSKNSRAYRVFNNRFGSVMKTINVVINDLDSAIKHVNDEE--DESPNMSDARTMSKSLKKS---SEEIITKKSELIPSAHVKKNHPASSIIGD
          GIFLGYS NSRAYRVFN+R  +VM++INVV++DL  A K   +E+      N++D    +++ + S   ++E    + +  PS  ++K HP   IIGD
Subjt:  KQGIFLGYSKNSRAYRVFNNRFGSVMKTINVVINDLDSAIKHVNDEE--DESPNMSDARTMSKSLKKS---SEEIITKKSELIPSAHVKKNHPASSIIGD

Query:  PSAGMQTRRKEKIDYMKMVVDLCYISTIEPSTVDSALKDEYWLNVMQEELLQFRRNNVWTL------------------KIDEARCVTKNKVRLVAQGYT
        P+ G+ TR +E    +++V + C++S IEP  V  AL DE+W+N MQEEL QF+RN VW L                  K +E   +T+NK RLVAQGYT
Subjt:  PSAGMQTRRKEKIDYMKMVVDLCYISTIEPSTVDSALKDEYWLNVMQEELLQFRRNNVWTL------------------KIDEARCVTKNKVRLVAQGYT

Query:  QVEGIDFDETFAPVAQLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQLKGFVDSEHPKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYS
        Q+EG+DFDETFAPVA+LE+IRLLLG++CI KFKLYQMDVKSAFLNGYLNEE YV Q KGFVD  HP HV++L K LYGLKQAPRAWY+RLT +L  +GY 
Subjt:  QVEGIDFDETFAPVAQLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQLKGFVDSEHPKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYS

Query:  GGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFSQDLVNNFINIMQSEFEMSMVGELSCFWGLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAAT
         G  DKTLF+ + ++ L++AQIYVDDI+FGG S +++ +F+  MQSEFEMS+VGEL+ F GLQ+KQ  D IF+SQ KYAKN+VKKFG+E A +KRTPA T
Subjt:  GGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFSQDLVNNFINIMQSEFEMSMVGELSCFWGLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAAT

Query:  HVKLTTDTDGVEVDHKLYRSIVGSLLYLTASRPDIAYAVEICARYQ----------------YVHETSDFGMMYSCDTTLTLVGYYDANWAGSVDDRKST
        H+KL+ D  G  VD  LYRS++GSLLYLTASRPDI YAV +CARYQ                YV+ TSD+G+MY   +   LVGY DA+WAGS DDRKST
Subjt:  HVKLTTDTDGVEVDHKLYRSIVGSLLYLTASRPDIAYAVEICARYQ----------------YVHETSDFGMMYSCDTTLTLVGYYDANWAGSVDDRKST

Query:  SGGRFFLGNNLISWLSKKQNCVSLSTAEAEYIAIGSGCT---------------QTTMTLYCDNMSAIDILKNPVQHSRTKHIKIRHHF-----------
        SGG F+LGNNLISW SKKQNCVSLSTAEAEYIA GS C+               Q  MTLYCDNMSAI+I KNPVQHSRTKHI IRHH+           
Subjt:  SGGRFFLGNNLISWLSKKQNCVSLSTAEAEYIAIGSGCT---------------QTTMTLYCDNMSAIDILKNPVQHSRTKHIKIRHHF-----------

Query:  ---------ILDIFTKPLDANSFEYLHAGLGVC
                 I DIFTK LDA  FE L   LG+C
Subjt:  ---------ILDIFTKPLDANSFEYLHAGLGVC

TrEMBL top hitse value%identityAlignment
Q84VH6 Gag-pol polyprotein1.4e-30753.73Show/hide
Query:  MIAFTSVQ-TADDAWYFDSGCSRHMIGNRSYFTNLKDCVTGHVTFGDGAKGKIIAKGNIDKNNLPR------------------------YKVSFDDIGC
        ++  TS++ +A + WY DSGCSRHM G + +  N++ C T +VTFGDG+KGKI   G +    LP                         + V+F    C
Subjt:  MIAFTSVQ-TADDAWYFDSGCSRHMIGNRSYFTNLKDCVTGHVTFGDGAKGKIIAKGNIDKNNLPR------------------------YKVSFDDIGC

Query:  VVMNKENQICMSGKRQTNNCYHW---NSNTSDTCQLTRSNQTWLWHKMLGRVSMRGLEKIIKNEAIVGIPDLDVNGKFFCGDCQIGKQTRSTHKSLKKCY
        +V N+++++ M G R  +NCY W    S+ S TC  ++ ++  +WH+  G + +RG++KII   A+ GIP+L +     CG+CQIGKQ + +H+ L+   
Subjt:  VVMNKENQICMSGKRQTNNCYHW---NSNTSDTCQLTRSNQTWLWHKMLGRVSMRGLEKIIKNEAIVGIPDLDVNGKFFCGDCQIGKQTRSTHKSLKKCY

Query:  TNKVLELLHMDLMGQMQTESLRGKRYVLVVVDDYSRYTWVCFLKGKIDTVEICKNLCLKLQREKGKKIIKIQSDHGKEFDSEGFNSFCLLEGIHHEFSAP
        T++VLELLHMDLMG MQ ESL GKRY  VVVDD+SR+TWV F++ K DT E+ K L L+LQREK   I +I+SDHG+EF++  F  FC  EGI HEFSA 
Subjt:  TNKVLELLHMDLMGQMQTESLRGKRYVLVVVDDYSRYTWVCFLKGKIDTVEICKNLCLKLQREKGKKIIKIQSDHGKEFDSEGFNSFCLLEGIHHEFSAP

Query:  ITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAKISGLQNTACHIHNRVTIRTRTIVTLYELWKERKLNVKYFHVFGSTCYILADREYRQKWDARS
        ITPQQNG+VERKNRTLQE ARVM+HAK LP   WAEA      NTAC+IHNRVT+R  T  TLYE+WK RK  VK+FH+FGS CYILADRE R+K D +S
Subjt:  ITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAKISGLQNTACHIHNRVTIRTRTIVTLYELWKERKLNVKYFHVFGSTCYILADREYRQKWDARS

Query:  KQGIFLGYSKNSRAYRVFNNRFGSVMKTINVVINDLDSAIKHVNDEE--DESPNMSDARTMSKSLKKS---SEEIITKKSELIPSAHVKKNHPASSIIGD
          GIFLGYS NSRAYRVFN+R  +VM++INVV++DL  A K   +E+      N++D    +++ + S   ++E    + +  PS  ++K HP   IIGD
Subjt:  KQGIFLGYSKNSRAYRVFNNRFGSVMKTINVVINDLDSAIKHVNDEE--DESPNMSDARTMSKSLKKS---SEEIITKKSELIPSAHVKKNHPASSIIGD

Query:  PSAGMQTRRKEKIDYMKMVVDLCYISTIEPSTVDSALKDEYWLNVMQEELLQFRRNNVWTL------------------KIDEARCVTKNKVRLVAQGYT
        P+ G+ TR +E    +++V + C++S IEP  V  AL DE+W+N MQEEL QF+RN VW L                  K +E   +T+NK RLVAQGYT
Subjt:  PSAGMQTRRKEKIDYMKMVVDLCYISTIEPSTVDSALKDEYWLNVMQEELLQFRRNNVWTL------------------KIDEARCVTKNKVRLVAQGYT

Query:  QVEGIDFDETFAPVAQLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQLKGFVDSEHPKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYS
        Q+EG+DFDETFAPVA+LE+IRLLLG++CI KFKLYQMDVKSAFLNGYLNEE YV Q KGFVD  HP HV++L K LYGLKQAPRAWY+RLT +L  +GY 
Subjt:  QVEGIDFDETFAPVAQLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQLKGFVDSEHPKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYS

Query:  GGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFSQDLVNNFINIMQSEFEMSMVGELSCFWGLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAAT
         G  DKTLF+ + ++ L++AQIYVDDI+FGG S +++ +F+  MQSEFEMS+VGEL+ F GLQ+KQ  D IF+SQ KYAKN+VKKFG+E A +KRTPA T
Subjt:  GGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFSQDLVNNFINIMQSEFEMSMVGELSCFWGLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAAT

Query:  HVKLTTDTDGVEVDHKLYRSIVGSLLYLTASRPDIAYAVEICARYQ----------------YVHETSDFGMMYSCDTTLTLVGYYDANWAGSVDDRKST
        H+KL+ D  G  VD  LYRS++GSLLYLTASRPDI YAV +CARYQ                YV+ TSD+G+MY   +   LVGY DA+WAGS DDRKST
Subjt:  HVKLTTDTDGVEVDHKLYRSIVGSLLYLTASRPDIAYAVEICARYQ----------------YVHETSDFGMMYSCDTTLTLVGYYDANWAGSVDDRKST

Query:  SGGRFFLGNNLISWLSKKQNCVSLSTAEAEYIAIGSGCT---------------QTTMTLYCDNMSAIDILKNPVQHSRTKHIKIRHHF-----------
        SGG F+LGNNLISW SKKQNCVSLSTAEAEYIA GS C+               Q  MTLYCDNMSAI+I KNPVQHSRTKHI IRHH+           
Subjt:  SGGRFFLGNNLISWLSKKQNCVSLSTAEAEYIAIGSGCT---------------QTTMTLYCDNMSAIDILKNPVQHSRTKHIKIRHHF-----------

Query:  ---------ILDIFTKPLDANSFEYLHAGLGVC
                 I DIFTK LDA  FE L   LG+C
Subjt:  ---------ILDIFTKPLDANSFEYLHAGLGVC

Q84VH8 Gag-pol polyprotein1.8e-30753.63Show/hide
Query:  MIAFTSVQ-TADDAWYFDSGCSRHMIGNRSYFTNLKDCVTGHVTFGDGAKGKIIAKGNIDKNNLPR------------------------YKVSFDDIGC
        ++  TS++ +A + WY DSGCSRHM G + +  N++ C T +VTFGDG+KGKII  G +  + LP                         + V+F    C
Subjt:  MIAFTSVQ-TADDAWYFDSGCSRHMIGNRSYFTNLKDCVTGHVTFGDGAKGKIIAKGNIDKNNLPR------------------------YKVSFDDIGC

Query:  VVMNKENQICMSGKRQTNNCYHW---NSNTSDTCQLTRSNQTWLWHKMLGRVSMRGLEKIIKNEAIVGIPDLDVNGKFFCGDCQIGKQTRSTHKSLKKCY
        +V N+++++ M G R  +NCY W    ++ S TC  ++ ++  +WH+  G + +RG++KII   A+ GIP+L +     CG+CQIGKQ + +H+ L+   
Subjt:  VVMNKENQICMSGKRQTNNCYHW---NSNTSDTCQLTRSNQTWLWHKMLGRVSMRGLEKIIKNEAIVGIPDLDVNGKFFCGDCQIGKQTRSTHKSLKKCY

Query:  TNKVLELLHMDLMGQMQTESLRGKRYVLVVVDDYSRYTWVCFLKGKIDTVEICKNLCLKLQREKGKKIIKIQSDHGKEFDSEGFNSFCLLEGIHHEFSAP
        T++VLELLHMDLMG MQ ESL GKRY  VVVDD+SR+TWV F++ K +T E+ K L L+LQREK   I +I+SDHG+EF++  F  FC  EGI HEFSA 
Subjt:  TNKVLELLHMDLMGQMQTESLRGKRYVLVVVDDYSRYTWVCFLKGKIDTVEICKNLCLKLQREKGKKIIKIQSDHGKEFDSEGFNSFCLLEGIHHEFSAP

Query:  ITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAKISGLQNTACHIHNRVTIRTRTIVTLYELWKERKLNVKYFHVFGSTCYILADREYRQKWDARS
        ITPQQNG+VERKNRTLQE ARVM+HAK LP   WAEA      NTAC+IHNRVT+R  T  TLYE+WK RK +VK+FH+FGS CYILADRE R+K D +S
Subjt:  ITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAKISGLQNTACHIHNRVTIRTRTIVTLYELWKERKLNVKYFHVFGSTCYILADREYRQKWDARS

Query:  KQGIFLGYSKNSRAYRVFNNRFGSVMKTINVVINDLDSAIKHVNDEEDES--PNMSDARTMSKSLKKS---SEEIITKKSELIPSAHVKKNHPASSIIGD
          GIFLGYS NSRAYRVFN+R  +VM++INVV++DL  A K   +E+  +   N++DA    ++ + S   ++E    + +   S  ++K HP   IIGD
Subjt:  KQGIFLGYSKNSRAYRVFNNRFGSVMKTINVVINDLDSAIKHVNDEEDES--PNMSDARTMSKSLKKS---SEEIITKKSELIPSAHVKKNHPASSIIGD

Query:  PSAGMQTRRKEKIDYMKMVVDLCYISTIEPSTVDSALKDEYWLNVMQEELLQFRRNNVWTL------------------KIDEARCVTKNKVRLVAQGYT
        P+ G+ TR +E    +++V + C++S IEP  V  AL DE+W+N MQEEL QF+RN VW L                  K +E   +T+NK RLVAQGYT
Subjt:  PSAGMQTRRKEKIDYMKMVVDLCYISTIEPSTVDSALKDEYWLNVMQEELLQFRRNNVWTL------------------KIDEARCVTKNKVRLVAQGYT

Query:  QVEGIDFDETFAPVAQLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQLKGFVDSEHPKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYS
        Q+EG+DFDETFAPVA+LE+IRLLLG++CI KFKLYQMDVKSAFLNGYLNEEVYV Q KGF D  HP HV++L K LYGLKQAPRAWY+RLT +L  +GY 
Subjt:  QVEGIDFDETFAPVAQLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQLKGFVDSEHPKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYS

Query:  GGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFSQDLVNNFINIMQSEFEMSMVGELSCFWGLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAAT
         G  DKTLF+ + ++ L++AQIYVDDI+FGG S +++ +F+  MQSEFEMS+VGEL+ F GLQ+KQ  D IF+SQ +YAKN+VKKFG+E A +KRTPA T
Subjt:  GGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFSQDLVNNFINIMQSEFEMSMVGELSCFWGLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAAT

Query:  HVKLTTDTDGVEVDHKLYRSIVGSLLYLTASRPDIAYAVEICARYQ----------------YVHETSDFGMMYSCDTTLTLVGYYDANWAGSVDDRKST
        H+KL+ D  G  VD  LYRS++GSLLYLTASRPDI YAV +CARYQ                YV+ TSD+G+MY   +   LVGY DA+WAGS DDRKST
Subjt:  HVKLTTDTDGVEVDHKLYRSIVGSLLYLTASRPDIAYAVEICARYQ----------------YVHETSDFGMMYSCDTTLTLVGYYDANWAGSVDDRKST

Query:  SGGRFFLGNNLISWLSKKQNCVSLSTAEAEYIAIGSGCT---------------QTTMTLYCDNMSAIDILKNPVQHSRTKHIKIRHHF-----------
        SGG F+LGNNLISW SKKQNCVSLSTAEAEYIA GS C+               Q  MTLYCDNMSAI+I KNPVQHSRTKHI IRHH+           
Subjt:  SGGRFFLGNNLISWLSKKQNCVSLSTAEAEYIAIGSGCT---------------QTTMTLYCDNMSAIDILKNPVQHSRTKHIKIRHHF-----------

Query:  ---------ILDIFTKPLDANSFEYLHAGLGVC
                 I DIFTK LDAN FE L   LG+C
Subjt:  ---------ILDIFTKPLDANSFEYLHAGLGVC

Q84VI0 Gag-pol polyprotein3.9e-30253.05Show/hide
Query:  MIAFTSVQ-TADDAWYFDSGCSRHMIGNRSYFTNLKDCVTGHVTFGDGAKGKIIAKGNIDKNNLPR------------------------YKVSFDDIGC
        ++  TS++ +A + WY DSGCSRHM G + +  N++ C T +VTFGDG+KGKI   G +  + LP                         + V+F    C
Subjt:  MIAFTSVQ-TADDAWYFDSGCSRHMIGNRSYFTNLKDCVTGHVTFGDGAKGKIIAKGNIDKNNLPR------------------------YKVSFDDIGC

Query:  VVMNKENQICMSGKRQTNNCYHW---NSNTSDTCQLTRSNQTWLWHKMLGRVSMRGLEKIIKNEAIVGIPDLDVNGKFFCGDCQIGKQTRSTHKSLKKCY
        +V N+++++ M G R  +NCY W    ++ S TC  ++ ++  +WH+  G + +RG++KII   A+ GIP+L +     CG+CQIGKQ + +H+ L+   
Subjt:  VVMNKENQICMSGKRQTNNCYHW---NSNTSDTCQLTRSNQTWLWHKMLGRVSMRGLEKIIKNEAIVGIPDLDVNGKFFCGDCQIGKQTRSTHKSLKKCY

Query:  TNKVLELLHMDLMGQMQTESLRGKRYVLVVVDDYSRYTWVCFLKGKIDTVEICKNLCLKLQREKGKKIIKIQSDHGKEFDSEGFNSFCLLEGIHHEFSAP
        T+ VLELLHMDLMG MQ ESL GKRY  VVVDD+SR+TWV F++ K DT E+ K L L+LQREK   I +I+SDHG+EF++  F  FC  EGI HEFSA 
Subjt:  TNKVLELLHMDLMGQMQTESLRGKRYVLVVVDDYSRYTWVCFLKGKIDTVEICKNLCLKLQREKGKKIIKIQSDHGKEFDSEGFNSFCLLEGIHHEFSAP

Query:  ITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAKISGLQNTACHIHNRVTIRTRTIVTLYELWKERKLNVKYFHVFGSTCYILADREYRQKWDARS
        ITPQQNG+VERKNRTLQE  RVM+HAK LP   WAEA      NTAC+IHNRVT+R  T  TLYE+WK RK  VK+FH+FGS CYILADRE R+K D +S
Subjt:  ITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAKISGLQNTACHIHNRVTIRTRTIVTLYELWKERKLNVKYFHVFGSTCYILADREYRQKWDARS

Query:  KQGIFLGYSKNSRAYRVFNNRFGSVMKTINVVINDLDSAIKHVNDEE--DESPNMSDARTMSKSLKKS---SEEIITKKSELIPSAHVKKNHPASSIIGD
          GIFLGYS NSRAYRVFN+R  +VM++INVV++DL  A K   +E+      N++D    +++ +KS   ++E    + +  P   ++K  P   IIGD
Subjt:  KQGIFLGYSKNSRAYRVFNNRFGSVMKTINVVINDLDSAIKHVNDEE--DESPNMSDARTMSKSLKKS---SEEIITKKSELIPSAHVKKNHPASSIIGD

Query:  PSAGMQTRRKEKIDYMKMVVDLCYISTIEPSTVDSALKDEYWLNVMQEELLQFRRNNVWTL------------------KIDEARCVTKNKVRLVAQGYT
        P+ G+ TR +E    +++V + C++S IEP  V  AL DE+W+N MQEEL QF+RN VW L                  K +E   +T+NK RLVAQGYT
Subjt:  PSAGMQTRRKEKIDYMKMVVDLCYISTIEPSTVDSALKDEYWLNVMQEELLQFRRNNVWTL------------------KIDEARCVTKNKVRLVAQGYT

Query:  QVEGIDFDETFAPVAQLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQLKGFVDSEHPKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYS
        Q+EG+DFDETFAPVA+LE+IRLLLG++CI KFKLYQMDVKSAFLNGYLNEE YV Q KGFVD  H  HV++L K LYGLKQAPRAWY+RLT +L  +GY 
Subjt:  QVEGIDFDETFAPVAQLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQLKGFVDSEHPKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYS

Query:  GGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFSQDLVNNFINIMQSEFEMSMVGELSCFWGLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAAT
         G  DKTLF+ + ++ L++AQIYVDDI+FGG S +++ +F+  MQSEFEMS+VGEL  F GLQ+KQ  D IF+SQ KYAKN+VKKFG+E A +KRTPA T
Subjt:  GGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFSQDLVNNFINIMQSEFEMSMVGELSCFWGLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAAT

Query:  HVKLTTDTDGVEVDHKLYRSIVGSLLYLTASRPDIAYAVEICARYQ----------------YVHETSDFGMMYSCDTTLTLVGYYDANWAGSVDDRKST
        H+KL+ D  G  VD  LYRS++GSLLYLTASRPDI +AV +CARYQ                YV+ TSD+G+MY   +   LVGY DA+WAGS DDRK T
Subjt:  HVKLTTDTDGVEVDHKLYRSIVGSLLYLTASRPDIAYAVEICARYQ----------------YVHETSDFGMMYSCDTTLTLVGYYDANWAGSVDDRKST

Query:  SGGRFFLGNNLISWLSKKQNCVSLSTAEAEYIAIGSGCT---------------QTTMTLYCDNMSAIDILKNPVQHSRTKHIKIRHHFI----------
        SGG F+LG NLISW SKKQNCVSLSTAEAEYIA GS C+               Q  MTLYCDNMSAI+I KNPVQH+RTKHI IRHH+I          
Subjt:  SGGRFFLGNNLISWLSKKQNCVSLSTAEAEYIAIGSGCT---------------QTTMTLYCDNMSAIDILKNPVQHSRTKHIKIRHHFI----------

Query:  ----------LDIFTKPLDANSFEYLHAGLGVC
                   DIFTK LDAN FE L   LG C
Subjt:  ----------LDIFTKPLDANSFEYLHAGLGVC

Q84VI2 Gag-pol polyprotein4.0e-30753.53Show/hide
Query:  MIAFTSVQ-TADDAWYFDSGCSRHMIGNRSYFTNLKDCVTGHVTFGDGAKGKIIAKGNIDKNNLPR------------------------YKVSFDDIGC
        ++  TS++ +A + WY DSGCSRHM G + +  N++ C T +VTFGDG+KGKII  G +  + LP                         + V+F    C
Subjt:  MIAFTSVQ-TADDAWYFDSGCSRHMIGNRSYFTNLKDCVTGHVTFGDGAKGKIIAKGNIDKNNLPR------------------------YKVSFDDIGC

Query:  VVMNKENQICMSGKRQTNNCYHW---NSNTSDTCQLTRSNQTWLWHKMLGRVSMRGLEKIIKNEAIVGIPDLDVNGKFFCGDCQIGKQTRSTHKSLKKCY
        +V N+++++ M G R  +NCY W    ++ S TC  ++ ++  +WH+  G + +RG++KI+   A+ GIP+L +     CG+CQIGKQ + +H+ L+   
Subjt:  VVMNKENQICMSGKRQTNNCYHW---NSNTSDTCQLTRSNQTWLWHKMLGRVSMRGLEKIIKNEAIVGIPDLDVNGKFFCGDCQIGKQTRSTHKSLKKCY

Query:  TNKVLELLHMDLMGQMQTESLRGKRYVLVVVDDYSRYTWVCFLKGKIDTVEICKNLCLKLQREKGKKIIKIQSDHGKEFDSEGFNSFCLLEGIHHEFSAP
        T++VLELLHMDLMG MQ ESL GKRY  VVVDD+SR+TWV F++ K  T E+ K L L+LQREK   I +I+SDHG+EF++  F  FC  EGI HEFSA 
Subjt:  TNKVLELLHMDLMGQMQTESLRGKRYVLVVVDDYSRYTWVCFLKGKIDTVEICKNLCLKLQREKGKKIIKIQSDHGKEFDSEGFNSFCLLEGIHHEFSAP

Query:  ITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAKISGLQNTACHIHNRVTIRTRTIVTLYELWKERKLNVKYFHVFGSTCYILADREYRQKWDARS
        ITPQQNG+VERKNRTLQE ARVM+HAK LP   WAEA      NTAC+IHNRVT+R  T  TLYE+WK RK +VK+FH+FGS CYILADRE R+K D +S
Subjt:  ITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAKISGLQNTACHIHNRVTIRTRTIVTLYELWKERKLNVKYFHVFGSTCYILADREYRQKWDARS

Query:  KQGIFLGYSKNSRAYRVFNNRFGSVMKTINVVINDLDSAIKHVNDEE--DESPNMSDARTMSKSLKKS---SEEIITKKSELIPSAHVKKNHPASSIIGD
          GIFLGYS NSRAYRVFN+R  +VM++INVV++DL  A K   +E+      N++DA    ++ + S   ++E    + +   S  ++K HP   IIGD
Subjt:  KQGIFLGYSKNSRAYRVFNNRFGSVMKTINVVINDLDSAIKHVNDEE--DESPNMSDARTMSKSLKKS---SEEIITKKSELIPSAHVKKNHPASSIIGD

Query:  PSAGMQTRRKEKIDYMKMVVDLCYISTIEPSTVDSALKDEYWLNVMQEELLQFRRNNVWTL------------------KIDEARCVTKNKVRLVAQGYT
        P+ G+ TR +E    +++V + C++S IEP  V  AL DE+W+N MQEEL QF+RN VW L                  K +E   +T+NK RLVAQGYT
Subjt:  PSAGMQTRRKEKIDYMKMVVDLCYISTIEPSTVDSALKDEYWLNVMQEELLQFRRNNVWTL------------------KIDEARCVTKNKVRLVAQGYT

Query:  QVEGIDFDETFAPVAQLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQLKGFVDSEHPKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYS
        Q+EG+DFDETFAPVA+LE+IRLLLG++CI KFKLYQMDVKSAFLNGYLNEEVYV Q KGF D  HP HV++L K LYGLKQAPRAWY+RLT +L  +GY 
Subjt:  QVEGIDFDETFAPVAQLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQLKGFVDSEHPKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYS

Query:  GGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFSQDLVNNFINIMQSEFEMSMVGELSCFWGLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAAT
         G  DKTLF+ + ++ L++AQIYVDDI+FGG S +++ +F+  MQSEFEMS+VGEL+ F GLQ+KQ  D IF+SQ +YAKN+VKKFG+E A +KRTPA T
Subjt:  GGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFSQDLVNNFINIMQSEFEMSMVGELSCFWGLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAAT

Query:  HVKLTTDTDGVEVDHKLYRSIVGSLLYLTASRPDIAYAVEICARYQ----------------YVHETSDFGMMYSCDTTLTLVGYYDANWAGSVDDRKST
        H+KL+ D  G  VD K YRS++GSLLYLTASRPDI YAV +CARYQ                YV+ TSD+G+MY   ++  LVGY DA+WAGS DDRKST
Subjt:  HVKLTTDTDGVEVDHKLYRSIVGSLLYLTASRPDIAYAVEICARYQ----------------YVHETSDFGMMYSCDTTLTLVGYYDANWAGSVDDRKST

Query:  SGGRFFLGNNLISWLSKKQNCVSLSTAEAEYIAIGSGCT---------------QTTMTLYCDNMSAIDILKNPVQHSRTKHIKIRHHF-----------
        SGG F+LGNNLISW SKKQNCVSLSTAEAEYIA GS C+               Q  MTLYCDNMSAI+I KNPVQHSRTKHI IRHH+           
Subjt:  SGGRFFLGNNLISWLSKKQNCVSLSTAEAEYIAIGSGCT---------------QTTMTLYCDNMSAIDILKNPVQHSRTKHIKIRHHF-----------

Query:  ---------ILDIFTKPLDANSFEYLHAGLGVC
                 I DIFTK LDAN FE L   LG+C
Subjt:  ---------ILDIFTKPLDANSFEYLHAGLGVC

Q84VI4 Gag-pol polyprotein8.9e-30753.53Show/hide
Query:  MIAFTSVQ-TADDAWYFDSGCSRHMIGNRSYFTNLKDCVTGHVTFGDGAKGKIIAKGNIDKNNLPR------------------------YKVSFDDIGC
        ++  TS++ +A + WY DSGCSRHM G + +  N++ C T +VTFGDG+KGKII  G +  + LP                         + V+F    C
Subjt:  MIAFTSVQ-TADDAWYFDSGCSRHMIGNRSYFTNLKDCVTGHVTFGDGAKGKIIAKGNIDKNNLPR------------------------YKVSFDDIGC

Query:  VVMNKENQICMSGKRQTNNCYHW---NSNTSDTCQLTRSNQTWLWHKMLGRVSMRGLEKIIKNEAIVGIPDLDVNGKFFCGDCQIGKQTRSTHKSLKKCY
        +V N+++++ M G R  +NCY W    ++ S TC  ++ ++  +WH+  G + +RG++KII   A+ GIP+L +     CG+CQIGKQ + +H+ L+   
Subjt:  VVMNKENQICMSGKRQTNNCYHW---NSNTSDTCQLTRSNQTWLWHKMLGRVSMRGLEKIIKNEAIVGIPDLDVNGKFFCGDCQIGKQTRSTHKSLKKCY

Query:  TNKVLELLHMDLMGQMQTESLRGKRYVLVVVDDYSRYTWVCFLKGKIDTVEICKNLCLKLQREKGKKIIKIQSDHGKEFDSEGFNSFCLLEGIHHEFSAP
        T++VLELLHMDLMG MQ ESL GKRY  VVVDD+SR+TWV F++ K +T E+ K L L+LQREK   I +I+SDHG+EF++     FC  EGI HEFSA 
Subjt:  TNKVLELLHMDLMGQMQTESLRGKRYVLVVVDDYSRYTWVCFLKGKIDTVEICKNLCLKLQREKGKKIIKIQSDHGKEFDSEGFNSFCLLEGIHHEFSAP

Query:  ITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAKISGLQNTACHIHNRVTIRTRTIVTLYELWKERKLNVKYFHVFGSTCYILADREYRQKWDARS
        ITPQQNG+VERKNRTLQE ARVM+HAK LP   WAEA      NTAC+IHNRVT+R  T  TLYE+WK RK +VK+FH+FGS CYILADRE R+K D +S
Subjt:  ITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAKISGLQNTACHIHNRVTIRTRTIVTLYELWKERKLNVKYFHVFGSTCYILADREYRQKWDARS

Query:  KQGIFLGYSKNSRAYRVFNNRFGSVMKTINVVINDLDSAIKHVNDEE--DESPNMSDARTMSKSLKKS---SEEIITKKSELIPSAHVKKNHPASSIIGD
          GIFLGYS NSRAYRVFN+R  +VM++INVV++DL  A K   +E+      N++DA    ++ + S   ++E    + +   S  ++K HP   IIGD
Subjt:  KQGIFLGYSKNSRAYRVFNNRFGSVMKTINVVINDLDSAIKHVNDEE--DESPNMSDARTMSKSLKKS---SEEIITKKSELIPSAHVKKNHPASSIIGD

Query:  PSAGMQTRRKEKIDYMKMVVDLCYISTIEPSTVDSALKDEYWLNVMQEELLQFRRNNVWTL------------------KIDEARCVTKNKVRLVAQGYT
        P+ G+ TR +E    +++V + C++S IEP  V  AL DE+W+N MQEEL QF+RN VW L                  K +E   +T+NK RLVAQGYT
Subjt:  PSAGMQTRRKEKIDYMKMVVDLCYISTIEPSTVDSALKDEYWLNVMQEELLQFRRNNVWTL------------------KIDEARCVTKNKVRLVAQGYT

Query:  QVEGIDFDETFAPVAQLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQLKGFVDSEHPKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYS
        Q+EG+DFDETFAPVA+LE+IRLLLG++CI KFKLYQMDVKSAFLNGYLNEEVYV Q KGF D  HP HV++L K LYGLKQAPRAWY+RLT +L  +GY 
Subjt:  QVEGIDFDETFAPVAQLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQLKGFVDSEHPKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYS

Query:  GGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFSQDLVNNFINIMQSEFEMSMVGELSCFWGLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAAT
         G  DKTLF+ + ++ L++AQIYVDDI+FGG S +++ +F+  MQSEFEMS+VGEL+ F GLQ+KQ  D IF+SQ +YAKN+VKKFG+E A +KRTPA T
Subjt:  GGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFSQDLVNNFINIMQSEFEMSMVGELSCFWGLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAAT

Query:  HVKLTTDTDGVEVDHKLYRSIVGSLLYLTASRPDIAYAVEICARYQ----------------YVHETSDFGMMYSCDTTLTLVGYYDANWAGSVDDRKST
        H+KL+ D  G  VD  LYRS++GSLLYLTASRPDI YAV +CARYQ                YV+ TSD+G+MY   +   LVGY DA+WAGS DDRKST
Subjt:  HVKLTTDTDGVEVDHKLYRSIVGSLLYLTASRPDIAYAVEICARYQ----------------YVHETSDFGMMYSCDTTLTLVGYYDANWAGSVDDRKST

Query:  SGGRFFLGNNLISWLSKKQNCVSLSTAEAEYIAIGSGCT---------------QTTMTLYCDNMSAIDILKNPVQHSRTKHIKIRHHF-----------
        SGG F+LGNNLISW SKKQNCVSLSTAEAEYIA GS C+               Q  MTLYCDNMSAI+I KNPVQHSRTKHI IRHH+           
Subjt:  SGGRFFLGNNLISWLSKKQNCVSLSTAEAEYIAIGSGCT---------------QTTMTLYCDNMSAIDILKNPVQHSRTKHIKIRHHF-----------

Query:  ---------ILDIFTKPLDANSFEYLHAGLGVC
                 I DIFTK LDAN FE L   LG+C
Subjt:  ---------ILDIFTKPLDANSFEYLHAGLGVC

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.7e-9026.8Show/hide
Query:  VTFGDGAKGKIIAKGNIDKNNLPRYKVSFDDIGCVVMNKENQICMSGKRQTNNCYHWNSNTSDTCQLTRSNQTWLWHKMLGRVSMRGLEKIIKNEAIVG-
        V F   A G +++   + +  +    + FD  G V ++K   + +      NN    N   + +      N   LWH+  G +S   L +I +       
Subjt:  VTFGDGAKGKIIAKGNIDKNNLPRYKVSFDDIGCVVMNKENQICMSGKRQTNNCYHWNSNTSDTCQLTRSNQTWLWHKMLGRVSMRGLEKIIKNEAIVG-

Query:  --IPDLDVNGKFFCGDCQIGKQTRSTHKSLK-KCYTNKVLELLHMDLMGQMQTESLRGKRYVLVVVDDYSRYTWVCFLKGKIDTVEICKNLCLKLQREKG
          + +L+++ +  C  C  GKQ R   K LK K +  + L ++H D+ G +   +L  K Y ++ VD ++ Y     +K K D   + ++   K +    
Subjt:  --IPDLDVNGKFFCGDCQIGKQTRSTHKSLK-KCYTNKVLELLHMDLMGQMQTESLRGKRYVLVVVDDYSRYTWVCFLKGKIDTVEICKNLCLKLQREKG

Query:  KKIIKIQSDHGKEFDSEGFNSFCLLEGIHHEFSAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAKISGLQNTACHIHNRVTIR--TRTIVTL
         K++ +  D+G+E+ S     FC+ +GI +  + P TPQ NGV ER  RT+ E AR M+    L   FW EA +     TA ++ NR+  R    +  T 
Subjt:  KKIIKIQSDHGKEFDSEGFNSFCLLEGIHHEFSAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAKISGLQNTACHIHNRVTIR--TRTIVTL

Query:  YELWKERKLNVKYFHVFGSTCYILADREYRQKWDARSKQGIFLGYSKNS-RAYRVFNNRF---------------GSVMKTINVVINDLDSA--------
        YE+W  +K  +K+  VFG+T Y+   +  + K+D +S + IF+GY  N  + +   N +F                  +K   V + D   +        
Subjt:  YELWKERKLNVKYFHVFGSTCYILADREYRQKWDARSKQGIFLGYSKNS-RAYRVFNNRF---------------GSVMKTINVVINDLDSA--------

Query:  ------------------IKHVNDEED----------------ESPN----------MSDARTMSK-----SLKKSSEEIITKKS---------ELIPSA
                          I+ + D ++                E PN          + D++  +K     S K+  ++ + +           E   + 
Subjt:  ------------------IKHVNDEED----------------ESPN----------MSDARTMSK-----SLKKSSEEIITKKS---------ELIPSA

Query:  HVKK---NHPASS----IIGDPSAGMQTR-----RKEKIDYMKMVVDLCYISTIEPSTVDS-ALKDE--YWLNVMQEELLQFRRNNVWTL----------
        H+K+   ++P  +    II   S  ++T+      +E     K+V++   I    P++ D    +D+   W   +  EL   + NN WT+          
Subjt:  HVKK---NHPASS----IIGDPSAGMQTR-----RKEKIDYMKMVVDLCYISTIEPSTVDS-ALKDE--YWLNVMQEELLQFRRNNVWTL----------

Query:  --------KIDEARCVTKNKVRLVAQGYTQVEGIDFDETFAPVAQLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQLKGFVDSEHPKHVHK
                K +E     + K RLVA+G+TQ   ID++ETFAPVA++ + R +L +      K++QMDVK+AFLNG L EE+Y+   +G   S +  +V K
Subjt:  --------KIDEARCVTKNKVRLVAQGYTQVEGIDFDETFAPVAQLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQLKGFVDSEHPKHVHK

Query:  LNKTLYGLKQAPRAWYDRLTIYLRDKGYSGGKFDKTLFIHRKS--DQLLVAQIYVDDIIFGGFSQDLVNNFINIMQSEFEMSMVGELSCFWGLQIKQKND
        LNK +YGLKQA R W++     L++  +     D+ ++I  K   ++ +   +YVDD++        +NNF   +  +F M+ + E+  F G++I+ + D
Subjt:  LNKTLYGLKQAPRAWYDRLTIYLRDKGYSGGKFDKTLFIHRKS--DQLLVAQIYVDDIIFGGFSQDLVNNFINIMQSEFEMSMVGELSCFWGLQIKQKND

Query:  DIFISQEKYAKNMVKKFGLEQARNKRTPAATHVK---LTTDTDGVEVDHKLYRSIVGSLLY-LTASRPDIAYAVEICARY----------------QYVH
         I++SQ  Y K ++ KF +E      TP  + +    L +D D     +   RS++G L+Y +  +RPD+  AV I +RY                +Y+ 
Subjt:  DIFISQEKYAKNMVKKFGLEQARNKRTPAATHVK---LTTDTDGVEVDHKLYRSIVGSLLY-LTASRPDIAYAVEICARY----------------QYVH

Query:  ETSDFGMMYSCDTTL--TLVGYYDANWAGSVDDRKSTSGGRFFLGN-NLISWLSKKQNCVSLSTAEAEYIAIGSGC----------------TQTTMTLY
         T D  +++  +      ++GY D++WAGS  DRKST+G  F + + NLI W +K+QN V+ S+ EAEY+A+                     +  + +Y
Subjt:  ETSDFGMMYSCDTTL--TLVGYYDANWAGSVDDRKSTSGGRFFLGN-NLISWLSKKQNCVSLSTAEAEYIAIGSGC----------------TQTTMTLY

Query:  CDNMSAIDILKNPVQHSRTKHIKIRHHF--------------------ILDIFTKPLDANSFEYLHAGLGV
         DN   I I  NP  H R KHI I++HF                    + DIFTKPL A  F  L   LG+
Subjt:  CDNMSAIDILKNPVQHSRTKHIKIRHHF--------------------ILDIFTKPLDANSFEYLHAGLGV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-947.4e-10930.15Show/hide
Query:  LWHKMLGRVSMRGLEKIIKNEAIVGIPDLDVNGKFFCGDCQIGKQTRSTHKSLKKCYTNKVLELLHMDLMGQMQTESLRGKRYVLVVVDDYSRYTWVCFL
        LWHK +G +S +GL+ + K   I       V     C  C  GKQ R + ++  +   N +L+L++ D+ G M+ ES+ G +Y +  +DD SR  WV  L
Subjt:  LWHKMLGRVSMRGLEKIIKNEAIVGIPDLDVNGKFFCGDCQIGKQTRSTHKSLKKCYTNKVLELLHMDLMGQMQTESLRGKRYVLVVVDDYSRYTWVCFL

Query:  KGKIDTVEICKNLCLKLQREKGKKIIKIQSDHGKEFDSEGFNSFCLLEGIHHEFSAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAKISGLQ
        K K    ++ +     ++RE G+K+ +++SD+G E+ S  F  +C   GI HE + P TPQ NGV ER NRT+ E  R M+    LP  FW EA      
Subjt:  KGKIDTVEICKNLCLKLQREKGKKIIKIQSDHGKEFDSEGFNSFCLLEGIHHEFSAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAKISGLQ

Query:  NTACHIHNRVTIRTRTIVTLYELWKERKLNVKYFHVFGSTCYILADREYRQKWDARSKQGIFLGYSKNSRAYRVFNNRFGSVMKTINVVIN--------D
         TAC++ NR             +W  ++++  +  VFG   +    +E R K D +S   IF+GY      YR+++     V+++ +VV          D
Subjt:  NTACHIHNRVTIRTRTIVTLYELWKERKLNVKYFHVFGSTCYILADREYRQKWDARSKQGIFLGYSKNSRAYRVFNNRFGSVMKTINVVIN--------D

Query:  LDSAIKH-------VNDEEDESPNMSDARTMSKSLKKSSEEIITKKSELIPSAHVKKNHPASS------IIGDPSAGMQTRRKEKIDYMKMVVDLCYIST
        +   +K+              +P  +++ T   S +      + ++ E +     +  HP         +       +++RR    +Y+ +  D      
Subjt:  LDSAIKH-------VNDEEDESPNMSDARTMSKSLKKSSEEIITKKSELIPSAHVKKNHPASS------IIGDPSAGMQTRRKEKIDYMKMVVDLCYIST

Query:  IEPSTVDSAL---KDEYWLNVMQEELLQFRRNN------------------VWTLKIDEARCVTKNKVRLVAQGYTQVEGIDFDETFAPVAQLEAIRLLL
         EP ++   L   +    +  MQEE+   ++N                   V+ LK D    + + K RLV +G+ Q +GIDFDE F+PV ++ +IR +L
Subjt:  IEPSTVDSAL---KDEYWLNVMQEELLQFRRNN------------------VWTLKIDEARCVTKNKVRLVAQGYTQVEGIDFDETFAPVAQLEAIRLLL

Query:  GISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQLKGFVDSEHPKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYSGGKFDKTLFIHRKSD-QLLVAQIY
         ++     ++ Q+DVK+AFL+G L EE+Y+ Q +GF  +     V KLNK+LYGLKQAPR WY +   +++ + Y     D  ++  R S+   ++  +Y
Subjt:  GISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQLKGFVDSEHPKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYSGGKFDKTLFIHRKSD-QLLVAQIY

Query:  VDDIIFGGFSQDLVNNFINIMQSEFEMSMVGELSCFWGLQI--KQKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAATHVKLTTDTDGVEVDHK-----
        VDD++  G  + L+      +   F+M  +G      G++I  ++ +  +++SQEKY + ++++F ++ A+   TP A H+KL+       V+ K     
Subjt:  VDDIIFGGFSQDLVNNFINIMQSEFEMSMVGELSCFWGLQI--KQKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAATHVKLTTDTDGVEVDHK-----

Query:  -LYRSIVGSLLY-LTASRPDIAYAVEICARY----------------QYVHETSDFGMMYSCDTTLTLVGYYDANWAGSVDDRKSTSGGRFFLGNNLISW
          Y S VGSL+Y +  +RPDIA+AV + +R+                +Y+  T+   + +     + L GY DA+ AG +D+RKS++G  F      ISW
Subjt:  -LYRSIVGSLLY-LTASRPDIAYAVEICARY----------------QYVHETSDFGMMYSCDTTLTLVGYYDANWAGSVDDRKSTSGGRFFLGNNLISW

Query:  LSKKQNCVSLSTAEAEYIAIGS---------------GCTQTTMTLYCDNMSAIDILKNPVQHSRTKHIKIRHHFILDI
         SK Q CV+LST EAEYIA                  G  Q    +YCD+ SAID+ KN + H+RTKHI +R+H+I ++
Subjt:  LSKKQNCVSLSTAEAEYIAIGS---------------GCTQTTMTLYCDNMSAIDILKNPVQHSRTKHIKIRHHFILDI

P25600 Putative transposon Ty5-1 protein YCL074W1.4e-2729.43Show/hide
Query:  MDVKSAFLNGYLNEEVYVAQLKGFVDSEHPKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYSGGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFSQDL
        MDV +AFLN  ++E +YV Q  GFV+  +P +V +L   +YGLKQAP  W + +   L+  G+   + +  L+    SD  +   +YVDD++    S  +
Subjt:  MDVKSAFLNGYLNEEVYVAQLKGFVDSEHPKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYSGGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFSQDL

Query:  VNNFINIMQSEFEMSMVGELSCFWGLQIKQ-KNDDIFISQEKYAKNMVKKFGLEQARNKRTPAATHVKLTTDTDGVEVDHKLYRSIVGSLLY-LTASRPD
         +     +   + M  +G++  F GL I Q  N DI +S + Y      +  +   +  +TP      L   T     D   Y+SIVG LL+     RPD
Subjt:  VNNFINIMQSEFEMSMVGELSCFWGLQIKQ-KNDDIFISQEKYAKNMVKKFGLEQARNKRTPAATHVKLTTDTDGVEVDHKLYRSIVGSLLY-LTASRPD

Query:  IAYAVEICARY----------------QYVHETSDFGMMYSCDTTLTLVGYYDANWAGSVDDRKSTSGGRFFLGNNLISWLSKK-QNCVSLSTAEAEYI
        I+Y V + +R+                +Y++ T    + Y   + L L  Y DA+     D   ST G    L    ++W SKK +  + + + EAEYI
Subjt:  IAYAVEICARY----------------QYVHETSDFGMMYSCDTTLTLVGYYDANWAGSVDDRKSTSGGRFFLGNNLISWLSKK-QNCVSLSTAEAEYI

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.0e-8626.85Show/hide
Query:  NIDKNNLPRYK--------VSFDDIGCVVMNKENQICMSGKRQTNNCYHWNSNTSDTCQLTRSNQTWL----WHKMLGRVSMRGLEKIIKNEAIVGIPDL
        NI KN +  Y+        V F      V +    + +   +  +  Y W   +S    L  S  +      WH  LG  +   L  +I N     +  L
Subjt:  NIDKNNLPRYK--------VSFDDIGCVVMNKENQICMSGKRQTNNCYHWNSNTSDTCQLTRSNQTWL----WHKMLGRVSMRGLEKIIKNEAIVGIPDL

Query:  DVNGKFF-CGDCQIGKQTRSTHKSLKKCYTNKVLELLHMDLMGQMQTESLRGKRYVLVVVDDYSRYTWVCFLKGKIDTVEICKNLCLKLQREKGKKIIKI
        + + KF  C DC I K  +    S     + + LE ++ D+       S    RY ++ VD ++RYTW+  LK K    E        L+     +I   
Subjt:  DVNGKFF-CGDCQIGKQTRSTHKSLKKCYTNKVLELLHMDLMGQMQTESLRGKRYVLVVVDDYSRYTWVCFLKGKIDTVEICKNLCLKLQREKGKKIIKI

Query:  QSDHGKEFDS--EGFNSFCLLEGIHHEFSAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAKISGLQNTACHIHNRVTIRTRTIVTLYELWKE
         SD+G EF +  E F+      GI H  S P TP+ NG+ ERK+R + E    ++   ++P  +W  A        A ++ NR+      + + ++    
Subjt:  QSDHGKEFDS--EGFNSFCLLEGIHHEFSAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAKISGLQNTACHIHNRVTIRTRTIVTLYELWKE

Query:  RKLNVKYFHVFGSTCYILADREYRQKWDARSKQGIFLGYSKNSRAY--------RVFNNR----------FGSVMKTIN---------------------
           N     VFG  CY       + K D +S+Q +FLGYS    AY        R++ +R          F + + T++                     
Subjt:  RKLNVKYFHVFGSTCYILADREYRQKWDARSKQGIFLGYSKNSRAY--------RVFNNR----------FGSVMKTIN---------------------

Query:  -------------------------------VVINDLDSAIK------------------------------------HVNDEEDESPNM------SDAR
                                       V  ++LDS+                                        N+  +ESP+       + A+
Subjt:  -------------------------------VVINDLDSAIK------------------------------------HVNDEEDESPNM------SDAR

Query:  TMSKSLKKSSEEIITKKSELIPSAHVKKNHPASSIIGDPSA------GMQTRRKEKI--DYMKMVVDLCYISTIEPSTVDSALKDEYWLNVMQEELLQFR
        + S S   ++    +  S   PS  +    P + I+ + +        M TR K  I     K  + +   +  EP T   ALKDE W N M  E+    
Subjt:  TMSKSLKKSSEEIITKKSELIPSAHVKKNHPASSIIGDPSA------GMQTRRKEKI--DYMKMVVDLCYISTIEPSTVDSALKDEYWLNVMQEELLQFR

Query:  RNNVW-------------------TLKIDEARCVTKNKVRLVAQGYTQVEGIDFDETFAPVAQLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNEEVY
         N+ W                   T K +    + + K RLVA+GY Q  G+D+ ETF+PV +  +IR++LG++  + + + Q+DV +AFL G L ++VY
Subjt:  RNNVW-------------------TLKIDEARCVTKNKVRLVAQGYTQVEGIDFDETFAPVAQLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNEEVY

Query:  VAQLKGFVDSEHPKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYSGGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFSQDLVNNFINIMQSEFEMSMV
        ++Q  GF+D + P +V KL K LYGLKQAPRAWY  L  YL   G+     D +LF+ ++   ++   +YVDDI+  G    L++N ++ +   F +   
Subjt:  VAQLKGFVDSEHPKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYSGGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFSQDLVNNFINIMQSEFEMSMV

Query:  GELSCFWGLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAATHVKLTTDTDGVEVDHKLYRSIVGSLLYLTASRPDIAYAVEICARY--------
         EL  F G++ K+    + +SQ +Y  +++ +  +  A+   TP A   KL+  +     D   YR IVGSL YL  +RPDI+YAV   +++        
Subjt:  GELSCFWGLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAATHVKLTTDTDGVEVDHKLYRSIVGSLLYLTASRPDIAYAVEICARY--------

Query:  --------QYVHETSDFGMMYSCDTTLTLVGYYDANWAGSVDDRKSTSGGRFFLGNNLISWLSKKQNCVSLSTAEAEYIAIGSG-------CTQTT----
                +Y+  T + G+      TL+L  Y DA+WAG  DD  ST+G   +LG++ ISW SKKQ  V  S+ EAEY ++ +        C+  T    
Subjt:  --------QYVHETSDFGMMYSCDTTLTLVGYYDANWAGSVDDRKSTSGGRFFLGNNLISWLSKKQNCVSLSTAEAEYIAIGSG-------CTQTT----

Query:  -----MTLYCDNMSAIDILKNPVQHSRTKHIKIRHHFI--------------------LDIFTKPLDANSFEYLHAGLGV
               +YCDN+ A  +  NPV HSR KHI I +HFI                     D  TKPL   +F+   + +GV
Subjt:  -----MTLYCDNMSAIDILKNPVQHSRTKHIKIRHHFI--------------------LDIFTKPLDANSFEYLHAGLGV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.5e-8926.87Show/hide
Query:  KIIAKGNIDKN--------NLPRYKVSFDDIGCVVMNKENQICMSGKRQTNNCYHWNSNTSDTCQLTRSNQTWL----WHKMLGRVSMRGLEKIIKNEAI
        K++   NI KN        N  R  V F      V +    + +   +  +  Y W   +S    +  S  +      WH  LG  S+  L  +I N + 
Subjt:  KIIAKGNIDKN--------NLPRYKVSFDDIGCVVMNKENQICMSGKRQTNNCYHWNSNTSDTCQLTRSNQTWL----WHKMLGRVSMRGLEKIIKNEAI

Query:  VGIPDLDVNGKFF-CGDCQIGKQTRSTHK---SLKKCYTNKVLELLHMDLMGQMQTESLRGKRYVLVVVDDYSRYTWVCFLKGKI---DTVEICKNLCLK
          +P L+ + K   C DC I K    +HK   S     ++K LE ++ D+       S+   RY ++ VD ++RYTW+  LK K    DT  I K+L   
Subjt:  VGIPDLDVNGKFF-CGDCQIGKQTRSTHK---SLKKCYTNKVLELLHMDLMGQMQTESLRGKRYVLVVVDDYSRYTWVCFLKGKI---DTVEICKNLCLK

Query:  LQREKGKKIIKIQSDHGKEFDSEGFNSFCLLEGIHHEFSAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAKISGLQNTACHIHNRVTIRTRT
        ++     +I  + SD+G EF       +    GI H  S P TP+ NG+ ERK+R + EM   ++   ++P  +W  A      + A ++ NR+      
Subjt:  LQREKGKKIIKIQSDHGKEFDSEGFNSFCLLEGIHHEFSAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAKISGLQNTACHIHNRVTIRTRT

Query:  IVTLYELWKERKLNVKYFHVFGSTCYILADREYRQKWDARSKQGIFLGYSKNSRAYRVFNNRFGSVMKTINVVINDLDSAIKHVN--------DEEDESP
        + + ++    +  N +   VFG  CY       R K + +SKQ  F+GYS    AY   +   G +  + +V  ++        N           D +P
Subjt:  IVTLYELWKERKLNVKYFHVFGSTCYILADREYRQKWDARSKQGIFLGYSKNSRAYRVFNNRFGSVMKTINVVINDLDSAIKHVN--------DEEDESP

Query:  N-----------------------------------------MSDARTMSKSLKKSSEEIITKKS-----------------------------------
        N                                         +S +   S S+   S    T  S                                   
Subjt:  N-----------------------------------------MSDARTMSKSLKKSSEEIITKKS-----------------------------------

Query:  ---------ELIPSAHV--------KKNHPASSIIGDP---------------------SAGMQTRRKEKI--DYMKMVVDLCYISTIEPSTVDSALKDE
                   I S H+        + N P+SS    P                     +  M TR K+ I     K        +  EP T   A+KD+
Subjt:  ---------ELIPSAHV--------KKNHPASSIIGDP---------------------SAGMQTRRKEKI--DYMKMVVDLCYISTIEPSTVDSALKDE

Query:  YWLNVMQEELLQFRRNNVW-------------------TLKIDEARCVTKNKVRLVAQGYTQVEGIDFDETFAPVAQLEAIRLLLGISCIQKFKLYQMDV
         W   M  E+     N+ W                   T K +    + + K RLVA+GY Q  G+D+ ETF+PV +  +IR++LG++  + + + Q+DV
Subjt:  YWLNVMQEELLQFRRNNVW-------------------TLKIDEARCVTKNKVRLVAQGYTQVEGIDFDETFAPVAQLEAIRLLLGISCIQKFKLYQMDV

Query:  KSAFLNGYLNEEVYVAQLKGFVDSEHPKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYSGGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFSQDLVNN
         +AFL G L +EVY++Q  GFVD + P +V +L K +YGLKQAPRAWY  L  YL   G+     D +LF+ ++   ++   +YVDDI+  G    L+ +
Subjt:  KSAFLNGYLNEEVYVAQLKGFVDSEHPKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYSGGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFSQDLVNN

Query:  FINIMQSEFEMSMVGELSCFWGLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAATHVKLTTDTDGVEVDHKLYRSIVGSLLYLTASRPDIAYAV
         ++ +   F +    +L  F G++ K+    + +SQ +Y  +++ +  +  A+   TP AT  KLT  +     D   YR IVGSL YL  +RPD++YAV
Subjt:  FINIMQSEFEMSMVGELSCFWGLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAATHVKLTTDTDGVEVDHKLYRSIVGSLLYLTASRPDIAYAV

Query:  EICARY----------------QYVHETSDFGMMYSCDTTLTLVGYYDANWAGSVDDRKSTSGGRFFLGNNLISWLSKKQNCVSLSTAEAEYIAIGSG--
           ++Y                +Y+  T D G+      TL+L  Y DA+WAG  DD  ST+G   +LG++ ISW SKKQ  V  S+ EAEY ++ +   
Subjt:  EICARY----------------QYVHETSDFGMMYSCDTTLTLVGYYDANWAGSVDDRKSTSGGRFFLGNNLISWLSKKQNCVSLSTAEAEYIAIGSG--

Query:  -----CTQTT---------MTLYCDNMSAIDILKNPVQHSRTKHIKIRHHFI--------------------LDIFTKPLDANSFEYLHAGLGV
             C+  T           +YCDN+ A  +  NPV HSR KHI + +HFI                     D  TKPL   +F+     +GV
Subjt:  -----CTQTT---------MTLYCDNMSAIDILKNPVQHSRTKHIKIRHHFI--------------------LDIFTKPLDANSFEYLHAGLGV

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.8e-6533.74Show/hide
Query:  EKIDYMKMVVDLCYISTIEPSTVDSALKDEYWLNVMQEELLQFRRNNVW------------------TLKIDEARCVTKNKVRLVAQGYTQVEGIDFDET
        EK+  +     +C     EPST + A +   W   M +E+      + W                   +K +    + + K RLVA+GYTQ EGIDF ET
Subjt:  EKIDYMKMVVDLCYISTIEPSTVDSALKDEYWLNVMQEELLQFRRNNVW------------------TLKIDEARCVTKNKVRLVAQGYTQVEGIDFDET

Query:  FAPVAQLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQLKGFV----DSEHPKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYSGGKFDK
        F+PV +L +++L+L IS I  F L+Q+D+ +AFLNG L+EE+Y+    G+     DS  P  V  L K++YGLKQA R W+ + ++ L   G+     D 
Subjt:  FAPVAQLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQLKGFV----DSEHPKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYSGGKFDK

Query:  TLFIHRKSDQLLVAQIYVDDIIFGGFSQDLVNNFINIMQSEFEMSMVGELSCFWGLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAATHVKLTT
        T F+   +   L   +YVDDII    +   V+   + ++S F++  +G L  F GL+I +    I I Q KYA +++ + GL   +    P    V  + 
Subjt:  TLFIHRKSDQLLVAQIYVDDIIFGGFSQDLVNNFINIMQSEFEMSMVGELSCFWGLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAATHVKLTT

Query:  DTDGVEVDHKLYRSIVGSLLYLTASRPDIAYAVEICARY----------------QYVHETSDFGMMYSCDTTLTLVGYYDANWAGSVDDRKSTSGGRFF
         + G  VD K YR ++G L+YL  +R DI++AV   +++                 Y+  T   G+ YS    + L  + DA++    D R+ST+G   F
Subjt:  DTDGVEVDHKLYRSIVGSLLYLTASRPDIAYAVEICARY----------------QYVHETSDFGMMYSCDTTLTLVGYYDANWAGSVDDRKSTSGGRFF

Query:  LGNNLISWLSKKQNCVSLSTAEAEYIAIGSGCTQTTM-----------------TLYCDNMSAIDILKNPVQHSRTKHIKIRHHFI
        LG +LISW SKKQ  VS S+AEAEY A+ S  T   M                  L+CDN +AI I  N V H RTKHI+   H +
Subjt:  LGNNLISWLSKKQNCVSLSTAEAEYIAIGSGCTQTTM-----------------TLYCDNMSAIDILKNPVQHSRTKHIKIRHHFI

ATMG00810.1 DNA/RNA polymerases superfamily protein1.1e-2735.75Show/hide
Query:  IYVDDIIFGGFSQDLVNNFINIMQSEFEMSMVGELSCFWGLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAATHVKLTTDTDGVEVDHKLYRSI
        +YVDDI+  G S  L+N  I  + S F M  +G +  F G+QIK     +F+SQ KYA+ ++   G+   +   TP    +  +  T     D   +RSI
Subjt:  IYVDDIIFGGFSQDLVNNFINIMQSEFEMSMVGELSCFWGLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAATHVKLTTDTDGVEVDHKLYRSI

Query:  VGSLLYLTASRPDIAYAVEI-CAR---------------YQYVHETSDFGMMYSCDTTLTLVGYYDANWAGSVDDRKSTSGGRFFLGNNLISWLSKKQNC
        VG+L YLT +RPDI+YAV I C R                +YV  T   G+    ++ L +  + D++WAG    R+ST+G   FLG N+ISW +K+Q  
Subjt:  VGSLLYLTASRPDIAYAVEI-CAR---------------YQYVHETSDFGMMYSCDTTLTLVGYYDANWAGSVDDRKSTSGGRFFLGNNLISWLSKKQNC

Query:  VSLSTAEAEYIAIGSGCTQTT
        VS S+ E EY A+     + T
Subjt:  VSLSTAEAEYIAIGSGCTQTT

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)6.2e-1035.2Show/hide
Query:  MQTRRKEKIDYMKMVVDLCYISTI--EPSTVDSALKDEYWLNVMQEELLQFRRNNVWTL------------------KIDEARCVTKNKVRLVAQGYTQV
        M TR K  I+ +     L   +TI  EP +V  ALKD  W   MQEEL    RN  W L                  K+     + + K RLVA+G+ Q 
Subjt:  MQTRRKEKIDYMKMVVDLCYISTI--EPSTVDSALKDEYWLNVMQEELLQFRRNNVWTL------------------KIDEARCVTKNKVRLVAQGYTQV

Query:  EGIDFDETFAPVAQLEAIRLLLGIS
        EGI F ET++PV +   IR +L ++
Subjt:  EGIDFDETFAPVAQLEAIRLLLGIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGACTATTAATAAAACTAATCATCCGAGCTGTGCGCAGCTCCCCAGATGTAGGCAATATTGGCCGAACTGGGTTATCAAAGTCTCATGTGTTAATCTACTCTATTG
TTTCACTAGAACAATGAAAATTCTGATAGAAGAAGTGATGATTGCCTTTACATCCGTTCAGACTGCAGATGATGCGTGGTATTTTGATAGTGGGTGCTCCAGACATATGA
TTGGAAACAGATCCTACTTTACGAACTTAAAAGACTGTGTCACTGGACATGTTACCTTTGGTGATGGTGCAAAAGGAAAAATTATAGCTAAAGGCAACATAGACAAAAAT
AACCTACCACGCTACAAAGTTAGTTTTGATGATATTGGTTGTGTTGTGATGAATAAAGAAAATCAGATTTGTATGAGTGGTAAACGACAGACTAATAACTGTTATCACTG
GAACTCAAATACGTCAGACACCTGTCAGTTGACAAGATCAAATCAAACATGGCTATGGCATAAAATGCTGGGGCGTGTCAGTATGAGAGGCTTGGAAAAAATTATTAAAA
ATGAAGCAATTGTGGGAATTCCTGATTTAGACGTAAATGGAAAATTCTTCTGTGGAGACTGTCAAATTGGCAAGCAGACAAGGTCTACTCATAAAAGTCTGAAAAAATGT
TATACTAACAAAGTCTTGGAATTGTTACATATGGATCTCATGGGTCAAATGCAAACAGAAAGTCTGAGAGGAAAGAGGTATGTGCTGGTAGTAGTTGATGATTACTCAAG
ATATACTTGGGTTTGCTTTCTCAAAGGCAAAATAGATACTGTTGAAATATGCAAAAATCTGTGTTTGAAACTACAACGTGAAAAAGGGAAGAAAATAATCAAGATCCAAA
GTGATCATGGTAAAGAGTTTGATAGTGAAGGTTTTAACAGTTTTTGTCTGTTAGAAGGAATACACCATGAATTCTCTGCACCTATAACTCCTCAACAAAATGGTGTAGTA
GAAAGAAAGAACAGGACGTTACAAGAAATGGCACGTGTTATGATACATGCCAAAAATTTACCTCTATGTTTTTGGGCAGAAGCTAAAATATCGGGTTTACAAAATACTGC
TTGTCACATTCATAACAGGGTAACTATTAGAACTAGAACGATTGTTACACTGTATGAACTTTGGAAAGAGAGAAAGTTGAATGTTAAATACTTCCATGTGTTTGGAAGTA
CATGTTATATCTTAGCTGACAGGGAATATCGTCAGAAATGGGATGCTAGGTCAAAACAAGGAATCTTTCTCGGGTACTCTAAGAACAGTCGGGCCTATAGAGTCTTCAAT
AACAGATTTGGGAGTGTCATGAAAACAATCAATGTAGTTATAAATGATCTCGATTCAGCTATCAAACATGTGAATGATGAGGAAGATGAGTCTCCAAACATGTCTGATGC
TAGAACTATGAGCAAAAGTTTGAAAAAATCATCAGAAGAAATTATCACTAAGAAATCAGAACTAATTCCGTCTGCTCATGTGAAGAAAAATCATCCAGCAAGCTCTATTA
TAGGTGATCCGTCAGCTGGGATGCAAACCAGAAGGAAAGAAAAGATTGATTACATGAAGATGGTTGTTGATTTATGTTATATTTCCACCATTGAACCTTCTACTGTTGAC
TCTGCTCTCAAGGATGAGTATTGGTTAAATGTTATGCAAGAGGAGTTACTCCAATTCAGACGAAACAATGTCTGGACGTTAAAGATTGATGAAGCTAGATGTGTGACAAA
AAATAAAGTCAGATTAGTAGCTCAAGGGTATACTCAAGTTGAAGGTATTGACTTTGATGAAACGTTTGCTCCTGTTGCTCAACTTGAAGCCATTCGATTATTACTTGGTA
TATCATGCATACAGAAATTTAAATTGTATCAAATGGATGTAAAGAGTGCCTTCTTAAATGGATATTTGAATGAGGAGGTTTATGTTGCTCAACTAAAAGGTTTTGTTGAT
TCTGAGCACCCGAAGCATGTGCATAAGCTCAACAAAACTTTATATGGGCTAAAGCAAGCTCCGAGAGCTTGGTATGATCGGCTAACTATTTACTTGAGAGATAAAGGATA
TTCTGGAGGAAAATTTGACAAGACCTTGTTTATACACAGAAAATCTGATCAACTTTTGGTTGCTCAAATTTATGTTGATGACATCATTTTTGGAGGTTTTTCTCAAGATC
TAGTAAATAATTTCATTAACATTATGCAGTCAGAATTTGAAATGAGCATGGTTGGAGAACTTTCATGCTTTTGGGGACTTCAAATTAAGCAAAAGAATGACGACATCTTC
ATATCTCAAGAAAAGTATGCCAAGAATATGGTTAAAAAGTTTGGTTTGGAACAGGCTCGAAATAAGCGGACTCCAGCTGCGACACATGTTAAACTTACAACAGACACCGA
TGGTGTTGAAGTTGATCACAAACTCTACAGGAGTATAGTAGGCAGCTTATTATACTTAACAGCAAGTCGACCTGACATAGCTTATGCTGTGGAAATATGTGCTCGTTATC
AGTATGTTCATGAAACCAGTGACTTTGGAATGATGTATTCCTGTGATACCACCCTCACTCTTGTTGGATATTATGATGCTAACTGGGCAGGTTCAGTTGATGATCGTAAA
AGTACGTCTGGAGGACGTTTCTTTTTAGGAAACAATTTAATTTCTTGGTTAAGTAAGAAGCAAAATTGTGTCTCTTTATCTACAGCTGAAGCTGAATATATAGCAATAGG
TAGTGGTTGCACACAGACGACTATGACGTTGTATTGTGACAATATGAGCGCAATTGATATTTTAAAAAATCCTGTTCAACATAGTCGAACAAAGCACATTAAGATAAGAC
ATCACTTTATTCTCGATATTTTCACTAAACCTCTGGATGCAAACTCATTCGAATACTTACATGCTGGTCTAGGAGTGTGTCACACTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTGACTATTAATAAAACTAATCATCCGAGCTGTGCGCAGCTCCCCAGATGTAGGCAATATTGGCCGAACTGGGTTATCAAAGTCTCATGTGTTAATCTACTCTATTG
TTTCACTAGAACAATGAAAATTCTGATAGAAGAAGTGATGATTGCCTTTACATCCGTTCAGACTGCAGATGATGCGTGGTATTTTGATAGTGGGTGCTCCAGACATATGA
TTGGAAACAGATCCTACTTTACGAACTTAAAAGACTGTGTCACTGGACATGTTACCTTTGGTGATGGTGCAAAAGGAAAAATTATAGCTAAAGGCAACATAGACAAAAAT
AACCTACCACGCTACAAAGTTAGTTTTGATGATATTGGTTGTGTTGTGATGAATAAAGAAAATCAGATTTGTATGAGTGGTAAACGACAGACTAATAACTGTTATCACTG
GAACTCAAATACGTCAGACACCTGTCAGTTGACAAGATCAAATCAAACATGGCTATGGCATAAAATGCTGGGGCGTGTCAGTATGAGAGGCTTGGAAAAAATTATTAAAA
ATGAAGCAATTGTGGGAATTCCTGATTTAGACGTAAATGGAAAATTCTTCTGTGGAGACTGTCAAATTGGCAAGCAGACAAGGTCTACTCATAAAAGTCTGAAAAAATGT
TATACTAACAAAGTCTTGGAATTGTTACATATGGATCTCATGGGTCAAATGCAAACAGAAAGTCTGAGAGGAAAGAGGTATGTGCTGGTAGTAGTTGATGATTACTCAAG
ATATACTTGGGTTTGCTTTCTCAAAGGCAAAATAGATACTGTTGAAATATGCAAAAATCTGTGTTTGAAACTACAACGTGAAAAAGGGAAGAAAATAATCAAGATCCAAA
GTGATCATGGTAAAGAGTTTGATAGTGAAGGTTTTAACAGTTTTTGTCTGTTAGAAGGAATACACCATGAATTCTCTGCACCTATAACTCCTCAACAAAATGGTGTAGTA
GAAAGAAAGAACAGGACGTTACAAGAAATGGCACGTGTTATGATACATGCCAAAAATTTACCTCTATGTTTTTGGGCAGAAGCTAAAATATCGGGTTTACAAAATACTGC
TTGTCACATTCATAACAGGGTAACTATTAGAACTAGAACGATTGTTACACTGTATGAACTTTGGAAAGAGAGAAAGTTGAATGTTAAATACTTCCATGTGTTTGGAAGTA
CATGTTATATCTTAGCTGACAGGGAATATCGTCAGAAATGGGATGCTAGGTCAAAACAAGGAATCTTTCTCGGGTACTCTAAGAACAGTCGGGCCTATAGAGTCTTCAAT
AACAGATTTGGGAGTGTCATGAAAACAATCAATGTAGTTATAAATGATCTCGATTCAGCTATCAAACATGTGAATGATGAGGAAGATGAGTCTCCAAACATGTCTGATGC
TAGAACTATGAGCAAAAGTTTGAAAAAATCATCAGAAGAAATTATCACTAAGAAATCAGAACTAATTCCGTCTGCTCATGTGAAGAAAAATCATCCAGCAAGCTCTATTA
TAGGTGATCCGTCAGCTGGGATGCAAACCAGAAGGAAAGAAAAGATTGATTACATGAAGATGGTTGTTGATTTATGTTATATTTCCACCATTGAACCTTCTACTGTTGAC
TCTGCTCTCAAGGATGAGTATTGGTTAAATGTTATGCAAGAGGAGTTACTCCAATTCAGACGAAACAATGTCTGGACGTTAAAGATTGATGAAGCTAGATGTGTGACAAA
AAATAAAGTCAGATTAGTAGCTCAAGGGTATACTCAAGTTGAAGGTATTGACTTTGATGAAACGTTTGCTCCTGTTGCTCAACTTGAAGCCATTCGATTATTACTTGGTA
TATCATGCATACAGAAATTTAAATTGTATCAAATGGATGTAAAGAGTGCCTTCTTAAATGGATATTTGAATGAGGAGGTTTATGTTGCTCAACTAAAAGGTTTTGTTGAT
TCTGAGCACCCGAAGCATGTGCATAAGCTCAACAAAACTTTATATGGGCTAAAGCAAGCTCCGAGAGCTTGGTATGATCGGCTAACTATTTACTTGAGAGATAAAGGATA
TTCTGGAGGAAAATTTGACAAGACCTTGTTTATACACAGAAAATCTGATCAACTTTTGGTTGCTCAAATTTATGTTGATGACATCATTTTTGGAGGTTTTTCTCAAGATC
TAGTAAATAATTTCATTAACATTATGCAGTCAGAATTTGAAATGAGCATGGTTGGAGAACTTTCATGCTTTTGGGGACTTCAAATTAAGCAAAAGAATGACGACATCTTC
ATATCTCAAGAAAAGTATGCCAAGAATATGGTTAAAAAGTTTGGTTTGGAACAGGCTCGAAATAAGCGGACTCCAGCTGCGACACATGTTAAACTTACAACAGACACCGA
TGGTGTTGAAGTTGATCACAAACTCTACAGGAGTATAGTAGGCAGCTTATTATACTTAACAGCAAGTCGACCTGACATAGCTTATGCTGTGGAAATATGTGCTCGTTATC
AGTATGTTCATGAAACCAGTGACTTTGGAATGATGTATTCCTGTGATACCACCCTCACTCTTGTTGGATATTATGATGCTAACTGGGCAGGTTCAGTTGATGATCGTAAA
AGTACGTCTGGAGGACGTTTCTTTTTAGGAAACAATTTAATTTCTTGGTTAAGTAAGAAGCAAAATTGTGTCTCTTTATCTACAGCTGAAGCTGAATATATAGCAATAGG
TAGTGGTTGCACACAGACGACTATGACGTTGTATTGTGACAATATGAGCGCAATTGATATTTTAAAAAATCCTGTTCAACATAGTCGAACAAAGCACATTAAGATAAGAC
ATCACTTTATTCTCGATATTTTCACTAAACCTCTGGATGCAAACTCATTCGAATACTTACATGCTGGTCTAGGAGTGTGTCACACTTAA
Protein sequenceShow/hide protein sequence
MVTINKTNHPSCAQLPRCRQYWPNWVIKVSCVNLLYCFTRTMKILIEEVMIAFTSVQTADDAWYFDSGCSRHMIGNRSYFTNLKDCVTGHVTFGDGAKGKIIAKGNIDKN
NLPRYKVSFDDIGCVVMNKENQICMSGKRQTNNCYHWNSNTSDTCQLTRSNQTWLWHKMLGRVSMRGLEKIIKNEAIVGIPDLDVNGKFFCGDCQIGKQTRSTHKSLKKC
YTNKVLELLHMDLMGQMQTESLRGKRYVLVVVDDYSRYTWVCFLKGKIDTVEICKNLCLKLQREKGKKIIKIQSDHGKEFDSEGFNSFCLLEGIHHEFSAPITPQQNGVV
ERKNRTLQEMARVMIHAKNLPLCFWAEAKISGLQNTACHIHNRVTIRTRTIVTLYELWKERKLNVKYFHVFGSTCYILADREYRQKWDARSKQGIFLGYSKNSRAYRVFN
NRFGSVMKTINVVINDLDSAIKHVNDEEDESPNMSDARTMSKSLKKSSEEIITKKSELIPSAHVKKNHPASSIIGDPSAGMQTRRKEKIDYMKMVVDLCYISTIEPSTVD
SALKDEYWLNVMQEELLQFRRNNVWTLKIDEARCVTKNKVRLVAQGYTQVEGIDFDETFAPVAQLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQLKGFVD
SEHPKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYSGGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFSQDLVNNFINIMQSEFEMSMVGELSCFWGLQIKQKNDDIF
ISQEKYAKNMVKKFGLEQARNKRTPAATHVKLTTDTDGVEVDHKLYRSIVGSLLYLTASRPDIAYAVEICARYQYVHETSDFGMMYSCDTTLTLVGYYDANWAGSVDDRK
STSGGRFFLGNNLISWLSKKQNCVSLSTAEAEYIAIGSGCTQTTMTLYCDNMSAIDILKNPVQHSRTKHIKIRHHFILDIFTKPLDANSFEYLHAGLGVCHT