; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh04G019820 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh04G019820
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionGag/pol protein
Genome locationCmo_Chr04:10222837..10233255
RNA-Seq ExpressionCmoCh04G019820
SyntenyCmoCh04G019820
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]8.5e-18487.5Show/hide
Query:  MNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLN
        MNLEMESMYFNSVW LVD P  VKPIGCKWIYKRKRDQ GKVQTFKARLVAKGYTQ+EGVDYEETFSPVAMLKSIRILLSIATFY+YEIWQMDVKTAFLN
Subjt:  MNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLN

Query:  GNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWLA
        GNLEESIYM QPEGFI QD EQ+VCKL++SIYGLKQASRSWNI+FDTAIKSYGF+QNVDEPCVYK+IVNS VAFL+LYVDDILLIGNDV  LTD+K WL 
Subjt:  GNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWLA

Query:  TQFQMKDLGEAQFVLGIQIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDIC
        TQFQMKDLGEAQ++LGIQI+RNRKNKTLA+SQASYIDK+L RYKMQ+SKKG LPFRHG+HLSKEQ PKTPQEVEDM++IPY+SAVGSLMYAMLCTRPDIC
Subjt:  TQFQMKDLGEAQFVLGIQIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDIC

Query:  YAVGIVSRYQSNPGRAHWTAVKNILKYLRTTRDYMLMYGAKDLILTGYTDSDFQTDCERR
        Y+VGIVSRYQSNPGR HWTAVKNILKYLR TR+YML+YGAKDLILTGYTDSDFQ+D + R
Subjt:  YAVGIVSRYQSNPGRAHWTAVKNILKYLRTTRDYMLMYGAKDLILTGYTDSDFQTDCERR

KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]8.5e-18488.61Show/hide
Query:  MNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLN
        M+LEMESMYFNSVWELVD P+GVKPIGCKWIYKRKRD  GKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLN
Subjt:  MNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLN

Query:  GNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWLA
        GNLEESI+M+QPEGFI Q  EQ+VCKL RSIYGLKQASRSWNI+FDTAIKSYGF QNVDEPCVYK+I    VAFLVLYVDDILLIGNDVG LTD+K WLA
Subjt:  GNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWLA

Query:  TQFQMKDLGEAQFVLGIQIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDIC
         QFQMKDLGEAQ+VLGIQIIR+RKNKTLALSQA+YIDK+L+RY MQ+SKKGLLPFRHGVHLSKEQSPKTPQEVEDM+ IPYASAVGSLMYAMLCTRPDIC
Subjt:  TQFQMKDLGEAQFVLGIQIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDIC

Query:  YAVGIVSRYQSNPGRAHWTAVKNILKYLRTTRDYMLMYGAKDLILTGYTDSDFQTDCERR
        YAVGIVSRYQSNPG  HWTAVK +LKYLR TRDYML+YGAKDLILTGYTDSDFQTD + R
Subjt:  YAVGIVSRYQSNPGRAHWTAVKNILKYLRTTRDYMLMYGAKDLILTGYTDSDFQTDCERR

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-18085.03Show/hide
Query:  MNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLN
        M+LEMESMYFNSVWELVD P+GVKPIGCKWIYKRKRD  GKVQTFKARLVAKGYT++EGVDYEETFS VAMLKSIRILLSIA FYDYEIWQMDVKTAFLN
Subjt:  MNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLN

Query:  GNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWLA
        GNLEESI+M+QPEGFI Q  EQ+VCKL RSIYGLKQASRSWNI+FDTAIKSYGF QNVDEPCVYK+I    VAFLVLYVDDILLIGNDVG LTD+K WLA
Subjt:  GNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWLA

Query:  TQFQMKDLGEAQFVLGIQIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDIC
         QFQMKDLGE Q+VLGIQIIR+RKNKTLALSQA+YIDK+L+RY MQ+SKKGLLPFRHGVHLSKEQSPKTPQEVEDM+ IPYASAVGSLMYAMLCTRPDIC
Subjt:  TQFQMKDLGEAQFVLGIQIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDIC

Query:  YAVGIVSRYQSNPGRAHWTAVKNILKYLRTTRDYMLMYGAKDLILTGYTDSDFQTDCE-RRALLRSLTRLQSEA
        YAVGIVSRYQSNPG  HWTAVK ILKYLR TRDYML+YGAKDLILTGYT+SDFQTD + R++  RS+  L   A
Subjt:  YAVGIVSRYQSNPGRAHWTAVKNILKYLRTTRDYMLMYGAKDLILTGYTDSDFQTDCE-RRALLRSLTRLQSEA

KAA0047792.1 gag/pol protein [Cucumis melo var. makuwa]4.4e-18086.11Show/hide
Query:  MNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLN
        MNLE+ESMYFNSVW+LVDQPDGVKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EGVDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLN
Subjt:  MNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLN

Query:  GNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWLA
        GNLEE+IYM QPEGFI    EQ++CKL RSIYGLKQASRSWNI+FDTAIKSYGF Q VDEPCVYKRI+N +VAFLVLYVDDILLIGND+G+LTDIK WLA
Subjt:  GNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWLA

Query:  TQFQMKDLGEAQFVLGIQIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDIC
        TQFQMKDLGEAQFVLGIQI R+RKNK LALSQASYIDK++++Y MQ+SK+GLLPFRHGV LSKEQ PKTPQ+VE+M+HIPYASAVGSLMYAMLCTRPDIC
Subjt:  TQFQMKDLGEAQFVLGIQIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDIC

Query:  YAVGIVSRYQSNPGRAHWTAVKNILKYLRTTRDYMLMYGAKDLILTGYTDSDFQTDCERR
        YAVGIVSRYQSNPG AHWTAVK ILKYLR TRDYML+YG+KDLILTGYTDSDFQTD + R
Subjt:  YAVGIVSRYQSNPGRAHWTAVKNILKYLRTTRDYMLMYGAKDLILTGYTDSDFQTDCERR

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]8.5e-18488.61Show/hide
Query:  MNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLN
        M+LEMESMYFNSVWELVD P+GVKPIGCKWIYKRKRD  GKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLN
Subjt:  MNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLN

Query:  GNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWLA
        GNLEESI+M+QPEGFI Q  EQ+VCKL RSIYGLKQASRSWNI+FDTAIKSYGF QNVDEPCVYK+I    VAFLVLYVDDILLIGNDVG LTD+K WLA
Subjt:  GNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWLA

Query:  TQFQMKDLGEAQFVLGIQIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDIC
         QFQMKDLGEAQ+VLGIQIIR+RKNKTLALSQA+YIDK+L+RY MQ+SKKGLLPFRHGVHLSKEQSPKTPQEVEDM+ IPYASAVGSLMYAMLCTRPDIC
Subjt:  TQFQMKDLGEAQFVLGIQIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDIC

Query:  YAVGIVSRYQSNPGRAHWTAVKNILKYLRTTRDYMLMYGAKDLILTGYTDSDFQTDCERR
        YAVGIVSRYQSNPG  HWTAVK +LKYLR TRDYML+YGAKDLILTGYTDSDFQTD + R
Subjt:  YAVGIVSRYQSNPGRAHWTAVKNILKYLRTTRDYMLMYGAKDLILTGYTDSDFQTDCERR

TrEMBL top hitse value%identityAlignment
A0A5A7T2V9 Gag/pol protein5.6e-18185.03Show/hide
Query:  MNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLN
        M+LEMESMYFNSVWELVD P+GVKPIGCKWIYKRKRD  GKVQTFKARLVAKGYT++EGVDYEETFS VAMLKSIRILLSIA FYDYEIWQMDVKTAFLN
Subjt:  MNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLN

Query:  GNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWLA
        GNLEESI+M+QPEGFI Q  EQ+VCKL RSIYGLKQASRSWNI+FDTAIKSYGF QNVDEPCVYK+I    VAFLVLYVDDILLIGNDVG LTD+K WLA
Subjt:  GNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWLA

Query:  TQFQMKDLGEAQFVLGIQIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDIC
         QFQMKDLGE Q+VLGIQIIR+RKNKTLALSQA+YIDK+L+RY MQ+SKKGLLPFRHGVHLSKEQSPKTPQEVEDM+ IPYASAVGSLMYAMLCTRPDIC
Subjt:  TQFQMKDLGEAQFVLGIQIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDIC

Query:  YAVGIVSRYQSNPGRAHWTAVKNILKYLRTTRDYMLMYGAKDLILTGYTDSDFQTDCE-RRALLRSLTRLQSEA
        YAVGIVSRYQSNPG  HWTAVK ILKYLR TRDYML+YGAKDLILTGYT+SDFQTD + R++  RS+  L   A
Subjt:  YAVGIVSRYQSNPGRAHWTAVKNILKYLRTTRDYMLMYGAKDLILTGYTDSDFQTDCE-RRALLRSLTRLQSEA

A0A5A7TWB9 Gag/pol protein2.1e-18086.11Show/hide
Query:  MNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLN
        MNLE+ESMYFNSVW+LVDQPDGVKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EGVDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLN
Subjt:  MNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLN

Query:  GNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWLA
        GNLEE+IYM QPEGFI    EQ++CKL RSIYGLKQASRSWNI+FDTAIKSYGF Q VDEPCVYKRI+N +VAFLVLYVDDILLIGND+G+LTDIK WLA
Subjt:  GNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWLA

Query:  TQFQMKDLGEAQFVLGIQIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDIC
        TQFQMKDLGEAQFVLGIQI R+RKNK LALSQASYIDK++++Y MQ+SK+GLLPFRHGV LSKEQ PKTPQ+VE+M+HIPYASAVGSLMYAMLCTRPDIC
Subjt:  TQFQMKDLGEAQFVLGIQIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDIC

Query:  YAVGIVSRYQSNPGRAHWTAVKNILKYLRTTRDYMLMYGAKDLILTGYTDSDFQTDCERR
        YAVGIVSRYQSNPG AHWTAVK ILKYLR TRDYML+YG+KDLILTGYTDSDFQTD + R
Subjt:  YAVGIVSRYQSNPGRAHWTAVKNILKYLRTTRDYMLMYGAKDLILTGYTDSDFQTDCERR

A0A5A7TZD0 Gag/pol protein4.1e-18488.61Show/hide
Query:  MNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLN
        M+LEMESMYFNSVWELVD P+GVKPIGCKWIYKRKRD  GKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLN
Subjt:  MNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLN

Query:  GNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWLA
        GNLEESI+M+QPEGFI Q  EQ+VCKL RSIYGLKQASRSWNI+FDTAIKSYGF QNVDEPCVYK+I    VAFLVLYVDDILLIGNDVG LTD+K WLA
Subjt:  GNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWLA

Query:  TQFQMKDLGEAQFVLGIQIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDIC
         QFQMKDLGEAQ+VLGIQIIR+RKNKTLALSQA+YIDK+L+RY MQ+SKKGLLPFRHGVHLSKEQSPKTPQEVEDM+ IPYASAVGSLMYAMLCTRPDIC
Subjt:  TQFQMKDLGEAQFVLGIQIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDIC

Query:  YAVGIVSRYQSNPGRAHWTAVKNILKYLRTTRDYMLMYGAKDLILTGYTDSDFQTDCERR
        YAVGIVSRYQSNPG  HWTAVK +LKYLR TRDYML+YGAKDLILTGYTDSDFQTD + R
Subjt:  YAVGIVSRYQSNPGRAHWTAVKNILKYLRTTRDYMLMYGAKDLILTGYTDSDFQTDCERR

A0A5A7UYE8 Gag/pol protein4.1e-18488.61Show/hide
Query:  MNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLN
        M+LEMESMYFNSVWELVD P+GVKPIGCKWIYKRKRD  GKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLN
Subjt:  MNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLN

Query:  GNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWLA
        GNLEESI+M+QPEGFI Q  EQ+VCKL RSIYGLKQASRSWNI+FDTAIKSYGF QNVDEPCVYK+I    VAFLVLYVDDILLIGNDVG LTD+K WLA
Subjt:  GNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWLA

Query:  TQFQMKDLGEAQFVLGIQIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDIC
         QFQMKDLGEAQ+VLGIQIIR+RKNKTLALSQA+YIDK+L+RY MQ+SKKGLLPFRHGVHLSKEQSPKTPQEVEDM+ IPYASAVGSLMYAMLCTRPDIC
Subjt:  TQFQMKDLGEAQFVLGIQIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDIC

Query:  YAVGIVSRYQSNPGRAHWTAVKNILKYLRTTRDYMLMYGAKDLILTGYTDSDFQTDCERR
        YAVGIVSRYQSNPG  HWTAVK +LKYLR TRDYML+YGAKDLILTGYTDSDFQTD + R
Subjt:  YAVGIVSRYQSNPGRAHWTAVKNILKYLRTTRDYMLMYGAKDLILTGYTDSDFQTDCERR

E2GK51 Gag/pol protein (Fragment)4.1e-18487.5Show/hide
Query:  MNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLN
        MNLEMESMYFNSVW LVD P  VKPIGCKWIYKRKRDQ GKVQTFKARLVAKGYTQ+EGVDYEETFSPVAMLKSIRILLSIATFY+YEIWQMDVKTAFLN
Subjt:  MNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLN

Query:  GNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWLA
        GNLEESIYM QPEGFI QD EQ+VCKL++SIYGLKQASRSWNI+FDTAIKSYGF+QNVDEPCVYK+IVNS VAFL+LYVDDILLIGNDV  LTD+K WL 
Subjt:  GNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWLA

Query:  TQFQMKDLGEAQFVLGIQIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDIC
        TQFQMKDLGEAQ++LGIQI+RNRKNKTLA+SQASYIDK+L RYKMQ+SKKG LPFRHG+HLSKEQ PKTPQEVEDM++IPY+SAVGSLMYAMLCTRPDIC
Subjt:  TQFQMKDLGEAQFVLGIQIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDIC

Query:  YAVGIVSRYQSNPGRAHWTAVKNILKYLRTTRDYMLMYGAKDLILTGYTDSDFQTDCERR
        Y+VGIVSRYQSNPGR HWTAVKNILKYLR TR+YML+YGAKDLILTGYTDSDFQ+D + R
Subjt:  YAVGIVSRYQSNPGRAHWTAVKNILKYLRTTRDYMLMYGAKDLILTGYTDSDFQTDCERR

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.5e-6135.93Show/hide
Query:  MNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLN
        +N E+ +   N+ W +  +P+    +  +W++  K ++ G    +KARLVA+G+TQ+  +DYEETF+PVA + S R +LS+   Y+ ++ QMDVKTAFLN
Subjt:  MNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLN

Query:  GNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVY---KRIVNSTVAFLVLYVDDILLIGNDVGILTDIKH
        G L+E IYM  P+G     +   VCKL ++IYGLKQA+R W   F+ A+K   F  +  + C+Y   K  +N  + +++LYVDD+++   D+  + + K 
Subjt:  GNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVY---KRIVNSTVAFLVLYVDDILLIGNDVGILTDIKH

Query:  WLATQFQMKDLGEAQFVLGIQIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRP
        +L  +F+M DL E +  +GI+I    +   + LSQ++Y+ K+L ++ M++      P    ++     S       ++  + P  S +G LMY MLCTRP
Subjt:  WLATQFQMKDLGEAQFVLGIQIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRP

Query:  DICYAVGIVSRYQSNPGRAHWTAVKNILKYLRTTRDYMLMYG---AKDLILTGYTDSDF
        D+  AV I+SRY S      W  +K +L+YL+ T D  L++    A +  + GY DSD+
Subjt:  DICYAVGIVSRYQSNPGRAHWTAVKNILKYLRTTRDYMLMYG---AKDLILTGYTDSDF

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.1e-9949.31Show/hide
Query:  MNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLN
        M  EMES+  N  ++LV+ P G +P+ CKW++K K+D   K+  +KARLV KG+ Q++G+D++E FSPV  + SIR +LS+A   D E+ Q+DVKTAFL+
Subjt:  MNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLN

Query:  GNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVY-KRIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWL
        G+LEE IYM QPEGF     +  VCKL +S+YGLKQA R W +KFD+ +KS  + +   +PCVY KR   +    L+LYVDD+L++G D G++  +K  L
Subjt:  GNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVY-KRIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWL

Query:  ATQFQMKDLGEAQFVLGIQIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDI
        +  F MKDLG AQ +LG++I+R R ++ L LSQ  YI+++L R+ M+++K    P    + LSK+  P T +E  +M  +PY+SAVGSLMYAM+CTRPDI
Subjt:  ATQFQMKDLGEAQFVLGIQIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDI

Query:  CYAVGIVSRYQSNPGRAHWTAVKNILKYLRTTRDYMLMYGAKDLILTGYTDSDFQTDCERR
         +AVG+VSR+  NPG+ HW AVK IL+YLR T    L +G  D IL GYTD+D   D + R
Subjt:  CYAVGIVSRYQSNPGRAHWTAVKNILKYLRTTRDYMLMYGAKDLILTGYTDSDFQTDCERR

P25600 Putative transposon Ty5-1 protein YCL074W7.1e-3233.72Show/hide
Query:  MDVKTAFLNGNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVGI
        MDV TAFLN  ++E IY+ QP GF+ + +   V +L   +YGLKQA   WN   +  +K  GF ++  E  +Y R  +    ++ +YVDD+L+      I
Subjt:  MDVKTAFLNGNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVGI

Query:  LTDIKHWLATQFQMKDLGEAQFVLGIQIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYA
           +K  L   + MKDLG+    LG+  I    N  + LS   YI K     ++   K    P  +   L +  SP     ++D+   PY S VG L++ 
Subjt:  LTDIKHWLATQFQMKDLGEAQFVLGIQIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYA

Query:  MLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRTTRDYMLMY-GAKDLILTGYTDS
            RPDI Y V ++SR+   P   H  + + +L+YL TTR   L Y     L LT Y D+
Subjt:  MLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRTTRDYMLMY-GAKDLILTGYTDS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.1e-6037.99Show/hide
Query:  MNLEMESMYFNSVWELVDQPDG-VKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFL
        M  E+ +   N  W+LV  P   V  +GC+WI+ +K +  G +  +KARLVAKGY QR G+DY ETFSPV    SIRI+L +A    + I Q+DV  AFL
Subjt:  MNLEMESMYFNSVWELVDQPDG-VKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFL

Query:  NGNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWL
         G L + +YM+QP GFI++D    VCKL++++YGLKQA R+W ++    + + GF  +V +  ++      ++ ++++YVDDIL+ GND  +L +    L
Subjt:  NGNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWL

Query:  ATQFQMKDLGEAQFVLGIQIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDI
        + +F +KD  E  + LGI+    R    L LSQ  YI  +L R  M  +K    P      LS     K     E      Y   VGSL Y +  TRPDI
Subjt:  ATQFQMKDLGEAQFVLGIQIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDI

Query:  CYAVGIVSRYQSNPGRAHWTAVKNILKYLRTTRDY-MLMYGAKDLILTGYTDSDFQTD
         YAV  +S++   P   H  A+K IL+YL  T ++ + +     L L  Y+D+D+  D
Subjt:  CYAVGIVSRYQSNPGRAHWTAVKNILKYLRTTRDY-MLMYGAKDLILTGYTDSDFQTD

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.9e-6236.94Show/hide
Query:  MNLEMESMYFNSVWELV-DQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFL
        M  E+ +   N  W+LV   P  V  +GC+WI+ +K +  G +  +KARLVAKGY QR G+DY ETFSPV    SIRI+L +A    + I Q+DV  AFL
Subjt:  MNLEMESMYFNSVWELV-DQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFL

Query:  NGNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWL
         G L + +YM+QP GF+++D    VC+L+++IYGLKQA R+W ++  T + + GF  ++ +  ++      ++ ++++YVDDIL+ GND  +L      L
Subjt:  NGNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWL

Query:  ATQFQMKDLGEAQFVLGIQIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDI
        + +F +K+  +  + LGI+    R  + L LSQ  Y   +L R  M  +K    P      L+     K P   E      Y   VGSL Y +  TRPD+
Subjt:  ATQFQMKDLGEAQFVLGIQIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDI

Query:  CYAVGIVSRYQSNPGRAHWTAVKNILKYLRTTRDY-MLMYGAKDLILTGYTDSDFQTDCE
         YAV  +S+Y   P   HW A+K +L+YL  T D+ + +     L L  Y+D+D+  D +
Subjt:  CYAVGIVSRYQSNPGRAHWTAVKNILKYLRTTRDY-MLMYGAKDLILTGYTDSDFQTDCE

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 85.7e-6135.83Show/hide
Query:  MNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLN
        M+ E+ +M     WE+   P   KPIGCKW+YK K +  G ++ +KARLVAKGYTQ+EG+D+ ETFSPV  L S++++L+I+  Y++ + Q+D+  AFLN
Subjt:  MNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLN

Query:  GNLEESIYMAQPEGFIEQDHE----QRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVGILTDIK
        G+L+E IYM  P G+  +  +      VC LK+SIYGLKQASR W +KF   +  +GF Q+  +   + +I  +    +++YVDDI++  N+   + ++K
Subjt:  GNLEESIYMAQPEGFIEQDHE----QRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVGILTDIK

Query:  HWLATQFQMKDLGEAQFVLGIQIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTR
          L + F+++DLG  ++ LG++I R+     + + Q  Y   +L    +   K   +P    V  S         +  D K   Y   +G LMY  + TR
Subjt:  HWLATQFQMKDLGEAQFVLGIQIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTR

Query:  PDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRTTRDYMLMYGAK-DLILTGYTDSDFQT
         DI +AV  +S++   P  AH  AV  IL Y++ T    L Y ++ ++ L  ++D+ FQ+
Subjt:  PDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRTTRDYMLMYGAK-DLILTGYTDSDFQT

ATMG00810.1 DNA/RNA polymerases superfamily protein2.1e-1534.25Show/hide
Query:  FLVLYVDDILLIGNDVGILTDIKHWLATQFQMKDLGEAQFVLGIQIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEV
        +L+LYVDDILL G+   +L  +   L++ F MKDLG   + LGIQI  +     L LSQ  Y +++L    M D K    P    ++ S   + K P   
Subjt:  FLVLYVDDILLIGNDVGILTDIKHWLATQFQMKDLGEAQFVLGIQIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEV

Query:  EDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRTTRDY-MLMYGAKDLILTGYTDSDF
        +      + S VG+L Y  L TRPDI YAV IV +    P  A +  +K +L+Y++ T  + + ++    L +  + DSD+
Subjt:  EDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRTTRDY-MLMYGAKDLILTGYTDSDF

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.5e-1341.46Show/hide
Query:  MNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIA
        M  E++++  N  W LV  P     +GCKW++K K    G +   KARLVAKG+ Q EG+ + ET+SPV    +IR +L++A
Subjt:  MNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCTTGAAATGGAGTCAATGTACTTCAATTCAGTCTGGGAACTTGTAGATCAACCGGATGGGGTAAAACCTATTGGTTGCAAATGGATCTATAAGAGGAAA
CGAGACCAAACCGGTAAGGTACAGACCTTTAAAGCACGACTAGTAGCAAAGGGTTATACCCAAAGAGAGGGGGTGGACTATGAAGAAACCTTCTCTCCTGTTGCT
ATGCTTAAATCAATAAGAATACTCTTGTCTATTGCCACATTTTATGATTATGAAATTTGGCAAATGGATGTTAAGACAGCTTTTCTAAACGGCAATCTTGAAGAG
AGTATCTATATGGCTCAACCAGAGGGGTTCATTGAACAGGATCACGAGCAAAGGGTTTGCAAGCTTAAAAGATCCATTTATGGGTTGAAGCAAGCATCTCGATCC
TGGAATATAAAGTTTGATACTGCGATCAAATCTTATGGCTTTAAACAGAATGTTGACGAACCTTGTGTTTATAAAAGGATAGTCAACTCTACAGTAGCTTTCCTG
GTGTTGTACGTTGACGATATCCTACTCATTGGAAATGATGTGGGAATTCTGACTGACATTAAGCATTGGCTGGCGACACAATTCCAAATGAAAGATTTGGGAGAG
GCTCAGTTTGTTCTTGGAATCCAAATTATTCGGAATCGCAAGAACAAAACACTAGCATTGTCTCAAGCATCGTACATCGACAAAATGTTGATTCGATATAAGATG
CAGGACTCCAAGAAAGGATTATTACCTTTCAGGCATGGAGTTCATTTGTCGAAGGAACAAAGTCCTAAGACACCTCAAGAAGTTGAGGATATGAAACACATTCCC
TATGCATCAGCGGTCGGTAGTCTGATGTATGCCATGCTTTGTACCCGACCCGACATATGCTATGCTGTGGGAATTGTCAGCAGATATCAGTCCAATCCGGGACGT
GCTCATTGGACTGCCGTTAAGAATATCCTCAAGTATCTTAGGACAACGAGGGACTATATGCTAATGTACGGTGCTAAGGATCTGATCCTTACAGGGTACACTGAC
TCAGATTTTCAGACCGATTGTGAAAGAAGAGCTCTGTTGCGTTCCTTAACGAGGCTGCAAAGTGAGGCCTCGGAAAATGACGTCTTCTTTCAATTTGAAGATGTT
AAAGCATCGGAAAAGATCATCAATGAAGTTCTGTGTACGGAAAAGGCGGAAAAAGAAGGCGTAAGTGGACCCATATGGTACTATGGAGAAGAATTCGAAGTGAGC
GTTAAGAGGGTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAATCTTGAAATGGAGTCAATGTACTTCAATTCAGTCTGGGAACTTGTAGATCAACCGGATGGGGTAAAACCTATTGGTTGCAAATGGATCTATAAGAGGAAA
CGAGACCAAACCGGTAAGGTACAGACCTTTAAAGCACGACTAGTAGCAAAGGGTTATACCCAAAGAGAGGGGGTGGACTATGAAGAAACCTTCTCTCCTGTTGCT
ATGCTTAAATCAATAAGAATACTCTTGTCTATTGCCACATTTTATGATTATGAAATTTGGCAAATGGATGTTAAGACAGCTTTTCTAAACGGCAATCTTGAAGAG
AGTATCTATATGGCTCAACCAGAGGGGTTCATTGAACAGGATCACGAGCAAAGGGTTTGCAAGCTTAAAAGATCCATTTATGGGTTGAAGCAAGCATCTCGATCC
TGGAATATAAAGTTTGATACTGCGATCAAATCTTATGGCTTTAAACAGAATGTTGACGAACCTTGTGTTTATAAAAGGATAGTCAACTCTACAGTAGCTTTCCTG
GTGTTGTACGTTGACGATATCCTACTCATTGGAAATGATGTGGGAATTCTGACTGACATTAAGCATTGGCTGGCGACACAATTCCAAATGAAAGATTTGGGAGAG
GCTCAGTTTGTTCTTGGAATCCAAATTATTCGGAATCGCAAGAACAAAACACTAGCATTGTCTCAAGCATCGTACATCGACAAAATGTTGATTCGATATAAGATG
CAGGACTCCAAGAAAGGATTATTACCTTTCAGGCATGGAGTTCATTTGTCGAAGGAACAAAGTCCTAAGACACCTCAAGAAGTTGAGGATATGAAACACATTCCC
TATGCATCAGCGGTCGGTAGTCTGATGTATGCCATGCTTTGTACCCGACCCGACATATGCTATGCTGTGGGAATTGTCAGCAGATATCAGTCCAATCCGGGACGT
GCTCATTGGACTGCCGTTAAGAATATCCTCAAGTATCTTAGGACAACGAGGGACTATATGCTAATGTACGGTGCTAAGGATCTGATCCTTACAGGGTACACTGAC
TCAGATTTTCAGACCGATTGTGAAAGAAGAGCTCTGTTGCGTTCCTTAACGAGGCTGCAAAGTGAGGCCTCGGAAAATGACGTCTTCTTTCAATTTGAAGATGTT
AAAGCATCGGAAAAGATCATCAATGAAGTTCTGTGTACGGAAAAGGCGGAAAAAGAAGGCGTAAGTGGACCCATATGGTACTATGGAGAAGAATTCGAAGTGAGC
GTTAAGAGGGTGTGA
Protein sequenceShow/hide protein sequence
MNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEE
SIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWLATQFQMKDLGE
AQFVLGIQIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGR
AHWTAVKNILKYLRTTRDYMLMYGAKDLILTGYTDSDFQTDCERRALLRSLTRLQSEASENDVFFQFEDVKASEKIINEVLCTEKAEKEGVSGPIWYYGEEFEVS
VKRV