; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0001743 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0001743
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr09:14717420..14722398
RNA-Seq ExpressionPay0001743
SyntenyPay0001743
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR025558 - Domain of unknown function DUF4283
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046247.1 uncharacterized protein E6C27_scaffold284G00450 [Cucumis melo var. makuwa]0.0e+0077.78Show/hide
Query:  MDGLNVVGPDLIGPDVESEGKRGSKEAGRRDDGLGSFGPRNETVIGRCGSSVCGQKLDDISSGSRIEFGDFFNSGEDEPSNVSSGCVLNNQNANVSNVVQ
        MDGLNVVGPDLIGPDVESEGKRGSKEAGRRDDGLGSFGPRNETVIGRCGSSVCGQKLDDISSGSRIEFGDFFNSGEDEPSNVSSGCVLNNQNANVSNVVQ
Subjt:  MDGLNVVGPDLIGPDVESEGKRGSKEAGRRDDGLGSFGPRNETVIGRCGSSVCGQKLDDISSGSRIEFGDFFNSGEDEPSNVSSGCVLNNQNANVSNVVQ

Query:  NGINKQPIDSKSTWASLFGTSSEESLLYTLPKIIGDKIVVTPPEEVIDQGIKVWENSLVGQLIDSKLPYTVIQHLVEKIWGKIEMPIITILENDLICFQF
        NGINKQ IDSKSTWASLFGTSSEESLLYTLPKIIGDKIVVTPPE+VIDQGIKVWENSLVGQLIDSKLPYTVIQHLVEKIWGKIEMPIITILENDLICFQF
Subjt:  NGINKQPIDSKSTWASLFGTSSEESLLYTLPKIIGDKIVVTPPEEVIDQGIKVWENSLVGQLIDSKLPYTVIQHLVEKIWGKIEMPIITILENDLICFQF

Query:  RRSKSVEWILSRGPWHLDDKSMLLRKWTPGIVPEFFVFNSVPVWIRLGKLPMELWTEAGLAVVASAVGKPISLDLATKERRRLSYARVCVELEAGSNMPA
        RRSKSVEWILSRGPWHLDDKSMLLRKWTPGIVPEFFVFNSVPVWIRLGKLPMELWTEAGLAVVASAVGKPISLDLATKERRRLSYARVCVELEAGSNMPA
Subjt:  RRSKSVEWILSRGPWHLDDKSMLLRKWTPGIVPEFFVFNSVPVWIRLGKLPMELWTEAGLAVVASAVGKPISLDLATKERRRLSYARVCVELEAGSNMPA

Query:  EITVSLRGVDFNVSVNYEWKPRKCNLCCAFGHSGSNCSRSVESKTIQEEVVHKGDDVDSEPCGEVVLESFKQLEEGEIRNSPNRHNSQVEKGVGKSDEFT
        EITVSLRGVDFNVSVNYEWKPRKCNLCCAFGHSGSNCSRSVESKTIQEEVVHKGDDVDSEPCGEVVLESFKQLEEGEIRNSPNRHNSQVEKGVGKSDEFT
Subjt:  EITVSLRGVDFNVSVNYEWKPRKCNLCCAFGHSGSNCSRSVESKTIQEEVVHKGDDVDSEPCGEVVLESFKQLEEGEIRNSPNRHNSQVEKGVGKSDEFT

Query:  LVTRRKSELVSVRDRGKS-----MEVDEGTDILNGV----------SSSIG--------------PKGLPTLNNSHDY---YSNSGVGRILVMWKKNRFS
        LVTRRKSELVSVRDRGK      ++VDEGTDILNG+          SSS+G                     +NS DY   YSNSGVGRI VMWKKNRFS
Subjt:  LVTRRKSELVSVRDRGKS-----MEVDEGTDILNGV----------SSSIG--------------PKGLPTLNNSHDY---YSNSGVGRILVMWKKNRFS

Query:  FSTNVMDEQFITVEITFAWSSPGVVMGDFNAIRVYSEAFGGSPIQGEMEDFDLAIRDVDLVEPLVQGNWFTWTMNDDWLFAWPTMLVNVLPWGISDHSPI
        FSTNVMDEQFITVEIT AWSSPGVVMGDFNAIRVYSEAFGGSPIQGEMEDFDLAIRDVDLVEPLVQGN                                
Subjt:  FSTNVMDEQFITVEITFAWSSPGVVMGDFNAIRVYSEAFGGSPIQGEMEDFDLAIRDVDLVEPLVQGNWFTWTMNDDWLFAWPTMLVNVLPWGISDHSPI

Query:  LFYPSFQLNSKVVSFRFFNHWVEDPSFIEVVARRWSRHEGVSPLVSLMRNLHHLKPILHGRFGRHIKSLNEKVRIAKLAMDIAQRKVERNPMSDVLSRQA
             FQLNSKVVSFRFFNHWVEDPSFIEVVARRWSRHEGVSPLVSLM+NLHHLKPIL GRFGRHIKSLNEKVRIAKLAMDIAQRKVERNPMSDVLSRQA
Subjt:  LFYPSFQLNSKVVSFRFFNHWVEDPSFIEVVARRWSRHEGVSPLVSLMRNLHHLKPILHGRFGRHIKSLNEKVRIAKLAMDIAQRKVERNPMSDVLSRQA

Query:  SLATETFWTAVRLEEASLRQKSRIRWLKLGDQNTAFFHRSVYSRMSRNSLLSLVDSDGDGSRVSSHDGVVHMAVNYFSNSLGSQEISYRELTPSEECCQA
        SLATETFWTAVRLEEASLRQKSRIRWLKLGDQNTAFFHR VYSRMSRNSLLSLVDSDGDGSRVSSHDGVVHMA                          A
Subjt:  SLATETFWTAVRLEEASLRQKSRIRWLKLGDQNTAFFHRSVYSRMSRNSLLSLVDSDGDGSRVSSHDGVVHMAVNYFSNSLGSQEISYRELTPSEECCQA

Query:  LQLPISREEVRRVLLTMDSGKAPGPDGFSLGFFK--VNATAITLIPKHNGAERLEDFRPISCCNVLYKCISKILADRLRSAFIPGRSIIENILLCQELVG
        LQLPISREEVRRVLLTMDSGKAPGPDGFSLGFFK  VNATAITLIPKHNGAERLEDFRPISC                                      
Subjt:  LQLPISREEVRRVLLTMDSGKAPGPDGFSLGFFK--VNATAITLIPKHNGAERLEDFRPISCCNVLYKCISKILADRLRSAFIPGRSIIENILLCQELVG

Query:  VDLQKAYDSVNWDFLFGLLIAIGTLLKKGVRQGDPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEK-------------------------------KF
         DLQKAYDSVNWDFLFGLLIAIGTLLKKGVRQGDPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEK                               KF
Subjt:  VDLQKAYDSVNWDFLFGLLIAIGTLLKKGVRQGDPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEK-------------------------------KF

Query:  GELSGLFANPKKSSIFVAGVNNENASHLAACMGFVRGNLSVRYLGLPLLTGRLCSNDCAPLIQRITSQIRSWTARVLSFAGRMQLVRSVLRSLQVYWASV
        GELSGLFANP+KSSIFVAGVNNENASHLAACMGFVRGNLSVRYLGLPLLTGRLCSNDCAPLIQRITSQIRSWTARVLSFAGRMQLV SVLRSLQVYWASV
Subjt:  GELSGLFANPKKSSIFVAGVNNENASHLAACMGFVRGNLSVRYLGLPLLTGRLCSNDCAPLIQRITSQIRSWTARVLSFAGRMQLVRSVLRSLQVYWASV

Query:  FVLPAYVHN---------------------------------EGGLGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKRRSLWDVDSRVGRSWC
        FVLPAYVHN                                 EGGLGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKRRSLWDVDSRVGRSWC
Subjt:  FVLPAYVHN---------------------------------EGGLGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKRRSLWDVDSRVGRSWC

Query:  LRAILRKREKLKHHVRMKVGNGNRCRVWLDPWLQRVIYDAGSWREARLSDFIDPDGEWLWPRGGFSIASVWEAIRPRGGRVLWDDLLWGGGNIPKH-SFC
        LRAILRKREKLKHHVRMKVGNGNRCRVWLDPWLQ     A   R  R+   +     +L  R   S+  V+  +   G    +  +  G G I ++  F 
Subjt:  LRAILRKREKLKHHVRMKVGNGNRCRVWLDPWLQRVIYDAGSWREARLSDFIDPDGEWLWPRGGFSIASVWEAIRPRGGRVLWDDLLWGGGNIPKH-SFC

Query:  SWLAIKDRLGTRDRLHRWDSSVPLSCILRQGGVESCNHLFFSVLRIMASSHRIGHWGVELSWICHQGIGKGVRRKLWRVLWWATIYFIWNEWNHRLHGGQ
            +K  +G       W   +        GG+         VLRIMASSHRIGHWGVELSWICHQGIGKGVRRKLWRVLWWATIYFIWNEWNHRLHGGQ
Subjt:  SWLAIKDRLGTRDRLHRWDSSVPLSCILRQGGVESCNHLFFSVLRIMASSHRIGHWGVELSWICHQGIGKGVRRKLWRVLWWATIYFIWNEWNHRLHGGQ

Query:  ARDPVVLFHLICS
        ARDPVVLFHLICS
Subjt:  ARDPVVLFHLICS

KAA0046851.1 uncharacterized protein E6C27_scaffold19358G00020 [Cucumis melo var. makuwa]0.0e+0066.25Show/hide
Query:  YSNSGVGRILVMWKKNRFSFSTNVMDEQFI---------------------------------TVEITFAWSSPGVVMGDFNAIRVYSEAFGGSPIQGEM
        YSNSGVGRI VMWKK RFSF T+VMDE+F+                                   EIT AWSS GVVMGDFNAIRV+SEAFGGSPIQGEM
Subjt:  YSNSGVGRILVMWKKNRFSFSTNVMDEQFI---------------------------------TVEITFAWSSPGVVMGDFNAIRVYSEAFGGSPIQGEM

Query:  EDFDLAIRDVDLVEPLVQGNWFTWT----------------MNDDWLFAWPTMLVNVLPWGISDHSPILFYPSFQLNSKVVSFRFFNHWVEDPSFIEVVA
        E+FDLAIRD DLVEP VQGNWFTWT                +ND+WL AWPTM +NVLPWGISDHSPILFYPSFQ+NS+VVSFRFFNHWVE+PSFIEVVA
Subjt:  EDFDLAIRDVDLVEPLVQGNWFTWT----------------MNDDWLFAWPTMLVNVLPWGISDHSPILFYPSFQLNSKVVSFRFFNHWVEDPSFIEVVA

Query:  RRWSRHEGVSPLVSLMRNLHHLKPILHGRFGRHIKSLNEKVRIAKLAMDIAQRKVERNPMSDVLSRQASLATETFWTAVRLEEASLRQKSRIRWLKLGDQ
        R WSRHEGVS LVSLMRNLHHLKPIL  +FGRHIKSL+E+V IAK AMDIAQR+VERNP+SDVLSRQASLATETFWTAVRLEEASLRQKS++RWL LGDQ
Subjt:  RRWSRHEGVSPLVSLMRNLHHLKPILHGRFGRHIKSLNEKVRIAKLAMDIAQRKVERNPMSDVLSRQASLATETFWTAVRLEEASLRQKSRIRWLKLGDQ

Query:  NTAFFHRSVYSRMSRNSLLSLVDSDGDGSRVSSHDGVVHMAVNYFSNSLGSQEISYRELTP----------SEECCQALQLPISREEVRRVLLTMDSGKA
        NTAFFHRSV SR+SRNSLLSLVDS  DGSRVSSHDGV  MAVNYFSNSLGSQEI YREL+P          SEECCQALQLPISREEVRRVL +MDSGKA
Subjt:  NTAFFHRSVYSRMSRNSLLSLVDSDGDGSRVSSHDGVVHMAVNYFSNSLGSQEISYRELTP----------SEECCQALQLPISREEVRRVLLTMDSGKA

Query:  PGPDGFSLGFFK--------------------------VNATAITLIPKHNGAERLEDFRPISCCNVLYKCISKILADRLR-----------SAFIPGRS
        PGPDGFS+GF+K                          VNATAITLIPKH GAERLEDFRPISCCNVLYKCISKILADRLR           SAFIPGRS
Subjt:  PGPDGFSLGFFK--------------------------VNATAITLIPKHNGAERLEDFRPISCCNVLYKCISKILADRLR-----------SAFIPGRS

Query:  IIENILLCQELVG--------------VDLQKAYDSVNWDFLFGLLIAIGTLLK-----------------------------KGVRQGDPLSPFLFVMV
        IIENILLCQELVG              VDLQKAYDSVNWDFLFGLLIAIGT LK                             KG+RQGDPLSPFLFVMV
Subjt:  IIENILLCQELVG--------------VDLQKAYDSVNWDFLFGLLIAIGTLLK-----------------------------KGVRQGDPLSPFLFVMV

Query:  MEVLSRMLNKIPQSFQFHHRCEK-------------------------------KFGELSGLFANPKKSSIFVAGVNNENASHLAACMGFVRGNLSVRYL
        MEVLSRMLNKIPQSF+FHHRCEK                               KFGE SGLFANP+KSSIFV GVNNE ASHLAAC+G      S   L
Subjt:  MEVLSRMLNKIPQSFQFHHRCEK-------------------------------KFGELSGLFANPKKSSIFVAGVNNENASHLAACMGFVRGNLSVRYL

Query:  GLPLLT-GRLCSNDCAPLIQRITSQIRSWTARVLSFAGRMQLVRSVLRSLQVYWASVFVLPAYVHN---------------------------------E
          P  +   L S DCAPLIQRITS+IRSWTARVLSFAGR+QLVRSVLRSLQVYWASVFVLPAYVHN                                 E
Subjt:  GLPLLT-GRLCSNDCAPLIQRITSQIRSWTARVLSFAGRMQLVRSVLRSLQVYWASVFVLPAYVHN---------------------------------E

Query:  GGLGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKRRSLWDVDSRVGRSWCLRAILRKREKLKHHVRMKVGNGNRCRVWLDPWLQRVIYDAGSW
        GGLGIRDGPSWNIA+TLKIL   LTN GSLWVAW+EAYILK +SLWDVDSRVGRSWCLRAILRKREK+KHHV                  +RV+YDA S 
Subjt:  GGLGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKRRSLWDVDSRVGRSWCLRAILRKREKLKHHVRMKVGNGNRCRVWLDPWLQRVIYDAGSW

Query:  REARLSDFIDPDGEWLWPR--------------------------------GGFSIASVWEAIRPRGGRVLWDDLLWGGGNIPKHSFCSWLAIKDRLGTR
        REA+LSDFIDP+GEWLWPR                                GGFSIAS WEAI PRGGRVLWD LLWGGGNIPKHSFC+WLAIKDRL TR
Subjt:  REARLSDFIDPDGEWLWPR--------------------------------GGFSIASVWEAIRPRGGRVLWDDLLWGGGNIPKHSFCSWLAIKDRLGTR

Query:  DRLHRWDSSVPLSCILRQGGVESCNHLFFS----------VLRIMASSHRIGHWGVELSWICHQGIGKGVRRKLWRVLWWATIYFIWNEWNHRLHGGQAR
        DRLHRWDSS+PLSCIL QGGVES +HLFFS          V +IM SSHRIGHWGVELSWICH+GIGKGVRRKLWRVLW ATIYFIWNE NHRLHGG+AR
Subjt:  DRLHRWDSSVPLSCILRQGGVESCNHLFFS----------VLRIMASSHRIGHWGVELSWICHQGIGKGVRRKLWRVLWWATIYFIWNEWNHRLHGGQAR

Query:  DPVVLFHLICS
        DP++LFHLIC+
Subjt:  DPVVLFHLICS

TYK18951.1 uncharacterized protein E5676_scaffold418G00380 [Cucumis melo var. makuwa]0.0e+0077.57Show/hide
Query:  MDGLNVVGPDLIGPDVESEGKRGSKEAGRRDDGLGSFGPRNETVIGRCGSSVCGQKLDDISSGSRIEFGDFFNSGEDEPSNVSSGCVLNNQNANVSNVVQ
        MDGLNVVGPDLIGPDVESEGKRGSKEAGRRDDGLGSFGPRNETVIGRCGSSVCGQKLDDISSGSRIEFGDFFNSGEDEPSNVSSGCVLNNQNANVSNVVQ
Subjt:  MDGLNVVGPDLIGPDVESEGKRGSKEAGRRDDGLGSFGPRNETVIGRCGSSVCGQKLDDISSGSRIEFGDFFNSGEDEPSNVSSGCVLNNQNANVSNVVQ

Query:  NGINKQPIDSKSTWASLFGTSSEESLLYTLPKIIGDKIVVTPPEEVIDQGIKVWENSLVGQLIDSKLPYTVIQHLVEKIWGKIEMPIITILENDLICFQF
        NGINKQ IDSKSTWASLFGTSSEESLLYTLPKIIGDKIVVTPPEEVIDQGIKVWENSLVGQLIDSKLPYTVIQHLVEKIWGKIEMPIITILENDLICFQF
Subjt:  NGINKQPIDSKSTWASLFGTSSEESLLYTLPKIIGDKIVVTPPEEVIDQGIKVWENSLVGQLIDSKLPYTVIQHLVEKIWGKIEMPIITILENDLICFQF

Query:  RRSKSVEWILSRGPWHLDDKSMLLRKWTPGIVPEFFVFNSVPVWIRLGKLPMELWTEAGLAVVASAVGKPISLDLATKERRRLSYARVCVELEAGSNMPA
        RRSKSVEWILSRGPWHLDDKSMLLRKWTPGIVPEFFVFNSVPVWIRLGKLPMELWTEAGLAVVASAVGKPISLDLATKERRRLSYARVCVELEAGSNMPA
Subjt:  RRSKSVEWILSRGPWHLDDKSMLLRKWTPGIVPEFFVFNSVPVWIRLGKLPMELWTEAGLAVVASAVGKPISLDLATKERRRLSYARVCVELEAGSNMPA

Query:  EITVSLRGVDFNVSVNYEWKPRKCNLCCAFGHSGSNCSRSVESKTIQEEVVHKGDDVDSEPCGEVVLESFKQLEEGEIRNSPNRHNSQVEKGVGKSDEFT
        EITVSLRGVDFNVSVNYEWKPRKCNLCCAFGHSGSNCSRSVESKTIQEEVVHKGDDVDSEPCGEVVLESFKQLEEGEIRNSPNRHNSQVEKGVGKSDEFT
Subjt:  EITVSLRGVDFNVSVNYEWKPRKCNLCCAFGHSGSNCSRSVESKTIQEEVVHKGDDVDSEPCGEVVLESFKQLEEGEIRNSPNRHNSQVEKGVGKSDEFT

Query:  LVTRRKSELVSVRDRGKS-----MEVDEGTDILNGV----------SSSIG--------------PKGLPTLNNSHDY---YSNSGVGRILVMWKKNRFS
        LVTRRKSELVSVRDRGK      ++VDEGTDILNG+          SSS+G                     +NS DY   YSNSGVGRI VMWKKNRFS
Subjt:  LVTRRKSELVSVRDRGKS-----MEVDEGTDILNGV----------SSSIG--------------PKGLPTLNNSHDY---YSNSGVGRILVMWKKNRFS

Query:  FSTNVMDEQFITVEITFAWSSPGVVMGDFNAIRVYSEAFGGSPIQGEMEDFDLAIRDVDLVEPLVQGNWFTWTMNDDWLFAWPTMLVNVLPWGISDHSPI
        FSTNVMDEQFIT          GVVMGDFNAIRVYSEAFGGSPIQGEMEDFDLAIRDVDLVEPLVQGNWFTWT                           
Subjt:  FSTNVMDEQFITVEITFAWSSPGVVMGDFNAIRVYSEAFGGSPIQGEMEDFDLAIRDVDLVEPLVQGNWFTWTMNDDWLFAWPTMLVNVLPWGISDHSPI

Query:  LFYPSFQLNSKVVSFRFFNHWVEDPSFIEVVARRWSRHEGVSPLVSLMRNLHHLKPILHGRFGRHIKSLNEKVRIAKLAMDIAQRKVERNPMSDVLSRQA
             FQLNSKVVSFRFFNHWVEDPSFIEVVARRWSRHEGVSPLVSLM+NLHHLKPIL GRFGRHIKSLNEKVRIAKLAMDIAQRKVERNPMSDVLSRQA
Subjt:  LFYPSFQLNSKVVSFRFFNHWVEDPSFIEVVARRWSRHEGVSPLVSLMRNLHHLKPILHGRFGRHIKSLNEKVRIAKLAMDIAQRKVERNPMSDVLSRQA

Query:  SLATETFWTAVRLEEASLRQKSRIRWLKLGDQNTAFFHRSVYSRMSRNSLLSLVDSDGDGSRVSSHDGVVHMAVNYFSNSLGSQEISYRELTPSEECCQA
        SLATETFWTAVRLEEASLRQKSRIRWLKLGDQNTAFFHR VYSRMSRNSLLSLVDSDGDGSRVSSHDGVVHMA                          A
Subjt:  SLATETFWTAVRLEEASLRQKSRIRWLKLGDQNTAFFHRSVYSRMSRNSLLSLVDSDGDGSRVSSHDGVVHMAVNYFSNSLGSQEISYRELTPSEECCQA

Query:  LQLPISREEVRRVLLTMDSGKAPGPDGFSLGFFK--VNATAITLIPKHNGAERLEDFRPISCCNVLYKCISKILADRLRSAFIPGRSIIENILLCQELVG
        LQLPISREEVRRVLLTMDSGKAPGPDGFSLGFFK  VNATAITLIPKHNGAERLEDFRPISC                                      
Subjt:  LQLPISREEVRRVLLTMDSGKAPGPDGFSLGFFK--VNATAITLIPKHNGAERLEDFRPISCCNVLYKCISKILADRLRSAFIPGRSIIENILLCQELVG

Query:  VDLQKAYDSVNWDFLFGLLIAIGTLLKKGVRQGDPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEK-------------------------------KF
         DLQKAYDSVNWDFLFGLLIAIGTLLKKGVRQGDPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEK                               KF
Subjt:  VDLQKAYDSVNWDFLFGLLIAIGTLLKKGVRQGDPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEK-------------------------------KF

Query:  GELSGLFANPKKSSIFVAGVNNENASHLAACMGFVRGNLSVRYLGLPLLTGRLCSNDCAPLIQRITSQIRSWTARVLSFAGRMQLVRSVLRSLQVYWASV
        GELSGLFANP+KSSIFVAGVNNENASHLAACMGFVRGNLSVRYLGLPLLTGRLCSNDCAPLIQRITSQIRSWTARVLSFAGRMQLV SVLRSLQVYWASV
Subjt:  GELSGLFANPKKSSIFVAGVNNENASHLAACMGFVRGNLSVRYLGLPLLTGRLCSNDCAPLIQRITSQIRSWTARVLSFAGRMQLVRSVLRSLQVYWASV

Query:  FVLPAYVHN---------------------------------EGGLGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKRRSLWDVDSRVGRSWC
        FVLPAYVHN                                 EGGLGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKRRSLWDVDSRVGRSWC
Subjt:  FVLPAYVHN---------------------------------EGGLGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKRRSLWDVDSRVGRSWC

Query:  LRAILRKREKLKHHVRMKVGNGNRCRVWLDPWLQRVIYDAGSWREARLSDFIDPDGEWLWPRGGFSIASVWEAIRPRGGRVLWDDLLWGGGNIPKH-SFC
        LRAILRKREKLKHHVRMKVGNGNRCRVWLDPWLQ     A   R  R+   +     +L  R   S+  V+  +   G    +  +  G G I ++  F 
Subjt:  LRAILRKREKLKHHVRMKVGNGNRCRVWLDPWLQRVIYDAGSWREARLSDFIDPDGEWLWPRGGFSIASVWEAIRPRGGRVLWDDLLWGGGNIPKH-SFC

Query:  SWLAIKDRLGTRDRLHRWDSSVPLSCILRQGGVESCNHLFFSVLRIMASSHRIGHWGVELSWICHQGIGKGVRRKLWRVLWWATIYFIWNEWNHRLHGGQ
            +K  +G       W   +        GG+         VLRIMASSHRIGHWGVELSWICHQGIGKGVRRKLWRVLWWATIYFIWNEWNHRLHGGQ
Subjt:  SWLAIKDRLGTRDRLHRWDSSVPLSCILRQGGVESCNHLFFSVLRIMASSHRIGHWGVELSWICHQGIGKGVRRKLWRVLWWATIYFIWNEWNHRLHGGQ

Query:  ARDPVVLFHLICS
        ARDPVVLFHLICS
Subjt:  ARDPVVLFHLICS

TYK28099.1 uncharacterized protein E5676_scaffold1467G00020 [Cucumis melo var. makuwa]0.0e+0055.74Show/hide
Query:  MDGLNVVGPDLIGPDVESEGKRGSKEAGRRDDGLGSFGPRNETVIGRCGSSVCGQKLDDISSGSRIEFGDFFNSGEDEPSNVSSGCVLNNQNANVSNVVQ
        MDGLNVVGP L G  VE +  RGS  AGR  +G   FGPR                   I+S   ++              V+SG +L NQ ANV+N V+
Subjt:  MDGLNVVGPDLIGPDVESEGKRGSKEAGRRDDGLGSFGPRNETVIGRCGSSVCGQKLDDISSGSRIEFGDFFNSGEDEPSNVSSGCVLNNQNANVSNVVQ

Query:  NGINKQPIDSKSTWASLFGTSSEESLLYTLPKIIGDKIVVTPPEEVIDQGIKVWENSLVGQLIDSKLPYTVIQHLVEKIWGKIEMPIITILENDLICFQF
        N +NK   DSKSTWASLFGTSSEESL YT PK IGDKIVV PPEEVIDQGI+VWENSLVGQLID+KLPY VIQ L                         
Subjt:  NGINKQPIDSKSTWASLFGTSSEESLLYTLPKIIGDKIVVTPPEEVIDQGIKVWENSLVGQLIDSKLPYTVIQHLVEKIWGKIEMPIITILENDLICFQF

Query:  RRSKSVEWILSRGPWHLDDKSMLLRKWTPGIVPEFFVFNSVPVWIRLGKLPMELWTEAGLAVVASAVGKPISLDLATKERRRLSYARVCVELEAGSNMPA
                                                                            KPISLD ATK+RRRLSYARVCVELE GSNM A
Subjt:  RRSKSVEWILSRGPWHLDDKSMLLRKWTPGIVPEFFVFNSVPVWIRLGKLPMELWTEAGLAVVASAVGKPISLDLATKERRRLSYARVCVELEAGSNMPA

Query:  EITVSLRGVDFNVSVNYEWKPRKCNLCCAFGHSGSNCSRSVESKTIQEEVVHKGDDVDSEPCGEVVLESFKQLEEGEIRNSPNRHNSQVEKGVGKSDEFT
        EITVSLRGVDFNVSVNYEWKPRKCNLCCAFGHS S CSRSVESKTIQEEVVHKGDDVD E CGEVVLESFKQ+E+GEIR+SPNRH+SQVEKGVGKSDEFT
Subjt:  EITVSLRGVDFNVSVNYEWKPRKCNLCCAFGHSGSNCSRSVESKTIQEEVVHKGDDVDSEPCGEVVLESFKQLEEGEIRNSPNRHNSQVEKGVGKSDEFT

Query:  LVTRRKSELVSVRDRGKSME--------------------------------VDEGTDILNGVSSSIGPKGLPTL-----------------NNSHDY--
        LVTR+KSELVS+RDRGKSME                                VDEGTD+L+G+SSSI   G   L                 +NS DY  
Subjt:  LVTRRKSELVSVRDRGKSME--------------------------------VDEGTDILNGVSSSIGPKGLPTL-----------------NNSHDY--

Query:  -YSNSGVGRILVMWKKNRFSFSTNVMDEQFIT---------------------------------VEITFAWSSPGVVMGDFNAIRVYSEAFGGSPIQGE
         YSNSGVGRI VMWKKNRFSFST+VMDEQF+T                                 VEIT AWSSPGVVM DFNAIRV+SEAF GSPIQGE
Subjt:  -YSNSGVGRILVMWKKNRFSFSTNVMDEQFIT---------------------------------VEITFAWSSPGVVMGDFNAIRVYSEAFGGSPIQGE

Query:  MEDFDLAIRDVDLVEPLVQGNWFTWT----------------MNDDWLFAWPTMLVNVLPWGISDHSPILFYPSFQLNSKVVSFRFFNHWVEDPSFIEVV
        MEDF+LAIRD DLVEP VQGNWFTWT                +NDDWL  WPTMLVNVLPWGISDH PILFYPSFQ ++KVVSFRFFNHWVEDPSFIEVV
Subjt:  MEDFDLAIRDVDLVEPLVQGNWFTWT----------------MNDDWLFAWPTMLVNVLPWGISDHSPILFYPSFQLNSKVVSFRFFNHWVEDPSFIEVV

Query:  ARRWSRHEGVSPLVSLMRNLHHLKPILHGRFGRHIKSLNEKVRIAKLAMDIAQRKVERNPMSDVLSRQASLATETFWTAVRLEEASLRQKSRIRWLKLGD
         R WSRHEGVSPLV LMRNLH LKPIL  RFGRHIK L+E+VRI K AMDIAQR+                                             
Subjt:  ARRWSRHEGVSPLVSLMRNLHHLKPILHGRFGRHIKSLNEKVRIAKLAMDIAQRKVERNPMSDVLSRQASLATETFWTAVRLEEASLRQKSRIRWLKLGD

Query:  QNTAFFHRSVYSRMSRNSLLSLVDSDGDGSRVSSHDGVVHMAVNYFSNSLGSQEISYRELTP----------SEECCQALQLPISREEVRRVLLTMDSGK
                                                MAVNYF NSLGSQEI YREL+P          SEECCQALQLPISREEVRRVL +MDSGK
Subjt:  QNTAFFHRSVYSRMSRNSLLSLVDSDGDGSRVSSHDGVVHMAVNYFSNSLGSQEISYRELTP----------SEECCQALQLPISREEVRRVLLTMDSGK

Query:  APGPDGFSLGFFKVNATAITLIPKHNGAERLEDFRPISCCNVLYKCISKILADRLR-----------SAFIPGRSIIENILLCQELVGVDLQKAYDSVNW
        APGPDGFS+    +NATAITLIPKHNGAERLEDF PISC NVLYKCISKILADRLR           SAFIPGRSIIENILLCQEL+   +         
Subjt:  APGPDGFSLGFFKVNATAITLIPKHNGAERLEDFRPISCCNVLYKCISKILADRLR-----------SAFIPGRSIIENILLCQELVGVDLQKAYDSVNW

Query:  DFLFGLLIAIGTLL-----KKGVRQGDPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEK-------------------------------KFGELSGLF
          +F ++I  G+L      +KGVRQGDPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEK                               KFGELSGLF
Subjt:  DFLFGLLIAIGTLL-----KKGVRQGDPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEK-------------------------------KFGELSGLF

Query:  ANPKKSSIFVAGVNNENASHLAACMGFVRGNLSVRYLGLPLLTGRLCSNDCAPLIQRITSQIRSWTARVLSFAGRMQLVRSVLRSLQVYWASVFVLPAYV
        ANP+KSSIFVAGVNNENASHLA CMGF RGNL VRYLGLPLLTGRL SNDCAPLIQRITSQIRSW ARVLSFAGR+QLVRSVLRSLQVYWASVFVLPAYV
Subjt:  ANPKKSSIFVAGVNNENASHLAACMGFVRGNLSVRYLGLPLLTGRLCSNDCAPLIQRITSQIRSWTARVLSFAGRMQLVRSVLRSLQVYWASVFVLPAYV

Query:  HN---------------------------------EGGLGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKRRSLWDVDSRVGRSWCLRAILRK
        HN                                 EGG GIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILK RSLWDVDSRVGRSWCL AILR 
Subjt:  HN---------------------------------EGGLGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKRRSLWDVDSRVGRSWCLRAILRK

Query:  REKLKHHVRMKVGNGNRC-RVWLDPWLQRVIYDAGSWREARLSDFIDPDGEWLWPR---GGFSIASVWEAIRPRGGRVLWDDLLWGGGNIPKHSFCSWLA
                    G  +R   VWL P +   + D    R   +S  +      +W R   GGFSI+S WEAIRPRGGRVLWD                   
Subjt:  REKLKHHVRMKVGNGNRC-RVWLDPWLQRVIYDAGSWREARLSDFIDPDGEWLWPR---GGFSIASVWEAIRPRGGRVLWDDLLWGGGNIPKHSFCSWLA

Query:  IKDRLGTRDRLHRWDSSVPLSCILRQGGVESCNHLFFS----------VLRIMASSHRIGHWGVELSWICHQGIGKGVRRKL
                                  GGVES +HLFFS          VLRIMASS+RIGHWGVELSWICHQGIGKGVRRKL
Subjt:  IKDRLGTRDRLHRWDSSVPLSCILRQGGVESCNHLFFS----------VLRIMASSHRIGHWGVELSWICHQGIGKGVRRKL

XP_008463187.1 PREDICTED: uncharacterized protein LOC103501395 [Cucumis melo]0.0e+0094.35Show/hide
Query:  MDGLNVVGPDLIGPDVESEGKRGSKEAGRRDDGLGSFGPRNETVIGRCGSSVCGQKLDDISSGSRIEFGDFFNSGEDEPSNVSSGCVLNNQNANVSNVVQ
        MDGLNVVGPDLIGPDVESEGKRGSKEAGRRDDGLGSFGPRNETVIGRCGSSVCGQKLDDISSGSRIEFGDFFNSGEDEPSNVSSGCVLNNQNANVSNVVQ
Subjt:  MDGLNVVGPDLIGPDVESEGKRGSKEAGRRDDGLGSFGPRNETVIGRCGSSVCGQKLDDISSGSRIEFGDFFNSGEDEPSNVSSGCVLNNQNANVSNVVQ

Query:  NGINKQPIDSKSTWASLFGTSSEESLLYTLPKIIGDKIVVTPPEEVIDQGIKVWENSLVGQLIDSKLPYTVIQHLVEKIWGKIEMPIITILENDLICFQF
        NGINKQ IDSKSTWASLFGTSSEESLLYTLPKIIGDKIVVTPPEEVIDQGIKVWENSLVGQLIDSKLPYTVIQHLVEKIWGKIEMPIITILENDLICFQF
Subjt:  NGINKQPIDSKSTWASLFGTSSEESLLYTLPKIIGDKIVVTPPEEVIDQGIKVWENSLVGQLIDSKLPYTVIQHLVEKIWGKIEMPIITILENDLICFQF

Query:  RRSKSVEWILSRGPWHLDDKSMLLRKWTPGIVPEFFVFNSVPVWIRLGKLPMELWTEAGLAVVASAVGKPISLDLATKERRRLSYARVCVELEAGSNMPA
        RRSKSVEWILSRGPWHLDDKSMLLRKWTPGIVPEFFVFNSVPVWIRLGKLPMELWTEAGLAVVASAVGKPISLDLATKERRRLSYARVCVELEAGSNMPA
Subjt:  RRSKSVEWILSRGPWHLDDKSMLLRKWTPGIVPEFFVFNSVPVWIRLGKLPMELWTEAGLAVVASAVGKPISLDLATKERRRLSYARVCVELEAGSNMPA

Query:  EITVSLRGVDFNVSVNYEWKPRKCNLCCAFGHSGSNCSRSVESKTIQEEVVHKGDDVDSEPCGEVVLESFKQLEEGEIRNSPNRHNSQVEKGVGKSDEFT
        EITVSLRGVDFNVSVNYEWKPRKCNLCCAFGHSGSNCSRSVESKTIQEEVVHKGDDVDSEPCGEVVLESFKQLEEGEIRNSPNRHNSQVEKGVGKSDEFT
Subjt:  EITVSLRGVDFNVSVNYEWKPRKCNLCCAFGHSGSNCSRSVESKTIQEEVVHKGDDVDSEPCGEVVLESFKQLEEGEIRNSPNRHNSQVEKGVGKSDEFT

Query:  LVTRRKSELVSVRDRGKSMEVDEGTDILNGVSSSIGPKGLPTLNNSHDYYSNSGVGRILVMWKKNRFSFSTNVMDEQFITVEITFAWSSPGVVMGDFNAI
        LVTRRKSELVSVRDRGKSMEVDEGTDILNGVSSSIGPKGLPTLNNSHDYYSNSGVGRI VMWKKNRFSFSTNVMDEQFITVEIT AWSSPGVVMGDFNAI
Subjt:  LVTRRKSELVSVRDRGKSMEVDEGTDILNGVSSSIGPKGLPTLNNSHDYYSNSGVGRILVMWKKNRFSFSTNVMDEQFITVEITFAWSSPGVVMGDFNAI

Query:  RVYSEAFGGSPIQGEMEDFDLAIRDVDLVEPLVQGNWFTWTMNDDWLFAWPTMLVNVLPWGISDHSPILFYPSFQLNSKVVSFRFFNHWVEDPSFIEVVA
        RVYSEAFGGSPIQGEMEDFDLAIRDVDLVEPLVQGN                                     FQLNSKVVSFRFFNHWVEDPSFIEVVA
Subjt:  RVYSEAFGGSPIQGEMEDFDLAIRDVDLVEPLVQGNWFTWTMNDDWLFAWPTMLVNVLPWGISDHSPILFYPSFQLNSKVVSFRFFNHWVEDPSFIEVVA

Query:  RRWSRHEGVSPLVSLMRNLHHLKPILHGRFGRHIKSLNEKVRIAKLAMDIAQRKVERNPMSDVLSRQASLATETFWTAVRLEEASLRQKSRIRWLKLGDQ
        RRWSRHEGVSPLVSLM+NLHHLKPIL GRFGRHIKSLNEKVRIAKLAMDIAQRKVERNPMSDVLSRQASLATETFWTAVRLEEASLRQKSRIRWLKLGDQ
Subjt:  RRWSRHEGVSPLVSLMRNLHHLKPILHGRFGRHIKSLNEKVRIAKLAMDIAQRKVERNPMSDVLSRQASLATETFWTAVRLEEASLRQKSRIRWLKLGDQ

Query:  NTAFFHRSVYSRMSRNSLLSLVDSDGDGSRVSSHDGVVHMAVNYFSNSLGSQEISYRELTP
        NTAFFHR VYSRMSRNSLLSLVDSDGDGSRVSSHDGVVHMAVNYFSNSLGSQEISYRELTP
Subjt:  NTAFFHRSVYSRMSRNSLLSLVDSDGDGSRVSSHDGVVHMAVNYFSNSLGSQEISYRELTP

TrEMBL top hitse value%identityAlignment
A0A1S3CJ11 uncharacterized protein LOC1035013950.0e+0094.35Show/hide
Query:  MDGLNVVGPDLIGPDVESEGKRGSKEAGRRDDGLGSFGPRNETVIGRCGSSVCGQKLDDISSGSRIEFGDFFNSGEDEPSNVSSGCVLNNQNANVSNVVQ
        MDGLNVVGPDLIGPDVESEGKRGSKEAGRRDDGLGSFGPRNETVIGRCGSSVCGQKLDDISSGSRIEFGDFFNSGEDEPSNVSSGCVLNNQNANVSNVVQ
Subjt:  MDGLNVVGPDLIGPDVESEGKRGSKEAGRRDDGLGSFGPRNETVIGRCGSSVCGQKLDDISSGSRIEFGDFFNSGEDEPSNVSSGCVLNNQNANVSNVVQ

Query:  NGINKQPIDSKSTWASLFGTSSEESLLYTLPKIIGDKIVVTPPEEVIDQGIKVWENSLVGQLIDSKLPYTVIQHLVEKIWGKIEMPIITILENDLICFQF
        NGINKQ IDSKSTWASLFGTSSEESLLYTLPKIIGDKIVVTPPEEVIDQGIKVWENSLVGQLIDSKLPYTVIQHLVEKIWGKIEMPIITILENDLICFQF
Subjt:  NGINKQPIDSKSTWASLFGTSSEESLLYTLPKIIGDKIVVTPPEEVIDQGIKVWENSLVGQLIDSKLPYTVIQHLVEKIWGKIEMPIITILENDLICFQF

Query:  RRSKSVEWILSRGPWHLDDKSMLLRKWTPGIVPEFFVFNSVPVWIRLGKLPMELWTEAGLAVVASAVGKPISLDLATKERRRLSYARVCVELEAGSNMPA
        RRSKSVEWILSRGPWHLDDKSMLLRKWTPGIVPEFFVFNSVPVWIRLGKLPMELWTEAGLAVVASAVGKPISLDLATKERRRLSYARVCVELEAGSNMPA
Subjt:  RRSKSVEWILSRGPWHLDDKSMLLRKWTPGIVPEFFVFNSVPVWIRLGKLPMELWTEAGLAVVASAVGKPISLDLATKERRRLSYARVCVELEAGSNMPA

Query:  EITVSLRGVDFNVSVNYEWKPRKCNLCCAFGHSGSNCSRSVESKTIQEEVVHKGDDVDSEPCGEVVLESFKQLEEGEIRNSPNRHNSQVEKGVGKSDEFT
        EITVSLRGVDFNVSVNYEWKPRKCNLCCAFGHSGSNCSRSVESKTIQEEVVHKGDDVDSEPCGEVVLESFKQLEEGEIRNSPNRHNSQVEKGVGKSDEFT
Subjt:  EITVSLRGVDFNVSVNYEWKPRKCNLCCAFGHSGSNCSRSVESKTIQEEVVHKGDDVDSEPCGEVVLESFKQLEEGEIRNSPNRHNSQVEKGVGKSDEFT

Query:  LVTRRKSELVSVRDRGKSMEVDEGTDILNGVSSSIGPKGLPTLNNSHDYYSNSGVGRILVMWKKNRFSFSTNVMDEQFITVEITFAWSSPGVVMGDFNAI
        LVTRRKSELVSVRDRGKSMEVDEGTDILNGVSSSIGPKGLPTLNNSHDYYSNSGVGRI VMWKKNRFSFSTNVMDEQFITVEIT AWSSPGVVMGDFNAI
Subjt:  LVTRRKSELVSVRDRGKSMEVDEGTDILNGVSSSIGPKGLPTLNNSHDYYSNSGVGRILVMWKKNRFSFSTNVMDEQFITVEITFAWSSPGVVMGDFNAI

Query:  RVYSEAFGGSPIQGEMEDFDLAIRDVDLVEPLVQGNWFTWTMNDDWLFAWPTMLVNVLPWGISDHSPILFYPSFQLNSKVVSFRFFNHWVEDPSFIEVVA
        RVYSEAFGGSPIQGEMEDFDLAIRDVDLVEPLVQGN                                     FQLNSKVVSFRFFNHWVEDPSFIEVVA
Subjt:  RVYSEAFGGSPIQGEMEDFDLAIRDVDLVEPLVQGNWFTWTMNDDWLFAWPTMLVNVLPWGISDHSPILFYPSFQLNSKVVSFRFFNHWVEDPSFIEVVA

Query:  RRWSRHEGVSPLVSLMRNLHHLKPILHGRFGRHIKSLNEKVRIAKLAMDIAQRKVERNPMSDVLSRQASLATETFWTAVRLEEASLRQKSRIRWLKLGDQ
        RRWSRHEGVSPLVSLM+NLHHLKPIL GRFGRHIKSLNEKVRIAKLAMDIAQRKVERNPMSDVLSRQASLATETFWTAVRLEEASLRQKSRIRWLKLGDQ
Subjt:  RRWSRHEGVSPLVSLMRNLHHLKPILHGRFGRHIKSLNEKVRIAKLAMDIAQRKVERNPMSDVLSRQASLATETFWTAVRLEEASLRQKSRIRWLKLGDQ

Query:  NTAFFHRSVYSRMSRNSLLSLVDSDGDGSRVSSHDGVVHMAVNYFSNSLGSQEISYRELTP
        NTAFFHR VYSRMSRNSLLSLVDSDGDGSRVSSHDGVVHMAVNYFSNSLGSQEISYRELTP
Subjt:  NTAFFHRSVYSRMSRNSLLSLVDSDGDGSRVSSHDGVVHMAVNYFSNSLGSQEISYRELTP

A0A5A7TWG5 Reverse transcriptase domain-containing protein0.0e+0077.78Show/hide
Query:  MDGLNVVGPDLIGPDVESEGKRGSKEAGRRDDGLGSFGPRNETVIGRCGSSVCGQKLDDISSGSRIEFGDFFNSGEDEPSNVSSGCVLNNQNANVSNVVQ
        MDGLNVVGPDLIGPDVESEGKRGSKEAGRRDDGLGSFGPRNETVIGRCGSSVCGQKLDDISSGSRIEFGDFFNSGEDEPSNVSSGCVLNNQNANVSNVVQ
Subjt:  MDGLNVVGPDLIGPDVESEGKRGSKEAGRRDDGLGSFGPRNETVIGRCGSSVCGQKLDDISSGSRIEFGDFFNSGEDEPSNVSSGCVLNNQNANVSNVVQ

Query:  NGINKQPIDSKSTWASLFGTSSEESLLYTLPKIIGDKIVVTPPEEVIDQGIKVWENSLVGQLIDSKLPYTVIQHLVEKIWGKIEMPIITILENDLICFQF
        NGINKQ IDSKSTWASLFGTSSEESLLYTLPKIIGDKIVVTPPE+VIDQGIKVWENSLVGQLIDSKLPYTVIQHLVEKIWGKIEMPIITILENDLICFQF
Subjt:  NGINKQPIDSKSTWASLFGTSSEESLLYTLPKIIGDKIVVTPPEEVIDQGIKVWENSLVGQLIDSKLPYTVIQHLVEKIWGKIEMPIITILENDLICFQF

Query:  RRSKSVEWILSRGPWHLDDKSMLLRKWTPGIVPEFFVFNSVPVWIRLGKLPMELWTEAGLAVVASAVGKPISLDLATKERRRLSYARVCVELEAGSNMPA
        RRSKSVEWILSRGPWHLDDKSMLLRKWTPGIVPEFFVFNSVPVWIRLGKLPMELWTEAGLAVVASAVGKPISLDLATKERRRLSYARVCVELEAGSNMPA
Subjt:  RRSKSVEWILSRGPWHLDDKSMLLRKWTPGIVPEFFVFNSVPVWIRLGKLPMELWTEAGLAVVASAVGKPISLDLATKERRRLSYARVCVELEAGSNMPA

Query:  EITVSLRGVDFNVSVNYEWKPRKCNLCCAFGHSGSNCSRSVESKTIQEEVVHKGDDVDSEPCGEVVLESFKQLEEGEIRNSPNRHNSQVEKGVGKSDEFT
        EITVSLRGVDFNVSVNYEWKPRKCNLCCAFGHSGSNCSRSVESKTIQEEVVHKGDDVDSEPCGEVVLESFKQLEEGEIRNSPNRHNSQVEKGVGKSDEFT
Subjt:  EITVSLRGVDFNVSVNYEWKPRKCNLCCAFGHSGSNCSRSVESKTIQEEVVHKGDDVDSEPCGEVVLESFKQLEEGEIRNSPNRHNSQVEKGVGKSDEFT

Query:  LVTRRKSELVSVRDRGKS-----MEVDEGTDILNGV----------SSSIG--------------PKGLPTLNNSHDY---YSNSGVGRILVMWKKNRFS
        LVTRRKSELVSVRDRGK      ++VDEGTDILNG+          SSS+G                     +NS DY   YSNSGVGRI VMWKKNRFS
Subjt:  LVTRRKSELVSVRDRGKS-----MEVDEGTDILNGV----------SSSIG--------------PKGLPTLNNSHDY---YSNSGVGRILVMWKKNRFS

Query:  FSTNVMDEQFITVEITFAWSSPGVVMGDFNAIRVYSEAFGGSPIQGEMEDFDLAIRDVDLVEPLVQGNWFTWTMNDDWLFAWPTMLVNVLPWGISDHSPI
        FSTNVMDEQFITVEIT AWSSPGVVMGDFNAIRVYSEAFGGSPIQGEMEDFDLAIRDVDLVEPLVQGN                                
Subjt:  FSTNVMDEQFITVEITFAWSSPGVVMGDFNAIRVYSEAFGGSPIQGEMEDFDLAIRDVDLVEPLVQGNWFTWTMNDDWLFAWPTMLVNVLPWGISDHSPI

Query:  LFYPSFQLNSKVVSFRFFNHWVEDPSFIEVVARRWSRHEGVSPLVSLMRNLHHLKPILHGRFGRHIKSLNEKVRIAKLAMDIAQRKVERNPMSDVLSRQA
             FQLNSKVVSFRFFNHWVEDPSFIEVVARRWSRHEGVSPLVSLM+NLHHLKPIL GRFGRHIKSLNEKVRIAKLAMDIAQRKVERNPMSDVLSRQA
Subjt:  LFYPSFQLNSKVVSFRFFNHWVEDPSFIEVVARRWSRHEGVSPLVSLMRNLHHLKPILHGRFGRHIKSLNEKVRIAKLAMDIAQRKVERNPMSDVLSRQA

Query:  SLATETFWTAVRLEEASLRQKSRIRWLKLGDQNTAFFHRSVYSRMSRNSLLSLVDSDGDGSRVSSHDGVVHMAVNYFSNSLGSQEISYRELTPSEECCQA
        SLATETFWTAVRLEEASLRQKSRIRWLKLGDQNTAFFHR VYSRMSRNSLLSLVDSDGDGSRVSSHDGVVHMA                          A
Subjt:  SLATETFWTAVRLEEASLRQKSRIRWLKLGDQNTAFFHRSVYSRMSRNSLLSLVDSDGDGSRVSSHDGVVHMAVNYFSNSLGSQEISYRELTPSEECCQA

Query:  LQLPISREEVRRVLLTMDSGKAPGPDGFSLGFFK--VNATAITLIPKHNGAERLEDFRPISCCNVLYKCISKILADRLRSAFIPGRSIIENILLCQELVG
        LQLPISREEVRRVLLTMDSGKAPGPDGFSLGFFK  VNATAITLIPKHNGAERLEDFRPISC                                      
Subjt:  LQLPISREEVRRVLLTMDSGKAPGPDGFSLGFFK--VNATAITLIPKHNGAERLEDFRPISCCNVLYKCISKILADRLRSAFIPGRSIIENILLCQELVG

Query:  VDLQKAYDSVNWDFLFGLLIAIGTLLKKGVRQGDPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEK-------------------------------KF
         DLQKAYDSVNWDFLFGLLIAIGTLLKKGVRQGDPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEK                               KF
Subjt:  VDLQKAYDSVNWDFLFGLLIAIGTLLKKGVRQGDPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEK-------------------------------KF

Query:  GELSGLFANPKKSSIFVAGVNNENASHLAACMGFVRGNLSVRYLGLPLLTGRLCSNDCAPLIQRITSQIRSWTARVLSFAGRMQLVRSVLRSLQVYWASV
        GELSGLFANP+KSSIFVAGVNNENASHLAACMGFVRGNLSVRYLGLPLLTGRLCSNDCAPLIQRITSQIRSWTARVLSFAGRMQLV SVLRSLQVYWASV
Subjt:  GELSGLFANPKKSSIFVAGVNNENASHLAACMGFVRGNLSVRYLGLPLLTGRLCSNDCAPLIQRITSQIRSWTARVLSFAGRMQLVRSVLRSLQVYWASV

Query:  FVLPAYVHN---------------------------------EGGLGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKRRSLWDVDSRVGRSWC
        FVLPAYVHN                                 EGGLGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKRRSLWDVDSRVGRSWC
Subjt:  FVLPAYVHN---------------------------------EGGLGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKRRSLWDVDSRVGRSWC

Query:  LRAILRKREKLKHHVRMKVGNGNRCRVWLDPWLQRVIYDAGSWREARLSDFIDPDGEWLWPRGGFSIASVWEAIRPRGGRVLWDDLLWGGGNIPKH-SFC
        LRAILRKREKLKHHVRMKVGNGNRCRVWLDPWLQ     A   R  R+   +     +L  R   S+  V+  +   G    +  +  G G I ++  F 
Subjt:  LRAILRKREKLKHHVRMKVGNGNRCRVWLDPWLQRVIYDAGSWREARLSDFIDPDGEWLWPRGGFSIASVWEAIRPRGGRVLWDDLLWGGGNIPKH-SFC

Query:  SWLAIKDRLGTRDRLHRWDSSVPLSCILRQGGVESCNHLFFSVLRIMASSHRIGHWGVELSWICHQGIGKGVRRKLWRVLWWATIYFIWNEWNHRLHGGQ
            +K  +G       W   +        GG+         VLRIMASSHRIGHWGVELSWICHQGIGKGVRRKLWRVLWWATIYFIWNEWNHRLHGGQ
Subjt:  SWLAIKDRLGTRDRLHRWDSSVPLSCILRQGGVESCNHLFFSVLRIMASSHRIGHWGVELSWICHQGIGKGVRRKLWRVLWWATIYFIWNEWNHRLHGGQ

Query:  ARDPVVLFHLICS
        ARDPVVLFHLICS
Subjt:  ARDPVVLFHLICS

A0A5A7TZS0 Reverse transcriptase domain-containing protein0.0e+0066.25Show/hide
Query:  YSNSGVGRILVMWKKNRFSFSTNVMDEQFI---------------------------------TVEITFAWSSPGVVMGDFNAIRVYSEAFGGSPIQGEM
        YSNSGVGRI VMWKK RFSF T+VMDE+F+                                   EIT AWSS GVVMGDFNAIRV+SEAFGGSPIQGEM
Subjt:  YSNSGVGRILVMWKKNRFSFSTNVMDEQFI---------------------------------TVEITFAWSSPGVVMGDFNAIRVYSEAFGGSPIQGEM

Query:  EDFDLAIRDVDLVEPLVQGNWFTWT----------------MNDDWLFAWPTMLVNVLPWGISDHSPILFYPSFQLNSKVVSFRFFNHWVEDPSFIEVVA
        E+FDLAIRD DLVEP VQGNWFTWT                +ND+WL AWPTM +NVLPWGISDHSPILFYPSFQ+NS+VVSFRFFNHWVE+PSFIEVVA
Subjt:  EDFDLAIRDVDLVEPLVQGNWFTWT----------------MNDDWLFAWPTMLVNVLPWGISDHSPILFYPSFQLNSKVVSFRFFNHWVEDPSFIEVVA

Query:  RRWSRHEGVSPLVSLMRNLHHLKPILHGRFGRHIKSLNEKVRIAKLAMDIAQRKVERNPMSDVLSRQASLATETFWTAVRLEEASLRQKSRIRWLKLGDQ
        R WSRHEGVS LVSLMRNLHHLKPIL  +FGRHIKSL+E+V IAK AMDIAQR+VERNP+SDVLSRQASLATETFWTAVRLEEASLRQKS++RWL LGDQ
Subjt:  RRWSRHEGVSPLVSLMRNLHHLKPILHGRFGRHIKSLNEKVRIAKLAMDIAQRKVERNPMSDVLSRQASLATETFWTAVRLEEASLRQKSRIRWLKLGDQ

Query:  NTAFFHRSVYSRMSRNSLLSLVDSDGDGSRVSSHDGVVHMAVNYFSNSLGSQEISYRELTP----------SEECCQALQLPISREEVRRVLLTMDSGKA
        NTAFFHRSV SR+SRNSLLSLVDS  DGSRVSSHDGV  MAVNYFSNSLGSQEI YREL+P          SEECCQALQLPISREEVRRVL +MDSGKA
Subjt:  NTAFFHRSVYSRMSRNSLLSLVDSDGDGSRVSSHDGVVHMAVNYFSNSLGSQEISYRELTP----------SEECCQALQLPISREEVRRVLLTMDSGKA

Query:  PGPDGFSLGFFK--------------------------VNATAITLIPKHNGAERLEDFRPISCCNVLYKCISKILADRLR-----------SAFIPGRS
        PGPDGFS+GF+K                          VNATAITLIPKH GAERLEDFRPISCCNVLYKCISKILADRLR           SAFIPGRS
Subjt:  PGPDGFSLGFFK--------------------------VNATAITLIPKHNGAERLEDFRPISCCNVLYKCISKILADRLR-----------SAFIPGRS

Query:  IIENILLCQELVG--------------VDLQKAYDSVNWDFLFGLLIAIGTLLK-----------------------------KGVRQGDPLSPFLFVMV
        IIENILLCQELVG              VDLQKAYDSVNWDFLFGLLIAIGT LK                             KG+RQGDPLSPFLFVMV
Subjt:  IIENILLCQELVG--------------VDLQKAYDSVNWDFLFGLLIAIGTLLK-----------------------------KGVRQGDPLSPFLFVMV

Query:  MEVLSRMLNKIPQSFQFHHRCEK-------------------------------KFGELSGLFANPKKSSIFVAGVNNENASHLAACMGFVRGNLSVRYL
        MEVLSRMLNKIPQSF+FHHRCEK                               KFGE SGLFANP+KSSIFV GVNNE ASHLAAC+G      S   L
Subjt:  MEVLSRMLNKIPQSFQFHHRCEK-------------------------------KFGELSGLFANPKKSSIFVAGVNNENASHLAACMGFVRGNLSVRYL

Query:  GLPLLT-GRLCSNDCAPLIQRITSQIRSWTARVLSFAGRMQLVRSVLRSLQVYWASVFVLPAYVHN---------------------------------E
          P  +   L S DCAPLIQRITS+IRSWTARVLSFAGR+QLVRSVLRSLQVYWASVFVLPAYVHN                                 E
Subjt:  GLPLLT-GRLCSNDCAPLIQRITSQIRSWTARVLSFAGRMQLVRSVLRSLQVYWASVFVLPAYVHN---------------------------------E

Query:  GGLGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKRRSLWDVDSRVGRSWCLRAILRKREKLKHHVRMKVGNGNRCRVWLDPWLQRVIYDAGSW
        GGLGIRDGPSWNIA+TLKIL   LTN GSLWVAW+EAYILK +SLWDVDSRVGRSWCLRAILRKREK+KHHV                  +RV+YDA S 
Subjt:  GGLGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKRRSLWDVDSRVGRSWCLRAILRKREKLKHHVRMKVGNGNRCRVWLDPWLQRVIYDAGSW

Query:  REARLSDFIDPDGEWLWPR--------------------------------GGFSIASVWEAIRPRGGRVLWDDLLWGGGNIPKHSFCSWLAIKDRLGTR
        REA+LSDFIDP+GEWLWPR                                GGFSIAS WEAI PRGGRVLWD LLWGGGNIPKHSFC+WLAIKDRL TR
Subjt:  REARLSDFIDPDGEWLWPR--------------------------------GGFSIASVWEAIRPRGGRVLWDDLLWGGGNIPKHSFCSWLAIKDRLGTR

Query:  DRLHRWDSSVPLSCILRQGGVESCNHLFFS----------VLRIMASSHRIGHWGVELSWICHQGIGKGVRRKLWRVLWWATIYFIWNEWNHRLHGGQAR
        DRLHRWDSS+PLSCIL QGGVES +HLFFS          V +IM SSHRIGHWGVELSWICH+GIGKGVRRKLWRVLW ATIYFIWNE NHRLHGG+AR
Subjt:  DRLHRWDSSVPLSCILRQGGVESCNHLFFS----------VLRIMASSHRIGHWGVELSWICHQGIGKGVRRKLWRVLWWATIYFIWNEWNHRLHGGQAR

Query:  DPVVLFHLICS
        DP++LFHLIC+
Subjt:  DPVVLFHLICS

A0A5A7UP65 Reverse transcriptase0.0e+0070.8Show/hide
Query:  NSHDY---YSNSGVGRILVMWKKNRFSFSTNVMDEQFITVEITFAWSSPGVVMGDFNAIRVYSEAFGGSPIQGEMEDFDLAIRDVDLVEPLVQGNWFTWT
        NS DY   YSNSGVGRI VMWKKNRFSFST+V DEQFIT           VVM DFNAIR +SEA GGSPIQGEMEDFD+AIRD DLVEP VQGNWFTWT
Subjt:  NSHDY---YSNSGVGRILVMWKKNRFSFSTNVMDEQFITVEITFAWSSPGVVMGDFNAIRVYSEAFGGSPIQGEMEDFDLAIRDVDLVEPLVQGNWFTWT

Query:  ----------------MNDDWLFAWPTMLVNVLPWGISDHSPILFYPSFQLNSKVVSFRFFNHWVEDPSFIEVVARRWSRHEGVSPLVSLMRNLHHLKPI
                        +NDDWL AWPTMLVNVLPWGISDHSPIL YPSFQ NSKVVSFR FNHWV+DPSF+                             
Subjt:  ----------------MNDDWLFAWPTMLVNVLPWGISDHSPILFYPSFQLNSKVVSFRFFNHWVEDPSFIEVVARRWSRHEGVSPLVSLMRNLHHLKPI

Query:  LHGRFGRHIKSLNEKVRIAKLAMDIAQRKVERNPMSDVLSRQASLATETFWTAVRLEEASLRQKSRIRWLKLGDQNTAFFHRSVYSRMSRNSLLSLVDSD
           RFGRHI+SL+E+VRIAK AMDIAQR+VERNPMSDVLSRQASLATETFWTAVRLE+                            R  RN L  +VDS 
Subjt:  LHGRFGRHIKSLNEKVRIAKLAMDIAQRKVERNPMSDVLSRQASLATETFWTAVRLEEASLRQKSRIRWLKLGDQNTAFFHRSVYSRMSRNSLLSLVDSD

Query:  GDGSRVSSHDGVVHMAVNYFSNSLGSQEISYRELTP----------SEECCQALQLPISREEVRRVLLTMDSGKAPGPDGFSLGFFK-------------
           SRVSSHDGV  MAVNYFSNSLGSQEI YRELTP          SEECCQALQ+PISREEVRRVL +MDSGKAPGPDGFS+GFFK             
Subjt:  GDGSRVSSHDGVVHMAVNYFSNSLGSQEISYRELTP----------SEECCQALQLPISREEVRRVLLTMDSGKAPGPDGFSLGFFK-------------

Query:  -------------VNATAITLIPKHNGAERLEDFRPISCCNVLYKCISKILADRLR-----------SAFIPGRSIIENILLCQELVG------------
                     VNATAITLIPKHNGAERLEDFRPISCCNVLYKCISKILADRLR           SAFI GRSIIENILLCQELVG            
Subjt:  -------------VNATAITLIPKHNGAERLEDFRPISCCNVLYKCISKILADRLR-----------SAFIPGRSIIENILLCQELVG------------

Query:  --VDLQKAYDSVNWDFLFGLLIAIGTLLK-----------------------------KGVRQGDPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEKKF
          VDLQKAYDSVNWDFLFGL I+I T LK                             KGVRQGDPLS FLFVMVMEVLSRMLNKIPQSFQFHHRCEK+F
Subjt:  --VDLQKAYDSVNWDFLFGLLIAIGTLLK-----------------------------KGVRQGDPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEKKF

Query:  GELSGLFANPKKSSIFVAGVNNENASHLAACMGFVRGNLSVRYLGLPLLTGRLCSNDCAPLIQRITSQIRSWTARVLSFAGRMQLVRSVLRSLQVYWASV
        GELSGLFANP+KSSIF+AGVNNENAS LAACMGFVRGNL VRYLGLPLLTGRL SNDC PLIQRITS+IRS +ARVLSFAGR+QLV SVL SLQVYWA V
Subjt:  GELSGLFANPKKSSIFVAGVNNENASHLAACMGFVRGNLSVRYLGLPLLTGRLCSNDCAPLIQRITSQIRSWTARVLSFAGRMQLVRSVLRSLQVYWASV

Query:  FVLPAYVHN-EGGLGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKRRSLWDVDSRVGRSWCLRAILRKREKLKHHVRMKVGNGNRCRVWLDPW
        FVLPAYVHN EGGLGIRDG +W  ASTLKILWLMLTNSGSLWVAWVEAY+LK RSLWDVDSRVGRSWCLRAILRK+EKLK HVRMKVGNGNRCRVWLDPW
Subjt:  FVLPAYVHN-EGGLGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKRRSLWDVDSRVGRSWCLRAILRKREKLKHHVRMKVGNGNRCRVWLDPW

Query:  LQ----------RVIYDAGSWREARLSDFIDPDGEWLWPRGGFSIASVWEAIRPRGGRVLWDDLLWGGGNIPKHSFCSWLAIKDRLGTRDRLHRWDSSVP
        LQ          RV+YDA S REA LS+FI PDGEWLWPRGGFSIAS WEAIRPRGGRVLWD LLWGGGNIPKHSFC+WLAIKDRLGTRDR HRWDSSVP
Subjt:  LQ----------RVIYDAGSWREARLSDFIDPDGEWLWPRGGFSIASVWEAIRPRGGRVLWDDLLWGGGNIPKHSFCSWLAIKDRLGTRDRLHRWDSSVP

Query:  LSCILRQGGVESCNHLFFS----------VLRIMASSHRIGHWGVELSWICHQGIGKGVRRKLWRVLWWATIYFIWNEWNHRLHGGQARDPVVLFHLICS
        LSCIL +GG+ES +HLFFS          VLRIMASSHRIGHWGVELSWICHQGI KGVRRKLWRVLW ATIYFIWNE NHRLHGGQA DP+V+FHLIC+
Subjt:  LSCILRQGGVESCNHLFFS----------VLRIMASSHRIGHWGVELSWICHQGIGKGVRRKLWRVLWWATIYFIWNEWNHRLHGGQARDPVVLFHLICS

A0A5D3D5X6 Reverse transcriptase domain-containing protein0.0e+0077.57Show/hide
Query:  MDGLNVVGPDLIGPDVESEGKRGSKEAGRRDDGLGSFGPRNETVIGRCGSSVCGQKLDDISSGSRIEFGDFFNSGEDEPSNVSSGCVLNNQNANVSNVVQ
        MDGLNVVGPDLIGPDVESEGKRGSKEAGRRDDGLGSFGPRNETVIGRCGSSVCGQKLDDISSGSRIEFGDFFNSGEDEPSNVSSGCVLNNQNANVSNVVQ
Subjt:  MDGLNVVGPDLIGPDVESEGKRGSKEAGRRDDGLGSFGPRNETVIGRCGSSVCGQKLDDISSGSRIEFGDFFNSGEDEPSNVSSGCVLNNQNANVSNVVQ

Query:  NGINKQPIDSKSTWASLFGTSSEESLLYTLPKIIGDKIVVTPPEEVIDQGIKVWENSLVGQLIDSKLPYTVIQHLVEKIWGKIEMPIITILENDLICFQF
        NGINKQ IDSKSTWASLFGTSSEESLLYTLPKIIGDKIVVTPPEEVIDQGIKVWENSLVGQLIDSKLPYTVIQHLVEKIWGKIEMPIITILENDLICFQF
Subjt:  NGINKQPIDSKSTWASLFGTSSEESLLYTLPKIIGDKIVVTPPEEVIDQGIKVWENSLVGQLIDSKLPYTVIQHLVEKIWGKIEMPIITILENDLICFQF

Query:  RRSKSVEWILSRGPWHLDDKSMLLRKWTPGIVPEFFVFNSVPVWIRLGKLPMELWTEAGLAVVASAVGKPISLDLATKERRRLSYARVCVELEAGSNMPA
        RRSKSVEWILSRGPWHLDDKSMLLRKWTPGIVPEFFVFNSVPVWIRLGKLPMELWTEAGLAVVASAVGKPISLDLATKERRRLSYARVCVELEAGSNMPA
Subjt:  RRSKSVEWILSRGPWHLDDKSMLLRKWTPGIVPEFFVFNSVPVWIRLGKLPMELWTEAGLAVVASAVGKPISLDLATKERRRLSYARVCVELEAGSNMPA

Query:  EITVSLRGVDFNVSVNYEWKPRKCNLCCAFGHSGSNCSRSVESKTIQEEVVHKGDDVDSEPCGEVVLESFKQLEEGEIRNSPNRHNSQVEKGVGKSDEFT
        EITVSLRGVDFNVSVNYEWKPRKCNLCCAFGHSGSNCSRSVESKTIQEEVVHKGDDVDSEPCGEVVLESFKQLEEGEIRNSPNRHNSQVEKGVGKSDEFT
Subjt:  EITVSLRGVDFNVSVNYEWKPRKCNLCCAFGHSGSNCSRSVESKTIQEEVVHKGDDVDSEPCGEVVLESFKQLEEGEIRNSPNRHNSQVEKGVGKSDEFT

Query:  LVTRRKSELVSVRDRGKS-----MEVDEGTDILNGV----------SSSIG--------------PKGLPTLNNSHDY---YSNSGVGRILVMWKKNRFS
        LVTRRKSELVSVRDRGK      ++VDEGTDILNG+          SSS+G                     +NS DY   YSNSGVGRI VMWKKNRFS
Subjt:  LVTRRKSELVSVRDRGKS-----MEVDEGTDILNGV----------SSSIG--------------PKGLPTLNNSHDY---YSNSGVGRILVMWKKNRFS

Query:  FSTNVMDEQFITVEITFAWSSPGVVMGDFNAIRVYSEAFGGSPIQGEMEDFDLAIRDVDLVEPLVQGNWFTWTMNDDWLFAWPTMLVNVLPWGISDHSPI
        FSTNVMDEQFIT          GVVMGDFNAIRVYSEAFGGSPIQGEMEDFDLAIRDVDLVEPLVQGNWFTWT                           
Subjt:  FSTNVMDEQFITVEITFAWSSPGVVMGDFNAIRVYSEAFGGSPIQGEMEDFDLAIRDVDLVEPLVQGNWFTWTMNDDWLFAWPTMLVNVLPWGISDHSPI

Query:  LFYPSFQLNSKVVSFRFFNHWVEDPSFIEVVARRWSRHEGVSPLVSLMRNLHHLKPILHGRFGRHIKSLNEKVRIAKLAMDIAQRKVERNPMSDVLSRQA
             FQLNSKVVSFRFFNHWVEDPSFIEVVARRWSRHEGVSPLVSLM+NLHHLKPIL GRFGRHIKSLNEKVRIAKLAMDIAQRKVERNPMSDVLSRQA
Subjt:  LFYPSFQLNSKVVSFRFFNHWVEDPSFIEVVARRWSRHEGVSPLVSLMRNLHHLKPILHGRFGRHIKSLNEKVRIAKLAMDIAQRKVERNPMSDVLSRQA

Query:  SLATETFWTAVRLEEASLRQKSRIRWLKLGDQNTAFFHRSVYSRMSRNSLLSLVDSDGDGSRVSSHDGVVHMAVNYFSNSLGSQEISYRELTPSEECCQA
        SLATETFWTAVRLEEASLRQKSRIRWLKLGDQNTAFFHR VYSRMSRNSLLSLVDSDGDGSRVSSHDGVVHMA                          A
Subjt:  SLATETFWTAVRLEEASLRQKSRIRWLKLGDQNTAFFHRSVYSRMSRNSLLSLVDSDGDGSRVSSHDGVVHMAVNYFSNSLGSQEISYRELTPSEECCQA

Query:  LQLPISREEVRRVLLTMDSGKAPGPDGFSLGFFK--VNATAITLIPKHNGAERLEDFRPISCCNVLYKCISKILADRLRSAFIPGRSIIENILLCQELVG
        LQLPISREEVRRVLLTMDSGKAPGPDGFSLGFFK  VNATAITLIPKHNGAERLEDFRPISC                                      
Subjt:  LQLPISREEVRRVLLTMDSGKAPGPDGFSLGFFK--VNATAITLIPKHNGAERLEDFRPISCCNVLYKCISKILADRLRSAFIPGRSIIENILLCQELVG

Query:  VDLQKAYDSVNWDFLFGLLIAIGTLLKKGVRQGDPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEK-------------------------------KF
         DLQKAYDSVNWDFLFGLLIAIGTLLKKGVRQGDPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEK                               KF
Subjt:  VDLQKAYDSVNWDFLFGLLIAIGTLLKKGVRQGDPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEK-------------------------------KF

Query:  GELSGLFANPKKSSIFVAGVNNENASHLAACMGFVRGNLSVRYLGLPLLTGRLCSNDCAPLIQRITSQIRSWTARVLSFAGRMQLVRSVLRSLQVYWASV
        GELSGLFANP+KSSIFVAGVNNENASHLAACMGFVRGNLSVRYLGLPLLTGRLCSNDCAPLIQRITSQIRSWTARVLSFAGRMQLV SVLRSLQVYWASV
Subjt:  GELSGLFANPKKSSIFVAGVNNENASHLAACMGFVRGNLSVRYLGLPLLTGRLCSNDCAPLIQRITSQIRSWTARVLSFAGRMQLVRSVLRSLQVYWASV

Query:  FVLPAYVHN---------------------------------EGGLGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKRRSLWDVDSRVGRSWC
        FVLPAYVHN                                 EGGLGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKRRSLWDVDSRVGRSWC
Subjt:  FVLPAYVHN---------------------------------EGGLGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKRRSLWDVDSRVGRSWC

Query:  LRAILRKREKLKHHVRMKVGNGNRCRVWLDPWLQRVIYDAGSWREARLSDFIDPDGEWLWPRGGFSIASVWEAIRPRGGRVLWDDLLWGGGNIPKH-SFC
        LRAILRKREKLKHHVRMKVGNGNRCRVWLDPWLQ     A   R  R+   +     +L  R   S+  V+  +   G    +  +  G G I ++  F 
Subjt:  LRAILRKREKLKHHVRMKVGNGNRCRVWLDPWLQRVIYDAGSWREARLSDFIDPDGEWLWPRGGFSIASVWEAIRPRGGRVLWDDLLWGGGNIPKH-SFC

Query:  SWLAIKDRLGTRDRLHRWDSSVPLSCILRQGGVESCNHLFFSVLRIMASSHRIGHWGVELSWICHQGIGKGVRRKLWRVLWWATIYFIWNEWNHRLHGGQ
            +K  +G       W   +        GG+         VLRIMASSHRIGHWGVELSWICHQGIGKGVRRKLWRVLWWATIYFIWNEWNHRLHGGQ
Subjt:  SWLAIKDRLGTRDRLHRWDSSVPLSCILRQGGVESCNHLFFSVLRIMASSHRIGHWGVELSWICHQGIGKGVRRKLWRVLWWATIYFIWNEWNHRLHGGQ

Query:  ARDPVVLFHLICS
        ARDPVVLFHLICS
Subjt:  ARDPVVLFHLICS

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein6.2e-1822.77Show/hide
Query:  SEECCQALQLPISREEVRRVLLTMDSGKAPGPDGFSLGFFK--------------------------VNATAITLIPK-HNGAERLEDFRPISCCNVLYK
        ++E  ++L  PI+  E+  ++ ++ + K+PGPDGF+  F++                              +I LIPK      + E+FRPIS  N+  K
Subjt:  SEECCQALQLPISREEVRRVLLTMDSGKAPGPDGFSLGFFK--------------------------VNATAITLIPK-HNGAERLEDFRPISCCNVLYK

Query:  CISKILADRLRS-----------AFIPGR----------SIIENILLCQE----LVGVDLQKAYDSVNWDFLFGLLIAIGT-------------------
         ++KILA+R++             FIPG           ++I++I   ++    ++ +D +KA+D +   F+   L  +G                    
Subjt:  CISKILADRLRS-----------AFIPGR----------SIIENILLCQE----LVGVDLQKAYDSVNWDFLFGLLIAIGT-------------------

Query:  ----------LLKKGVRQGDPLSPFLFVMVMEVLSRMLNKIPQ-----------------------------SFQFHHRCEKKFGELSGLFANPKKSSIF
                   LK G RQG PLSP LF +V+EVL+R + +  +                             S Q   +    F ++SG   N +KS  F
Subjt:  ----------LLKKGVRQGDPLSPFLFVMVMEVLSRMLNKIPQ-----------------------------SFQFHHRCEKKFGELSGLFANPKKSSIF

Query:  VAGVNNENASHLAACMGFVRGNLSVRYLGLPLL--TGRLCSNDCAPLIQRITSQIRSWTARVLSFAGRMQLVRSVLRSLQVY
        +   N +  S +   + F   +  ++YLG+ L      L   +  PL++ I      W     S+ GR+ +V+  +    +Y
Subjt:  VAGVNNENASHLAACMGFVRGNLSVRYLGLPLL--TGRLCSNDCAPLIQRITSQIRSWTARVLSFAGRMQLVRSVLRSLQVY

P08548 LINE-1 reverse transcriptase homolog3.6e-1024.21Show/hide
Query:  FFKVNATAITLIPK-HNGAERLEDFRPISCCNVLYKCISKILADRLRS-----------AFIPGR----------SIIENILLCQE----LVGVDLQKAY
        F++ N   ITLIPK      R E++RPIS  N+  K ++KIL +R++             FIPG           ++I++I   +     ++ +D +KA+
Subjt:  FFKVNATAITLIPK-HNGAERLEDFRPISCCNVLYKCISKILADRLRS-----------AFIPGR----------SIIENILLCQE----LVGVDLQKAY

Query:  DSVNWDFLFGLLIAI---GTLLK--------------------------KGVRQGDPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCE------------
        D++   F+   L  I   GT LK                           G RQG PLSP LF +VMEVL+  + +       H   E            
Subjt:  DSVNWDFLFGLLIAI---GTLLK--------------------------KGVRQGDPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCE------------

Query:  -----------------KKFGELSGLFANPKKSSIFVAGVNNENASHLAACMGFVRGNLSVRYLGLPLL--TGRLCSNDCAPLIQRITSQIRSWTARVLS
                         K++  +SG   N  KS  F+   NN+    +   + F      ++YLG+ L      L   +   L + I   +  W     S
Subjt:  -----------------KKFGELSGLFANPKKSSIFVAGVNNENASHLAACMGFVRGNLSVRYLGLPLL--TGRLCSNDCAPLIQRITSQIRSWTARVLS

Query:  FAGRMQLVRSVLRSLQVY
        + GR+ +V+  +    +Y
Subjt:  FAGRMQLVRSVLRSLQVY

P0C2F6 Putative ribonuclease H protein At1g657507.3e-1126.18Show/hide
Query:  LPLLTGRLCSNDCAPLIQRITSQIRSWTARVLSFAGRMQLVRSVLRSLQVYWASVFVLPAYVHN---------------------------------EGG
        +P+L  R+  +    +++R++S++  W  + LSFAGR+ L ++VL S+ V+  S  +LP  + N                                 EGG
Subjt:  LPLLTGRLCSNDCAPLIQRITSQIRSWTARVLSFAGRMQLVRSVLRSLQVYWASVFVLPAYVHN---------------------------------EGG

Query:  LGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVE-AYILK--RRSLWDVDSRVGRSWCLRAILRKREKLKHHVRMKVGNGNRCRVWLDPWL
        LG+R   S N A   K+ W +L    SLW   ++  Y +   R S W +      S      +  R+ + H V    G+G + R W D W+
Subjt:  LGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVE-AYILK--RRSLWDVDSRVGRSWCLRAILRKREKLKHHVRMKVGNGNRCRVWLDPWL

P11369 LINE-1 retrotransposable element ORF2 protein5.2e-1724.8Show/hide
Query:  LQLPISREEVRRVLLTMDSGKAPGPDGFSLGFF------------------KVNAT--------AITLIPK-HNGAERLEDFRPISCCNVLYKCISKILA
        L  PIS +E+  V+ ++ + K+PGPDGFS  F+                  +V  T         ITLIPK      ++E+FRPIS  N+  K ++KILA
Subjt:  LQLPISREEVRRVLLTMDSGKAPGPDGFSLGFF------------------KVNAT--------AITLIPK-HNGAERLEDFRPISCCNVLYKCISKILA

Query:  DRLRS-----------AFIPGR----------SIIENILLCQE----LVGVDLQKAYDSVNWDFLFGLLIAIGTL-------------------------
        +R++             FIPG           ++I  I   ++    ++ +D +KA+D +   F+  +L   G                           
Subjt:  DRLRS-----------AFIPGR----------SIIENILLCQE----LVGVDLQKAYDSVNWDFLFGLLIAIGTL-------------------------

Query:  ----LKKGVRQGDPLSPFLFVMVMEVLSRML--NKIPQSFQFHHRCEK---------------------------KFGELSGLFANPKKSSIFVAGVNNE
            LK G RQG PLSP+LF +V+EVL+R +   K  +  Q      K                            FGE+ G   N  KS  F+   N +
Subjt:  ----LKKGVRQGDPLSPFLFVMVMEVLSRML--NKIPQSFQFHHRCEK---------------------------KFGELSGLFANPKKSSIFVAGVNNE

Query:  NASHLAACMGFVRGNLSVRYLGLPLL--TGRLCSNDCAPLIQRITSQIRSWTARVLSFAGRMQLVRSVLRSLQVY
            +     F     +++YLG+ L      L   +   L + I   +R W     S+ GR+ +V+  +    +Y
Subjt:  NASHLAACMGFVRGNLSVRYLGLPLL--TGRLCSNDCAPLIQRITSQIRSWTARVLSFAGRMQLVRSVLRSLQVY

P14381 Transposon TX1 uncharacterized 149 kDa protein5.6e-1121.64Show/hide
Query:  SDHSPILFYPSFQLN-SKVVSFRFFNHWVEDPSFIEVVARRWSRHEGVSPLVSLMRNLH-----HLKPI-------LHGRFGRHIKSLNEKVRIAKLAMD
        SDH+ +    S   +  K   + F N  +ED  F + V   W          + +         HLK +       + G+    I++LN +V      +D
Subjt:  SDHSPILFYPSFQLN-SKVVSFRFFNHWVEDPSFIEVVARRWSRHEGVSPLVSLMRNLH-----HLKPI-------LHGRFGRHIKSLNEKVRIAKLAMD

Query:  IAQR---KVERNPMSDVLSRQASLATETFWTAVRLEEASLR---QKSRIRWLKLGDQNTAFFHRSVYSRMSRNSLLSLVDSDGDGSRVSSHDGVVHMAVN
        + QR     ++    + L R+ +L          +E+   R    +SR++ L   D+ + FF+     + +R  +  L     DG+ +   + +   A +
Subjt:  IAQR---KVERNPMSDVLSRQASLATETFWTAVRLEEASLR---QKSRIRWLKLGDQNTAFFHRSVYSRMSRNSLLSLVDSDGDGSRVSSHDGVVHMAVN

Query:  YFSNSLGSQEIS---YREL-----TPSEECCQALQLPISREEVRRVLLTMDSGKAPGPDGFSLGFFK--------------------------VNATAIT
        ++ N      IS     EL       SE   + L+ PI+ +E+ + L  M   K+PG DG ++ FF+                               ++
Subjt:  YFSNSLGSQEIS---YREL-----TPSEECCQALQLPISREEVRRVLLTMDSGKAPGPDGFSLGFFK--------------------------VNATAIT

Query:  LIPKHNGAERLEDFRPISCCNVLYKCISKILADRLRSAF-----------IPGRSIIENILLCQEL-------------VGVDLQKAYDSVNWDFLFGLL
        L+PK      ++++RP+S  +  YK ++K ++ RL+S             +PGR+I +N+ L ++L             + +D +KA+D V+  +L G L
Subjt:  LIPKHNGAERLEDFRPISCCNVLYKCISKILADRLRSAF-----------IPGRSIIENILLCQEL-------------VGVDLQKAYDSVNWDFLFGLL

Query:  IA
         A
Subjt:  IA

Arabidopsis top hitse value%identityAlignment
AT1G43730.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.0e-2028.69Show/hide
Query:  LKRRSLWDVDSRVGRSWCLRAILRKREKLKHHVRMKVGNGNRCRVWLDPW-----LQRVIYDAGSWREARLSDFI-----DPDGEWLW------PRGGFS
        L +R+ W ++S    SW  R + + RE  +  V   VG+G   + W D W     L  ++   G        D +       D  ++W      P   FS
Subjt:  LKRRSLWDVDSRVGRSWCLRAILRKREKLKHHVRMKVGNGNRCRVWLDPW-----LQRVIYDAGSWREARLSDFI-----DPDGEWLW------PRGGFS

Query:  IASVWEAIRPRGGRVLWDDLLWGGGNIPKHSFCSWLAIKDRLGTRDRLHRWDSSVPLSCILRQGGVESCNHLFF------SVLRIMASSHRI---GHWGV
         A    A+ P+   V W   +W   ++PKH+F  W+   +RL TRDRL  W  S+P  C+L     ES  HLFF      +V R       +        
Subjt:  IASVWEAIRPRGGRVLWDDLLWGGGNIPKHSFCSWLAIKDRLGTRDRLHRWDSSVPLSCILRQGGVESCNHLFF------SVLRIMASSHRI---GHWGV

Query:  ELSWICHQGIGKGVRRKLWRVLWWATIYFIWNEWNHRLHGGQAR
         L W+ +    K     + R+ + A +Y IW E N  LH G AR
Subjt:  ELSWICHQGIGKGVRRKLWRVLWWATIYFIWNEWNHRLHGGQAR

AT1G43760.1 DNAse I-like superfamily protein6.3e-3427.74Show/hide
Query:  YSNSGVGRILVMWKKNRFSFSTNVMDEQFITVEITFAWSSPGVVMGDFNAIRVYSEAFG----GSPIQGEMEDFDLAIRDVDLVEPLVQGNWFTWT----
        Y  S +GRI ++W             +  ++V +        +++GDF+ I   S+ +       P++G +E+F   +RD DLV+   +G  +TW+    
Subjt:  YSNSGVGRILVMWKKNRFSFSTNVMDEQFITVEITFAWSSPGVVMGDFNAIRVYSEAFG----GSPIQGEMEDFDLAIRDVDLVEPLVQGNWFTWT----

Query:  ------------MNDDWLFAWPTMLVNVLPWGISDHSP-ILFYPSFQLNSKVVSFRFFNHWVEDPSFIEVVARRWSRHEGV-SPLVSLMRNLHHLKPILH
                     N DW  ++P+ +      G+SDHSP I+   +    SK   FR+F+     P+F+  +   W     V S + SL  +L   K    
Subjt:  ------------MNDDWLFAWPTMLVNVLPWGISDHSP-ILFYPSFQLNSKVVSFRFFNHWVEDPSFIEVVARRWSRHEGV-SPLVSLMRNLHHLKPILH

Query:  GRFGRHIKSLNEKVRIAKLAMDIAQRKVERNPMSDVLSRQASLATETFWTAVRLEEASLRQKSRIRWLKLGDQNTAFFHRSVYSRMSRNSLLSLVDSDGD
            +   ++  K + A  +++  Q ++  NP SD L R   +A + +       E+  RQKSRI+WL+ GD NT FFH+ + +  ++N L+  +  D D
Subjt:  GRFGRHIKSLNEKVRIAKLAMDIAQRKVERNPMSDVLSRQASLATETFWTAVRLEEASLRQKSRIRWLKLGDQNTAFFHRSVYSRMSRNSLLSLVDSDGD

Query:  GSRVSSHDGVVHMAVNYFSNSLGSQE--------ISYRELTP---SEECCQALQLPISREEVRRVLLTMDSGKAPGPDGFSLGFF---------------
          RV +   V  M V Y+++ LGS             +++ P   ++     L    S +E+   +  M   KAPGPD F+  FF               
Subjt:  GSRVSSHDGVVHMAVNYFSNSLGSQE--------ISYRELTP---SEECCQALQLPISREEVRRVLLTMDSGKAPGPDGFSLGFF---------------

Query:  -----------KVNATAITLIPKHNGAERLEDFRPISCCNVLYKCIS
                   + NATAITLIPK  G ++L  FRP+SCC V+YK I+
Subjt:  -----------KVNATAITLIPKHNGAERLEDFRPISCCNVLYKCIS

AT1G60720.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.7e-1832.28Show/hide
Query:  GFSIASVWEAIRPRGGRVLWDDLLWGGGNIPKHSFCSWLAIKDRLGTRDRLHRWDSSVPLSCILRQGGVESCNHLFFS-----------VLRIMASSHRI
        GFS A  W+AIRPR   + W   +W  G +PKH+F  W++  DRL TR RL  W       C L     ES +HL FS             R+       
Subjt:  GFSIASVWEAIRPRGGRVLWDDLLWGGGNIPKHSFCSWLAIKDRLGTRDRLHRWDSSVPLSCILRQGGVESCNHLFFS-----------VLRIMASSHRI

Query:  GHWGVELSWICHQGIGKGVRRKLWRVLWWATIYFIWNEWNHRLHGGQARDPVVLFHLI
          W   LSW+  +         L +V   A IY IW + N+ LH      P+++F ++
Subjt:  GHWGVELSWICHQGIGKGVRRKLWRVLWWATIYFIWNEWNHRLHGGQARDPVVLFHLI

AT2G01050.1 zinc ion binding;nucleic acid binding7.5e-1924.76Show/hide
Query:  VVTPPEEVIDQGIKVWENSLVGQLIDSKLPYTVIQHLVEKIWGKIEMPIITILENDLICFQFRRSKSVEWILSRGPWHLDDKSMLLRKWTPGIVPEFFVF
        V+T  EEV++    +W+  ++ +++ S++P +V+   + ++W    +  +  L       +F   +     L+ GPW +    +L++ W+    P     
Subjt:  VVTPPEEVIDQGIKVWENSLVGQLIDSKLPYTVIQHLVEKIWGKIEMPIITILENDLICFQFRRSKSVEWILSRGPWHLDDKSMLLRKWTPGIVPEFFVF

Query:  NSVPVWIRLGKLPMELWTEAGLAVVASAVGKPISLDLATKERRRLSYARVCVELEAGSNMPAEITVSLRGVDFNVSVNYEWKPRKCNLCCAFGHSGSNCS
         + PVW+RL  +P   +    L  +A  +G+P+ +D+ T    +  +ARVC+E+      P + TV + G  + V+  YE   + C+ C  +GH   +C 
Subjt:  NSVPVWIRLGKLPMELWTEAGLAVVASAVGKPISLDLATKERRRLSYARVCVELEAGSNMPAEITVSLRGVDFNVSVNYEWKPRKCNLCCAFGHSGSNCS

Query:  RSVESK
        R+V  K
Subjt:  RSVESK

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.4e-3328.18Show/hide
Query:  VAGVNNENASHLAACMGFVRGNLSVRYLGLPLLTGRLCSNDCAPLIQRITSQIRSWTARVLSFAGRMQLVRSVLRSLQVYWASVFVLPAYVHNE----GG
        +AGV + + + +     F  G L VRYLGLPLLT ++ ++D  PL+++I  +I  WTAR LSFAGR+QL+ SV+ SL  +W S F LP+    E      
Subjt:  VAGVNNENASHLAACMGFVRGNLSVRYLGLPLLTGRLCSNDCAPLIQRITSQIRSWTARVLSFAGRMQLVRSVLRSLQVYWASVFVLPAYVHNE----GG

Query:  LGIRDGPSWNIASTLKILWLML---TNSGSLWVAWVEAYILKRRSLWDVDSRVG-RSWCLRAILRKREKLKHHVRMKVGNGNRCRVWLDPWLQ-------
          +  GP  N     K+ W  +    + G L +  ++     + S W +       SW  + IL+ R      V+  + NG+    W D W +       
Subjt:  LGIRDGPSWNIASTLKILWLML---TNSGSLWVAWVEAYILKRRSLWDVDSRVG-RSWCLRAILRKREKLKHHVRMKVGNGNRCRVWLDPWLQ-------

Query:  ---RVIYDAG-----SWREA---------------RLSDFI---------DPDGEWLWPRGG------FSIASVWEAIRPRGGRVLWDDLLWGGGNIPKH
           R   D G     S  EA               R+ D I           +    W   G      F+    W A R    +V W   +W     PK+
Subjt:  ---RVIYDAG-----SWREA---------------RLSDFI---------DPDGEWLWPRGG------FSIASVWEAIRPRGGRVLWDDLLWGGGNIPKH

Query:  SFCSWLAIKDRLGTRDRLHRWDSSVPLSCILRQGGVESCNHLFFSVLRIMASSHRIGHWGVELSWICHQGIGKGVRRKLWRVLWWATIYFIWNEWNHRLH
        S  +W+AIK+RL T DR+  W++    SC+L    VE+ +HLFF+             +  E+ +             L R  +  T++ +W E N R H
Subjt:  SFCSWLAIKDRLGTRDRLHRWDSSVPLSCILRQGGVESCNHLFFSVLRIMASSHRIGHWGVELSWICHQGIGKGVRRKLWRVLWWATIYFIWNEWNHRLH

Query:  G
        G
Subjt:  G


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGGGCTGAACGTAGTTGGGCCTGACTTAATTGGGCCTGATGTTGAATCAGAAGGAAAACGTGGATCTAAAGAAGCTGGGCGGCGAGACGATGGGCTGGGTTCTTT
TGGGCCGCGAAATGAAACTGTAATTGGAAGATGTGGGTCTTCGGTGTGTGGGCAGAAGCTGGATGATATTTCAAGTGGGTCGCGAATTGAATTTGGTGATTTTTTTAATT
CTGGAGAGGATGAGCCGAGCAACGTTTCGTCTGGTTGCGTTTTGAATAATCAGAATGCAAATGTCTCTAATGTTGTTCAGAATGGAATTAATAAGCAGCCTATTGACTCT
AAATCCACATGGGCTTCGCTATTTGGGACCTCTTCTGAAGAATCCCTGCTCTATACTCTGCCCAAAATTATTGGGGATAAAATTGTAGTCACTCCACCTGAAGAGGTTAT
CGATCAAGGAATTAAAGTGTGGGAAAATTCTTTGGTAGGTCAGCTGATTGATTCCAAGCTACCTTATACCGTTATTCAGCATTTGGTTGAGAAAATTTGGGGGAAGATTG
AAATGCCCATTATCACTATTTTGGAAAACGATCTCATTTGCTTTCAATTTAGACGGTCAAAATCAGTGGAGTGGATTCTTTCGCGTGGACCATGGCATCTCGATGACAAG
TCTATGCTCCTCAGAAAATGGACTCCAGGTATTGTTCCTGAATTCTTTGTGTTTAATTCAGTTCCGGTTTGGATTAGATTGGGGAAATTACCTATGGAGTTATGGACAGA
AGCTGGGTTGGCTGTTGTAGCGAGTGCTGTGGGGAAACCTATATCTTTAGACTTGGCCACTAAGGAGCGTCGTAGGCTCTCATATGCTCGTGTTTGTGTTGAATTAGAGG
CAGGGTCAAATATGCCTGCTGAAATTACTGTTAGTCTCAGAGGAGTAGATTTTAATGTTTCGGTTAATTATGAGTGGAAACCACGGAAGTGTAATTTGTGTTGTGCCTTT
GGGCATTCTGGTAGTAATTGTTCTAGAAGTGTGGAGAGTAAAACCATTCAGGAGGAGGTTGTGCACAAGGGGGATGATGTAGACAGTGAACCTTGTGGGGAAGTTGTTCT
TGAATCGTTCAAACAGTTAGAGGAAGGTGAAATTAGGAACTCTCCTAATAGACATAACAGCCAAGTGGAGAAGGGGGTGGGTAAAAGTGATGAATTTACCCTTGTAACTC
GCAGGAAGAGTGAGTTGGTCTCTGTTAGAGATCGTGGAAAGAGTATGGAGGTTGATGAAGGTACTGATATTCTTAATGGTGTTAGTTCTTCTATTGGTCCTAAAGGATTA
CCTACACTTAATAATTCTCATGATTATTACAGTAATAGTGGTGTTGGTCGGATTTTGGTGATGTGGAAGAAGAATCGTTTTTCTTTCTCTACTAATGTGATGGATGAGCA
GTTTATTACAGTTGAGATTACTTTTGCTTGGTCGAGCCCAGGGGTTGTCATGGGAGATTTTAATGCTATTAGAGTTTATTCTGAAGCATTTGGGGGATCTCCTATTCAGG
GTGAGATGGAGGATTTTGATCTGGCTATTCGCGATGTTGATTTAGTGGAGCCTTTGGTGCAGGGGAACTGGTTTACTTGGACTATGAATGATGATTGGTTATTTGCATGG
CCTACCATGTTGGTAAATGTGCTTCCATGGGGTATTTCTGATCATTCTCCAATTTTATTTTATCCTAGCTTTCAGCTAAATAGCAAAGTGGTGTCTTTTCGGTTCTTCAA
TCATTGGGTGGAGGATCCATCCTTTATTGAGGTGGTTGCTAGGAGGTGGAGTCGTCATGAGGGTGTCTCTCCGCTAGTGAGCCTCATGAGAAACCTTCATCATCTCAAAC
CTATTCTCCATGGACGGTTTGGTAGACACATCAAGAGTCTTAATGAGAAGGTGCGCATTGCTAAGTTGGCCATGGATATAGCTCAGAGAAAGGTAGAACGTAACCCAATG
TCGGATGTTTTGAGTCGCCAAGCAAGTCTTGCTACTGAGACTTTCTGGACAGCAGTTAGATTGGAGGAAGCCTCGCTTCGGCAGAAATCCAGAATTCGATGGTTAAAGCT
GGGTGATCAGAATACGGCTTTTTTCCATCGTTCCGTCTACTCTCGTATGAGTCGTAATTCACTACTTTCTCTAGTTGATTCTGATGGTGATGGATCCAGGGTGTCTTCAC
ATGATGGGGTGGTTCATATGGCAGTTAATTATTTTAGTAACAGTTTGGGGTCCCAGGAGATTAGCTATAGAGAATTGACCCCATCTGAGGAGTGTTGTCAGGCGTTACAG
TTACCTATTAGTAGGGAGGAAGTTAGGAGGGTCTTATTAACTATGGATAGTGGAAAGGCTCCCGGTCCTGATGGGTTCTCTTTAGGTTTCTTCAAAGTTAATGCTACTGC
TATCACCCTCATTCCTAAACATAATGGGGCTGAGCGTCTGGAGGACTTTCGTCCTATTTCTTGTTGTAATGTGTTATATAAATGCATTTCTAAAATTCTGGCTGATAGAC
TTCGTTCAGCTTTTATACCTGGGAGGAGTATTATCGAGAACATCCTGCTTTGTCAGGAATTGGTAGGAGTTGATCTTCAAAAAGCTTATGACTCTGTTAATTGGGATTTT
CTGTTTGGTTTGTTGATTGCTATTGGTACTCTGCTGAAGAAGGGTGTAAGACAAGGTGATCCTTTATCTCCTTTTCTCTTTGTTATGGTGATGGAAGTTCTTTCTCGTAT
GCTGAACAAGATTCCTCAGAGTTTTCAATTTCACCATCGTTGTGAAAAGAAGTTTGGTGAGCTTTCAGGTTTGTTTGCAAATCCTAAGAAAAGCTCTATCTTTGTTGCAG
GAGTTAATAATGAGAATGCTTCTCATCTGGCTGCTTGTATGGGTTTTGTCCGTGGAAATCTCTCTGTTCGTTATCTTGGCCTTCCTCTTCTTACTGGTCGGTTATGTTCT
AATGATTGTGCTCCTCTGATTCAGCGTATCACTAGCCAAATTCGTTCTTGGACTGCTCGAGTTCTTTCGTTTGCTGGTAGAATGCAGCTTGTTCGTTCTGTGCTTCGTAG
CCTTCAAGTTTACTGGGCTAGTGTGTTTGTTCTTCCTGCGTATGTGCATAATGAGGGTGGTCTTGGTATTCGAGATGGCCCTTCTTGGAATATTGCGAGTACTTTGAAGA
TTTTGTGGCTTATGTTGACAAATTCGGGTTCTCTTTGGGTGGCTTGGGTGGAGGCTTATATATTAAAGAGGAGGTCTTTGTGGGATGTGGATAGTAGAGTGGGTCGATCT
TGGTGTCTTCGGGCGATCTTACGTAAGCGAGAGAAGCTGAAACATCATGTAAGGATGAAGGTAGGGAATGGCAATAGATGTAGAGTTTGGCTTGATCCGTGGTTGCAGAG
GGTGATTTATGATGCAGGTAGTTGGAGGGAGGCTAGACTTTCTGACTTTATTGACCCAGATGGAGAATGGCTTTGGCCACGAGGTGGTTTCTCTATTGCAAGTGTATGGG
AAGCTATTCGTCCTAGGGGTGGTAGGGTTCTTTGGGATGATTTATTGTGGGGTGGGGGAAATATCCCAAAACATTCCTTTTGTTCGTGGTTGGCCATTAAAGATAGGTTG
GGCACTAGAGATAGATTACATAGGTGGGATAGTTCGGTACCGTTGTCGTGCATTCTACGTCAGGGGGGTGTGGAGTCTTGCAATCACTTATTTTTTTCAGTTCTTCGGAT
CATGGCTTCATCACATAGGATTGGGCATTGGGGGGTTGAGTTGTCTTGGATTTGTCATCAGGGTATTGGGAAGGGTGTGAGGAGGAAGCTGTGGCGTGTTCTTTGGTGGG
CAACTATCTATTTTATTTGGAACGAGTGGAATCATCGGTTACATGGTGGTCAAGCTCGTGATCCTGTTGTCCTTTTCCATCTTATTTGTTCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACGGGCTGAACGTAGTTGGGCCTGACTTAATTGGGCCTGATGTTGAATCAGAAGGAAAACGTGGATCTAAAGAAGCTGGGCGGCGAGACGATGGGCTGGGTTCTTT
TGGGCCGCGAAATGAAACTGTAATTGGAAGATGTGGGTCTTCGGTGTGTGGGCAGAAGCTGGATGATATTTCAAGTGGGTCGCGAATTGAATTTGGTGATTTTTTTAATT
CTGGAGAGGATGAGCCGAGCAACGTTTCGTCTGGTTGCGTTTTGAATAATCAGAATGCAAATGTCTCTAATGTTGTTCAGAATGGAATTAATAAGCAGCCTATTGACTCT
AAATCCACATGGGCTTCGCTATTTGGGACCTCTTCTGAAGAATCCCTGCTCTATACTCTGCCCAAAATTATTGGGGATAAAATTGTAGTCACTCCACCTGAAGAGGTTAT
CGATCAAGGAATTAAAGTGTGGGAAAATTCTTTGGTAGGTCAGCTGATTGATTCCAAGCTACCTTATACCGTTATTCAGCATTTGGTTGAGAAAATTTGGGGGAAGATTG
AAATGCCCATTATCACTATTTTGGAAAACGATCTCATTTGCTTTCAATTTAGACGGTCAAAATCAGTGGAGTGGATTCTTTCGCGTGGACCATGGCATCTCGATGACAAG
TCTATGCTCCTCAGAAAATGGACTCCAGGTATTGTTCCTGAATTCTTTGTGTTTAATTCAGTTCCGGTTTGGATTAGATTGGGGAAATTACCTATGGAGTTATGGACAGA
AGCTGGGTTGGCTGTTGTAGCGAGTGCTGTGGGGAAACCTATATCTTTAGACTTGGCCACTAAGGAGCGTCGTAGGCTCTCATATGCTCGTGTTTGTGTTGAATTAGAGG
CAGGGTCAAATATGCCTGCTGAAATTACTGTTAGTCTCAGAGGAGTAGATTTTAATGTTTCGGTTAATTATGAGTGGAAACCACGGAAGTGTAATTTGTGTTGTGCCTTT
GGGCATTCTGGTAGTAATTGTTCTAGAAGTGTGGAGAGTAAAACCATTCAGGAGGAGGTTGTGCACAAGGGGGATGATGTAGACAGTGAACCTTGTGGGGAAGTTGTTCT
TGAATCGTTCAAACAGTTAGAGGAAGGTGAAATTAGGAACTCTCCTAATAGACATAACAGCCAAGTGGAGAAGGGGGTGGGTAAAAGTGATGAATTTACCCTTGTAACTC
GCAGGAAGAGTGAGTTGGTCTCTGTTAGAGATCGTGGAAAGAGTATGGAGGTTGATGAAGGTACTGATATTCTTAATGGTGTTAGTTCTTCTATTGGTCCTAAAGGATTA
CCTACACTTAATAATTCTCATGATTATTACAGTAATAGTGGTGTTGGTCGGATTTTGGTGATGTGGAAGAAGAATCGTTTTTCTTTCTCTACTAATGTGATGGATGAGCA
GTTTATTACAGTTGAGATTACTTTTGCTTGGTCGAGCCCAGGGGTTGTCATGGGAGATTTTAATGCTATTAGAGTTTATTCTGAAGCATTTGGGGGATCTCCTATTCAGG
GTGAGATGGAGGATTTTGATCTGGCTATTCGCGATGTTGATTTAGTGGAGCCTTTGGTGCAGGGGAACTGGTTTACTTGGACTATGAATGATGATTGGTTATTTGCATGG
CCTACCATGTTGGTAAATGTGCTTCCATGGGGTATTTCTGATCATTCTCCAATTTTATTTTATCCTAGCTTTCAGCTAAATAGCAAAGTGGTGTCTTTTCGGTTCTTCAA
TCATTGGGTGGAGGATCCATCCTTTATTGAGGTGGTTGCTAGGAGGTGGAGTCGTCATGAGGGTGTCTCTCCGCTAGTGAGCCTCATGAGAAACCTTCATCATCTCAAAC
CTATTCTCCATGGACGGTTTGGTAGACACATCAAGAGTCTTAATGAGAAGGTGCGCATTGCTAAGTTGGCCATGGATATAGCTCAGAGAAAGGTAGAACGTAACCCAATG
TCGGATGTTTTGAGTCGCCAAGCAAGTCTTGCTACTGAGACTTTCTGGACAGCAGTTAGATTGGAGGAAGCCTCGCTTCGGCAGAAATCCAGAATTCGATGGTTAAAGCT
GGGTGATCAGAATACGGCTTTTTTCCATCGTTCCGTCTACTCTCGTATGAGTCGTAATTCACTACTTTCTCTAGTTGATTCTGATGGTGATGGATCCAGGGTGTCTTCAC
ATGATGGGGTGGTTCATATGGCAGTTAATTATTTTAGTAACAGTTTGGGGTCCCAGGAGATTAGCTATAGAGAATTGACCCCATCTGAGGAGTGTTGTCAGGCGTTACAG
TTACCTATTAGTAGGGAGGAAGTTAGGAGGGTCTTATTAACTATGGATAGTGGAAAGGCTCCCGGTCCTGATGGGTTCTCTTTAGGTTTCTTCAAAGTTAATGCTACTGC
TATCACCCTCATTCCTAAACATAATGGGGCTGAGCGTCTGGAGGACTTTCGTCCTATTTCTTGTTGTAATGTGTTATATAAATGCATTTCTAAAATTCTGGCTGATAGAC
TTCGTTCAGCTTTTATACCTGGGAGGAGTATTATCGAGAACATCCTGCTTTGTCAGGAATTGGTAGGAGTTGATCTTCAAAAAGCTTATGACTCTGTTAATTGGGATTTT
CTGTTTGGTTTGTTGATTGCTATTGGTACTCTGCTGAAGAAGGGTGTAAGACAAGGTGATCCTTTATCTCCTTTTCTCTTTGTTATGGTGATGGAAGTTCTTTCTCGTAT
GCTGAACAAGATTCCTCAGAGTTTTCAATTTCACCATCGTTGTGAAAAGAAGTTTGGTGAGCTTTCAGGTTTGTTTGCAAATCCTAAGAAAAGCTCTATCTTTGTTGCAG
GAGTTAATAATGAGAATGCTTCTCATCTGGCTGCTTGTATGGGTTTTGTCCGTGGAAATCTCTCTGTTCGTTATCTTGGCCTTCCTCTTCTTACTGGTCGGTTATGTTCT
AATGATTGTGCTCCTCTGATTCAGCGTATCACTAGCCAAATTCGTTCTTGGACTGCTCGAGTTCTTTCGTTTGCTGGTAGAATGCAGCTTGTTCGTTCTGTGCTTCGTAG
CCTTCAAGTTTACTGGGCTAGTGTGTTTGTTCTTCCTGCGTATGTGCATAATGAGGGTGGTCTTGGTATTCGAGATGGCCCTTCTTGGAATATTGCGAGTACTTTGAAGA
TTTTGTGGCTTATGTTGACAAATTCGGGTTCTCTTTGGGTGGCTTGGGTGGAGGCTTATATATTAAAGAGGAGGTCTTTGTGGGATGTGGATAGTAGAGTGGGTCGATCT
TGGTGTCTTCGGGCGATCTTACGTAAGCGAGAGAAGCTGAAACATCATGTAAGGATGAAGGTAGGGAATGGCAATAGATGTAGAGTTTGGCTTGATCCGTGGTTGCAGAG
GGTGATTTATGATGCAGGTAGTTGGAGGGAGGCTAGACTTTCTGACTTTATTGACCCAGATGGAGAATGGCTTTGGCCACGAGGTGGTTTCTCTATTGCAAGTGTATGGG
AAGCTATTCGTCCTAGGGGTGGTAGGGTTCTTTGGGATGATTTATTGTGGGGTGGGGGAAATATCCCAAAACATTCCTTTTGTTCGTGGTTGGCCATTAAAGATAGGTTG
GGCACTAGAGATAGATTACATAGGTGGGATAGTTCGGTACCGTTGTCGTGCATTCTACGTCAGGGGGGTGTGGAGTCTTGCAATCACTTATTTTTTTCAGTTCTTCGGAT
CATGGCTTCATCACATAGGATTGGGCATTGGGGGGTTGAGTTGTCTTGGATTTGTCATCAGGGTATTGGGAAGGGTGTGAGGAGGAAGCTGTGGCGTGTTCTTTGGTGGG
CAACTATCTATTTTATTTGGAACGAGTGGAATCATCGGTTACATGGTGGTCAAGCTCGTGATCCTGTTGTCCTTTTCCATCTTATTTGTTCGTGA
Protein sequenceShow/hide protein sequence
MDGLNVVGPDLIGPDVESEGKRGSKEAGRRDDGLGSFGPRNETVIGRCGSSVCGQKLDDISSGSRIEFGDFFNSGEDEPSNVSSGCVLNNQNANVSNVVQNGINKQPIDS
KSTWASLFGTSSEESLLYTLPKIIGDKIVVTPPEEVIDQGIKVWENSLVGQLIDSKLPYTVIQHLVEKIWGKIEMPIITILENDLICFQFRRSKSVEWILSRGPWHLDDK
SMLLRKWTPGIVPEFFVFNSVPVWIRLGKLPMELWTEAGLAVVASAVGKPISLDLATKERRRLSYARVCVELEAGSNMPAEITVSLRGVDFNVSVNYEWKPRKCNLCCAF
GHSGSNCSRSVESKTIQEEVVHKGDDVDSEPCGEVVLESFKQLEEGEIRNSPNRHNSQVEKGVGKSDEFTLVTRRKSELVSVRDRGKSMEVDEGTDILNGVSSSIGPKGL
PTLNNSHDYYSNSGVGRILVMWKKNRFSFSTNVMDEQFITVEITFAWSSPGVVMGDFNAIRVYSEAFGGSPIQGEMEDFDLAIRDVDLVEPLVQGNWFTWTMNDDWLFAW
PTMLVNVLPWGISDHSPILFYPSFQLNSKVVSFRFFNHWVEDPSFIEVVARRWSRHEGVSPLVSLMRNLHHLKPILHGRFGRHIKSLNEKVRIAKLAMDIAQRKVERNPM
SDVLSRQASLATETFWTAVRLEEASLRQKSRIRWLKLGDQNTAFFHRSVYSRMSRNSLLSLVDSDGDGSRVSSHDGVVHMAVNYFSNSLGSQEISYRELTPSEECCQALQ
LPISREEVRRVLLTMDSGKAPGPDGFSLGFFKVNATAITLIPKHNGAERLEDFRPISCCNVLYKCISKILADRLRSAFIPGRSIIENILLCQELVGVDLQKAYDSVNWDF
LFGLLIAIGTLLKKGVRQGDPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEKKFGELSGLFANPKKSSIFVAGVNNENASHLAACMGFVRGNLSVRYLGLPLLTGRLCS
NDCAPLIQRITSQIRSWTARVLSFAGRMQLVRSVLRSLQVYWASVFVLPAYVHNEGGLGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKRRSLWDVDSRVGRS
WCLRAILRKREKLKHHVRMKVGNGNRCRVWLDPWLQRVIYDAGSWREARLSDFIDPDGEWLWPRGGFSIASVWEAIRPRGGRVLWDDLLWGGGNIPKHSFCSWLAIKDRL
GTRDRLHRWDSSVPLSCILRQGGVESCNHLFFSVLRIMASSHRIGHWGVELSWICHQGIGKGVRRKLWRVLWWATIYFIWNEWNHRLHGGQARDPVVLFHLICS