; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0021288 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0021288
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionReverse transcriptase
Genome locationchr08:26549720..26558561
RNA-Seq ExpressionIVF0021288
SyntenyIVF0021288
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035138.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]3.19e-27360.67Show/hide
Query:  MRKVISRSLISILKAEKLLRKGCIAFLAHVIVVQREKLKPEDVSVVKEFLDVFPDYLLGLPPDREIEFTIELLPGTAPISQAPYKMAPSELKELKMQLQE
        MRK +SRSLIS+LKAEKLLRKGC AFLAHV+VVQREKLKPEDV VVKEFLDVF D L GLPPDREIEFTIELLPGTAPISQAPY+MAPSELKELKMQLQE
Subjt:  MRKVISRSLISILKAEKLLRKGCIAFLAHVIVVQREKLKPEDVSVVKEFLDVFPDYLLGLPPDREIEFTIELLPGTAPISQAPYKMAPSELKELKMQLQE

Query:  LV------------------TRDTSGLVYRLREHQQLNKVKIRNKYPLPRIDDLFDQLRGASLFSKIDLKSGYHKLKVRESDIDKTAFRTRYGHYKFRVM
        LV                   +   G +    +++QLNKV IRNKYPLPRIDDLF+QLRGA+LFSKIDL+SGYH+LKVRESDI KTAFRTRYGHY+FRVM
Subjt:  LV------------------TRDTSGLVYRLREHQQLNKVKIRNKYPLPRIDDLFDQLRGASLFSKIDLKSGYHKLKVRESDIDKTAFRTRYGHYKFRVM

Query:  PFVLTNAPTIFMDLMNKIFHQYLDQFVIMFIDDILAYSVDKEAHEEHRGF-------------------------FYKLYMMNSCTLSSATVSSGW----
        PF LTNAP +FMDLMN+IFH+YLDQFVI+FIDDIL YSVD+E+HEEH                            F +L +     L+   V   W    
Subjt:  PFVLTNAPTIFMDLMNKIFHQYLDQFVIMFIDDILAYSVDKEAHEEHRGF-------------------------FYKLYMMNSCTLSSATVSSGW----

Query:  -------------------------------------NKRRW----------------------------------------------------------
                                              +RRW                                                          
Subjt:  -------------------------------------NKRRW----------------------------------------------------------

Query:  -------KSKKGLEVEFELRTYGAIVKQGRLCVLNISELKNAILEEAHNSAYDMHPGSTKMYRTLKKTYWWPGIKKEIAEYVDRYLICQQVKPVRQRPGG
               KSKKGLEVEFELRT GAI+KQGRLCV NISELKN ILEEAH+SAY MHPGSTKMYRTLKKTYWW G+K+EIAEYVDR LICQQVKPVRQRP G
Subjt:  -------KSKKGLEVEFELRTYGAIVKQGRLCVLNISELKNAILEEAHNSAYDMHPGSTKMYRTLKKTYWWPGIKKEIAEYVDRYLICQQVKPVRQRPGG

Query:  FLNPLPVPEWKWEHITMDFLFGLPHTSSGHD-----------------------------------------------DKDLRFTSKFWPSLQKAMGTGL
        FLNPLPVPEWKWEHITMDFLFGLP TSSGHD                                               D+D RFTSKFWPSLQKAMGTGL
Subjt:  FLNPLPVPEWKWEHITMDFLFGLPHTSSGHD-----------------------------------------------DKDLRFTSKFWPSLQKAMGTGL

Query:  KFSTSFHPQIDGQSERTIQTLEDMLRACVLQLKGSWDTHLPLMEFAYNNSYQSSIGMSSYEALYGRPCRTPACGMKWESES---PELVQIMTNNIKLTRE
        KFSTSFHPQ DGQSERT+QTLEDMLRACVLQLKGSWDTHLPLMEF YNN+YQSSIGM+ YEALYGRPCRTP C  +        PELVQI TNNIKL RE
Subjt:  KFSTSFHPQIDGQSERTIQTLEDMLRACVLQLKGSWDTHLPLMEFAYNNSYQSSIGMSSYEALYGRPCRTPACGMKWESES---PELVQIMTNNIKLTRE

Query:  NLRIAQDRQKSYVDKRRRNLEFQVGDQVFLKLSPWQGVIRFGRKG
        NLRIAQDRQKSY DKRRRNLEFQVGDQVFLKLSPW+GVIRFGRKG
Subjt:  NLRIAQDRQKSYVDKRRRNLEFQVGDQVFLKLSPWQGVIRFGRKG

KAA0061889.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]4.79e-27360.03Show/hide
Query:  MRKVISRSLISILKAEKLLRKGCIAFLAHVIVVQREKLKPEDVSVVKEFLDVFPDYLLGLPPDREIEFTIELLPGTAPISQAPYKMAPSELKELKMQLQE
        MRK +SRSLIS+LKAEKLLRKGC AFLAH++VVQREKLKPEDV VVKEFLDVFPD L GLPPDREIEFTIELLPGTAPISQAPY++APSELKELKMQLQE
Subjt:  MRKVISRSLISILKAEKLLRKGCIAFLAHVIVVQREKLKPEDVSVVKEFLDVFPDYLLGLPPDREIEFTIELLPGTAPISQAPYKMAPSELKELKMQLQE

Query:  LV------------------TRDTSGLVYRLREHQQLNKVKIRNKYPLPRIDDLFDQLRGASLFSKIDLKSGYHKLKVRESDIDKTAFRTRYGHYKFRVM
        LV                   +   G +    +++QLNKV I NKYPLPRIDDLFDQLRGA+LFSKIDL+SGYH+LKVRESDI KTAFRTRYGHY+FRVM
Subjt:  LV------------------TRDTSGLVYRLREHQQLNKVKIRNKYPLPRIDDLFDQLRGASLFSKIDLKSGYHKLKVRESDIDKTAFRTRYGHYKFRVM

Query:  PFVLTNAPTIFMDLMNKIFHQYLDQFVIMFIDDILAYSVDKEAHEEH-----------------------------------------------------
        PF LTNAP +FMDLMN+IFH+YLDQF+I+FIDDIL YSVD+E+HEEH                                                     
Subjt:  PFVLTNAPTIFMDLMNKIFHQYLDQFVIMFIDDILAYSVDKEAHEEH-----------------------------------------------------

Query:  ---------RGFFYKLYMMNS-------------------------------CTLS--------------------------------------------
                  G  Y++Y   S                               CT+                                             
Subjt:  ---------RGFFYKLYMMNS-------------------------------CTLS--------------------------------------------

Query:  --SATVSSGWNKRRWKSKKGLEVEFELRTYGAIVKQGRLCVLNISELKNAILEEAHNSAYDMHPGSTKMYRTLKKTYWWPGIKKEIAEYVDRYLICQQVK
          S ++ + + K+  KSKKGLEVEFELRT GAIVKQ RLCV NISELKNAILEEAH+SAY MHPGSTKMYRTLKKTYWW G+K++IAEYVDR LICQQVK
Subjt:  --SATVSSGWNKRRWKSKKGLEVEFELRTYGAIVKQGRLCVLNISELKNAILEEAHNSAYDMHPGSTKMYRTLKKTYWWPGIKKEIAEYVDRYLICQQVK

Query:  PVRQRPGGFLNPLPVPEWKWEHITMDFLFGLPHTSSGHD-----------------------------------------------DKDLRFTSKFWPSL
        PVRQRPGGFLNPLPVPEWKWEHITMDFLFGLP TSSGHD                                               D+D RFTSKFWPSL
Subjt:  PVRQRPGGFLNPLPVPEWKWEHITMDFLFGLPHTSSGHD-----------------------------------------------DKDLRFTSKFWPSL

Query:  QKAMGTGLKFSTSFHPQIDGQSERTIQTLEDMLRACVLQLKGSWDTHLPLMEFAYNNSYQSSIGMSSYEALYGRPCRTPACGMKWESES---PELVQIMT
        QKAMGTGLKFSTSFHPQ DGQSERTIQTLEDMLRACVLQLKGSWDTHLPLMEFAYNN+YQSSIGM+ YEALYGRPCRTP C  +        PELVQI T
Subjt:  QKAMGTGLKFSTSFHPQIDGQSERTIQTLEDMLRACVLQLKGSWDTHLPLMEFAYNNSYQSSIGMSSYEALYGRPCRTPACGMKWESES---PELVQIMT

Query:  NNIKLTRENLRIAQDRQKSYVDKRRRNLEFQVGDQVFLKLSPWQGVIRFGRKG
        NNIKL RENLR AQDRQKSY DKRRRNLEFQVGDQVFLKLSPW+GVIRFGRKG
Subjt:  NNIKLTRENLRIAQDRQKSYVDKRRRNLEFQVGDQVFLKLSPWQGVIRFGRKG

KAA0065520.1 retrotransposon protein, putative, Ty3-gypsy subclass, expressed [Cucumis melo var. makuwa]0.053.22Show/hide
Query:  MRKVISRSLISILKAEKLLRKGCIAFLAHVIVVQREKLKPEDVSVVKEFLDVFPDYLLGLPPDREIEFTIELLPGTAPISQAPYKMAPSELKELKMQLQE
        MRKVISRSLISILKAEKLLRKGCIAFLAHVIVVQREKLKPEDV VVKEFLDVFPDYLLGLPPDREIEFTIELLPGTAPISQAPYK APSELKELKMQLQE
Subjt:  MRKVISRSLISILKAEKLLRKGCIAFLAHVIVVQREKLKPEDVSVVKEFLDVFPDYLLGLPPDREIEFTIELLPGTAPISQAPYKMAPSELKELKMQLQE

Query:  LVTRDTSGLVYRLREHQQLNKVKIRNKYPLPRIDDLFDQLRGASLFSKIDLKSGYHKLKVRESDIDKTAFRTRYGHYKFRVMPFVLTNAPTIFMDLMNKI
        LV +   G +        LNKVKIRNKYPLPRIDDLFDQLRGASLFSKIDLKSGYHKLKVRESDIDKTAFRTRYGHYKFRVMPFVLTNAPTIFMDLMNKI
Subjt:  LVTRDTSGLVYRLREHQQLNKVKIRNKYPLPRIDDLFDQLRGASLFSKIDLKSGYHKLKVRESDIDKTAFRTRYGHYKFRVMPFVLTNAPTIFMDLMNKI

Query:  FHQYLDQFVIMFIDDILAYSVDKEAHEEHRGFFYKLYMMNSCTLSSATVSS----------------------------------------GWNKRRWK-
        FHQYLDQFVIMFIDDILAYSVDKEAHEEHRGFFYKLYM+NSCTLSSAT                                           G+  R+ K 
Subjt:  FHQYLDQFVIMFIDDILAYSVDKEAHEEHRGFFYKLYMMNSCTLSSATVSS----------------------------------------GWNKRRWK-

Query:  ---------------------------------------------------------------------------------SKK---------GLEV---
                                                                                         S+K         G+ V   
Subjt:  ---------------------------------------------------------------------------------SKK---------GLEV---

Query:  --------------------EFELRTYGAIVKQGRLCVLNISELKNAILEEAHNSAYDMHPGSTKMYRTLKKTYWWPGIKKEIAEYVDRYLICQQVKPVR
                            +F++R+   +V+ GRLCVLNIS+LKNAILEEAHNSAYDMHPGSTKMYRTLKKTYWWPG+K+EIAEYVDRYLICQQVKPVR
Subjt:  --------------------EFELRTYGAIVKQGRLCVLNISELKNAILEEAHNSAYDMHPGSTKMYRTLKKTYWWPGIKKEIAEYVDRYLICQQVKPVR

Query:  QRPGGFLNPLPVPEWKWEHITMDFLFGLPHTSSGHD--DKDLRFTSKFWPSLQKAMGTGLKFSTSFHPQIDGQSERTIQTLEDMLRACVLQLKGSWDTHL
        QRPGGFLNPLPVPEWKWEHITMDFLFGLPHTSSGHD  DKDLRFTSKFWPSLQKAMGTGLKFSTSFHPQIDGQSERTIQTLEDMLRACVLQLKGSWDTHL
Subjt:  QRPGGFLNPLPVPEWKWEHITMDFLFGLPHTSSGHD--DKDLRFTSKFWPSLQKAMGTGLKFSTSFHPQIDGQSERTIQTLEDMLRACVLQLKGSWDTHL

Query:  PLMEFAYNNSYQSSIGMSSYEALYGRPCRTPACGMKWESESPELVQIMTNNIKLTRENLRIAQDRQKSYVDKRRRNLEFQVGDQVFLKLSPWQGVIRFGR
        PLMEFAYNNSYQSSIG                         PELVQIMTNNIKLTRENLRIAQDRQKSYVDKRRRNLEFQVGDQVFLKLSPW+GVIRFGR
Subjt:  PLMEFAYNNSYQSSIGMSSYEALYGRPCRTPACGMKWESESPELVQIMTNNIKLTRENLRIAQDRQKSYVDKRRRNLEFQVGDQVFLKLSPWQGVIRFGR

Query:  KG-LS-------EIIRRESDL------------------------------HVLK---------------------------------------------
        KG LS       +II R   +                              HVL+                                             
Subjt:  KG-LS-------EIIRRESDL------------------------------HVLK---------------------------------------------

Query:  -------------------PRGNRSDIHIGKLVQD-------------------------------------------C---------------------
                           PRGNRSDIHIGKLVQD                                           C                     
Subjt:  -------------------PRGNRSDIHIGKLVQD-------------------------------------------C---------------------

Query:  --------TSSFGTHTPRTLVYIPDIVPHPFSQSIGWEASSLEEELRCNPKQSKRWPMLVQRVANARPESASENASDQYFYRRVKSVPTRVCLFCVGHKR
                TSSFGTHTPRTLVYIPDIVPHPFSQSIGWEASSLEEELRCNPKQSKRWPMLVQRVANARPESASENASDQYFYRRVKSVPTRVCLFCVGHKR
Subjt:  --------TSSFGTHTPRTLVYIPDIVPHPFSQSIGWEASSLEEELRCNPKQSKRWPMLVQRVANARPESASENASDQYFYRRVKSVPTRVCLFCVGHKR

Query:  FELRDSYPRTLVYIPGIVPHPFSQSIGGEASASRYGTQLRVLQSIFLASSLSRFPKYRSGSKIIEGRTKVQPETIEMMANALPTSGERSSQRRKWAKLKN
        FELRDSYPRTLVYIPGIVPHPFSQSI   A+    G         F       FPKYRS S+IIEGRTKVQPETIEMMANALPTSGERSSQ         
Subjt:  FELRDSYPRTLVYIPGIVPHPFSQSIGGEASASRYGTQLRVLQSIFLASSLSRFPKYRSGSKIIEGRTKVQPETIEMMANALPTSGERSSQRRKWAKLKN

Query:  FWCLSQKSASENSLHSVFYRGESANENALDSVFLSRGKKRSYNGFLVLCCIGHKRFKLQDSYSEDFSVHS--------------------------W--H
              KSASENSLHSVF               LSRGKKR  N F V CC+  KRF++ DSYSE F +HS                          W   
Subjt:  FWCLSQKSASENSLHSVFYRGESANENALDSVFLSRGKKRSYNGFLVLCCIGHKRFKLQDSYSEDFSVHS--------------------------W--H

Query:  RPSLIFPK---------------YRSGSELV--------------------------------------------QGRTKVQPETIENMANTLPRVANAR
        R S++ P+                ++G  +V                                            +G  +  P+  +       RVANAR
Subjt:  RPSLIFPK---------------YRSGSELV--------------------------------------------QGRTKVQPETIENMANTLPRVANAR

Query:  PMSGMIKHAFQQWKCAKLKNFWCSSQESVSENAPDSVFLSPDKKR---------------------------------PAMVSWFY------------VV
        P+SGMIKHAFQQWKCAKLKNFWCSSQES SENAPDSVFLSPDKKR                                 P+ V   Y            V 
Subjt:  PMSGMIKHAFQQWKCAKLKNFWCSSQESVSENAPDSVFLSPDKKR---------------------------------PAMVSWFY------------VV

Query:  LV------------------------------ISTSSFRTHTLRTSVYIRGNVRHPFFKSIGREASSFEGELR
        LV                              I++S FRT+TL+TSVYI G V H F + IGRE+SS +GELR
Subjt:  LV------------------------------ISTSSFRTHTLRTSVYIRGNVRHPFFKSIGREASSFEGELR

TYJ98798.1 retrotransposon protein, putative, Ty3-gypsy subclass, expressed [Cucumis melo var. makuwa]4.79e-30469.09Show/hide
Query:  MRKVISRSLISILKAEKLLRKGCIAFLAHVIVVQREKLKPEDVSVVKEFLDVFPDYLLGLPPDREIEFTIELLPGTAPISQAPYKMAPSELKELKMQLQE
        MRKVISRSLISILKAEKLLRKGCIAFLAHVIVVQREKLKPEDVSVVKEFLDVFPDYLLGLPPDREIEFTIELLPGTAPISQAPYKMAPSELKELKMQLQE
Subjt:  MRKVISRSLISILKAEKLLRKGCIAFLAHVIVVQREKLKPEDVSVVKEFLDVFPDYLLGLPPDREIEFTIELLPGTAPISQAPYKMAPSELKELKMQLQE

Query:  LVTRDTSGLVYRLREHQQLNKVKIRNKYPLPRIDDLFDQLRGASLFSKIDLKSGYHKLKVRESDIDKTAFRTRYGHYKFRVMPFVLTNAPTIFMDLMNKI
        LV +   G +        LNKVKIRNKYPLPRIDDLFDQLRGASLFSKIDLKSGYHKLKVRESDIDKTAFRTRYGHYKFRVMPFVLTNAPTIFMDLMNKI
Subjt:  LVTRDTSGLVYRLREHQQLNKVKIRNKYPLPRIDDLFDQLRGASLFSKIDLKSGYHKLKVRESDIDKTAFRTRYGHYKFRVMPFVLTNAPTIFMDLMNKI

Query:  FHQYLDQFVIMFIDDILAYSVDKEAHEEHRGFFYKLYMMNSCTLSSATVSS----------------------------------------GWNKRRWK-
        FHQYLDQFVIMFIDDILAYSVDKEAHEEHRGFFYKLYMMNSCTLSSAT                                           G+  R+ K 
Subjt:  FHQYLDQFVIMFIDDILAYSVDKEAHEEHRGFFYKLYMMNSCTLSSATVSS----------------------------------------GWNKRRWK-

Query:  ---------------------------------------------------------------------------------SKK---------GLEV---
                                                                                         S+K         G+ V   
Subjt:  ---------------------------------------------------------------------------------SKK---------GLEV---

Query:  --------------------EFELRTYGAIVKQGRLCVLNISELKNAILEEAHNSAYDMHPGSTKMYRTLKKTYWWPGIKKEIAEYVDRYLICQQVKPVR
                            +F++R+   +V+ GRLCVLNISELKNAILEEAHNSAYDMHPGSTKMYRTLKKTYWWPGIKKEIAEYVDRYLICQQVKPVR
Subjt:  --------------------EFELRTYGAIVKQGRLCVLNISELKNAILEEAHNSAYDMHPGSTKMYRTLKKTYWWPGIKKEIAEYVDRYLICQQVKPVR

Query:  QRPGGFLNPLPVPEWKWEHITMDFLFGLPHTSSGHD--DKDLRFTSKFWPSLQKAMGTGLKFSTSFHPQIDGQSERTIQTLEDMLRACVLQLKGSWDTHL
        QRPGGFLNPLPVPEWKWEHITMDFLFGLPHTSSGHD  DKDLRFTSKFWPSLQKAMGTGLKFSTSFHPQIDGQSERTIQTLEDMLRACVLQLKGSWDTHL
Subjt:  QRPGGFLNPLPVPEWKWEHITMDFLFGLPHTSSGHD--DKDLRFTSKFWPSLQKAMGTGLKFSTSFHPQIDGQSERTIQTLEDMLRACVLQLKGSWDTHL

Query:  PLMEFAYNNSYQSSIGMSSYEALYGRPCRTPACGMKWESESPELVQIMTNNIKLTRENLRIAQDRQKSYVDKRRRNLEFQVGDQVFLKLSPWQGVIRFGR
        PLMEFAYNNSYQSSIG                         PELVQIMTNNIKLTRENLRIAQDRQKSYVDKRRRNLEFQVGDQVFLKLSPWQGVIRFGR
Subjt:  PLMEFAYNNSYQSSIGMSSYEALYGRPCRTPACGMKWESESPELVQIMTNNIKLTRENLRIAQDRQKSYVDKRRRNLEFQVGDQVFLKLSPWQGVIRFGR

Query:  KG
        KG
Subjt:  KG

TYK30559.1 Transposon Ty3-G Gag-Pol polyprotein [Cucumis melo var. makuwa]7.94e-27764.87Show/hide
Query:  MRKVISRSLISILKAEKLLRKGCIAFLAHVIVVQREKLKPEDVSVVKEFLDVFPDYLLGLPPDREIEFTIELLPGTAPISQAPYKMAPSELKELKMQLQE
        MRK +SRSLIS+LKA KLLRKGCIAFLAH++VVQREKLK EDV VVKEFLDVFPD L GLPPDREIEFTIELLPGT PISQAPY+M+PSELK+LKMQLQE
Subjt:  MRKVISRSLISILKAEKLLRKGCIAFLAHVIVVQREKLKPEDVSVVKEFLDVFPDYLLGLPPDREIEFTIELLPGTAPISQAPYKMAPSELKELKMQLQE

Query:  LV------------------TRDTSGLVYRLREHQQLNKVKIRNKYPLPRIDDLFDQLRGASLFSKIDLKSGYHKLKVRESDIDKTAFRTRYGHYKFRVM
        LV                   +   G +    +++QLNKV IRNKYPLPRIDDLFDQLRGA+LFSKIDL+SGYH+LKVRESDI KT F+TRYGHY+FRVM
Subjt:  LV------------------TRDTSGLVYRLREHQQLNKVKIRNKYPLPRIDDLFDQLRGASLFSKIDLKSGYHKLKVRESDIDKTAFRTRYGHYKFRVM

Query:  PFVLTNAPTIFMDLMNKIFHQYLDQFVIMFIDDILAYSVDKEAHEEHRGFFY------KLY---------------------------------------
        PF LTN P +FMDLMN IFH+YLDQFVI+FIDDIL YS+D+E+HEEH           KLY                                       
Subjt:  PFVLTNAPTIFMDLMNKIFHQYLDQFVIMFIDDILAYSVDKEAHEEHRGFFY------KLY---------------------------------------

Query:  ------------------------------MMNSCTLSSATVS----------------------------SGWNKRRWKSKKGLEVEFELRTYGAIVKQ
                                      ++N    S A V+                            S   K+  KSKKGLEVEFELRT G I KQ
Subjt:  ------------------------------MMNSCTLSSATVS----------------------------SGWNKRRWKSKKGLEVEFELRTYGAIVKQ

Query:  GRLCVLNISELKNAILEEAHNSAYDMHPGSTKMYRTLKKTYWWPGIKKEIAEYVDRYLICQQVKPVRQRPGGFLNPLPVPEWKWEHITMDFLFGLPHTSS
        GRLCV NISELKNAILEEAH+SAY M+PGSTKMYRTLKKTYWW G+K+EIAEYVDR LICQQVKPVRQR GGFLNPLP+PEWKWEHITMDFLFGLP TSS
Subjt:  GRLCVLNISELKNAILEEAHNSAYDMHPGSTKMYRTLKKTYWWPGIKKEIAEYVDRYLICQQVKPVRQRPGGFLNPLPVPEWKWEHITMDFLFGLPHTSS

Query:  GHD----------------DKDLRFTSKFWPSLQKAMGTGLKFSTSFHPQIDGQSERTIQTLEDMLRACVLQLKGSWDTHLPLMEFAYNNSYQSSIGMSS
        GHD                D+D RFTSKFWPSLQKAMGTGLKFSTSFHPQ DGQSERTIQTLEDMLRACVLQLKGSWDTHLPLMEFAYNN+YQSSIGM+ 
Subjt:  GHD----------------DKDLRFTSKFWPSLQKAMGTGLKFSTSFHPQIDGQSERTIQTLEDMLRACVLQLKGSWDTHLPLMEFAYNNSYQSSIGMSS

Query:  YEALYGRPCRTPACGMKWESES---PELVQIMTNNIKLTRENLRIAQDRQKSYVDKRRRNLEFQVGDQVFLKLSPWQGVIRFGRKG
        YEALYGRPCRTP C  +        PELVQI TNNIKL RENLR AQDRQKSY DKRRRNLEFQVGDQVFLKLSPW+GVIRFGRKG
Subjt:  YEALYGRPCRTPACGMKWESES---PELVQIMTNNIKLTRENLRIAQDRQKSYVDKRRRNLEFQVGDQVFLKLSPWQGVIRFGRKG

TrEMBL top hitse value%identityAlignment
A0A5A7SX06 DNA/RNA polymerases superfamily protein5.0e-23360.56Show/hide
Query:  MRKVISRSLISILKAEKLLRKGCIAFLAHVIVVQREKLKPEDVSVVKEFLDVFPDYLLGLPPDREIEFTIELLPGTAPISQAPYKMAPSELKELKMQLQE
        MRK +SRSLIS+LKAEKLLRKGC AFLAHV+VVQREKLKPEDV VVKEFLDVF D L GLPPDREIEFTIELLPGTAPISQAPY+MAPSELKELKMQLQE
Subjt:  MRKVISRSLISILKAEKLLRKGCIAFLAHVIVVQREKLKPEDVSVVKEFLDVFPDYLLGLPPDREIEFTIELLPGTAPISQAPYKMAPSELKELKMQLQE

Query:  LV------------------TRDTSGLVYRLREHQQLNKVKIRNKYPLPRIDDLFDQLRGASLFSKIDLKSGYHKLKVRESDIDKTAFRTRYGHYKFRVM
        LV                   +   G +    +++QLNKV IRNKYPLPRIDDLF+QLRGA+LFSKIDL+SGYH+LKVRESDI KTAFRTRYGHY+FRVM
Subjt:  LV------------------TRDTSGLVYRLREHQQLNKVKIRNKYPLPRIDDLFDQLRGASLFSKIDLKSGYHKLKVRESDIDKTAFRTRYGHYKFRVM

Query:  PFVLTNAPTIFMDLMNKIFHQYLDQFVIMFIDDILAYSVDKEAHEEHRGF-------------------------FYKLYMMNSCTLSSATVSSGW----
        PF LTNAP +FMDLMN+IFH+YLDQFVI+FIDDIL YSVD+E+HEEH                            F +L +     L+   V   W    
Subjt:  PFVLTNAPTIFMDLMNKIFHQYLDQFVIMFIDDILAYSVDKEAHEEHRGF-------------------------FYKLYMMNSCTLSSATVSSGW----

Query:  -------------------------------------NKRRW----------------------------------------------------------
                                              +RRW                                                          
Subjt:  -------------------------------------NKRRW----------------------------------------------------------

Query:  -------KSKKGLEVEFELRTYGAIVKQGRLCVLNISELKNAILEEAHNSAYDMHPGSTKMYRTLKKTYWWPGIKKEIAEYVDRYLICQQVKPVRQRPGG
               KSKKGLEVEFELRT GAI+KQGRLCV NISELKN ILEEAH+SAY MHPGSTKMYRTLKKTYWW G+K+EIAEYVDR LICQQVKPVRQRP G
Subjt:  -------KSKKGLEVEFELRTYGAIVKQGRLCVLNISELKNAILEEAHNSAYDMHPGSTKMYRTLKKTYWWPGIKKEIAEYVDRYLICQQVKPVRQRPGG

Query:  FLNPLPVPEWKWEHITMDFLFGLPHTSSGHD-----------------------------------------------DKDLRFTSKFWPSLQKAMGTGL
        FLNPLPVPEWKWEHITMDFLFGLP TSSGHD                                               D+D RFTSKFWPSLQKAMGTGL
Subjt:  FLNPLPVPEWKWEHITMDFLFGLPHTSSGHD-----------------------------------------------DKDLRFTSKFWPSLQKAMGTGL

Query:  KFSTSFHPQIDGQSERTIQTLEDMLRACVLQLKGSWDTHLPLMEFAYNNSYQSSIGMSSYEALYGRPCRTPACGMKWES------ESPELVQIMTNNIKL
        KFSTSFHPQ DGQSERT+QTLEDMLRACVLQLKGSWDTHLPLMEF YNN+YQSSIGM+ YEALYGRPCRTP C   W          PELVQI TNNIKL
Subjt:  KFSTSFHPQIDGQSERTIQTLEDMLRACVLQLKGSWDTHLPLMEFAYNNSYQSSIGMSSYEALYGRPCRTPACGMKWES------ESPELVQIMTNNIKL

Query:  TRENLRIAQDRQKSYVDKRRRNLEFQVGDQVFLKLSPWQGVIRFGRKG
         RENLRIAQDRQKSY DKRRRNLEFQVGDQVFLKLSPW+GVIRFGRKG
Subjt:  TRENLRIAQDRQKSYVDKRRRNLEFQVGDQVFLKLSPWQGVIRFGRKG

A0A5A7V873 DNA/RNA polymerases superfamily protein2.5e-23259.92Show/hide
Query:  MRKVISRSLISILKAEKLLRKGCIAFLAHVIVVQREKLKPEDVSVVKEFLDVFPDYLLGLPPDREIEFTIELLPGTAPISQAPYKMAPSELKELKMQLQE
        MRK +SRSLIS+LKAEKLLRKGC AFLAH++VVQREKLKPEDV VVKEFLDVFPD L GLPPDREIEFTIELLPGTAPISQAPY++APSELKELKMQLQE
Subjt:  MRKVISRSLISILKAEKLLRKGCIAFLAHVIVVQREKLKPEDVSVVKEFLDVFPDYLLGLPPDREIEFTIELLPGTAPISQAPYKMAPSELKELKMQLQE

Query:  LV------------------TRDTSGLVYRLREHQQLNKVKIRNKYPLPRIDDLFDQLRGASLFSKIDLKSGYHKLKVRESDIDKTAFRTRYGHYKFRVM
        LV                   +   G +    +++QLNKV I NKYPLPRIDDLFDQLRGA+LFSKIDL+SGYH+LKVRESDI KTAFRTRYGHY+FRVM
Subjt:  LV------------------TRDTSGLVYRLREHQQLNKVKIRNKYPLPRIDDLFDQLRGASLFSKIDLKSGYHKLKVRESDIDKTAFRTRYGHYKFRVM

Query:  PFVLTNAPTIFMDLMNKIFHQYLDQFVIMFIDDILAYSVDKEAHEEH-----------------------------------------------------
        PF LTNAP +FMDLMN+IFH+YLDQF+I+FIDDIL YSVD+E+HEEH                                                     
Subjt:  PFVLTNAPTIFMDLMNKIFHQYLDQFVIMFIDDILAYSVDKEAHEEH-----------------------------------------------------

Query:  ---------RGFFYKLYMMNS-------------------------------CTL---------------------------------------------
                  G  Y++Y   S                               CT+                                             
Subjt:  ---------RGFFYKLYMMNS-------------------------------CTL---------------------------------------------

Query:  -SSATVSSGWNKRRWKSKKGLEVEFELRTYGAIVKQGRLCVLNISELKNAILEEAHNSAYDMHPGSTKMYRTLKKTYWWPGIKKEIAEYVDRYLICQQVK
          S ++ + + K+  KSKKGLEVEFELRT GAIVKQ RLCV NISELKNAILEEAH+SAY MHPGSTKMYRTLKKTYWW G+K++IAEYVDR LICQQVK
Subjt:  -SSATVSSGWNKRRWKSKKGLEVEFELRTYGAIVKQGRLCVLNISELKNAILEEAHNSAYDMHPGSTKMYRTLKKTYWWPGIKKEIAEYVDRYLICQQVK

Query:  PVRQRPGGFLNPLPVPEWKWEHITMDFLFGLPHTSSGHD-----------------------------------------------DKDLRFTSKFWPSL
        PVRQRPGGFLNPLPVPEWKWEHITMDFLFGLP TSSGHD                                               D+D RFTSKFWPSL
Subjt:  PVRQRPGGFLNPLPVPEWKWEHITMDFLFGLPHTSSGHD-----------------------------------------------DKDLRFTSKFWPSL

Query:  QKAMGTGLKFSTSFHPQIDGQSERTIQTLEDMLRACVLQLKGSWDTHLPLMEFAYNNSYQSSIGMSSYEALYGRPCRTPACGMKWES------ESPELVQ
        QKAMGTGLKFSTSFHPQ DGQSERTIQTLEDMLRACVLQLKGSWDTHLPLMEFAYNN+YQSSIGM+ YEALYGRPCRTP C   W          PELVQ
Subjt:  QKAMGTGLKFSTSFHPQIDGQSERTIQTLEDMLRACVLQLKGSWDTHLPLMEFAYNNSYQSSIGMSSYEALYGRPCRTPACGMKWES------ESPELVQ

Query:  IMTNNIKLTRENLRIAQDRQKSYVDKRRRNLEFQVGDQVFLKLSPWQGVIRFGRKG
        I TNNIKL RENLR AQDRQKSY DKRRRNLEFQVGDQVFLKLSPW+GVIRFGRKG
Subjt:  IMTNNIKLTRENLRIAQDRQKSYVDKRRRNLEFQVGDQVFLKLSPWQGVIRFGRKG

A0A5A7VEB1 Retrotransposon protein, putative, Ty3-gypsy subclass, expressed0.0e+0052.89Show/hide
Query:  MRKVISRSLISILKAEKLLRKGCIAFLAHVIVVQREKLKPEDVSVVKEFLDVFPDYLLGLPPDREIEFTIELLPGTAPISQAPYKMAPSELKELKMQLQE
        MRKVISRSLISILKAEKLLRKGCIAFLAHVIVVQREKLKPEDV VVKEFLDVFPDYLLGLPPDREIEFTIELLPGTAPISQAPYK APSELKELKMQLQE
Subjt:  MRKVISRSLISILKAEKLLRKGCIAFLAHVIVVQREKLKPEDVSVVKEFLDVFPDYLLGLPPDREIEFTIELLPGTAPISQAPYKMAPSELKELKMQLQE

Query:  LVTRDTSGLVYRLREHQQLNKVKIRNKYPLPRIDDLFDQLRGASLFSKIDLKSGYHKLKVRESDIDKTAFRTRYGHYKFRVMPFVLTNAPTIFMDLMNKI
        LV +   G +        LNKVKIRNKYPLPRIDDLFDQLRGASLFSKIDLKSGYHKLKVRESDIDKTAFRTRYGHYKFRVMPFVLTNAPTIFMDLMNKI
Subjt:  LVTRDTSGLVYRLREHQQLNKVKIRNKYPLPRIDDLFDQLRGASLFSKIDLKSGYHKLKVRESDIDKTAFRTRYGHYKFRVMPFVLTNAPTIFMDLMNKI

Query:  FHQYLDQFVIMFIDDILAYSVDKEAHEEHRGFFYKLYMMNSCTLSSAT----------------------------------------------------
        FHQYLDQFVIMFIDDILAYSVDKEAHEEHRGFFYKLYM+NSCTLSSAT                                                    
Subjt:  FHQYLDQFVIMFIDDILAYSVDKEAHEEHRGFFYKLYMMNSCTLSSAT----------------------------------------------------

Query:  ----------------------------------------------VSSGWNKRRW--------------------------------------------
                                                            +R+W                                            
Subjt:  ----------------------------------------------VSSGWNKRRW--------------------------------------------

Query:  ------------KSKKGLEVEFELRTYGAIVKQGRLCVLNISELKNAILEEAHNSAYDMHPGSTKMYRTLKKTYWWPGIKKEIAEYVDRYLICQQVKPVR
                    +    L  +F++R+   +V+ GRLCVLNIS+LKNAILEEAHNSAYDMHPGSTKMYRTLKKTYWWPG+K+EIAEYVDRYLICQQVKPVR
Subjt:  ------------KSKKGLEVEFELRTYGAIVKQGRLCVLNISELKNAILEEAHNSAYDMHPGSTKMYRTLKKTYWWPGIKKEIAEYVDRYLICQQVKPVR

Query:  QRPGGFLNPLPVPEWKWEHITMDFLFGLPHTSSGHD--DKDLRFTSKFWPSLQKAMGTGLKFSTSFHPQIDGQSERTIQTLEDMLRACVLQLKGSWDTHL
        QRPGGFLNPLPVPEWKWEHITMDFLFGLPHTSSGHD  DKDLRFTSKFWPSLQKAMGTGLKFSTSFHPQIDGQSERTIQTLEDMLRACVLQLKGSWDTHL
Subjt:  QRPGGFLNPLPVPEWKWEHITMDFLFGLPHTSSGHD--DKDLRFTSKFWPSLQKAMGTGLKFSTSFHPQIDGQSERTIQTLEDMLRACVLQLKGSWDTHL

Query:  PLMEFAYNNSYQSSIGMSSYEALYGRPCRTPACGMKWESESPELVQIMTNNIKLTRENLRIAQDRQKSYVDKRRRNLEFQVGDQVFLKLSPWQGVIRFGR
        PLMEFAYNNSYQSSIG                         PELVQIMTNNIKLTRENLRIAQDRQKSYVDKRRRNLEFQVGDQVFLKLSPW+GVIRFGR
Subjt:  PLMEFAYNNSYQSSIGMSSYEALYGRPCRTPACGMKWESESPELVQIMTNNIKLTRENLRIAQDRQKSYVDKRRRNLEFQVGDQVFLKLSPWQGVIRFGR

Query:  KG-LS-------EIIRRESDL------------------------------HVL----------------------------------------------
        KG LS       +II R   +                              HVL                                              
Subjt:  KG-LS-------EIIRRESDL------------------------------HVL----------------------------------------------

Query:  ------------------KPRGNRSDIHIGKLVQD-------------------------------------------C---------------------
                          +PRGNRSDIHIGKLVQD                                           C                     
Subjt:  ------------------KPRGNRSDIHIGKLVQD-------------------------------------------C---------------------

Query:  --------TSSFGTHTPRTLVYIPDIVPHPFSQSIGWEASSLEEELRCNPKQSKRWPMLVQRVANARPESASENASDQYFYRRVKSVPTRVCLFCVGHKR
                TSSFGTHTPRTLVYIPDIVPHPFSQSIGWEASSLEEELRCNPKQSKRWPMLVQRVANARPESASENASDQYFYRRVKSVPTRVCLFCVGHKR
Subjt:  --------TSSFGTHTPRTLVYIPDIVPHPFSQSIGWEASSLEEELRCNPKQSKRWPMLVQRVANARPESASENASDQYFYRRVKSVPTRVCLFCVGHKR

Query:  FELRDSYPRTLVYIPGIVPHPFSQSIGGEASASRYGTQLRVLQSIFLASSLSRFPKYRSGSKIIEGRTKVQPETIEMMANALPTSGERSSQRRKWAKLKN
        FELRDSYPRTLVYIPGIVPHPFSQSI   A+    G         F       FPKYRS S+IIEGRTKVQPETIEMMANALPTSGERS           
Subjt:  FELRDSYPRTLVYIPGIVPHPFSQSIGGEASASRYGTQLRVLQSIFLASSLSRFPKYRSGSKIIEGRTKVQPETIEMMANALPTSGERSSQRRKWAKLKN

Query:  FWCLSQKSASENSLHSVFYRGESANENALDSVFLSRGKKRSYNGFLVLCCIGHKRFKLQDSYSEDFSVHS--------------------------W--H
            SQKSASENSLH               SVFLSRGKKR  N F V CC+  KRF++ DSYSE F +HS                          W   
Subjt:  FWCLSQKSASENSLHSVFYRGESANENALDSVFLSRGKKRSYNGFLVLCCIGHKRFKLQDSYSEDFSVHS--------------------------W--H

Query:  RPSLIFPK---------------YRSGSELV--------------------------------------------QGRTKVQPETIENMANTLPRVANAR
        R S++ P+                ++G  +V                                            +G  +  P+  +       RVANAR
Subjt:  RPSLIFPK---------------YRSGSELV--------------------------------------------QGRTKVQPETIENMANTLPRVANAR

Query:  PMSGMIKHAFQQWKCAKLKNFWCSSQESVSENAPDSVFLSPDKKRP-------------------------AMVSW--------------------FYVV
        P+SGMIKHAFQQWKCAKLKNFWCSSQES SENAPDSVFLSPDKKR                          ++ SW                      V 
Subjt:  PMSGMIKHAFQQWKCAKLKNFWCSSQESVSENAPDSVFLSPDKKRP-------------------------AMVSW--------------------FYVV

Query:  LV------------------------------ISTSSFRTHTLRTSVYIRGNVRHPFFKSIGREASSFEGELR
        LV                              I++S FRT+TL+TSVYI G V H F + IGRE+SS +GELR
Subjt:  LV------------------------------ISTSSFRTHTLRTSVYIRGNVRHPFFKSIGREASSFEGELR

A0A5D3BKS7 Retrotransposon protein, putative, Ty3-gypsy subclass, expressed2.6e-25068.52Show/hide
Query:  MRKVISRSLISILKAEKLLRKGCIAFLAHVIVVQREKLKPEDVSVVKEFLDVFPDYLLGLPPDREIEFTIELLPGTAPISQAPYKMAPSELKELKMQLQE
        MRKVISRSLISILKAEKLLRKGCIAFLAHVIVVQREKLKPEDVSVVKEFLDVFPDYLLGLPPDREIEFTIELLPGTAPISQAPYKMAPSELKELKMQLQE
Subjt:  MRKVISRSLISILKAEKLLRKGCIAFLAHVIVVQREKLKPEDVSVVKEFLDVFPDYLLGLPPDREIEFTIELLPGTAPISQAPYKMAPSELKELKMQLQE

Query:  LVTRDTSGLVYRLREHQQLNKVKIRNKYPLPRIDDLFDQLRGASLFSKIDLKSGYHKLKVRESDIDKTAFRTRYGHYKFRVMPFVLTNAPTIFMDLMNKI
        LV +   G +        LNKVKIRNKYPLPRIDDLFDQLRGASLFSKIDLKSGYHKLKVRESDIDKTAFRTRYGHYKFRVMPFVLTNAPTIFMDLMNKI
Subjt:  LVTRDTSGLVYRLREHQQLNKVKIRNKYPLPRIDDLFDQLRGASLFSKIDLKSGYHKLKVRESDIDKTAFRTRYGHYKFRVMPFVLTNAPTIFMDLMNKI

Query:  FHQYLDQFVIMFIDDILAYSVDKEAHEEHRGFFYKLYMMNSCTLSSAT----------------------------------------------------
        FHQYLDQFVIMFIDDILAYSVDKEAHEEHRGFFYKLYMMNSCTLSSAT                                                    
Subjt:  FHQYLDQFVIMFIDDILAYSVDKEAHEEHRGFFYKLYMMNSCTLSSAT----------------------------------------------------

Query:  ----------------------------------------------VSSGWNKRRW--------------------------------------------
                                                            +R+W                                            
Subjt:  ----------------------------------------------VSSGWNKRRW--------------------------------------------

Query:  ------------KSKKGLEVEFELRTYGAIVKQGRLCVLNISELKNAILEEAHNSAYDMHPGSTKMYRTLKKTYWWPGIKKEIAEYVDRYLICQQVKPVR
                    +    L  +F++R+   +V+ GRLCVLNISELKNAILEEAHNSAYDMHPGSTKMYRTLKKTYWWPGIKKEIAEYVDRYLICQQVKPVR
Subjt:  ------------KSKKGLEVEFELRTYGAIVKQGRLCVLNISELKNAILEEAHNSAYDMHPGSTKMYRTLKKTYWWPGIKKEIAEYVDRYLICQQVKPVR

Query:  QRPGGFLNPLPVPEWKWEHITMDFLFGLPHTSSGHD--DKDLRFTSKFWPSLQKAMGTGLKFSTSFHPQIDGQSERTIQTLEDMLRACVLQLKGSWDTHL
        QRPGGFLNPLPVPEWKWEHITMDFLFGLPHTSSGHD  DKDLRFTSKFWPSLQKAMGTGLKFSTSFHPQIDGQSERTIQTLEDMLRACVLQLKGSWDTHL
Subjt:  QRPGGFLNPLPVPEWKWEHITMDFLFGLPHTSSGHD--DKDLRFTSKFWPSLQKAMGTGLKFSTSFHPQIDGQSERTIQTLEDMLRACVLQLKGSWDTHL

Query:  PLMEFAYNNSYQSSIGMSSYEALYGRPCRTPACGMKWESESPELVQIMTNNIKLTRENLRIAQDRQKSYVDKRRRNLEFQVGDQVFLKLSPWQGVIRFGR
        PLMEFAYNNSYQSSIG                         PELVQIMTNNIKLTRENLRIAQDRQKSYVDKRRRNLEFQVGDQVFLKLSPWQGVIRFGR
Subjt:  PLMEFAYNNSYQSSIGMSSYEALYGRPCRTPACGMKWESESPELVQIMTNNIKLTRENLRIAQDRQKSYVDKRRRNLEFQVGDQVFLKLSPWQGVIRFGR

Query:  KG
        KG
Subjt:  KG

A0A5D3E424 Transposon Ty3-G Gag-Pol polyprotein5.3e-23564.73Show/hide
Query:  MRKVISRSLISILKAEKLLRKGCIAFLAHVIVVQREKLKPEDVSVVKEFLDVFPDYLLGLPPDREIEFTIELLPGTAPISQAPYKMAPSELKELKMQLQE
        MRK +SRSLIS+LKA KLLRKGCIAFLAH++VVQREKLK EDV VVKEFLDVFPD L GLPPDREIEFTIELLPGT PISQAPY+M+PSELK+LKMQLQE
Subjt:  MRKVISRSLISILKAEKLLRKGCIAFLAHVIVVQREKLKPEDVSVVKEFLDVFPDYLLGLPPDREIEFTIELLPGTAPISQAPYKMAPSELKELKMQLQE

Query:  LV------------------TRDTSGLVYRLREHQQLNKVKIRNKYPLPRIDDLFDQLRGASLFSKIDLKSGYHKLKVRESDIDKTAFRTRYGHYKFRVM
        LV                   +   G +    +++QLNKV IRNKYPLPRIDDLFDQLRGA+LFSKIDL+SGYH+LKVRESDI KT F+TRYGHY+FRVM
Subjt:  LV------------------TRDTSGLVYRLREHQQLNKVKIRNKYPLPRIDDLFDQLRGASLFSKIDLKSGYHKLKVRESDIDKTAFRTRYGHYKFRVM

Query:  PFVLTNAPTIFMDLMNKIFHQYLDQFVIMFIDDILAYSVDKEAHEEHRGFFY------KLY---------------------------------------
        PF LTN P +FMDLMN IFH+YLDQFVI+FIDDIL YS+D+E+HEEH           KLY                                       
Subjt:  PFVLTNAPTIFMDLMNKIFHQYLDQFVIMFIDDILAYSVDKEAHEEHRGFFY------KLY---------------------------------------

Query:  ------------------------------MMNSCTLSSATVS----------------------------SGWNKRRWKSKKGLEVEFELRTYGAIVKQ
                                      ++N    S A V+                            S   K+  KSKKGLEVEFELRT G I KQ
Subjt:  ------------------------------MMNSCTLSSATVS----------------------------SGWNKRRWKSKKGLEVEFELRTYGAIVKQ

Query:  GRLCVLNISELKNAILEEAHNSAYDMHPGSTKMYRTLKKTYWWPGIKKEIAEYVDRYLICQQVKPVRQRPGGFLNPLPVPEWKWEHITMDFLFGLPHTSS
        GRLCV NISELKNAILEEAH+SAY M+PGSTKMYRTLKKTYWW G+K+EIAEYVDR LICQQVKPVRQR GGFLNPLP+PEWKWEHITMDFLFGLP TSS
Subjt:  GRLCVLNISELKNAILEEAHNSAYDMHPGSTKMYRTLKKTYWWPGIKKEIAEYVDRYLICQQVKPVRQRPGGFLNPLPVPEWKWEHITMDFLFGLPHTSS

Query:  GHD----------------DKDLRFTSKFWPSLQKAMGTGLKFSTSFHPQIDGQSERTIQTLEDMLRACVLQLKGSWDTHLPLMEFAYNNSYQSSIGMSS
        GHD                D+D RFTSKFWPSLQKAMGTGLKFSTSFHPQ DGQSERTIQTLEDMLRACVLQLKGSWDTHLPLMEFAYNN+YQSSIGM+ 
Subjt:  GHD----------------DKDLRFTSKFWPSLQKAMGTGLKFSTSFHPQIDGQSERTIQTLEDMLRACVLQLKGSWDTHLPLMEFAYNNSYQSSIGMSS

Query:  YEALYGRPCRTPACGMKWES------ESPELVQIMTNNIKLTRENLRIAQDRQKSYVDKRRRNLEFQVGDQVFLKLSPWQGVIRFGRKG
        YEALYGRPCRTP C   W          PELVQI TNNIKL RENLR AQDRQKSY DKRRRNLEFQVGDQVFLKLSPW+GVIRFGRKG
Subjt:  YEALYGRPCRTPACGMKWES------ESPELVQIMTNNIKLTRENLRIAQDRQKSYVDKRRRNLEFQVGDQVFLKLSPWQGVIRFGRKG

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein3.8e-2826.85Show/hide
Query:  KGLEVEFELRTYGAIVKQGRLCVLNISELKNAILEEAHNSAYDMHPGSTKMYRTLKKTYWWPGIKKEIAEYVDRYLICQQVKPVRQRPGGFLNPLPVPEW
        K +E   +L+    I  + ++ + N ++L   I+++ H     +HPG   +   + + + W GI+K+I EYV     CQ  K    +P G L P+P  E 
Subjt:  KGLEVEFELRTYGAIVKQGRLCVLNISELKNAILEEAHNSAYDMHPGSTKMYRTLKKTYWWPGIKKEIAEYVDRYLICQQVKPVRQRPGGFLNPLPVPEW

Query:  KWEHITMDFLFGLPHTSSGHD-----------------------------------------------DKDLRFTSKFWPSLQKAMGTGLKFSTSFHPQI
         WE ++MDF+  LP  SSG++                                               D D  FTS+ W          +KFS  + PQ 
Subjt:  KWEHITMDFLFGLPHTSSGHD-----------------------------------------------DKDLRFTSKFWPSLQKAMGTGLKFSTSFHPQI

Query:  DGQSERTIQTLEDMLRACVLQLKGSWDTHLPLMEFAYNNSYQSSIGMSSYEALYG-----RPCRTPACGMKWESESPELVQIMTNNIKLTRENLRIAQDR
        DGQ+ERT QT+E +LR        +W  H+ L++ +YNN+  S+  M+ +E ++       P   P+   K +  S E +Q+     +  +E+L     +
Subjt:  DGQSERTIQTLEDMLRACVLQLKGSWDTHLPLMEFAYNNSYQSSIGMSSYEALYG-----RPCRTPACGMKWESESPELVQIMTNNIKLTRENLRIAQDR

Query:  QKSYVDKRRRNL-EFQVGDQVFLK
         K Y D + + + EFQ GD V +K
Subjt:  QKSYVDKRRRNL-EFQVGDQVFLK

P0CT34 Transposon Tf2-1 polyprotein4.2e-1929.38Show/hide
Query:  KPEDVSVVKEFLDVFPD-YLLGLP-PDREIEFTIELLPGTAPISQAPYKMAPSELK----ELKMQLQELVTRDT--------------SGLVYRLREHQQ
        +PE   + KEF D+  +     LP P + +EF +EL      +    Y + P +++    E+   L+  + R++               G +  + +++ 
Subjt:  KPEDVSVVKEFLDVFPD-YLLGLP-PDREIEFTIELLPGTAPISQAPYKMAPSELK----ELKMQLQELVTRDT--------------SGLVYRLREHQQ

Query:  LNKVKIRNKYPLPRIDDLFDQLRGASLFSKIDLKSGYHKLKVRESDIDKTAFRTRYGHYKFRVMPFVLTNAPTIFMDLMNKIFHQYLDQFVIMFIDDILA
        LNK    N YPLP I+ L  +++G+++F+K+DLKS YH ++VR+ D  K AFR   G +++ VMP+ ++ AP  F   +N I  +  +  V+ ++DDIL 
Subjt:  LNKVKIRNKYPLPRIDDLFDQLRGASLFSKIDLKSGYHKLKVRESDIDKTAFRTRYGHYKFRVMPFVLTNAPTIFMDLMNKIFHQYLDQFVIMFIDDILA

Query:  YSVDKEAHEEH
        +S  +  H +H
Subjt:  YSVDKEAHEEH

P0CT35 Transposon Tf2-2 polyprotein3.8e-2826.85Show/hide
Query:  KGLEVEFELRTYGAIVKQGRLCVLNISELKNAILEEAHNSAYDMHPGSTKMYRTLKKTYWWPGIKKEIAEYVDRYLICQQVKPVRQRPGGFLNPLPVPEW
        K +E   +L+    I  + ++ + N ++L   I+++ H     +HPG   +   + + + W GI+K+I EYV     CQ  K    +P G L P+P  E 
Subjt:  KGLEVEFELRTYGAIVKQGRLCVLNISELKNAILEEAHNSAYDMHPGSTKMYRTLKKTYWWPGIKKEIAEYVDRYLICQQVKPVRQRPGGFLNPLPVPEW

Query:  KWEHITMDFLFGLPHTSSGHD-----------------------------------------------DKDLRFTSKFWPSLQKAMGTGLKFSTSFHPQI
         WE ++MDF+  LP  SSG++                                               D D  FTS+ W          +KFS  + PQ 
Subjt:  KWEHITMDFLFGLPHTSSGHD-----------------------------------------------DKDLRFTSKFWPSLQKAMGTGLKFSTSFHPQI

Query:  DGQSERTIQTLEDMLRACVLQLKGSWDTHLPLMEFAYNNSYQSSIGMSSYEALYG-----RPCRTPACGMKWESESPELVQIMTNNIKLTRENLRIAQDR
        DGQ+ERT QT+E +LR        +W  H+ L++ +YNN+  S+  M+ +E ++       P   P+   K +  S E +Q+     +  +E+L     +
Subjt:  DGQSERTIQTLEDMLRACVLQLKGSWDTHLPLMEFAYNNSYQSSIGMSSYEALYG-----RPCRTPACGMKWESESPELVQIMTNNIKLTRENLRIAQDR

Query:  QKSYVDKRRRNL-EFQVGDQVFLK
         K Y D + + + EFQ GD V +K
Subjt:  QKSYVDKRRRNL-EFQVGDQVFLK

P0CT35 Transposon Tf2-2 polyprotein4.2e-1929.38Show/hide
Query:  KPEDVSVVKEFLDVFPD-YLLGLP-PDREIEFTIELLPGTAPISQAPYKMAPSELK----ELKMQLQELVTRDT--------------SGLVYRLREHQQ
        +PE   + KEF D+  +     LP P + +EF +EL      +    Y + P +++    E+   L+  + R++               G +  + +++ 
Subjt:  KPEDVSVVKEFLDVFPD-YLLGLP-PDREIEFTIELLPGTAPISQAPYKMAPSELK----ELKMQLQELVTRDT--------------SGLVYRLREHQQ

Query:  LNKVKIRNKYPLPRIDDLFDQLRGASLFSKIDLKSGYHKLKVRESDIDKTAFRTRYGHYKFRVMPFVLTNAPTIFMDLMNKIFHQYLDQFVIMFIDDILA
        LNK    N YPLP I+ L  +++G+++F+K+DLKS YH ++VR+ D  K AFR   G +++ VMP+ ++ AP  F   +N I  +  +  V+ ++DDIL 
Subjt:  LNKVKIRNKYPLPRIDDLFDQLRGASLFSKIDLKSGYHKLKVRESDIDKTAFRTRYGHYKFRVMPFVLTNAPTIFMDLMNKIFHQYLDQFVIMFIDDILA

Query:  YSVDKEAHEEH
        +S  +  H +H
Subjt:  YSVDKEAHEEH

P0CT36 Transposon Tf2-3 polyprotein3.8e-2826.85Show/hide
Query:  KGLEVEFELRTYGAIVKQGRLCVLNISELKNAILEEAHNSAYDMHPGSTKMYRTLKKTYWWPGIKKEIAEYVDRYLICQQVKPVRQRPGGFLNPLPVPEW
        K +E   +L+    I  + ++ + N ++L   I+++ H     +HPG   +   + + + W GI+K+I EYV     CQ  K    +P G L P+P  E 
Subjt:  KGLEVEFELRTYGAIVKQGRLCVLNISELKNAILEEAHNSAYDMHPGSTKMYRTLKKTYWWPGIKKEIAEYVDRYLICQQVKPVRQRPGGFLNPLPVPEW

Query:  KWEHITMDFLFGLPHTSSGHD-----------------------------------------------DKDLRFTSKFWPSLQKAMGTGLKFSTSFHPQI
         WE ++MDF+  LP  SSG++                                               D D  FTS+ W          +KFS  + PQ 
Subjt:  KWEHITMDFLFGLPHTSSGHD-----------------------------------------------DKDLRFTSKFWPSLQKAMGTGLKFSTSFHPQI

Query:  DGQSERTIQTLEDMLRACVLQLKGSWDTHLPLMEFAYNNSYQSSIGMSSYEALYG-----RPCRTPACGMKWESESPELVQIMTNNIKLTRENLRIAQDR
        DGQ+ERT QT+E +LR        +W  H+ L++ +YNN+  S+  M+ +E ++       P   P+   K +  S E +Q+     +  +E+L     +
Subjt:  DGQSERTIQTLEDMLRACVLQLKGSWDTHLPLMEFAYNNSYQSSIGMSSYEALYG-----RPCRTPACGMKWESESPELVQIMTNNIKLTRENLRIAQDR

Query:  QKSYVDKRRRNL-EFQVGDQVFLK
         K Y D + + + EFQ GD V +K
Subjt:  QKSYVDKRRRNL-EFQVGDQVFLK

P0CT36 Transposon Tf2-3 polyprotein4.2e-1929.38Show/hide
Query:  KPEDVSVVKEFLDVFPD-YLLGLP-PDREIEFTIELLPGTAPISQAPYKMAPSELK----ELKMQLQELVTRDT--------------SGLVYRLREHQQ
        +PE   + KEF D+  +     LP P + +EF +EL      +    Y + P +++    E+   L+  + R++               G +  + +++ 
Subjt:  KPEDVSVVKEFLDVFPD-YLLGLP-PDREIEFTIELLPGTAPISQAPYKMAPSELK----ELKMQLQELVTRDT--------------SGLVYRLREHQQ

Query:  LNKVKIRNKYPLPRIDDLFDQLRGASLFSKIDLKSGYHKLKVRESDIDKTAFRTRYGHYKFRVMPFVLTNAPTIFMDLMNKIFHQYLDQFVIMFIDDILA
        LNK    N YPLP I+ L  +++G+++F+K+DLKS YH ++VR+ D  K AFR   G +++ VMP+ ++ AP  F   +N I  +  +  V+ ++DDIL 
Subjt:  LNKVKIRNKYPLPRIDDLFDQLRGASLFSKIDLKSGYHKLKVRESDIDKTAFRTRYGHYKFRVMPFVLTNAPTIFMDLMNKIFHQYLDQFVIMFIDDILA

Query:  YSVDKEAHEEH
        +S  +  H +H
Subjt:  YSVDKEAHEEH

P0CT41 Transposon Tf2-12 polyprotein3.8e-2826.85Show/hide
Query:  KGLEVEFELRTYGAIVKQGRLCVLNISELKNAILEEAHNSAYDMHPGSTKMYRTLKKTYWWPGIKKEIAEYVDRYLICQQVKPVRQRPGGFLNPLPVPEW
        K +E   +L+    I  + ++ + N ++L   I+++ H     +HPG   +   + + + W GI+K+I EYV     CQ  K    +P G L P+P  E 
Subjt:  KGLEVEFELRTYGAIVKQGRLCVLNISELKNAILEEAHNSAYDMHPGSTKMYRTLKKTYWWPGIKKEIAEYVDRYLICQQVKPVRQRPGGFLNPLPVPEW

Query:  KWEHITMDFLFGLPHTSSGHD-----------------------------------------------DKDLRFTSKFWPSLQKAMGTGLKFSTSFHPQI
         WE ++MDF+  LP  SSG++                                               D D  FTS+ W          +KFS  + PQ 
Subjt:  KWEHITMDFLFGLPHTSSGHD-----------------------------------------------DKDLRFTSKFWPSLQKAMGTGLKFSTSFHPQI

Query:  DGQSERTIQTLEDMLRACVLQLKGSWDTHLPLMEFAYNNSYQSSIGMSSYEALYG-----RPCRTPACGMKWESESPELVQIMTNNIKLTRENLRIAQDR
        DGQ+ERT QT+E +LR        +W  H+ L++ +YNN+  S+  M+ +E ++       P   P+   K +  S E +Q+     +  +E+L     +
Subjt:  DGQSERTIQTLEDMLRACVLQLKGSWDTHLPLMEFAYNNSYQSSIGMSSYEALYG-----RPCRTPACGMKWESESPELVQIMTNNIKLTRENLRIAQDR

Query:  QKSYVDKRRRNL-EFQVGDQVFLK
         K Y D + + + EFQ GD V +K
Subjt:  QKSYVDKRRRNL-EFQVGDQVFLK

P0CT41 Transposon Tf2-12 polyprotein4.2e-1929.38Show/hide
Query:  KPEDVSVVKEFLDVFPD-YLLGLP-PDREIEFTIELLPGTAPISQAPYKMAPSELK----ELKMQLQELVTRDT--------------SGLVYRLREHQQ
        +PE   + KEF D+  +     LP P + +EF +EL      +    Y + P +++    E+   L+  + R++               G +  + +++ 
Subjt:  KPEDVSVVKEFLDVFPD-YLLGLP-PDREIEFTIELLPGTAPISQAPYKMAPSELK----ELKMQLQELVTRDT--------------SGLVYRLREHQQ

Query:  LNKVKIRNKYPLPRIDDLFDQLRGASLFSKIDLKSGYHKLKVRESDIDKTAFRTRYGHYKFRVMPFVLTNAPTIFMDLMNKIFHQYLDQFVIMFIDDILA
        LNK    N YPLP I+ L  +++G+++F+K+DLKS YH ++VR+ D  K AFR   G +++ VMP+ ++ AP  F   +N I  +  +  V+ ++DDIL 
Subjt:  LNKVKIRNKYPLPRIDDLFDQLRGASLFSKIDLKSGYHKLKVRESDIDKTAFRTRYGHYKFRVMPFVLTNAPTIFMDLMNKIFHQYLDQFVIMFIDDILA

Query:  YSVDKEAHEEH
        +S  +  H +H
Subjt:  YSVDKEAHEEH

Q9UR07 Transposon Tf2-11 polyprotein3.8e-2826.85Show/hide
Query:  KGLEVEFELRTYGAIVKQGRLCVLNISELKNAILEEAHNSAYDMHPGSTKMYRTLKKTYWWPGIKKEIAEYVDRYLICQQVKPVRQRPGGFLNPLPVPEW
        K +E   +L+    I  + ++ + N ++L   I+++ H     +HPG   +   + + + W GI+K+I EYV     CQ  K    +P G L P+P  E 
Subjt:  KGLEVEFELRTYGAIVKQGRLCVLNISELKNAILEEAHNSAYDMHPGSTKMYRTLKKTYWWPGIKKEIAEYVDRYLICQQVKPVRQRPGGFLNPLPVPEW

Query:  KWEHITMDFLFGLPHTSSGHD-----------------------------------------------DKDLRFTSKFWPSLQKAMGTGLKFSTSFHPQI
         WE ++MDF+  LP  SSG++                                               D D  FTS+ W          +KFS  + PQ 
Subjt:  KWEHITMDFLFGLPHTSSGHD-----------------------------------------------DKDLRFTSKFWPSLQKAMGTGLKFSTSFHPQI

Query:  DGQSERTIQTLEDMLRACVLQLKGSWDTHLPLMEFAYNNSYQSSIGMSSYEALYG-----RPCRTPACGMKWESESPELVQIMTNNIKLTRENLRIAQDR
        DGQ+ERT QT+E +LR        +W  H+ L++ +YNN+  S+  M+ +E ++       P   P+   K +  S E +Q+     +  +E+L     +
Subjt:  DGQSERTIQTLEDMLRACVLQLKGSWDTHLPLMEFAYNNSYQSSIGMSSYEALYG-----RPCRTPACGMKWESESPELVQIMTNNIKLTRENLRIAQDR

Query:  QKSYVDKRRRNL-EFQVGDQVFLK
         K Y D + + + EFQ GD V +K
Subjt:  QKSYVDKRRRNL-EFQVGDQVFLK

Q9UR07 Transposon Tf2-11 polyprotein2.7e-1828.91Show/hide
Query:  KPEDVSVVKEFLDVFPD-YLLGLP-PDREIEFTIELLPGTAPISQAPYKMAPSELK----ELKMQLQELVTRDT--------------SGLVYRLREHQQ
        +PE   + KEF D+  +     LP P + +EF +EL      +    Y + P +++    E+   L+  + R++               G +  + +++ 
Subjt:  KPEDVSVVKEFLDVFPD-YLLGLP-PDREIEFTIELLPGTAPISQAPYKMAPSELK----ELKMQLQELVTRDT--------------SGLVYRLREHQQ

Query:  LNKVKIRNKYPLPRIDDLFDQLRGASLFSKIDLKSGYHKLKVRESDIDKTAFRTRYGHYKFRVMPFVLTNAPTIFMDLMNKIFHQYLDQFVIMFIDDILA
        LNK    N YPLP I+ L  +++G+++F+K+DLKS YH ++VR+ D  K AFR   G +++ VMP+ ++ AP  F   +N I  +  +  V+ ++D+IL 
Subjt:  LNKVKIRNKYPLPRIDDLFDQLRGASLFSKIDLKSGYHKLKVRESDIDKTAFRTRYGHYKFRVMPFVLTNAPTIFMDLMNKIFHQYLDQFVIMFIDDILA

Query:  YSVDKEAHEEH
        +S  +  H +H
Subjt:  YSVDKEAHEEH

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGAAGGTCATTTCTAGAAGTTTAATCTCAATTTTGAAAGCTGAGAAATTGTTGAGGAAAGGTTGCATAGCGTTTCTTGCACACGTCATAGTAGTGCAGAGAGAAAA
GCTGAAGCCAGAAGATGTTTCTGTGGTGAAAGAGTTTCTTGATGTATTTCCAGATTATCTGTTAGGTTTGCCACCTGATAGAGAGATTGAGTTCACTATTGAATTATTAC
CAGGAACAGCACCTATTTCACAGGCACCATATAAAATGGCTCCAAGCGAGCTTAAAGAATTGAAGATGCAGTTACAAGAACTGGTGACAAGGGATACATCAGGCCTAGTG
TATCGCCTTAGGGAGCACCAACAGTTAAATAAGGTTAAAATACGTAACAAGTATCCTTTACCACGCATTGATGACTTATTTGATCAACTAAGGGGAGCATCATTGTTCTC
TAAGATTGACTTAAAGTCAGGATACCACAAGTTGAAGGTTAGAGAATCAGATATTGATAAGACAGCATTCAGAACGAGGTATGGGCATTATAAGTTTCGAGTTATGCCAT
TCGTGTTAACGAATGCGCCAACGATTTTCATGGATCTCATGAACAAGATCTTCCATCAATATTTAGATCAGTTTGTGATCATGTTCATTGATGATATATTAGCTTACTCG
GTTGACAAAGAAGCTCATGAGGAACATCGAGGATTTTTCTACAAACTCTACATGATGAACAGTTGTACGCTAAGTTCAGCAACTGTGAGTTCTGGTTGGAACAAAAGAAG
ATGGAAATCCAAGAAAGGCTTAGAGGTGGAGTTTGAACTGAGAACATATGGAGCCATTGTTAAACAGGGAAGATTATGTGTTCTGAATATCAGTGAGCTTAAGAATGCTA
TTCTAGAAGAAGCTCACAATTCAGCTTACGATATGCATCCAGGTAGCACCAAGATGTACAGAACTTTAAAGAAGACTTATTGGTGGCCTGGAATAAAGAAAGAGATAGCT
GAATATGTTGATAGATATTTGATTTGTCAACAGGTTAAACCAGTGAGACAGAGGCCAGGAGGATTTCTTAATCCTTTGCCAGTGCCAGAGTGGAAGTGGGAGCATATTAC
TATGGATTTCCTGTTTGGATTACCTCATACATCCAGTGGACATGATGATAAGGATCTGAGGTTTACTTCTAAATTTTGGCCTAGTTTGCAAAAAGCAATGGGAACAGGGC
TAAAGTTTAGTACATCATTTCATCCCCAAATAGATGGTCAGTCTGAGAGGACCATCCAAACTTTAGAGGACATGTTGAGAGCATGTGTCCTTCAACTTAAAGGAAGTTGG
GATACCCACTTGCCACTTATGGAGTTTGCTTATAATAATAGCTATCAGTCTAGTATCGGTATGTCATCATATGAGGCCTTATACGGAAGACCATGCAGAACTCCTGCGTG
TGGAATGAAGTGGGAGAGTGAAAGTCCTGAGTTGGTTCAGATTATGACAAACAATATTAAGTTAACTAGAGAAAACCTGAGGATAGCCCAAGATCGGCAGAAAAGTTATG
TGGATAAGCGACGAAGAAACTTAGAATTTCAAGTTGGAGATCAAGTTTTCTTAAAGTTATCTCCATGGCAAGGTGTTATTCGTTTTGGAAGGAAAGGGTTATCTGAGATT
ATTCGGAGGGAATCCGATTTACATGTCCTGAAGCCAAGAGGAAATAGGTCAGATATACACATCGGAAAACTAGTCCAAGACTGCACTTCAAGCTTCGGAACTCATACTCC
AAGGACCTTAGTCTACATTCCTGACATTGTCCCTCACCCATTTTCTCAAAGTATAGGTTGGGAAGCGAGCTCGTTGGAGGAAGAACTAAGGTGCAACCCGAAACAATCGA
AAAGATGGCCAATGCTCGTTCAACGAGTGGCGAATGCTCGTCCGGAAAGTGCGAGCGAGAATGCCTCAGATCAGTATTTCTATCGCAGGGTAAAAAGCGTTCCTACAAGG
GTTTGTTTGTTTTGTGTTGGTCATAAGCGCTTCGAGCTTCGGGACTCATATCCGAGAACTTTAGTCTACATTCCTGGCATCGTCCCTCACCCGTTTTCCCAAAGTATAGG
TGGGGAAGCGAGCGCTTCGCGCTACGGGACTCAACTCAGAGTACTTCAGTCTATATTTCTGGCATCGTCCCTCTCCCGTTTTCCCAAGTATAGGTCGGGAAGTAAGATCA
TTGAAGGGAGAACTAAGGTGCAACCCGAAACGATCGAAATGATGGCCAATGCTCTTCCAACTAGTGGAGAACGCTCGTCCCAACGAAGAAAATGGGCTAAGCTCAAGAAC
TTCTGGTGTTTGTCTCAGAAAAGTGCGAGCGAGAATTCCCTGCACTCAGTTTTCTATCGCGGGGAAAGTGCGAACGAGAATGCCCTGGACTCAGTTTTTCTATCTCGGGG
TAAAAAACGGTCCTACAATGGTTTCTTGGTTTTGTGTTGTATTGGTCATAAGCGCTTCAAGCTTCAGGACTCATACTCTGAGGACTTCAGTGTACATTCCTGGCATCGTC
CGTCACTCATTTTCCCAAAGTATAGGTCGGGAAGCGAGCTCGTTCAAGGGAGAACAAAGGTGCAACCCGAAACGATCGAAAATATGGCCAATACTCTTCCAAGAGTGGCG
AACGCTCGTCCGATGTCCGGAATGATTAAACATGCATTTCAGCAATGGAAATGTGCTAAGCTTAAGAACTTCTGGTGTTCGTCCCAGGAAAGTGTGAGCGAGAATGCCCC
GGACTCAGTTTTTCTATCGCCGGATAAAAAGCGTCCTGCGATGGTTTCTTGGTTTTATGTTGTGTTGGTCATAAGCACTTCGAGCTTCAGGACTCATACTCTGAGGACTT
CAGTCTACATTCGTGGCAATGTCCGTCACCCGTTTTTCAAAAGTATAGGACGGGAAGCAAGCTCGTTCGAGGGAGAACTAAGGTGCAACTCGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGGAAGGTCATTTCTAGAAGTTTAATCTCAATTTTGAAAGCTGAGAAATTGTTGAGGAAAGGTTGCATAGCGTTTCTTGCACACGTCATAGTAGTGCAGAGAGAAAA
GCTGAAGCCAGAAGATGTTTCTGTGGTGAAAGAGTTTCTTGATGTATTTCCAGATTATCTGTTAGGTTTGCCACCTGATAGAGAGATTGAGTTCACTATTGAATTATTAC
CAGGAACAGCACCTATTTCACAGGCACCATATAAAATGGCTCCAAGCGAGCTTAAAGAATTGAAGATGCAGTTACAAGAACTGGTGACAAGGGATACATCAGGCCTAGTG
TATCGCCTTAGGGAGCACCAACAGTTAAATAAGGTTAAAATACGTAACAAGTATCCTTTACCACGCATTGATGACTTATTTGATCAACTAAGGGGAGCATCATTGTTCTC
TAAGATTGACTTAAAGTCAGGATACCACAAGTTGAAGGTTAGAGAATCAGATATTGATAAGACAGCATTCAGAACGAGGTATGGGCATTATAAGTTTCGAGTTATGCCAT
TCGTGTTAACGAATGCGCCAACGATTTTCATGGATCTCATGAACAAGATCTTCCATCAATATTTAGATCAGTTTGTGATCATGTTCATTGATGATATATTAGCTTACTCG
GTTGACAAAGAAGCTCATGAGGAACATCGAGGATTTTTCTACAAACTCTACATGATGAACAGTTGTACGCTAAGTTCAGCAACTGTGAGTTCTGGTTGGAACAAAAGAAG
ATGGAAATCCAAGAAAGGCTTAGAGGTGGAGTTTGAACTGAGAACATATGGAGCCATTGTTAAACAGGGAAGATTATGTGTTCTGAATATCAGTGAGCTTAAGAATGCTA
TTCTAGAAGAAGCTCACAATTCAGCTTACGATATGCATCCAGGTAGCACCAAGATGTACAGAACTTTAAAGAAGACTTATTGGTGGCCTGGAATAAAGAAAGAGATAGCT
GAATATGTTGATAGATATTTGATTTGTCAACAGGTTAAACCAGTGAGACAGAGGCCAGGAGGATTTCTTAATCCTTTGCCAGTGCCAGAGTGGAAGTGGGAGCATATTAC
TATGGATTTCCTGTTTGGATTACCTCATACATCCAGTGGACATGATGATAAGGATCTGAGGTTTACTTCTAAATTTTGGCCTAGTTTGCAAAAAGCAATGGGAACAGGGC
TAAAGTTTAGTACATCATTTCATCCCCAAATAGATGGTCAGTCTGAGAGGACCATCCAAACTTTAGAGGACATGTTGAGAGCATGTGTCCTTCAACTTAAAGGAAGTTGG
GATACCCACTTGCCACTTATGGAGTTTGCTTATAATAATAGCTATCAGTCTAGTATCGGTATGTCATCATATGAGGCCTTATACGGAAGACCATGCAGAACTCCTGCGTG
TGGAATGAAGTGGGAGAGTGAAAGTCCTGAGTTGGTTCAGATTATGACAAACAATATTAAGTTAACTAGAGAAAACCTGAGGATAGCCCAAGATCGGCAGAAAAGTTATG
TGGATAAGCGACGAAGAAACTTAGAATTTCAAGTTGGAGATCAAGTTTTCTTAAAGTTATCTCCATGGCAAGGTGTTATTCGTTTTGGAAGGAAAGGGTTATCTGAGATT
ATTCGGAGGGAATCCGATTTACATGTCCTGAAGCCAAGAGGAAATAGGTCAGATATACACATCGGAAAACTAGTCCAAGACTGCACTTCAAGCTTCGGAACTCATACTCC
AAGGACCTTAGTCTACATTCCTGACATTGTCCCTCACCCATTTTCTCAAAGTATAGGTTGGGAAGCGAGCTCGTTGGAGGAAGAACTAAGGTGCAACCCGAAACAATCGA
AAAGATGGCCAATGCTCGTTCAACGAGTGGCGAATGCTCGTCCGGAAAGTGCGAGCGAGAATGCCTCAGATCAGTATTTCTATCGCAGGGTAAAAAGCGTTCCTACAAGG
GTTTGTTTGTTTTGTGTTGGTCATAAGCGCTTCGAGCTTCGGGACTCATATCCGAGAACTTTAGTCTACATTCCTGGCATCGTCCCTCACCCGTTTTCCCAAAGTATAGG
TGGGGAAGCGAGCGCTTCGCGCTACGGGACTCAACTCAGAGTACTTCAGTCTATATTTCTGGCATCGTCCCTCTCCCGTTTTCCCAAGTATAGGTCGGGAAGTAAGATCA
TTGAAGGGAGAACTAAGGTGCAACCCGAAACGATCGAAATGATGGCCAATGCTCTTCCAACTAGTGGAGAACGCTCGTCCCAACGAAGAAAATGGGCTAAGCTCAAGAAC
TTCTGGTGTTTGTCTCAGAAAAGTGCGAGCGAGAATTCCCTGCACTCAGTTTTCTATCGCGGGGAAAGTGCGAACGAGAATGCCCTGGACTCAGTTTTTCTATCTCGGGG
TAAAAAACGGTCCTACAATGGTTTCTTGGTTTTGTGTTGTATTGGTCATAAGCGCTTCAAGCTTCAGGACTCATACTCTGAGGACTTCAGTGTACATTCCTGGCATCGTC
CGTCACTCATTTTCCCAAAGTATAGGTCGGGAAGCGAGCTCGTTCAAGGGAGAACAAAGGTGCAACCCGAAACGATCGAAAATATGGCCAATACTCTTCCAAGAGTGGCG
AACGCTCGTCCGATGTCCGGAATGATTAAACATGCATTTCAGCAATGGAAATGTGCTAAGCTTAAGAACTTCTGGTGTTCGTCCCAGGAAAGTGTGAGCGAGAATGCCCC
GGACTCAGTTTTTCTATCGCCGGATAAAAAGCGTCCTGCGATGGTTTCTTGGTTTTATGTTGTGTTGGTCATAAGCACTTCGAGCTTCAGGACTCATACTCTGAGGACTT
CAGTCTACATTCGTGGCAATGTCCGTCACCCGTTTTTCAAAAGTATAGGACGGGAAGCAAGCTCGTTCGAGGGAGAACTAAGGTGCAACTCGTAA
Protein sequenceShow/hide protein sequence
MRKVISRSLISILKAEKLLRKGCIAFLAHVIVVQREKLKPEDVSVVKEFLDVFPDYLLGLPPDREIEFTIELLPGTAPISQAPYKMAPSELKELKMQLQELVTRDTSGLV
YRLREHQQLNKVKIRNKYPLPRIDDLFDQLRGASLFSKIDLKSGYHKLKVRESDIDKTAFRTRYGHYKFRVMPFVLTNAPTIFMDLMNKIFHQYLDQFVIMFIDDILAYS
VDKEAHEEHRGFFYKLYMMNSCTLSSATVSSGWNKRRWKSKKGLEVEFELRTYGAIVKQGRLCVLNISELKNAILEEAHNSAYDMHPGSTKMYRTLKKTYWWPGIKKEIA
EYVDRYLICQQVKPVRQRPGGFLNPLPVPEWKWEHITMDFLFGLPHTSSGHDDKDLRFTSKFWPSLQKAMGTGLKFSTSFHPQIDGQSERTIQTLEDMLRACVLQLKGSW
DTHLPLMEFAYNNSYQSSIGMSSYEALYGRPCRTPACGMKWESESPELVQIMTNNIKLTRENLRIAQDRQKSYVDKRRRNLEFQVGDQVFLKLSPWQGVIRFGRKGLSEI
IRRESDLHVLKPRGNRSDIHIGKLVQDCTSSFGTHTPRTLVYIPDIVPHPFSQSIGWEASSLEEELRCNPKQSKRWPMLVQRVANARPESASENASDQYFYRRVKSVPTR
VCLFCVGHKRFELRDSYPRTLVYIPGIVPHPFSQSIGGEASASRYGTQLRVLQSIFLASSLSRFPKYRSGSKIIEGRTKVQPETIEMMANALPTSGERSSQRRKWAKLKN
FWCLSQKSASENSLHSVFYRGESANENALDSVFLSRGKKRSYNGFLVLCCIGHKRFKLQDSYSEDFSVHSWHRPSLIFPKYRSGSELVQGRTKVQPETIENMANTLPRVA
NARPMSGMIKHAFQQWKCAKLKNFWCSSQESVSENAPDSVFLSPDKKRPAMVSWFYVVLVISTSSFRTHTLRTSVYIRGNVRHPFFKSIGREASSFEGELRCNS