; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0027630 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0027630
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr06:12664583..12666672
RNA-Seq ExpressionPI0027630
SyntenyPI0027630
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046851.1 uncharacterized protein E6C27_scaffold19358G00020 [Cucumis melo var. makuwa]3.2e-17866.81Show/hide
Query:  MEVFCVYASNNNVDYRVLWRWLVEITFRWSIPGVVMGDFNAIRVHSKACGGSPVTGDMEEFDIAICDADLVEPAVQGNWFTWTSKVCGSGWLRRLDRILV
        +EV CVYASN++ + R LWR L EIT  WS  GVVMGDFNAIRVHS+A GGSP+ G+MEEFD+AI DADLVEP+VQGNWFTWTSKV GSG LRRLDR+LV
Subjt:  MEVFCVYASNNNVDYRVLWRWLVEITFRWSIPGVVMGDFNAIRVHSKACGGSPVTGDMEEFDIAICDADLVEPAVQGNWFTWTSKVCGSGWLRRLDRILV

Query:  NEQGLMAWPSLRVSVLPWGISDHSPMLIYQGVEQRRRTISFCFFNHWAEDTTFSDMVSSVWVRRLGVSPLVSLMRNLHDLKPMLRGHFGRHIRGLSEEVR
        N++ L AWP++R++VLPWGISDHSP+L Y   +   R +SF FFNHW E+ +F ++V+ +W R  GVS LVSLMRNLH LKP+LR  FGRHI+ LSEEV 
Subjt:  NEQGLMAWPSLRVSVLPWGISDHSPMLIYQGVEQRRRTISFCFFNHWAEDTTFSDMVSSVWVRRLGVSPLVSLMRNLHDLKPMLRGHFGRHIRGLSEEVR

Query:  SSKEDMDRAQREVERDPGSVERSRDASVATEAFWSAIRQEEASLHQKSRVRWLELGDQNFAFFHRSIRSRIGCNSLLFIVDSEGIQVTSHERLVQVAVNF
         +KE MD AQREVER+P S   SR AS+ATE FW+A+R EEASL QKS+VRWL LGDQN AFFHRS+RSR+  NSLL +VDS+G +V+SH+ + Q+AVN+
Subjt:  SSKEDMDRAQREVERDPGSVERSRDASVATEAFWSAIRQEEASLHQKSRVRWLELGDQNFAFFHRSIRSRIGCNSLLFIVDSEGIQVTSHERLVQVAVNF

Query:  FRNSLGSQVVGYRELYLLREEVVQFKWTEECCHALQALIRREEIMRV--------APGPDGFSAGFFKGAWNTVGEDFCDAVLHFFETCYLPPRVNATAI
        F NSLGSQ +GYREL  + +++VQF+W+EECC ALQ  I REE+ RV        APGPDGFS GF+KGAW+ VGEDFC+AVLHFFETCYLP  VNATAI
Subjt:  FRNSLGSQVVGYRELYLLREEVVQFKWTEECCHALQALIRREEIMRV--------APGPDGFSAGFFKGAWNTVGEDFCDAVLHFFETCYLPPRVNATAI

Query:  TLIPKRSGAERMKDYRPISCCNVVYKRISKILADRLRVWLPSFISGNLTPLLLGRVLL
        TLIPK  GAER++D+RPISCCNV+YK ISKILADRLR+WLPSFIS N +  + GR ++
Subjt:  TLIPKRSGAERMKDYRPISCCNVVYKRISKILADRLRVWLPSFISGNLTPLLLGRVLL

KAA0059841.1 reverse transcriptase [Cucumis melo var. makuwa]2.1e-15366.08Show/hide
Query:  MEVFCVYASNNNVDYRVLWRWLVEITFRWSIPGVVMGDFNAIRVHSKACGGSPVTGDMEEFDIAICDADLVEPAVQGNWFTWTSKVCGSGWLRRLDRILV
        +EVFCVYASN+N++ R+LW  LVE T  WS PGVVMGDFNAIRVHS+A GGSP+ G+ME+FD+AI DADLVEP+VQGNWFTWTSKV GSG LRRLDR+LV
Subjt:  MEVFCVYASNNNVDYRVLWRWLVEITFRWSIPGVVMGDFNAIRVHSKACGGSPVTGDMEEFDIAICDADLVEPAVQGNWFTWTSKVCGSGWLRRLDRILV

Query:  NEQGLMAWPSLRVSVLPWGISDHSPMLIYQGVEQRRRTISFCFFNHWAEDTTFSDMVSSVWVRRLGVSPLVSLMRNLHDLKPMLRGHFGRHIRGLSEEVR
        N+  L AWP++ V+VLPWGISDHSP+L Y   +   + +SF FFNHW ED +F ++V+ +W R  GVSPLVSLMRNLH LKP LR  FGRHI+ LSEEV 
Subjt:  NEQGLMAWPSLRVSVLPWGISDHSPMLIYQGVEQRRRTISFCFFNHWAEDTTFSDMVSSVWVRRLGVSPLVSLMRNLHDLKPMLRGHFGRHIRGLSEEVR

Query:  SSKEDMDRAQREVERDPGSVERSRDASVATEAFWSAIRQEEASLHQKSRVRWLELGDQNFAFFHRSIRSRIGCNSLLFIVDSEGIQVTSHERLVQVAVNF
         +KE MDRAQR+VER+  S   SR AS+ATE FW+A+R EEASL QKSR+RWL+LGDQN  FFHRS+RSR+  NSLL +VDS+G +V+SH+ + Q+AVN+
Subjt:  SSKEDMDRAQREVERDPGSVERSRDASVATEAFWSAIRQEEASLHQKSRVRWLELGDQNFAFFHRSIRSRIGCNSLLFIVDSEGIQVTSHERLVQVAVNF

Query:  FRNSLGSQVVGYRELYLLREEVVQFKWTEECCHALQALIRREEIMRV--------APGPDGFSAGFFKGAWNTVGEDFCDAVLHFFETCYLPPRVNAT
        FRNSLGSQ +GYREL  + ++++QF+W+EECC ALQ  I REE+ RV        APGPDGFS G FKG W+ VGEDFCD VLHFFETCYLP  VNAT
Subjt:  FRNSLGSQVVGYRELYLLREEVVQFKWTEECCHALQALIRREEIMRV--------APGPDGFSAGFFKGAWNTVGEDFCDAVLHFFETCYLPPRVNAT

KAA0062888.1 non-LTR retroelement reverse transcriptase-like protein [Cucumis melo var. makuwa]2.1e-16153.7Show/hide
Query:  MEVFCVYASNNNVDYRVLWRWLVEITFRWSIPGVVMGDFNAIRVHSKACGGSPVTGDMEEFDIAICDADLVEPAVQGNWFTWTSKVCGSGWLRRLDRILV
        +EVFCVYASN+N++ R+LWR LVEIT  WS P VVMGDFNAIRVH +A GGSP+ G+ME+FD+A  DADLVEP+VQGNWFTWTSKV GSG LRRLDRILV
Subjt:  MEVFCVYASNNNVDYRVLWRWLVEITFRWSIPGVVMGDFNAIRVHSKACGGSPVTGDMEEFDIAICDADLVEPAVQGNWFTWTSKVCGSGWLRRLDRILV

Query:  NEQGLMAWPSLRVSVLPWGISDHSPMLIYQGVEQRRRTISFCFFNHWAEDTTFSDMVSSVWVRRLGVSPLVSLMRNLHDLKPMLRGHFGRHIRGLSEEVR
        N++ L AWP+L               L+ Q +                ED +F ++V+ +W R  GVSPLVSLMRNL +LKP LR  FGRHI+ L+EEV 
Subjt:  NEQGLMAWPSLRVSVLPWGISDHSPMLIYQGVEQRRRTISFCFFNHWAEDTTFSDMVSSVWVRRLGVSPLVSLMRNLHDLKPMLRGHFGRHIRGLSEEVR

Query:  SSKEDMDRAQREVERDPGSVERSRDASVATEAFWSAIRQEEASLHQKSRVRWLELGDQNFAFFHRSIRSRIGCNSLLFIVDSEGIQVTSHERLVQVAVNF
         +KE+MDRAQREVE +P S   SR   +ATEAFW+A+R EEASL QKSR+RWLELGDQN AFFHR +RSR+  NSLL +VD++G +V+SH+ +VQ+AVN+
Subjt:  SSKEDMDRAQREVERDPGSVERSRDASVATEAFWSAIRQEEASLHQKSRVRWLELGDQNFAFFHRSIRSRIGCNSLLFIVDSEGIQVTSHERLVQVAVNF

Query:  FRNSLGSQVVGYRELYLLREEVVQFKWTEECCHALQALIRREEIMRV--------APGPDGFSAGFFKGAWNTVGEDFCDAVLHFFETCYLPPRVNATAI
        FRNSLGSQ +GYREL+ + +++VQF+W+EECC ALQ  I REE+ RV        APGPDGFS GFFKGAW+ V EDFCD VLHFFETCYLP  VNAT I
Subjt:  FRNSLGSQVVGYRELYLLREEVVQFKWTEECCHALQALIRREEIMRV--------APGPDGFSAGFFKGAWNTVGEDFCDAVLHFFETCYLPPRVNATAI

Query:  TLIPKRSGAERMKDYRPISCCNVVYKRISKILADRLRVWLPSFISGNLTPLLLGRVLLITF------------------CFVR----------RW-----
        TLIPKR GAE+M+++RPISCCNV+YK ISKILADRLRVWLPSFI  N +  + GR ++                     C ++           W     
Subjt:  TLIPKRSGAERMKDYRPISCCNVVYKRISKILADRLRVWLPSFISGNLTPLLLGRVLLITF------------------CFVR----------RW-----

Query:  -----------WGIIMVLLDRL-----ASIGF------VRETLRQFGELLGLIANLDKSSMFVAGVDIEAATVLADSMGFVLGTLPVHYLAVPL
                   + ++M +L R+      S  F      V+ T   F + L +    D+ S+       EAA+ LA SMGFVLG LPV YL +PL
Subjt:  -----------WGIIMVLLDRL-----ASIGF------VRETLRQFGELLGLIANLDKSSMFVAGVDIEAATVLADSMGFVLGTLPVHYLAVPL

TYK28312.1 uncharacterized protein E5676_scaffold600G001370 [Cucumis melo var. makuwa]2.4e-14148.46Show/hide
Query:  MEVFCVYASNNNVDYRVLWRWLVEITFRWSIPGVVMGDFNAIRVHSKACGGSPVTGDMEEFDIAICDADLVEPAVQGNWFTWTSKVCGSGWLRRLDRILV
        +EVF VYASN+N++ R+LW  LVEIT+ WS PG+VMGDFNAIRVHS+A GGSP+ G+ME+FD+AI DADLVEP+VQGNWFTWTSK               
Subjt:  MEVFCVYASNNNVDYRVLWRWLVEITFRWSIPGVVMGDFNAIRVHSKACGGSPVTGDMEEFDIAICDADLVEPAVQGNWFTWTSKVCGSGWLRRLDRILV

Query:  NEQGLMAWPSLRVSVLPWGISDHSPMLIYQGVEQRRRTISFCFFNHWAEDTTFSDMVSSVWVRRLGVSPLVSLMRNLHDLKPMLRGHFGRHIRGLSEEVR
                                           RR +SF FFNHW ED +F ++VS +W R  GVSPLVSL+RNL  LK  +R HFGRHI+ LSEEVR
Subjt:  NEQGLMAWPSLRVSVLPWGISDHSPMLIYQGVEQRRRTISFCFFNHWAEDTTFSDMVSSVWVRRLGVSPLVSLMRNLHDLKPMLRGHFGRHIRGLSEEVR

Query:  SSKEDMDRAQREVERDPGSVERSRDASVATEAFWSAIRQEEASLHQKSRVRWLELGDQNFAFFHRSIRSRIGCNSLLFIVDSEGIQVTSHERLVQVAVNF
        ++KE MDRAQREV+R+P S   SR A +ATEAFW+ +R EEASLHQK R+RWLELG+QN AFFHRS+ SR                              
Subjt:  SSKEDMDRAQREVERDPGSVERSRDASVATEAFWSAIRQEEASLHQKSRVRWLELGDQNFAFFHRSIRSRIGCNSLLFIVDSEGIQVTSHERLVQVAVNF

Query:  FRNSLGSQVVGYRELYLLREEVVQFKWTEECCHALQALIRREEIMRV--------APGPDGFSAGFFKGAWNTVGEDFCDAVLHFFETCYLPPRVNATAI
              SQ + YREL  + +++VQF+W+EECC ALQ  I REE+ RV        A G DGFS  FFKG W+ V EDFCD +LHFFETCYLP  VNAT I
Subjt:  FRNSLGSQVVGYRELYLLREEVVQFKWTEECCHALQALIRREEIMRV--------APGPDGFSAGFFKGAWNTVGEDFCDAVLHFFETCYLPPRVNATAI

Query:  TLIPKRSGAERMKDYRPISCCNVVYKRISKILADRLRVWLPSFISGNLTPLLLGRVLL--------ITFCF-------------VRRWWGI-------IM
        TLIPKR GAE ++++RPIS CNV+YK ISKILADRL VWLPSFISGN +  + GR ++        + F F             VR+ + +       +M
Subjt:  TLIPKRSGAERMKDYRPISCCNVVYKRISKILADRLRVWLPSFISGNLTPLLLGRVLL--------ITFCF-------------VRRWWGI-------IM

Query:  VLLDRL------------------------------------ASIGFVRETLRQFGELLGLIANLDKSSMFVAGVDIEAATVLADSMGFVLGTLPVHYLA
          L R+                                     S+ F+RE+L++FGELLGL ANL KSS+FVAG   EAA+ LA SMGFVLG LPV YL 
Subjt:  VLLDRL------------------------------------ASIGFVRETLRQFGELLGLIANLDKSSMFVAGVDIEAATVLADSMGFVLGTLPVHYLA

Query:  VPLHSVGCVLRIVLRSS
        +PL +V      VL +S
Subjt:  VPLHSVGCVLRIVLRSS

XP_008466769.1 PREDICTED: uncharacterized protein LOC103504100 [Cucumis melo]2.1e-15366.08Show/hide
Query:  MEVFCVYASNNNVDYRVLWRWLVEITFRWSIPGVVMGDFNAIRVHSKACGGSPVTGDMEEFDIAICDADLVEPAVQGNWFTWTSKVCGSGWLRRLDRILV
        +EVFCVYASN+N++ R+LW  LVE T  WS PGVVMGDFNAIRVHS+A GGSP+ G+ME+FD+AI DADLVEP+VQGNWFTWTSKV GSG LRRLDR+LV
Subjt:  MEVFCVYASNNNVDYRVLWRWLVEITFRWSIPGVVMGDFNAIRVHSKACGGSPVTGDMEEFDIAICDADLVEPAVQGNWFTWTSKVCGSGWLRRLDRILV

Query:  NEQGLMAWPSLRVSVLPWGISDHSPMLIYQGVEQRRRTISFCFFNHWAEDTTFSDMVSSVWVRRLGVSPLVSLMRNLHDLKPMLRGHFGRHIRGLSEEVR
        N+  L AWP++ V+VLPWGISDHSP+L Y   +   + +SF FFNHW ED +F ++V+ +W R  GVSPLVSLMRNLH LKP LR  FGRHI+ LSEEV 
Subjt:  NEQGLMAWPSLRVSVLPWGISDHSPMLIYQGVEQRRRTISFCFFNHWAEDTTFSDMVSSVWVRRLGVSPLVSLMRNLHDLKPMLRGHFGRHIRGLSEEVR

Query:  SSKEDMDRAQREVERDPGSVERSRDASVATEAFWSAIRQEEASLHQKSRVRWLELGDQNFAFFHRSIRSRIGCNSLLFIVDSEGIQVTSHERLVQVAVNF
         +KE MDRAQR+VER+  S   SR AS+ATE FW+A+R EEASL QKSR+RWL+LGDQN  FFHRS+RSR+  NSLL +VDS+G +V+SH+ + Q+AVN+
Subjt:  SSKEDMDRAQREVERDPGSVERSRDASVATEAFWSAIRQEEASLHQKSRVRWLELGDQNFAFFHRSIRSRIGCNSLLFIVDSEGIQVTSHERLVQVAVNF

Query:  FRNSLGSQVVGYRELYLLREEVVQFKWTEECCHALQALIRREEIMRV--------APGPDGFSAGFFKGAWNTVGEDFCDAVLHFFETCYLPPRVNAT
        FRNSLGSQ +GYREL  + ++++QF+W+EECC ALQ  I REE+ RV        APGPDGFS G FKG W+ VGEDFCD VLHFFETCYLP  VNAT
Subjt:  FRNSLGSQVVGYRELYLLREEVVQFKWTEECCHALQALIRREEIMRV--------APGPDGFSAGFFKGAWNTVGEDFCDAVLHFFETCYLPPRVNAT

TrEMBL top hitse value%identityAlignment
A0A1S3CRZ6 uncharacterized protein LOC1035041001.0e-15366.08Show/hide
Query:  MEVFCVYASNNNVDYRVLWRWLVEITFRWSIPGVVMGDFNAIRVHSKACGGSPVTGDMEEFDIAICDADLVEPAVQGNWFTWTSKVCGSGWLRRLDRILV
        +EVFCVYASN+N++ R+LW  LVE T  WS PGVVMGDFNAIRVHS+A GGSP+ G+ME+FD+AI DADLVEP+VQGNWFTWTSKV GSG LRRLDR+LV
Subjt:  MEVFCVYASNNNVDYRVLWRWLVEITFRWSIPGVVMGDFNAIRVHSKACGGSPVTGDMEEFDIAICDADLVEPAVQGNWFTWTSKVCGSGWLRRLDRILV

Query:  NEQGLMAWPSLRVSVLPWGISDHSPMLIYQGVEQRRRTISFCFFNHWAEDTTFSDMVSSVWVRRLGVSPLVSLMRNLHDLKPMLRGHFGRHIRGLSEEVR
        N+  L AWP++ V+VLPWGISDHSP+L Y   +   + +SF FFNHW ED +F ++V+ +W R  GVSPLVSLMRNLH LKP LR  FGRHI+ LSEEV 
Subjt:  NEQGLMAWPSLRVSVLPWGISDHSPMLIYQGVEQRRRTISFCFFNHWAEDTTFSDMVSSVWVRRLGVSPLVSLMRNLHDLKPMLRGHFGRHIRGLSEEVR

Query:  SSKEDMDRAQREVERDPGSVERSRDASVATEAFWSAIRQEEASLHQKSRVRWLELGDQNFAFFHRSIRSRIGCNSLLFIVDSEGIQVTSHERLVQVAVNF
         +KE MDRAQR+VER+  S   SR AS+ATE FW+A+R EEASL QKSR+RWL+LGDQN  FFHRS+RSR+  NSLL +VDS+G +V+SH+ + Q+AVN+
Subjt:  SSKEDMDRAQREVERDPGSVERSRDASVATEAFWSAIRQEEASLHQKSRVRWLELGDQNFAFFHRSIRSRIGCNSLLFIVDSEGIQVTSHERLVQVAVNF

Query:  FRNSLGSQVVGYRELYLLREEVVQFKWTEECCHALQALIRREEIMRV--------APGPDGFSAGFFKGAWNTVGEDFCDAVLHFFETCYLPPRVNAT
        FRNSLGSQ +GYREL  + ++++QF+W+EECC ALQ  I REE+ RV        APGPDGFS G FKG W+ VGEDFCD VLHFFETCYLP  VNAT
Subjt:  FRNSLGSQVVGYRELYLLREEVVQFKWTEECCHALQALIRREEIMRV--------APGPDGFSAGFFKGAWNTVGEDFCDAVLHFFETCYLPPRVNAT

A0A5A7TZS0 Reverse transcriptase domain-containing protein1.6e-17866.81Show/hide
Query:  MEVFCVYASNNNVDYRVLWRWLVEITFRWSIPGVVMGDFNAIRVHSKACGGSPVTGDMEEFDIAICDADLVEPAVQGNWFTWTSKVCGSGWLRRLDRILV
        +EV CVYASN++ + R LWR L EIT  WS  GVVMGDFNAIRVHS+A GGSP+ G+MEEFD+AI DADLVEP+VQGNWFTWTSKV GSG LRRLDR+LV
Subjt:  MEVFCVYASNNNVDYRVLWRWLVEITFRWSIPGVVMGDFNAIRVHSKACGGSPVTGDMEEFDIAICDADLVEPAVQGNWFTWTSKVCGSGWLRRLDRILV

Query:  NEQGLMAWPSLRVSVLPWGISDHSPMLIYQGVEQRRRTISFCFFNHWAEDTTFSDMVSSVWVRRLGVSPLVSLMRNLHDLKPMLRGHFGRHIRGLSEEVR
        N++ L AWP++R++VLPWGISDHSP+L Y   +   R +SF FFNHW E+ +F ++V+ +W R  GVS LVSLMRNLH LKP+LR  FGRHI+ LSEEV 
Subjt:  NEQGLMAWPSLRVSVLPWGISDHSPMLIYQGVEQRRRTISFCFFNHWAEDTTFSDMVSSVWVRRLGVSPLVSLMRNLHDLKPMLRGHFGRHIRGLSEEVR

Query:  SSKEDMDRAQREVERDPGSVERSRDASVATEAFWSAIRQEEASLHQKSRVRWLELGDQNFAFFHRSIRSRIGCNSLLFIVDSEGIQVTSHERLVQVAVNF
         +KE MD AQREVER+P S   SR AS+ATE FW+A+R EEASL QKS+VRWL LGDQN AFFHRS+RSR+  NSLL +VDS+G +V+SH+ + Q+AVN+
Subjt:  SSKEDMDRAQREVERDPGSVERSRDASVATEAFWSAIRQEEASLHQKSRVRWLELGDQNFAFFHRSIRSRIGCNSLLFIVDSEGIQVTSHERLVQVAVNF

Query:  FRNSLGSQVVGYRELYLLREEVVQFKWTEECCHALQALIRREEIMRV--------APGPDGFSAGFFKGAWNTVGEDFCDAVLHFFETCYLPPRVNATAI
        F NSLGSQ +GYREL  + +++VQF+W+EECC ALQ  I REE+ RV        APGPDGFS GF+KGAW+ VGEDFC+AVLHFFETCYLP  VNATAI
Subjt:  FRNSLGSQVVGYRELYLLREEVVQFKWTEECCHALQALIRREEIMRV--------APGPDGFSAGFFKGAWNTVGEDFCDAVLHFFETCYLPPRVNATAI

Query:  TLIPKRSGAERMKDYRPISCCNVVYKRISKILADRLRVWLPSFISGNLTPLLLGRVLL
        TLIPK  GAER++D+RPISCCNV+YK ISKILADRLR+WLPSFIS N +  + GR ++
Subjt:  TLIPKRSGAERMKDYRPISCCNVVYKRISKILADRLRVWLPSFISGNLTPLLLGRVLL

A0A5A7V275 Reverse transcriptase1.0e-15366.08Show/hide
Query:  MEVFCVYASNNNVDYRVLWRWLVEITFRWSIPGVVMGDFNAIRVHSKACGGSPVTGDMEEFDIAICDADLVEPAVQGNWFTWTSKVCGSGWLRRLDRILV
        +EVFCVYASN+N++ R+LW  LVE T  WS PGVVMGDFNAIRVHS+A GGSP+ G+ME+FD+AI DADLVEP+VQGNWFTWTSKV GSG LRRLDR+LV
Subjt:  MEVFCVYASNNNVDYRVLWRWLVEITFRWSIPGVVMGDFNAIRVHSKACGGSPVTGDMEEFDIAICDADLVEPAVQGNWFTWTSKVCGSGWLRRLDRILV

Query:  NEQGLMAWPSLRVSVLPWGISDHSPMLIYQGVEQRRRTISFCFFNHWAEDTTFSDMVSSVWVRRLGVSPLVSLMRNLHDLKPMLRGHFGRHIRGLSEEVR
        N+  L AWP++ V+VLPWGISDHSP+L Y   +   + +SF FFNHW ED +F ++V+ +W R  GVSPLVSLMRNLH LKP LR  FGRHI+ LSEEV 
Subjt:  NEQGLMAWPSLRVSVLPWGISDHSPMLIYQGVEQRRRTISFCFFNHWAEDTTFSDMVSSVWVRRLGVSPLVSLMRNLHDLKPMLRGHFGRHIRGLSEEVR

Query:  SSKEDMDRAQREVERDPGSVERSRDASVATEAFWSAIRQEEASLHQKSRVRWLELGDQNFAFFHRSIRSRIGCNSLLFIVDSEGIQVTSHERLVQVAVNF
         +KE MDRAQR+VER+  S   SR AS+ATE FW+A+R EEASL QKSR+RWL+LGDQN  FFHRS+RSR+  NSLL +VDS+G +V+SH+ + Q+AVN+
Subjt:  SSKEDMDRAQREVERDPGSVERSRDASVATEAFWSAIRQEEASLHQKSRVRWLELGDQNFAFFHRSIRSRIGCNSLLFIVDSEGIQVTSHERLVQVAVNF

Query:  FRNSLGSQVVGYRELYLLREEVVQFKWTEECCHALQALIRREEIMRV--------APGPDGFSAGFFKGAWNTVGEDFCDAVLHFFETCYLPPRVNAT
        FRNSLGSQ +GYREL  + ++++QF+W+EECC ALQ  I REE+ RV        APGPDGFS G FKG W+ VGEDFCD VLHFFETCYLP  VNAT
Subjt:  FRNSLGSQVVGYRELYLLREEVVQFKWTEECCHALQALIRREEIMRV--------APGPDGFSAGFFKGAWNTVGEDFCDAVLHFFETCYLPPRVNAT

A0A5A7V5J2 Non-LTR retroelement reverse transcriptase-like protein1.0e-16153.7Show/hide
Query:  MEVFCVYASNNNVDYRVLWRWLVEITFRWSIPGVVMGDFNAIRVHSKACGGSPVTGDMEEFDIAICDADLVEPAVQGNWFTWTSKVCGSGWLRRLDRILV
        +EVFCVYASN+N++ R+LWR LVEIT  WS P VVMGDFNAIRVH +A GGSP+ G+ME+FD+A  DADLVEP+VQGNWFTWTSKV GSG LRRLDRILV
Subjt:  MEVFCVYASNNNVDYRVLWRWLVEITFRWSIPGVVMGDFNAIRVHSKACGGSPVTGDMEEFDIAICDADLVEPAVQGNWFTWTSKVCGSGWLRRLDRILV

Query:  NEQGLMAWPSLRVSVLPWGISDHSPMLIYQGVEQRRRTISFCFFNHWAEDTTFSDMVSSVWVRRLGVSPLVSLMRNLHDLKPMLRGHFGRHIRGLSEEVR
        N++ L AWP+L               L+ Q +                ED +F ++V+ +W R  GVSPLVSLMRNL +LKP LR  FGRHI+ L+EEV 
Subjt:  NEQGLMAWPSLRVSVLPWGISDHSPMLIYQGVEQRRRTISFCFFNHWAEDTTFSDMVSSVWVRRLGVSPLVSLMRNLHDLKPMLRGHFGRHIRGLSEEVR

Query:  SSKEDMDRAQREVERDPGSVERSRDASVATEAFWSAIRQEEASLHQKSRVRWLELGDQNFAFFHRSIRSRIGCNSLLFIVDSEGIQVTSHERLVQVAVNF
         +KE+MDRAQREVE +P S   SR   +ATEAFW+A+R EEASL QKSR+RWLELGDQN AFFHR +RSR+  NSLL +VD++G +V+SH+ +VQ+AVN+
Subjt:  SSKEDMDRAQREVERDPGSVERSRDASVATEAFWSAIRQEEASLHQKSRVRWLELGDQNFAFFHRSIRSRIGCNSLLFIVDSEGIQVTSHERLVQVAVNF

Query:  FRNSLGSQVVGYRELYLLREEVVQFKWTEECCHALQALIRREEIMRV--------APGPDGFSAGFFKGAWNTVGEDFCDAVLHFFETCYLPPRVNATAI
        FRNSLGSQ +GYREL+ + +++VQF+W+EECC ALQ  I REE+ RV        APGPDGFS GFFKGAW+ V EDFCD VLHFFETCYLP  VNAT I
Subjt:  FRNSLGSQVVGYRELYLLREEVVQFKWTEECCHALQALIRREEIMRV--------APGPDGFSAGFFKGAWNTVGEDFCDAVLHFFETCYLPPRVNATAI

Query:  TLIPKRSGAERMKDYRPISCCNVVYKRISKILADRLRVWLPSFISGNLTPLLLGRVLLITF------------------CFVR----------RW-----
        TLIPKR GAE+M+++RPISCCNV+YK ISKILADRLRVWLPSFI  N +  + GR ++                     C ++           W     
Subjt:  TLIPKRSGAERMKDYRPISCCNVVYKRISKILADRLRVWLPSFISGNLTPLLLGRVLLITF------------------CFVR----------RW-----

Query:  -----------WGIIMVLLDRL-----ASIGF------VRETLRQFGELLGLIANLDKSSMFVAGVDIEAATVLADSMGFVLGTLPVHYLAVPL
                   + ++M +L R+      S  F      V+ T   F + L +    D+ S+       EAA+ LA SMGFVLG LPV YL +PL
Subjt:  -----------WGIIMVLLDRL-----ASIGF------VRETLRQFGELLGLIANLDKSSMFVAGVDIEAATVLADSMGFVLGTLPVHYLAVPL

A0A5D3DXQ8 Reverse transcriptase domain-containing protein1.2e-14148.46Show/hide
Query:  MEVFCVYASNNNVDYRVLWRWLVEITFRWSIPGVVMGDFNAIRVHSKACGGSPVTGDMEEFDIAICDADLVEPAVQGNWFTWTSKVCGSGWLRRLDRILV
        +EVF VYASN+N++ R+LW  LVEIT+ WS PG+VMGDFNAIRVHS+A GGSP+ G+ME+FD+AI DADLVEP+VQGNWFTWTSK               
Subjt:  MEVFCVYASNNNVDYRVLWRWLVEITFRWSIPGVVMGDFNAIRVHSKACGGSPVTGDMEEFDIAICDADLVEPAVQGNWFTWTSKVCGSGWLRRLDRILV

Query:  NEQGLMAWPSLRVSVLPWGISDHSPMLIYQGVEQRRRTISFCFFNHWAEDTTFSDMVSSVWVRRLGVSPLVSLMRNLHDLKPMLRGHFGRHIRGLSEEVR
                                           RR +SF FFNHW ED +F ++VS +W R  GVSPLVSL+RNL  LK  +R HFGRHI+ LSEEVR
Subjt:  NEQGLMAWPSLRVSVLPWGISDHSPMLIYQGVEQRRRTISFCFFNHWAEDTTFSDMVSSVWVRRLGVSPLVSLMRNLHDLKPMLRGHFGRHIRGLSEEVR

Query:  SSKEDMDRAQREVERDPGSVERSRDASVATEAFWSAIRQEEASLHQKSRVRWLELGDQNFAFFHRSIRSRIGCNSLLFIVDSEGIQVTSHERLVQVAVNF
        ++KE MDRAQREV+R+P S   SR A +ATEAFW+ +R EEASLHQK R+RWLELG+QN AFFHRS+ SR                              
Subjt:  SSKEDMDRAQREVERDPGSVERSRDASVATEAFWSAIRQEEASLHQKSRVRWLELGDQNFAFFHRSIRSRIGCNSLLFIVDSEGIQVTSHERLVQVAVNF

Query:  FRNSLGSQVVGYRELYLLREEVVQFKWTEECCHALQALIRREEIMRV--------APGPDGFSAGFFKGAWNTVGEDFCDAVLHFFETCYLPPRVNATAI
              SQ + YREL  + +++VQF+W+EECC ALQ  I REE+ RV        A G DGFS  FFKG W+ V EDFCD +LHFFETCYLP  VNAT I
Subjt:  FRNSLGSQVVGYRELYLLREEVVQFKWTEECCHALQALIRREEIMRV--------APGPDGFSAGFFKGAWNTVGEDFCDAVLHFFETCYLPPRVNATAI

Query:  TLIPKRSGAERMKDYRPISCCNVVYKRISKILADRLRVWLPSFISGNLTPLLLGRVLL--------ITFCF-------------VRRWWGI-------IM
        TLIPKR GAE ++++RPIS CNV+YK ISKILADRL VWLPSFISGN +  + GR ++        + F F             VR+ + +       +M
Subjt:  TLIPKRSGAERMKDYRPISCCNVVYKRISKILADRLRVWLPSFISGNLTPLLLGRVLL--------ITFCF-------------VRRWWGI-------IM

Query:  VLLDRL------------------------------------ASIGFVRETLRQFGELLGLIANLDKSSMFVAGVDIEAATVLADSMGFVLGTLPVHYLA
          L R+                                     S+ F+RE+L++FGELLGL ANL KSS+FVAG   EAA+ LA SMGFVLG LPV YL 
Subjt:  VLLDRL------------------------------------ASIGFVRETLRQFGELLGLIANLDKSSMFVAGVDIEAATVLADSMGFVLGTLPVHYLA

Query:  VPLHSVGCVLRIVLRSS
        +PL +V      VL +S
Subjt:  VPLHSVGCVLRIVLRSS

SwissProt top hitse value%identityAlignment
P14381 Transposon TX1 uncharacterized 149 kDa protein4.6e-1019.9Show/hide
Query:  RLDRILVNEQGLMAWPSLRVSVLPWGISDHSPMLIYQGVEQRRRTISFCFFNH-WAEDTTFSDMVSSVWVRRLGVSPLVSLMRNLHDLKPMLRGHFGRHI
        R+DRI ++   +    S  + + P+  SDH+ + +   +       ++  FN+   ED  F+  V   W          + +    D+  +       H+
Subjt:  RLDRILVNEQGLMAWPSLRVSVLPWGISDHSPMLIYQGVEQRRRTISFCFFNH-WAEDTTFSDMVSSVWVRRLGVSPLVSLMRNLHDLKPMLRGHFGRHI

Query:  RGLSEEVRSSKEDMDRAQREV---------ERDPGSVERSRDASV--ATEAFWSAIRQEEASLHQKSRVRWLELGDQNFAFFHRSIRSRIGCNSLLFIVD
        + L +E   S      A+ E          +R  GS +++         EA  +  +++      +SR++ L   D+   FF+   + +     +  +  
Subjt:  RGLSEEVRSSKEDMDRAQREV---------ERDPGSVERSRDASV--ATEAFWSAIRQEEASLHQKSRVRWLELGDQNFAFFHRSIRSRIGCNSLLFIVD

Query:  SEGIQVTSHERLVQVAVNFFRNSLGSQVV----------GYRELYLLREEVVQFKWT-EECCHALQALIRREEIMRVAPGPDGFSAGFFKGAWNTVGEDF
         +G  +   E +   A +F++N      +          G   +   R+E ++   T +E   AL+ +   +     +PG DG +  FF+  W+T+G DF
Subjt:  SEGIQVTSHERLVQVAVNFFRNSLGSQVV----------GYRELYLLREEVVQFKWT-EECCHALQALIRREEIMRVAPGPDGFSAGFFKGAWNTVGEDF

Query:  CDAVLHFFETCYLPPRVNATAITLIPKRSGAERMKDYRPISCCNVVYKRISKILADRLRVWLPSFISGNLTPLLLGRVLLITFCFVR
           +   F+   LP       ++L+PK+     +K++RP+S  +  YK ++K ++ RL+  L   I  + +  + GR +      +R
Subjt:  CDAVLHFFETCYLPPRVNATAITLIPKRSGAERMKDYRPISCCNVVYKRISKILADRLRVWLPSFISGNLTPLLLGRVLLITFCFVR

Arabidopsis top hitse value%identityAlignment
AT1G40390.1 DNAse I-like superfamily protein1.7e-1224.61Show/hide
Query:  NNNVDYRVLWRWLVEITFRWSI---PGVVMGDFN---AIRVHSKACGGSPVTGDMEEFDIAICDADLVEPAVQGNWFTWTSKVCGSGWLRRLDRILVNEQ
        N   + R LW  +  ++    +   P +V+GDFN   ++  H      +     +E+    + D+DLV+   +G  +TW++    +  LR+LDR +VN  
Subjt:  NNNVDYRVLWRWLVEITFRWSI---PGVVMGDFN---AIRVHSKACGGSPVTGDMEEFDIAICDADLVEPAVQGNWFTWTSKVCGSGWLRRLDRILVNEQ

Query:  GLMAWPSLRVSVLPWGISDHSP-MLIYQGVEQRRRTISFCFFNHWAEDTTFSDMVSSVWVRRLGV-SPLVSLMRNLHDLKPMLRGHFGRHIRGLSEEVRS
         L  +P+      P   SDH+  M+I        +  SF +F+  +    F   + + W + + V S + SL   L + K   RG   R    +  ++ S
Subjt:  GLMAWPSLRVSVLPWGISDHSP-MLIYQGVEQRRRTISFCFFNHWAEDTTFSDMVSSVWVRRLGV-SPLVSLMRNLHDLKPMLRGHFGRHIRGLSEEVRS

Query:  SKEDMDRAQREVERDPGSVERSRDASVATEAFWSAIRQEEASLHQKSRVRWLELGD
        +  D       V R     +     + A E+F+           QKSR++WL+ GD
Subjt:  SKEDMDRAQREVERDPGSVERSRDASVATEAFWSAIRQEEASLHQKSRVRWLELGD

AT1G43760.1 DNAse I-like superfamily protein7.4e-4028.68Show/hide
Query:  VVMGDFNAIRV---HSKACGGSPVTGDMEEFDIAICDADLVEPAVQGNWFTWTSKVCGSGWLRRLDRILVNEQGLMAWPSLRVSVLPWGISDHSPMLIYQ
        +++GDF+ I     H      S     +EEF   + D+DLV+   +G  +TW++    +  +R+LDR + N     ++PS        G+SDHSP +I  
Subjt:  VVMGDFNAIRV---HSKACGGSPVTGDMEEFDIAICDADLVEPAVQGNWFTWTSKVCGSGWLRRLDRILVNEQGLMAWPSLRVSVLPWGISDHSPMLIYQ

Query:  GVEQRRRTISFCFFNHWAEDTTFSDMVSSVWVRRLGV-SPLVSLMRNLHDLKPMLRGHFGRHIRGLSEEVRSSKEDMDRAQREVERDPGSVERSRDASVA
            +R    F +F+  +   TF   ++  W  ++ V S + SL  +L   K   +    +    +  + + + + ++  Q ++  +P S    R   VA
Subjt:  GVEQRRRTISFCFFNHWAEDTTFSDMVSSVWVRRLGV-SPLVSLMRNLHDLKPMLRGHFGRHIRGLSEEVRSSKEDMDRAQREVERDPGSVERSRDASVA

Query:  TEAFWSAIRQEEASLHQKSRVRWLELGDQNFAFFHRSIRSRIGCNSLLFIVDSEGIQVTSHERLVQVAVNFFRNSLGSQVVGYRELYLLR-EEVVQFKWT
         + +       E+   QKSR++WL+ GD N  FFH+ I +    N + F+   + ++V +  ++ ++ V ++ + LGS         + R +++  F+  
Subjt:  TEAFWSAIRQEEASLHQKSRVRWLELGDQNFAFFHRSIRSRIGCNSLLFIVDSEGIQVTSHERLVQVAVNFFRNSLGSQVVGYRELYLLR-EEVVQFKWT

Query:  EECCHALQALIRREEIMRV--------APGPDGFSAGFFKGAWNTVGEDFCDAVLHFFETCYLPPRVNATAITLIPKRSGAERMKDYRPISCCNVVYKRI
        +     L AL   +EI           APGPD F+A FF  +W  V +    AV  FF T +L  R NATAITLIPK +G +++  +RP+SCC VVYK I
Subjt:  EECCHALQALIRREEIMRV--------APGPDGFSAGFFKGAWNTVGEDFCDAVLHFFETCYLPPRVNATAITLIPKRSGAERMKDYRPISCCNVVYKRI

Query:  S
        +
Subjt:  S


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGTGTTTTGTGTTTATGCCTCTAATAATAATGTGGACTATCGTGTGCTTTGGCGTTGGTTAGTTGAGATCACTTTTAGATGGTCGATTCCAGGTGTGGTTATGGG
TGATTTTAATGCAATTCGAGTGCACTCTAAAGCTTGTGGTGGGAGTCCAGTTACTGGTGATATGGAGGAGTTTGATATTGCAATTTGTGATGCTGACTTGGTTGAGCCAG
CTGTTCAGGGAAACTGGTTCACTTGGACTAGTAAGGTGTGTGGTTCAGGTTGGTTGCGTCGGCTTGATCGTATTTTGGTAAATGAGCAGGGGTTAATGGCTTGGCCTAGT
CTGCGTGTTTCAGTTTTGCCTTGGGGGATTTCTGATCATTCCCCTATGTTAATCTATCAGGGTGTTGAACAGCGGAGGCGTACTATTTCGTTTTGTTTCTTTAATCACTG
GGCGGAGGATACGACGTTTAGTGATATGGTGTCTTCGGTTTGGGTGAGAAGGTTGGGTGTGTCTCCGTTAGTGAGTCTTATGCGGAATTTGCATGACCTTAAACCTATGC
TTCGTGGACATTTTGGTAGGCATATAAGGGGCCTCAGTGAGGAGGTGCGCTCTTCAAAAGAGGATATGGATAGGGCCCAGCGGGAGGTTGAGCGGGATCCTGGGTCTGTG
GAGAGGAGCCGTGATGCTAGTGTTGCGACTGAGGCTTTTTGGTCAGCTATCCGACAGGAAGAAGCCTCTCTCCATCAGAAATCACGAGTTAGGTGGTTGGAGCTTGGGGA
TCAGAATTTTGCCTTTTTTCATCGCTCGATTCGTTCCCGTATTGGTTGTAATAGTTTGCTTTTTATTGTTGATTCTGAGGGTATTCAGGTGACATCCCATGAGAGGTTGG
TGCAGGTGGCTGTCAACTTTTTTCGTAATAGTCTTGGGTCCCAGGTGGTTGGTTATAGGGAGCTTTATCTTTTGAGGGAGGAGGTGGTTCAATTTAAGTGGACGGAGGAG
TGTTGTCATGCGTTACAGGCTCTGATTAGGCGTGAGGAGATTATGAGGGTGGCTCCTGGTCCTGATGGTTTTTCGGCAGGGTTCTTCAAAGGTGCTTGGAACACGGTTGG
TGAGGATTTTTGTGATGCTGTGCTGCATTTCTTTGAGACGTGTTATCTGCCTCCTAGGGTAAATGCTACTGCGATCACCCTCATTCCCAAACGTAGTGGAGCTGAACGTA
TGAAGGATTATAGGCCTATTTCGTGTTGTAATGTGGTTTACAAGCGTATTTCTAAGATTTTGGCGGATAGGCTTCGTGTGTGGCTTCCTTCTTTTATCAGTGGTAATCTA
ACGCCTTTGTTGTTGGGAAGAGTATTATTGATAACATTCTGCTTTGTTAGGAGATGGTGGGGGATTATCATGGTTCTTCTGGACCGCCTAGCTTCCATTGGCTTTGTGCG
TGAGACTCTTCGTCAGTTTGGGGAGTTATTGGGGTTAATTGCAAATCTAGATAAGAGCTCTATGTTTGTGGCGGGGGTTGACATTGAGGCTGCTACTGTGTTGGCGGATA
GTATGGGATTTGTGCTGGGTACTTTGCCTGTTCATTACCTAGCAGTTCCCTTACACTCGGTCGGCTGCGTTCTTCGGATTGTGCTCCGCTCATCCAGCGTATTACTAGTC
GGATTCGGTCTTGGTCGGCTAGAGTTTTATCCTTCGCTGGTAGGCTTCAGCTTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGGTGTTTTGTGTTTATGCCTCTAATAATAATGTGGACTATCGTGTGCTTTGGCGTTGGTTAGTTGAGATCACTTTTAGATGGTCGATTCCAGGTGTGGTTATGGG
TGATTTTAATGCAATTCGAGTGCACTCTAAAGCTTGTGGTGGGAGTCCAGTTACTGGTGATATGGAGGAGTTTGATATTGCAATTTGTGATGCTGACTTGGTTGAGCCAG
CTGTTCAGGGAAACTGGTTCACTTGGACTAGTAAGGTGTGTGGTTCAGGTTGGTTGCGTCGGCTTGATCGTATTTTGGTAAATGAGCAGGGGTTAATGGCTTGGCCTAGT
CTGCGTGTTTCAGTTTTGCCTTGGGGGATTTCTGATCATTCCCCTATGTTAATCTATCAGGGTGTTGAACAGCGGAGGCGTACTATTTCGTTTTGTTTCTTTAATCACTG
GGCGGAGGATACGACGTTTAGTGATATGGTGTCTTCGGTTTGGGTGAGAAGGTTGGGTGTGTCTCCGTTAGTGAGTCTTATGCGGAATTTGCATGACCTTAAACCTATGC
TTCGTGGACATTTTGGTAGGCATATAAGGGGCCTCAGTGAGGAGGTGCGCTCTTCAAAAGAGGATATGGATAGGGCCCAGCGGGAGGTTGAGCGGGATCCTGGGTCTGTG
GAGAGGAGCCGTGATGCTAGTGTTGCGACTGAGGCTTTTTGGTCAGCTATCCGACAGGAAGAAGCCTCTCTCCATCAGAAATCACGAGTTAGGTGGTTGGAGCTTGGGGA
TCAGAATTTTGCCTTTTTTCATCGCTCGATTCGTTCCCGTATTGGTTGTAATAGTTTGCTTTTTATTGTTGATTCTGAGGGTATTCAGGTGACATCCCATGAGAGGTTGG
TGCAGGTGGCTGTCAACTTTTTTCGTAATAGTCTTGGGTCCCAGGTGGTTGGTTATAGGGAGCTTTATCTTTTGAGGGAGGAGGTGGTTCAATTTAAGTGGACGGAGGAG
TGTTGTCATGCGTTACAGGCTCTGATTAGGCGTGAGGAGATTATGAGGGTGGCTCCTGGTCCTGATGGTTTTTCGGCAGGGTTCTTCAAAGGTGCTTGGAACACGGTTGG
TGAGGATTTTTGTGATGCTGTGCTGCATTTCTTTGAGACGTGTTATCTGCCTCCTAGGGTAAATGCTACTGCGATCACCCTCATTCCCAAACGTAGTGGAGCTGAACGTA
TGAAGGATTATAGGCCTATTTCGTGTTGTAATGTGGTTTACAAGCGTATTTCTAAGATTTTGGCGGATAGGCTTCGTGTGTGGCTTCCTTCTTTTATCAGTGGTAATCTA
ACGCCTTTGTTGTTGGGAAGAGTATTATTGATAACATTCTGCTTTGTTAGGAGATGGTGGGGGATTATCATGGTTCTTCTGGACCGCCTAGCTTCCATTGGCTTTGTGCG
TGAGACTCTTCGTCAGTTTGGGGAGTTATTGGGGTTAATTGCAAATCTAGATAAGAGCTCTATGTTTGTGGCGGGGGTTGACATTGAGGCTGCTACTGTGTTGGCGGATA
GTATGGGATTTGTGCTGGGTACTTTGCCTGTTCATTACCTAGCAGTTCCCTTACACTCGGTCGGCTGCGTTCTTCGGATTGTGCTCCGCTCATCCAGCGTATTACTAGTC
GGATTCGGTCTTGGTCGGCTAGAGTTTTATCCTTCGCTGGTAGGCTTCAGCTTGTGA
Protein sequenceShow/hide protein sequence
MEVFCVYASNNNVDYRVLWRWLVEITFRWSIPGVVMGDFNAIRVHSKACGGSPVTGDMEEFDIAICDADLVEPAVQGNWFTWTSKVCGSGWLRRLDRILVNEQGLMAWPS
LRVSVLPWGISDHSPMLIYQGVEQRRRTISFCFFNHWAEDTTFSDMVSSVWVRRLGVSPLVSLMRNLHDLKPMLRGHFGRHIRGLSEEVRSSKEDMDRAQREVERDPGSV
ERSRDASVATEAFWSAIRQEEASLHQKSRVRWLELGDQNFAFFHRSIRSRIGCNSLLFIVDSEGIQVTSHERLVQVAVNFFRNSLGSQVVGYRELYLLREEVVQFKWTEE
CCHALQALIRREEIMRVAPGPDGFSAGFFKGAWNTVGEDFCDAVLHFFETCYLPPRVNATAITLIPKRSGAERMKDYRPISCCNVVYKRISKILADRLRVWLPSFISGNL
TPLLLGRVLLITFCFVRRWWGIIMVLLDRLASIGFVRETLRQFGELLGLIANLDKSSMFVAGVDIEAATVLADSMGFVLGTLPVHYLAVPLHSVGCVLRIVLRSSSVLLV
GFGLGRLEFYPSLVGFSL