; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0028329 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0028329
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr01:14992120..14994874
RNA-Seq ExpressionPI0028329
SyntenyPI0028329
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR025558 - Domain of unknown function DUF4283
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048521.1 uncharacterized protein E6C27_scaffold61G001420 [Cucumis melo var. makuwa]9.7e-15547.56Show/hide
Query:  MLLHRWVLGIVPETFVFNFVPVWIKLGRIPMELWTESGIAVIASAICKPLSLDLATKERCRLSFARVCVEVEGGADLPSEVTVTLRGVEFIVPVTYEWEP
        MLL +W   IVPE+FVF  VPVWIKLGRIPMELWTE+G+ V+AS + KPL+LDLATKERCRLS+ARVC+E++  + +P+E+TV+L+GV+FIV V YEW+P
Subjt:  MLLHRWVLGIVPETFVFNFVPVWIKLGRIPMELWTESGIAVIASAICKPLSLDLATKERCRLSFARVCVEVEGGADLPSEVTVTLRGVEFIVPVTYEWEP

Query:  RMCNSCHSFSHSVGYF-TTVTRKKRELVLVRDKGKGMSMQPLQNSFG-----SLSELSEGENWALELQVNR----------PPPLQI-------------
        R CN C +F HS G    +V     + V+V     G   +    + G     S  +L EGE   +    NR          PPPLQ+             
Subjt:  RMCNSCHSFSHSVGYF-TTVTRKKRELVLVRDKGKGMSMQPLQNSFG-----SLSELSEGENWALELQVNR----------PPPLQI-------------

Query:  --------------------------VGSDGGMPIMSSDGGP--------------------------------LGICVD-------------------L
                                   G+        S+ G                                  G+ V+                   L
Subjt:  --------------------------VGSDGGMPIMSSDGGP--------------------------------LGICVD-------------------L

Query:  VDITSRWSILGVVMGDFNAIRVHYEACGGNLIPDDMEEFDLAIREADLVELSVQDNWFTWT---------------------------IWVRRLGVSPLV
        V+IT  WS  G+VMGDFNAIRVH EA GG+ I  +ME+FDLAIR+ADLVE SVQ NWFTWT                           +W R  GVSPLV
Subjt:  VDITSRWSILGVVMGDFNAIRVHYEACGGNLIPDDMEEFDLAIREADLVELSVQDNWFTWT---------------------------IWVRRLGVSPLV

Query:  SPMRSLHDLKSVLRRHFGRHIRGLSKEVRSAKEAMDRAQREVERDPGFVERSRLARLATEAFWLAIRLEEASLRHKSRIRWLKLGDQNSAFLHRSVRSRI
        S +R+L  LKS +RRHFGRHI+ LS+EVR+AKEAMDRAQREV+R+P     SR A LATEAFW  +RLEEASL  K RIRWL+LG+QN+AF HRSV SR 
Subjt:  SPMRSLHDLKSVLRRHFGRHIRGLSKEVRSAKEAMDRAQREVERDPGFVERSRLARLATEAFWLAIRLEEASLRHKSRIRWLKLGDQNSAFLHRSVRSRI

Query:  CRNNLLSIVDFEGGRVTSYDGLVQVAINFFRNSLGSQMIGYCELSPLLEEVVKFRWSEECSHALQALIRHDEIRKVLFTMDSSKASGPDGFSAGFFKDAW
                                           SQ I Y ELSP+++++V+FRWSEEC  ALQ  I  +E+R+VLF+MDS KASG DGFS  FFK  W
Subjt:  CRNNLLSIVDFEGGRVTSYDGLVQVAINFFRNSLGSQMIGYCELSPLLEEVVKFRWSEECSHALQALIRHDEIRKVLFTMDSSKASGPDGFSAGFFKDAW

Query:  NVVGEDFSDVVLHFFDTCYLPTEVNATAITLISKRCGAERMEDFRPISCCNVVYKCISKILAGRLRVWLPFFISGNQHAFVAGRSITNNILLCQEL
        +VV EDF DV+LHFF+TCYLP  VNAT ITLI KR GAE +E+FRPIS CNV+YKCISKILA RL VWLP FISGNQ AF+ GRSI +NILLCQEL
Subjt:  NVVGEDFSDVVLHFFDTCYLPTEVNATAITLISKRCGAERMEDFRPISCCNVVYKCISKILAGRLRVWLPFFISGNQHAFVAGRSITNNILLCQEL

KAA0059841.1 reverse transcriptase [Cucumis melo var. makuwa]8.5e-15146.81Show/hide
Query:  MLLHRWVLGIVPETFVFNFVPVWIKLGRIPMELWTESGIAVIASAICKPLSLDLATKERCRLSFARVCVEVEGGADLPSEVTVTLRGVEFIVPVTYEWEP
        MLL +W  GIVPE+FVFN V VWI+LG+IPMELWTE+G+AV+ASA+ KP+SLDL TKER RLS+ARVCVE+EGG+++P+++TV+L GV+F V + YEW+P
Subjt:  MLLHRWVLGIVPETFVFNFVPVWIKLGRIPMELWTESGIAVIASAICKPLSLDLATKERCRLSFARVCVEVEGGADLPSEVTVTLRGVEFIVPVTYEWEP

Query:  RMCNSCHSFSHSVGYFTTVTRKK---RELV--LVRDKGKGMSMQP----LQNSFGSLSEL----------SEGENWALELQVNRPPPLQIVGSDGGMPIM
        R CN C +F HS    +     K    E+V  ++  K  G+  +P    +  SF  L E+          S+ + WAL +    PPPLQ+      +  +
Subjt:  RMCNSCHSFSHSVGYFTTVTRKK---RELV--LVRDKGKGMSMQP----LQNSFGSLSEL----------SEGENWALELQVNRPPPLQIVGSDGGMPIM

Query:  SSDGGP--------------------------------------------------LGICVD-------------------LVDITSRWSILGVVMGDFN
        SS  GP                                                   G+CV+                   LV+ TS WS  GVVMGDFN
Subjt:  SSDGGP--------------------------------------------------LGICVD-------------------LVDITSRWSILGVVMGDFN

Query:  AIRVHYEACGGNLIPDDMEEFDLAIREADLVELSVQDNWFTWT---------------------------------------------------------
        AIRVH EA GG+ I  +ME+FDLAIR+ADLVE SVQ NWFTWT                                                         
Subjt:  AIRVHYEACGGNLIPDDMEEFDLAIREADLVELSVQDNWFTWT---------------------------------------------------------

Query:  -------------------IWVRRLGVSPLVSPMRSLHDLKSVLRRHFGRHIRGLSKEVRSAKEAMDRAQREVERDPGFVERSRLARLATEAFWLAIRLE
                           +W R  GVSPLVS MR+LH LK  LRR FGRHI+ LS+EV  AKEAMDRAQR+VER+      SR A LATE FW A+RLE
Subjt:  -------------------IWVRRLGVSPLVSPMRSLHDLKSVLRRHFGRHIRGLSKEVRSAKEAMDRAQREVERDPGFVERSRLARLATEAFWLAIRLE

Query:  EASLRHKSRIRWLKLGDQNSAFLHRSVRSRICRNNLLSIVDFEGGRVTSYDGLVQVAINFFRNSLGSQMIGYCELSPLLEEVVKFRWSEECSHALQALIR
        EASLR KSRIRWLKLGDQN+ F HRSVRSR+ RN+LLS+VD +G RV+S+DG+ Q+A+N+FRNSLGSQ IGY ELSP+++++++F+WSEEC  ALQ  I 
Subjt:  EASLRHKSRIRWLKLGDQNSAFLHRSVRSRICRNNLLSIVDFEGGRVTSYDGLVQVAINFFRNSLGSQMIGYCELSPLLEEVVKFRWSEECSHALQALIR

Query:  HDEIRKVLFTMDSSKASGPDGFSAGFFKDAWNVVGEDFSDVVLHFFDTCYLPTEVNAT
         +E+R+VLF+MDS KA GPDGFS G FK  W+VVGEDF DVVLHFF+TCYLP  VNAT
Subjt:  HDEIRKVLFTMDSSKASGPDGFSAGFFKDAWNVVGEDFSDVVLHFFDTCYLPTEVNAT

KAA0062888.1 non-LTR retroelement reverse transcriptase-like protein [Cucumis melo var. makuwa]2.0e-20849.63Show/hide
Query:  MPTITILENGLICFQFRHPKSIEWIISHGPWHLGGKPMLLHRWVLGIVPETFVFNFVPVWIKLGRIPMELWTESGIAVIASAICKPLSLDLATKERCRLS
        MPTITILEN LICFQFR P S+EWI+S GPWHLGGKPMLL +W LGIVPE+FVFN VPVWI+LGRIPMELWTE+ +A++AS + KP++LDLATKE  RLS
Subjt:  MPTITILENGLICFQFRHPKSIEWIISHGPWHLGGKPMLLHRWVLGIVPETFVFNFVPVWIKLGRIPMELWTESGIAVIASAICKPLSLDLATKERCRLS

Query:  FARVCVEVEGGADLPSEVTVTLRGVEFIVPVTYEWEPRMCNSCHSFSHSVG-------------------------------------------------
        +ARVCV++EG  ++ +E+TV LRGV+F V V YEW+P+ CN C +  HS G                                                 
Subjt:  FARVCVEVEGGADLPSEVTVTLRGVEFIVPVTYEWEPRMCNSCHSFSHSVG-------------------------------------------------

Query:  ------------------YFTTVTRKKRELVLVRDKGKGMSMQPLQNSFGSLSELSEGENWALELQVNRPPPLQIVGSDGGMPIMSSDG-----------
                           FT VTRKK ELV VRD+GK + +  + NSFGSL E+ + + WAL +    PPPLQ+    G +  + S             
Subjt:  ------------------YFTTVTRKKRELVLVRDKGKGMSMQPLQNSFGSLSELSEGENWALELQVNRPPPLQIVGSDGGMPIMSSDG-----------

Query:  -------------------------------------------------------------GPL-----GICVD-------------------LVDITSR
                                                                     G L     G+CV+                   LV+ITS 
Subjt:  -------------------------------------------------------------GPL-----GICVD-------------------LVDITSR

Query:  WSILGVVMGDFNAIRVHYEACGGNLIPDDMEEFDLAIREADLVELSVQDNWFTWT---------------------------------------------
        WS   VVMGDFNAIRVH+EA GG+ I  +ME+FDLA R+ADLVE SVQ NWFTWT                                             
Subjt:  WSILGVVMGDFNAIRVHYEACGGNLIPDDMEEFDLAIREADLVELSVQDNWFTWT---------------------------------------------

Query:  -IWVRRLGVSPLVSPMRSLHDLKSVLRRHFGRHIRGLSKEVRSAKEAMDRAQREVERDPGFVERSRLARLATEAFWLAIRLEEASLRHKSRIRWLKLGDQ
         +W R  GVSPLVS MR+L +LK  LRR FGRHI+ L++EV  AKE MDRAQREVE +P     SR   LATEAFW A+RLEEASLR KSRIRWL+LGDQ
Subjt:  -IWVRRLGVSPLVSPMRSLHDLKSVLRRHFGRHIRGLSKEVRSAKEAMDRAQREVERDPGFVERSRLARLATEAFWLAIRLEEASLRHKSRIRWLKLGDQ

Query:  NSAFLHRSVRSRICRNNLLSIVDFEGGRVTSYDGLVQVAINFFRNSLGSQMIGYCELSPLLEEVVKFRWSEECSHALQALIRHDEIRKVLFTMDSSKASG
        N+AF HR VRSR+ RN+LLS+VD +G RV+S+DG+VQ+A+N+FRNSLGSQ IGY EL P+++++V+FRWSEEC  ALQ  I  +E+R+VLF+MDS KA G
Subjt:  NSAFLHRSVRSRICRNNLLSIVDFEGGRVTSYDGLVQVAINFFRNSLGSQMIGYCELSPLLEEVVKFRWSEECSHALQALIRHDEIRKVLFTMDSSKASG

Query:  PDGFSAGFFKDAWNVVGEDFSDVVLHFFDTCYLPTEVNATAITLISKRCGAERMEDFRPISCCNVVYKCISKILAGRLRVWLPFFISGNQHAFVAGRSIT
        PDGFS GFFK AW+VV EDF DVVLHFF+TCYLP  VNAT ITLI KR GAE+ME+FRPISCCNV+YKCISKILA RLRVWLP FI  NQ AF+ GRSI 
Subjt:  PDGFSAGFFKDAWNVVGEDFSDVVLHFFDTCYLPTEVNATAITLISKRCGAERMEDFRPISCCNVVYKCISKILAGRLRVWLPFFISGNQHAFVAGRSIT

Query:  NNILLCQELVGGYHGTSSPP
        +NILLCQELV GYH  S  P
Subjt:  NNILLCQELVGGYHGTSSPP

TYK28312.1 uncharacterized protein E5676_scaffold600G001370 [Cucumis melo var. makuwa]7.4e-15547.56Show/hide
Query:  MLLHRWVLGIVPETFVFNFVPVWIKLGRIPMELWTESGIAVIASAICKPLSLDLATKERCRLSFARVCVEVEGGADLPSEVTVTLRGVEFIVPVTYEWEP
        MLL +W   IVPE+FVF  VPVWIKLGRIPMELWTE+G+ V+AS + KPL+LDLATKERCRLS+ARVC+E++  + +P+E+TV+L+GV+FIV V YEW+P
Subjt:  MLLHRWVLGIVPETFVFNFVPVWIKLGRIPMELWTESGIAVIASAICKPLSLDLATKERCRLSFARVCVEVEGGADLPSEVTVTLRGVEFIVPVTYEWEP

Query:  RMCNSCHSFSHSVGYF-TTVTRKKRELVLVRDKGKGMSMQPLQNSFG-----SLSELSEGENWALELQVNR----------PPPLQI-------------
        R CN C +F HS G    +V     + V+V     G   +    + G     S  +L EGE   +    NR          PPPLQ+             
Subjt:  RMCNSCHSFSHSVGYF-TTVTRKKRELVLVRDKGKGMSMQPLQNSFG-----SLSELSEGENWALELQVNR----------PPPLQI-------------

Query:  --------------------------VGSDGGMPIMSSDGGP--------------------------------LGICVD-------------------L
                                   G+        S+ G                                  G+ V+                   L
Subjt:  --------------------------VGSDGGMPIMSSDGGP--------------------------------LGICVD-------------------L

Query:  VDITSRWSILGVVMGDFNAIRVHYEACGGNLIPDDMEEFDLAIREADLVELSVQDNWFTWT---------------------------IWVRRLGVSPLV
        V+IT  WS  G+VMGDFNAIRVH EA GG+ I  +ME+FDLAIR+ADLVE SVQ NWFTWT                           +W R  GVSPLV
Subjt:  VDITSRWSILGVVMGDFNAIRVHYEACGGNLIPDDMEEFDLAIREADLVELSVQDNWFTWT---------------------------IWVRRLGVSPLV

Query:  SPMRSLHDLKSVLRRHFGRHIRGLSKEVRSAKEAMDRAQREVERDPGFVERSRLARLATEAFWLAIRLEEASLRHKSRIRWLKLGDQNSAFLHRSVRSRI
        S +R+L  LKS +RRHFGRHI+ LS+EVR+AKEAMDRAQREV+R+P     SR A LATEAFW  +RLEEASL  K RIRWL+LG+QN+AF HRSV SR 
Subjt:  SPMRSLHDLKSVLRRHFGRHIRGLSKEVRSAKEAMDRAQREVERDPGFVERSRLARLATEAFWLAIRLEEASLRHKSRIRWLKLGDQNSAFLHRSVRSRI

Query:  CRNNLLSIVDFEGGRVTSYDGLVQVAINFFRNSLGSQMIGYCELSPLLEEVVKFRWSEECSHALQALIRHDEIRKVLFTMDSSKASGPDGFSAGFFKDAW
                                           SQ I Y ELSP+++++V+FRWSEEC  ALQ  I  +E+R+VLF+MDS KASG DGFS  FFK  W
Subjt:  CRNNLLSIVDFEGGRVTSYDGLVQVAINFFRNSLGSQMIGYCELSPLLEEVVKFRWSEECSHALQALIRHDEIRKVLFTMDSSKASGPDGFSAGFFKDAW

Query:  NVVGEDFSDVVLHFFDTCYLPTEVNATAITLISKRCGAERMEDFRPISCCNVVYKCISKILAGRLRVWLPFFISGNQHAFVAGRSITNNILLCQEL
        +VV EDF DV+LHFF+TCYLP  VNAT ITLI KR GAE +E+FRPIS CNV+YKCISKILA RL VWLP FISGNQ AF+ GRSI +NILLCQEL
Subjt:  NVVGEDFSDVVLHFFDTCYLPTEVNATAITLISKRCGAERMEDFRPISCCNVVYKCISKILAGRLRVWLPFFISGNQHAFVAGRSITNNILLCQEL

XP_031745634.1 uncharacterized protein LOC116406053 [Cucumis sativus]1.4e-15643.09Show/hide
Query:  LENGLICFQFRHPKSIEWIISHGPWHLGGKPMLLHRWVLGIVPETFVFNFVPVWIKLGRIPMELWTESGIAVIASAICKPLSLDLATKERCRLSFARVCV
        +ENGLICFQF+H KSIEWI+S GPWHLGGKPMLL +W  GIVPE+FVF+ V V IKLGRIP+ELWT++G+AV+ASAI KPLS+DLATKER RLS+AR+CV
Subjt:  LENGLICFQFRHPKSIEWIISHGPWHLGGKPMLLHRWVLGIVPETFVFNFVPVWIKLGRIPMELWTESGIAVIASAICKPLSLDLATKERCRLSFARVCV

Query:  EVEGGADLPSEVTVTLRGVEFIVPVTYEWEPRMCNSCHSFSHSVG-------------------------------------------------------
        E+   + +P+EVTV LRG EFIV VTYEW+P+ CN C SF HS                                                         
Subjt:  EVEGGADLPSEVTVTLRGVEFIVPVTYEWEPRMCNSCHSFSHSVG-------------------------------------------------------

Query:  -------------YFTTVTRKKRELVLVRDKGKGMSMQPLQNSFGSLSELSEGENWALELQVNRPPPLQIVGSDGGMPIMSSDGG--PLG----------
                      FT V RK R ++ + D+GK   +  + NSF +L E+ +G+ W L +    PPPL++   D  M ++S+ G   P+G          
Subjt:  -------------YFTTVTRKKRELVLVRDKGKGMSMQPLQNSFGSLSELSEGENWALELQVNRPPPLQIVGSDGGMPIMSSDGG--PLG----------

Query:  --------ICV--------------DLVDITSRWSILGVVMGDFNAIRVHYEACGGNLIPDDMEEFDLAIREADLVELSVQDNWFTWT------------
                +CV               + +I++ W   G+V+GDFN IR+  EA GG                  L+  +VQ NWFTWT            
Subjt:  --------ICV--------------DLVDITSRWSILGVVMGDFNAIRVHYEACGGNLIPDDMEEFDLAIREADLVELSVQDNWFTWT------------

Query:  --------------------------------IWVRRLGVSPLVSPMRSLHDLKSVLRRHFGRHIRGLSKEVRSAKEAMDRAQREVERDPGFVERSRLAR
                                         W +   VSP+V+ +R+L +LKS+LRRHFGRHIR +S++VR A + MDRA+RE+E +    E S  A 
Subjt:  --------------------------------IWVRRLGVSPLVSPMRSLHDLKSVLRRHFGRHIRGLSKEVRSAKEAMDRAQREVERDPGFVERSRLAR

Query:  LATEAFWLAIRLEEASLRHKSRIRWLKLGDQNSAFLHRSVRSRICRNNLLSIVDFEGGRVTSYDGLVQVAINFFRNSLGSQMIGYCELSPLLEEVVKFRW
        LAT +                                         N L S++D +G R+T++D + QV +N+F++SLGSQ I Y ELS  +EE+V+FRW
Subjt:  LATEAFWLAIRLEEASLRHKSRIRWLKLGDQNSAFLHRSVRSRICRNNLLSIVDFEGGRVTSYDGLVQVAINFFRNSLGSQMIGYCELSPLLEEVVKFRW

Query:  SEECSHALQALIRHDEIRKVLFTMDSSKASGPDGFSAGFFKDAWNVVGEDFSDVVLHFFDTCYLPTEVNATAITLISKRCGAERMEDFRPISCCNVVYKC
        +EEC  ALQ+ I   E+R+VLF+MD  KA GPDG+S GFFK AW VVGE F DVVLHFF+T Y P  VN TAITLI KR GA+R+EDF PISCC+V+YKC
Subjt:  SEECSHALQALIRHDEIRKVLFTMDSSKASGPDGFSAGFFKDAWNVVGEDFSDVVLHFFDTCYLPTEVNATAITLISKRCGAERMEDFRPISCCNVVYKC

Query:  ISKILAGRLRVWLPFFISGNQHAFVAGRSITNNILLCQELVGGYH
        IS+ILA RLRVWLP F+SGNQ AF+ GRSI +NILLCQELVG YH
Subjt:  ISKILAGRLRVWLPFFISGNQHAFVAGRSITNNILLCQELVGGYH

TrEMBL top hitse value%identityAlignment
A0A5A7TZS0 Reverse transcriptase domain-containing protein9.2e-14358.86Show/hide
Query:  LVDITSRWSILGVVMGDFNAIRVHYEACGGNLIPDDMEEFDLAIREADLVELSVQDNWFTWT--------------------------------------
        L +ITS WS LGVVMGDFNAIRVH EA GG+ I  +MEEFDLAIR+ADLVE SVQ NWFTWT                                      
Subjt:  LVDITSRWSILGVVMGDFNAIRVHYEACGGNLIPDDMEEFDLAIREADLVELSVQDNWFTWT--------------------------------------

Query:  --------------------------------------IWVRRLGVSPLVSPMRSLHDLKSVLRRHFGRHIRGLSKEVRSAKEAMDRAQREVERDPGFVE
                                              +W R  GVS LVS MR+LH LK +LRR FGRHI+ LS+EV  AKEAMD AQREVER+P    
Subjt:  --------------------------------------IWVRRLGVSPLVSPMRSLHDLKSVLRRHFGRHIRGLSKEVRSAKEAMDRAQREVERDPGFVE

Query:  RSRLARLATEAFWLAIRLEEASLRHKSRIRWLKLGDQNSAFLHRSVRSRICRNNLLSIVDFEGGRVTSYDGLVQVAINFFRNSLGSQMIGYCELSPLLEE
         SR A LATE FW A+RLEEASLR KS++RWL LGDQN+AF HRSVRSR+ RN+LLS+VD +G RV+S+DG+ Q+A+N+F NSLGSQ IGY ELSP++++
Subjt:  RSRLARLATEAFWLAIRLEEASLRHKSRIRWLKLGDQNSAFLHRSVRSRICRNNLLSIVDFEGGRVTSYDGLVQVAINFFRNSLGSQMIGYCELSPLLEE

Query:  VVKFRWSEECSHALQALIRHDEIRKVLFTMDSSKASGPDGFSAGFFKDAWNVVGEDFSDVVLHFFDTCYLPTEVNATAITLISKRCGAERMEDFRPISCC
        +V+F+WSEEC  ALQ  I  +E+R+VLF+MDS KA GPDGFS GF+K AW+VVGEDF + VLHFF+TCYLP  VNATAITLI K  GAER+EDFRPISCC
Subjt:  VVKFRWSEECSHALQALIRHDEIRKVLFTMDSSKASGPDGFSAGFFKDAWNVVGEDFSDVVLHFFDTCYLPTEVNATAITLISKRCGAERMEDFRPISCC

Query:  NVVYKCISKILAGRLRVWLPFFISGNQHAFVAGRSITNNILLCQELVGGYHGTSSPP
        NV+YKCISKILA RLR+WLP FIS NQ AF+ GRSI  NILLCQELVGGYH  S  P
Subjt:  NVVYKCISKILAGRLRVWLPFFISGNQHAFVAGRSITNNILLCQELVGGYHGTSSPP

A0A5A7U4M4 Reverse transcriptase domain-containing protein4.7e-15547.56Show/hide
Query:  MLLHRWVLGIVPETFVFNFVPVWIKLGRIPMELWTESGIAVIASAICKPLSLDLATKERCRLSFARVCVEVEGGADLPSEVTVTLRGVEFIVPVTYEWEP
        MLL +W   IVPE+FVF  VPVWIKLGRIPMELWTE+G+ V+AS + KPL+LDLATKERCRLS+ARVC+E++  + +P+E+TV+L+GV+FIV V YEW+P
Subjt:  MLLHRWVLGIVPETFVFNFVPVWIKLGRIPMELWTESGIAVIASAICKPLSLDLATKERCRLSFARVCVEVEGGADLPSEVTVTLRGVEFIVPVTYEWEP

Query:  RMCNSCHSFSHSVGYF-TTVTRKKRELVLVRDKGKGMSMQPLQNSFG-----SLSELSEGENWALELQVNR----------PPPLQI-------------
        R CN C +F HS G    +V     + V+V     G   +    + G     S  +L EGE   +    NR          PPPLQ+             
Subjt:  RMCNSCHSFSHSVGYF-TTVTRKKRELVLVRDKGKGMSMQPLQNSFG-----SLSELSEGENWALELQVNR----------PPPLQI-------------

Query:  --------------------------VGSDGGMPIMSSDGGP--------------------------------LGICVD-------------------L
                                   G+        S+ G                                  G+ V+                   L
Subjt:  --------------------------VGSDGGMPIMSSDGGP--------------------------------LGICVD-------------------L

Query:  VDITSRWSILGVVMGDFNAIRVHYEACGGNLIPDDMEEFDLAIREADLVELSVQDNWFTWT---------------------------IWVRRLGVSPLV
        V+IT  WS  G+VMGDFNAIRVH EA GG+ I  +ME+FDLAIR+ADLVE SVQ NWFTWT                           +W R  GVSPLV
Subjt:  VDITSRWSILGVVMGDFNAIRVHYEACGGNLIPDDMEEFDLAIREADLVELSVQDNWFTWT---------------------------IWVRRLGVSPLV

Query:  SPMRSLHDLKSVLRRHFGRHIRGLSKEVRSAKEAMDRAQREVERDPGFVERSRLARLATEAFWLAIRLEEASLRHKSRIRWLKLGDQNSAFLHRSVRSRI
        S +R+L  LKS +RRHFGRHI+ LS+EVR+AKEAMDRAQREV+R+P     SR A LATEAFW  +RLEEASL  K RIRWL+LG+QN+AF HRSV SR 
Subjt:  SPMRSLHDLKSVLRRHFGRHIRGLSKEVRSAKEAMDRAQREVERDPGFVERSRLARLATEAFWLAIRLEEASLRHKSRIRWLKLGDQNSAFLHRSVRSRI

Query:  CRNNLLSIVDFEGGRVTSYDGLVQVAINFFRNSLGSQMIGYCELSPLLEEVVKFRWSEECSHALQALIRHDEIRKVLFTMDSSKASGPDGFSAGFFKDAW
                                           SQ I Y ELSP+++++V+FRWSEEC  ALQ  I  +E+R+VLF+MDS KASG DGFS  FFK  W
Subjt:  CRNNLLSIVDFEGGRVTSYDGLVQVAINFFRNSLGSQMIGYCELSPLLEEVVKFRWSEECSHALQALIRHDEIRKVLFTMDSSKASGPDGFSAGFFKDAW

Query:  NVVGEDFSDVVLHFFDTCYLPTEVNATAITLISKRCGAERMEDFRPISCCNVVYKCISKILAGRLRVWLPFFISGNQHAFVAGRSITNNILLCQEL
        +VV EDF DV+LHFF+TCYLP  VNAT ITLI KR GAE +E+FRPIS CNV+YKCISKILA RL VWLP FISGNQ AF+ GRSI +NILLCQEL
Subjt:  NVVGEDFSDVVLHFFDTCYLPTEVNATAITLISKRCGAERMEDFRPISCCNVVYKCISKILAGRLRVWLPFFISGNQHAFVAGRSITNNILLCQEL

A0A5A7V275 Reverse transcriptase4.1e-15146.81Show/hide
Query:  MLLHRWVLGIVPETFVFNFVPVWIKLGRIPMELWTESGIAVIASAICKPLSLDLATKERCRLSFARVCVEVEGGADLPSEVTVTLRGVEFIVPVTYEWEP
        MLL +W  GIVPE+FVFN V VWI+LG+IPMELWTE+G+AV+ASA+ KP+SLDL TKER RLS+ARVCVE+EGG+++P+++TV+L GV+F V + YEW+P
Subjt:  MLLHRWVLGIVPETFVFNFVPVWIKLGRIPMELWTESGIAVIASAICKPLSLDLATKERCRLSFARVCVEVEGGADLPSEVTVTLRGVEFIVPVTYEWEP

Query:  RMCNSCHSFSHSVGYFTTVTRKK---RELV--LVRDKGKGMSMQP----LQNSFGSLSEL----------SEGENWALELQVNRPPPLQIVGSDGGMPIM
        R CN C +F HS    +     K    E+V  ++  K  G+  +P    +  SF  L E+          S+ + WAL +    PPPLQ+      +  +
Subjt:  RMCNSCHSFSHSVGYFTTVTRKK---RELV--LVRDKGKGMSMQP----LQNSFGSLSEL----------SEGENWALELQVNRPPPLQIVGSDGGMPIM

Query:  SSDGGP--------------------------------------------------LGICVD-------------------LVDITSRWSILGVVMGDFN
        SS  GP                                                   G+CV+                   LV+ TS WS  GVVMGDFN
Subjt:  SSDGGP--------------------------------------------------LGICVD-------------------LVDITSRWSILGVVMGDFN

Query:  AIRVHYEACGGNLIPDDMEEFDLAIREADLVELSVQDNWFTWT---------------------------------------------------------
        AIRVH EA GG+ I  +ME+FDLAIR+ADLVE SVQ NWFTWT                                                         
Subjt:  AIRVHYEACGGNLIPDDMEEFDLAIREADLVELSVQDNWFTWT---------------------------------------------------------

Query:  -------------------IWVRRLGVSPLVSPMRSLHDLKSVLRRHFGRHIRGLSKEVRSAKEAMDRAQREVERDPGFVERSRLARLATEAFWLAIRLE
                           +W R  GVSPLVS MR+LH LK  LRR FGRHI+ LS+EV  AKEAMDRAQR+VER+      SR A LATE FW A+RLE
Subjt:  -------------------IWVRRLGVSPLVSPMRSLHDLKSVLRRHFGRHIRGLSKEVRSAKEAMDRAQREVERDPGFVERSRLARLATEAFWLAIRLE

Query:  EASLRHKSRIRWLKLGDQNSAFLHRSVRSRICRNNLLSIVDFEGGRVTSYDGLVQVAINFFRNSLGSQMIGYCELSPLLEEVVKFRWSEECSHALQALIR
        EASLR KSRIRWLKLGDQN+ F HRSVRSR+ RN+LLS+VD +G RV+S+DG+ Q+A+N+FRNSLGSQ IGY ELSP+++++++F+WSEEC  ALQ  I 
Subjt:  EASLRHKSRIRWLKLGDQNSAFLHRSVRSRICRNNLLSIVDFEGGRVTSYDGLVQVAINFFRNSLGSQMIGYCELSPLLEEVVKFRWSEECSHALQALIR

Query:  HDEIRKVLFTMDSSKASGPDGFSAGFFKDAWNVVGEDFSDVVLHFFDTCYLPTEVNAT
         +E+R+VLF+MDS KA GPDGFS G FK  W+VVGEDF DVVLHFF+TCYLP  VNAT
Subjt:  HDEIRKVLFTMDSSKASGPDGFSAGFFKDAWNVVGEDFSDVVLHFFDTCYLPTEVNAT

A0A5A7V5J2 Non-LTR retroelement reverse transcriptase-like protein9.6e-20949.63Show/hide
Query:  MPTITILENGLICFQFRHPKSIEWIISHGPWHLGGKPMLLHRWVLGIVPETFVFNFVPVWIKLGRIPMELWTESGIAVIASAICKPLSLDLATKERCRLS
        MPTITILEN LICFQFR P S+EWI+S GPWHLGGKPMLL +W LGIVPE+FVFN VPVWI+LGRIPMELWTE+ +A++AS + KP++LDLATKE  RLS
Subjt:  MPTITILENGLICFQFRHPKSIEWIISHGPWHLGGKPMLLHRWVLGIVPETFVFNFVPVWIKLGRIPMELWTESGIAVIASAICKPLSLDLATKERCRLS

Query:  FARVCVEVEGGADLPSEVTVTLRGVEFIVPVTYEWEPRMCNSCHSFSHSVG-------------------------------------------------
        +ARVCV++EG  ++ +E+TV LRGV+F V V YEW+P+ CN C +  HS G                                                 
Subjt:  FARVCVEVEGGADLPSEVTVTLRGVEFIVPVTYEWEPRMCNSCHSFSHSVG-------------------------------------------------

Query:  ------------------YFTTVTRKKRELVLVRDKGKGMSMQPLQNSFGSLSELSEGENWALELQVNRPPPLQIVGSDGGMPIMSSDG-----------
                           FT VTRKK ELV VRD+GK + +  + NSFGSL E+ + + WAL +    PPPLQ+    G +  + S             
Subjt:  ------------------YFTTVTRKKRELVLVRDKGKGMSMQPLQNSFGSLSELSEGENWALELQVNRPPPLQIVGSDGGMPIMSSDG-----------

Query:  -------------------------------------------------------------GPL-----GICVD-------------------LVDITSR
                                                                     G L     G+CV+                   LV+ITS 
Subjt:  -------------------------------------------------------------GPL-----GICVD-------------------LVDITSR

Query:  WSILGVVMGDFNAIRVHYEACGGNLIPDDMEEFDLAIREADLVELSVQDNWFTWT---------------------------------------------
        WS   VVMGDFNAIRVH+EA GG+ I  +ME+FDLA R+ADLVE SVQ NWFTWT                                             
Subjt:  WSILGVVMGDFNAIRVHYEACGGNLIPDDMEEFDLAIREADLVELSVQDNWFTWT---------------------------------------------

Query:  -IWVRRLGVSPLVSPMRSLHDLKSVLRRHFGRHIRGLSKEVRSAKEAMDRAQREVERDPGFVERSRLARLATEAFWLAIRLEEASLRHKSRIRWLKLGDQ
         +W R  GVSPLVS MR+L +LK  LRR FGRHI+ L++EV  AKE MDRAQREVE +P     SR   LATEAFW A+RLEEASLR KSRIRWL+LGDQ
Subjt:  -IWVRRLGVSPLVSPMRSLHDLKSVLRRHFGRHIRGLSKEVRSAKEAMDRAQREVERDPGFVERSRLARLATEAFWLAIRLEEASLRHKSRIRWLKLGDQ

Query:  NSAFLHRSVRSRICRNNLLSIVDFEGGRVTSYDGLVQVAINFFRNSLGSQMIGYCELSPLLEEVVKFRWSEECSHALQALIRHDEIRKVLFTMDSSKASG
        N+AF HR VRSR+ RN+LLS+VD +G RV+S+DG+VQ+A+N+FRNSLGSQ IGY EL P+++++V+FRWSEEC  ALQ  I  +E+R+VLF+MDS KA G
Subjt:  NSAFLHRSVRSRICRNNLLSIVDFEGGRVTSYDGLVQVAINFFRNSLGSQMIGYCELSPLLEEVVKFRWSEECSHALQALIRHDEIRKVLFTMDSSKASG

Query:  PDGFSAGFFKDAWNVVGEDFSDVVLHFFDTCYLPTEVNATAITLISKRCGAERMEDFRPISCCNVVYKCISKILAGRLRVWLPFFISGNQHAFVAGRSIT
        PDGFS GFFK AW+VV EDF DVVLHFF+TCYLP  VNAT ITLI KR GAE+ME+FRPISCCNV+YKCISKILA RLRVWLP FI  NQ AF+ GRSI 
Subjt:  PDGFSAGFFKDAWNVVGEDFSDVVLHFFDTCYLPTEVNATAITLISKRCGAERMEDFRPISCCNVVYKCISKILAGRLRVWLPFFISGNQHAFVAGRSIT

Query:  NNILLCQELVGGYHGTSSPP
        +NILLCQELV GYH  S  P
Subjt:  NNILLCQELVGGYHGTSSPP

A0A5D3DXQ8 Reverse transcriptase domain-containing protein3.6e-15547.56Show/hide
Query:  MLLHRWVLGIVPETFVFNFVPVWIKLGRIPMELWTESGIAVIASAICKPLSLDLATKERCRLSFARVCVEVEGGADLPSEVTVTLRGVEFIVPVTYEWEP
        MLL +W   IVPE+FVF  VPVWIKLGRIPMELWTE+G+ V+AS + KPL+LDLATKERCRLS+ARVC+E++  + +P+E+TV+L+GV+FIV V YEW+P
Subjt:  MLLHRWVLGIVPETFVFNFVPVWIKLGRIPMELWTESGIAVIASAICKPLSLDLATKERCRLSFARVCVEVEGGADLPSEVTVTLRGVEFIVPVTYEWEP

Query:  RMCNSCHSFSHSVGYF-TTVTRKKRELVLVRDKGKGMSMQPLQNSFG-----SLSELSEGENWALELQVNR----------PPPLQI-------------
        R CN C +F HS G    +V     + V+V     G   +    + G     S  +L EGE   +    NR          PPPLQ+             
Subjt:  RMCNSCHSFSHSVGYF-TTVTRKKRELVLVRDKGKGMSMQPLQNSFG-----SLSELSEGENWALELQVNR----------PPPLQI-------------

Query:  --------------------------VGSDGGMPIMSSDGGP--------------------------------LGICVD-------------------L
                                   G+        S+ G                                  G+ V+                   L
Subjt:  --------------------------VGSDGGMPIMSSDGGP--------------------------------LGICVD-------------------L

Query:  VDITSRWSILGVVMGDFNAIRVHYEACGGNLIPDDMEEFDLAIREADLVELSVQDNWFTWT---------------------------IWVRRLGVSPLV
        V+IT  WS  G+VMGDFNAIRVH EA GG+ I  +ME+FDLAIR+ADLVE SVQ NWFTWT                           +W R  GVSPLV
Subjt:  VDITSRWSILGVVMGDFNAIRVHYEACGGNLIPDDMEEFDLAIREADLVELSVQDNWFTWT---------------------------IWVRRLGVSPLV

Query:  SPMRSLHDLKSVLRRHFGRHIRGLSKEVRSAKEAMDRAQREVERDPGFVERSRLARLATEAFWLAIRLEEASLRHKSRIRWLKLGDQNSAFLHRSVRSRI
        S +R+L  LKS +RRHFGRHI+ LS+EVR+AKEAMDRAQREV+R+P     SR A LATEAFW  +RLEEASL  K RIRWL+LG+QN+AF HRSV SR 
Subjt:  SPMRSLHDLKSVLRRHFGRHIRGLSKEVRSAKEAMDRAQREVERDPGFVERSRLARLATEAFWLAIRLEEASLRHKSRIRWLKLGDQNSAFLHRSVRSRI

Query:  CRNNLLSIVDFEGGRVTSYDGLVQVAINFFRNSLGSQMIGYCELSPLLEEVVKFRWSEECSHALQALIRHDEIRKVLFTMDSSKASGPDGFSAGFFKDAW
                                           SQ I Y ELSP+++++V+FRWSEEC  ALQ  I  +E+R+VLF+MDS KASG DGFS  FFK  W
Subjt:  CRNNLLSIVDFEGGRVTSYDGLVQVAINFFRNSLGSQMIGYCELSPLLEEVVKFRWSEECSHALQALIRHDEIRKVLFTMDSSKASGPDGFSAGFFKDAW

Query:  NVVGEDFSDVVLHFFDTCYLPTEVNATAITLISKRCGAERMEDFRPISCCNVVYKCISKILAGRLRVWLPFFISGNQHAFVAGRSITNNILLCQEL
        +VV EDF DV+LHFF+TCYLP  VNAT ITLI KR GAE +E+FRPIS CNV+YKCISKILA RL VWLP FISGNQ AF+ GRSI +NILLCQEL
Subjt:  NVVGEDFSDVVLHFFDTCYLPTEVNATAITLISKRCGAERMEDFRPISCCNVVYKCISKILAGRLRVWLPFFISGNQHAFVAGRSITNNILLCQEL

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.7e-0824.37Show/hide
Query:  LHRSVRSRICRNNLLSIVDFEGGRVTSYDGLVQVAINFFRNSLGSQMIGYCELSPLLEEVVKFRWSEECSHALQALIRHDEIRKVLFTMDSSKASGPDGF
        L R ++ +  +N + +I + +G   T    +      ++++   +++    E+   L+     R ++E   +L   I   EI  ++ ++ + K+ GPDGF
Subjt:  LHRSVRSRICRNNLLSIVDFEGGRVTSYDGLVQVAINFFRNSLGSQMIGYCELSPLLEEVVKFRWSEECSHALQALIRHDEIRKVLFTMDSSKASGPDGF

Query:  SAGFFKDAWNVVGEDFSDVVLHFFDTC----YLPTEVNATAITLISK-RCGAERMEDFRPISCCNVVYKCISKILAGRLRVWLPFFISGNQHAFVAG
        +A F++       E+    +L  F +      LP      +I LI K      + E+FRPIS  N+  K ++KILA R++  +   I  +Q  F+ G
Subjt:  SAGFFKDAWNVVGEDFSDVVLHFFDTC----YLPTEVNATAITLISK-RCGAERMEDFRPISCCNVVYKCISKILAGRLRVWLPFFISGNQHAFVAG

P11369 LINE-1 retrotransposable element ORF2 protein6.8e-1026.96Show/hide
Query:  DQNSAFLHRSVRSRICRNNLLSIVDFEGGRVTSYDGLVQVAI-NFFRNSLGSQMIGYCELSPLLEEVVKFRWSEECSHALQALIRHDEIRKVLFTMDSSK
        D+  A L +  R +I  N + +    E G +T+    +Q  I +F++    +++    E+   L+     + +++    L + I   EI  V+ ++ + K
Subjt:  DQNSAFLHRSVRSRICRNNLLSIVDFEGGRVTSYDGLVQVAI-NFFRNSLGSQMIGYCELSPLLEEVVKFRWSEECSHALQALIRHDEIRKVLFTMDSSK

Query:  ASGPDGFSAGFFKDAWNVVGEDFSDVVLHFFDTC----YLPTEVNATAITLISK-RCGAERMEDFRPISCCNVVYKCISKILAGRLRVWLPFFISGNQHA
        + GPDGFSA F++       ED   ++   F        LP       ITLI K +    ++E+FRPIS  N+  K ++KILA R++  +   I  +Q  
Subjt:  ASGPDGFSAGFFKDAWNVVGEDFSDVVLHFFDTC----YLPTEVNATAITLISK-RCGAERMEDFRPISCCNVVYKCISKILAGRLRVWLPFFISGNQHA

Query:  FVAG
        F+ G
Subjt:  FVAG

P14381 Transposon TX1 uncharacterized 149 kDa protein4.6e-1424.77Show/hide
Query:  KSRIRWLKLGDQNSAFLHRSVRSRICRNNLLSIVDFEGGRVTSYDGLVQVAINFFRNSLGSQMIGYCELSPLLEEVVKFRWSEECSHALQALIRHDEIRK
        +SR++ L   D+ S F +   + +  R  +  +   +G  +   + +   A +F++N      I       L + +     SE     L+  I  DE+ +
Subjt:  KSRIRWLKLGDQNSAFLHRSVRSRICRNNLLSIVDFEGGRVTSYDGLVQVAINFFRNSLGSQMIGYCELSPLLEEVVKFRWSEECSHALQALIRHDEIRK

Query:  VLFTMDSSKASGPDGFSAGFFKDAWNVVGEDFSDVVLHFFDTCYLPTEVNATAITLISKRCGAERMEDFRPISCCNVVYKCISKILAGRLRVWLPFFISG
         L  M  +K+ G DG +  FF+  W+ +G DF  V+   F    LP       ++L+ K+     ++++RP+S  +  YK ++K ++ RL+  L   I  
Subjt:  VLFTMDSSKASGPDGFSAGFFKDAWNVVGEDFSDVVLHFFDTCYLPTEVNATAITLISKRCGAERMEDFRPISCCNVVYKCISKILAGRLRVWLPFFISG

Query:  NQHAFVAGRSITNNILLCQELV
        +Q   V GR+I +N+ L ++L+
Subjt:  NQHAFVAGRSITNNILLCQELV

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein4.1e-2630.95Show/hide
Query:  KSVLRRHFGRHIRGLSKEVRSAKEAMDRAQREVERDP--GFVERSRLARLATEAFWLAIRLEEASLRHKSRIRWLKLGDQNSAFLHRSVRSRICRNNLLS
        K + R+ FG     +  + + A ++++  Q ++  +P         +AR     F  A+   E+  R KSRI+WL+ GD N+ F H+ + +   +N +  
Subjt:  KSVLRRHFGRHIRGLSKEVRSAKEAMDRAQREVERDP--GFVERSRLARLATEAFWLAIRLEEASLRHKSRIRWLKLGDQNSAFLHRSVRSRICRNNLLS

Query:  IVDFEGGRVTSYDGLVQVAINFFRNSLGSQM-IGYCELSPLLEEVVKFRWSEECSHALQALIRHDEIRKVLFTMDSSKASGPDGFSAGFFKDAWNVVGED
        +   +  RV +   + ++ + ++ + LGS   I   +    ++++  FR ++  +  L AL    EI   +F M  +KA GPD F+A FF ++W VV + 
Subjt:  IVDFEGGRVTSYDGLVQVAINFFRNSLGSQM-IGYCELSPLLEEVVKFRWSEECSHALQALIRHDEIRKVLFTMDSSKASGPDGFSAGFFKDAWNVVGED

Query:  FSDVVLHFFDTCYLPTEVNATAITLISKRCGAERMEDFRPISCCNVVYKCIS
            V  FF T +L    NATAITLI K  G +++  FRP+SCC VVYK I+
Subjt:  FSDVVLHFFDTCYLPTEVNATAITLISKRCGAERMEDFRPISCCNVVYKCIS

AT2G01050.1 zinc ion binding;nucleic acid binding1.3e-1125.6Show/hide
Query:  ITLFLIQLLIEKIWGKIEMPTITILENGLICFQFRHPKSIEWIISHGPWHLGGKPMLLHRWVLGIVPETFVFNFVPVWIKLGRIPMELWTESGIAVIASA
        I + ++   + ++W    + T+  L       +F   +     ++ GPW + G  +L+  W     P        PVW++L  IP   +    +  IA  
Subjt:  ITLFLIQLLIEKIWGKIEMPTITILENGLICFQFRHPKSIEWIISHGPWHLGGKPMLLHRWVLGIVPETFVFNFVPVWIKLGRIPMELWTESGIAVIASA

Query:  ICKPLSLDLATKERCRLSFARVCVEVEGGADLPSEVTVTLRGVEFIVPVTYEWEPRMCNSCHSFSHSV
        + +PL +D+ T    +  FARVC+EV      P + TV + G  +   V YE   ++C+SC  + H V
Subjt:  ICKPLSLDLATKERCRLSFARVCVEVEGGADLPSEVTVTLRGVEFIVPVTYEWEPRMCNSCHSFSHSV

AT2G07760.1 Zinc knuckle (CCHC-type) family protein2.9e-0828.8Show/hide
Query:  IISHGPWHLGGKPMLLHRWVLGIVPETFVFNFVPVWIKLGRIPMELWTESGIAVIASAICKPLSLDLATKERCRLSFARVCVEVEGGADLPSEV-TVTLR
        I   G WH+    M +  W      +    + +PVW+ L  IP  L++  GI+ IAS +  P++      +   +S A + VEVE     P  +  V  +
Subjt:  IISHGPWHLGGKPMLLHRWVLGIVPETFVFNFVPVWIKLGRIPMELWTESGIAVIASAICKPLSLDLATKERCRLSFARVCVEVEGGADLPSEV-TVTLR

Query:  GVEFIVPVTYEWEPRMCNSCHSFSH
        G   +V V Y W P  C  C    H
Subjt:  GVEFIVPVTYEWEPRMCNSCHSFSH

AT5G28823.1 FUNCTIONS IN: molecular_function unknown4.0e-0527.03Show/hide
Query:  WHLGGKPMLLHRWVLGIVPETFVFNFVPVWIKLGRIPMELWTESGIAVIASAICKPLSLDLATKERCRLSFARVCVEVEGGADLPSEVTVTLR-GVEFIV
        WH+    M +  W      E      +P W+ L  IP +L++  GI  IAS I + +       +  ++  A++ VEV+     P  V +    G   +V
Subjt:  WHLGGKPMLLHRWVLGIVPETFVFNFVPVWIKLGRIPMELWTESGIAVIASAICKPLSLDLATKERCRLSFARVCVEVEGGADLPSEVTVTLR-GVEFIV

Query:  PVTYEWEPRMC
         V Y W P  C
Subjt:  PVTYEWEPRMC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGACGACGAATTCACTCATTCCAGACACATGCCATAGCTTCCTCATCACACTCTTCCTCATTCAACTGCTTATTGAGAAAATTTGGGGGAAAATCGAAATGCCAAC
CATTACGATTCTTGAGAATGGGCTTATTTGCTTTCAATTTCGTCATCCCAAATCGATTGAGTGGATTATTTCCCATGGGCCATGGCATCTTGGAGGGAAACCTATGCTAC
TCCATAGATGGGTTTTAGGTATTGTCCCTGAAACCTTTGTCTTTAATTTTGTCCCTGTGTGGATTAAACTAGGCCGAATCCCTATGGAGTTGTGGACTGAGTCGGGTATT
GCAGTCATTGCTAGTGCTATTTGTAAACCTCTTTCTTTAGATTTGGCCACTAAAGAGAGATGTAGACTATCGTTTGCTAGGGTGTGTGTTGAAGTAGAGGGGGGTGCAGA
CTTGCCTTCTGAGGTCACAGTTACTCTGAGGGGTGTGGAATTTATTGTGCCAGTAACATATGAGTGGGAACCCCGAATGTGTAATTCCTGTCATTCTTTTAGTCATTCTG
TTGGATATTTCACTACTGTGACTCGTAAGAAGAGGGAGTTGGTATTAGTGAGAGACAAAGGGAAAGGAATGAGTATGCAGCCTTTGCAAAACTCCTTTGGTAGTCTTTCT
GAATTGAGTGAGGGAGAAAATTGGGCGTTGGAATTACAGGTTAATAGGCCCCCACCCTTACAGATTGTGGGTAGTGATGGTGGCATGCCTATTATGAGTTCGGATGGTGG
TCCGTTGGGGATATGTGTTGATTTAGTTGACATCACTTCTAGATGGTCGATCCTAGGTGTTGTCATGGGTGACTTTAATGCTATTCGTGTGCACTATGAAGCTTGTGGAG
GGAATTTGATTCCTGATGATATGGAGGAGTTTGATCTTGCTATTCGTGAGGCTGACTTGGTGGAGCTGTCTGTTCAGGATAACTGGTTCACTTGGACGATTTGGGTGAGA
AGATTAGGTGTTTCTCCGTTAGTGAGTCCTATGCGGAGTTTGCATGACCTTAAGTCTGTGCTTCGTAGACATTTTGGTAGGCATATCAGGGGCCTTAGTAAGGAGGTGCG
CTCTGCTAAAGAGGCTATGGATAGGGCCCAGCGTGAGGTTGAGAGGGATCCTGGGTTTGTTGAGAGGAGTCGTCTTGCTAGGCTAGCGACTGAGGCTTTTTGGTTGGCTA
TCCGTCTAGAAGAAGCCTCTCTTCGTCATAAATCTCGTATTAGGTGGTTGAAGCTTGGTGATCAAAATTCTGCCTTTCTTCACCGTTCGGTTCGTTCTCGTATTTGTCGT
AATAATCTACTTTCTATTGTGGATTTTGAGGGTGGTAGGGTGACGTCCTATGATGGGTTGGTTCAGGTGGCAATTAACTTCTTTCGTAATAGTTTGGGTTCCCAGATGAT
TGGCTATTGTGAGCTCTCTCCTTTGCTGGAGGAGGTGGTTAAGTTTAGGTGGTCTGAGGAGTGTTCTCATGCATTACAGGCTCTTATTAGGCATGATGAGATTAGGAAGG
TGTTATTCACTATGGATAGTAGCAAGGCTTCTGGTCCTGATGGTTTTTCGGCGGGGTTCTTCAAAGATGCTTGGAATGTGGTGGGTGAGGATTTTAGTGATGTTGTGTTG
CATTTCTTTGACACTTGTTATCTGCCTACTGAGGTTAATGCTACTGCGATAACCCTTATCTCCAAACGTTGTGGGGCTGAGCGTATGGAGGATTTTAGGCCTATTTCGTG
TTGTAATGTGGTCTATAAGTGCATTTCTAAGATTTTGGCTGGTAGGCTACGTGTGTGGCTTCCTTTTTTTATCAGTGGTAATCAGCATGCCTTTGTTGCTGGGAGGAGTA
TTACTAATAATATTCTACTTTGTCAAGAGTTGGTTGGGGGTTATCATGGTACTTCTAGTCCGCCTGGTGTGCGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGACGACGAATTCACTCATTCCAGACACATGCCATAGCTTCCTCATCACACTCTTCCTCATTCAACTGCTTATTGAGAAAATTTGGGGGAAAATCGAAATGCCAAC
CATTACGATTCTTGAGAATGGGCTTATTTGCTTTCAATTTCGTCATCCCAAATCGATTGAGTGGATTATTTCCCATGGGCCATGGCATCTTGGAGGGAAACCTATGCTAC
TCCATAGATGGGTTTTAGGTATTGTCCCTGAAACCTTTGTCTTTAATTTTGTCCCTGTGTGGATTAAACTAGGCCGAATCCCTATGGAGTTGTGGACTGAGTCGGGTATT
GCAGTCATTGCTAGTGCTATTTGTAAACCTCTTTCTTTAGATTTGGCCACTAAAGAGAGATGTAGACTATCGTTTGCTAGGGTGTGTGTTGAAGTAGAGGGGGGTGCAGA
CTTGCCTTCTGAGGTCACAGTTACTCTGAGGGGTGTGGAATTTATTGTGCCAGTAACATATGAGTGGGAACCCCGAATGTGTAATTCCTGTCATTCTTTTAGTCATTCTG
TTGGATATTTCACTACTGTGACTCGTAAGAAGAGGGAGTTGGTATTAGTGAGAGACAAAGGGAAAGGAATGAGTATGCAGCCTTTGCAAAACTCCTTTGGTAGTCTTTCT
GAATTGAGTGAGGGAGAAAATTGGGCGTTGGAATTACAGGTTAATAGGCCCCCACCCTTACAGATTGTGGGTAGTGATGGTGGCATGCCTATTATGAGTTCGGATGGTGG
TCCGTTGGGGATATGTGTTGATTTAGTTGACATCACTTCTAGATGGTCGATCCTAGGTGTTGTCATGGGTGACTTTAATGCTATTCGTGTGCACTATGAAGCTTGTGGAG
GGAATTTGATTCCTGATGATATGGAGGAGTTTGATCTTGCTATTCGTGAGGCTGACTTGGTGGAGCTGTCTGTTCAGGATAACTGGTTCACTTGGACGATTTGGGTGAGA
AGATTAGGTGTTTCTCCGTTAGTGAGTCCTATGCGGAGTTTGCATGACCTTAAGTCTGTGCTTCGTAGACATTTTGGTAGGCATATCAGGGGCCTTAGTAAGGAGGTGCG
CTCTGCTAAAGAGGCTATGGATAGGGCCCAGCGTGAGGTTGAGAGGGATCCTGGGTTTGTTGAGAGGAGTCGTCTTGCTAGGCTAGCGACTGAGGCTTTTTGGTTGGCTA
TCCGTCTAGAAGAAGCCTCTCTTCGTCATAAATCTCGTATTAGGTGGTTGAAGCTTGGTGATCAAAATTCTGCCTTTCTTCACCGTTCGGTTCGTTCTCGTATTTGTCGT
AATAATCTACTTTCTATTGTGGATTTTGAGGGTGGTAGGGTGACGTCCTATGATGGGTTGGTTCAGGTGGCAATTAACTTCTTTCGTAATAGTTTGGGTTCCCAGATGAT
TGGCTATTGTGAGCTCTCTCCTTTGCTGGAGGAGGTGGTTAAGTTTAGGTGGTCTGAGGAGTGTTCTCATGCATTACAGGCTCTTATTAGGCATGATGAGATTAGGAAGG
TGTTATTCACTATGGATAGTAGCAAGGCTTCTGGTCCTGATGGTTTTTCGGCGGGGTTCTTCAAAGATGCTTGGAATGTGGTGGGTGAGGATTTTAGTGATGTTGTGTTG
CATTTCTTTGACACTTGTTATCTGCCTACTGAGGTTAATGCTACTGCGATAACCCTTATCTCCAAACGTTGTGGGGCTGAGCGTATGGAGGATTTTAGGCCTATTTCGTG
TTGTAATGTGGTCTATAAGTGCATTTCTAAGATTTTGGCTGGTAGGCTACGTGTGTGGCTTCCTTTTTTTATCAGTGGTAATCAGCATGCCTTTGTTGCTGGGAGGAGTA
TTACTAATAATATTCTACTTTGTCAAGAGTTGGTTGGGGGTTATCATGGTACTTCTAGTCCGCCTGGTGTGCGATGA
Protein sequenceShow/hide protein sequence
METTNSLIPDTCHSFLITLFLIQLLIEKIWGKIEMPTITILENGLICFQFRHPKSIEWIISHGPWHLGGKPMLLHRWVLGIVPETFVFNFVPVWIKLGRIPMELWTESGI
AVIASAICKPLSLDLATKERCRLSFARVCVEVEGGADLPSEVTVTLRGVEFIVPVTYEWEPRMCNSCHSFSHSVGYFTTVTRKKRELVLVRDKGKGMSMQPLQNSFGSLS
ELSEGENWALELQVNRPPPLQIVGSDGGMPIMSSDGGPLGICVDLVDITSRWSILGVVMGDFNAIRVHYEACGGNLIPDDMEEFDLAIREADLVELSVQDNWFTWTIWVR
RLGVSPLVSPMRSLHDLKSVLRRHFGRHIRGLSKEVRSAKEAMDRAQREVERDPGFVERSRLARLATEAFWLAIRLEEASLRHKSRIRWLKLGDQNSAFLHRSVRSRICR
NNLLSIVDFEGGRVTSYDGLVQVAINFFRNSLGSQMIGYCELSPLLEEVVKFRWSEECSHALQALIRHDEIRKVLFTMDSSKASGPDGFSAGFFKDAWNVVGEDFSDVVL
HFFDTCYLPTEVNATAITLISKRCGAERMEDFRPISCCNVVYKCISKILAGRLRVWLPFFISGNQHAFVAGRSITNNILLCQELVGGYHGTSSPPGVR