; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh15G010410 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh15G010410
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionCCHC-type domain-containing protein
Genome locationCmo_Chr15:6639234..6643852
RNA-Seq ExpressionCmoCh15G010410
SyntenyCmoCh15G010410
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_017617594.1 PREDICTED: uncharacterized protein LOC108462111 [Gossypium arboreum]3.1e-10449.39Show/hide
Query:  EEYLQYESKIEHVFDCNNFSEERKLKLVVAKFCDYAIIWWTSLKLEWRRNYEEPVETWEELKTLMRKRYIPKHYSRVLKQKLYTLQQGSKSVEEYYKEME
        E YL++E KIE VFD +N+S+ +K+KL   +F DYA+IWW  L    RR+ E+P+ TW E+K ++R+R+IP +Y R L QKL  L QG+KSVE+YYKEME
Subjt:  EEYLQYESKIEHVFDCNNFSEERKLKLVVAKFCDYAIIWWTSLKLEWRRNYEEPVETWEELKTLMRKRYIPKHYSRVLKQKLYTLQQGSKSVEEYYKEME

Query:  TLMNRACIDEDEEDTNARFLGGLNRQLAHQVDRQAYFDMQELLHLAAKIEGQLAWEKE----------NSKRVMI---------IKNGQVVTDSEESDHD
          M  A + ED E T ARFL GLNR +A+ ++ Q Y ++ +++H+A K++  L+ + +          +  +V +         I  G + +        
Subjt:  TLMNRACIDEDEEDTNARFLGGLNRQLAHQVDRQAYFDMQELLHLAAKIEGQLAWEKE----------NSKRVMI---------IKNGQVVTDSEESDHD

Query:  ELVEEEIQENEEELEDGSHLEFDDVFQDEVPKGLPPIRGIEHKIDFIPGAVIPNRSAYRANPTETKEIQRQVEELMEKGYIRESLSPCSVPVLLVPKKDG
         +  +   E E+E E  S  +F DVF +EVP GLPP+RGIEH+IDFIPGA IPNR AYRANP ETKE+QRQV +L++KGY                    
Subjt:  ELVEEEIQENEEELEDGSHLEFDDVFQDEVPKGLPPIRGIEHKIDFIPGAVIPNRSAYRANPTETKEIQRQVEELMEKGYIRESLSPCSVPVLLVPKKDG

Query:  TWRMCVDCRAINKITEFDDVFQDEVPKGLPPIRGIEHKIDFIPGAVIPNRSAYRANPTETKEIQRQVEELMEKGYIRESLSPCSVPVLLVPKKDGTWRMC
                       +F DVF +EVP GLPP+RGIEH+IDFIPGA IPNR AY+ANP ETKE+Q+QV +L++KGY+RESLSPC+VPVLLVPKKDGTWRMC
Subjt:  TWRMCVDCRAINKITEFDDVFQDEVPKGLPPIRGIEHKIDFIPGAVIPNRSAYRANPTETKEIQRQVEELMEKGYIRESLSPCSVPVLLVPKKDGTWRMC

Query:  VDCRAINKITVKY
        VDC A+NKIT+KY
Subjt:  VDCRAINKITVKY

XP_022158198.1 uncharacterized protein LOC111024735 [Momordica charantia]3.4e-12755.67Show/hide
Query:  LQYESKIEHVFDCNNFSEERKLKLVVAKFCDYAIIWWTSLKLEWRRNYEEPVETWEELKTLMRKRYIPKHYSRVLKQKLYTLQQGSKSVEEYYKEMETLM
        LQYESKIEHVFDCNNFSEERKLKL VA+FCDYA IWWTSLK E RRNYEEP+ETWEELK LMRKRYIPKHYSR LKQKLY LQQGSKSVE+YYKEME LM
Subjt:  LQYESKIEHVFDCNNFSEERKLKLVVAKFCDYAIIWWTSLKLEWRRNYEEPVETWEELKTLMRKRYIPKHYSRVLKQKLYTLQQGSKSVEEYYKEMETLM

Query:  NRACIDEDEEDTNARFLGGLNRQLAHQVDRQAYFDMQELLHLAAKIEGQLAWEKENS-------------------------------------------
        NRA IDED EDT A FLGGLNR LAHQVDRQ YFDM+ELLHLA KIEGQLAWEKENS                                           
Subjt:  NRACIDEDEEDTNARFLGGLNRQLAHQVDRQAYFDMQELLHLAAKIEGQLAWEKENS-------------------------------------------

Query:  --------------------------------KRVMIIKNGQVVTDSEESDHDELVEEEIQENEEELEDGSHLEF------------DDV--FQDEVPKG
                                        KRVMIIKNG+VVTD EESD DEL+EE++QE+EEELEDGSHL              DDV   +D +   
Subjt:  --------------------------------KRVMIIKNGQVVTDSEESDHDELVEEEIQENEEELEDGSHLEF------------DDV--FQDEVPKG

Query:  LPPIRGIEHKIDFIPGAVIPNRSAYRAN----PTETKEIQRQVEELMEKG--------YIRESLSPCSVPVL--LVPKKDGTWRMCVDCRAINKITEFDD
           + G    +    G+     S +       PT+      +++ L + G         I  +L   +  +L  ++P   G   +    +   K  EF+D
Subjt:  LPPIRGIEHKIDFIPGAVIPNRSAYRAN----PTETKEIQRQVEELMEKG--------YIRESLSPCSVPVL--LVPKKDGTWRMCVDCRAINKITEFDD

Query:  VFQDEVPKGLPPIRGIEHKIDFIPGAVIPNRSAYRANPTETKEIQRQVEELMEKGYIRESLSPCSVPVLLVPKKDGTWRMCVDCRAINKITVKY
        VFQ+++PKGLP IRGIEH+IDFIPG  IPNR AYRANPTETKEIQRQVEELMEKGY+RESLSPCSVP++LVPKKDGTWRMCVDCRAINKITVKY
Subjt:  VFQDEVPKGLPPIRGIEHKIDFIPGAVIPNRSAYRANPTETKEIQRQVEELMEKGYIRESLSPCSVPVLLVPKKDGTWRMCVDCRAINKITVKY

XP_022925325.1 uncharacterized protein LOC111432615 [Cucurbita moschata]1.4e-11249.07Show/hide
Query:  MNQDGGNGGEINEARWREAQVGTITRLNQVISTLTDRMEQIEIAL----------------------------------------------EEYLQYESK
        MNQD GNGGEINEARWREAQVGTITRLNQVISTLTDRME+I+IAL                                              EEYLQYESK
Subjt:  MNQDGGNGGEINEARWREAQVGTITRLNQVISTLTDRMEQIEIAL----------------------------------------------EEYLQYESK

Query:  IEHVFDCNNFSEERKLKLVVAKFCDYAIIWWTSLKLEWRRNYEEPVETWEELKTLMRKRYIPKHYSRVLKQKLYTLQQGSKSVEEYYKEMETLMNRACID
        IEHVFDCNNFSEERKLKL VA+FCDYAIIWWTSLK EWRRNYEEP+ETWEE K LMRKR                                         
Subjt:  IEHVFDCNNFSEERKLKLVVAKFCDYAIIWWTSLKLEWRRNYEEPVETWEELKTLMRKRYIPKHYSRVLKQKLYTLQQGSKSVEEYYKEMETLMNRACID

Query:  EDEEDTNARFLGGLNRQLAHQVDRQAYFDMQELLHLAAKIEGQLAWEKENS-------------------------------------------------
                              DRQAYFDMQELLHLA KIEGQLAWEKENS                                                 
Subjt:  EDEEDTNARFLGGLNRQLAHQVDRQAYFDMQELLHLAAKIEGQLAWEKENS-------------------------------------------------

Query:  ---------KRVMIIKNGQVVTDSEESDHDELVEEEIQENEEELEDGSHLEFDDVFQDEVPKGLPPIRGIEHKIDFIPGAVIPNRSAYRANPTETKEIQR
                 KRV+IIKNGQVVTDSEESDHDELVEE+IQENEEELEDGS L         V +     + IE+ +D     +   R   +           
Subjt:  ---------KRVMIIKNGQVVTDSEESDHDELVEEEIQENEEELEDGSHLEFDDVFQDEVPKGLPPIRGIEHKIDFIPGAVIPNRSAYRANPTETKEIQR

Query:  QVEELMEKGYIRESLSPCSVPVLLVPKKDGTWRMCVD-CRAINKITEFDDVFQDEVPKGLPPIRGIEHKIDFIPGAVIPNRSAYRANPTETKEIQRQVEE
            +++ G     +S   +  LL+P +       +        + EFDDVFQDEVPKGLPPIRGIEHKI+FI GAVI NR AYRANPTETKEIQR+VEE
Subjt:  QVEELMEKGYIRESLSPCSVPVLLVPKKDGTWRMCVD-CRAINKITEFDDVFQDEVPKGLPPIRGIEHKIDFIPGAVIPNRSAYRANPTETKEIQRQVEE

Query:  LMEKGYIRESLSPCSVPVLLVPKKDGTWRMCVDCRAINKI
        LMEKGYIRESLSPCSVP+LLV  K GTWRMCVDCRAINKI
Subjt:  LMEKGYIRESLSPCSVPVLLVPKKDGTWRMCVDCRAINKI

XP_022972407.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111470974 [Cucurbita maxima]7.8e-13253.41Show/hide
Query:  MEQIEIALEEYLQYESKIEHVFDCNNFSEERKLKLVVAKFCDYAIIWWTSLKLEWRRNYEEPVETWEELKTLMRKRYIPKHYSRVLKQKLYT--------
        ME+IEIALEEYLQYESKIEHVFDCNNFSEERKLKL VA+FCDYAIIWWTSLK EWRRNYEEP+ETWEELKTLMRKRYIPKHYSR+LKQKLYT        
Subjt:  MEQIEIALEEYLQYESKIEHVFDCNNFSEERKLKLVVAKFCDYAIIWWTSLKLEWRRNYEEPVETWEELKTLMRKRYIPKHYSRVLKQKLYT--------

Query:  LQQGSKSVEEYYKEMETLMNRACIDEDEEDTNARFLGGLNRQLAHQVDRQAYFDMQELLHLAAKIEGQLAWEKEN-------------------------
        LQQGSKSVEEYYKEMETLMNRACIDEDE+DT A FLGGLNRQLAHQVDRQAYFDMQELLHLA KI+GQLAWEKEN                         
Subjt:  LQQGSKSVEEYYKEMETLMNRACIDEDEEDTNARFLGGLNRQLAHQVDRQAYFDMQELLHLAAKIEGQLAWEKEN-------------------------

Query:  -----------------------------------------------------SKRVMIIKNGQVVTDSEESDHDELVEEEIQENEEELEDGSHLEFDDV
                                                             +KRVMIIKNGQVVTDSE+SDHD+L EE+IQE+EEELEDGS L     
Subjt:  -----------------------------------------------------SKRVMIIKNGQVVTDSEESDHDELVEEEIQENEEELEDGSHLEFDDV

Query:  FQDEVPKGLPPIRGIEHKIDFIPGAVIPNRSAYRANPTETKEIQRQVEELMEKGYIRESLSPCSVPVLLVPKK-----------DGTWRMCVDCRAINKI
            V + L   +  E+ +D     +   R   +  P            +++ G     +S   +  LL+P +           + +  M V  + +   
Subjt:  FQDEVPKGLPPIRGIEHKIDFIPGAVIPNRSAYRANPTETKEIQRQVEELMEKGYIRESLSPCSVPVLLVPKK-----------DGTWRMCVDCRAINKI

Query:  T------------------------------EFDDVFQDEVPKGLPPIRGIEHKIDFIPGAVIPNRSAYRANPTETKEIQRQVEELMEKGYIRESLSPCS
        T                              EFDDVFQDEVPKGLPPIRGIEHKIDFIPG +IPN+ AY A+PTETKEI+R++EELMEKGYIRESLSPCS
Subjt:  T------------------------------EFDDVFQDEVPKGLPPIRGIEHKIDFIPGAVIPNRSAYRANPTETKEIQRQVEELMEKGYIRESLSPCS

Query:  VPVLLVPKKDGTWRMCVDCRAINKITVK
        VP+LL+PKKDGTWRMC+D   +  I  K
Subjt:  VPVLLVPKKDGTWRMCVDCRAINKITVK

XP_023534142.1 uncharacterized protein LOC111795765 [Cucurbita pepo subsp. pepo]4.4e-12752.17Show/hide
Query:  MNQDGGNGGEINEARWREAQVGTITRLNQVISTLTDRMEQIEIAL-------------------------------------------------------
        MNQDGGNGGEINEARWREAQVGTITRLNQVISTLTDRME+IEIAL                                                       
Subjt:  MNQDGGNGGEINEARWREAQVGTITRLNQVISTLTDRMEQIEIAL-------------------------------------------------------

Query:  -------------------------------EEYLQYESKIEHVFDCNNFSEERKLKLVVAKFCDYAIIWWTSLKLEWRRNYEEPVETWEELKTLMRKRY
                                       EEYLQYESKIEHVFDCNNFSEERKLKL VA+FCDYAIIWWT LK EWRRNYEEP+ETWEEL+TLMRKRY
Subjt:  -------------------------------EEYLQYESKIEHVFDCNNFSEERKLKLVVAKFCDYAIIWWTSLKLEWRRNYEEPVETWEELKTLMRKRY

Query:  IPKHYSRVLKQKLYTLQQGSKSVEEYYKEMETLMNRACIDEDEEDTNARFLGGLNRQLAHQVDRQAYFDMQELLHLAAKIEGQLAWEKENS---------
        IPKHYSRVLKQKLYTLQQGSKSVEEYYKEMETLMNRACIDEDEEDT ARFLGGLNRQLAHQVDRQAYFDMQELLHLA KIEGQLAWEK+NS         
Subjt:  IPKHYSRVLKQKLYTLQQGSKSVEEYYKEMETLMNRACIDEDEEDTNARFLGGLNRQLAHQVDRQAYFDMQELLHLAAKIEGQLAWEKENS---------

Query:  ---------------------------------------------------------------------KRVMIIKNGQVVTDSEESDHDELVEEEIQEN
                                                                             KRVMIIKNGQVVTDSEESDHDELVEEEIQEN
Subjt:  ---------------------------------------------------------------------KRVMIIKNGQVVTDSEESDHDELVEEEIQEN

Query:  EEELEDGSHLEFDDVFQDEVPKGLPPIRGIEHKIDFIPGAVIPNRSAYRANPTETKEIQRQVEELMEKGYIRESLSPCSVPVLLVPKKDGTWRMCVD-CR
        EEELEDGSHL            G P                                       +++ G     +S   +  LL+P +       +    
Subjt:  EEELEDGSHLEFDDVFQDEVPKGLPPIRGIEHKIDFIPGAVIPNRSAYRANPTETKEIQRQVEELMEKGYIRESLSPCSVPVLLVPKKDGTWRMCVD-CR

Query:  AINKITEFDDVFQDEVPKGLPPIRGIEHKIDFIPGAVIPNRSAYRANPTETKEI
            + EFDDVFQDEVPKGLPPIRGIEHKIDFIPGAVIPNR AYRANPTET+EI
Subjt:  AINKITEFDDVFQDEVPKGLPPIRGIEHKIDFIPGAVIPNRSAYRANPTETKEI

TrEMBL top hitse value%identityAlignment
A0A5D3DGA7 Transposon Ty3-I Gag-Pol polyprotein2.0e-9348.97Show/hide
Query:  EEYLQYESKIEHVFDCNNFSEERKLKLVVAKFCDYAIIWWTSLKLEWRRNYEEPVETWEELKTLMRKRYIPKHYSRVLKQKLYTLQQGSKSVEEYYKEME
        E YLQ+E KIEHVFDCN FSE +K+KL +A+F +YA  W+  LK E RR  E+P+ETWEELK  MRKR++PKHY R LK KL +L+QG+KSV EYY+EME
Subjt:  EEYLQYESKIEHVFDCNNFSEERKLKLVVAKFCDYAIIWWTSLKLEWRRNYEEPVETWEELKTLMRKRYIPKHYSRVLKQKLYTLQQGSKSVEEYYKEME

Query:  TLMNRACIDEDEEDTNARFLGGLNRQLAHQVDRQAYFDMQELLHLAAKIEGQLAWEKENSKRVMIIKNGQVVTDSEESDHDELVEEE---------IQEN
        TL+ RA I EDEEDT +RFLGGLN+++AH VDR   ++M+++ H A KIE QL  EKE SKR   +      + S   + D  V            +   
Subjt:  TLMNRACIDEDEEDTNARFLGGLNRQLAHQVDRQAYFDMQELLHLAAKIEGQLAWEKENSKRVMIIKNGQVVTDSEESDHDELVEEE---------IQEN

Query:  EEELEDGSHLEFDDVFQDEVPKGLPPI-----RGIEH-KIDFIPGAVIPNRSAYRANPTETKEIQRQVEELME----KGYIRE--SLSPCSVPVLLVPKK
        + E E  +  +F+     EV +    I     +G  H   D I   V+  R+    +  E +E   Q+EE  E      YI E  S+S  +  VL V  K
Subjt:  EEELEDGSHLEFDDVFQDEVPKGLPPI-----RGIEH-KIDFIPGAVIPNRSAYRANPTETKEIQRQVEELME----KGYIRE--SLSPCSVPVLLVPKK

Query:  D-------------------GTWRMCVDCRAINKITEFDDVF-QDEVPKGLPPIRGIEHKIDFIPGAVIPNRSAYRANPTETKEIQRQVEELMEKGYIRE
        +                       + +D  +   + EF+D+F  ++ P GLPP+RGIEH+IDFIPGA +PN +AYR NPTETKEIQRQVEELM+KGYIRE
Subjt:  D-------------------GTWRMCVDCRAINKITEFDDVF-QDEVPKGLPPIRGIEHKIDFIPGAVIPNRSAYRANPTETKEIQRQVEELMEKGYIRE

Query:  SLSPCSVPVLLVPKKDGTWRMCVDCRAINKITVKY
        S+SPCSVPV+LVPKKDGTWRMCVDCRAINKITVKY
Subjt:  SLSPCSVPVLLVPKKDGTWRMCVDCRAINKITVKY

A0A6J1DVF2 uncharacterized protein LOC1110247351.6e-12755.67Show/hide
Query:  LQYESKIEHVFDCNNFSEERKLKLVVAKFCDYAIIWWTSLKLEWRRNYEEPVETWEELKTLMRKRYIPKHYSRVLKQKLYTLQQGSKSVEEYYKEMETLM
        LQYESKIEHVFDCNNFSEERKLKL VA+FCDYA IWWTSLK E RRNYEEP+ETWEELK LMRKRYIPKHYSR LKQKLY LQQGSKSVE+YYKEME LM
Subjt:  LQYESKIEHVFDCNNFSEERKLKLVVAKFCDYAIIWWTSLKLEWRRNYEEPVETWEELKTLMRKRYIPKHYSRVLKQKLYTLQQGSKSVEEYYKEMETLM

Query:  NRACIDEDEEDTNARFLGGLNRQLAHQVDRQAYFDMQELLHLAAKIEGQLAWEKENS-------------------------------------------
        NRA IDED EDT A FLGGLNR LAHQVDRQ YFDM+ELLHLA KIEGQLAWEKENS                                           
Subjt:  NRACIDEDEEDTNARFLGGLNRQLAHQVDRQAYFDMQELLHLAAKIEGQLAWEKENS-------------------------------------------

Query:  --------------------------------KRVMIIKNGQVVTDSEESDHDELVEEEIQENEEELEDGSHLEF------------DDV--FQDEVPKG
                                        KRVMIIKNG+VVTD EESD DEL+EE++QE+EEELEDGSHL              DDV   +D +   
Subjt:  --------------------------------KRVMIIKNGQVVTDSEESDHDELVEEEIQENEEELEDGSHLEF------------DDV--FQDEVPKG

Query:  LPPIRGIEHKIDFIPGAVIPNRSAYRAN----PTETKEIQRQVEELMEKG--------YIRESLSPCSVPVL--LVPKKDGTWRMCVDCRAINKITEFDD
           + G    +    G+     S +       PT+      +++ L + G         I  +L   +  +L  ++P   G   +    +   K  EF+D
Subjt:  LPPIRGIEHKIDFIPGAVIPNRSAYRAN----PTETKEIQRQVEELMEKG--------YIRESLSPCSVPVL--LVPKKDGTWRMCVDCRAINKITEFDD

Query:  VFQDEVPKGLPPIRGIEHKIDFIPGAVIPNRSAYRANPTETKEIQRQVEELMEKGYIRESLSPCSVPVLLVPKKDGTWRMCVDCRAINKITVKY
        VFQ+++PKGLP IRGIEH+IDFIPG  IPNR AYRANPTETKEIQRQVEELMEKGY+RESLSPCSVP++LVPKKDGTWRMCVDCRAINKITVKY
Subjt:  VFQDEVPKGLPPIRGIEHKIDFIPGAVIPNRSAYRANPTETKEIQRQVEELMEKGYIRESLSPCSVPVLLVPKKDGTWRMCVDCRAINKITVKY

A0A6J1EHL9 uncharacterized protein LOC1114326156.7e-11349.07Show/hide
Query:  MNQDGGNGGEINEARWREAQVGTITRLNQVISTLTDRMEQIEIAL----------------------------------------------EEYLQYESK
        MNQD GNGGEINEARWREAQVGTITRLNQVISTLTDRME+I+IAL                                              EEYLQYESK
Subjt:  MNQDGGNGGEINEARWREAQVGTITRLNQVISTLTDRMEQIEIAL----------------------------------------------EEYLQYESK

Query:  IEHVFDCNNFSEERKLKLVVAKFCDYAIIWWTSLKLEWRRNYEEPVETWEELKTLMRKRYIPKHYSRVLKQKLYTLQQGSKSVEEYYKEMETLMNRACID
        IEHVFDCNNFSEERKLKL VA+FCDYAIIWWTSLK EWRRNYEEP+ETWEE K LMRKR                                         
Subjt:  IEHVFDCNNFSEERKLKLVVAKFCDYAIIWWTSLKLEWRRNYEEPVETWEELKTLMRKRYIPKHYSRVLKQKLYTLQQGSKSVEEYYKEMETLMNRACID

Query:  EDEEDTNARFLGGLNRQLAHQVDRQAYFDMQELLHLAAKIEGQLAWEKENS-------------------------------------------------
                              DRQAYFDMQELLHLA KIEGQLAWEKENS                                                 
Subjt:  EDEEDTNARFLGGLNRQLAHQVDRQAYFDMQELLHLAAKIEGQLAWEKENS-------------------------------------------------

Query:  ---------KRVMIIKNGQVVTDSEESDHDELVEEEIQENEEELEDGSHLEFDDVFQDEVPKGLPPIRGIEHKIDFIPGAVIPNRSAYRANPTETKEIQR
                 KRV+IIKNGQVVTDSEESDHDELVEE+IQENEEELEDGS L         V +     + IE+ +D     +   R   +           
Subjt:  ---------KRVMIIKNGQVVTDSEESDHDELVEEEIQENEEELEDGSHLEFDDVFQDEVPKGLPPIRGIEHKIDFIPGAVIPNRSAYRANPTETKEIQR

Query:  QVEELMEKGYIRESLSPCSVPVLLVPKKDGTWRMCVD-CRAINKITEFDDVFQDEVPKGLPPIRGIEHKIDFIPGAVIPNRSAYRANPTETKEIQRQVEE
            +++ G     +S   +  LL+P +       +        + EFDDVFQDEVPKGLPPIRGIEHKI+FI GAVI NR AYRANPTETKEIQR+VEE
Subjt:  QVEELMEKGYIRESLSPCSVPVLLVPKKDGTWRMCVD-CRAINKITEFDDVFQDEVPKGLPPIRGIEHKIDFIPGAVIPNRSAYRANPTETKEIQRQVEE

Query:  LMEKGYIRESLSPCSVPVLLVPKKDGTWRMCVDCRAINKI
        LMEKGYIRESLSPCSVP+LLV  K GTWRMCVDCRAINKI
Subjt:  LMEKGYIRESLSPCSVPVLLVPKKDGTWRMCVDCRAINKI

A0A6J1I4Q5 LOW QUALITY PROTEIN: uncharacterized protein LOC1114709743.8e-13253.41Show/hide
Query:  MEQIEIALEEYLQYESKIEHVFDCNNFSEERKLKLVVAKFCDYAIIWWTSLKLEWRRNYEEPVETWEELKTLMRKRYIPKHYSRVLKQKLYT--------
        ME+IEIALEEYLQYESKIEHVFDCNNFSEERKLKL VA+FCDYAIIWWTSLK EWRRNYEEP+ETWEELKTLMRKRYIPKHYSR+LKQKLYT        
Subjt:  MEQIEIALEEYLQYESKIEHVFDCNNFSEERKLKLVVAKFCDYAIIWWTSLKLEWRRNYEEPVETWEELKTLMRKRYIPKHYSRVLKQKLYT--------

Query:  LQQGSKSVEEYYKEMETLMNRACIDEDEEDTNARFLGGLNRQLAHQVDRQAYFDMQELLHLAAKIEGQLAWEKEN-------------------------
        LQQGSKSVEEYYKEMETLMNRACIDEDE+DT A FLGGLNRQLAHQVDRQAYFDMQELLHLA KI+GQLAWEKEN                         
Subjt:  LQQGSKSVEEYYKEMETLMNRACIDEDEEDTNARFLGGLNRQLAHQVDRQAYFDMQELLHLAAKIEGQLAWEKEN-------------------------

Query:  -----------------------------------------------------SKRVMIIKNGQVVTDSEESDHDELVEEEIQENEEELEDGSHLEFDDV
                                                             +KRVMIIKNGQVVTDSE+SDHD+L EE+IQE+EEELEDGS L     
Subjt:  -----------------------------------------------------SKRVMIIKNGQVVTDSEESDHDELVEEEIQENEEELEDGSHLEFDDV

Query:  FQDEVPKGLPPIRGIEHKIDFIPGAVIPNRSAYRANPTETKEIQRQVEELMEKGYIRESLSPCSVPVLLVPKK-----------DGTWRMCVDCRAINKI
            V + L   +  E+ +D     +   R   +  P            +++ G     +S   +  LL+P +           + +  M V  + +   
Subjt:  FQDEVPKGLPPIRGIEHKIDFIPGAVIPNRSAYRANPTETKEIQRQVEELMEKGYIRESLSPCSVPVLLVPKK-----------DGTWRMCVDCRAINKI

Query:  T------------------------------EFDDVFQDEVPKGLPPIRGIEHKIDFIPGAVIPNRSAYRANPTETKEIQRQVEELMEKGYIRESLSPCS
        T                              EFDDVFQDEVPKGLPPIRGIEHKIDFIPG +IPN+ AY A+PTETKEI+R++EELMEKGYIRESLSPCS
Subjt:  T------------------------------EFDDVFQDEVPKGLPPIRGIEHKIDFIPGAVIPNRSAYRANPTETKEIQRQVEELMEKGYIRESLSPCS

Query:  VPVLLVPKKDGTWRMCVDCRAINKITVK
        VP+LL+PKKDGTWRMC+D   +  I  K
Subjt:  VPVLLVPKKDGTWRMCVDCRAINKITVK

A0A6P4MA28 uncharacterized protein LOC1084621111.5e-10449.39Show/hide
Query:  EEYLQYESKIEHVFDCNNFSEERKLKLVVAKFCDYAIIWWTSLKLEWRRNYEEPVETWEELKTLMRKRYIPKHYSRVLKQKLYTLQQGSKSVEEYYKEME
        E YL++E KIE VFD +N+S+ +K+KL   +F DYA+IWW  L    RR+ E+P+ TW E+K ++R+R+IP +Y R L QKL  L QG+KSVE+YYKEME
Subjt:  EEYLQYESKIEHVFDCNNFSEERKLKLVVAKFCDYAIIWWTSLKLEWRRNYEEPVETWEELKTLMRKRYIPKHYSRVLKQKLYTLQQGSKSVEEYYKEME

Query:  TLMNRACIDEDEEDTNARFLGGLNRQLAHQVDRQAYFDMQELLHLAAKIEGQLAWEKE----------NSKRVMI---------IKNGQVVTDSEESDHD
          M  A + ED E T ARFL GLNR +A+ ++ Q Y ++ +++H+A K++  L+ + +          +  +V +         I  G + +        
Subjt:  TLMNRACIDEDEEDTNARFLGGLNRQLAHQVDRQAYFDMQELLHLAAKIEGQLAWEKE----------NSKRVMI---------IKNGQVVTDSEESDHD

Query:  ELVEEEIQENEEELEDGSHLEFDDVFQDEVPKGLPPIRGIEHKIDFIPGAVIPNRSAYRANPTETKEIQRQVEELMEKGYIRESLSPCSVPVLLVPKKDG
         +  +   E E+E E  S  +F DVF +EVP GLPP+RGIEH+IDFIPGA IPNR AYRANP ETKE+QRQV +L++KGY                    
Subjt:  ELVEEEIQENEEELEDGSHLEFDDVFQDEVPKGLPPIRGIEHKIDFIPGAVIPNRSAYRANPTETKEIQRQVEELMEKGYIRESLSPCSVPVLLVPKKDG

Query:  TWRMCVDCRAINKITEFDDVFQDEVPKGLPPIRGIEHKIDFIPGAVIPNRSAYRANPTETKEIQRQVEELMEKGYIRESLSPCSVPVLLVPKKDGTWRMC
                       +F DVF +EVP GLPP+RGIEH+IDFIPGA IPNR AY+ANP ETKE+Q+QV +L++KGY+RESLSPC+VPVLLVPKKDGTWRMC
Subjt:  TWRMCVDCRAINKITEFDDVFQDEVPKGLPPIRGIEHKIDFIPGAVIPNRSAYRANPTETKEIQRQVEELMEKGYIRESLSPCSVPVLLVPKKDGTWRMC

Query:  VDCRAINKITVKY
        VDC A+NKIT+KY
Subjt:  VDCRAINKITVKY

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein6.9e-0634.29Show/hide
Query:  EEELEDGSHLEFDDVFQDEVPKGLP-PIRGIEHKIDFI-PGAVIPNRSAYRANPTETKEIQRQVEELMEKGYIRESLSPCSVPVLLVPKKDGTWRMCVDC
        E EL D  + EF D+  +   + LP PI+G+E +++       +P R+ Y   P + + +  ++ + ++ G IRES +  + PV+ VPKK+GT RM VD 
Subjt:  EEELEDGSHLEFDDVFQDEVPKGLP-PIRGIEHKIDFI-PGAVIPNRSAYRANPTETKEIQRQVEELMEKGYIRESLSPCSVPVLLVPKKDGTWRMCVDC

Query:  RAINK
        + +NK
Subjt:  RAINK

P0CT41 Transposon Tf2-12 polyprotein6.9e-0634.29Show/hide
Query:  EEELEDGSHLEFDDVFQDEVPKGLP-PIRGIEHKIDFI-PGAVIPNRSAYRANPTETKEIQRQVEELMEKGYIRESLSPCSVPVLLVPKKDGTWRMCVDC
        E EL D  + EF D+  +   + LP PI+G+E +++       +P R+ Y   P + + +  ++ + ++ G IRES +  + PV+ VPKK+GT RM VD 
Subjt:  EEELEDGSHLEFDDVFQDEVPKGLP-PIRGIEHKIDFI-PGAVIPNRSAYRANPTETKEIQRQVEELMEKGYIRESLSPCSVPVLLVPKKDGTWRMCVDC

Query:  RAINK
        + +NK
Subjt:  RAINK

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein2.4e-1438.78Show/hide
Query:  EFDDVFQDEVPKGLPPIRGI--EHKIDFIPGAVIPNRSAYRANPTETKEIQRQVEELMEKGYIRESLSPCSVPVLLVPKKDGTWRMCVDCRAINKITV
        ++ ++ ++++P     I  I  +H I+  PGA +P    Y       +EI + V++L++  +I  S SPCS PV+LVPKKDGT+R+CVD R +NK T+
Subjt:  EFDDVFQDEVPKGLPPIRGI--EHKIDFIPGAVIPNRSAYRANPTETKEIQRQVEELMEKGYIRESLSPCSVPVLLVPKKDGTWRMCVDCRAINKITV

Q99315 Transposon Ty3-G Gag-Pol polyprotein2.4e-1438.78Show/hide
Query:  EFDDVFQDEVPKGLPPIRGI--EHKIDFIPGAVIPNRSAYRANPTETKEIQRQVEELMEKGYIRESLSPCSVPVLLVPKKDGTWRMCVDCRAINKITV
        ++ ++ ++++P     I  I  +H I+  PGA +P    Y       +EI + V++L++  +I  S SPCS PV+LVPKKDGT+R+CVD R +NK T+
Subjt:  EFDDVFQDEVPKGLPPIRGI--EHKIDFIPGAVIPNRSAYRANPTETKEIQRQVEELMEKGYIRESLSPCSVPVLLVPKKDGTWRMCVDCRAINKITV

Q9UR07 Transposon Tf2-11 polyprotein6.9e-0634.29Show/hide
Query:  EEELEDGSHLEFDDVFQDEVPKGLP-PIRGIEHKIDFI-PGAVIPNRSAYRANPTETKEIQRQVEELMEKGYIRESLSPCSVPVLLVPKKDGTWRMCVDC
        E EL D  + EF D+  +   + LP PI+G+E +++       +P R+ Y   P + + +  ++ + ++ G IRES +  + PV+ VPKK+GT RM VD 
Subjt:  EEELEDGSHLEFDDVFQDEVPKGLP-PIRGIEHKIDFI-PGAVIPNRSAYRANPTETKEIQRQVEELMEKGYIRESLSPCSVPVLLVPKKDGTWRMCVDC

Query:  RAINK
        + +NK
Subjt:  RAINK

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACCAAGATGGAGGAAATGGAGGGGAGATCAATGAAGCTAGGTGGAGAGAAGCACAGGTCGGAACAATTACTCGTTTGAATCAAGTTATTAGTACATTGACAGACCG
GATGGAGCAAATTGAAATAGCCTTAGAGGAGTATTTGCAATATGAGAGCAAAATTGAGCATGTCTTCGATTGCAACAATTTCAGTGAAGAAAGAAAGTTGAAGCTTGTTG
TGGCTAAATTTTGTGATTATGCCATTATATGGTGGACATCTTTGAAATTAGAGTGGAGGAGAAATTATGAAGAACCAGTTGAAACGTGGGAGGAATTGAAGACATTAATG
AGAAAAAGGTACATTCCTAAGCATTATTCTCGAGTACTCAAGCAAAAACTCTATACTTTACAACAGGGATCCAAAAGTGTTGAAGAATATTACAAAGAAATGGAGACCCT
TATGAATAGAGCATGTATTGATGAAGATGAGGAAGATACAAATGCTAGATTTCTTGGTGGGTTAAACCGACAACTTGCTCATCAAGTGGATAGGCAAGCGTACTTTGATA
TGCAAGAATTATTACACCTTGCTGCCAAAATCGAAGGGCAACTAGCTTGGGAGAAGGAGAACTCCAAGAGAGTCATGATCATCAAGAATGGACAAGTTGTCACAGATAGT
GAGGAAAGTGACCATGATGAGCTAGTTGAAGAGGAAATCCAAGAGAATGAGGAGGAACTTGAAGATGGGAGTCATTTGGAATTTGATGACGTATTCCAAGATGAAGTCCC
TAAAGGACTACCTCCCATTAGAGGTATTGAACATAAAATAGACTTCATTCCAGGGGCAGTAATTCCTAATAGGTCAGCTTATAGAGCCAACCCAACGGAGACTAAAGAAA
TTCAAAGGCAAGTAGAGGAGCTTATGGAGAAGGGCTATATACGAGAGAGTTTGAGTCCATGTTCTGTACCAGTCTTATTGGTGCCAAAGAAAGATGGAACGTGGCGCATG
TGCGTGGATTGTAGAGCCATCAACAAAATCACAGAATTTGATGACGTATTCCAAGATGAAGTCCCTAAAGGACTACCTCCCATTAGAGGTATTGAACATAAAATAGACTT
CATTCCAGGGGCAGTAATTCCTAATAGGTCAGCTTATAGAGCCAACCCAACGGAGACTAAAGAAATTCAAAGGCAAGTAGAGGAGCTTATGGAGAAGGGCTATATACGAG
AGAGTTTGAGTCCATGTTCTGTACCAGTCTTATTGGTGCCAAAGAAAGATGGAACGTGGCGCATGTGCGTGGATTGTAGAGCCATCAACAAAATCACAGTAAAGTATTGA
mRNA sequenceShow/hide mRNA sequence
ATGAACCAAGATGGAGGAAATGGAGGGGAGATCAATGAAGCTAGGTGGAGAGAAGCACAGGTCGGAACAATTACTCGTTTGAATCAAGTTATTAGTACATTGACAGACCG
GATGGAGCAAATTGAAATAGCCTTAGAGGAGTATTTGCAATATGAGAGCAAAATTGAGCATGTCTTCGATTGCAACAATTTCAGTGAAGAAAGAAAGTTGAAGCTTGTTG
TGGCTAAATTTTGTGATTATGCCATTATATGGTGGACATCTTTGAAATTAGAGTGGAGGAGAAATTATGAAGAACCAGTTGAAACGTGGGAGGAATTGAAGACATTAATG
AGAAAAAGGTACATTCCTAAGCATTATTCTCGAGTACTCAAGCAAAAACTCTATACTTTACAACAGGGATCCAAAAGTGTTGAAGAATATTACAAAGAAATGGAGACCCT
TATGAATAGAGCATGTATTGATGAAGATGAGGAAGATACAAATGCTAGATTTCTTGGTGGGTTAAACCGACAACTTGCTCATCAAGTGGATAGGCAAGCGTACTTTGATA
TGCAAGAATTATTACACCTTGCTGCCAAAATCGAAGGGCAACTAGCTTGGGAGAAGGAGAACTCCAAGAGAGTCATGATCATCAAGAATGGACAAGTTGTCACAGATAGT
GAGGAAAGTGACCATGATGAGCTAGTTGAAGAGGAAATCCAAGAGAATGAGGAGGAACTTGAAGATGGGAGTCATTTGGAATTTGATGACGTATTCCAAGATGAAGTCCC
TAAAGGACTACCTCCCATTAGAGGTATTGAACATAAAATAGACTTCATTCCAGGGGCAGTAATTCCTAATAGGTCAGCTTATAGAGCCAACCCAACGGAGACTAAAGAAA
TTCAAAGGCAAGTAGAGGAGCTTATGGAGAAGGGCTATATACGAGAGAGTTTGAGTCCATGTTCTGTACCAGTCTTATTGGTGCCAAAGAAAGATGGAACGTGGCGCATG
TGCGTGGATTGTAGAGCCATCAACAAAATCACAGAATTTGATGACGTATTCCAAGATGAAGTCCCTAAAGGACTACCTCCCATTAGAGGTATTGAACATAAAATAGACTT
CATTCCAGGGGCAGTAATTCCTAATAGGTCAGCTTATAGAGCCAACCCAACGGAGACTAAAGAAATTCAAAGGCAAGTAGAGGAGCTTATGGAGAAGGGCTATATACGAG
AGAGTTTGAGTCCATGTTCTGTACCAGTCTTATTGGTGCCAAAGAAAGATGGAACGTGGCGCATGTGCGTGGATTGTAGAGCCATCAACAAAATCACAGTAAAGTATTGA
Protein sequenceShow/hide protein sequence
MNQDGGNGGEINEARWREAQVGTITRLNQVISTLTDRMEQIEIALEEYLQYESKIEHVFDCNNFSEERKLKLVVAKFCDYAIIWWTSLKLEWRRNYEEPVETWEELKTLM
RKRYIPKHYSRVLKQKLYTLQQGSKSVEEYYKEMETLMNRACIDEDEEDTNARFLGGLNRQLAHQVDRQAYFDMQELLHLAAKIEGQLAWEKENSKRVMIIKNGQVVTDS
EESDHDELVEEEIQENEEELEDGSHLEFDDVFQDEVPKGLPPIRGIEHKIDFIPGAVIPNRSAYRANPTETKEIQRQVEELMEKGYIRESLSPCSVPVLLVPKKDGTWRM
CVDCRAINKITEFDDVFQDEVPKGLPPIRGIEHKIDFIPGAVIPNRSAYRANPTETKEIQRQVEELMEKGYIRESLSPCSVPVLLVPKKDGTWRMCVDCRAINKITVKY