; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G13050 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G13050
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionGag-pol polyprotein
Genome locationChr7:11493755..11495774
RNA-Seq ExpressionCSPI07G13050
SyntenyCSPI07G13050
Gene Ontology termsNA
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAO72413.1 gag-pol polyprotein [Oryza sativa Japonica Group]2.4e-19352.11Show/hide
Query:  GRKPHLAHLRVFGCVAYVKTTTPHLKKLDDRSSPVVYFGVEEGCKAHRLYDPGREKLQISRDVVFQENLEWPWNEVVSDGKEITEFQVMDQFCSDEFENL
        G+KPHL HLRVFGC A+ K T PHLKKLDDRS+PVVY GVEEG KAHRL+DP R ++ +SRDVVF EN  W W+    +    TEF+V +   +++    
Subjt:  GRKPHLAHLRVFGCVAYVKTTTPHLKKLDDRSSPVVYFGVEEGCKAHRLYDPGREKLQISRDVVFQENLEWPWNEVVSDGKEITEFQVMDQFCSDEFENL

Query:  EDAET----RVENVIPHATEIPAIGATGPSPP----------------------------------STNTPVRLRSLTHIYINTEEV-VGGDEQENEKLL
        E A +    R       A + P +     +PP                                  S + PVR RSL  I I    V +  D+ + E LL
Subjt:  EDAET----RVENVIPHATEIPAIGATGPSPP----------------------------------STNTPVRLRSLTHIYINTEEV-VGGDEQENEKLL

Query:  -------------KRPNWYKVMENELKSIEKNNTWSLTKLPPGHKPIGLKWVFKLKKDPSVEVVKHNARLVAK------GIDFEEVLAPVARLDTVRVIL
                       P W   M  EL++IEKN TWSL  LP  HK IGLKWVFKLKK+ + EV+KH ARLVAK      G+DF+EV APVARLDTVR IL
Subjt:  -------------KRPNWYKVMENELKSIEKNNTWSLTKLPPGHKPIGLKWVFKLKKDPSVEVVKHNARLVAK------GIDFEEVLAPVARLDTVRVIL

Query:  VLVANQSWEVHHLDVKSTFLNGELEEEVYVTQSEGFEVPNKKHKVHRLSKALYGLRQAPRAWNIQLDRSLKELGFGKCTQKQVVYTRSEGEECVLVEVYV
         +  ++ W+VHHLDVKS FLNG+LEEEVYV+Q EGF    K+H V++LSKALYGLRQAPRAWN +LDRS+KELGF +C Q+Q VYTR  G   ++V VYV
Subjt:  VLVANQSWEVHHLDVKSTFLNGELEEEVYVTQSEGFEVPNKKHKVHRLSKALYGLRQAPRAWNIQLDRSLKELGFGKCTQKQVVYTRSEGEECVLVEVYV

Query:  DNLIVAGNSTEKVNKFKQQMMAEFEMSHLGLLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDMEGAPIEATEYRNIVGC
        D+LIV G S   +  FKQQMM EFEMS LGLL+YYLGIE                                        LHKD +G+ ++ TEYR ++GC
Subjt:  DNLIVAGNSTEKVNKFKQQMMAEFEMSHLGLLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDMEGAPIEATEYRNIVGC

Query:  LRYLLNTRPNLSYVFGMASKYMERPTTIHYKVVKQILRYLRGTIHFGLTYTKGPRQFDILGYSDSDLVSDLDGRKSTSGMKFYLNESLVSWNSQKQKTVT
        LRYLL+TRP+LSY  G+AS++MERPT +H+K VKQILRYL+GTI+ GL ++ G    +I G++DSDL  D D R+STSGM FY N SLVSW+SQKQKTV 
Subjt:  LRYLLNTRPNLSYVFGMASKYMERPTTIHYKVVKQILRYLRGTIHFGLTYTKGPRQFDILGYSDSDLVSDLDGRKSTSGMKFYLNESLVSWNSQKQKTVT

Query:  LSSCKAEFIAATTAACQALWLRCIVSEIVRMEPRSVTLFVDNKSAIALMKNPVFHGCGKHIDTCFHFIQECVENGQIIVEFVNTREQRADVLTKALTRVK
        LSSC+AEF+AAT AAC ALWLR ++ E++  E R V L+VDNKSAIALMKNPVFHG  KHIDT +HFI+ECVE+G+I +EFV   EQRAD LTK L   +
Subjt:  LSSCKAEFIAATTAACQALWLRCIVSEIVRMEPRSVTLFVDNKSAIALMKNPVFHGCGKHIDTCFHFIQECVENGQIIVEFVNTREQRADVLTKALTRVK

Query:  LAAMCQLLGVRN
        LA    LLGVR+
Subjt:  LAAMCQLLGVRN

ABF94034.1 retrotransposon protein, putative, unclassified [Oryza sativa Japonica Group]2.0e-21654.85Show/hide
Query:  GRKPHLAHLRVFGCVAYVKTTTPHLKKLDDRSSPVVYFGVEEGCKAHRLYDPGREKLQISRDVVFQENLEWPWNEVVSDGKEITEFQVMDQFCSDEFENL
        GRKP L HL+VFGC A+ K T PHLKKLDDRS+P VY GVEEG KAHRL+DP R ++ +SRDVVF+EN+ W W      G+E T+F + +    +  E L
Subjt:  GRKPHLAHLRVFGCVAYVKTTTPHLKKLDDRSSPVVYFGVEEGCKAHRLYDPGREKLQISRDVVFQENLEWPWNEVVSDGKEITEFQVMDQFCSDEFENL

Query:  EDAETRVENVIPH----------------ATEIPA-----------IGATGPSPPSTNT------------------PVRLRSLTHIYINTEEV-VGGDE
            T    V P+                A E+P+            G   P  PSTN+                  P R RSL  +      V +  DE
Subjt:  EDAETRVENVIPH----------------ATEIPA-----------IGATGPSPPSTNT------------------PVRLRSLTHIYINTEEV-VGGDE

Query:  QENEKLLK-------------RPNWYKVMENELKSIEKNNTWSLTKLPPGHKPIGLKWVFKLKKDPSVEVVKHNARLVAK------GIDFEEVLAPVARL
         + E LL              +P W + M+ E+++IEKN TW L  LP GH+ IGLKWV+KLKK+ + E++KH ARLVAK      G+DFEEV APVARL
Subjt:  QENEKLLK-------------RPNWYKVMENELKSIEKNNTWSLTKLPPGHKPIGLKWVFKLKKDPSVEVVKHNARLVAK------GIDFEEVLAPVARL

Query:  DTVRVILVLVANQSWEVHHLDVKSTFLNGELEEEVYVTQSEGFEVPNKKHKVHRLSKALYGLRQAPRAWNIQLDRSLKELGFGKCTQKQVVYTRSEGEEC
        DTVRV+L + A++ W+VHHLDVKS FLNGELEEEVYV Q EGF    K+H V +L KALYGLRQAPRAWNI+LDRSL+ELGF +CTQ+Q VYTR  G + 
Subjt:  DTVRVILVLVANQSWEVHHLDVKSTFLNGELEEEVYVTQSEGFEVPNKKHKVHRLSKALYGLRQAPRAWNIQLDRSLKELGFGKCTQKQVVYTRSEGEEC

Query:  VLVEVYVDNLIVAGNSTEKVNKFKQQMMAEFEMSHLGLLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDMEGAPIEATE
        ++V VYVD+LIV G +  ++  FK+QMM EFEMS LGLL+YYLGIEV+Q +    LKQ  YAK++LSQFGM +CN+   P++P++QL KD EG P++ATE
Subjt:  VLVEVYVDNLIVAGNSTEKVNKFKQQMMAEFEMSHLGLLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDMEGAPIEATE

Query:  YRNIVGCLRYLLNTRPNLSYVFGMASKYMERPTTIHYKVVKQILRYLRGTIHFGLTYTKGPRQFDILGYSDSDLVSDLDGRKSTSGMKFYLNESLVSWNS
        YR I+G LRYLL+TRP+LSY  G+AS++MERPT +H+K VKQILRY++GT+ +GL Y  G     I GY+DSDL  DLD R+ST GM FY+N+SLV+W+S
Subjt:  YRNIVGCLRYLLNTRPNLSYVFGMASKYMERPTTIHYKVVKQILRYLRGTIHFGLTYTKGPRQFDILGYSDSDLVSDLDGRKSTSGMKFYLNESLVSWNS

Query:  QKQKTVTLSSCKAEFIAATTAACQALWLRCIVSEIVRMEPRSVTLFVDNKSAIALMKNPVFHGCGKHIDTCFHFIQECVENGQIIVEFVNTREQRADVLT
        QKQKTV LSSC+AEF+AATTAACQALWLR +++E+  +E ++V LFVDN+SAIALMKNPVFHG  KHIDT +HFI+ECV+ GQI+VEFV T EQRAD LT
Subjt:  QKQKTVTLSSCKAEFIAATTAACQALWLRCIVSEIVRMEPRSVTLFVDNKSAIALMKNPVFHGCGKHIDTCFHFIQECVENGQIIVEFVNTREQRADVLT

Query:  KALTRVKLAAMCQLLGVRNLES
        K L   KL     LLGVR+L S
Subjt:  KALTRVKLAAMCQLLGVRNLES

EEC84282.1 hypothetical protein OsI_30754 [Oryza sativa Indica Group]4.1e-21756.66Show/hide
Query:  GRKPHLAHLRVFGCVAYVKTTTPHLKKLDDRSSPVVYFGVEEGCKAHRLYDPGREKLQISRDVVFQENLEWPWNEVVSDGKEITEFQVMDQFCS------
        GRKP L HLRVFGC+A+ K TTP+ KKLDDRS+P VY GVEEG KAHRL+DP   ++ +SRDV+F+EN+ W W+ VV+  +  TEF V +          
Subjt:  GRKPHLAHLRVFGCVAYVKTTTPHLKKLDDRSSPVVYFGVEEGCKAHRLYDPGREKLQISRDVVFQENLEWPWNEVVSDGKEITEFQVMDQFCS------

Query:  ------------------------DEFENLEDAETRVENVIPHATEIPAIGATGP--SPPSTNT------PVRLRSLTHIYINTEEV-VGGDEQENEKLL
                                    +L  +     +  P +T  PA G+ GP  SP S+        PVR RSL  I      V +  DE + + LL
Subjt:  ------------------------DEFENLEDAETRVENVIPHATEIPAIGATGP--SPPSTNT------PVRLRSLTHIYINTEEV-VGGDEQENEKLL

Query:  K-------------RPNWYKVMENELKSIEKNNTWSLTKLPPGHKPIGLKWVFKLKKDPSVEVVKHNARLVAK------GIDFEEVLAPVARLDTVRVIL
                      +P W   M  EL++IEKN+TW+LT LP GHKPIGLKWV+KLKK+ + EV+KH ARLVAK      G+DFEEV APVARLDTVRVIL
Subjt:  K-------------RPNWYKVMENELKSIEKNNTWSLTKLPPGHKPIGLKWVFKLKKDPSVEVVKHNARLVAK------GIDFEEVLAPVARLDTVRVIL

Query:  VLVANQSWEVHHLDVKSTFLNGELEEEVYVTQSEGFEVPNKKHKVHRLSKALYGLRQAPRAWNIQLDRSLKELGFGKCTQKQVVYTRSEGEECVLVEVYV
         + A++ WEVHHLDVKS FLNG+LEEEVYV Q EGF    ++H V RLSKALYGLRQAPRAWN +LD+ LKELGF +CTQ+Q VYTR +G+  V+V VYV
Subjt:  VLVANQSWEVHHLDVKSTFLNGELEEEVYVTQSEGFEVPNKKHKVHRLSKALYGLRQAPRAWNIQLDRSLKELGFGKCTQKQVVYTRSEGEECVLVEVYV

Query:  DNLIVAGNSTEKVNKFKQQMMAEFEMSHLGLLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDMEGAPIEATEYRNIVGC
        D+LIV G + +++  FKQQMM EFEMS LGLLSYYLGIEV Q +  I +KQ  YAK+ILSQFGM  CN T  PMEP++ LHKD +G PI+ATEYR ++GC
Subjt:  DNLIVAGNSTEKVNKFKQQMMAEFEMSHLGLLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDMEGAPIEATEYRNIVGC

Query:  LRYLLNTRPNLSYVFGMASKYMERPTTIHYKVVKQILRYLRGTIHFGLTYTKGPRQFDILGYSDSDLVSDLDGRKSTSGMKFYLNESLVSWNSQKQKTVT
        LRYLL+TRP+LSY  G+AS++MERPTT+H K VK ILRYL+GT+  GL +  G    DI G++DSDL  D+D R+ST GM FY+N SLVSW SQKQKTV 
Subjt:  LRYLLNTRPNLSYVFGMASKYMERPTTIHYKVVKQILRYLRGTIHFGLTYTKGPRQFDILGYSDSDLVSDLDGRKSTSGMKFYLNESLVSWNSQKQKTVT

Query:  LSSCKAEFIAATTAACQALWLRCIVSEIVRMEPRSVTLFVDNKSAIALMKNPVFHGCGKHIDTCFHFIQECVENGQIIVEFVNTREQRADVLTKALTRVK
        LSSC+AEF+AAT AAC ALWLR ++SE++  E + V LFVDNKSAIALMKNPVFHG  KHIDT +HFI+ECVE+GQI++EFV++ EQRAD +TK L   K
Subjt:  LSSCKAEFIAATTAACQALWLRCIVSEIVRMEPRSVTLFVDNKSAIALMKNPVFHGCGKHIDTCFHFIQECVENGQIIVEFVNTREQRADVLTKALTRVK

Query:  LAAMCQLLGVRNL
        LA    LLGVR+L
Subjt:  LAAMCQLLGVRNL

KAB8107251.1 hypothetical protein EE612_041900 [Oryza sativa]1.5e-21656.66Show/hide
Query:  GRKPHLAHLRVFGCVAYVKTTTPHLKKLDDRSSPVVYFGVEEGCKAHRLYDPGREKLQISRDVVFQENLEWPWNEVVSDGKEITEFQVMDQFCS------
        GRKP L HLRVFGC+A+ K TTP+ KKLDDRS+P VY GVEEG KAHRL+DP   ++ +SRDV+F+EN+ W W+ VV+  +  TEF V +          
Subjt:  GRKPHLAHLRVFGCVAYVKTTTPHLKKLDDRSSPVVYFGVEEGCKAHRLYDPGREKLQISRDVVFQENLEWPWNEVVSDGKEITEFQVMDQFCS------

Query:  ------------------------DEFENLEDAETRVENVIPHATEIPAIGATGP--SP------PSTNTPVRLRSLTHIYINTEEV-VGGDEQENEKLL
                                    +L  +     +  P +T  PA G+ GP  SP       S   PVR RSL  I      V +  DE + + LL
Subjt:  ------------------------DEFENLEDAETRVENVIPHATEIPAIGATGP--SP------PSTNTPVRLRSLTHIYINTEEV-VGGDEQENEKLL

Query:  K-------------RPNWYKVMENELKSIEKNNTWSLTKLPPGHKPIGLKWVFKLKKDPSVEVVKHNARLVAK------GIDFEEVLAPVARLDTVRVIL
                      +P W   M  EL++IEKN+TW+LT LP GHKPIGLKWV+KLKK+ + EV+KH ARLVAK      G+DFEEV APVARLDTVRVIL
Subjt:  K-------------RPNWYKVMENELKSIEKNNTWSLTKLPPGHKPIGLKWVFKLKKDPSVEVVKHNARLVAK------GIDFEEVLAPVARLDTVRVIL

Query:  VLVANQSWEVHHLDVKSTFLNGELEEEVYVTQSEGFEVPNKKHKVHRLSKALYGLRQAPRAWNIQLDRSLKELGFGKCTQKQVVYTRSEGEECVLVEVYV
         + A++ WEVHHLDVKS FLNG+LEEEVYV Q EGF    ++H V RLSKALYGLRQAPRAWN +LD+ LKELGF +CTQ+Q VYTR +G+  V+V VYV
Subjt:  VLVANQSWEVHHLDVKSTFLNGELEEEVYVTQSEGFEVPNKKHKVHRLSKALYGLRQAPRAWNIQLDRSLKELGFGKCTQKQVVYTRSEGEECVLVEVYV

Query:  DNLIVAGNSTEKVNKFKQQMMAEFEMSHLGLLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDMEGAPIEATEYRNIVGC
        D+LIV G +  ++  FKQQMM EFEMS LGLLSYYLGIEV Q +  I +KQ  YAK+ILSQFGM  CN T  PMEP++ LHKD +G PI+ATEYR ++GC
Subjt:  DNLIVAGNSTEKVNKFKQQMMAEFEMSHLGLLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDMEGAPIEATEYRNIVGC

Query:  LRYLLNTRPNLSYVFGMASKYMERPTTIHYKVVKQILRYLRGTIHFGLTYTKGPRQFDILGYSDSDLVSDLDGRKSTSGMKFYLNESLVSWNSQKQKTVT
        LRYLL+TRP+LSY  G+AS++MERPTT+H K VK ILRYL+GT+  GL +  G    DI G++DSDL  D+D R+ST GM FY+N SLVSW SQKQKTV 
Subjt:  LRYLLNTRPNLSYVFGMASKYMERPTTIHYKVVKQILRYLRGTIHFGLTYTKGPRQFDILGYSDSDLVSDLDGRKSTSGMKFYLNESLVSWNSQKQKTVT

Query:  LSSCKAEFIAATTAACQALWLRCIVSEIVRMEPRSVTLFVDNKSAIALMKNPVFHGCGKHIDTCFHFIQECVENGQIIVEFVNTREQRADVLTKALTRVK
        LSSC+AEF+AAT AAC ALWLR ++SE++  E + V LFVDNKSAIALMKNPVFHG  KHIDT +HFI+ECVE+GQI++EFV + EQRAD +TK L   K
Subjt:  LSSCKAEFIAATTAACQALWLRCIVSEIVRMEPRSVTLFVDNKSAIALMKNPVFHGCGKHIDTCFHFIQECVENGQIIVEFVNTREQRADVLTKALTRVK

Query:  LAAMCQLLGVRNL
        LA    LLGVR+L
Subjt:  LAAMCQLLGVRNL

KAG6523193.1 hypothetical protein ZIOFF_013046 [Zingiber officinale]3.5e-19255.59Show/hide
Query:  MGRKPHLAHLRVFGCVAYVKTTTPHLKKLDDRSSPVVYFGVEEGCKAHRLYDPGREKLQISRDVVFQENLEWPWNEVVSDGKEITEFQVMDQFCSDEFEN
        MGRKPHLAHLRVFGCVAYVK TTPHLKKLDDRSSP+VY GVEEGCKAHRL+DP  +KLQ+SRDVVFQEN EW W    +  + + EF V D   +DE   
Subjt:  MGRKPHLAHLRVFGCVAYVKTTTPHLKKLDDRSSPVVYFGVEEGCKAHRLYDPGREKLQISRDVVFQENLEWPWNEVVSDGKEITEFQVMDQFCSDEFEN

Query:  LEDAETRVENVIPHATE-IPAIGATGPSPPSTNT-----------PVRLRSLTHIYINTEEVVGGDEQENEKLLKRPNWYKVMENELKSIEKNNTWSLTK
        + D E   E+V P AT  +P  GA+ PS  S++T           PVR RS+  IY NTEEVVG DE+ENE +L                          
Subjt:  LEDAETRVENVIPHATE-IPAIGATGPSPPSTNT-----------PVRLRSLTHIYINTEEVVGGDEQENEKLLKRPNWYKVMENELKSIEKNNTWSLTK

Query:  LPPGHKPIGLKWVFKLKKDPSVEVVKHNARLVAKGIDFEEVLAPVARLDTVRVILVLVANQSWEVHHLDVKSTFLNGELEEEVYVTQSEGFEVPNKKHKV
                                                                                      + EE    Q    E        
Subjt:  LPPGHKPIGLKWVFKLKKDPSVEVVKHNARLVAKGIDFEEVLAPVARLDTVRVILVLVANQSWEVHHLDVKSTFLNGELEEEVYVTQSEGFEVPNKKHKV

Query:  HRLSKALYGLRQAPRAWNIQLDRSLKELGFGKCTQKQVVYTRSEGEECVLVEVYVDNLIVAGNSTEKVNKFKQQMMAEFEMSHLGLLSYYLGIEVEQQKG
                                L ELGF KC Q+  VYTR EGE  +LV VYVD+LIV G+ST ++NKFKQQMM EFEMS LGLLSYYLGIEVEQQK 
Subjt:  HRLSKALYGLRQAPRAWNIQLDRSLKELGFGKCTQKQVVYTRSEGEECVLVEVYVDNLIVAGNSTEKVNKFKQQMMAEFEMSHLGLLSYYLGIEVEQQKG

Query:  RILLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDMEGAPIEATEYRNIVGCLRYLLNTRPNLSYVFGMASKYMERPTTIHYKVVKQILRYLRGTIH
        RILL+Q TYAK+ILSQF MADCNATK+PMEPK QLHKD+EG P++ATEY+ ++GCLRYLL+TRP+LSY  GMAS+YMERPTT+H+KVVKQILRYL+G IH
Subjt:  RILLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDMEGAPIEATEYRNIVGCLRYLLNTRPNLSYVFGMASKYMERPTTIHYKVVKQILRYLRGTIH

Query:  FGLTYTKGPRQFDILGYSDSDLVSDLDGRKSTSGMKFYLNESLVSWNSQKQKTVTLSSCKAEFIAATTAACQALWLRCIVSEIVRMEPRSVTLFVDNKSA
        FGLTY KGP++  I GYSDSDL  DLDGRKSTSGM FY NESLVSWNSQKQKTV LSSC+AEF+AATTAAC ALWLR + SE+   +P+ VTLFVDNKSA
Subjt:  FGLTYTKGPRQFDILGYSDSDLVSDLDGRKSTSGMKFYLNESLVSWNSQKQKTVTLSSCKAEFIAATTAACQALWLRCIVSEIVRMEPRSVTLFVDNKSA

Query:  IALMKNPVFHGCGKHIDTCFHFIQECVENGQIIVEFVNTREQRADVLTKALTRVKLAAMCQLLGVRNLESC
        IALMKNPVFHG  K+IDT FHFI+ECVENGQI+VEF+NT EQRADVLTKAL  VKL  M QLLGVR+LE C
Subjt:  IALMKNPVFHGCGKHIDTCFHFIQECVENGQIIVEFVNTREQRADVLTKALTRVKLAAMCQLLGVRNLESC

TrEMBL top hitse value%identityAlignment
A0A0P0XB91 Os08g0125300 protein7.5e-21756.66Show/hide
Query:  GRKPHLAHLRVFGCVAYVKTTTPHLKKLDDRSSPVVYFGVEEGCKAHRLYDPGREKLQISRDVVFQENLEWPWNEVVSDGKEITEFQVMDQFCS------
        GRKP L HLRVFGC+A+ K TTP+ KKLDDRS+P VY GVEEG KAHRL+DP   ++ +SRDV+F+EN+ W W+ VV+  +  TEF V +          
Subjt:  GRKPHLAHLRVFGCVAYVKTTTPHLKKLDDRSSPVVYFGVEEGCKAHRLYDPGREKLQISRDVVFQENLEWPWNEVVSDGKEITEFQVMDQFCS------

Query:  ------------------------DEFENLEDAETRVENVIPHATEIPAIGATGP--SP------PSTNTPVRLRSLTHIYINTEEV-VGGDEQENEKLL
                                    +L  +     +  P +T  PA G+ GP  SP       S   PVR RSL  I      V +  DE + + LL
Subjt:  ------------------------DEFENLEDAETRVENVIPHATEIPAIGATGP--SP------PSTNTPVRLRSLTHIYINTEEV-VGGDEQENEKLL

Query:  K-------------RPNWYKVMENELKSIEKNNTWSLTKLPPGHKPIGLKWVFKLKKDPSVEVVKHNARLVAK------GIDFEEVLAPVARLDTVRVIL
                      +P W   M  EL++IEKN+TW+LT LP GHKPIGLKWV+KLKK+ + EV+KH ARLVAK      G+DFEEV APVARLDTVRVIL
Subjt:  K-------------RPNWYKVMENELKSIEKNNTWSLTKLPPGHKPIGLKWVFKLKKDPSVEVVKHNARLVAK------GIDFEEVLAPVARLDTVRVIL

Query:  VLVANQSWEVHHLDVKSTFLNGELEEEVYVTQSEGFEVPNKKHKVHRLSKALYGLRQAPRAWNIQLDRSLKELGFGKCTQKQVVYTRSEGEECVLVEVYV
         + A++ WEVHHLDVKS FLNG+LEEEVYV Q EGF    ++H V RLSKALYGLRQAPRAWN +LD+ LKELGF +CTQ+Q VYTR +G+  V+V VYV
Subjt:  VLVANQSWEVHHLDVKSTFLNGELEEEVYVTQSEGFEVPNKKHKVHRLSKALYGLRQAPRAWNIQLDRSLKELGFGKCTQKQVVYTRSEGEECVLVEVYV

Query:  DNLIVAGNSTEKVNKFKQQMMAEFEMSHLGLLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDMEGAPIEATEYRNIVGC
        D+LIV G +  ++  FKQQMM EFEMS LGLLSYYLGIEV Q +  I +KQ  YAK+ILSQFGM  CN T  PMEP++ LHKD +G PI+ATEYR ++GC
Subjt:  DNLIVAGNSTEKVNKFKQQMMAEFEMSHLGLLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDMEGAPIEATEYRNIVGC

Query:  LRYLLNTRPNLSYVFGMASKYMERPTTIHYKVVKQILRYLRGTIHFGLTYTKGPRQFDILGYSDSDLVSDLDGRKSTSGMKFYLNESLVSWNSQKQKTVT
        LRYLL+TRP+LSY  G+AS++MERPTT+H K VK ILRYL+GT+  GL +  G    DI G++DSDL  D+D R+ST GM FY+N SLVSW SQKQKTV 
Subjt:  LRYLLNTRPNLSYVFGMASKYMERPTTIHYKVVKQILRYLRGTIHFGLTYTKGPRQFDILGYSDSDLVSDLDGRKSTSGMKFYLNESLVSWNSQKQKTVT

Query:  LSSCKAEFIAATTAACQALWLRCIVSEIVRMEPRSVTLFVDNKSAIALMKNPVFHGCGKHIDTCFHFIQECVENGQIIVEFVNTREQRADVLTKALTRVK
        LSSC+AEF+AAT AAC ALWLR ++SE++  E + V LFVDNKSAIALMKNPVFHG  KHIDT +HFI+ECVE+GQI++EFV + EQRAD +TK L   K
Subjt:  LSSCKAEFIAATTAACQALWLRCIVSEIVRMEPRSVTLFVDNKSAIALMKNPVFHGCGKHIDTCFHFIQECVENGQIIVEFVNTREQRADVLTKALTRVK

Query:  LAAMCQLLGVRNL
        LA    LLGVR+L
Subjt:  LAAMCQLLGVRNL

B8BDZ6 Uncharacterized protein2.0e-21756.66Show/hide
Query:  GRKPHLAHLRVFGCVAYVKTTTPHLKKLDDRSSPVVYFGVEEGCKAHRLYDPGREKLQISRDVVFQENLEWPWNEVVSDGKEITEFQVMDQFCS------
        GRKP L HLRVFGC+A+ K TTP+ KKLDDRS+P VY GVEEG KAHRL+DP   ++ +SRDV+F+EN+ W W+ VV+  +  TEF V +          
Subjt:  GRKPHLAHLRVFGCVAYVKTTTPHLKKLDDRSSPVVYFGVEEGCKAHRLYDPGREKLQISRDVVFQENLEWPWNEVVSDGKEITEFQVMDQFCS------

Query:  ------------------------DEFENLEDAETRVENVIPHATEIPAIGATGP--SPPSTNT------PVRLRSLTHIYINTEEV-VGGDEQENEKLL
                                    +L  +     +  P +T  PA G+ GP  SP S+        PVR RSL  I      V +  DE + + LL
Subjt:  ------------------------DEFENLEDAETRVENVIPHATEIPAIGATGP--SPPSTNT------PVRLRSLTHIYINTEEV-VGGDEQENEKLL

Query:  K-------------RPNWYKVMENELKSIEKNNTWSLTKLPPGHKPIGLKWVFKLKKDPSVEVVKHNARLVAK------GIDFEEVLAPVARLDTVRVIL
                      +P W   M  EL++IEKN+TW+LT LP GHKPIGLKWV+KLKK+ + EV+KH ARLVAK      G+DFEEV APVARLDTVRVIL
Subjt:  K-------------RPNWYKVMENELKSIEKNNTWSLTKLPPGHKPIGLKWVFKLKKDPSVEVVKHNARLVAK------GIDFEEVLAPVARLDTVRVIL

Query:  VLVANQSWEVHHLDVKSTFLNGELEEEVYVTQSEGFEVPNKKHKVHRLSKALYGLRQAPRAWNIQLDRSLKELGFGKCTQKQVVYTRSEGEECVLVEVYV
         + A++ WEVHHLDVKS FLNG+LEEEVYV Q EGF    ++H V RLSKALYGLRQAPRAWN +LD+ LKELGF +CTQ+Q VYTR +G+  V+V VYV
Subjt:  VLVANQSWEVHHLDVKSTFLNGELEEEVYVTQSEGFEVPNKKHKVHRLSKALYGLRQAPRAWNIQLDRSLKELGFGKCTQKQVVYTRSEGEECVLVEVYV

Query:  DNLIVAGNSTEKVNKFKQQMMAEFEMSHLGLLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDMEGAPIEATEYRNIVGC
        D+LIV G + +++  FKQQMM EFEMS LGLLSYYLGIEV Q +  I +KQ  YAK+ILSQFGM  CN T  PMEP++ LHKD +G PI+ATEYR ++GC
Subjt:  DNLIVAGNSTEKVNKFKQQMMAEFEMSHLGLLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDMEGAPIEATEYRNIVGC

Query:  LRYLLNTRPNLSYVFGMASKYMERPTTIHYKVVKQILRYLRGTIHFGLTYTKGPRQFDILGYSDSDLVSDLDGRKSTSGMKFYLNESLVSWNSQKQKTVT
        LRYLL+TRP+LSY  G+AS++MERPTT+H K VK ILRYL+GT+  GL +  G    DI G++DSDL  D+D R+ST GM FY+N SLVSW SQKQKTV 
Subjt:  LRYLLNTRPNLSYVFGMASKYMERPTTIHYKVVKQILRYLRGTIHFGLTYTKGPRQFDILGYSDSDLVSDLDGRKSTSGMKFYLNESLVSWNSQKQKTVT

Query:  LSSCKAEFIAATTAACQALWLRCIVSEIVRMEPRSVTLFVDNKSAIALMKNPVFHGCGKHIDTCFHFIQECVENGQIIVEFVNTREQRADVLTKALTRVK
        LSSC+AEF+AAT AAC ALWLR ++SE++  E + V LFVDNKSAIALMKNPVFHG  KHIDT +HFI+ECVE+GQI++EFV++ EQRAD +TK L   K
Subjt:  LSSCKAEFIAATTAACQALWLRCIVSEIVRMEPRSVTLFVDNKSAIALMKNPVFHGCGKHIDTCFHFIQECVENGQIIVEFVNTREQRADVLTKALTRVK

Query:  LAAMCQLLGVRNL
        LA    LLGVR+L
Subjt:  LAAMCQLLGVRNL

Q0J8A6 Os08g0125300 protein7.5e-21756.66Show/hide
Query:  GRKPHLAHLRVFGCVAYVKTTTPHLKKLDDRSSPVVYFGVEEGCKAHRLYDPGREKLQISRDVVFQENLEWPWNEVVSDGKEITEFQVMDQFCS------
        GRKP L HLRVFGC+A+ K TTP+ KKLDDRS+P VY GVEEG KAHRL+DP   ++ +SRDV+F+EN+ W W+ VV+  +  TEF V +          
Subjt:  GRKPHLAHLRVFGCVAYVKTTTPHLKKLDDRSSPVVYFGVEEGCKAHRLYDPGREKLQISRDVVFQENLEWPWNEVVSDGKEITEFQVMDQFCS------

Query:  ------------------------DEFENLEDAETRVENVIPHATEIPAIGATGP--SP------PSTNTPVRLRSLTHIYINTEEV-VGGDEQENEKLL
                                    +L  +     +  P +T  PA G+ GP  SP       S   PVR RSL  I      V +  DE + + LL
Subjt:  ------------------------DEFENLEDAETRVENVIPHATEIPAIGATGP--SP------PSTNTPVRLRSLTHIYINTEEV-VGGDEQENEKLL

Query:  K-------------RPNWYKVMENELKSIEKNNTWSLTKLPPGHKPIGLKWVFKLKKDPSVEVVKHNARLVAK------GIDFEEVLAPVARLDTVRVIL
                      +P W   M  EL++IEKN+TW+LT LP GHKPIGLKWV+KLKK+ + EV+KH ARLVAK      G+DFEEV APVARLDTVRVIL
Subjt:  K-------------RPNWYKVMENELKSIEKNNTWSLTKLPPGHKPIGLKWVFKLKKDPSVEVVKHNARLVAK------GIDFEEVLAPVARLDTVRVIL

Query:  VLVANQSWEVHHLDVKSTFLNGELEEEVYVTQSEGFEVPNKKHKVHRLSKALYGLRQAPRAWNIQLDRSLKELGFGKCTQKQVVYTRSEGEECVLVEVYV
         + A++ WEVHHLDVKS FLNG+LEEEVYV Q EGF    ++H V RLSKALYGLRQAPRAWN +LD+ LKELGF +CTQ+Q VYTR +G+  V+V VYV
Subjt:  VLVANQSWEVHHLDVKSTFLNGELEEEVYVTQSEGFEVPNKKHKVHRLSKALYGLRQAPRAWNIQLDRSLKELGFGKCTQKQVVYTRSEGEECVLVEVYV

Query:  DNLIVAGNSTEKVNKFKQQMMAEFEMSHLGLLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDMEGAPIEATEYRNIVGC
        D+LIV G +  ++  FKQQMM EFEMS LGLLSYYLGIEV Q +  I +KQ  YAK+ILSQFGM  CN T  PMEP++ LHKD +G PI+ATEYR ++GC
Subjt:  DNLIVAGNSTEKVNKFKQQMMAEFEMSHLGLLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDMEGAPIEATEYRNIVGC

Query:  LRYLLNTRPNLSYVFGMASKYMERPTTIHYKVVKQILRYLRGTIHFGLTYTKGPRQFDILGYSDSDLVSDLDGRKSTSGMKFYLNESLVSWNSQKQKTVT
        LRYLL+TRP+LSY  G+AS++MERPTT+H K VK ILRYL+GT+  GL +  G    DI G++DSDL  D+D R+ST GM FY+N SLVSW SQKQKTV 
Subjt:  LRYLLNTRPNLSYVFGMASKYMERPTTIHYKVVKQILRYLRGTIHFGLTYTKGPRQFDILGYSDSDLVSDLDGRKSTSGMKFYLNESLVSWNSQKQKTVT

Query:  LSSCKAEFIAATTAACQALWLRCIVSEIVRMEPRSVTLFVDNKSAIALMKNPVFHGCGKHIDTCFHFIQECVENGQIIVEFVNTREQRADVLTKALTRVK
        LSSC+AEF+AAT AAC ALWLR ++SE++  E + V LFVDNKSAIALMKNPVFHG  KHIDT +HFI+ECVE+GQI++EFV + EQRAD +TK L   K
Subjt:  LSSCKAEFIAATTAACQALWLRCIVSEIVRMEPRSVTLFVDNKSAIALMKNPVFHGCGKHIDTCFHFIQECVENGQIIVEFVNTREQRADVLTKALTRVK

Query:  LAAMCQLLGVRNL
        LA    LLGVR+L
Subjt:  LAAMCQLLGVRNL

Q10F84 Gag-pol polyprotein1.2e-19352.11Show/hide
Query:  GRKPHLAHLRVFGCVAYVKTTTPHLKKLDDRSSPVVYFGVEEGCKAHRLYDPGREKLQISRDVVFQENLEWPWNEVVSDGKEITEFQVMDQFCSDEFENL
        G+KPHL HLRVFGC A+ K T PHLKKLDDRS+PVVY GVEEG KAHRL+DP R ++ +SRDVVF EN  W W+    +    TEF+V +   +++    
Subjt:  GRKPHLAHLRVFGCVAYVKTTTPHLKKLDDRSSPVVYFGVEEGCKAHRLYDPGREKLQISRDVVFQENLEWPWNEVVSDGKEITEFQVMDQFCSDEFENL

Query:  EDAET----RVENVIPHATEIPAIGATGPSPP----------------------------------STNTPVRLRSLTHIYINTEEV-VGGDEQENEKLL
        E A +    R       A + P +     +PP                                  S + PVR RSL  I I    V +  D+ + E LL
Subjt:  EDAET----RVENVIPHATEIPAIGATGPSPP----------------------------------STNTPVRLRSLTHIYINTEEV-VGGDEQENEKLL

Query:  -------------KRPNWYKVMENELKSIEKNNTWSLTKLPPGHKPIGLKWVFKLKKDPSVEVVKHNARLVAK------GIDFEEVLAPVARLDTVRVIL
                       P W   M  EL++IEKN TWSL  LP  HK IGLKWVFKLKK+ + EV+KH ARLVAK      G+DF+EV APVARLDTVR IL
Subjt:  -------------KRPNWYKVMENELKSIEKNNTWSLTKLPPGHKPIGLKWVFKLKKDPSVEVVKHNARLVAK------GIDFEEVLAPVARLDTVRVIL

Query:  VLVANQSWEVHHLDVKSTFLNGELEEEVYVTQSEGFEVPNKKHKVHRLSKALYGLRQAPRAWNIQLDRSLKELGFGKCTQKQVVYTRSEGEECVLVEVYV
         +  ++ W+VHHLDVKS FLNG+LEEEVYV+Q EGF    K+H V++LSKALYGLRQAPRAWN +LDRS+KELGF +C Q+Q VYTR  G   ++V VYV
Subjt:  VLVANQSWEVHHLDVKSTFLNGELEEEVYVTQSEGFEVPNKKHKVHRLSKALYGLRQAPRAWNIQLDRSLKELGFGKCTQKQVVYTRSEGEECVLVEVYV

Query:  DNLIVAGNSTEKVNKFKQQMMAEFEMSHLGLLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDMEGAPIEATEYRNIVGC
        D+LIV G S   +  FKQQMM EFEMS LGLL+YYLGIE                                        LHKD +G+ ++ TEYR ++GC
Subjt:  DNLIVAGNSTEKVNKFKQQMMAEFEMSHLGLLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDMEGAPIEATEYRNIVGC

Query:  LRYLLNTRPNLSYVFGMASKYMERPTTIHYKVVKQILRYLRGTIHFGLTYTKGPRQFDILGYSDSDLVSDLDGRKSTSGMKFYLNESLVSWNSQKQKTVT
        LRYLL+TRP+LSY  G+AS++MERPT +H+K VKQILRYL+GTI+ GL ++ G    +I G++DSDL  D D R+STSGM FY N SLVSW+SQKQKTV 
Subjt:  LRYLLNTRPNLSYVFGMASKYMERPTTIHYKVVKQILRYLRGTIHFGLTYTKGPRQFDILGYSDSDLVSDLDGRKSTSGMKFYLNESLVSWNSQKQKTVT

Query:  LSSCKAEFIAATTAACQALWLRCIVSEIVRMEPRSVTLFVDNKSAIALMKNPVFHGCGKHIDTCFHFIQECVENGQIIVEFVNTREQRADVLTKALTRVK
        LSSC+AEF+AAT AAC ALWLR ++ E++  E R V L+VDNKSAIALMKNPVFHG  KHIDT +HFI+ECVE+G+I +EFV   EQRAD LTK L   +
Subjt:  LSSCKAEFIAATTAACQALWLRCIVSEIVRMEPRSVTLFVDNKSAIALMKNPVFHGCGKHIDTCFHFIQECVENGQIIVEFVNTREQRADVLTKALTRVK

Query:  LAAMCQLLGVRN
        LA    LLGVR+
Subjt:  LAAMCQLLGVRN

Q10RM4 Retrotransposon protein, putative, unclassified9.8e-21754.85Show/hide
Query:  GRKPHLAHLRVFGCVAYVKTTTPHLKKLDDRSSPVVYFGVEEGCKAHRLYDPGREKLQISRDVVFQENLEWPWNEVVSDGKEITEFQVMDQFCSDEFENL
        GRKP L HL+VFGC A+ K T PHLKKLDDRS+P VY GVEEG KAHRL+DP R ++ +SRDVVF+EN+ W W      G+E T+F + +    +  E L
Subjt:  GRKPHLAHLRVFGCVAYVKTTTPHLKKLDDRSSPVVYFGVEEGCKAHRLYDPGREKLQISRDVVFQENLEWPWNEVVSDGKEITEFQVMDQFCSDEFENL

Query:  EDAETRVENVIPH----------------ATEIPA-----------IGATGPSPPSTNT------------------PVRLRSLTHIYINTEEV-VGGDE
            T    V P+                A E+P+            G   P  PSTN+                  P R RSL  +      V +  DE
Subjt:  EDAETRVENVIPH----------------ATEIPA-----------IGATGPSPPSTNT------------------PVRLRSLTHIYINTEEV-VGGDE

Query:  QENEKLLK-------------RPNWYKVMENELKSIEKNNTWSLTKLPPGHKPIGLKWVFKLKKDPSVEVVKHNARLVAK------GIDFEEVLAPVARL
         + E LL              +P W + M+ E+++IEKN TW L  LP GH+ IGLKWV+KLKK+ + E++KH ARLVAK      G+DFEEV APVARL
Subjt:  QENEKLLK-------------RPNWYKVMENELKSIEKNNTWSLTKLPPGHKPIGLKWVFKLKKDPSVEVVKHNARLVAK------GIDFEEVLAPVARL

Query:  DTVRVILVLVANQSWEVHHLDVKSTFLNGELEEEVYVTQSEGFEVPNKKHKVHRLSKALYGLRQAPRAWNIQLDRSLKELGFGKCTQKQVVYTRSEGEEC
        DTVRV+L + A++ W+VHHLDVKS FLNGELEEEVYV Q EGF    K+H V +L KALYGLRQAPRAWNI+LDRSL+ELGF +CTQ+Q VYTR  G + 
Subjt:  DTVRVILVLVANQSWEVHHLDVKSTFLNGELEEEVYVTQSEGFEVPNKKHKVHRLSKALYGLRQAPRAWNIQLDRSLKELGFGKCTQKQVVYTRSEGEEC

Query:  VLVEVYVDNLIVAGNSTEKVNKFKQQMMAEFEMSHLGLLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDMEGAPIEATE
        ++V VYVD+LIV G +  ++  FK+QMM EFEMS LGLL+YYLGIEV+Q +    LKQ  YAK++LSQFGM +CN+   P++P++QL KD EG P++ATE
Subjt:  VLVEVYVDNLIVAGNSTEKVNKFKQQMMAEFEMSHLGLLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDMEGAPIEATE

Query:  YRNIVGCLRYLLNTRPNLSYVFGMASKYMERPTTIHYKVVKQILRYLRGTIHFGLTYTKGPRQFDILGYSDSDLVSDLDGRKSTSGMKFYLNESLVSWNS
        YR I+G LRYLL+TRP+LSY  G+AS++MERPT +H+K VKQILRY++GT+ +GL Y  G     I GY+DSDL  DLD R+ST GM FY+N+SLV+W+S
Subjt:  YRNIVGCLRYLLNTRPNLSYVFGMASKYMERPTTIHYKVVKQILRYLRGTIHFGLTYTKGPRQFDILGYSDSDLVSDLDGRKSTSGMKFYLNESLVSWNS

Query:  QKQKTVTLSSCKAEFIAATTAACQALWLRCIVSEIVRMEPRSVTLFVDNKSAIALMKNPVFHGCGKHIDTCFHFIQECVENGQIIVEFVNTREQRADVLT
        QKQKTV LSSC+AEF+AATTAACQALWLR +++E+  +E ++V LFVDN+SAIALMKNPVFHG  KHIDT +HFI+ECV+ GQI+VEFV T EQRAD LT
Subjt:  QKQKTVTLSSCKAEFIAATTAACQALWLRCIVSEIVRMEPRSVTLFVDNKSAIALMKNPVFHGCGKHIDTCFHFIQECVENGQIIVEFVNTREQRADVLT

Query:  KALTRVKLAAMCQLLGVRNLES
        K L   KL     LLGVR+L S
Subjt:  KALTRVKLAAMCQLLGVRNLES

SwissProt top hitse value%identityAlignment
P04146 Copia protein7.7e-8629.26Show/hide
Query:  RKPHLAHLRVFGCVAYVKTTTPHLK----KLDDRSSPVVYFGVEEGCKAHRLYDPGREKLQISRDVVF-----------------------QENLEWPWN
        +KP+L HLRVFG   YV     H+K    K DD+S   ++ G E      +L+D   EK  ++RDVV                         EN  +P +
Subjt:  RKPHLAHLRVFGCVAYVKTTTPHLK----KLDDRSSPVVYFGVEEGCKAHRLYDPGREKLQISRDVVF-----------------------QENLEWPWN

Query:  -------------------EVVSDGKEI--------------TEFQVMDQFCSDEFENLEDA---------ETRVENVIPHATEIPAIG-----------
                           + + D KE               TEF    + C D  + L+D+         E++      H  E    G           
Subjt:  -------------------EVVSDGKEI--------------TEFQVMDQFCSDEFENLEDA---------ETRVENVIPHATEIPAIG-----------

Query:  ----ATGPSPPSTNTPVRLRSLTHIYINTEEVVGGDEQEN--EKLL---------------------KRPNWYKVMENELKSIEKNNTWSLTKLPPGHKP
              G   P+ N  + + +     + T+  +  +E++N   K++                      + +W + +  EL + + NNTW++TK P     
Subjt:  ----ATGPSPPSTNTPVRLRSLTHIYINTEEVVGGDEQEN--EKLL---------------------KRPNWYKVMENELKSIEKNNTWSLTKLPPGHKP

Query:  IGLKWVFKLKKDPSVEVVKHNARLVAKG------IDFEEVLAPVARLDTVRVILVLVANQSWEVHHLDVKSTFLNGELEEEVYVTQSEGFEVPNKKHKVH
        +  +WVF +K +     +++ ARLVA+G      ID+EE  APVAR+ + R IL LV   + +VH +DVK+ FLNG L+EE+Y+   +G  +      V 
Subjt:  IGLKWVFKLKKDPSVEVVKHNARLVAKG------IDFEEVLAPVARLDTVRVILVLVANQSWEVHHLDVKSTFLNGELEEEVYVTQSEGFEVPNKKHKVH

Query:  RLSKALYGLRQAPRAWNIQLDRSLKELGFGKCTQKQVVYTRSEG--EECVLVEVYVDNLIVAGNSTEKVNKFKQQMMAEFEMSHLGLLSYYLGIEVEQQK
        +L+KA+YGL+QA R W    +++LKE  F   +  + +Y   +G   E + V +YVD++++A     ++N FK+ +M +F M+ L  + +++GI +E Q+
Subjt:  RLSKALYGLRQAPRAWNIQLDRSLKELGFGKCTQKQVVYTRSEG--EECVLVEVYVDNLIVAGNSTEKVNKFKQQMMAEFEMSHLGLLSYYLGIEVEQQK

Query:  GRILLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDMEGAPIEATEYRNIVGCLRY-LLNTRPNLSYVFGMASKYMERPTTIHYKVVKQILRYLRGT
         +I L Q  Y K+ILS+F M +CNA   P+  K   ++ +       T  R+++GCL Y +L TRP+L+    + S+Y  +  +  ++ +K++LRYL+GT
Subjt:  GRILLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDMEGAPIEATEYRNIVGCLRY-LLNTRPNLSYVFGMASKYMERPTTIHYKVVKQILRYLRGT

Query:  IHFGLTYTKG-PRQFDILGYSDSDLVSDLDGRKSTSGMKFYLNE-SLVSWNSQKQKTVTLSSCKAEFIAATTAACQALWLRCIVSEIVRMEPRSVTLFVD
        I   L + K    +  I+GY DSD       RKST+G  F + + +L+ WN+++Q +V  SS +AE++A   A  +ALWL+ +++ I       + ++ D
Subjt:  IHFGLTYTKG-PRQFDILGYSDSDLVSDLDGRKSTSGMKFYLNE-SLVSWNSQKQKTVTLSSCKAEFIAATTAACQALWLRCIVSEIVRMEPRSVTLFVD

Query:  NKSAIALMKNPVFHGCGKHIDTCFHFIQECVENGQIIVEFVNTREQRADVLTKALTRVKLAAMCQLLGV
        N+  I++  NP  H   KHID  +HF +E V+N  I +E++ T  Q AD+ TK L   +   +   LG+
Subjt:  NKSAIALMKNPVFHGCGKHIDTCFHFIQECVENGQIIVEFVNTREQRADVLTKALTRVKLAAMCQLLGV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.8e-10434.78Show/hide
Query:  AHLRVFGCVAYVKTTTPHLKKLDDRSSPVVYFGVEEGCKAHRLYDPGREKLQISRDVVFQEN---LEWPWNEVVSDG-----------------KEITEF
        +HL+VFGC A+         KLDD+S P ++ G  +    +RL+DP ++K+  SRDVVF+E+        +E V +G                  E T  
Subjt:  AHLRVFGCVAYVKTTTPHLKKLDDRSSPVVYFGVEEGCKAHRLYDPGREKLQISRDVVFQEN---LEWPWNEVVSDG-----------------KEITEF

Query:  QVMDQFCSDEFENLEDAETRVENVIPHATEIPAIGATGPSPPSTNTPVRLRSLTHIYINTEEVVGGDEQENEKL---LKRP---NWYKVMENELKSIEKN
        +V +Q      E +E  E   E V     E P  G     P   +   R+ S    Y +TE V+  D++E E L   L  P      K M+ E++S++KN
Subjt:  QVMDQFCSDEFENLEDAETRVENVIPHATEIPAIGATGPSPPSTNTPVRLRSLTHIYINTEEVVGGDEQENEKL---LKRP---NWYKVMENELKSIEKN

Query:  NTWSLTKLPPGHKPIGLKWVFKLKKDPSVEVVKHNARLVA------KGIDFEEVLAPVARLDTVRVILVLVANQSWEVHHLDVKSTFLNGELEEEVYVTQ
         T+ L +LP G +P+  KWVFKLKKD   ++V++ ARLV       KGIDF+E+ +PV ++ ++R IL L A+   EV  LDVK+ FL+G+LEEE+Y+ Q
Subjt:  NTWSLTKLPPGHKPIGLKWVFKLKKDPSVEVVKHNARLVA------KGIDFEEVLAPVARLDTVRVILVLVANQSWEVHHLDVKSTFLNGELEEEVYVTQ

Query:  SEGFEVPNKKHKVHRLSKALYGLRQAPRAWNIQLDRSLKELGFGKCTQKQVVYTRSEGE-ECVLVEVYVDNLIVAGNSTEKVNKFKQQMMAEFEMSHLGL
         EGFEV  KKH V +L+K+LYGL+QAPR W ++ D  +K   + K      VY +   E   +++ +YVD++++ G     + K K  +   F+M  LG 
Subjt:  SEGFEVPNKKHKVHRLSKALYGLRQAPRAWNIQLDRSLKELGFGKCTQKQVVYTRSEGE-ECVLVEVYVDNLIVAGNSTEKVNKFKQQMMAEFEMSHLGL

Query:  LSYYLGIEV--EQQKGRILLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDM------EGAPIEATEYRNIVGCLRY-LLNTRPNLSYVFGMASKYM
            LG+++  E+   ++ L Q  Y +R+L +F M +      P+    +L K M      E   +    Y + VG L Y ++ TRP++++  G+ S+++
Subjt:  LSYYLGIEV--EQQKGRILLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDM------EGAPIEATEYRNIVGCLRY-LLNTRPNLSYVFGMASKYM

Query:  ERPTTIHYKVVKQILRYLRGTIHFGLTYTKGPRQFDILGYSDSDLVSDLDGRKSTSGMKFYLNESLVSWNSQKQKTVTLSSCKAEFIAATTAACQALWLR
        E P   H++ VK ILRYLRGT    L +  G     + GY+D+D+  D+D RKS++G  F  +   +SW S+ QK V LS+ +AE+IAAT    + +WL+
Subjt:  ERPTTIHYKVVKQILRYLRGTIHFGLTYTKGPRQFDILGYSDSDLVSDLDGRKSTSGMKFYLNESLVSWNSQKQKTVTLSSCKAEFIAATTAACQALWLR

Query:  CIVSEIVRMEPRSVTLFVDNKSAIALMKNPVFHGCGKHIDTCFHFIQECVENGQIIVEFVNTREQRADVLTKALTRVKLAAMCQLLGVRN
          + E+  +  +   ++ D++SAI L KN ++H   KHID  +H+I+E V++  + V  ++T E  AD+LTK + R K     +L+G+ +
Subjt:  CIVSEIVRMEPRSVTLFVDNKSAIALMKNPVFHGCGKHIDTCFHFIQECVENGQIIVEFVNTREQRADVLTKALTRVKLAAMCQLLGVRN

P25600 Putative transposon Ty5-1 protein YCL074W2.6e-3331.02Show/hide
Query:  LDVKSTFLNGELEEEVYVTQSEGFEVPNKKHKVHRLSKALYGLRQAPRAWNIQLDRSLKELGFGKCTQKQVVYTRSEGEECVLVEVYVDNLIVAGNSTEK
        +DV + FLN  ++E +YV Q  GF        V  L   +YGL+QAP  WN  ++ +LK++GF +   +  +Y RS  +  + + VYVD+L+VA  S + 
Subjt:  LDVKSTFLNGELEEEVYVTQSEGFEVPNKKHKVHRLSKALYGLRQAPRAWNIQLDRSLKELGFGKCTQKQVVYTRSEGEECVLVEVYVDNLIVAGNSTEK

Query:  VNKFKQQMMAEFEMSHLGLLSYYLGIEVEQ-QKGRILLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDMEGAPIEATEYRNIVGCLRYLLNT-RPN
         ++ KQ++   + M  LG +  +LG+ + Q   G I L    Y  +  S+  +     T+ P+     L +       + T Y++IVG L +  NT RP+
Subjt:  VNKFKQQMMAEFEMSHLGLLSYYLGIEVEQ-QKGRILLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDMEGAPIEATEYRNIVGCLRYLLNT-RPN

Query:  LSYVFGMASKYMERPTTIHYKVVKQILRYLRGTIHFGLTYTKGPRQFDILGYSDSDLVSDLDGRKSTSGMKFYLNESLVSWNSQKQK-TVTLSSCKAEFI
        +SY   + S+++  P  IH +  +++LRYL  T    L Y  G  Q  +  Y D+   +  D   ST G    L  + V+W+S+K K  + + S +AE+I
Subjt:  LSYVFGMASKYMERPTTIHYKVVKQILRYLRGTIHFGLTYTKGPRQFDILGYSDSDLVSDLDGRKSTSGMKFYLNESLVSWNSQKQK-TVTLSSCKAEFI

Query:  AAT
         A+
Subjt:  AAT

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.7e-8034.47Show/hide
Query:  LKRPNWYKVMENELKSIEKNNTWSLTKLPPGHKPI-GLKWVFKLKKDPSVEVVKHNARLVAK------GIDFEEVLAPVARLDTVRVILVLVANQSWEVH
        LK   W   M +E+ +   N+TW L   PP H  I G +W+F  K +    + ++ ARLVAK      G+D+ E  +PV +  ++R++L +  ++SW + 
Subjt:  LKRPNWYKVMENELKSIEKNNTWSLTKLPPGHKPI-GLKWVFKLKKDPSVEVVKHNARLVAK------GIDFEEVLAPVARLDTVRVILVLVANQSWEVH

Query:  HLDVKSTFLNGELEEEVYVTQSEGFEVPNKKHKVHRLSKALYGLRQAPRAWNIQLDRSLKELGFGKCTQKQVVYTRSEGEECVLVEVYVDNLIVAGNSTE
         LDV + FL G L ++VY++Q  GF   ++ + V +L KALYGL+QAPRAW ++L   L  +GF        ++    G+  V + VYVD++++ GN   
Subjt:  HLDVKSTFLNGELEEEVYVTQSEGFEVPNKKHKVHRLSKALYGLRQAPRAWNIQLDRSLKELGFGKCTQKQVVYTRSEGEECVLVEVYVDNLIVAGNSTE

Query:  KVNKFKQQMMAEFEMSHLGLLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDMEGAPIEATEYRNIVGCLRYLLNTRPNL
         ++     +   F +     L Y+LGIE ++    + L Q  Y   +L++  M        PM P  +L         + TEYR IVG L+YL  TRP++
Subjt:  KVNKFKQQMMAEFEMSHLGLLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDMEGAPIEATEYRNIVGCLRYLLNTRPNL

Query:  SYVFGMASKYMERPTTIHYKVVKQILRYLRGTIHFGLTYTKGPRQFDILGYSDSDLVSDLDGRKSTSGMKFYLNESLVSWNSQKQKTVTLSSCKAEFIAA
        SY     S++M  PT  H + +K+ILRYL GT + G+   KG     +  YSD+D   D D   ST+G   YL    +SW+S+KQK V  SS +AE+ + 
Subjt:  SYVFGMASKYMERPTTIHYKVVKQILRYLRGTIHFGLTYTKGPRQFDILGYSDSDLVSDLDGRKSTSGMKFYLNESLVSWNSQKQKTVTLSSCKAEFIAA

Query:  TTAACQALWLRCIVSEIVRMEPRSVTLFVDNKSAIALMKNPVFHGCGKHIDTCFHFIQECVENGQIIVEFVNTREQRADVLTKALTRVKLAAMCQLLGV
           + +  W+  +++E+     R   ++ DN  A  L  NPVFH   KHI   +HFI+  V++G + V  V+T +Q AD LTK L+R         +GV
Subjt:  TTAACQALWLRCIVSEIVRMEPRSVTLFVDNKSAIALMKNPVFHGCGKHIDTCFHFIQECVENGQIIVEFVNTREQRADVLTKALTRVKLAAMCQLLGV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE27.0e-7934.93Show/hide
Query:  LKRPNWYKVMENELKSIEKNNTWSLTKLPPGHKPI-GLKWVFKLKKDPSVEVVKHNARLVAK------GIDFEEVLAPVARLDTVRVILVLVANQSWEVH
        +K   W + M +E+ +   N+TW L   PP    I G +W+F  K +    + ++ ARLVAK      G+D+ E  +PV +  ++R++L +  ++SW + 
Subjt:  LKRPNWYKVMENELKSIEKNNTWSLTKLPPGHKPI-GLKWVFKLKKDPSVEVVKHNARLVAK------GIDFEEVLAPVARLDTVRVILVLVANQSWEVH

Query:  HLDVKSTFLNGELEEEVYVTQSEGFEVPNKKHKVHRLSKALYGLRQAPRAWNIQLDRSLKELGFGKCTQKQVVYTRSEGEECVLVEVYVDNLIVAGNSTE
         LDV + FL G L +EVY++Q  GF   ++   V RL KA+YGL+QAPRAW ++L   L  +GF        ++    G   + + VYVD++++ GN T 
Subjt:  HLDVKSTFLNGELEEEVYVTQSEGFEVPNKKHKVHRLSKALYGLRQAPRAWNIQLDRSLKELGFGKCTQKQVVYTRSEGEECVLVEVYVDNLIVAGNSTE

Query:  KVNKFKQQMMAEFEMSHLGLLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATKYPM--EPKAQLHKDMEGAPIEATEYRNIVGCLRYLLNTRP
         +      +   F +     L Y+LGIE ++    + L Q  Y   +L++  M        PM   PK  LH   +  P + TEYR IVG L+YL  TRP
Subjt:  KVNKFKQQMMAEFEMSHLGLLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATKYPM--EPKAQLHKDMEGAPIEATEYRNIVGCLRYLLNTRP

Query:  NLSYVFGMASKYMERPTTIHYKVVKQILRYLRGTIHFGLTYTKGPRQFDILGYSDSDLVSDLDGRKSTSGMKFYLNESLVSWNSQKQKTVTLSSCKAEFI
        +LSY     S+YM  PT  H+  +K++LRYL GT   G+   KG     +  YSD+D   D D   ST+G   YL    +SW+S+KQK V  SS +AE+ 
Subjt:  NLSYVFGMASKYMERPTTIHYKVVKQILRYLRGTIHFGLTYTKGPRQFDILGYSDSDLVSDLDGRKSTSGMKFYLNESLVSWNSQKQKTVTLSSCKAEFI

Query:  AATTAACQALWLRCIVSEIVRMEPRSVTLFVDNKSAIALMKNPVFHGCGKHIDTCFHFIQECVENGQIIVEFVNTREQRADVLTKALTRVKLAAMCQLLG
        +    + +  W+  +++E+         ++ DN  A  L  NPVFH   KHI   +HFI+  V++G + V  V+T +Q AD LTK L+RV      + +G
Subjt:  AATTAACQALWLRCIVSEIVRMEPRSVTLFVDNKSAIALMKNPVFHGCGKHIDTCFHFIQECVENGQIIVEFVNTREQRADVLTKALTRVKLAAMCQLLG

Query:  V
        V
Subjt:  V

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.7e-7533.7Show/hide
Query:  WYKVMENELKSIEKNNTWSLTKLPPGHKPIGLKWVFKLKKDPSVEVVKHNARLVAK------GIDFEEVLAPVARLDTVRVILVLVANQSWEVHHLDVKS
        W   M++E+ ++E  +TW +  LPP  KPIG KWV+K+K +    + ++ ARLVAK      GIDF E  +PV +L +V++IL + A  ++ +H LD+ +
Subjt:  WYKVMENELKSIEKNNTWSLTKLPPGHKPIGLKWVFKLKKDPSVEVVKHNARLVAK------GIDFEEVLAPVARLDTVRVILVLVANQSWEVHHLDVKS

Query:  TFLNGELEEEVYVTQSEGFEVPN----KKHKVHRLSKALYGLRQAPRAWNIQLDRSLKELGFGKCTQKQVVYTRSEGEECVLVEVYVDNLIVAGNSTEKV
         FLNG+L+EE+Y+    G+          + V  L K++YGL+QA R W ++   +L   GF +       + +      + V VYVD++I+  N+   V
Subjt:  TFLNGELEEEVYVTQSEGFEVPN----KKHKVHRLSKALYGLRQAPRAWNIQLDRSLKELGFGKCTQKQVVYTRSEGEECVLVEVYVDNLIVAGNSTEKV

Query:  NKFKQQMMAEFEMSHLGLLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDMEGAPIEATEYRNIVGCLRYLLNTRPNLSY
        ++ K Q+ + F++  LG L Y+LG+E+ +    I + Q  YA  +L + G+  C  +  PM+P         G  ++A  YR ++G L YL  TR ++S+
Subjt:  NKFKQQMMAEFEMSHLGLLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDMEGAPIEATEYRNIVGCLRYLLNTRPNLSY

Query:  VFGMASKYMERPTTIHYKVVKQILRYLRGTIHFGLTYTKGPRQFDILGYSDSDLVSDLDGRKSTSGMKFYLNESLVSWNSQKQKTVTLSSCKAEFIAATT
             S++ E P   H + V +IL Y++GT+  GL Y+    +  +  +SD+   S  D R+ST+G   +L  SL+SW S+KQ+ V+ SS +AE+ A + 
Subjt:  VFGMASKYMERPTTIHYKVVKQILRYLRGTIHFGLTYTKGPRQFDILGYSDSDLVSDLDGRKSTSGMKFYLNESLVSWNSQKQKTVTLSSCKAEFIAATT

Query:  AACQALWLRCIVSEIVRMEPRSVTLFVDNKSAIALMKNPVFHGCGKHIDTCFHFIQE
        A  + +WL     E+     +   LF DN +AI +  N VFH   KHI++  H ++E
Subjt:  AACQALWLRCIVSEIVRMEPRSVTLFVDNKSAIALMKNPVFHGCGKHIDTCFHFIQE

ATMG00240.1 Gag-Pol-related retrotransposon family protein1.5e-0431.17Show/hide
Query:  YLLNTRPNLSYVFGMASKYMERPTTIHYKVVKQILRYLRGTIHFGLTYTKGPRQFDILGYSDSDLVSDLDGRKSTSG
        YL  TRP+L++     S++     T   + V ++L Y++GT+  GL Y+       +  ++DSD  S  D R+S +G
Subjt:  YLLNTRPNLSYVFGMASKYMERPTTIHYKVVKQILRYLRGTIHFGLTYTKGPRQFDILGYSDSDLVSDLDGRKSTSG

ATMG00810.1 DNA/RNA polymerases superfamily protein1.7e-3232.29Show/hide
Query:  VYVDNLIVAGNSTEKVNKFKQQMMAEFEMSHLGLLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDMEGAPIEATEYRNI
        +YVD++++ G+S   +N    Q+ + F M  LG + Y+LGI+++     + L Q  YA++IL+  GM DC     P+  K          P + +++R+I
Subjt:  VYVDNLIVAGNSTEKVNKFKQQMMAEFEMSHLGLLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDMEGAPIEATEYRNI

Query:  VGCLRYLLNTRPNLSYVFGMASKYMERPTTIHYKVVKQILRYLRGTIHFGLTYTKGPRQFDILGYSDSDLVSDLDGRKSTSGMKFYLNESLVSWNSQKQK
        VG L+YL  TRP++SY   +  + M  PT   + ++K++LRY++GTI  GL Y     + ++  + DSD       R+ST+G   +L  +++SW++++Q 
Subjt:  VGCLRYLLNTRPNLSYVFGMASKYMERPTTIHYKVVKQILRYLRGTIHFGLTYTKGPRQFDILGYSDSDLVSDLDGRKSTSGMKFYLNESLVSWNSQKQK

Query:  TVTLSSCKAEFIAATTAACQALW
        TV+ SS + E+ A    A +  W
Subjt:  TVTLSSCKAEFIAATTAACQALW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)7.1e-1040.91Show/hide
Query:  LKRPNWYKVMENELKSIEKNNTWSLTKLPPGHKPIGLKWVFKLKKDPSVEVVKHNARLVAK------GIDFEEVLAPVARLDTVRVIL
        LK P W + M+ EL ++ +N TW L   P     +G KWVFK K      + +  ARLVAK      GI F E  +PV R  T+R IL
Subjt:  LKRPNWYKVMENELKSIEKNNTWSLTKLPPGHKPIGLKWVFKLKKDPSVEVVKHNARLVAK------GIDFEEVLAPVARLDTVRVIL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAGAAAGCCACATCTTGCACACTTGAGAGTCTTTGGTTGTGTGGCATATGTAAAGACCACAACCCCTCACCTCAAGAAACTCGACGATCGAAGCTCACCGGTGGT
ATATTTTGGTGTCGAAGAAGGATGTAAAGCTCATCGCTTATATGACCCAGGTCGTGAAAAACTACAAATTAGTAGAGATGTTGTTTTTCAAGAGAATCTTGAATGGCCTT
GGAACGAAGTCGTTAGTGACGGTAAGGAGATTACAGAGTTTCAGGTGATGGACCAATTTTGTTCTGACGAGTTCGAAAACTTGGAGGATGCAGAAACTAGGGTTGAAAAT
GTCATACCACATGCAACTGAGATACCTGCGATTGGAGCAACCGGTCCATCTCCTCCATCAACGAACACACCGGTCCGTCTAAGATCTCTCACTCATATCTACATCAACAC
AGAGGAAGTTGTAGGTGGTGATGAACAAGAGAATGAGAAGTTGTTAAAGAGGCCCAACTGGTACAAAGTAATGGAGAACGAATTAAAATCCATTGAGAAAAACAACACAT
GGAGTCTGACCAAGCTTCCACCAGGACACAAACCCATTGGTCTAAAATGGGTGTTCAAATTGAAGAAAGACCCTAGTGTTGAAGTTGTCAAGCACAATGCAAGATTGGTT
GCTAAAGGCATTGACTTTGAAGAAGTTTTAGCACCAGTTGCAAGACTTGACACCGTTCGAGTCATTCTTGTACTAGTTGCAAATCAAAGTTGGGAGGTACACCATCTAGA
TGTGAAGTCGACATTTCTCAATGGAGAACTGGAAGAGGAAGTATATGTTACTCAATCAGAGGGTTTTGAGGTCCCAAATAAAAAACACAAGGTGCATAGATTGTCGAAGG
CTCTCTACGGATTAAGGCAAGCTCCACGAGCTTGGAACATTCAACTTGATAGGAGTCTCAAAGAGCTTGGTTTTGGAAAATGCACTCAAAAGCAAGTAGTCTACACAAGA
AGTGAAGGAGAAGAATGTGTTCTTGTTGAAGTGTATGTTGACAATCTCATTGTAGCAGGAAATAGCACTGAAAAGGTCAATAAGTTCAAGCAGCAAATGATGGCAGAATT
TGAAATGAGCCACTTAGGTCTTCTCTCTTACTACTTAGGAATTGAAGTTGAACAACAGAAGGGTCGAATCCTGCTCAAACAACCAACTTATGCCAAAAGAATTTTGTCCC
AATTTGGAATGGCTGATTGCAATGCCACAAAGTACCCGATGGAACCCAAGGCACAACTTCACAAAGACATGGAAGGAGCACCGATTGAAGCTACGGAGTACAGAAACATC
GTTGGTTGTCTTAGATACTTACTGAACACAAGGCCAAATCTTTCATATGTTTTTGGGATGGCGAGTAAGTATATGGAAAGGCCTACAACCATACATTACAAGGTTGTCAA
GCAAATACTTAGGTATTTGAGAGGGACAATTCACTTTGGGCTCACTTATACGAAAGGTCCTAGACAATTCGATATATTAGGTTACTCTGACAGTGATTTAGTCAGTGATC
TCGACGGGAGGAAAAGTACAAGTGGAATGAAATTTTACTTAAACGAAAGCTTGGTTTCATGGAATTCGCAAAAGCAAAAGACAGTGACACTCTCATCTTGCAAAGCCGAG
TTCATTGCAGCTACTACCGCAGCTTGCCAAGCGTTGTGGTTAAGATGCATTGTTAGCGAGATAGTCAGAATGGAGCCAAGATCAGTAACATTATTCGTGGACAACAAATC
CGCGATAGCTCTCATGAAGAATCCTGTATTTCATGGTTGTGGCAAGCACATAGATACATGTTTTCATTTCATTCAAGAGTGTGTTGAGAATGGACAAATTATCGTTGAAT
TTGTCAATACTAGAGAACAACGAGCCGATGTTTTGACTAAAGCATTGACAAGAGTAAAGTTAGCTGCTATGTGTCAGCTACTTGGTGTTCGTAATTTAGAATCATGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGGAGAAAGCCACATCTTGCACACTTGAGAGTCTTTGGTTGTGTGGCATATGTAAAGACCACAACCCCTCACCTCAAGAAACTCGACGATCGAAGCTCACCGGTGGT
ATATTTTGGTGTCGAAGAAGGATGTAAAGCTCATCGCTTATATGACCCAGGTCGTGAAAAACTACAAATTAGTAGAGATGTTGTTTTTCAAGAGAATCTTGAATGGCCTT
GGAACGAAGTCGTTAGTGACGGTAAGGAGATTACAGAGTTTCAGGTGATGGACCAATTTTGTTCTGACGAGTTCGAAAACTTGGAGGATGCAGAAACTAGGGTTGAAAAT
GTCATACCACATGCAACTGAGATACCTGCGATTGGAGCAACCGGTCCATCTCCTCCATCAACGAACACACCGGTCCGTCTAAGATCTCTCACTCATATCTACATCAACAC
AGAGGAAGTTGTAGGTGGTGATGAACAAGAGAATGAGAAGTTGTTAAAGAGGCCCAACTGGTACAAAGTAATGGAGAACGAATTAAAATCCATTGAGAAAAACAACACAT
GGAGTCTGACCAAGCTTCCACCAGGACACAAACCCATTGGTCTAAAATGGGTGTTCAAATTGAAGAAAGACCCTAGTGTTGAAGTTGTCAAGCACAATGCAAGATTGGTT
GCTAAAGGCATTGACTTTGAAGAAGTTTTAGCACCAGTTGCAAGACTTGACACCGTTCGAGTCATTCTTGTACTAGTTGCAAATCAAAGTTGGGAGGTACACCATCTAGA
TGTGAAGTCGACATTTCTCAATGGAGAACTGGAAGAGGAAGTATATGTTACTCAATCAGAGGGTTTTGAGGTCCCAAATAAAAAACACAAGGTGCATAGATTGTCGAAGG
CTCTCTACGGATTAAGGCAAGCTCCACGAGCTTGGAACATTCAACTTGATAGGAGTCTCAAAGAGCTTGGTTTTGGAAAATGCACTCAAAAGCAAGTAGTCTACACAAGA
AGTGAAGGAGAAGAATGTGTTCTTGTTGAAGTGTATGTTGACAATCTCATTGTAGCAGGAAATAGCACTGAAAAGGTCAATAAGTTCAAGCAGCAAATGATGGCAGAATT
TGAAATGAGCCACTTAGGTCTTCTCTCTTACTACTTAGGAATTGAAGTTGAACAACAGAAGGGTCGAATCCTGCTCAAACAACCAACTTATGCCAAAAGAATTTTGTCCC
AATTTGGAATGGCTGATTGCAATGCCACAAAGTACCCGATGGAACCCAAGGCACAACTTCACAAAGACATGGAAGGAGCACCGATTGAAGCTACGGAGTACAGAAACATC
GTTGGTTGTCTTAGATACTTACTGAACACAAGGCCAAATCTTTCATATGTTTTTGGGATGGCGAGTAAGTATATGGAAAGGCCTACAACCATACATTACAAGGTTGTCAA
GCAAATACTTAGGTATTTGAGAGGGACAATTCACTTTGGGCTCACTTATACGAAAGGTCCTAGACAATTCGATATATTAGGTTACTCTGACAGTGATTTAGTCAGTGATC
TCGACGGGAGGAAAAGTACAAGTGGAATGAAATTTTACTTAAACGAAAGCTTGGTTTCATGGAATTCGCAAAAGCAAAAGACAGTGACACTCTCATCTTGCAAAGCCGAG
TTCATTGCAGCTACTACCGCAGCTTGCCAAGCGTTGTGGTTAAGATGCATTGTTAGCGAGATAGTCAGAATGGAGCCAAGATCAGTAACATTATTCGTGGACAACAAATC
CGCGATAGCTCTCATGAAGAATCCTGTATTTCATGGTTGTGGCAAGCACATAGATACATGTTTTCATTTCATTCAAGAGTGTGTTGAGAATGGACAAATTATCGTTGAAT
TTGTCAATACTAGAGAACAACGAGCCGATGTTTTGACTAAAGCATTGACAAGAGTAAAGTTAGCTGCTATGTGTCAGCTACTTGGTGTTCGTAATTTAGAATCATGTTAG
Protein sequenceShow/hide protein sequence
MGRKPHLAHLRVFGCVAYVKTTTPHLKKLDDRSSPVVYFGVEEGCKAHRLYDPGREKLQISRDVVFQENLEWPWNEVVSDGKEITEFQVMDQFCSDEFENLEDAETRVEN
VIPHATEIPAIGATGPSPPSTNTPVRLRSLTHIYINTEEVVGGDEQENEKLLKRPNWYKVMENELKSIEKNNTWSLTKLPPGHKPIGLKWVFKLKKDPSVEVVKHNARLV
AKGIDFEEVLAPVARLDTVRVILVLVANQSWEVHHLDVKSTFLNGELEEEVYVTQSEGFEVPNKKHKVHRLSKALYGLRQAPRAWNIQLDRSLKELGFGKCTQKQVVYTR
SEGEECVLVEVYVDNLIVAGNSTEKVNKFKQQMMAEFEMSHLGLLSYYLGIEVEQQKGRILLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDMEGAPIEATEYRNI
VGCLRYLLNTRPNLSYVFGMASKYMERPTTIHYKVVKQILRYLRGTIHFGLTYTKGPRQFDILGYSDSDLVSDLDGRKSTSGMKFYLNESLVSWNSQKQKTVTLSSCKAE
FIAATTAACQALWLRCIVSEIVRMEPRSVTLFVDNKSAIALMKNPVFHGCGKHIDTCFHFIQECVENGQIIVEFVNTREQRADVLTKALTRVKLAAMCQLLGVRNLESC