; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh08G004680 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh08G004680
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionReverse transcriptase
Genome locationCma_Chr08:2642410..2645712
RNA-Seq ExpressionCmaCh08G004680
SyntenyCmaCh08G004680
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022972954.1 uncharacterized protein LOC111471473 [Cucurbita maxima]9.0e-27462.71Show/hide
Query:  MSTTKSTPKSQADRLTLIEEEMLFLKEAPDIIRVLEARVKELSGKVVEIDAMGSRLDGLPIAKLMFRVTSFEEKVAPTSSPRPSGSPDSSVAHKEGCGEE
        MSTTKSTPK+QADRLTLIEEEML LKE PD +R LEARV ELS KVV IDAMG+RLDGLPIA+LMFRVTS EE+VAPTSSP+PSGSPDSSVAHKEG GEE
Subjt:  MSTTKSTPKSQADRLTLIEEEMLFLKEAPDIIRVLEARVKELSGKVVEIDAMGSRLDGLPIAKLMFRVTSFEEKVAPTSSPRPSGSPDSSVAHKEGCGEE

Query:  FDVLQNTMMSLFNGLADEFRTTIDDMQEKMFAMNTRIEVTMKVVQNVSAGQTNTGFNKLKFPDPRPFKGNRDAKELENFIFDVEQYFKATPACTDDMKVA
        FDVLQNTMMSLFNGLADEFRTTIDD+QE+M +M TRIEVTMK V+NV+AGQ NTG NKL+FPDPR FK NRDAKELENFIFDVEQYFKAT ACTDD KV 
Subjt:  FDVLQNTMMSLFNGLADEFRTTIDDMQEKMFAMNTRIEVTMKVVQNVSAGQTNTGFNKLKFPDPRPFKGNRDAKELENFIFDVEQYFKATPACTDDMKVA

Query:  VASMYLIDDTKLWWRMKVQDIENELCTIDSWEDFKRELRDLFLPENVDYLAMEKLIALKQTGSIRDYVKQFSPLMLDIRGTSEKD---------------
        VASMYLIDD KLWWR KVQDIE+ L TIDSWED K+ELRD FLPEN  +LAMEKL+ALK TG IRDYV+QFS LMLDI GT EKD               
Subjt:  VASMYLIDDTKLWWRMKVQDIENELCTIDSWEDFKRELRDLFLPENVDYLAMEKLIALKQTGSIRDYVKQFSPLMLDIRGTSEKD---------------

Query:  --------------------------------------------KPSGHRNGSPNRPNGGNERPSGWTDRPPQNNQAETSRGPYPQRNHPMTPLQCILCK
                                                    KP  HRNGSPNRPNGGN+RPSGWTDRPPQNNQA TSRGPY QRNHP TPLQC+LCK
Subjt:  --------------------------------------------KPSGHRNGSPNRPNGGNERPSGWTDRPPQNNQAETSRGPYPQRNHPMTPLQCILCK

Query:  GPHKVSYCPHWASLTALQVSIQESIDTRVETMLDKKEDQDNPPMGALKFLSALQRKVDPKEIIEKGLMF----------VDATINSRPSKSTLINSG--A
        GPHKVSYCPH ASLTALQVSIQES DTR+ETMLDKKEDQDNP MGALKFLSALQRKV+PKEI+EKGL+           +D  +        ++      
Subjt:  GPHKVSYCPHWASLTALQVSIQESIDTRVETMLDKKEDQDNPPMGALKFLSALQRKVDPKEIIEKGLMF----------VDATINSRPSKSTLINSG--A

Query:  THNFIAKQVARRLGLTIG---------KDPGKMKVVNSEALPIVGVSKRVPFKL-----------GAWTGELDLVVIRIDDFDVRAYHKHYHPVEALITK
         H  I   +A+ L +T           K PG ++++++  L   G+++  P  +                E++ V+    D    +  +   P + +  +
Subjt:  THNFIAKQVARRLGLTIG---------KDPGKMKVVNSEALPIVGVSKRVPFKL-----------GAWTGELDLVVIRIDDFDVRAYHKHYHPVEALITK

Query:  WNSFPGLKPPAKNAYRMAPPELAELRKQLDELLTAEFISPVKATYGAPVLFQKKKDETLHLCIDYRALNKVTVRNKYPLSIISDLFDQLHGPKYFTKLDL
            PG+KPPAKNAY+MAPPELAELRKQLDELL A FI P KA YGA VLFQKKKD TL LCIDYRALNKVTVRNKYPL IISDLFDQLHG KYFTKLDL
Subjt:  WNSFPGLKPPAKNAYRMAPPELAELRKQLDELLTAEFISPVKATYGAPVLFQKKKDETLHLCIDYRALNKVTVRNKYPLSIISDLFDQLHGPKYFTKLDL

Query:  RSRYYQVRIAEGDESKTTCVTRYGAFEFLVMPFGLTIAPATFCTLMN----------------------TTLEEHKVHLKLIFDKLQQNQSYIKKEKCVF
        RS YYQVRIAEGDE KTTCVTRYGAFEFLVMPFGLT APATFCTLMN                      TTLEEHKVHLKL                   
Subjt:  RSRYYQVRIAEGDESKTTCVTRYGAFEFLVMPFGLTIAPATFCTLMN----------------------TTLEEHKVHLKLIFDKLQQNQSYIKKEKCVF

Query:  AQTCINFLRHVISCGQIGMDSDKIKAIQEWKVPTSVSDVRSFLGLANYYR
                        I MDSDKIKAIQEWKVPTSVSD+RSFL LANYYR
Subjt:  AQTCINFLRHVISCGQIGMDSDKIKAIQEWKVPTSVSDVRSFLGLANYYR

XP_022975516.1 uncharacterized protein LOC111474945, partial [Cucurbita maxima]3.8e-28865.78Show/hide
Query:  MSTTKSTPKSQADRLTLIEEEMLFLKEAPDIIRVLEARVKELSGKVVEIDAMGSRLDGLPIAKLMFRVTSFEEKVAPTSSPRPSGSPDSSVAHKEGCGEE
        MSTTKSTPK+QADRLTLIEEEMLFLKE PD +R LEARV ELS KVV IDAMG+RLDGLPIA+LMFRVTS EE+VAPTSSP+PSGSPDSSVAHKEG GEE
Subjt:  MSTTKSTPKSQADRLTLIEEEMLFLKEAPDIIRVLEARVKELSGKVVEIDAMGSRLDGLPIAKLMFRVTSFEEKVAPTSSPRPSGSPDSSVAHKEGCGEE

Query:  FDVLQNTMMSLFNGLADEFRTTIDDMQEKMFAMNTRIEVTMKVVQNVSAGQTNTGFNKLKFPDPRPFKGNRDAKELENFIFDVEQYFKATPACTDDMKVA
        FDVLQNTMMSLFNGLADEFRTTIDD+QE+M +M TRIEVTMK V+NV+AGQTNTG NKL+FP+PR FKGN+DAKELENFIFDVEQYFKAT  C DD KV 
Subjt:  FDVLQNTMMSLFNGLADEFRTTIDDMQEKMFAMNTRIEVTMKVVQNVSAGQTNTGFNKLKFPDPRPFKGNRDAKELENFIFDVEQYFKATPACTDDMKVA

Query:  VASMYLIDDTKLWWRMKVQDIENELCTIDSWEDFKRELRDLFLPENVDYLAMEKLIALKQTGSIRDYVKQFSPLMLDIRGTSEKD---------------
        VASMYL DD KLWWR KVQDIE+ LCTIDSWED K+ELRD FLPEN ++LAMEKL+ALK TGSIRDYV+QFS LMLDIRGTSEKD               
Subjt:  VASMYLIDDTKLWWRMKVQDIENELCTIDSWEDFKRELRDLFLPENVDYLAMEKLIALKQTGSIRDYVKQFSPLMLDIRGTSEKD---------------

Query:  --------------------------------------------KPSGHRNGSPNRPNGGNERPSGWTDRPPQNNQAETSRGPYPQRNHPMTPLQCILCK
                                                    KP  HRNGSPNRPNGGN+RPSGWTDRPPQNNQA TSRGPY QRNHP TPLQC+LCK
Subjt:  --------------------------------------------KPSGHRNGSPNRPNGGNERPSGWTDRPPQNNQAETSRGPYPQRNHPMTPLQCILCK

Query:  GPHKVSYCPHWASLTALQVSIQESIDTRVETMLDKKEDQDNPPMGALKFLSALQRKVDPKEIIEKGLMF----------VDATINSRPSKSTLINSG--A
        GPHKVSYCPH ASLTALQVSIQES D +VETMLDKKEDQDNP MGALKFLSALQRKV+PKEI+EKGL+           +D  +        ++      
Subjt:  GPHKVSYCPHWASLTALQVSIQESIDTRVETMLDKKEDQDNPPMGALKFLSALQRKVDPKEIIEKGLMF----------VDATINSRPSKSTLINSG--A

Query:  THNFIAKQVARRLGLTIGKDPGKMKVVNSEALPIVGVSKRVPFKLGAWTGELDLVVIRIDDFDVRAYHKHYHPVEALITKWNSFPGLKPPAKNAYRMAPP
         H  I   +A+ L +T   D     +  S   P+    + VP        E++ V+    D    +  +   P   +  +    PG+KPPAKNAYRMAPP
Subjt:  THNFIAKQVARRLGLTIGKDPGKMKVVNSEALPIVGVSKRVPFKLGAWTGELDLVVIRIDDFDVRAYHKHYHPVEALITKWNSFPGLKPPAKNAYRMAPP

Query:  ELAELRKQLDELLTAEFISPVKATYGAPVLFQKKKDETLHLCIDYRALNKVTVRNKYPLSIISDLFDQLHGPKYFTKLDLRSRYYQVRIAEGDESKTTCV
        ELAELRKQLDELL A FI P KA YGAPVLFQKKKD TL LCIDYRALNKVTVRNKYPL IISDLFDQLHG KYFTKLDLRS YYQVRIAEGDE KTTCV
Subjt:  ELAELRKQLDELLTAEFISPVKATYGAPVLFQKKKDETLHLCIDYRALNKVTVRNKYPLSIISDLFDQLHGPKYFTKLDLRSRYYQVRIAEGDESKTTCV

Query:  TRYGAFEFLVMPFGLTIAPATFCTLMN----------------------TTLEEHKVHLKLIFDKLQQNQSYIKKEKCVFAQTCINFLRHVISCGQIGMD
        TRYGAFEFLVMPFGLT APATFCTLMN                      TTLEEHKVHLKL+FDKL+QNQ                       CGQI MD
Subjt:  TRYGAFEFLVMPFGLTIAPATFCTLMN----------------------TTLEEHKVHLKLIFDKLQQNQSYIKKEKCVFAQTCINFLRHVISCGQIGMD

Query:  SDKIKAIQEWKVPTSVSDVRSFLGLANYYR
        SDKIKAIQEWKVPTSVSD+RSFLGLANYYR
Subjt:  SDKIKAIQEWKVPTSVSDVRSFLGLANYYR

XP_023524533.1 uncharacterized protein LOC111788429 [Cucurbita pepo subsp. pepo]0.0e+0066.45Show/hide
Query:  MSTTKSTPKSQADRLTLIEEEMLFLKEAPDIIRVLEARVKELSGKVVEIDAMGSRLDGLPIAKLMFRVTSFEEKVAPTSSPRPSGSPDSSVAHKEGCGEE
        MSTTKSTPK+QADRLTLIEEEMLFLKE PD +R LE RV ELS KVV++DAMG+RLDGLPIA+L+FRVTS EE+VAPTSSP+PS SPDSSVAHKEG GEE
Subjt:  MSTTKSTPKSQADRLTLIEEEMLFLKEAPDIIRVLEARVKELSGKVVEIDAMGSRLDGLPIAKLMFRVTSFEEKVAPTSSPRPSGSPDSSVAHKEGCGEE

Query:  FDVLQNTMMSLFNGLADEFRTTIDDMQEKMFAMNTRIEVTMKVVQNVSAGQTNTGFNKLKFPDPRPFKGNRDAKELENFIFDVEQYFKATPACTDDMKVA
        FD+LQNTMMSLFNGLADEFR+T+DD+QE+M AM+TRIEVTMK V+ V+AGQT+TG NKL+FPDPR FKGNRDAKELENFIFDVEQYFKAT ACTDD KV 
Subjt:  FDVLQNTMMSLFNGLADEFRTTIDDMQEKMFAMNTRIEVTMKVVQNVSAGQTNTGFNKLKFPDPRPFKGNRDAKELENFIFDVEQYFKATPACTDDMKVA

Query:  VASMYLIDDTKLWWRMKVQDIENELCTIDSWEDFKRELRDLFLPENVDYLAMEKLIALKQTGSIRDYVKQFSPLMLDIRGTSEKDK--------------
        VA+MYL+DD KLWWR KVQDIE+ LCTIDSWED KRELR+ FLPEN  ++AMEK++ALK TG+IRDYV+QFS LMLDIRGT+EKDK              
Subjt:  VASMYLIDDTKLWWRMKVQDIENELCTIDSWEDFKRELRDLFLPENVDYLAMEKLIALKQTGSIRDYVKQFSPLMLDIRGTSEKDK--------------

Query:  ---------------------------------------------PSGHRNGSPNRPNGGNERPSGWTDRPPQNNQAETSRGPYPQRNHPMTPLQCILCK
                                                     P GHRNGSPNR N       GWTDRPPQNNQA TSRGPY Q+NHP TPLQC+LCK
Subjt:  ---------------------------------------------PSGHRNGSPNRPNGGNERPSGWTDRPPQNNQAETSRGPYPQRNHPMTPLQCILCK

Query:  GPHKVSYCPHWASLTALQVSIQESIDTRVETMLDKKEDQDNPPMGALKFLSALQRKVDPKEIIEKGLMFVDATINSRPSKSTLINSGATHNFIAKQVARR
        GPHKVSYCPH ASLTALQVSIQES + RVET+LDKKED DNP MGALKFLSALQRKV+PKEIIEKGLMFVDATINSR SKSTLI+SGATHNFIA Q ARR
Subjt:  GPHKVSYCPHWASLTALQVSIQESIDTRVETMLDKKEDQDNPPMGALKFLSALQRKVDPKEIIEKGLMFVDATINSRPSKSTLINSGATHNFIAKQVARR

Query:  LGLTIGKDPGKMKVVNSEALPIVGVSKRVPFKLGAWTGELDLVVIRIDDFDV------------------------------------------------
        LGLTIGKDPGKMK VNSEALPIVGVSK VPFK+G WTGELDLVV+R+DDFDV                                                
Subjt:  LGLTIGKDPGKMKVVNSEALPIVGVSKRVPFKLGAWTGELDLVVIRIDDFDV------------------------------------------------

Query:  -------------------------------------------RAYHKHYHPVEALITKWNSFPGLKPPAKNAYRMAPPELAELRKQLDELLTAEFISPV
                                                    +  +   P   +  +    PG+KPPAKNAYRMAPPELAELRKQLDELL A FI P 
Subjt:  -------------------------------------------RAYHKHYHPVEALITKWNSFPGLKPPAKNAYRMAPPELAELRKQLDELLTAEFISPV

Query:  KATYGAPVLFQKKKDETLHLCIDYRALNKVTVRNKYPLSIISDLFDQLHGPKYFTKLDLRSRYYQVRIAEGDESKTTCVTRYGAFEFLVMPFGLTIAPAT
        KA YGAPVLFQKKKD TL LCIDYRALNKVTVRNKYPL IISDLFDQLHG KYFTKLDLRS YYQVRI EGDE KTTCVTRYGAFEFLVMPFGLT APAT
Subjt:  KATYGAPVLFQKKKDETLHLCIDYRALNKVTVRNKYPLSIISDLFDQLHGPKYFTKLDLRSRYYQVRIAEGDESKTTCVTRYGAFEFLVMPFGLTIAPAT

Query:  FCTLMN----------------------TTLEEHKVHLKLIFDKLQQNQSYIKKEKCVFAQTCINFLRHVISCGQIGMDSDKIKAIQEWKVPTSVSDVRS
        F TLMN                      TTLEEHKVHLKL+FDKL+QNQ Y+KKEKC FAQTCINFL HV+ CGQI MDSDKIKAIQEWKVPTSVS++RS
Subjt:  FCTLMN----------------------TTLEEHKVHLKLIFDKLQQNQSYIKKEKCVFAQTCINFLRHVISCGQIGMDSDKIKAIQEWKVPTSVSDVRS

Query:  FLGLANYYR
        FLGLANYYR
Subjt:  FLGLANYYR

XP_023526180.1 uncharacterized protein LOC111789739 [Cucurbita pepo subsp. pepo]1.8e-27759.26Show/hide
Query:  MSTTKSTPKSQADRLTLIEEEMLFLKEAPDIIRVLEARVKELSGKVVEIDAMGSRLDGLPIAKLMFRVTSFEEKVAPTSSPRPSGSPDSSVAHKEGCGEE
        MSTTKSTPK+QADRLTLIEEEMLFLKE PD +R LE RV ELS KVV++DAMG+RLDGLPIA+L+FRVTS EE+VAPTSSP+PS SPDSSVAHKEG GEE
Subjt:  MSTTKSTPKSQADRLTLIEEEMLFLKEAPDIIRVLEARVKELSGKVVEIDAMGSRLDGLPIAKLMFRVTSFEEKVAPTSSPRPSGSPDSSVAHKEGCGEE

Query:  FDVLQNTMMSLFNGLADEFRTTIDDMQEKMFAMNTRIEVTMKVVQNVSAGQTNTGFNKLKFPDPRPFKGNRDAKELENFIFDVEQYFKATPACTDDMKVA
        FDVLQNTMMSLFNGLADEFR+T+DD+QE+M AM+TRIEVTMK                                        VE YFKAT ACTDD KV 
Subjt:  FDVLQNTMMSLFNGLADEFRTTIDDMQEKMFAMNTRIEVTMKVVQNVSAGQTNTGFNKLKFPDPRPFKGNRDAKELENFIFDVEQYFKATPACTDDMKVA

Query:  VASMYLIDDTKLWWRMKVQDIENELCTIDSWEDFKRELRDLFLPENVDYLAMEKLIALKQTGSIRDYVKQFSPLMLDIRGTSEKDK--------------
        VA+MYL+DD KLWWR KVQDIE+ LCTIDSWED KRELR+ FLPEN  ++AMEK++ALK TG+IRDYV+QFS LMLDIRGT+EKDK              
Subjt:  VASMYLIDDTKLWWRMKVQDIENELCTIDSWEDFKRELRDLFLPENVDYLAMEKLIALKQTGSIRDYVKQFSPLMLDIRGTSEKDK--------------

Query:  ---------------------------------------------PSGHRNGSPNRPNGGNERPSGWTDRPPQNNQAETSRGPYPQRNHPMTPLQCILCK
                                                     P GHRNGSPNR N       GWTDRPPQNNQA TSRGPY Q+NHP TPLQCILCK
Subjt:  ---------------------------------------------PSGHRNGSPNRPNGGNERPSGWTDRPPQNNQAETSRGPYPQRNHPMTPLQCILCK

Query:  GPH----KVSYCPHWASLTALQVSIQESIDTRVETMLDKKEDQDNPPMGALKFLSALQRKVDPKEIIEKGLMFVDATINSRPSKSTLINSGATHNFIAKQ
        GPH    K++  P WA                                   KFLSALQRKV+PKEIIEKGLMFVDATINSR SKSTLI+SGATHNFIA Q
Subjt:  GPH----KVSYCPHWASLTALQVSIQESIDTRVETMLDKKEDQDNPPMGALKFLSALQRKVDPKEIIEKGLMFVDATINSRPSKSTLINSGATHNFIAKQ

Query:  VARRLGLTIGKDPGKMKVVNSEALPIVGVSKRVPFKLGAWTGELDLVVIRIDDFDV--------------------------------------------
         ARRLGLTIGKDPGKMK VNSEALPIVGVSK VPFK+G WTGELDLVV+R+DDFDV                                            
Subjt:  VARRLGLTIGKDPGKMKVVNSEALPIVGVSKRVPFKLGAWTGELDLVVIRIDDFDV--------------------------------------------

Query:  -----------------------------------------------RAYHKHYHPVEALITKWNSFPGLKPPAKNAYRMAPPELAELRKQLDELLTAEF
                                                        +  +   P   +  +    PG+KPPAKNAYRMAPPELAELRKQLDELL A F
Subjt:  -----------------------------------------------RAYHKHYHPVEALITKWNSFPGLKPPAKNAYRMAPPELAELRKQLDELLTAEF

Query:  ISPVKATYGAPVLFQKKKDETLHLCIDYRALNKVTVRNKYPLSIISDLFDQLHGPKYFTKLDLRSRYYQVRIAEGDESKTTCVTRYGAFEFLVMPFGLTI
        I P KA YGAPVLFQKKKD TL LCIDYRALNKVTVRNKYPL IISDLFDQLHG KYFTKLDLRS YYQVRIAEGDE KTTCVTRYGAFEFLVMPFGLT 
Subjt:  ISPVKATYGAPVLFQKKKDETLHLCIDYRALNKVTVRNKYPLSIISDLFDQLHGPKYFTKLDLRSRYYQVRIAEGDESKTTCVTRYGAFEFLVMPFGLTI

Query:  APATFCTLMN----------------------TTLEEHKVHLKLIFDKLQQNQSYIKKEKCVFAQTCINFLRHVISCGQIGMDSDKIKAIQEWKVPTSVS
        APATFCTLMN                      TTLEEHKVHLKL+FDKL+QNQ Y+KKEKC FAQTCI+FL HV+ CGQI MDSDKIKAIQEWKVPTSVS
Subjt:  APATFCTLMN----------------------TTLEEHKVHLKLIFDKLQQNQSYIKKEKCVFAQTCINFLRHVISCGQIGMDSDKIKAIQEWKVPTSVS

Query:  DVRSFLGLANYYR
        ++RSFLGLANYYR
Subjt:  DVRSFLGLANYYR

XP_023537907.1 uncharacterized protein LOC111798805 [Cucurbita pepo subsp. pepo]0.0e+0066.56Show/hide
Query:  MSTTKSTPKSQADRLTLIEEEMLFLKEAPDIIRVLEARVKELSGKVVEIDAMGSRLDGLPIAKLMFRVTSFEEKVAPTSSPRPSGSPDSSVAHKEGCGEE
        MSTTKSTPK+QADRLTLIEEEMLFLKE PD +R LE RV ELS KVV++DAMG+RLDGLPIA+L+FRVTS EE+VAPTSSP+PS SPDSSVAHKEG GEE
Subjt:  MSTTKSTPKSQADRLTLIEEEMLFLKEAPDIIRVLEARVKELSGKVVEIDAMGSRLDGLPIAKLMFRVTSFEEKVAPTSSPRPSGSPDSSVAHKEGCGEE

Query:  FDVLQNTMMSLFNGLADEFRTTIDDMQEKMFAMNTRIEVTMKVVQNVSAGQTNTGFNKLKFPDPRPFKGNRDAKELENFIFDVEQYFKATPACTDDMKVA
        FD+LQNTMMSLFNGLADEFR+T+DD+QE+M AM+TRIEVTMK V+ V+AGQT+TG NKL+FPDPR FKGNRDAKELENFIFDVEQYFKAT ACTDD KV 
Subjt:  FDVLQNTMMSLFNGLADEFRTTIDDMQEKMFAMNTRIEVTMKVVQNVSAGQTNTGFNKLKFPDPRPFKGNRDAKELENFIFDVEQYFKATPACTDDMKVA

Query:  VASMYLIDDTKLWWRMKVQDIENELCTIDSWEDFKRELRDLFLPENVDYLAMEKLIALKQTGSIRDYVKQFSPLMLDIRGTSEKDK--------------
        VA+MYL+DD KLWWR KVQDIE+ LCTIDSWED KRELR+ FLPEN  ++AMEK++ALK TG+IRDYV+QFS LMLDIRGT+EKDK              
Subjt:  VASMYLIDDTKLWWRMKVQDIENELCTIDSWEDFKRELRDLFLPENVDYLAMEKLIALKQTGSIRDYVKQFSPLMLDIRGTSEKDK--------------

Query:  ---------------------------------------------PSGHRNGSPNRPNGGNERPSGWTDRPPQNNQAETSRGPYPQRNHPMTPLQCILCK
                                                     P GHRNGSPNR N       GWTDRPPQNNQA TSRGPY Q+NHP TPLQC+LCK
Subjt:  ---------------------------------------------PSGHRNGSPNRPNGGNERPSGWTDRPPQNNQAETSRGPYPQRNHPMTPLQCILCK

Query:  GPHKVSYCPHWASLTALQVSIQESIDTRVETMLDKKEDQDNPPMGALKFLSALQRKVDPKEIIEKGLMFVDATINSRPSKSTLINSGATHNFIAKQVARR
        GPHKVSYCPH ASLTALQVSIQES + RVET+LDKKED DNP MGALKFLSALQRKV+PKEIIEKGLMFVDATINSR SKSTLI+SGATHNFIA Q ARR
Subjt:  GPHKVSYCPHWASLTALQVSIQESIDTRVETMLDKKEDQDNPPMGALKFLSALQRKVDPKEIIEKGLMFVDATINSRPSKSTLINSGATHNFIAKQVARR

Query:  LGLTIGKDPGKMKVVNSEALPIVGVSKRVPFKLGAWTGELDLVVIRIDDFDV------------------------------------------------
        LGLTIGKDPGKMK VNSEALPIVGVSK VPFK+G WTGELDLVV+R+DDFDV                                                
Subjt:  LGLTIGKDPGKMKVVNSEALPIVGVSKRVPFKLGAWTGELDLVVIRIDDFDV------------------------------------------------

Query:  -------------------------------------------RAYHKHYHPVEALITKWNSFPGLKPPAKNAYRMAPPELAELRKQLDELLTAEFISPV
                                                    +  +   P   +  +    PG+KPPAKNAYRMAPPELAELRKQLDELL A FI P 
Subjt:  -------------------------------------------RAYHKHYHPVEALITKWNSFPGLKPPAKNAYRMAPPELAELRKQLDELLTAEFISPV

Query:  KATYGAPVLFQKKKDETLHLCIDYRALNKVTVRNKYPLSIISDLFDQLHGPKYFTKLDLRSRYYQVRIAEGDESKTTCVTRYGAFEFLVMPFGLTIAPAT
        KA YGAPVLFQKKKD TL LCIDYRALNKVTVRNKYPL IISDLFDQLHG KYFTKLDLRS YYQVRIAEGDE KTTCVTRYGAFEFLVMPFGLT APAT
Subjt:  KATYGAPVLFQKKKDETLHLCIDYRALNKVTVRNKYPLSIISDLFDQLHGPKYFTKLDLRSRYYQVRIAEGDESKTTCVTRYGAFEFLVMPFGLTIAPAT

Query:  FCTLMN----------------------TTLEEHKVHLKLIFDKLQQNQSYIKKEKCVFAQTCINFLRHVISCGQIGMDSDKIKAIQEWKVPTSVSDVRS
        FCTLMN                      TTLEEHKVHLKL+FDKL+QNQ Y+KKEKC FAQTCI+FL HV+ CGQI MDSDKIKAIQEWKVPTSVS++RS
Subjt:  FCTLMN----------------------TTLEEHKVHLKLIFDKLQQNQSYIKKEKCVFAQTCINFLRHVISCGQIGMDSDKIKAIQEWKVPTSVSDVRS

Query:  FLGLANYYR
        FLGLANYYR
Subjt:  FLGLANYYR

TrEMBL top hitse value%identityAlignment
A0A6J1D906 Reverse transcriptase8.9e-23551.17Show/hide
Query:  MSTTKSTPKSQADRLTLIEEEMLFLKEAPDIIRVLEARVKELSGKVVEIDAMGSRLDGLPIAKLMFRVTSFEEKVA-PTSSPR--PSGSPDSSVAHKEGC
        MSTTK   KS  DRL  IEE++L+L+E PD +R+LEARV E S K  EIDA+ +R+DGLPI  +  RV + E K   P S  R   S +P++ +  + G 
Subjt:  MSTTKSTPKSQADRLTLIEEEMLFLKEAPDIIRVLEARVKELSGKVVEIDAMGSRLDGLPIAKLMFRVTSFEEKVA-PTSSPR--PSGSPDSSVAHKEGC

Query:  GEEFDVLQNTMMSLFNGLADEFRTTIDDMQEKMFAMNTRIEVTMKVVQNVSAGQTNTGFNKLKFPDPRPFKGNRDAKELENFIFDVEQYFKATPACTDDM
          E +   + MM LFN + ++F+ TID ++ +M  ++TR+ +TM+ V N +  Q N GFNKLK P+P+PF GNRDAK+LENF+FDVEQYFKAT   +++M
Subjt:  GEEFDVLQNTMMSLFNGLADEFRTTIDDMQEKMFAMNTRIEVTMKVVQNVSAGQTNTGFNKLKFPDPRPFKGNRDAKELENFIFDVEQYFKATPACTDDM

Query:  KVAVASMYLIDDTKLWWRMKVQDIENELCTIDSWEDFKRELRDLFLPENVDYLAMEKLIALKQTGSIRDYVKQFSPLMLDIRGTSEKDK-----------
        KV +A+M+L DD KLWWR KV DI+N  CTI+SW+D K+ELR  F P+NV+++A  KL  L+ TG+IRDYVKQFS +M+DIR  SEKDK           
Subjt:  KVAVASMYLIDDTKLWWRMKVQDIENELCTIDSWEDFKRELRDLFLPENVDYLAMEKLIALKQTGSIRDYVKQFSPLMLDIRGTSEKDK-----------

Query:  ----------------------------PSGHRNGSPNRPNGGNER------PSGWTDRPPQNNQAETSRGPYPQRNHPMTPLQCILCKGPHKVSYCPHW
                                       H   +   P GGN+        SG  D+ PQ      SRGPYPQ  +    +   LCKGPH+V+ CPH 
Subjt:  ----------------------------PSGHRNGSPNRPNGGNER------PSGWTDRPPQNNQAETSRGPYPQRNHPMTPLQCILCKGPHKVSYCPHW

Query:  ASLTALQVSIQESIDTRVETMLDKKEDQDNPPMGALKFLSALQRKVD-PKEIIEKGLMFVDATINSRPSKSTLINSGATHNFIAKQVARRLGLTIGKDPG
        A+LTALQ S+Q   +  VET  +K+ED++ P M ALKFLSA+Q++V+ PK   EKGLMFVDATIN   +KST+++SGATHNFI++Q ARRL LTI KD G
Subjt:  ASLTALQVSIQESIDTRVETMLDKKEDQDNPPMGALKFLSALQRKVD-PKEIIEKGLMFVDATINSRPSKSTLINSGATHNFIAKQVARRLGLTIGKDPG

Query:  KMKVVNSEALPIVGVSKRVPFKLGAWTGELDLVVIRIDDFDV----------------------------------------------------------
        KMKVVN EALPIVGVSKRV  KLG WTG +D VV+R+DDFDV                                                          
Subjt:  KMKVVNSEALPIVGVSKRVPFKLGAWTGELDLVVIRIDDFDV----------------------------------------------------------

Query:  ---------------------RAYHKHY------------HPVEALITKWNSFPGLKPPAKNAYRMAPPELAELRKQLDELLTAEFISPVKATYGAPVLF
                             +   K Y             P   +  +    PG KPPAKNAYRMAPPELAELRKQLDELL A FI P KA +GAPVLF
Subjt:  ---------------------RAYHKHY------------HPVEALITKWNSFPGLKPPAKNAYRMAPPELAELRKQLDELLTAEFISPVKATYGAPVLF

Query:  QKKKDETLHLCIDYRALNKVTVRNKYPLSIISDLFDQLHGPKYFTKLDLRSRYYQVRIAEGDESKTTCVTRYGAFEFLVMPFGLTIAPATFCTLMN----
        QKKKD TL LCIDYRALNKVTVRNKYPL II+DLFDQLHG KYFTKLDLRS YYQVRIAEGDE KTTCVTRYGAFEFLVM FGLT APATFCT+MN    
Subjt:  QKKKDETLHLCIDYRALNKVTVRNKYPLSIISDLFDQLHGPKYFTKLDLRSRYYQVRIAEGDESKTTCVTRYGAFEFLVMPFGLTIAPATFCTLMN----

Query:  ------------------TTLEEHKVHLKLIFDKLQQNQSYIKKEKCVFAQTCINFLRHVISCGQIGMDSDKIKAIQEWKVPTSVSDVRSFLGLANYYR
                           TL+EH++HL+L+FDKL+QNQ Y+KKEKC FAQ  I FL HVI  GQI MD+DK+KAIQEW+VPTSV ++RSFLGLANYYR
Subjt:  ------------------TTLEEHKVHLKLIFDKLQQNQSYIKKEKCVFAQTCINFLRHVISCGQIGMDSDKIKAIQEWKVPTSVSDVRSFLGLANYYR

A0A6J1ID35 uncharacterized protein LOC1114714734.4e-27462.71Show/hide
Query:  MSTTKSTPKSQADRLTLIEEEMLFLKEAPDIIRVLEARVKELSGKVVEIDAMGSRLDGLPIAKLMFRVTSFEEKVAPTSSPRPSGSPDSSVAHKEGCGEE
        MSTTKSTPK+QADRLTLIEEEML LKE PD +R LEARV ELS KVV IDAMG+RLDGLPIA+LMFRVTS EE+VAPTSSP+PSGSPDSSVAHKEG GEE
Subjt:  MSTTKSTPKSQADRLTLIEEEMLFLKEAPDIIRVLEARVKELSGKVVEIDAMGSRLDGLPIAKLMFRVTSFEEKVAPTSSPRPSGSPDSSVAHKEGCGEE

Query:  FDVLQNTMMSLFNGLADEFRTTIDDMQEKMFAMNTRIEVTMKVVQNVSAGQTNTGFNKLKFPDPRPFKGNRDAKELENFIFDVEQYFKATPACTDDMKVA
        FDVLQNTMMSLFNGLADEFRTTIDD+QE+M +M TRIEVTMK V+NV+AGQ NTG NKL+FPDPR FK NRDAKELENFIFDVEQYFKAT ACTDD KV 
Subjt:  FDVLQNTMMSLFNGLADEFRTTIDDMQEKMFAMNTRIEVTMKVVQNVSAGQTNTGFNKLKFPDPRPFKGNRDAKELENFIFDVEQYFKATPACTDDMKVA

Query:  VASMYLIDDTKLWWRMKVQDIENELCTIDSWEDFKRELRDLFLPENVDYLAMEKLIALKQTGSIRDYVKQFSPLMLDIRGTSEKD---------------
        VASMYLIDD KLWWR KVQDIE+ L TIDSWED K+ELRD FLPEN  +LAMEKL+ALK TG IRDYV+QFS LMLDI GT EKD               
Subjt:  VASMYLIDDTKLWWRMKVQDIENELCTIDSWEDFKRELRDLFLPENVDYLAMEKLIALKQTGSIRDYVKQFSPLMLDIRGTSEKD---------------

Query:  --------------------------------------------KPSGHRNGSPNRPNGGNERPSGWTDRPPQNNQAETSRGPYPQRNHPMTPLQCILCK
                                                    KP  HRNGSPNRPNGGN+RPSGWTDRPPQNNQA TSRGPY QRNHP TPLQC+LCK
Subjt:  --------------------------------------------KPSGHRNGSPNRPNGGNERPSGWTDRPPQNNQAETSRGPYPQRNHPMTPLQCILCK

Query:  GPHKVSYCPHWASLTALQVSIQESIDTRVETMLDKKEDQDNPPMGALKFLSALQRKVDPKEIIEKGLMF----------VDATINSRPSKSTLINSG--A
        GPHKVSYCPH ASLTALQVSIQES DTR+ETMLDKKEDQDNP MGALKFLSALQRKV+PKEI+EKGL+           +D  +        ++      
Subjt:  GPHKVSYCPHWASLTALQVSIQESIDTRVETMLDKKEDQDNPPMGALKFLSALQRKVDPKEIIEKGLMF----------VDATINSRPSKSTLINSG--A

Query:  THNFIAKQVARRLGLTIG---------KDPGKMKVVNSEALPIVGVSKRVPFKL-----------GAWTGELDLVVIRIDDFDVRAYHKHYHPVEALITK
         H  I   +A+ L +T           K PG ++++++  L   G+++  P  +                E++ V+    D    +  +   P + +  +
Subjt:  THNFIAKQVARRLGLTIG---------KDPGKMKVVNSEALPIVGVSKRVPFKL-----------GAWTGELDLVVIRIDDFDVRAYHKHYHPVEALITK

Query:  WNSFPGLKPPAKNAYRMAPPELAELRKQLDELLTAEFISPVKATYGAPVLFQKKKDETLHLCIDYRALNKVTVRNKYPLSIISDLFDQLHGPKYFTKLDL
            PG+KPPAKNAY+MAPPELAELRKQLDELL A FI P KA YGA VLFQKKKD TL LCIDYRALNKVTVRNKYPL IISDLFDQLHG KYFTKLDL
Subjt:  WNSFPGLKPPAKNAYRMAPPELAELRKQLDELLTAEFISPVKATYGAPVLFQKKKDETLHLCIDYRALNKVTVRNKYPLSIISDLFDQLHGPKYFTKLDL

Query:  RSRYYQVRIAEGDESKTTCVTRYGAFEFLVMPFGLTIAPATFCTLMN----------------------TTLEEHKVHLKLIFDKLQQNQSYIKKEKCVF
        RS YYQVRIAEGDE KTTCVTRYGAFEFLVMPFGLT APATFCTLMN                      TTLEEHKVHLKL                   
Subjt:  RSRYYQVRIAEGDESKTTCVTRYGAFEFLVMPFGLTIAPATFCTLMN----------------------TTLEEHKVHLKLIFDKLQQNQSYIKKEKCVF

Query:  AQTCINFLRHVISCGQIGMDSDKIKAIQEWKVPTSVSDVRSFLGLANYYR
                        I MDSDKIKAIQEWKVPTSVSD+RSFL LANYYR
Subjt:  AQTCINFLRHVISCGQIGMDSDKIKAIQEWKVPTSVSDVRSFLGLANYYR

A0A6J1IDF7 uncharacterized protein LOC1114742151.6e-27160.29Show/hide
Query:  MSTTKSTPKSQADRLTLIEEEMLFLKEAPDIIRVLEARVKELSGKVVEIDAMGSRLDGLPIAKLMFRVTSFEEKVAPTSSPRPSGSPDSSVAHKEGCGEE
        MS TKSTPK+QADRLTLIEE+MLFLKE PD +  LEARV ELS KVVEIDAMG+RLDGLPIA+LMF+VTS EE+VAPTSSP+PSGSPDSSVAHKEG GEE
Subjt:  MSTTKSTPKSQADRLTLIEEEMLFLKEAPDIIRVLEARVKELSGKVVEIDAMGSRLDGLPIAKLMFRVTSFEEKVAPTSSPRPSGSPDSSVAHKEGCGEE

Query:  FDVLQNTMMSLFNGLADEFRTTIDDMQEKMFAMNTRIEVTMKVVQNVSAGQTNTGFNKLKFPDPRPFKGNRDAKELENFIFDVEQYFKATPACTDDMKVA
        FDVLQNTMMSLFNGLADEFRTTIDD+QE+M +M TRIEVTMK V+NV+AGQTNTG NKL+FP+PR FKGNRDAKELENFIFDVEQYFKAT ACTDD KV 
Subjt:  FDVLQNTMMSLFNGLADEFRTTIDDMQEKMFAMNTRIEVTMKVVQNVSAGQTNTGFNKLKFPDPRPFKGNRDAKELENFIFDVEQYFKATPACTDDMKVA

Query:  VASMYLIDDTKLWWRMKVQDIENELCTIDSWEDFKRELRDLFLPENVDYLAMEKLIALKQTGSIRDYVKQFSPLMLDIRGTSEKD---------------
        VASMYL DD KLWWR KVQDIE+ LCTIDSWED K+ELRD FLPEN  +LAMEKL+ALK TGSIRDYV+QFS LMLDIRGTSEKD               
Subjt:  VASMYLIDDTKLWWRMKVQDIENELCTIDSWEDFKRELRDLFLPENVDYLAMEKLIALKQTGSIRDYVKQFSPLMLDIRGTSEKD---------------

Query:  --------------------------------------------KPSGHRNGSPNRPNGGNERPSGWTDRPPQNNQAETSRGPYPQRNHPMTPLQCILCK
                                                    KP  HRNGSPNRPNGGN+RPSGWTDRPPQNNQA TSRGPY QRNHP TPLQC+LCK
Subjt:  --------------------------------------------KPSGHRNGSPNRPNGGNERPSGWTDRPPQNNQAETSRGPYPQRNHPMTPLQCILCK

Query:  GPHKVSYCPHWASLTALQVSIQESIDTRVETMLDKKEDQDNPPMGALKFLSALQRKVDPKEIIEKGLMFVDATINSRPSKSTLINSGATHNFIAKQVARR
        GPHKVSYCPH ASLTALQVSIQES D +VETMLDKKEDQDNP MGALKFLSALQRKV+PKEI+EKGL+                                
Subjt:  GPHKVSYCPHWASLTALQVSIQESIDTRVETMLDKKEDQDNPPMGALKFLSALQRKVDPKEIIEKGLMFVDATINSRPSKSTLINSGATHNFIAKQVARR

Query:  LGLTIGKDPGKMKVVNSEALPIVGVSKRVPFKLGAWTGELDLVVIRIDDFDV------------------------------------------------
                                    VPFK+G WTGELDLVV R+DDFDV                                                
Subjt:  LGLTIGKDPGKMKVVNSEALPIVGVSKRVPFKLGAWTGELDLVVIRIDDFDV------------------------------------------------

Query:  --RAYHKHYH-----PVEALIT-------------------KWNSFPGLKPP--AKNAYRMAPPELAELRKQLDELLTAEFISPVKATYGAPVLFQKKKD
          R   +  H     P+  + T                      S P   PP  AKNAYRMA PELAELRKQLDELL A FI P KA Y APVLFQKKKD
Subjt:  --RAYHKHYH-----PVEALIT-------------------KWNSFPGLKPP--AKNAYRMAPPELAELRKQLDELLTAEFISPVKATYGAPVLFQKKKD

Query:  ETLHLCIDYRALNKVTVRNKYPLSIISDLFDQLHGPKYFTKLDLRSRYYQVRIAEGDESKTTCVTRYGAFEFLVMPFGLTIAPATFCTLMN---------
         TLHLCIDYRALNKVTVRNK+PL IISDLFDQLHG KYFTKLDL S YYQVRIAEGDE +TTCVTRYGAFEFLVMPF LT APATFCTLMN         
Subjt:  ETLHLCIDYRALNKVTVRNKYPLSIISDLFDQLHGPKYFTKLDLRSRYYQVRIAEGDESKTTCVTRYGAFEFLVMPFGLTIAPATFCTLMN---------

Query:  -------------TTLEEHKVHLKLIFDKLQQNQSYIKKEKCVFAQTCINFLRHVISCGQIGMDSDKIKAIQEWKVPTSVSDVRSFLGLANYYR
                     TTLEEHKVHLKL                                   I MDSDKIKAIQEWKVPTSVSD+RSFLGLANYYR
Subjt:  -------------TTLEEHKVHLKLIFDKLQQNQSYIKKEKCVFAQTCINFLRHVISCGQIGMDSDKIKAIQEWKVPTSVSDVRSFLGLANYYR

A0A6J1IEF9 uncharacterized protein LOC1114749451.8e-28865.78Show/hide
Query:  MSTTKSTPKSQADRLTLIEEEMLFLKEAPDIIRVLEARVKELSGKVVEIDAMGSRLDGLPIAKLMFRVTSFEEKVAPTSSPRPSGSPDSSVAHKEGCGEE
        MSTTKSTPK+QADRLTLIEEEMLFLKE PD +R LEARV ELS KVV IDAMG+RLDGLPIA+LMFRVTS EE+VAPTSSP+PSGSPDSSVAHKEG GEE
Subjt:  MSTTKSTPKSQADRLTLIEEEMLFLKEAPDIIRVLEARVKELSGKVVEIDAMGSRLDGLPIAKLMFRVTSFEEKVAPTSSPRPSGSPDSSVAHKEGCGEE

Query:  FDVLQNTMMSLFNGLADEFRTTIDDMQEKMFAMNTRIEVTMKVVQNVSAGQTNTGFNKLKFPDPRPFKGNRDAKELENFIFDVEQYFKATPACTDDMKVA
        FDVLQNTMMSLFNGLADEFRTTIDD+QE+M +M TRIEVTMK V+NV+AGQTNTG NKL+FP+PR FKGN+DAKELENFIFDVEQYFKAT  C DD KV 
Subjt:  FDVLQNTMMSLFNGLADEFRTTIDDMQEKMFAMNTRIEVTMKVVQNVSAGQTNTGFNKLKFPDPRPFKGNRDAKELENFIFDVEQYFKATPACTDDMKVA

Query:  VASMYLIDDTKLWWRMKVQDIENELCTIDSWEDFKRELRDLFLPENVDYLAMEKLIALKQTGSIRDYVKQFSPLMLDIRGTSEKD---------------
        VASMYL DD KLWWR KVQDIE+ LCTIDSWED K+ELRD FLPEN ++LAMEKL+ALK TGSIRDYV+QFS LMLDIRGTSEKD               
Subjt:  VASMYLIDDTKLWWRMKVQDIENELCTIDSWEDFKRELRDLFLPENVDYLAMEKLIALKQTGSIRDYVKQFSPLMLDIRGTSEKD---------------

Query:  --------------------------------------------KPSGHRNGSPNRPNGGNERPSGWTDRPPQNNQAETSRGPYPQRNHPMTPLQCILCK
                                                    KP  HRNGSPNRPNGGN+RPSGWTDRPPQNNQA TSRGPY QRNHP TPLQC+LCK
Subjt:  --------------------------------------------KPSGHRNGSPNRPNGGNERPSGWTDRPPQNNQAETSRGPYPQRNHPMTPLQCILCK

Query:  GPHKVSYCPHWASLTALQVSIQESIDTRVETMLDKKEDQDNPPMGALKFLSALQRKVDPKEIIEKGLMF----------VDATINSRPSKSTLINSG--A
        GPHKVSYCPH ASLTALQVSIQES D +VETMLDKKEDQDNP MGALKFLSALQRKV+PKEI+EKGL+           +D  +        ++      
Subjt:  GPHKVSYCPHWASLTALQVSIQESIDTRVETMLDKKEDQDNPPMGALKFLSALQRKVDPKEIIEKGLMF----------VDATINSRPSKSTLINSG--A

Query:  THNFIAKQVARRLGLTIGKDPGKMKVVNSEALPIVGVSKRVPFKLGAWTGELDLVVIRIDDFDVRAYHKHYHPVEALITKWNSFPGLKPPAKNAYRMAPP
         H  I   +A+ L +T   D     +  S   P+    + VP        E++ V+    D    +  +   P   +  +    PG+KPPAKNAYRMAPP
Subjt:  THNFIAKQVARRLGLTIGKDPGKMKVVNSEALPIVGVSKRVPFKLGAWTGELDLVVIRIDDFDVRAYHKHYHPVEALITKWNSFPGLKPPAKNAYRMAPP

Query:  ELAELRKQLDELLTAEFISPVKATYGAPVLFQKKKDETLHLCIDYRALNKVTVRNKYPLSIISDLFDQLHGPKYFTKLDLRSRYYQVRIAEGDESKTTCV
        ELAELRKQLDELL A FI P KA YGAPVLFQKKKD TL LCIDYRALNKVTVRNKYPL IISDLFDQLHG KYFTKLDLRS YYQVRIAEGDE KTTCV
Subjt:  ELAELRKQLDELLTAEFISPVKATYGAPVLFQKKKDETLHLCIDYRALNKVTVRNKYPLSIISDLFDQLHGPKYFTKLDLRSRYYQVRIAEGDESKTTCV

Query:  TRYGAFEFLVMPFGLTIAPATFCTLMN----------------------TTLEEHKVHLKLIFDKLQQNQSYIKKEKCVFAQTCINFLRHVISCGQIGMD
        TRYGAFEFLVMPFGLT APATFCTLMN                      TTLEEHKVHLKL+FDKL+QNQ                       CGQI MD
Subjt:  TRYGAFEFLVMPFGLTIAPATFCTLMN----------------------TTLEEHKVHLKLIFDKLQQNQSYIKKEKCVFAQTCINFLRHVISCGQIGMD

Query:  SDKIKAIQEWKVPTSVSDVRSFLGLANYYR
        SDKIKAIQEWKVPTSVSD+RSFLGLANYYR
Subjt:  SDKIKAIQEWKVPTSVSDVRSFLGLANYYR

A0A6J1IEY4 uncharacterized protein LOC1114757336.1e-25265.39Show/hide
Query:  MSTTKSTPKSQADRLTLIEEEMLFLKEAPDIIRVLEARVKELSGKVVEIDAMGSRLDGLPIAKLMFRVTSFEEKVAPTSSPRPSGSPDSSVAHKEGCGEE
        MSTTKSTPK+QADRLTLIEEEMLFLKE P  +R LEARV ELS KVV IDAMG+RLDGLPIA+LMFRVTS EE+VAPTSSP+PSGSPDSSVAHKEG GEE
Subjt:  MSTTKSTPKSQADRLTLIEEEMLFLKEAPDIIRVLEARVKELSGKVVEIDAMGSRLDGLPIAKLMFRVTSFEEKVAPTSSPRPSGSPDSSVAHKEGCGEE

Query:  FDVLQNTMMSLFNGLADEFRTTIDDMQEKMFAMNTRIEVTMKVVQNVSAGQTNTGFNKLKFPDPRPFKGNRDAKELENFIFDVEQYFKATPACTDDMKVA
        FDVLQNTMMSLFNGLADEFRTTIDD+QE+M +M TRIEVTMK V+NV+AGQTNTG +KL+FPDPR FKGNRDAKELENFIFDVEQYFKAT ACTDD KV 
Subjt:  FDVLQNTMMSLFNGLADEFRTTIDDMQEKMFAMNTRIEVTMKVVQNVSAGQTNTGFNKLKFPDPRPFKGNRDAKELENFIFDVEQYFKATPACTDDMKVA

Query:  VASMYLIDDTKLWWRMKVQDIENELCTIDSWEDFKRELRDLFLPENVDYLAMEKLIALKQTGSIRDYVKQFSPLMLDIRGTSEKD---------------
        VASMYL DD KLWWR KVQDIE+ LCTIDSWED K+ELRD FLPEN  +LAMEKL+ALK TG IRDYV+QFS LMLDIRGTSEKD               
Subjt:  VASMYLIDDTKLWWRMKVQDIENELCTIDSWEDFKRELRDLFLPENVDYLAMEKLIALKQTGSIRDYVKQFSPLMLDIRGTSEKD---------------

Query:  --------------------------------------------KPSGHRNGSPNRPNGGNERPSGWTDRPPQNNQAETSRGPYPQRNHPMTPLQCILCK
                                                    KP  HRNGSPNRPNGGN+RPSGWTDRPPQNNQA TSR PY QRNHP TPLQC+LCK
Subjt:  --------------------------------------------KPSGHRNGSPNRPNGGNERPSGWTDRPPQNNQAETSRGPYPQRNHPMTPLQCILCK

Query:  GPHKVSYCPHWASLTALQVSIQESIDTRVETMLDKKEDQDNPPMGALKFLSALQRKVDPKEIIEKGLMFVDATINSRPSKSTLINSGATHNFIAKQVARR
        GPHKVSYCPH  SLTALQVSIQES D RVETMLDKKEDQDNP MGALKFLSALQRKV+PKEI+EKGLMFVDATIN +P++STLI+SGATHNFIA Q ARR
Subjt:  GPHKVSYCPHWASLTALQVSIQESIDTRVETMLDKKEDQDNPPMGALKFLSALQRKVDPKEIIEKGLMFVDATINSRPSKSTLINSGATHNFIAKQVARR

Query:  LGLTIGKDPGKMKVVNSEALPIVGVSKRVPFKLGAWTGELDLVVIRIDDFDV------------------------------------------------
        LGLTIG+DPGKMK +NSEALPIVGVSKRVPFK+G WTGELDLVV R+DDFDV                                                
Subjt:  LGLTIGKDPGKMKVVNSEALPIVGVSKRVPFKLGAWTGELDLVVIRIDDFDV------------------------------------------------

Query:  -------------------------------------------RAYHKHYHPVEALITKWNSFPGLKPPAKNAYRMAPPELAELRKQLDELLTAEFISPV
                                                    +  +   P   +  +    PG+KPPAKNAY MAPPELAELRKQLDELL A FI P 
Subjt:  -------------------------------------------RAYHKHYHPVEALITKWNSFPGLKPPAKNAYRMAPPELAELRKQLDELLTAEFISPV

Query:  KATYGAPVLFQKKKDETLHLCIDYRALNKVT
        KA YGAPVLFQKKKD TL LCIDYRALNKVT
Subjt:  KATYGAPVLFQKKKDETLHLCIDYRALNKVT

SwissProt top hitse value%identityAlignment
P0CT42 Transposon Tf2-7 polyprotein5.0e-3333.61Show/hide
Query:  PAKNAYRMAPPELAELRKQLDELLTAEFISPVKATYGAPVLFQKKKDETLHLCIDYRALNKVTVRNKYPLSIISDLFDQLHGPKYFTKLDLRSRYYQVRI
        P +N Y + P ++  +  ++++ L +  I   KA    PV+F  KK+ TL + +DY+ LNK    N YPL +I  L  ++ G   FTKLDL+S Y+ +R+
Subjt:  PAKNAYRMAPPELAELRKQLDELLTAEFISPVKATYGAPVLFQKKKDETLHLCIDYRALNKVTVRNKYPLSIISDLFDQLHGPKYFTKLDLRSRYYQVRI

Query:  AEGDESKTTCVTRYGAFEFLVMPFGLTIAPATFCTLMNTTL----------------------EEHKVHLKLIFDKLQQNQSYIKKEKCVFAQTCINFLR
         +GDE K       G FE+LVMP+G++IAPA F   +NT L                       EH  H+K +  KL+     I + KC F Q+ + F+ 
Subjt:  AEGDESKTTCVTRYGAFEFLVMPFGLTIAPATFCTLMNTTL----------------------EEHKVHLKLIFDKLQQNQSYIKKEKCVFAQTCINFLR

Query:  HVISCGQIGMDSDKIKAIQEWKVPTSVSDVRSFLGLANYYR
        + IS        + I  + +WK P +  ++R FLG  NY R
Subjt:  HVISCGQIGMDSDKIKAIQEWKVPTSVSDVRSFLGLANYYR

P0CT43 Transposon Tf2-8 polyprotein5.0e-3333.61Show/hide
Query:  PAKNAYRMAPPELAELRKQLDELLTAEFISPVKATYGAPVLFQKKKDETLHLCIDYRALNKVTVRNKYPLSIISDLFDQLHGPKYFTKLDLRSRYYQVRI
        P +N Y + P ++  +  ++++ L +  I   KA    PV+F  KK+ TL + +DY+ LNK    N YPL +I  L  ++ G   FTKLDL+S Y+ +R+
Subjt:  PAKNAYRMAPPELAELRKQLDELLTAEFISPVKATYGAPVLFQKKKDETLHLCIDYRALNKVTVRNKYPLSIISDLFDQLHGPKYFTKLDLRSRYYQVRI

Query:  AEGDESKTTCVTRYGAFEFLVMPFGLTIAPATFCTLMNTTL----------------------EEHKVHLKLIFDKLQQNQSYIKKEKCVFAQTCINFLR
         +GDE K       G FE+LVMP+G++IAPA F   +NT L                       EH  H+K +  KL+     I + KC F Q+ + F+ 
Subjt:  AEGDESKTTCVTRYGAFEFLVMPFGLTIAPATFCTLMNTTL----------------------EEHKVHLKLIFDKLQQNQSYIKKEKCVFAQTCINFLR

Query:  HVISCGQIGMDSDKIKAIQEWKVPTSVSDVRSFLGLANYYR
        + IS        + I  + +WK P +  ++R FLG  NY R
Subjt:  HVISCGQIGMDSDKIKAIQEWKVPTSVSDVRSFLGLANYYR

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.3e-3635.25Show/hide
Query:  PGLKPPAKNAYRMAPPELAELRKQLDELLTAEFISPVKATYGAPVLFQKKKDETLHLCIDYRALNKVTVRNKYPLSIISDLFDQLHGPKYFTKLDLRSRY
        PG + P    Y +      E+ K + +LL  +FI P K+   +PV+   KKD T  LC+DYR LNK T+ + +PL  I +L  ++   + FT LDL S Y
Subjt:  PGLKPPAKNAYRMAPPELAELRKQLDELLTAEFISPVKATYGAPVLFQKKKDETLHLCIDYRALNKVTVRNKYPLSIISDLFDQLHGPKYFTKLDLRSRY

Query:  YQVRIAEGDESKTTCVTRYGAFEFLVMPFGLTIAPATFCTLMNTTL--------------------EEHKVHLKLIFDKLQQNQSYIKKEKCVFAQTCIN
        +Q+ +   D  KT  VT  G +E+ VMPFGL  AP+TF   M  T                     EEH  HL  + ++L+     +KK+KC FA     
Subjt:  YQVRIAEGDESKTTCVTRYGAFEFLVMPFGLTIAPATFCTLMNTTL--------------------EEHKVHLKLIFDKLQQNQSYIKKEKCVFAQTCIN

Query:  FLRHVISCGQIGMDSDKIKAIQEWKVPTSVSDVRSFLGLANYYR
        FL + I   +I     K  AI+++  P +V   + FLG+ NYYR
Subjt:  FLRHVISCGQIGMDSDKIKAIQEWKVPTSVSDVRSFLGLANYYR

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.1e-3729.82Show/hide
Query:  DQDNPPMGALKFLSALQRKVDPKEIIEKG------LMFVDATINSRPSKSTLINSGATHNFIAKQVARRLGLTIGKDPGKMKVVNS----EALPIVGVSK
        ++ NP    L F   +    DPK    +G      L     T  +    +TL N G+T +  A+         + +D     VV++    E       +K
Subjt:  DQDNPPMGALKFLSALQRKVDPKEIIEKG------LMFVDATINSRPSKSTLINSGATHNFIAKQVARRLGLTIGKDPGKMKVVNS----EALPIVGVSK

Query:  RVPFKLGAWTGELDLVVIRIDDFDVRAYHKHYHPVEALITKWNSFPGLKPPAKNAYRMAPPELAELRKQLDELLTAEFISPVKATYGAPVLFQKKKDETL
             L  W  +    +IR +D   R    +  PV+  I      PG + P    Y +      E+ K + +LL  +FI P K+   +PV+   KKD T 
Subjt:  RVPFKLGAWTGELDLVVIRIDDFDVRAYHKHYHPVEALITKWNSFPGLKPPAKNAYRMAPPELAELRKQLDELLTAEFISPVKATYGAPVLFQKKKDETL

Query:  HLCIDYRALNKVTVRNKYPLSIISDLFDQLHGPKYFTKLDLRSRYYQVRIAEGDESKTTCVTRYGAFEFLVMPFGLTIAPATFCTLMNTTL---------
         LC+DYR LNK T+ + +PL  I +L  ++   + FT LDL S Y+Q+ +   D  KT  VT  G +E+ VMPFGL  AP+TF   M  T          
Subjt:  HLCIDYRALNKVTVRNKYPLSIISDLFDQLHGPKYFTKLDLRSRYYQVRIAEGDESKTTCVTRYGAFEFLVMPFGLTIAPATFCTLMNTTL---------

Query:  -----------EEHKVHLKLIFDKLQQNQSYIKKEKCVFAQTCINFLRHVISCGQIGMDSDKIKAIQEWKVPTSVSDVRSFLGLANYYR
                   EEH  HL  + ++L+     +KK+KC FA     FL + I   +I     K  AI+++  P +V   + FLG+ NYYR
Subjt:  -----------EEHKVHLKLIFDKLQQNQSYIKKEKCVFAQTCINFLRHVISCGQIGMDSDKIKAIQEWKVPTSVSDVRSFLGLANYYR

Q9UR07 Transposon Tf2-11 polyprotein5.0e-3333.61Show/hide
Query:  PAKNAYRMAPPELAELRKQLDELLTAEFISPVKATYGAPVLFQKKKDETLHLCIDYRALNKVTVRNKYPLSIISDLFDQLHGPKYFTKLDLRSRYYQVRI
        P +N Y + P ++  +  ++++ L +  I   KA    PV+F  KK+ TL + +DY+ LNK    N YPL +I  L  ++ G   FTKLDL+S Y+ +R+
Subjt:  PAKNAYRMAPPELAELRKQLDELLTAEFISPVKATYGAPVLFQKKKDETLHLCIDYRALNKVTVRNKYPLSIISDLFDQLHGPKYFTKLDLRSRYYQVRI

Query:  AEGDESKTTCVTRYGAFEFLVMPFGLTIAPATFCTLMNTTL----------------------EEHKVHLKLIFDKLQQNQSYIKKEKCVFAQTCINFLR
         +GDE K       G FE+LVMP+G++IAPA F   +NT L                       EH  H+K +  KL+     I + KC F Q+ + F+ 
Subjt:  AEGDESKTTCVTRYGAFEFLVMPFGLTIAPATFCTLMNTTL----------------------EEHKVHLKLIFDKLQQNQSYIKKEKCVFAQTCINFLR

Query:  HVISCGQIGMDSDKIKAIQEWKVPTSVSDVRSFLGLANYYR
        + IS        + I  + +WK P +  ++R FLG  NY R
Subjt:  HVISCGQIGMDSDKIKAIQEWKVPTSVSDVRSFLGLANYYR

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGACGACAAAGTCGACACCCAAATCACAAGCCGATCGGCTTACATTAATAGAGGAGGAAATGTTGTTCCTCAAAGAAGCCCCTGACATCATCCGCGTCCTG
GAAGCACGGGTGAAAGAATTGAGTGGGAAAGTCGTAGAGATCGACGCAATGGGTAGCCGCCTGGATGGGTTGCCAATCGCAAAATTGATGTTTCGGGTGACCTCA
TTCGAAGAAAAGGTTGCTCCTACGAGCAGCCCAAGACCGTCTGGTAGCCCCGATAGCTCTGTCGCACACAAGGAGGGATGTGGCGAAGAGTTCGACGTGCTACAA
AATACAATGATGAGTTTGTTCAATGGATTAGCTGACGAATTCAGAACAACCATCGATGACATGCAAGAAAAGATGTTTGCCATGAACACTCGAATCGAGGTGACC
ATGAAAGTTGTGCAGAACGTCTCGGCTGGACAAACTAATACAGGGTTCAACAAACTGAAGTTCCCAGATCCTAGACCTTTTAAAGGGAACCGGGATGCCAAAGAG
TTGGAGAACTTTATCTTTGATGTCGAACAGTACTTCAAAGCCACACCGGCCTGTACCGACGACATGAAGGTGGCAGTAGCCTCGATGTATCTCATAGACGACACC
AAACTTTGGTGGCGTATGAAGGTACAAGACATCGAGAATGAATTGTGCACCATAGACTCGTGGGAAGACTTCAAGAGAGAGTTGAGGGACCTATTCCTCCCCGAA
AACGTAGATTATCTAGCAATGGAAAAACTAATAGCTCTAAAGCAAACTGGAAGCATAAGGGACTATGTCAAACAATTTTCGCCCCTGATGCTAGATATTAGGGGC
ACATCAGAGAAGGACAAGCCGTCAGGTCACCGAAATGGAAGCCCCAACAGGCCAAACGGAGGTAACGAAAGACCAAGCGGGTGGACAGATAGACCTCCTCAGAAC
AACCAAGCGGAGACATCTCGAGGACCTTACCCTCAAAGGAACCACCCGATGACACCTTTACAATGCATATTGTGTAAAGGCCCCCACAAAGTGTCTTACTGTCCT
CATTGGGCCTCTCTCACTGCACTCCAAGTGTCCATTCAAGAGAGCATCGACACAAGAGTCGAGACTATGCTAGACAAGAAGGAAGATCAAGACAACCCCCCAATG
GGCGCGCTCAAATTCTTGTCAGCCCTCCAACGGAAGGTCGACCCGAAAGAGATAATAGAGAAAGGGCTCATGTTTGTGGATGCAACAATAAACTCTCGACCGAGC
AAGAGCACTTTGATAAACTCAGGAGCGACTCACAACTTCATCGCCAAACAAGTAGCCCGAAGATTGGGACTCACCATAGGAAAAGACCCTGGAAAAATGAAAGTT
GTCAACTCCGAGGCCTTGCCTATTGTGGGAGTTTCCAAAAGAGTCCCCTTCAAATTAGGGGCTTGGACAGGAGAGCTGGACCTGGTCGTAATTCGCATAGACGAC
TTTGACGTGAGAGCCTACCACAAACATTACCACCCCGTCGAGGCATTGATCACGAAATGGAACTCCTTCCCGGGGTTAAAACCCCCAGCGAAGAACGCATACCGG
ATGGCTCCCCCTGAGCTAGCTGAATTGAGAAAACAACTAGATGAGTTGTTGACAGCAGAATTCATCTCCCCGGTAAAAGCAACTTACGGAGCCCCCGTATTATTT
CAGAAAAAGAAGGATGAGACGTTGCATCTGTGCATAGATTATAGGGCCTTAAACAAGGTGACGGTACGCAACAAATACCCACTGTCGATAATATCCGACTTGTTC
GACCAACTTCATGGGCCCAAATACTTCACAAAGTTGGACTTACGATCAAGGTACTACCAAGTACGTATCGCCGAGGGGGACGAGTCCAAGACGACGTGTGTGACA
AGATATGGGGCCTTCGAGTTCCTGGTAATGCCCTTTGGCTTGACAATCGCCCCAGCTACGTTTTGCACGTTAATGAACACAACCCTAGAGGAACACAAGGTGCAC
TTGAAGCTGATATTTGACAAGCTGCAACAGAACCAGTCGTACATCAAGAAAGAAAAATGTGTCTTCGCACAAACATGCATCAACTTCCTCAGACATGTCATCAGT
TGTGGACAGATTGGGATGGATAGCGATAAGATAAAAGCTATCCAGGAGTGGAAAGTCCCTACTTCCGTATCCGATGTGCGGTCCTTCTTAGGATTAGCAAACTAC
TATAGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGACGACAAAGTCGACACCCAAATCACAAGCCGATCGGCTTACATTAATAGAGGAGGAAATGTTGTTCCTCAAAGAAGCCCCTGACATCATCCGCGTCCTG
GAAGCACGGGTGAAAGAATTGAGTGGGAAAGTCGTAGAGATCGACGCAATGGGTAGCCGCCTGGATGGGTTGCCAATCGCAAAATTGATGTTTCGGGTGACCTCA
TTCGAAGAAAAGGTTGCTCCTACGAGCAGCCCAAGACCGTCTGGTAGCCCCGATAGCTCTGTCGCACACAAGGAGGGATGTGGCGAAGAGTTCGACGTGCTACAA
AATACAATGATGAGTTTGTTCAATGGATTAGCTGACGAATTCAGAACAACCATCGATGACATGCAAGAAAAGATGTTTGCCATGAACACTCGAATCGAGGTGACC
ATGAAAGTTGTGCAGAACGTCTCGGCTGGACAAACTAATACAGGGTTCAACAAACTGAAGTTCCCAGATCCTAGACCTTTTAAAGGGAACCGGGATGCCAAAGAG
TTGGAGAACTTTATCTTTGATGTCGAACAGTACTTCAAAGCCACACCGGCCTGTACCGACGACATGAAGGTGGCAGTAGCCTCGATGTATCTCATAGACGACACC
AAACTTTGGTGGCGTATGAAGGTACAAGACATCGAGAATGAATTGTGCACCATAGACTCGTGGGAAGACTTCAAGAGAGAGTTGAGGGACCTATTCCTCCCCGAA
AACGTAGATTATCTAGCAATGGAAAAACTAATAGCTCTAAAGCAAACTGGAAGCATAAGGGACTATGTCAAACAATTTTCGCCCCTGATGCTAGATATTAGGGGC
ACATCAGAGAAGGACAAGCCGTCAGGTCACCGAAATGGAAGCCCCAACAGGCCAAACGGAGGTAACGAAAGACCAAGCGGGTGGACAGATAGACCTCCTCAGAAC
AACCAAGCGGAGACATCTCGAGGACCTTACCCTCAAAGGAACCACCCGATGACACCTTTACAATGCATATTGTGTAAAGGCCCCCACAAAGTGTCTTACTGTCCT
CATTGGGCCTCTCTCACTGCACTCCAAGTGTCCATTCAAGAGAGCATCGACACAAGAGTCGAGACTATGCTAGACAAGAAGGAAGATCAAGACAACCCCCCAATG
GGCGCGCTCAAATTCTTGTCAGCCCTCCAACGGAAGGTCGACCCGAAAGAGATAATAGAGAAAGGGCTCATGTTTGTGGATGCAACAATAAACTCTCGACCGAGC
AAGAGCACTTTGATAAACTCAGGAGCGACTCACAACTTCATCGCCAAACAAGTAGCCCGAAGATTGGGACTCACCATAGGAAAAGACCCTGGAAAAATGAAAGTT
GTCAACTCCGAGGCCTTGCCTATTGTGGGAGTTTCCAAAAGAGTCCCCTTCAAATTAGGGGCTTGGACAGGAGAGCTGGACCTGGTCGTAATTCGCATAGACGAC
TTTGACGTGAGAGCCTACCACAAACATTACCACCCCGTCGAGGCATTGATCACGAAATGGAACTCCTTCCCGGGGTTAAAACCCCCAGCGAAGAACGCATACCGG
ATGGCTCCCCCTGAGCTAGCTGAATTGAGAAAACAACTAGATGAGTTGTTGACAGCAGAATTCATCTCCCCGGTAAAAGCAACTTACGGAGCCCCCGTATTATTT
CAGAAAAAGAAGGATGAGACGTTGCATCTGTGCATAGATTATAGGGCCTTAAACAAGGTGACGGTACGCAACAAATACCCACTGTCGATAATATCCGACTTGTTC
GACCAACTTCATGGGCCCAAATACTTCACAAAGTTGGACTTACGATCAAGGTACTACCAAGTACGTATCGCCGAGGGGGACGAGTCCAAGACGACGTGTGTGACA
AGATATGGGGCCTTCGAGTTCCTGGTAATGCCCTTTGGCTTGACAATCGCCCCAGCTACGTTTTGCACGTTAATGAACACAACCCTAGAGGAACACAAGGTGCAC
TTGAAGCTGATATTTGACAAGCTGCAACAGAACCAGTCGTACATCAAGAAAGAAAAATGTGTCTTCGCACAAACATGCATCAACTTCCTCAGACATGTCATCAGT
TGTGGACAGATTGGGATGGATAGCGATAAGATAAAAGCTATCCAGGAGTGGAAAGTCCCTACTTCCGTATCCGATGTGCGGTCCTTCTTAGGATTAGCAAACTAC
TATAGGTGA
Protein sequenceShow/hide protein sequence
MSTTKSTPKSQADRLTLIEEEMLFLKEAPDIIRVLEARVKELSGKVVEIDAMGSRLDGLPIAKLMFRVTSFEEKVAPTSSPRPSGSPDSSVAHKEGCGEEFDVLQ
NTMMSLFNGLADEFRTTIDDMQEKMFAMNTRIEVTMKVVQNVSAGQTNTGFNKLKFPDPRPFKGNRDAKELENFIFDVEQYFKATPACTDDMKVAVASMYLIDDT
KLWWRMKVQDIENELCTIDSWEDFKRELRDLFLPENVDYLAMEKLIALKQTGSIRDYVKQFSPLMLDIRGTSEKDKPSGHRNGSPNRPNGGNERPSGWTDRPPQN
NQAETSRGPYPQRNHPMTPLQCILCKGPHKVSYCPHWASLTALQVSIQESIDTRVETMLDKKEDQDNPPMGALKFLSALQRKVDPKEIIEKGLMFVDATINSRPS
KSTLINSGATHNFIAKQVARRLGLTIGKDPGKMKVVNSEALPIVGVSKRVPFKLGAWTGELDLVVIRIDDFDVRAYHKHYHPVEALITKWNSFPGLKPPAKNAYR
MAPPELAELRKQLDELLTAEFISPVKATYGAPVLFQKKKDETLHLCIDYRALNKVTVRNKYPLSIISDLFDQLHGPKYFTKLDLRSRYYQVRIAEGDESKTTCVT
RYGAFEFLVMPFGLTIAPATFCTLMNTTLEEHKVHLKLIFDKLQQNQSYIKKEKCVFAQTCINFLRHVISCGQIGMDSDKIKAIQEWKVPTSVSDVRSFLGLANY
YR