; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG02G009760 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG02G009760
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionReverse transcriptase
Genome locationCG_Chr02:15346952..15353759
RNA-Seq ExpressionClCG02G009760
SyntenyClCG02G009760
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR012337 - Ribonuclease H-like superfamily
IPR022546 - Uncharacterised protein family Ycf68
IPR036397 - Ribonuclease H superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAD5336141.1 unnamed protein product [Arabidopsis thaliana]2.0e-16152.53Show/hide
Query:  IPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLRCPLSGWAVRVGEGQSLILKTSILKTKEAGGKGGKALRSWFSLPFEI
        IPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLR                                        S PFEI
Subjt:  IPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLRCPLSGWAVRVGEGQSLILKTSILKTKEAGGKGGKALRSWFSLPFEI

Query:  LRRVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHS
        LRRVALWRAQYDESCKLCSGGS CLSLASMVESV GL GGGLPCGGCQRFESAYLQLVNLADTKLYDST FFRFGSSIYD SFMDVDKI PFSSTLGWHS
Subjt:  LRRVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHS

Query:  LKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGQPFKLLLNPWAGKRQPGELKHLSSQRKRKQKRFPVVLLGEAVECCTLDGESPV
        LKVKGEVQT+KGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIG                                                     
Subjt:  LKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGQPFKLLLNPWAGKRQPGELKHLSSQRKRKQKRFPVVLLGEAVECCTLDGESPV

Query:  AESITSLRSDPSSMGHVESRVNQQGPPCKAKYSWVTDKPPSGSEIEHETVSFQAVGGDQDSDRAVAWLREPTGAVAKASLHGAIVTAYGPEPGGEMPLEP
                                                                       AV W         KA                      
Subjt:  AESITSLRSDPSSMGHVESRVNQQGPPCKAKYSWVTDKPPSGSEIEHETVSFQAVGGDQDSDRAVAWLREPTGAVAKASLHGAIVTAYGPEPGGEMPLEP

Query:  RASWFSPKCVEAQQLTGHLGGSRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEMNGAKRSAEAVGCKNASVGERSALEGSTCASSGGRSGSENVGLS
                              RSASET+GDKLHRREGNSPDHQLRPLNDRSVIKEMNGAKRSAEAVGC+NASVGERSAL GS  AS GGRSGSENVGLS
Subjt:  RASWFSPKCVEAQQLTGHLGGSRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEMNGAKRSAEAVGCKNASVGERSALEGSTCASSGGRSGSENVGLS

Query:  NANIGENPMPRKPKGSSARFVHGGRSRWTTGEYSCTTPCWSRGTEEARLAERWLSVQGRKVPLFFQVTHAILPGKARTTFNKRVPVPETDTGGPVTSGEG
        NANIGENPMPRKPKGSSARFVHGG  R   G  +  + C S     A                  +VTHAILPGKARTTFNKRVPVPETDT         
Subjt:  NANIGENPMPRKPKGSSARFVHGGRSRWTTGEYSCTTPCWSRGTEEARLAERWLSVQGRKVPLFFQVTHAILPGKARTTFNKRVPVPETDTGGPVTSGEG

Query:  VPPHKGGRSDQARATCRKVKEVGDLMTGEPATEAPVNGGRNYNGPKVAKFLVGLGEIDMSVKMRTTCTWTERPYEASLFPGIGFGPFLRSL
                                                                 DMSVKMRTTCTWTERPYEASLFPGIGFG FLRSL
Subjt:  VPPHKGGRSDQARATCRKVKEVGDLMTGEPATEAPVNGGRNYNGPKVAKFLVGLGEIDMSVKMRTTCTWTERPYEASLFPGIGFGPFLRSL

CAD5336145.1 unnamed protein product [Arabidopsis thaliana]9.3e-16748.77Show/hide
Query:  AMYNGELYAAFGKDESLPKRNLLILSQLVGPPGWPSYAKEKNKRIEEASDSFMQAPLGSGGYSSVGRAPLLQLGRCDYGNGEEDRNMPLKDSTETKMGCQ
        A YNGELYAAFGKDESLPKRNLLILSQL                +E A      A LG          P L L                           
Subjt:  AMYNGELYAAFGKDESLPKRNLLILSQLVGPPGWPSYAKEKNKRIEEASDSFMQAPLGSGGYSSVGRAPLLQLGRCDYGNGEEDRNMPLKDSTETKMGCQ

Query:  ERRGGRMGSWSDLVWIVHGRVPSSGIPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLRCPLSGWAVRVGEGQSLILKTS
                                 IPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLR                     
Subjt:  ERRGGRMGSWSDLVWIVHGRVPSSGIPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLRCPLSGWAVRVGEGQSLILKTS

Query:  ILKTKEAGGKGGKALRSWFSLPFEILRRVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFG
                           S PFEILRRVALWRAQYDESCKLCSGGS CLSLASMVESV GL GGGLPCGGCQRFESAYLQLVNLADTKLYDST FFRFG
Subjt:  ILKTKEAGGKGGKALRSWFSLPFEILRRVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFG

Query:  SSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGQPFKLLLNPWAGKRQPGELKHLSSQRKR
        SSIYD SFMDVDKI PFSSTLGWHSLKVKGEVQT+KGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIG                            
Subjt:  SSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGQPFKLLLNPWAGKRQPGELKHLSSQRKR

Query:  KQKRFPVVLLGEAVECCTLDGESPVAESITSLRSDPSSMGHVESRVNQQGPPCKAKYSWVTDKPPSGSEIEHETVSFQAVGGDQDSDRAVAWLREPTGAV
                                                                                                AV W        
Subjt:  KQKRFPVVLLGEAVECCTLDGESPVAESITSLRSDPSSMGHVESRVNQQGPPCKAKYSWVTDKPPSGSEIEHETVSFQAVGGDQDSDRAVAWLREPTGAV

Query:  AKASLHGAIVTAYGPEPGGEMPLEPRASWFSPKCVEAQQLTGHLGGSRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEMNGAKRSAEAVGCKNASVG
         KA                                            RSASET+GDKLHRREGNSPDHQLRPLNDRSVIKEMNGAKRSAEAVGC+NASVG
Subjt:  AKASLHGAIVTAYGPEPGGEMPLEPRASWFSPKCVEAQQLTGHLGGSRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEMNGAKRSAEAVGCKNASVG

Query:  ERSALEGSTCASSGGRSGSENVGLSNANIGENPMPRKPKGSSARFVHGGRSRWTTGEYSCTTPCWSRGTEEARLAERWLSVQGRKVPLFFQVTHAILPGK
        ERSAL GS  AS GGRSGSENVGLSNANIGENPMPRKPKGSSARFVHGG  R   G  +  + C S     A                  +VTHAILPGK
Subjt:  ERSALEGSTCASSGGRSGSENVGLSNANIGENPMPRKPKGSSARFVHGGRSRWTTGEYSCTTPCWSRGTEEARLAERWLSVQGRKVPLFFQVTHAILPGK

Query:  ARTTFNKRVPVPETDTGGPVTSGEGVPPHKGGRSDQARATCRKVKEVGDLMTGEPATEAPVNGGRNYNGPKVAKFLVGLGEIDMSVKMRTTCTWTERPYE
        ARTTFNKRVPVPETDT                                                                  DMSVKMRTTCTWTERPYE
Subjt:  ARTTFNKRVPVPETDTGGPVTSGEGVPPHKGGRSDQARATCRKVKEVGDLMTGEPATEAPVNGGRNYNGPKVAKFLVGLGEIDMSVKMRTTCTWTERPYE

Query:  ASLFPGIGFGPFLRSL
        ASLFPGIGFG FLRSL
Subjt:  ASLFPGIGFGPFLRSL

KAD3640919.1 hypothetical protein E3N88_30142 [Mikania micrantha]1.7e-20553.73Show/hide
Query:  STETKMGC--QERRGGRMGSWSDLVWIVHG-----RVPSSGIPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLRCPLSG
        S   ++GC   +  G  + S  D  W V G     RVP SGIPGEEDQVGPCEQLDALSPFNPLSE+RQKEGKSMDRPH LHPVGTTR PQGRLR P   
Subjt:  STETKMGC--QERRGGRMGSWSDLVWIVHG-----RVPSSGIPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLRCPLSG

Query:  WAVRVGEGQSLILKTSILKTKEAGGKGGKALRSWFSLPFEILRRVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVN
                                  G ++                                             G LRGGGLPCGGCQRFESAYLQLVN
Subjt:  WAVRVGEGQSLILKTSILKTKEAGGKGGKALRSWFSLPFEILRRVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVN

Query:  LADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGQPFKLLLNPWAG
        LADTKLYDST FFRFG SIYDLSFMDVDKI PFSSTLGWHSLK+KGEVQT+KGLRWIPRHPETRKGVVSDEMLRGVENK RSGDSRI             
Subjt:  LADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGQPFKLLLNPWAG

Query:  KRQPGELKHLSSQRKRKQKRFPVVLLGEAVECCTLDGESPVAESITSLRSDPSSMGHVESRVNQQGPPCKAKYSWVTDKPPSGSEIEHETVSFQAVGGDQ
                                  GEAVECCTLDGESPVAESITSLRSDPSSMGHVESRVNQQGPPCKAKYSWVTD                      
Subjt:  KRQPGELKHLSSQRKRKQKRFPVVLLGEAVECCTLDGESPVAESITSLRSDPSSMGHVESRVNQQGPPCKAKYSWVTDKPPSGSEIEHETVSFQAVGGDQ

Query:  DSDRAVAWLREPTGAVAKASLHGAIVTAYGPEPGGEMPLEPRASWFSPKCVEAQQLTGHLGGSRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEMNG
            AVAWLREPTGAVAKASLH AIVTAYGPEPGGEMPLEPRASWFSPKCVEAQQLTGHLG                                 +K   G
Subjt:  DSDRAVAWLREPTGAVAKASLHGAIVTAYGPEPGGEMPLEPRASWFSPKCVEAQQLTGHLGGSRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEMNG

Query:  AKRSAEAVGCKNASVGERSALEGSTCASSGGRSGSENVGLSNANIGENPMPRKPKGSSARFVHGGRSRWTTGEYSCTTPCWSRGTEEARLAERWLSVQGR
        A R +                                                                              GTEEARLAERWLSVQGR
Subjt:  AKRSAEAVGCKNASVGERSALEGSTCASSGGRSGSENVGLSNANIGENPMPRKPKGSSARFVHGGRSRWTTGEYSCTTPCWSRGTEEARLAERWLSVQGR

Query:  KVPLF--------------------FQVTHAILPGKARTTFNKRVPVPETDTGGPVTSGEGVPPHKGGRSDQARATCRKVKEVGDLMTGEPATEAPVNGG
        KV LF                     +VTHAILPGKART FNKRVPVPETDT            H  G +    A CRKVKEVGDLMTGEPATEAPVNGG
Subjt:  KVPLF--------------------FQVTHAILPGKARTTFNKRVPVPETDTGGPVTSGEGVPPHKGGRSDQARATCRKVKEVGDLMTGEPATEAPVNGG

Query:  RNYNGPKVAKFLVGLGEIDMSVKMRTTCTWTERPYEASLFPGIGFGPFLRSLVRYHSGRARILTLCQDLRAKGQSQVTEACKGFLGPDGDWPSSAKAEGS
        RNYNGPK+AKFLVG                   PYEASLFPGIGFGPFLRSL     GR R     +        +VTEACKGFLGPDGDWPSSAKAEGS
Subjt:  RNYNGPKVAKFLVGLGEIDMSVKMRTTCTWTERPYEASLFPGIGFGPFLRSLVRYHSGRARILTLCQDLRAKGQSQVTEACKGFLGPDGDWPSSAKAEGS

Query:  LTARPTRRAGTKVGLSDPTVPSGRAVAQRIKV
        LTARPTRRAGTKVGLSDPTVPSGRAVAQRIKV
Subjt:  LTARPTRRAGTKVGLSDPTVPSGRAVAQRIKV

KAG7528872.1 hypothetical protein ISN44_Un153g000040 [Arabidopsis suecica]5.6e-19666.67Show/hide
Query:  IPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLRCPLSGWAVRVGEGQSLILKTSILKTKEAGGKGGKALRSWFSLPFEI
        IPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRL                                               
Subjt:  IPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLRCPLSGWAVRVGEGQSLILKTSILKTKEAGGKGGKALRSWFSLPFEI

Query:  LRRVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHS
          RVALWRAQYDESCKLCSGGS CLSLASMVESV GL GGGLPCGGCQRFESAYLQLVNLADTKLYDST FFRFGSSIYD SFMDVDKI PFSSTLGWHS
Subjt:  LRRVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHS

Query:  LKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGQPFKLLLNPWAGKRQPGELKHLSSQRKRKQKRFPVVLLGEAVECCTLDGESPV
        LKVKGEVQT+KGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRI                                       GEAVEC TLDGESPV
Subjt:  LKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGQPFKLLLNPWAGKRQPGELKHLSSQRKRKQKRFPVVLLGEAVECCTLDGESPV

Query:  AESITSLRSDPSSMGHVESRVNQQGPPCKAKYSWVTDKPPSGSEIEHETVSFQAVGGDQDSDRAVAWLREPTGAVAKASLHGAIVTAYGPEPGGEMPLEP
        AESITSL SDPSSMGHVESRVNQQGPPCKAKYSWVTD                          AVAWLREPTGAVAKASLH AIVTAYGPEPGGEMPLEP
Subjt:  AESITSLRSDPSSMGHVESRVNQQGPPCKAKYSWVTDKPPSGSEIEHETVSFQAVGGDQDSDRAVAWLREPTGAVAKASLHGAIVTAYGPEPGGEMPLEP

Query:  RASWFSPKCVEAQQLTGHLGGSRSASETMGDKLHRREGNSPDHQLRPLNDRSVIK----EMNGAKRSAEAVGCKNASVGERSALEGSTCASSGGRSGSEN
        RASWFSPKCVEAQQLTGHLG               RE  +   Q   LN R  +K    +MNGAKRSAEAVGC+NASVGERSAL GS  AS GGRSGSEN
Subjt:  RASWFSPKCVEAQQLTGHLGGSRSASETMGDKLHRREGNSPDHQLRPLNDRSVIK----EMNGAKRSAEAVGCKNASVGERSALEGSTCASSGGRSGSEN

Query:  VGLSNANIGENPMPRKPKGSSARFVHGGRSRWTTGEYSCTTPCWSRGTEEARLAERWLSVQGRKVPLFFQVTHAILPGKARTTFNKRVPVPETDTGG
        VGLSNANIGENPMPRKPKGSSARFVHGG  R   G  +  + C S     A                  +VTHAILPGKARTTFNKRVPVPETDTGG
Subjt:  VGLSNANIGENPMPRKPKGSSARFVHGGRSRWTTGEYSCTTPCWSRGTEEARLAERWLSVQGRKVPLFFQVTHAILPGKARTTFNKRVPVPETDTGG

OVA05688.1 hypothetical protein BVC80_4285g1 [Macleaya cordata]7.4e-18053.98Show/hide
Query:  KRIEEASDSFMQAPLGSGGYSSVGRAPLLQLGRCDYGNGEEDRNMPLKDSTETKMGCQERRGGRMGSWSDLVWIVHGRVPSSGIPGEEDQVGPCEQLDAL
        KRIEEASDSFM APLGSGGYSSVGRAPLLQL                       +G     GG              RVPSSGIPGEEDQVGPCEQLDAL
Subjt:  KRIEEASDSFMQAPLGSGGYSSVGRAPLLQLGRCDYGNGEEDRNMPLKDSTETKMGCQERRGGRMGSWSDLVWIVHGRVPSSGIPGEEDQVGPCEQLDAL

Query:  SPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLRCPLSGWAVRVGEGQSLILKTSILKTKEAGGKGGKALRSWFSLPFEILRRVALWRAQYDESCKL
        SPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLR P     +   E Q  +   S                   S PFEILRRVALWRAQ       
Subjt:  SPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLRCPLSGWAVRVGEGQSLILKTSILKTKEAGGKGGKALRSWFSLPFEILRRVALWRAQYDESCKL

Query:  CSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIP
                       S GGLRGGGLPCGGCQRFESAYLQLVNLADTK+YDST FFRFGSSIYDLSFMDVDKIL FSSTLGWHSLKV GEVQT+KGLRWIP
Subjt:  CSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIP

Query:  RHPETRKGVVSDEMLRGVENKRRSGDSRIGQPFKLLLNPWAGKRQPGELKHLSSQRKRKQKRFPVVLLGEAVECCTLDGESPVAESITSLRSDPSSMGHV
        RHPETRKGV SDEMLRGVENK RSGDSRIGQPF+LLLNPWAGKRQPGELKHLS                EAVEC TLDGESPVAESITSLRSDPSSMGHV
Subjt:  RHPETRKGVVSDEMLRGVENKRRSGDSRIGQPFKLLLNPWAGKRQPGELKHLSSQRKRKQKRFPVVLLGEAVECCTLDGESPVAESITSLRSDPSSMGHV

Query:  ESRVNQQGPPCKAKYSWVTDKPPSGSEIEHETVSFQAVGGDQDSDRAVAWLREPTGAVAKASLHGAIVTAYGPEPGGEMPLEPRASWFSPKCVEAQQLTG
        ESRVNQQGPPCKAKYSWVTD                          AVAWLREPTGAVAKASLH AIVTAYGPEPGGEMPLEPRASWFSPKCVEAQQLTG
Subjt:  ESRVNQQGPPCKAKYSWVTDKPPSGSEIEHETVSFQAVGGDQDSDRAVAWLREPTGAVAKASLHGAIVTAYGPEPGGEMPLEPRASWFSPKCVEAQQLTG

Query:  HLGGSRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEMNGAKRSAEAVGCKNASVGERSALEGSTCASSGGRSGSENVGLSNANIGENPMPRKPKGSS
        HLG                                 +K                             C  +G  SGS   G S   I       +PKG  
Subjt:  HLGGSRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEMNGAKRSAEAVGCKNASVGERSALEGSTCASSGGRSGSENVGLSNANIGENPMPRKPKGSS

Query:  ARFVHGGRSRWTTGEYSCTTPCWSRGTEEARLAERWLSVQGRKVPLFFQVTHAILPGKARTTFNKRVPVPETDTGGPVTSGEGVPPHKGGRSDQARATCR
                                                                                     V  G+                CR
Subjt:  ARFVHGGRSRWTTGEYSCTTPCWSRGTEEARLAERWLSVQGRKVPLFFQVTHAILPGKARTTFNKRVPVPETDTGGPVTSGEGVPPHKGGRSDQARATCR

Query:  KVKEVGDLMTGEPATEAPVNGGRNYNGPKVAKFLVGLGEID
        KVKEVGDLMTGEPATEAPVNGGRNYNGPKVAKFLVG+ + D
Subjt:  KVKEVGDLMTGEPATEAPVNGGRNYNGPKVAKFLVGLGEID

TrEMBL top hitse value%identityAlignment
A0A2N9GIA5 Uncharacterized protein ycf684.9e-19854.53Show/hide
Query:  EILRRVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGW
        +++  VALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDST FFRFGSSIYDLSFMDVDKILPFSSTLGW
Subjt:  EILRRVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGW

Query:  HSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGQPFKLLLNPWAGKRQPGELKHLSSQRKRKQKRFPVVLLGEAVECCTLDGES
        HSLKVKGEVQT+KGLRWIPRHPETRKG                                                                         
Subjt:  HSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGQPFKLLLNPWAGKRQPGELKHLSSQRKRKQKRFPVVLLGEAVECCTLDGES

Query:  PVAESITSLRSDPSSMGHVESRVNQQGPPCKAKYSWVTDKPPSGSEIEHETVSFQAVGGDQDSDRAVAWLREPTGAVAKASLHGAIVTAYGPEPGGEMPL
                                                                          +AWLREPTGAVAKASLH AIVTAYGPEPGGEMPL
Subjt:  PVAESITSLRSDPSSMGHVESRVNQQGPPCKAKYSWVTDKPPSGSEIEHETVSFQAVGGDQDSDRAVAWLREPTGAVAKASLHGAIVTAYGPEPGGEMPL

Query:  EPRASWFSPKCVEAQQLTGHLGGSRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEMNGAKRSAEAVGCKNASVGERSALEGSTCASSGGRSGSENVG
        EPRASWFSPKCVEAQQLTGHLGGSRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEMNGAKRSAEAVGCK ASVGERSALEGST  S GGRSGSENVG
Subjt:  EPRASWFSPKCVEAQQLTGHLGGSRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEMNGAKRSAEAVGCKNASVGERSALEGSTCASSGGRSGSENVG

Query:  LSNANIGENPMPRKPKGSSARFVHGGRSRWTTGEYSCTTPCWSRGTEEARLAERWLSVQGRKVPLFFQVTHAILPGKARTTFNKRVPVPETDTGGPVTSG
        LSNANIGENPMPRKPKGSSARFVHGG  R   G  + ++ C S     A                  +VTHAILPGKARTTFNKR               
Subjt:  LSNANIGENPMPRKPKGSSARFVHGGRSRWTTGEYSCTTPCWSRGTEEARLAERWLSVQGRKVPLFFQVTHAILPGKARTTFNKRVPVPETDTGGPVTSG

Query:  EGVPPHKGGRSDQARATCRKVKEVGDLMTGEPATEAPVNGGRNYNGPKVAKFLVGLGEIDMSVKMRTTCTWTERPYEASLFPGIGFGPFLRSLVRYHSGR
                         CRKVKEVGDLMTGEPATEAPVNGGRNYNGPKVAKFLVG                                             
Subjt:  EGVPPHKGGRSDQARATCRKVKEVGDLMTGEPATEAPVNGGRNYNGPKVAKFLVGLGEIDMSVKMRTTCTWTERPYEASLFPGIGFGPFLRSLVRYHSGR

Query:  ARILTLCQDLRAKGQSQVTEACKGFLGPDGDWPSSAKAEGSLTARPTRRAGTKVGLSDPTVPSGRAVAQRIKVWHLDVGSSPPGAVVCSKGWAVRPLKRY
                        +VTEACKGFLGPDGDWPSSAKAEGSLTARPTRRAGTKVGLSDPTVPSGRAVAQRIK                            
Subjt:  ARILTLCQDLRAKGQSQVTEACKGFLGPDGDWPSSAKAEGSLTARPTRRAGTKVGLSDPTVPSGRAVAQRIKVWHLDVGSSPPGAVVCSKGWAVRPLKRY

Query:  VSWVQNVVRQFGPYPDREGRTSGVPVIVPTVNAGPPQDECSPIPTSPEPPVAQPRQRWV------LCPCGDGATEVLRIQEKL
                                    PT     PQDECSPIPTSPEPPVAQP           LCPCGDGATEVLRIQEK+
Subjt:  VSWVQNVVRQFGPYPDREGRTSGVPVIVPTVNAGPPQDECSPIPTSPEPPVAQPRQRWV------LCPCGDGATEVLRIQEKL

A0A2N9HJU4 Uncharacterized protein ycf681.8e-21667.76Show/hide
Query:  MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSD
        MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDST FFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQT+KGLRWIPRHPETRKGVVSD
Subjt:  MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSD

Query:  EMLRGVENKRRSGDSRIGQPFKLLLNPWAGKRQPGELKHLSSQRKRKQKRFPVVLLGEAVECCTLDGESPVAESITSLRSDPSSMGHVESRVNQQGPPCK
        EMLRGVENKRRSGDSRI                                       GEAVECCTLDGESPVAESITSLRSDPSSMGHVESRVNQQGPP  
Subjt:  EMLRGVENKRRSGDSRIGQPFKLLLNPWAGKRQPGELKHLSSQRKRKQKRFPVVLLGEAVECCTLDGESPVAESITSLRSDPSSMGHVESRVNQQGPPCK

Query:  AKYSWVTDKPPSGSEIEHETVSFQAVGGDQDSDRAVAWLREPTGAVAKASLHGAIVTAYGPEPGGEMPLEPRASWFSPKCVEAQQLTGHLGGSRSASETM
                                           +AWLREPTGAVAKASLH AIVTAYGPEPGGEMPLEPRASWFSPKCVEAQQLTGHLGGSRSASETM
Subjt:  AKYSWVTDKPPSGSEIEHETVSFQAVGGDQDSDRAVAWLREPTGAVAKASLHGAIVTAYGPEPGGEMPLEPRASWFSPKCVEAQQLTGHLGGSRSASETM

Query:  GDKLHRREGNSPDHQLRPLNDRSVIKEMNGAKRSAEAVGCKNASVGERSALEGSTCASSGGRSGSENVGLSNANIGENPMPRKPKGSSARFVHGGRSRWT
        GDKLHRREGNSPDHQLRPLNDRSVIKEMNGAKRSAEAVGCK ASVGERSALEGST  S GGRSGSENVGLSNANIGENPMPRKPKGSSARFVHGG  R  
Subjt:  GDKLHRREGNSPDHQLRPLNDRSVIKEMNGAKRSAEAVGCKNASVGERSALEGSTCASSGGRSGSENVGLSNANIGENPMPRKPKGSSARFVHGGRSRWT

Query:  TGEYSCTTPCWSRGTEEARLAERWLSVQGRKVPLFFQVTHAILPGKARTTFNKRVPVPETDTGGPVTSGEGVPPHKGGRSDQARATCRKVKEVGDLMTGE
         G  + ++ C S     A                  +VTHAILPGKARTTFNKR                                CRKVKEVGDLMTGE
Subjt:  TGEYSCTTPCWSRGTEEARLAERWLSVQGRKVPLFFQVTHAILPGKARTTFNKRVPVPETDTGGPVTSGEGVPPHKGGRSDQARATCRKVKEVGDLMTGE

Query:  PATEAPVNGGRNYNGPKVAKFLVGLGEIDMSVKMRTTCTWTERPYEASLFPGIGFGPFLRSLVRYHSGRARILTLCQDLRAKGQSQVTEACKGFLGPDGD
        PATEAPVNGGRNYNGPKVAKFLVGLG                RP                      SG AR              +VTEACKGFLGPDGD
Subjt:  PATEAPVNGGRNYNGPKVAKFLVGLGEIDMSVKMRTTCTWTERPYEASLFPGIGFGPFLRSLVRYHSGRARILTLCQDLRAKGQSQVTEACKGFLGPDGD

Query:  WPSSAKAEGSLTARPTRRAGTKVGLSDPTVPSGRAVAQRIKV
        WPSSAKAEGSLTARPTRRAGTKVGLSDPTVPSGRAVAQRIKV
Subjt:  WPSSAKAEGSLTARPTRRAGTKVGLSDPTVPSGRAVAQRIKV

A0A2N9HP93 Uncharacterized protein ycf683.6e-23368.5Show/hide
Query:  EILRRVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGW
        +++  VALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDST FFRFGSSIYDLSFMDVDKILPFSSTLGW
Subjt:  EILRRVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGW

Query:  HSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGQPFKLLLNPWAGKRQPGELKHLSSQRKRKQKRFPVVLLGEAVECCTLDGES
        HSLKVKGEVQT+KGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRI                                       GEAVECCTLDGES
Subjt:  HSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGQPFKLLLNPWAGKRQPGELKHLSSQRKRKQKRFPVVLLGEAVECCTLDGES

Query:  PVAESITSLRSDPSSMGHVESRVNQQGPPCKAKYSWVTDKPPSGSEIEHETVSFQAVGGDQDSDRAVAWLREPTGAVAKASLHGAIVTAYGPEPGGEMPL
        PVAESITSLRSDPSSMGHVESRVNQQGPP                                     +AWLREPTGAVAKASLH AIVTAYGPEPGGEMPL
Subjt:  PVAESITSLRSDPSSMGHVESRVNQQGPPCKAKYSWVTDKPPSGSEIEHETVSFQAVGGDQDSDRAVAWLREPTGAVAKASLHGAIVTAYGPEPGGEMPL

Query:  EPRASWFSPKCVEAQQLTGHLGGSRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEMNGAKRSAEAVGCKNASVGERSALEGSTCASSGGRSGSENVG
        EPRASWFSPKCVEAQQLTGHLGGSRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEMNGAKRSAEAVGCK ASVGERSALEGST  S GGRSGSENVG
Subjt:  EPRASWFSPKCVEAQQLTGHLGGSRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEMNGAKRSAEAVGCKNASVGERSALEGSTCASSGGRSGSENVG

Query:  LSNANIGENPMPRKPKGSSARFVHGGRSRWTTGEYSCTTPCWSRGTEEARLAERWLSVQGRKVPLFFQVTHAILPGKARTTFNKRVPVPETDTGGPVTSG
        LSNANIGENPMPRKPKGSSARFVHGG  R   G  + ++ C S     A                  +VTHAILPGKARTTFNKR               
Subjt:  LSNANIGENPMPRKPKGSSARFVHGGRSRWTTGEYSCTTPCWSRGTEEARLAERWLSVQGRKVPLFFQVTHAILPGKARTTFNKRVPVPETDTGGPVTSG

Query:  EGVPPHKGGRSDQARATCRKVKEVGDLMTGEPATEAPVNGGRNYNGPKVAKFLVGLGEIDMSVKMRTTCTWTERPYEASLFPGIGFGPFLRSLVRYHSGR
                         CRKVKEVGDLMTGEPATEAPVNGGRNYNGPKVAKFLVGLG                RP                      SG 
Subjt:  EGVPPHKGGRSDQARATCRKVKEVGDLMTGEPATEAPVNGGRNYNGPKVAKFLVGLGEIDMSVKMRTTCTWTERPYEASLFPGIGFGPFLRSLVRYHSGR

Query:  ARILTLCQDLRAKGQSQVTEACKGFLGPDGDWPSSAKAEGSLTARPTRRAGTKVGLSDPTVPSGRAVAQRIKV
        AR              +VTEACKGFLGPDGDWPSSAKAEGSLTARPTRRAGTKVGLSDPTVPSGRAVAQRIKV
Subjt:  ARILTLCQDLRAKGQSQVTEACKGFLGPDGDWPSSAKAEGSLTARPTRRAGTKVGLSDPTVPSGRAVAQRIKV

A0A2N9I678 Uncharacterized protein ycf683.6e-23368.5Show/hide
Query:  EILRRVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGW
        +++  VALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDST FFRFGSSIYDLSFMDVDKILPFSSTLGW
Subjt:  EILRRVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGW

Query:  HSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGQPFKLLLNPWAGKRQPGELKHLSSQRKRKQKRFPVVLLGEAVECCTLDGES
        HSLKVKGEVQT+KGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRI                                       GEAVECCTLDGES
Subjt:  HSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGQPFKLLLNPWAGKRQPGELKHLSSQRKRKQKRFPVVLLGEAVECCTLDGES

Query:  PVAESITSLRSDPSSMGHVESRVNQQGPPCKAKYSWVTDKPPSGSEIEHETVSFQAVGGDQDSDRAVAWLREPTGAVAKASLHGAIVTAYGPEPGGEMPL
        PVAESITSLRSDPSSMGHVESRVNQQGPP                                     +AWLREPTGAVAKASLH AIVTAYGPEPGGEMPL
Subjt:  PVAESITSLRSDPSSMGHVESRVNQQGPPCKAKYSWVTDKPPSGSEIEHETVSFQAVGGDQDSDRAVAWLREPTGAVAKASLHGAIVTAYGPEPGGEMPL

Query:  EPRASWFSPKCVEAQQLTGHLGGSRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEMNGAKRSAEAVGCKNASVGERSALEGSTCASSGGRSGSENVG
        EPRASWFSPKCVEAQQLTGHLGGSRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEMNGAKRSAEAVGCK ASVGERSALEGST  S GGRSGSENVG
Subjt:  EPRASWFSPKCVEAQQLTGHLGGSRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEMNGAKRSAEAVGCKNASVGERSALEGSTCASSGGRSGSENVG

Query:  LSNANIGENPMPRKPKGSSARFVHGGRSRWTTGEYSCTTPCWSRGTEEARLAERWLSVQGRKVPLFFQVTHAILPGKARTTFNKRVPVPETDTGGPVTSG
        LSNANIGENPMPRKPKGSSARFVHGG  R   G  + ++ C S     A                  +VTHAILPGKARTTFNKR               
Subjt:  LSNANIGENPMPRKPKGSSARFVHGGRSRWTTGEYSCTTPCWSRGTEEARLAERWLSVQGRKVPLFFQVTHAILPGKARTTFNKRVPVPETDTGGPVTSG

Query:  EGVPPHKGGRSDQARATCRKVKEVGDLMTGEPATEAPVNGGRNYNGPKVAKFLVGLGEIDMSVKMRTTCTWTERPYEASLFPGIGFGPFLRSLVRYHSGR
                         CRKVKEVGDLMTGEPATEAPVNGGRNYNGPKVAKFLVGLG                RP                      SG 
Subjt:  EGVPPHKGGRSDQARATCRKVKEVGDLMTGEPATEAPVNGGRNYNGPKVAKFLVGLGEIDMSVKMRTTCTWTERPYEASLFPGIGFGPFLRSLVRYHSGR

Query:  ARILTLCQDLRAKGQSQVTEACKGFLGPDGDWPSSAKAEGSLTARPTRRAGTKVGLSDPTVPSGRAVAQRIKV
        AR              +VTEACKGFLGPDGDWPSSAKAEGSLTARPTRRAGTKVGLSDPTVPSGRAVAQRIKV
Subjt:  ARILTLCQDLRAKGQSQVTEACKGFLGPDGDWPSSAKAEGSLTARPTRRAGTKVGLSDPTVPSGRAVAQRIKV

A0A5N6MLP8 Uncharacterized protein ycf688.4e-20653.73Show/hide
Query:  STETKMGC--QERRGGRMGSWSDLVWIVHG-----RVPSSGIPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLRCPLSG
        S   ++GC   +  G  + S  D  W V G     RVP SGIPGEEDQVGPCEQLDALSPFNPLSE+RQKEGKSMDRPH LHPVGTTR PQGRLR P   
Subjt:  STETKMGC--QERRGGRMGSWSDLVWIVHG-----RVPSSGIPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLRCPLSG

Query:  WAVRVGEGQSLILKTSILKTKEAGGKGGKALRSWFSLPFEILRRVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVN
                                  G ++                                             G LRGGGLPCGGCQRFESAYLQLVN
Subjt:  WAVRVGEGQSLILKTSILKTKEAGGKGGKALRSWFSLPFEILRRVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVN

Query:  LADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGQPFKLLLNPWAG
        LADTKLYDST FFRFG SIYDLSFMDVDKI PFSSTLGWHSLK+KGEVQT+KGLRWIPRHPETRKGVVSDEMLRGVENK RSGDSRI             
Subjt:  LADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGQPFKLLLNPWAG

Query:  KRQPGELKHLSSQRKRKQKRFPVVLLGEAVECCTLDGESPVAESITSLRSDPSSMGHVESRVNQQGPPCKAKYSWVTDKPPSGSEIEHETVSFQAVGGDQ
                                  GEAVECCTLDGESPVAESITSLRSDPSSMGHVESRVNQQGPPCKAKYSWVTD                      
Subjt:  KRQPGELKHLSSQRKRKQKRFPVVLLGEAVECCTLDGESPVAESITSLRSDPSSMGHVESRVNQQGPPCKAKYSWVTDKPPSGSEIEHETVSFQAVGGDQ

Query:  DSDRAVAWLREPTGAVAKASLHGAIVTAYGPEPGGEMPLEPRASWFSPKCVEAQQLTGHLGGSRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEMNG
            AVAWLREPTGAVAKASLH AIVTAYGPEPGGEMPLEPRASWFSPKCVEAQQLTGHLG                                 +K   G
Subjt:  DSDRAVAWLREPTGAVAKASLHGAIVTAYGPEPGGEMPLEPRASWFSPKCVEAQQLTGHLGGSRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEMNG

Query:  AKRSAEAVGCKNASVGERSALEGSTCASSGGRSGSENVGLSNANIGENPMPRKPKGSSARFVHGGRSRWTTGEYSCTTPCWSRGTEEARLAERWLSVQGR
        A R +                                                                              GTEEARLAERWLSVQGR
Subjt:  AKRSAEAVGCKNASVGERSALEGSTCASSGGRSGSENVGLSNANIGENPMPRKPKGSSARFVHGGRSRWTTGEYSCTTPCWSRGTEEARLAERWLSVQGR

Query:  KVPLF--------------------FQVTHAILPGKARTTFNKRVPVPETDTGGPVTSGEGVPPHKGGRSDQARATCRKVKEVGDLMTGEPATEAPVNGG
        KV LF                     +VTHAILPGKART FNKRVPVPETDT            H  G +    A CRKVKEVGDLMTGEPATEAPVNGG
Subjt:  KVPLF--------------------FQVTHAILPGKARTTFNKRVPVPETDTGGPVTSGEGVPPHKGGRSDQARATCRKVKEVGDLMTGEPATEAPVNGG

Query:  RNYNGPKVAKFLVGLGEIDMSVKMRTTCTWTERPYEASLFPGIGFGPFLRSLVRYHSGRARILTLCQDLRAKGQSQVTEACKGFLGPDGDWPSSAKAEGS
        RNYNGPK+AKFLVG                   PYEASLFPGIGFGPFLRSL     GR R     +        +VTEACKGFLGPDGDWPSSAKAEGS
Subjt:  RNYNGPKVAKFLVGLGEIDMSVKMRTTCTWTERPYEASLFPGIGFGPFLRSLVRYHSGRARILTLCQDLRAKGQSQVTEACKGFLGPDGDWPSSAKAEGS

Query:  LTARPTRRAGTKVGLSDPTVPSGRAVAQRIKV
        LTARPTRRAGTKVGLSDPTVPSGRAVAQRIKV
Subjt:  LTARPTRRAGTKVGLSDPTVPSGRAVAQRIKV

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein1.1e-2920.99Show/hide
Query:  LSELRGVSVISKIDSRSRYHQLKVRESDIPKPTFRIRYGDYEFLVMPFGLTNVPAAFMDLMNRIFHPYFDQFVIVFIDDILVYSRDREKRAEHLKIVLQT
        L++++G ++ +K+D +S YH ++VR+ D  K  FR   G +E+LVMP+G++  PA F   +N I     +  V+ ++DDIL++S+   +  +H+K VLQ 
Subjt:  LSELRGVSVISKIDSRSRYHQLKVRESDIPKPTFRIRYGDYEFLVMPFGLTNVPAAFMDLMNRIFHPYFDQFVIVFIDDILVYSRDREKRAEHLKIVLQT

Query:  LREKKLYAKFSKYEFWLEQVVFLGHVVSVAKVSVDSQKAEAIMKLERPKTMIKLR-----------------------------NTKFGWNEKCEQKFQE
        L+   L    +K EF   QV F+G+ +S    +   +  + +++ ++PK   +LR                             + ++ W     Q  + 
Subjt:  LREKKLYAKFSKYEFWLEQVVFLGHVVSVAKVSVDSQKAEAIMKLERPKTMIKLR-----------------------------NTKFGWNEKCEQKFQE

Query:  LKRRMVTAPIL--------ALPESGLT-------------------------RVTNPTLDLKLATKKL--------------------------------
        +K+ +V+ P+L         L E+  +                         +++   L+  ++ K++                                
Subjt:  LKRRMVTAPIL--------ALPESGLT-------------------------RVTNPTLDLKLATKKL--------------------------------

Query:  ---------NMRQRRWLELIKDYDCTIEYHPGKNSATSLRVT---------------NGGTVVNTFPVEAKLVDEMVRKQSEDPVIKKLM-EEVKVQRRN
                 N R  RW   ++D++  I Y PG  +  +  ++               N    VN   +     +++V + + D  +  L+  E K    N
Subjt:  ---------NMRQRRWLELIKDYDCTIEYHPGKNSATSLRVT---------------NGGTVVNTFPVEAKLVDEMVRKQSEDPVIKKLM-EEVKVQRRN

Query:  MEVK--------------------QVILEKAHSLVYAMHP---------------SSTKIEIAEYVTRCLICQQVKLERQKPVGSLNLLP----------
        +++K                    + I++K H     +HP                  + +I EYV  C  CQ  K    KP G L  +P          
Subjt:  MEVK--------------------QVILEKAHSLVYAMHP---------------SSTKIEIAEYVTRCLICQQVKLERQKPVGSLNLLP----------

Query:  ------IPETLTGVDGVWVTVDRLKKTARCLPVKATYTLDRLAQMYIDKI----------------------------ALRTKLQFSTTFHLQTDGQLER
              +PE+ +G + ++V VDR  K A  +P   + T ++ A+M+  ++                                 ++FS  +  QTDGQ ER
Subjt:  ------IPETLTGVDGVWVTVDRLKKTARCLPVKATYTLDRLAQMYIDKI----------------------------ALRTKLQFSTTFHLQTDGQLER

Query:  TIQTLEDMLRVCALQFKGCWDVHLSLMEFDYNNNYQLSIGMTPYEALY
        T QT+E +LR         W  H+SL++  YNN    +  MTP+E ++
Subjt:  TIQTLEDMLRVCALQFKGCWDVHLSLMEFDYNNNYQLSIGMTPYEALY

P0CT35 Transposon Tf2-2 polyprotein1.1e-2920.99Show/hide
Query:  LSELRGVSVISKIDSRSRYHQLKVRESDIPKPTFRIRYGDYEFLVMPFGLTNVPAAFMDLMNRIFHPYFDQFVIVFIDDILVYSRDREKRAEHLKIVLQT
        L++++G ++ +K+D +S YH ++VR+ D  K  FR   G +E+LVMP+G++  PA F   +N I     +  V+ ++DDIL++S+   +  +H+K VLQ 
Subjt:  LSELRGVSVISKIDSRSRYHQLKVRESDIPKPTFRIRYGDYEFLVMPFGLTNVPAAFMDLMNRIFHPYFDQFVIVFIDDILVYSRDREKRAEHLKIVLQT

Query:  LREKKLYAKFSKYEFWLEQVVFLGHVVSVAKVSVDSQKAEAIMKLERPKTMIKLR-----------------------------NTKFGWNEKCEQKFQE
        L+   L    +K EF   QV F+G+ +S    +   +  + +++ ++PK   +LR                             + ++ W     Q  + 
Subjt:  LREKKLYAKFSKYEFWLEQVVFLGHVVSVAKVSVDSQKAEAIMKLERPKTMIKLR-----------------------------NTKFGWNEKCEQKFQE

Query:  LKRRMVTAPIL--------ALPESGLT-------------------------RVTNPTLDLKLATKKL--------------------------------
        +K+ +V+ P+L         L E+  +                         +++   L+  ++ K++                                
Subjt:  LKRRMVTAPIL--------ALPESGLT-------------------------RVTNPTLDLKLATKKL--------------------------------

Query:  ---------NMRQRRWLELIKDYDCTIEYHPGKNSATSLRVT---------------NGGTVVNTFPVEAKLVDEMVRKQSEDPVIKKLM-EEVKVQRRN
                 N R  RW   ++D++  I Y PG  +  +  ++               N    VN   +     +++V + + D  +  L+  E K    N
Subjt:  ---------NMRQRRWLELIKDYDCTIEYHPGKNSATSLRVT---------------NGGTVVNTFPVEAKLVDEMVRKQSEDPVIKKLM-EEVKVQRRN

Query:  MEVK--------------------QVILEKAHSLVYAMHP---------------SSTKIEIAEYVTRCLICQQVKLERQKPVGSLNLLP----------
        +++K                    + I++K H     +HP                  + +I EYV  C  CQ  K    KP G L  +P          
Subjt:  MEVK--------------------QVILEKAHSLVYAMHP---------------SSTKIEIAEYVTRCLICQQVKLERQKPVGSLNLLP----------

Query:  ------IPETLTGVDGVWVTVDRLKKTARCLPVKATYTLDRLAQMYIDKI----------------------------ALRTKLQFSTTFHLQTDGQLER
              +PE+ +G + ++V VDR  K A  +P   + T ++ A+M+  ++                                 ++FS  +  QTDGQ ER
Subjt:  ------IPETLTGVDGVWVTVDRLKKTARCLPVKATYTLDRLAQMYIDKI----------------------------ALRTKLQFSTTFHLQTDGQLER

Query:  TIQTLEDMLRVCALQFKGCWDVHLSLMEFDYNNNYQLSIGMTPYEALY
        T QT+E +LR         W  H+SL++  YNN    +  MTP+E ++
Subjt:  TIQTLEDMLRVCALQFKGCWDVHLSLMEFDYNNNYQLSIGMTPYEALY

P0CT36 Transposon Tf2-3 polyprotein1.1e-2920.99Show/hide
Query:  LSELRGVSVISKIDSRSRYHQLKVRESDIPKPTFRIRYGDYEFLVMPFGLTNVPAAFMDLMNRIFHPYFDQFVIVFIDDILVYSRDREKRAEHLKIVLQT
        L++++G ++ +K+D +S YH ++VR+ D  K  FR   G +E+LVMP+G++  PA F   +N I     +  V+ ++DDIL++S+   +  +H+K VLQ 
Subjt:  LSELRGVSVISKIDSRSRYHQLKVRESDIPKPTFRIRYGDYEFLVMPFGLTNVPAAFMDLMNRIFHPYFDQFVIVFIDDILVYSRDREKRAEHLKIVLQT

Query:  LREKKLYAKFSKYEFWLEQVVFLGHVVSVAKVSVDSQKAEAIMKLERPKTMIKLR-----------------------------NTKFGWNEKCEQKFQE
        L+   L    +K EF   QV F+G+ +S    +   +  + +++ ++PK   +LR                             + ++ W     Q  + 
Subjt:  LREKKLYAKFSKYEFWLEQVVFLGHVVSVAKVSVDSQKAEAIMKLERPKTMIKLR-----------------------------NTKFGWNEKCEQKFQE

Query:  LKRRMVTAPIL--------ALPESGLT-------------------------RVTNPTLDLKLATKKL--------------------------------
        +K+ +V+ P+L         L E+  +                         +++   L+  ++ K++                                
Subjt:  LKRRMVTAPIL--------ALPESGLT-------------------------RVTNPTLDLKLATKKL--------------------------------

Query:  ---------NMRQRRWLELIKDYDCTIEYHPGKNSATSLRVT---------------NGGTVVNTFPVEAKLVDEMVRKQSEDPVIKKLM-EEVKVQRRN
                 N R  RW   ++D++  I Y PG  +  +  ++               N    VN   +     +++V + + D  +  L+  E K    N
Subjt:  ---------NMRQRRWLELIKDYDCTIEYHPGKNSATSLRVT---------------NGGTVVNTFPVEAKLVDEMVRKQSEDPVIKKLM-EEVKVQRRN

Query:  MEVK--------------------QVILEKAHSLVYAMHP---------------SSTKIEIAEYVTRCLICQQVKLERQKPVGSLNLLP----------
        +++K                    + I++K H     +HP                  + +I EYV  C  CQ  K    KP G L  +P          
Subjt:  MEVK--------------------QVILEKAHSLVYAMHP---------------SSTKIEIAEYVTRCLICQQVKLERQKPVGSLNLLP----------

Query:  ------IPETLTGVDGVWVTVDRLKKTARCLPVKATYTLDRLAQMYIDKI----------------------------ALRTKLQFSTTFHLQTDGQLER
              +PE+ +G + ++V VDR  K A  +P   + T ++ A+M+  ++                                 ++FS  +  QTDGQ ER
Subjt:  ------IPETLTGVDGVWVTVDRLKKTARCLPVKATYTLDRLAQMYIDKI----------------------------ALRTKLQFSTTFHLQTDGQLER

Query:  TIQTLEDMLRVCALQFKGCWDVHLSLMEFDYNNNYQLSIGMTPYEALY
        T QT+E +LR         W  H+SL++  YNN    +  MTP+E ++
Subjt:  TIQTLEDMLRVCALQFKGCWDVHLSLMEFDYNNNYQLSIGMTPYEALY

P0CT37 Transposon Tf2-4 polyprotein1.1e-2920.99Show/hide
Query:  LSELRGVSVISKIDSRSRYHQLKVRESDIPKPTFRIRYGDYEFLVMPFGLTNVPAAFMDLMNRIFHPYFDQFVIVFIDDILVYSRDREKRAEHLKIVLQT
        L++++G ++ +K+D +S YH ++VR+ D  K  FR   G +E+LVMP+G++  PA F   +N I     +  V+ ++DDIL++S+   +  +H+K VLQ 
Subjt:  LSELRGVSVISKIDSRSRYHQLKVRESDIPKPTFRIRYGDYEFLVMPFGLTNVPAAFMDLMNRIFHPYFDQFVIVFIDDILVYSRDREKRAEHLKIVLQT

Query:  LREKKLYAKFSKYEFWLEQVVFLGHVVSVAKVSVDSQKAEAIMKLERPKTMIKLR-----------------------------NTKFGWNEKCEQKFQE
        L+   L    +K EF   QV F+G+ +S    +   +  + +++ ++PK   +LR                             + ++ W     Q  + 
Subjt:  LREKKLYAKFSKYEFWLEQVVFLGHVVSVAKVSVDSQKAEAIMKLERPKTMIKLR-----------------------------NTKFGWNEKCEQKFQE

Query:  LKRRMVTAPIL--------ALPESGLT-------------------------RVTNPTLDLKLATKKL--------------------------------
        +K+ +V+ P+L         L E+  +                         +++   L+  ++ K++                                
Subjt:  LKRRMVTAPIL--------ALPESGLT-------------------------RVTNPTLDLKLATKKL--------------------------------

Query:  ---------NMRQRRWLELIKDYDCTIEYHPGKNSATSLRVT---------------NGGTVVNTFPVEAKLVDEMVRKQSEDPVIKKLM-EEVKVQRRN
                 N R  RW   ++D++  I Y PG  +  +  ++               N    VN   +     +++V + + D  +  L+  E K    N
Subjt:  ---------NMRQRRWLELIKDYDCTIEYHPGKNSATSLRVT---------------NGGTVVNTFPVEAKLVDEMVRKQSEDPVIKKLM-EEVKVQRRN

Query:  MEVK--------------------QVILEKAHSLVYAMHP---------------SSTKIEIAEYVTRCLICQQVKLERQKPVGSLNLLP----------
        +++K                    + I++K H     +HP                  + +I EYV  C  CQ  K    KP G L  +P          
Subjt:  MEVK--------------------QVILEKAHSLVYAMHP---------------SSTKIEIAEYVTRCLICQQVKLERQKPVGSLNLLP----------

Query:  ------IPETLTGVDGVWVTVDRLKKTARCLPVKATYTLDRLAQMYIDKI----------------------------ALRTKLQFSTTFHLQTDGQLER
              +PE+ +G + ++V VDR  K A  +P   + T ++ A+M+  ++                                 ++FS  +  QTDGQ ER
Subjt:  ------IPETLTGVDGVWVTVDRLKKTARCLPVKATYTLDRLAQMYIDKI----------------------------ALRTKLQFSTTFHLQTDGQLER

Query:  TIQTLEDMLRVCALQFKGCWDVHLSLMEFDYNNNYQLSIGMTPYEALY
        T QT+E +LR         W  H+SL++  YNN    +  MTP+E ++
Subjt:  TIQTLEDMLRVCALQFKGCWDVHLSLMEFDYNNNYQLSIGMTPYEALY

P0CT41 Transposon Tf2-12 polyprotein1.1e-2920.99Show/hide
Query:  LSELRGVSVISKIDSRSRYHQLKVRESDIPKPTFRIRYGDYEFLVMPFGLTNVPAAFMDLMNRIFHPYFDQFVIVFIDDILVYSRDREKRAEHLKIVLQT
        L++++G ++ +K+D +S YH ++VR+ D  K  FR   G +E+LVMP+G++  PA F   +N I     +  V+ ++DDIL++S+   +  +H+K VLQ 
Subjt:  LSELRGVSVISKIDSRSRYHQLKVRESDIPKPTFRIRYGDYEFLVMPFGLTNVPAAFMDLMNRIFHPYFDQFVIVFIDDILVYSRDREKRAEHLKIVLQT

Query:  LREKKLYAKFSKYEFWLEQVVFLGHVVSVAKVSVDSQKAEAIMKLERPKTMIKLR-----------------------------NTKFGWNEKCEQKFQE
        L+   L    +K EF   QV F+G+ +S    +   +  + +++ ++PK   +LR                             + ++ W     Q  + 
Subjt:  LREKKLYAKFSKYEFWLEQVVFLGHVVSVAKVSVDSQKAEAIMKLERPKTMIKLR-----------------------------NTKFGWNEKCEQKFQE

Query:  LKRRMVTAPIL--------ALPESGLT-------------------------RVTNPTLDLKLATKKL--------------------------------
        +K+ +V+ P+L         L E+  +                         +++   L+  ++ K++                                
Subjt:  LKRRMVTAPIL--------ALPESGLT-------------------------RVTNPTLDLKLATKKL--------------------------------

Query:  ---------NMRQRRWLELIKDYDCTIEYHPGKNSATSLRVT---------------NGGTVVNTFPVEAKLVDEMVRKQSEDPVIKKLM-EEVKVQRRN
                 N R  RW   ++D++  I Y PG  +  +  ++               N    VN   +     +++V + + D  +  L+  E K    N
Subjt:  ---------NMRQRRWLELIKDYDCTIEYHPGKNSATSLRVT---------------NGGTVVNTFPVEAKLVDEMVRKQSEDPVIKKLM-EEVKVQRRN

Query:  MEVK--------------------QVILEKAHSLVYAMHP---------------SSTKIEIAEYVTRCLICQQVKLERQKPVGSLNLLP----------
        +++K                    + I++K H     +HP                  + +I EYV  C  CQ  K    KP G L  +P          
Subjt:  MEVK--------------------QVILEKAHSLVYAMHP---------------SSTKIEIAEYVTRCLICQQVKLERQKPVGSLNLLP----------

Query:  ------IPETLTGVDGVWVTVDRLKKTARCLPVKATYTLDRLAQMYIDKI----------------------------ALRTKLQFSTTFHLQTDGQLER
              +PE+ +G + ++V VDR  K A  +P   + T ++ A+M+  ++                                 ++FS  +  QTDGQ ER
Subjt:  ------IPETLTGVDGVWVTVDRLKKTARCLPVKATYTLDRLAQMYIDKI----------------------------ALRTKLQFSTTFHLQTDGQLER

Query:  TIQTLEDMLRVCALQFKGCWDVHLSLMEFDYNNNYQLSIGMTPYEALY
        T QT+E +LR         W  H+SL++  YNN    +  MTP+E ++
Subjt:  TIQTLEDMLRVCALQFKGCWDVHLSLMEFDYNNNYQLSIGMTPYEALY

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTGCGGTATTCATACTAGCAAGGTGGAAATTTCTCAAGCAACACATTTGCTTGAAGAATCTCACACATCTTCATACCTTTAATCAACACATTTGCTACCAAATTTC
CGTACTCGTGGACCTGCCATTTGCAGACGAGGATTCCAAAGTGTCCCAGATTGTCTTTGCTAATTTTTGCATCACAAATAGATCGAACTGTCAACTGAGTGAGCTCAGAG
GAGTTTCAGTGATTTCCAAGATAGACTCGAGATCAAGATATCATCAGCTGAAGGTTAGAGAATCAGACATACCTAAGCCGACTTTTAGAATAAGGTATGGAGATTATGAA
TTTCTGGTGATGCCATTTGGATTGACGAATGTACCAGCAGCATTCATGGATCTTATGAACAGGATATTCCATCCTTATTTCGACCAATTTGTTATTGTGTTTATTGATGA
CATATTGGTGTATTCTAGAGATAGGGAAAAGCGTGCTGAACATCTCAAGATTGTTTTGCAGACCTTAAGAGAAAAGAAGTTGTATGCTAAGTTCAGCAAGTATGAGTTTT
GGTTAGAACAGGTTGTGTTTTTAGGCCATGTGGTGTCAGTTGCAAAAGTTAGTGTAGATTCCCAAAAGGCTGAGGCTATAATGAAGTTGGAACGACCTAAGACCATGATA
AAGTTGAGGAATACTAAGTTCGGATGGAATGAGAAGTGCGAGCAAAAATTTCAAGAACTAAAGAGAAGAATGGTGACTGCACCTATCTTAGCACTTCCGGAGTCAGGCCT
CACGAGAGTAACTAATCCTACCCTTGATCTAAAGTTAGCTACAAAGAAACTTAACATGAGACAAAGGAGGTGGTTAGAGTTGATCAAAGATTATGACTGTACTATTGAAT
ATCACCCAGGAAAAAATAGTGCTACTAGTTTAAGAGTGACTAATGGTGGAACTGTTGTTAACACATTTCCAGTTGAGGCCAAGTTAGTTGATGAGATGGTAAGGAAGCAG
TCAGAAGATCCTGTGATTAAAAAGCTAATGGAGGAAGTAAAAGTCCAGCGAAGAAATATGGAAGTTAAACAAGTGATTCTTGAAAAGGCACATAGCTTAGTTTATGCTAT
GCATCCTAGTAGTACCAAGATAGAGATAGCTGAGTATGTAACAAGATGTTTAATTTGTCAGCAAGTAAAGCTTGAACGTCAAAAGCCAGTAGGGTCGTTGAATCTACTCC
CTATTCCTGAGACTCTAACTGGTGTTGACGGGGTTTGGGTAACTGTGGACAGGCTGAAAAAGACAGCACGTTGCTTACCAGTTAAGGCAACTTATACGTTGGATAGACTG
GCACAAATGTACATAGATAAGATTGCTTTGAGAACTAAGTTGCAGTTCAGTACAACTTTTCATCTTCAGACTGATGGACAATTAGAGAGGACAATCCAGACCTTAGAAGA
TATGCTCCGAGTTTGTGCTTTGCAGTTCAAAGGTTGCTGGGATGTACATTTGTCTTTAATGGAATTTGATTATAATAACAACTACCAGTTGAGCATAGGTATGACTCCAT
ACGAAGCATTATATTGCAGGCCATGCAGAACCAATGTGTTCGGGAGAGGTTGGAGAAAGGAAACTATTTGGACTAGAGATTGTGTAGATTACGACAGAGAAGGAGATTTG
AAAAAGGATCTTAGAGTGTCTAGGGTTGGGCCAGGAGGGTTTCTTAACGCCTTCCTTTTTCTTCTCATCGGAGTTATTTCACAAAGACTTGCCATGTACAACGGAGAGTT
GTATGCTGCGTTCGGGAAGGATGAATCGCTCCCGAAAAGGAATCTATTGATTCTCTCCCAATTGGTTGGACCTCCAGGATGGCCCAGCTACGCCAAGGAAAAGAATAAAA
GAATAGAAGAAGCATCTGACTCCTTCATGCAGGCCCCACTTGGCTCGGGGGGATATAGCTCAGTTGGTAGAGCTCCGCTCTTGCAATTGGGTCGTTGCGATTACGGTAAT
GGGGAAGAGGACCGAAACATGCCACTGAAAGACTCTACTGAGACAAAGATGGGCTGTCAAGAACGTAGAGGAGGTAGGATGGGCAGTTGGTCAGATCTAGTATGGATCGT
ACATGGACGGGTTCCCTCATCTGGGATCCCTGGGGAAGAGGATCAAGTTGGCCCTTGCGAACAGCTTGATGCACTATCTCCCTTCAACCCTTTGAGCGAAATGCGGCAAA
AGGAAGGAAAATCCATGGACCGACCCCATCGTCTCCACCCCGTAGGAACTACGAGATCACCCCAAGGACGCCTTCGCTGTCCGCTCTCCGGTTGGGCAGTAAGGGTCGGA
GAAGGGCAATCACTCATTCTTAAAACCAGCATTCTTAAGACCAAAGAGGCGGGCGGAAAAGGGGGGAAAGCTCTCCGTTCCTGGTTCTCCTTACCTTTTGAGATTTTGAG
AAGAGTTGCTCTTTGGAGAGCACAGTACGATGAAAGTTGTAAGCTGTGTTCGGGGGGGAGTTATTGTCTATCGTTGGCCTCTATGGTAGAATCAGTCGGGGGCCTGAGAG
GCGGTGGTTTACCCTGTGGCGGATGTCAGCGGTTCGAGTCCGCTTATCTCCAACTCGTGAACTTAGCCGATACAAAGCTATATGATAGCACTCCATTTTTCCGATTCGGC
AGTTCGATCTATGATTTATCATTCATGGACGTTGATAAGATCCTTCCATTTAGCAGCACCTTAGGATGGCATAGCCTTAAAGTTAAGGGCGAGGTTCAAACGAAGAAAGG
CTTACGGTGGATACCTAGGCACCCAGAGACGAGGAAGGGCGTAGTAAGCGACGAAATGCTTCGGGGAGTTGAAAATAAGCGTAGATCCGGAGATTCCCGAATAGGTCAAC
CTTTCAAACTGCTGCTGAATCCATGGGCAGGCAAGAGACAACCTGGCGAACTGAAACATCTTAGTAGCCAGAGGAAAAGAAAGCAAAAGCGATTCCCCGTCGTGCTGCTA
GGCGAAGCGGTGGAGTGCTGCACCCTAGATGGCGAGAGTCCAGTAGCCGAAAGCATCACTAGCTTACGCTCTGACCCGAGTAGCATGGGGCACGTGGAATCCCGTGTGAA
TCAGCAAGGACCACCTTGCAAGGCTAAATACTCCTGGGTGACCGATAAACCCCCATCGGGGAGTGAAATAGAACATGAAACCGTAAGCTTCCAAGCAGTGGGAGGAGACC
AGGACTCTGACCGCGCAGTGGCTTGGTTAAGGGAACCCACCGGAGCCGTAGCGAAAGCGAGTCTTCATGGGGCAATTGTCACTGCTTATGGACCCGAACCTGGGGGTGAA
ATGCCACTCGAACCCAGAGCTAGCTGGTTCTCCCCGAAATGCGTTGAGGCGCAGCAGTTGACTGGACATCTAGGGGGGTCAAGGTCGGCCAGTGAGACGATGGGGGATAA
GCTTCATCGTCGAGAGGGAAACAGCCCGGATCACCAGCTAAGGCCCCTAAATGACCGCTCAGTGATAAAGGAGATGAACGGGGCTAAGCGATCTGCCGAAGCTGTGGGAT
GTAAAAATGCATCGGTAGGGGAGCGTTCCGCCTTAGAGGGAAGCACCTGCGCGAGCAGTGGTGGACGAAGCGGAAGCGAGAATGTCGGCTTGAGTAACGCAAACATTGGT
GAGAATCCAATGCCCCGAAAACCTAAGGGTTCCTCCGCAAGGTTCGTCCACGGAGGGCGTAGTCGATGGACAACAGGTGAATATTCCTGTACTACCCCTTGTTGGTCCCG
AGGGACGGAGGAGGCTAGGTTAGCCGAAAGATGGTTATCGGTTCAAGGACGCAAGGTGCCCCTGTTTTTTCAGGTAACCCATGCCATACTCCCAGGAAAAGCTCGAACGA
CCTTCAACAAAAGGGTACCTGTACCCGAAACCGACACAGGTGGCCCCGTAACTTCGGGAGAAGGGGTGCCTCCTCACAAAGGGGGTCGCAGTGACCAGGCCCGGGCGACT
TGCCGGAAGGTCAAGGAAGTTGGTGACCTGATGACAGGGGAGCCGGCGACCGAAGCCCCGGTGAACGGCGGCCGTAACTATAACGGTCCTAAGGTAGCGAAATTCCTTGT
CGGGCTCGGTGAAATAGACATGTCTGTGAAGATGCGGACTACCTGCACCTGGACAGAAAGACCCTATGAAGCTTCACTGTTCCCTGGGATTGGCTTTGGGCCTTTCCTGC
GCAGCTTAGTGAGATACCACTCTGGAAGAGCTAGAATTCTAACCTTGTGTCAGGACCTACGGGCCAAGGGACAGTCTCAGGTAACGGAGGCGTGCAAAGGTTTCCTCGGG
CCAGACGGAGATTGGCCCTCGAGTGCAAAGGCAGAAGGGAGCTTGACTGCAAGACCCACCCGTCGAGCAGGGACGAAAGTCGGCCTTAGTGATCCGACGGTGCCGAGTGG
AAGGGCCGTCGCTCAACGGATAAAAGTTTGGCACCTCGATGTCGGCTCTTCGCCACCTGGGGCTGTAGTATGTTCCAAGGGTTGGGCTGTTCGCCCATTAAAGCGGTACG
TGAGCTGGGTTCAGAACGTCGTGAGACAGTTCGGTCCATATCCGGACCGGGAAGGACGCACCTCTGGTGTACCAGTTATCGTGCCCACGGTAAACGCTGGCCCACCCCAA
GATGAGTGCTCTCCTATTCCGACTTCCCCAGAGCCTCCGGTAGCACAGCCGAGACAGCGATGGGTTCTCTGCCCCTGCGGGGATGGAGCGACAGAAGTTTTGAGAATTCA
AGAGAAGCTGAGGCATCCTAACAGACCGGTAGACTTGAACCTTGTTCCTACATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTGCGGTATTCATACTAGCAAGGTGGAAATTTCTCAAGCAACACATTTGCTTGAAGAATCTCACACATCTTCATACCTTTAATCAACACATTTGCTACCAAATTTC
CGTACTCGTGGACCTGCCATTTGCAGACGAGGATTCCAAAGTGTCCCAGATTGTCTTTGCTAATTTTTGCATCACAAATAGATCGAACTGTCAACTGAGTGAGCTCAGAG
GAGTTTCAGTGATTTCCAAGATAGACTCGAGATCAAGATATCATCAGCTGAAGGTTAGAGAATCAGACATACCTAAGCCGACTTTTAGAATAAGGTATGGAGATTATGAA
TTTCTGGTGATGCCATTTGGATTGACGAATGTACCAGCAGCATTCATGGATCTTATGAACAGGATATTCCATCCTTATTTCGACCAATTTGTTATTGTGTTTATTGATGA
CATATTGGTGTATTCTAGAGATAGGGAAAAGCGTGCTGAACATCTCAAGATTGTTTTGCAGACCTTAAGAGAAAAGAAGTTGTATGCTAAGTTCAGCAAGTATGAGTTTT
GGTTAGAACAGGTTGTGTTTTTAGGCCATGTGGTGTCAGTTGCAAAAGTTAGTGTAGATTCCCAAAAGGCTGAGGCTATAATGAAGTTGGAACGACCTAAGACCATGATA
AAGTTGAGGAATACTAAGTTCGGATGGAATGAGAAGTGCGAGCAAAAATTTCAAGAACTAAAGAGAAGAATGGTGACTGCACCTATCTTAGCACTTCCGGAGTCAGGCCT
CACGAGAGTAACTAATCCTACCCTTGATCTAAAGTTAGCTACAAAGAAACTTAACATGAGACAAAGGAGGTGGTTAGAGTTGATCAAAGATTATGACTGTACTATTGAAT
ATCACCCAGGAAAAAATAGTGCTACTAGTTTAAGAGTGACTAATGGTGGAACTGTTGTTAACACATTTCCAGTTGAGGCCAAGTTAGTTGATGAGATGGTAAGGAAGCAG
TCAGAAGATCCTGTGATTAAAAAGCTAATGGAGGAAGTAAAAGTCCAGCGAAGAAATATGGAAGTTAAACAAGTGATTCTTGAAAAGGCACATAGCTTAGTTTATGCTAT
GCATCCTAGTAGTACCAAGATAGAGATAGCTGAGTATGTAACAAGATGTTTAATTTGTCAGCAAGTAAAGCTTGAACGTCAAAAGCCAGTAGGGTCGTTGAATCTACTCC
CTATTCCTGAGACTCTAACTGGTGTTGACGGGGTTTGGGTAACTGTGGACAGGCTGAAAAAGACAGCACGTTGCTTACCAGTTAAGGCAACTTATACGTTGGATAGACTG
GCACAAATGTACATAGATAAGATTGCTTTGAGAACTAAGTTGCAGTTCAGTACAACTTTTCATCTTCAGACTGATGGACAATTAGAGAGGACAATCCAGACCTTAGAAGA
TATGCTCCGAGTTTGTGCTTTGCAGTTCAAAGGTTGCTGGGATGTACATTTGTCTTTAATGGAATTTGATTATAATAACAACTACCAGTTGAGCATAGGTATGACTCCAT
ACGAAGCATTATATTGCAGGCCATGCAGAACCAATGTGTTCGGGAGAGGTTGGAGAAAGGAAACTATTTGGACTAGAGATTGTGTAGATTACGACAGAGAAGGAGATTTG
AAAAAGGATCTTAGAGTGTCTAGGGTTGGGCCAGGAGGGTTTCTTAACGCCTTCCTTTTTCTTCTCATCGGAGTTATTTCACAAAGACTTGCCATGTACAACGGAGAGTT
GTATGCTGCGTTCGGGAAGGATGAATCGCTCCCGAAAAGGAATCTATTGATTCTCTCCCAATTGGTTGGACCTCCAGGATGGCCCAGCTACGCCAAGGAAAAGAATAAAA
GAATAGAAGAAGCATCTGACTCCTTCATGCAGGCCCCACTTGGCTCGGGGGGATATAGCTCAGTTGGTAGAGCTCCGCTCTTGCAATTGGGTCGTTGCGATTACGGTAAT
GGGGAAGAGGACCGAAACATGCCACTGAAAGACTCTACTGAGACAAAGATGGGCTGTCAAGAACGTAGAGGAGGTAGGATGGGCAGTTGGTCAGATCTAGTATGGATCGT
ACATGGACGGGTTCCCTCATCTGGGATCCCTGGGGAAGAGGATCAAGTTGGCCCTTGCGAACAGCTTGATGCACTATCTCCCTTCAACCCTTTGAGCGAAATGCGGCAAA
AGGAAGGAAAATCCATGGACCGACCCCATCGTCTCCACCCCGTAGGAACTACGAGATCACCCCAAGGACGCCTTCGCTGTCCGCTCTCCGGTTGGGCAGTAAGGGTCGGA
GAAGGGCAATCACTCATTCTTAAAACCAGCATTCTTAAGACCAAAGAGGCGGGCGGAAAAGGGGGGAAAGCTCTCCGTTCCTGGTTCTCCTTACCTTTTGAGATTTTGAG
AAGAGTTGCTCTTTGGAGAGCACAGTACGATGAAAGTTGTAAGCTGTGTTCGGGGGGGAGTTATTGTCTATCGTTGGCCTCTATGGTAGAATCAGTCGGGGGCCTGAGAG
GCGGTGGTTTACCCTGTGGCGGATGTCAGCGGTTCGAGTCCGCTTATCTCCAACTCGTGAACTTAGCCGATACAAAGCTATATGATAGCACTCCATTTTTCCGATTCGGC
AGTTCGATCTATGATTTATCATTCATGGACGTTGATAAGATCCTTCCATTTAGCAGCACCTTAGGATGGCATAGCCTTAAAGTTAAGGGCGAGGTTCAAACGAAGAAAGG
CTTACGGTGGATACCTAGGCACCCAGAGACGAGGAAGGGCGTAGTAAGCGACGAAATGCTTCGGGGAGTTGAAAATAAGCGTAGATCCGGAGATTCCCGAATAGGTCAAC
CTTTCAAACTGCTGCTGAATCCATGGGCAGGCAAGAGACAACCTGGCGAACTGAAACATCTTAGTAGCCAGAGGAAAAGAAAGCAAAAGCGATTCCCCGTCGTGCTGCTA
GGCGAAGCGGTGGAGTGCTGCACCCTAGATGGCGAGAGTCCAGTAGCCGAAAGCATCACTAGCTTACGCTCTGACCCGAGTAGCATGGGGCACGTGGAATCCCGTGTGAA
TCAGCAAGGACCACCTTGCAAGGCTAAATACTCCTGGGTGACCGATAAACCCCCATCGGGGAGTGAAATAGAACATGAAACCGTAAGCTTCCAAGCAGTGGGAGGAGACC
AGGACTCTGACCGCGCAGTGGCTTGGTTAAGGGAACCCACCGGAGCCGTAGCGAAAGCGAGTCTTCATGGGGCAATTGTCACTGCTTATGGACCCGAACCTGGGGGTGAA
ATGCCACTCGAACCCAGAGCTAGCTGGTTCTCCCCGAAATGCGTTGAGGCGCAGCAGTTGACTGGACATCTAGGGGGGTCAAGGTCGGCCAGTGAGACGATGGGGGATAA
GCTTCATCGTCGAGAGGGAAACAGCCCGGATCACCAGCTAAGGCCCCTAAATGACCGCTCAGTGATAAAGGAGATGAACGGGGCTAAGCGATCTGCCGAAGCTGTGGGAT
GTAAAAATGCATCGGTAGGGGAGCGTTCCGCCTTAGAGGGAAGCACCTGCGCGAGCAGTGGTGGACGAAGCGGAAGCGAGAATGTCGGCTTGAGTAACGCAAACATTGGT
GAGAATCCAATGCCCCGAAAACCTAAGGGTTCCTCCGCAAGGTTCGTCCACGGAGGGCGTAGTCGATGGACAACAGGTGAATATTCCTGTACTACCCCTTGTTGGTCCCG
AGGGACGGAGGAGGCTAGGTTAGCCGAAAGATGGTTATCGGTTCAAGGACGCAAGGTGCCCCTGTTTTTTCAGGTAACCCATGCCATACTCCCAGGAAAAGCTCGAACGA
CCTTCAACAAAAGGGTACCTGTACCCGAAACCGACACAGGTGGCCCCGTAACTTCGGGAGAAGGGGTGCCTCCTCACAAAGGGGGTCGCAGTGACCAGGCCCGGGCGACT
TGCCGGAAGGTCAAGGAAGTTGGTGACCTGATGACAGGGGAGCCGGCGACCGAAGCCCCGGTGAACGGCGGCCGTAACTATAACGGTCCTAAGGTAGCGAAATTCCTTGT
CGGGCTCGGTGAAATAGACATGTCTGTGAAGATGCGGACTACCTGCACCTGGACAGAAAGACCCTATGAAGCTTCACTGTTCCCTGGGATTGGCTTTGGGCCTTTCCTGC
GCAGCTTAGTGAGATACCACTCTGGAAGAGCTAGAATTCTAACCTTGTGTCAGGACCTACGGGCCAAGGGACAGTCTCAGGTAACGGAGGCGTGCAAAGGTTTCCTCGGG
CCAGACGGAGATTGGCCCTCGAGTGCAAAGGCAGAAGGGAGCTTGACTGCAAGACCCACCCGTCGAGCAGGGACGAAAGTCGGCCTTAGTGATCCGACGGTGCCGAGTGG
AAGGGCCGTCGCTCAACGGATAAAAGTTTGGCACCTCGATGTCGGCTCTTCGCCACCTGGGGCTGTAGTATGTTCCAAGGGTTGGGCTGTTCGCCCATTAAAGCGGTACG
TGAGCTGGGTTCAGAACGTCGTGAGACAGTTCGGTCCATATCCGGACCGGGAAGGACGCACCTCTGGTGTACCAGTTATCGTGCCCACGGTAAACGCTGGCCCACCCCAA
GATGAGTGCTCTCCTATTCCGACTTCCCCAGAGCCTCCGGTAGCACAGCCGAGACAGCGATGGGTTCTCTGCCCCTGCGGGGATGGAGCGACAGAAGTTTTGAGAATTCA
AGAGAAGCTGAGGCATCCTAACAGACCGGTAGACTTGAACCTTGTTCCTACATGA
Protein sequenceShow/hide protein sequence
MVAVFILARWKFLKQHICLKNLTHLHTFNQHICYQISVLVDLPFADEDSKVSQIVFANFCITNRSNCQLSELRGVSVISKIDSRSRYHQLKVRESDIPKPTFRIRYGDYE
FLVMPFGLTNVPAAFMDLMNRIFHPYFDQFVIVFIDDILVYSRDREKRAEHLKIVLQTLREKKLYAKFSKYEFWLEQVVFLGHVVSVAKVSVDSQKAEAIMKLERPKTMI
KLRNTKFGWNEKCEQKFQELKRRMVTAPILALPESGLTRVTNPTLDLKLATKKLNMRQRRWLELIKDYDCTIEYHPGKNSATSLRVTNGGTVVNTFPVEAKLVDEMVRKQ
SEDPVIKKLMEEVKVQRRNMEVKQVILEKAHSLVYAMHPSSTKIEIAEYVTRCLICQQVKLERQKPVGSLNLLPIPETLTGVDGVWVTVDRLKKTARCLPVKATYTLDRL
AQMYIDKIALRTKLQFSTTFHLQTDGQLERTIQTLEDMLRVCALQFKGCWDVHLSLMEFDYNNNYQLSIGMTPYEALYCRPCRTNVFGRGWRKETIWTRDCVDYDREGDL
KKDLRVSRVGPGGFLNAFLFLLIGVISQRLAMYNGELYAAFGKDESLPKRNLLILSQLVGPPGWPSYAKEKNKRIEEASDSFMQAPLGSGGYSSVGRAPLLQLGRCDYGN
GEEDRNMPLKDSTETKMGCQERRGGRMGSWSDLVWIVHGRVPSSGIPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLRCPLSGWAVRVG
EGQSLILKTSILKTKEAGGKGGKALRSWFSLPFEILRRVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFG
SSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGQPFKLLLNPWAGKRQPGELKHLSSQRKRKQKRFPVVLL
GEAVECCTLDGESPVAESITSLRSDPSSMGHVESRVNQQGPPCKAKYSWVTDKPPSGSEIEHETVSFQAVGGDQDSDRAVAWLREPTGAVAKASLHGAIVTAYGPEPGGE
MPLEPRASWFSPKCVEAQQLTGHLGGSRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEMNGAKRSAEAVGCKNASVGERSALEGSTCASSGGRSGSENVGLSNANIG
ENPMPRKPKGSSARFVHGGRSRWTTGEYSCTTPCWSRGTEEARLAERWLSVQGRKVPLFFQVTHAILPGKARTTFNKRVPVPETDTGGPVTSGEGVPPHKGGRSDQARAT
CRKVKEVGDLMTGEPATEAPVNGGRNYNGPKVAKFLVGLGEIDMSVKMRTTCTWTERPYEASLFPGIGFGPFLRSLVRYHSGRARILTLCQDLRAKGQSQVTEACKGFLG
PDGDWPSSAKAEGSLTARPTRRAGTKVGLSDPTVPSGRAVAQRIKVWHLDVGSSPPGAVVCSKGWAVRPLKRYVSWVQNVVRQFGPYPDREGRTSGVPVIVPTVNAGPPQ
DECSPIPTSPEPPVAQPRQRWVLCPCGDGATEVLRIQEKLRHPNRPVDLNLVPT