; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg021545 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg021545
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptiontetratricopeptide repeat protein SKI3
Genome locationscaffold9:7145537..7164196
RNA-Seq ExpressionSpg021545
SyntenySpg021545
Gene Ontology termsGO:0034427 - nuclear-transcribed mRNA catabolic process, exonucleolytic, 3'-5' (biological process)
GO:0055087 - Ski complex (cellular component)
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591181.1 Tetratricopeptide repeat protein SKI3, partial [Cucurbita argyrosperma subsp. sororia]5.4e-18073.77Show/hide
Query:  TIECLGGERGILKM-----CGPSLDIMFLFGLRNLLGYLLLSNEERDDSHTATRCCNMMYGFDQQNKGPKSAYEIHGAGAVACYTIGTSHPRISFPTCSY
        T  CLG + G++ +       P+  +M     RNLLGYLLLSNEERDD+HTATRCCNM+YGFDQQNKG KSAYEIHGAGAV+CYTIGTSHPR SFPTCSY
Subjt:  TIECLGGERGILKM-----CGPSLDIMFLFGLRNLLGYLLLSNEERDDSHTATRCCNMMYGFDQQNKGPKSAYEIHGAGAVACYTIGTSHPRISFPTCSY

Query:  QCQNGIGTIRQLQKYASKKLGKRSQGDEKRLAKSRKRRQRCLRQEPWNYDARYLLIQNFLQKAREERFPCHLGVIIERLILVAFSNEPYFKQDMSHQYKK
        QCQNGIGTIRQLQK                          CLRQ+PWNYDARYLLI N LQKAREERFPCHL   IERLILVAFSNEPYF +D SHQYKK
Subjt:  QCQNGIGTIRQLQKYASKKLGKRSQGDEKRLAKSRKRRQRCLRQEPWNYDARYLLIQNFLQKAREERFPCHLGVIIERLILVAFSNEPYFKQDMSHQYKK

Query:  FQLLLCASEISFQRGSQIKCINYAKAASSISLPDDYLICAHLLLCRAYAAENDSINLHKEFIRCLDLKTNNYLGWVCLKFIASRYELHVESNTLELSFKK
        FQLLLCASEIS Q   QIKCINYAKAASSISLPD+YL  AHLLLCRAYAAENDS NL  EFI+CLDLKT+NYLGWVCLKFIASRYELHVESN+LELSFKK
Subjt:  FQLLLCASEISFQRGSQIKCINYAKAASSISLPDDYLICAHLLLCRAYAAENDSINLHKEFIRCLDLKTNNYLGWVCLKFIASRYELHVESNTLELSFKK

Query:  WSVESKNLQHMLLPMFSLVDGLISFWSQDFMAAEKYFAQACSLGHDDACLLLCHGIISFPCKSFFINKTLFPHLHVLVACMELAKQLCSSHFLMLAVNSL
         +VESK+LQHM++PM  LVDGLISFWSQDFMAAEKYFAQACS GHDD CLLLCHG                      V CMELAKQLCS HFL LAVNSL
Subjt:  WSVESKNLQHMLLPMFSLVDGLISFWSQDFMAAEKYFAQACSLGHDDACLLLCHGIISFPCKSFFINKTLFPHLHVLVACMELAKQLCSSHFLMLAVNSL

Query:  LKAQVISVVPIPIVSIILAQAEGSLGLKENWESYLRFEWFSWPPGL
        LKAQVISVVPIP+VSI LAQAEGSLGLKENWES LRFEWFSWPP +
Subjt:  LKAQVISVVPIPIVSIILAQAEGSLGLKENWESYLRFEWFSWPPGL

KAG7024067.1 Tetratricopeptide repeat protein SKI3 [Cucurbita argyrosperma subsp. argyrosperma]1.0e-17873.54Show/hide
Query:  TIECLGGERGILKM-----CGPSLDIMFLFGLRNLLGYLLLSNEERDDSHTATRCCNMMYGFDQQNKGPKSAYEIHGAGAVACYTIGTSHPRISFPTCSY
        T  CLG   G++ +       P+  +M     RNLLGYLLLSNEERDD+HTATRCCNM+YGFDQQNKG KSAYEIHGAGAV+CYTIGTSHPR SFPTCS 
Subjt:  TIECLGGERGILKM-----CGPSLDIMFLFGLRNLLGYLLLSNEERDDSHTATRCCNMMYGFDQQNKGPKSAYEIHGAGAVACYTIGTSHPRISFPTCSY

Query:  QCQNGIGTIRQLQKYASKKLGKRSQGDEKRLAKSRKRRQRCLRQEPWNYDARYLLIQNFLQKAREERFPCHLGVIIERLILVAFSNEPYFKQDMSHQYKK
        QCQNGIGTIRQLQK                          CLRQ+PWNYDARYLLI N LQKAREERFPCHL   IERLILVAFSNEPYF +D SHQYKK
Subjt:  QCQNGIGTIRQLQKYASKKLGKRSQGDEKRLAKSRKRRQRCLRQEPWNYDARYLLIQNFLQKAREERFPCHLGVIIERLILVAFSNEPYFKQDMSHQYKK

Query:  FQLLLCASEISFQRGSQIKCINYAKAASSISLPDDYLICAHLLLCRAYAAENDSINLHKEFIRCLDLKTNNYLGWVCLKFIASRYELHVESNTLELSFKK
        FQLLLCASEIS Q   QIKCINYAKAASSISLPD+YL  AHLLLCRAYAAENDS NL  EFI+CLDLKT+NYLGWVCLKFIASRYELHVESN+LELSFKK
Subjt:  FQLLLCASEISFQRGSQIKCINYAKAASSISLPDDYLICAHLLLCRAYAAENDSINLHKEFIRCLDLKTNNYLGWVCLKFIASRYELHVESNTLELSFKK

Query:  WSVESKNLQHMLLPMFSLVDGLISFWSQDFMAAEKYFAQACSLGHDDACLLLCHGIISFPCKSFFINKTLFPHLHVLVACMELAKQLCSSHFLMLAVNSL
         +VESK+LQHM++PM  LVDGLISFWSQDFMAAEKYFAQACS GHDD CLLLCHG                      V CMELAKQLCS HFL LAVNSL
Subjt:  WSVESKNLQHMLLPMFSLVDGLISFWSQDFMAAEKYFAQACSLGHDDACLLLCHGIISFPCKSFFINKTLFPHLHVLVACMELAKQLCSSHFLMLAVNSL

Query:  LKAQVISVVPIPIVSIILAQAEGSLGLKENWESYLRFEWFSWPPGL
        LKAQVISVVPIP+VSI LAQAEGSLGLKENWES LRFEWFSWPP +
Subjt:  LKAQVISVVPIPIVSIILAQAEGSLGLKENWESYLRFEWFSWPPGL

XP_022936099.1 tetratricopeptide repeat protein SKI3 isoform X1 [Cucurbita moschata]5.4e-18073.99Show/hide
Query:  TIECLGGERGILKM-----CGPSLDIMFLFGLRNLLGYLLLSNEERDDSHTATRCCNMMYGFDQQNKGPKSAYEIHGAGAVACYTIGTSHPRISFPTCSY
        T  CLG   G++ +       P+  +M     RNLLGYLLLSNEERDD+HTATRCCNM+YGFDQQNKG KSAYEIHGAGAVACYTIGTSHPR SFPTCSY
Subjt:  TIECLGGERGILKM-----CGPSLDIMFLFGLRNLLGYLLLSNEERDDSHTATRCCNMMYGFDQQNKGPKSAYEIHGAGAVACYTIGTSHPRISFPTCSY

Query:  QCQNGIGTIRQLQKYASKKLGKRSQGDEKRLAKSRKRRQRCLRQEPWNYDARYLLIQNFLQKAREERFPCHLGVIIERLILVAFSNEPYFKQDMSHQYKK
        QCQNGIGTIRQLQK                          CLRQ+PWNYDARYLLI N LQKAREERFPCHL   IERLILVAFSNEPYF +D SHQYKK
Subjt:  QCQNGIGTIRQLQKYASKKLGKRSQGDEKRLAKSRKRRQRCLRQEPWNYDARYLLIQNFLQKAREERFPCHLGVIIERLILVAFSNEPYFKQDMSHQYKK

Query:  FQLLLCASEISFQRGSQIKCINYAKAASSISLPDDYLICAHLLLCRAYAAENDSINLHKEFIRCLDLKTNNYLGWVCLKFIASRYELHVESNTLELSFKK
        FQLLLCASEIS Q   QIKCINYAKAASSISLPD+YL  AHLLLCRAYAAENDS NL  EFI+CLDLKT+NYLGWVCLKFIASRYELHVESN+LELSFKK
Subjt:  FQLLLCASEISFQRGSQIKCINYAKAASSISLPDDYLICAHLLLCRAYAAENDSINLHKEFIRCLDLKTNNYLGWVCLKFIASRYELHVESNTLELSFKK

Query:  WSVESKNLQHMLLPMFSLVDGLISFWSQDFMAAEKYFAQACSLGHDDACLLLCHGIISFPCKSFFINKTLFPHLHVLVACMELAKQLCSSHFLMLAVNSL
         SVESK+LQHM++P+  LVDGLISFWSQDFMAAEKYFAQACS GHDD CLLLCHG                      V CMELAKQLCS HFL LAVNSL
Subjt:  WSVESKNLQHMLLPMFSLVDGLISFWSQDFMAAEKYFAQACSLGHDDACLLLCHGIISFPCKSFFINKTLFPHLHVLVACMELAKQLCSSHFLMLAVNSL

Query:  LKAQVISVVPIPIVSIILAQAEGSLGLKENWESYLRFEWFSWPPGL
        LKAQVISVVPIP+VSI LAQAEGSLGLKENWES LRFEWFSWPP +
Subjt:  LKAQVISVVPIPIVSIILAQAEGSLGLKENWESYLRFEWFSWPPGL

XP_022936100.1 tetratricopeptide repeat protein SKI3 isoform X2 [Cucurbita moschata]5.4e-18073.99Show/hide
Query:  TIECLGGERGILKM-----CGPSLDIMFLFGLRNLLGYLLLSNEERDDSHTATRCCNMMYGFDQQNKGPKSAYEIHGAGAVACYTIGTSHPRISFPTCSY
        T  CLG   G++ +       P+  +M     RNLLGYLLLSNEERDD+HTATRCCNM+YGFDQQNKG KSAYEIHGAGAVACYTIGTSHPR SFPTCSY
Subjt:  TIECLGGERGILKM-----CGPSLDIMFLFGLRNLLGYLLLSNEERDDSHTATRCCNMMYGFDQQNKGPKSAYEIHGAGAVACYTIGTSHPRISFPTCSY

Query:  QCQNGIGTIRQLQKYASKKLGKRSQGDEKRLAKSRKRRQRCLRQEPWNYDARYLLIQNFLQKAREERFPCHLGVIIERLILVAFSNEPYFKQDMSHQYKK
        QCQNGIGTIRQLQK                          CLRQ+PWNYDARYLLI N LQKAREERFPCHL   IERLILVAFSNEPYF +D SHQYKK
Subjt:  QCQNGIGTIRQLQKYASKKLGKRSQGDEKRLAKSRKRRQRCLRQEPWNYDARYLLIQNFLQKAREERFPCHLGVIIERLILVAFSNEPYFKQDMSHQYKK

Query:  FQLLLCASEISFQRGSQIKCINYAKAASSISLPDDYLICAHLLLCRAYAAENDSINLHKEFIRCLDLKTNNYLGWVCLKFIASRYELHVESNTLELSFKK
        FQLLLCASEIS Q   QIKCINYAKAASSISLPD+YL  AHLLLCRAYAAENDS NL  EFI+CLDLKT+NYLGWVCLKFIASRYELHVESN+LELSFKK
Subjt:  FQLLLCASEISFQRGSQIKCINYAKAASSISLPDDYLICAHLLLCRAYAAENDSINLHKEFIRCLDLKTNNYLGWVCLKFIASRYELHVESNTLELSFKK

Query:  WSVESKNLQHMLLPMFSLVDGLISFWSQDFMAAEKYFAQACSLGHDDACLLLCHGIISFPCKSFFINKTLFPHLHVLVACMELAKQLCSSHFLMLAVNSL
         SVESK+LQHM++P+  LVDGLISFWSQDFMAAEKYFAQACS GHDD CLLLCHG                      V CMELAKQLCS HFL LAVNSL
Subjt:  WSVESKNLQHMLLPMFSLVDGLISFWSQDFMAAEKYFAQACSLGHDDACLLLCHGIISFPCKSFFINKTLFPHLHVLVACMELAKQLCSSHFLMLAVNSL

Query:  LKAQVISVVPIPIVSIILAQAEGSLGLKENWESYLRFEWFSWPPGL
        LKAQVISVVPIP+VSI LAQAEGSLGLKENWES LRFEWFSWPP +
Subjt:  LKAQVISVVPIPIVSIILAQAEGSLGLKENWESYLRFEWFSWPPGL

XP_022975311.1 tetratricopeptide repeat protein SKI3 isoform X1 [Cucurbita maxima]1.8e-17572.65Show/hide
Query:  TIECLGGERGILKM-----CGPSLDIMFLFGLRNLLGYLLLSNEERDDSHTATRCCNMMYGFDQQNKGPKSAYEIHGAGAVACYTIGTSHPRISFPTCSY
        T  CLG   G++ +       P+  +M     RNLLGYLLLSNEERDD+HTATRCCNM+YGFDQQNK  KSAYEIHGAGAVACYTIGTSHPR SFPTCSY
Subjt:  TIECLGGERGILKM-----CGPSLDIMFLFGLRNLLGYLLLSNEERDDSHTATRCCNMMYGFDQQNKGPKSAYEIHGAGAVACYTIGTSHPRISFPTCSY

Query:  QCQNGIGTIRQLQKYASKKLGKRSQGDEKRLAKSRKRRQRCLRQEPWNYDARYLLIQNFLQKAREERFPCHLGVIIERLILVAFSNEPYFKQDMSHQYKK
        QCQNGIGTIRQLQK                          CLRQ+PWNYDARYLLI N LQKAREERFPCHL   +ERLILVAFSNEPYF +D SHQYKK
Subjt:  QCQNGIGTIRQLQKYASKKLGKRSQGDEKRLAKSRKRRQRCLRQEPWNYDARYLLIQNFLQKAREERFPCHLGVIIERLILVAFSNEPYFKQDMSHQYKK

Query:  FQLLLCASEISFQRGSQIKCINYAKAASSISLPDDYLICAHLLLCRAYAAENDSINLHKEFIRCLDLKTNNYLGWVCLKFIASRYELHVESNTLELSFKK
        FQLLLCASEIS Q   QIKCINYAKAASSISL D+YL  AHLLLCRAYAAENDS NL  EFI+CLDLKT+NYLGWVCLKFIASRYELHVESN+LELSFKK
Subjt:  FQLLLCASEISFQRGSQIKCINYAKAASSISLPDDYLICAHLLLCRAYAAENDSINLHKEFIRCLDLKTNNYLGWVCLKFIASRYELHVESNTLELSFKK

Query:  WSVESKNLQHMLLPMFSLVDGLISFWSQDFMAAEKYFAQACSLGHDDACLLLCHGIISFPCKSFFINKTLFPHLHVLVACMELAKQLCSSHFLMLAVNSL
         SVE K+LQH+++P+ SLVDGLISFWSQDFMAAEKYFAQACS G DD CLLLCHG                      V CM LAKQLCS HFL LAVNSL
Subjt:  WSVESKNLQHMLLPMFSLVDGLISFWSQDFMAAEKYFAQACSLGHDDACLLLCHGIISFPCKSFFINKTLFPHLHVLVACMELAKQLCSSHFLMLAVNSL

Query:  LKAQVISVVPIPIVSIILAQAEGSLGLKENWESYLRFEWFSWPPGL
        LKAQVISVVPIP+VSI LAQAEGSLGLKENWES LRFEWFSWPP +
Subjt:  LKAQVISVVPIPIVSIILAQAEGSLGLKENWESYLRFEWFSWPPGL

TrEMBL top hitse value%identityAlignment
A0A6J1DZ84 tetratricopeptide repeat protein SKI3 isoform X13.5e-17274.29Show/hide
Query:  LRNLLGYLLLSNEERDDSHTATRCCNMMYGFDQQNKGPKSAYEIHGAGAVACYTIGTSHPRISFPTCSYQCQNGIGTIRQLQKYASKKLGKRSQGDEKRL
        +RNLLGYLLLSNEERDDSH+ATRCCNM+YGFD+QN G KSAYEIHGAGAVACYTIGTSHPR SFPTCSYQCQNGIG IR+LQK                 
Subjt:  LRNLLGYLLLSNEERDDSHTATRCCNMMYGFDQQNKGPKSAYEIHGAGAVACYTIGTSHPRISFPTCSYQCQNGIGTIRQLQKYASKKLGKRSQGDEKRL

Query:  AKSRKRRQRCLRQEPWNYDARYLLIQNFLQKAREERFPCHLGVIIERLILVAFSNEPYFKQDMSHQYKKFQLLLCASEISFQRGSQIKCINYAKAASSIS
                 CL QEPWNYDARYLLI NFLQKAREERFP HL V IERLI VAFSNE Y K+D S+QYKKFQLLLCASEISFQ GSQIKCINYA+AASSIS
Subjt:  AKSRKRRQRCLRQEPWNYDARYLLIQNFLQKAREERFPCHLGVIIERLILVAFSNEPYFKQDMSHQYKKFQLLLCASEISFQRGSQIKCINYAKAASSIS

Query:  LPDDYLICAHLLLCRAYAAENDSINLHKEFIRCLDLKTNNYLGWVCLKFIASRYELHVESNTLELSFKKWSVESKNLQHMLLPMFSLVDGLISFWSQDFM
        LPD+YL CAHLLLCRAYAAENDS NL +EFI+CLDL+T+NYLGWVCLKFIASRYELH ESN LE SFK+   E +NLQHM +PMFS+VDGL SFWSQDFM
Subjt:  LPDDYLICAHLLLCRAYAAENDSINLHKEFIRCLDLKTNNYLGWVCLKFIASRYELHVESNTLELSFKKWSVESKNLQHMLLPMFSLVDGLISFWSQDFM

Query:  AAEKYFAQACSLGHDDACLLLCHGIISFPCKSFFINKTLFPHLHVLVACMELAKQLCSSHFLMLAVNSLLKAQVISVVPIPIVSIILAQAEGSLGLKENW
         AEK+FAQACSLG DD C LLCHG                      V CMELA+QLCSSHFLMLAVNSLLKAQVISVVP+PIVSI+LAQAEGSLGLKE W
Subjt:  AAEKYFAQACSLGHDDACLLLCHGIISFPCKSFFINKTLFPHLHVLVACMELAKQLCSSHFLMLAVNSLLKAQVISVVPIPIVSIILAQAEGSLGLKENW

Query:  ESYLRFEWFSWPPGLSIYEV
        ES LRFEWFSWPP +   E+
Subjt:  ESYLRFEWFSWPPGLSIYEV

A0A6J1F7C5 tetratricopeptide repeat protein SKI3 isoform X12.6e-18073.99Show/hide
Query:  TIECLGGERGILKM-----CGPSLDIMFLFGLRNLLGYLLLSNEERDDSHTATRCCNMMYGFDQQNKGPKSAYEIHGAGAVACYTIGTSHPRISFPTCSY
        T  CLG   G++ +       P+  +M     RNLLGYLLLSNEERDD+HTATRCCNM+YGFDQQNKG KSAYEIHGAGAVACYTIGTSHPR SFPTCSY
Subjt:  TIECLGGERGILKM-----CGPSLDIMFLFGLRNLLGYLLLSNEERDDSHTATRCCNMMYGFDQQNKGPKSAYEIHGAGAVACYTIGTSHPRISFPTCSY

Query:  QCQNGIGTIRQLQKYASKKLGKRSQGDEKRLAKSRKRRQRCLRQEPWNYDARYLLIQNFLQKAREERFPCHLGVIIERLILVAFSNEPYFKQDMSHQYKK
        QCQNGIGTIRQLQK                          CLRQ+PWNYDARYLLI N LQKAREERFPCHL   IERLILVAFSNEPYF +D SHQYKK
Subjt:  QCQNGIGTIRQLQKYASKKLGKRSQGDEKRLAKSRKRRQRCLRQEPWNYDARYLLIQNFLQKAREERFPCHLGVIIERLILVAFSNEPYFKQDMSHQYKK

Query:  FQLLLCASEISFQRGSQIKCINYAKAASSISLPDDYLICAHLLLCRAYAAENDSINLHKEFIRCLDLKTNNYLGWVCLKFIASRYELHVESNTLELSFKK
        FQLLLCASEIS Q   QIKCINYAKAASSISLPD+YL  AHLLLCRAYAAENDS NL  EFI+CLDLKT+NYLGWVCLKFIASRYELHVESN+LELSFKK
Subjt:  FQLLLCASEISFQRGSQIKCINYAKAASSISLPDDYLICAHLLLCRAYAAENDSINLHKEFIRCLDLKTNNYLGWVCLKFIASRYELHVESNTLELSFKK

Query:  WSVESKNLQHMLLPMFSLVDGLISFWSQDFMAAEKYFAQACSLGHDDACLLLCHGIISFPCKSFFINKTLFPHLHVLVACMELAKQLCSSHFLMLAVNSL
         SVESK+LQHM++P+  LVDGLISFWSQDFMAAEKYFAQACS GHDD CLLLCHG                      V CMELAKQLCS HFL LAVNSL
Subjt:  WSVESKNLQHMLLPMFSLVDGLISFWSQDFMAAEKYFAQACSLGHDDACLLLCHGIISFPCKSFFINKTLFPHLHVLVACMELAKQLCSSHFLMLAVNSL

Query:  LKAQVISVVPIPIVSIILAQAEGSLGLKENWESYLRFEWFSWPPGL
        LKAQVISVVPIP+VSI LAQAEGSLGLKENWES LRFEWFSWPP +
Subjt:  LKAQVISVVPIPIVSIILAQAEGSLGLKENWESYLRFEWFSWPPGL

A0A6J1FCN2 tetratricopeptide repeat protein SKI3 isoform X22.6e-18073.99Show/hide
Query:  TIECLGGERGILKM-----CGPSLDIMFLFGLRNLLGYLLLSNEERDDSHTATRCCNMMYGFDQQNKGPKSAYEIHGAGAVACYTIGTSHPRISFPTCSY
        T  CLG   G++ +       P+  +M     RNLLGYLLLSNEERDD+HTATRCCNM+YGFDQQNKG KSAYEIHGAGAVACYTIGTSHPR SFPTCSY
Subjt:  TIECLGGERGILKM-----CGPSLDIMFLFGLRNLLGYLLLSNEERDDSHTATRCCNMMYGFDQQNKGPKSAYEIHGAGAVACYTIGTSHPRISFPTCSY

Query:  QCQNGIGTIRQLQKYASKKLGKRSQGDEKRLAKSRKRRQRCLRQEPWNYDARYLLIQNFLQKAREERFPCHLGVIIERLILVAFSNEPYFKQDMSHQYKK
        QCQNGIGTIRQLQK                          CLRQ+PWNYDARYLLI N LQKAREERFPCHL   IERLILVAFSNEPYF +D SHQYKK
Subjt:  QCQNGIGTIRQLQKYASKKLGKRSQGDEKRLAKSRKRRQRCLRQEPWNYDARYLLIQNFLQKAREERFPCHLGVIIERLILVAFSNEPYFKQDMSHQYKK

Query:  FQLLLCASEISFQRGSQIKCINYAKAASSISLPDDYLICAHLLLCRAYAAENDSINLHKEFIRCLDLKTNNYLGWVCLKFIASRYELHVESNTLELSFKK
        FQLLLCASEIS Q   QIKCINYAKAASSISLPD+YL  AHLLLCRAYAAENDS NL  EFI+CLDLKT+NYLGWVCLKFIASRYELHVESN+LELSFKK
Subjt:  FQLLLCASEISFQRGSQIKCINYAKAASSISLPDDYLICAHLLLCRAYAAENDSINLHKEFIRCLDLKTNNYLGWVCLKFIASRYELHVESNTLELSFKK

Query:  WSVESKNLQHMLLPMFSLVDGLISFWSQDFMAAEKYFAQACSLGHDDACLLLCHGIISFPCKSFFINKTLFPHLHVLVACMELAKQLCSSHFLMLAVNSL
         SVESK+LQHM++P+  LVDGLISFWSQDFMAAEKYFAQACS GHDD CLLLCHG                      V CMELAKQLCS HFL LAVNSL
Subjt:  WSVESKNLQHMLLPMFSLVDGLISFWSQDFMAAEKYFAQACSLGHDDACLLLCHGIISFPCKSFFINKTLFPHLHVLVACMELAKQLCSSHFLMLAVNSL

Query:  LKAQVISVVPIPIVSIILAQAEGSLGLKENWESYLRFEWFSWPPGL
        LKAQVISVVPIP+VSI LAQAEGSLGLKENWES LRFEWFSWPP +
Subjt:  LKAQVISVVPIPIVSIILAQAEGSLGLKENWESYLRFEWFSWPPGL

A0A6J1IDT6 tetratricopeptide repeat protein SKI3 isoform X18.8e-17672.65Show/hide
Query:  TIECLGGERGILKM-----CGPSLDIMFLFGLRNLLGYLLLSNEERDDSHTATRCCNMMYGFDQQNKGPKSAYEIHGAGAVACYTIGTSHPRISFPTCSY
        T  CLG   G++ +       P+  +M     RNLLGYLLLSNEERDD+HTATRCCNM+YGFDQQNK  KSAYEIHGAGAVACYTIGTSHPR SFPTCSY
Subjt:  TIECLGGERGILKM-----CGPSLDIMFLFGLRNLLGYLLLSNEERDDSHTATRCCNMMYGFDQQNKGPKSAYEIHGAGAVACYTIGTSHPRISFPTCSY

Query:  QCQNGIGTIRQLQKYASKKLGKRSQGDEKRLAKSRKRRQRCLRQEPWNYDARYLLIQNFLQKAREERFPCHLGVIIERLILVAFSNEPYFKQDMSHQYKK
        QCQNGIGTIRQLQK                          CLRQ+PWNYDARYLLI N LQKAREERFPCHL   +ERLILVAFSNEPYF +D SHQYKK
Subjt:  QCQNGIGTIRQLQKYASKKLGKRSQGDEKRLAKSRKRRQRCLRQEPWNYDARYLLIQNFLQKAREERFPCHLGVIIERLILVAFSNEPYFKQDMSHQYKK

Query:  FQLLLCASEISFQRGSQIKCINYAKAASSISLPDDYLICAHLLLCRAYAAENDSINLHKEFIRCLDLKTNNYLGWVCLKFIASRYELHVESNTLELSFKK
        FQLLLCASEIS Q   QIKCINYAKAASSISL D+YL  AHLLLCRAYAAENDS NL  EFI+CLDLKT+NYLGWVCLKFIASRYELHVESN+LELSFKK
Subjt:  FQLLLCASEISFQRGSQIKCINYAKAASSISLPDDYLICAHLLLCRAYAAENDSINLHKEFIRCLDLKTNNYLGWVCLKFIASRYELHVESNTLELSFKK

Query:  WSVESKNLQHMLLPMFSLVDGLISFWSQDFMAAEKYFAQACSLGHDDACLLLCHGIISFPCKSFFINKTLFPHLHVLVACMELAKQLCSSHFLMLAVNSL
         SVE K+LQH+++P+ SLVDGLISFWSQDFMAAEKYFAQACS G DD CLLLCHG                      V CM LAKQLCS HFL LAVNSL
Subjt:  WSVESKNLQHMLLPMFSLVDGLISFWSQDFMAAEKYFAQACSLGHDDACLLLCHGIISFPCKSFFINKTLFPHLHVLVACMELAKQLCSSHFLMLAVNSL

Query:  LKAQVISVVPIPIVSIILAQAEGSLGLKENWESYLRFEWFSWPPGL
        LKAQVISVVPIP+VSI LAQAEGSLGLKENWES LRFEWFSWPP +
Subjt:  LKAQVISVVPIPIVSIILAQAEGSLGLKENWESYLRFEWFSWPPGL

A0A6J1IGE1 tetratricopeptide repeat protein SKI3 isoform X28.8e-17672.65Show/hide
Query:  TIECLGGERGILKM-----CGPSLDIMFLFGLRNLLGYLLLSNEERDDSHTATRCCNMMYGFDQQNKGPKSAYEIHGAGAVACYTIGTSHPRISFPTCSY
        T  CLG   G++ +       P+  +M     RNLLGYLLLSNEERDD+HTATRCCNM+YGFDQQNK  KSAYEIHGAGAVACYTIGTSHPR SFPTCSY
Subjt:  TIECLGGERGILKM-----CGPSLDIMFLFGLRNLLGYLLLSNEERDDSHTATRCCNMMYGFDQQNKGPKSAYEIHGAGAVACYTIGTSHPRISFPTCSY

Query:  QCQNGIGTIRQLQKYASKKLGKRSQGDEKRLAKSRKRRQRCLRQEPWNYDARYLLIQNFLQKAREERFPCHLGVIIERLILVAFSNEPYFKQDMSHQYKK
        QCQNGIGTIRQLQK                          CLRQ+PWNYDARYLLI N LQKAREERFPCHL   +ERLILVAFSNEPYF +D SHQYKK
Subjt:  QCQNGIGTIRQLQKYASKKLGKRSQGDEKRLAKSRKRRQRCLRQEPWNYDARYLLIQNFLQKAREERFPCHLGVIIERLILVAFSNEPYFKQDMSHQYKK

Query:  FQLLLCASEISFQRGSQIKCINYAKAASSISLPDDYLICAHLLLCRAYAAENDSINLHKEFIRCLDLKTNNYLGWVCLKFIASRYELHVESNTLELSFKK
        FQLLLCASEIS Q   QIKCINYAKAASSISL D+YL  AHLLLCRAYAAENDS NL  EFI+CLDLKT+NYLGWVCLKFIASRYELHVESN+LELSFKK
Subjt:  FQLLLCASEISFQRGSQIKCINYAKAASSISLPDDYLICAHLLLCRAYAAENDSINLHKEFIRCLDLKTNNYLGWVCLKFIASRYELHVESNTLELSFKK

Query:  WSVESKNLQHMLLPMFSLVDGLISFWSQDFMAAEKYFAQACSLGHDDACLLLCHGIISFPCKSFFINKTLFPHLHVLVACMELAKQLCSSHFLMLAVNSL
         SVE K+LQH+++P+ SLVDGLISFWSQDFMAAEKYFAQACS G DD CLLLCHG                      V CM LAKQLCS HFL LAVNSL
Subjt:  WSVESKNLQHMLLPMFSLVDGLISFWSQDFMAAEKYFAQACSLGHDDACLLLCHGIISFPCKSFFINKTLFPHLHVLVACMELAKQLCSSHFLMLAVNSL

Query:  LKAQVISVVPIPIVSIILAQAEGSLGLKENWESYLRFEWFSWPPGL
        LKAQVISVVPIP+VSI LAQAEGSLGLKENWES LRFEWFSWPP +
Subjt:  LKAQVISVVPIPIVSIILAQAEGSLGLKENWESYLRFEWFSWPPGL

SwissProt top hitse value%identityAlignment
F4I3Z5 Tetratricopeptide repeat protein SKI31.7e-9144.3Show/hide
Query:  GTIECLGGERGILKMCGPSLDIMFLFGLRNLLGYLLLSNEERDDSHTATRCCNMMYGFDQQNKGPKSAYEIHGAGAVACYTIGTSHPRISFPTCSYQCQN
        G  + LG E+GI  +            +RNLLGY+LL+ E   D+ TA+RCC +        +G KSA E+ G G+VAC  IG + PR SFPTC  Q  N
Subjt:  GTIECLGGERGILKMCGPSLDIMFLFGLRNLLGYLLLSNEERDDSHTATRCCNMMYGFDQQNKGPKSAYEIHGAGAVACYTIGTSHPRISFPTCSYQCQN

Query:  GIGTIRQLQKYASKKLGKRSQGDEKRLAKSRKRRQRCLRQEPWNYDARYLLIQNFLQKAREERFPCHLGVIIERLILVAFSNEPYFKQDMSHQYKKFQLL
            + +LQ++                          L QEP N   RYLLI N +QKARE+RFP  L   IERLI VA S+E   K+    +YKKFQLL
Subjt:  GIGTIRQLQKYASKKLGKRSQGDEKRLAKSRKRRQRCLRQEPWNYDARYLLIQNFLQKAREERFPCHLGVIIERLILVAFSNEPYFKQDMSHQYKKFQLL

Query:  LCASEISFQRGSQIKCINYAKAASSISLPDDYLICAHLLLCRAYAAENDSINLHKEFIRCLDLKTNNYLGWVCLKFIASRYELHVESNTLELSFKKWSVE
        LCASEIS Q G+  + IN+A+ ASS+SLP  YL   HL LCRAYAA   + N+ +E+  CL+LKT++ +GW+CLK I S+Y L  ++N LE+S ++ S +
Subjt:  LCASEISFQRGSQIKCINYAKAASSISLPDDYLICAHLLLCRAYAAENDSINLHKEFIRCLDLKTNNYLGWVCLKFIASRYELHVESNTLELSFKKWSVE

Query:  SKNLQHMLLPMFSLVDGLISFWSQDFMAAEKYFAQACSLGHDDACLLLCHGIISFPCKSFFINKTLFPHLHVLVACMELAKQLCSSHFLMLAVNSLLKAQ
         KN     + ++SL  GL S   +DF +AE++ AQACSL + ++CLLLCHG +                      CMELA+Q   S FL LAV SL K Q
Subjt:  SKNLQHMLLPMFSLVDGLISFWSQDFMAAEKYFAQACSLGHDDACLLLCHGIISFPCKSFFINKTLFPHLHVLVACMELAKQLCSSHFLMLAVNSLLKAQ

Query:  VISVVPIPIVSIILAQAEGSLGLKENWESYLRFEWFSWPPGLSIYEV
          S+ P+PIV  +LAQA GSLG KE WE  LR EWF WPP +   EV
Subjt:  VISVVPIPIVSIILAQAEGSLGLKENWESYLRFEWFSWPPGLSIYEV

Arabidopsis top hitse value%identityAlignment
AT1G76630.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.2e-9244.3Show/hide
Query:  GTIECLGGERGILKMCGPSLDIMFLFGLRNLLGYLLLSNEERDDSHTATRCCNMMYGFDQQNKGPKSAYEIHGAGAVACYTIGTSHPRISFPTCSYQCQN
        G  + LG E+GI  +            +RNLLGY+LL+ E   D+ TA+RCC +        +G KSA E+ G G+VAC  IG + PR SFPTC  Q  N
Subjt:  GTIECLGGERGILKMCGPSLDIMFLFGLRNLLGYLLLSNEERDDSHTATRCCNMMYGFDQQNKGPKSAYEIHGAGAVACYTIGTSHPRISFPTCSYQCQN

Query:  GIGTIRQLQKYASKKLGKRSQGDEKRLAKSRKRRQRCLRQEPWNYDARYLLIQNFLQKAREERFPCHLGVIIERLILVAFSNEPYFKQDMSHQYKKFQLL
            + +LQ++                          L QEP N   RYLLI N +QKARE+RFP  L   IERLI VA S+E   K+    +YKKFQLL
Subjt:  GIGTIRQLQKYASKKLGKRSQGDEKRLAKSRKRRQRCLRQEPWNYDARYLLIQNFLQKAREERFPCHLGVIIERLILVAFSNEPYFKQDMSHQYKKFQLL

Query:  LCASEISFQRGSQIKCINYAKAASSISLPDDYLICAHLLLCRAYAAENDSINLHKEFIRCLDLKTNNYLGWVCLKFIASRYELHVESNTLELSFKKWSVE
        LCASEIS Q G+  + IN+A+ ASS+SLP  YL   HL LCRAYAA   + N+ +E+  CL+LKT++ +GW+CLK I S+Y L  ++N LE+S ++ S +
Subjt:  LCASEISFQRGSQIKCINYAKAASSISLPDDYLICAHLLLCRAYAAENDSINLHKEFIRCLDLKTNNYLGWVCLKFIASRYELHVESNTLELSFKKWSVE

Query:  SKNLQHMLLPMFSLVDGLISFWSQDFMAAEKYFAQACSLGHDDACLLLCHGIISFPCKSFFINKTLFPHLHVLVACMELAKQLCSSHFLMLAVNSLLKAQ
         KN     + ++SL  GL S   +DF +AE++ AQACSL + ++CLLLCHG +                      CMELA+Q   S FL LAV SL K Q
Subjt:  SKNLQHMLLPMFSLVDGLISFWSQDFMAAEKYFAQACSLGHDDACLLLCHGIISFPCKSFFINKTLFPHLHVLVACMELAKQLCSSHFLMLAVNSLLKAQ

Query:  VISVVPIPIVSIILAQAEGSLGLKENWESYLRFEWFSWPPGLSIYEV
          S+ P+PIV  +LAQA GSLG KE WE  LR EWF WPP +   EV
Subjt:  VISVVPIPIVSIILAQAEGSLGLKENWESYLRFEWFSWPPGLSIYEV

AT1G76630.2 Tetratricopeptide repeat (TPR)-like superfamily protein1.2e-9244.3Show/hide
Query:  GTIECLGGERGILKMCGPSLDIMFLFGLRNLLGYLLLSNEERDDSHTATRCCNMMYGFDQQNKGPKSAYEIHGAGAVACYTIGTSHPRISFPTCSYQCQN
        G  + LG E+GI  +            +RNLLGY+LL+ E   D+ TA+RCC +        +G KSA E+ G G+VAC  IG + PR SFPTC  Q  N
Subjt:  GTIECLGGERGILKMCGPSLDIMFLFGLRNLLGYLLLSNEERDDSHTATRCCNMMYGFDQQNKGPKSAYEIHGAGAVACYTIGTSHPRISFPTCSYQCQN

Query:  GIGTIRQLQKYASKKLGKRSQGDEKRLAKSRKRRQRCLRQEPWNYDARYLLIQNFLQKAREERFPCHLGVIIERLILVAFSNEPYFKQDMSHQYKKFQLL
            + +LQ++                          L QEP N   RYLLI N +QKARE+RFP  L   IERLI VA S+E   K+    +YKKFQLL
Subjt:  GIGTIRQLQKYASKKLGKRSQGDEKRLAKSRKRRQRCLRQEPWNYDARYLLIQNFLQKAREERFPCHLGVIIERLILVAFSNEPYFKQDMSHQYKKFQLL

Query:  LCASEISFQRGSQIKCINYAKAASSISLPDDYLICAHLLLCRAYAAENDSINLHKEFIRCLDLKTNNYLGWVCLKFIASRYELHVESNTLELSFKKWSVE
        LCASEIS Q G+  + IN+A+ ASS+SLP  YL   HL LCRAYAA   + N+ +E+  CL+LKT++ +GW+CLK I S+Y L  ++N LE+S ++ S +
Subjt:  LCASEISFQRGSQIKCINYAKAASSISLPDDYLICAHLLLCRAYAAENDSINLHKEFIRCLDLKTNNYLGWVCLKFIASRYELHVESNTLELSFKKWSVE

Query:  SKNLQHMLLPMFSLVDGLISFWSQDFMAAEKYFAQACSLGHDDACLLLCHGIISFPCKSFFINKTLFPHLHVLVACMELAKQLCSSHFLMLAVNSLLKAQ
         KN     + ++SL  GL S   +DF +AE++ AQACSL + ++CLLLCHG +                      CMELA+Q   S FL LAV SL K Q
Subjt:  SKNLQHMLLPMFSLVDGLISFWSQDFMAAEKYFAQACSLGHDDACLLLCHGIISFPCKSFFINKTLFPHLHVLVACMELAKQLCSSHFLMLAVNSLLKAQ

Query:  VISVVPIPIVSIILAQAEGSLGLKENWESYLRFEWFSWPPGLSIYEV
          S+ P+PIV  +LAQA GSLG KE WE  LR EWF WPP +   EV
Subjt:  VISVVPIPIVSIILAQAEGSLGLKENWESYLRFEWFSWPPGLSIYEV

AT2G02520.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.2e-0927.75Show/hide
Query:  INPSPVRESVFSCLW-KVKAPKKVLFFAWQVILGRVNTCDRLSRVKAPLVGPFCCILCRKAEENLDHLLWDCEFARSVWSLFFKVFEFQFASQRHCRDLI
        +NP   R   F  +W K K PK   F AW  +  R++T DR+  +    + P  C+ C   +E   HL +DCEFAR VW          F S+ H     
Subjt:  INPSPVRESVFSCLW-KVKAPKKVLFFAWQVILGRVNTCDRLSRVKAPLVGPFCCILCRKAEENLDHLLWDCEFARSVWSLFFKVFEFQFASQRHCRDLI

Query:  EEFLLHPPFRERGKLLWHVGVC--------------AILWGLWGERNNRTFRGVERHPC----EVWDTIRFHV
            + PP      + W    C              A ++ +W ERN R      R       E+   IR H+
Subjt:  EEFLLHPPFRERGKLLWHVGVC--------------AILWGLWGERNNRTFRGVERHPC----EVWDTIRFHV

AT4G04650.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein4.1e-0826.51Show/hide
Query:  DMRFWSLD---PSAGFSCRSFFQFLINPSPVRESVFSCLWKVKAPKKVLFFAWQVILGRVNTCDRLSRVKAPLVGPFCCILCRKAEENLDHLLWDCEFAR
        D   W  D   PS  FS    +  L +P          +W      K  F  W V   R++T DRL      +  P  C+LC   +++  HL ++C+F+ 
Subjt:  DMRFWSLD---PSAGFSCRSFFQFLINPSPVRESVFSCLWKVKAPKKVLFFAWQVILGRVNTCDRLSRVKAPLVGPFCCILCRKAEENLDHLLWDCEFAR

Query:  SVWSLFFKVFEFQFASQRHCRDLIEEFLLHPPFRERGKLLWHVGVCAILWGLWGERNNRTFRGVER
         VW  F         +Q    D +  +LL P   +   L+  +   + ++ +W ERN R   GV R
Subjt:  SVWSLFFKVFEFQFASQRHCRDLIEEFLLHPPFRERGKLLWHVGVCAILWGLWGERNNRTFRGVER

AT4G29090.1 Ribonuclease H-like superfamily protein4.1e-0825.12Show/hide
Query:  LEGVILRSGRR--DMRFWSLDPSAGFSCRSFFQFLI-------NPSPVRE----SVFSCLWKVKAPKKVLFFAWQVILGRVNTCDRLSRVKAPLVGPFCC
        L G +   GRR  D   W    S  ++ +S +  L        +P  V E     ++  +WK +   K+  F W+ +   +     L+     L     C
Subjt:  LEGVILRSGRR--DMRFWSLDPSAGFSCRSFFQFLI-------NPSPVRE----SVFSCLWKVKAPKKVLFFAWQVILGRVNTCDRLSRVKAPLVGPFCC

Query:  ILCRKAEENLDHLLWDCEFARSVWSLFFKVFEF--QFASQRHCRDLIEEFLL---HPPFRERGKLL-WHVGVCAILWGLWGERNNRTFRGVERHPCEVWD
        I C   +E ++HLL+ C FAR  W++         ++A   +  +L   F L   +P + +  +L+ W      +LW LW  RN   FRG E +  EV  
Subjt:  ILCRKAEENLDHLLWDCEFARSVWSLFFKVFEF--QFASQRHCRDLIEEFLL---HPPFRERGKLL-WHVGVCAILWGLWGERNNRTFRGVERHPCEVWD

Query:  TIRFHVSLWASVTKA
             +  W   T+A
Subjt:  TIRFHVSLWASVTKA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCCTCTTATCTCTGCTTGAGGGGGTTATCCTTAGATCTGGCAGGAGGGATATGCGTTTTTGGAGTCTCGATCCCTCTGCGGGCTTCTCCTGCAGATCGTTCTTTCA
GTTCCTGATCAATCCCTCCCCTGTTAGAGAATCTGTTTTTTCGTGTCTGTGGAAGGTTAAAGCTCCGAAGAAAGTCTTATTCTTTGCTTGGCAGGTCATCTTAGGTCGTG
TTAACACTTGTGATAGGCTCTCGAGAGTGAAGGCTCCTTTAGTTGGCCCTTTCTGTTGTATTCTTTGTCGGAAGGCTGAGGAAAATCTTGATCACTTGTTATGGGATTGT
GAGTTTGCTCGCTCTGTCTGGAGCCTTTTTTTCAAGGTCTTCGAGTTTCAGTTTGCGAGCCAGAGGCATTGTAGGGATTTGATCGAGGAGTTCCTCCTCCATCCGCCTTT
TCGGGAGAGGGGAAAGTTACTTTGGCATGTGGGCGTGTGTGCTATTTTGTGGGGTTTATGGGGGGAAAGGAACAATAGAACTTTTAGAGGAGTTGAGAGACATCCTTGTG
AGGTTTGGGATACTATTAGATTTCATGTTTCCTTATGGGCCTCGGTTACCAAGGCTTTTTGGAGGTCTCCTTCGTTTTCTTTTGGCTTCAGTCATCCATTGTCCAATAGG
GAAACGACGGATGTTATGTCCCTTATGTCTTTGATTGAGGAGTTTGATTTCAGGGTGAGGAGGAGGGATTTCCGTTGTTGGAACCCTAGTCCTTCTGACGGGTTCTCTTG
CAGCTCCCTATTCCGTTGGCTCTTGCACCCTTCTCCCCAAAGTGAGTCCATTTTTTCGCGTGTGTGGAAGGTGAAAGTCCCGAAGAAGATTAGGTTTTTTATGTGGCAAG
TTATCCATGGCAGGGTTAATACCTACGATCGTCTGATGAAAAAGGATGCCCTCTTTGGTTGGTTCGTTTTGCTGCATTCTGTGTCGAAAGGTGGAGGAAGATCTGGATCA
CATTCTGTGGAGTTATGCCTATGCGCGTGCAGTGTGGGATCGGTTTGGTCAGGCTTTTGGGTTGTAGGGGATTTCTTTTGTTCACCACAGGGAGATGATTCGAGGAGTTC
CTCCTCCATCCGCCTTTTCGTGAGAGAGCGAGATTTTTGTGGCTTGGGGGGATGTGTGCATTGTTGTGGAACCTTTGGGGTGAAAGGAACAATAGAGTGTTTAGGGGGAG
AAAGAGGGATCCTGAAGATGTGTGGTCCCTCACTCGATATCATGTTTCTCTTTGGGCTTCGGAACCTGCTTGGTTATCTTTTGCTATCCAACGAAGAAAGGGATGACAGT
CACACAGCTACTAGGTGCTGCAACATGATGTATGGTTTTGACCAACAAAACAAAGGTCCGAAATCTGCATATGAAATTCATGGTGCTGGAGCTGTGGCCTGCTATACAAT
TGGCACCAGTCATCCAAGGATTTCTTTCCCAACATGTTCATATCAGTGCCAGAATGGAATTGGGACCATCCGACAACTTCAAAAGTACGCATCGAAAAAATTGGGAAAGA
GAAGTCAAGGGGATGAGAAACGATTGGCAAAATCGAGAAAGAGAAGGCAAAGATGCTTACGTCAAGAGCCATGGAATTATGATGCTCGATATCTTCTTATACAAAACTTT
CTGCAGAAGGCACGTGAAGAGAGATTTCCTTGTCATCTAGGTGTAATTATTGAGCGGCTAATCTTGGTTGCCTTTTCCAATGAACCATATTTTAAGCAAGATATGTCTCA
TCAATATAAAAAGTTTCAGCTGCTACTGTGTGCATCTGAGATCAGTTTCCAACGCGGTAGCCAAATTAAATGTATCAACTATGCCAAAGCTGCTTCTTCTATTTCACTTC
CTGATGATTATCTTATTTGTGCACACTTGTTACTGTGTCGAGCCTATGCTGCAGAAAATGATTCGATCAACCTCCACAAAGAGTTCATAAGATGTTTGGATTTAAAGACA
AATAACTATCTTGGTTGGGTATGTCTTAAATTCATTGCATCTCGATATGAGCTTCATGTTGAATCCAATACCTTAGAACTTAGTTTCAAGAAATGGTCAGTAGAGAGCAA
GAATCTGCAACACATGTTACTACCCATGTTTAGTCTGGTGGATGGTTTGATATCTTTTTGGAGCCAGGATTTTATGGCTGCTGAGAAGTATTTTGCACAAGCTTGTTCTT
TGGGACATGACGATGCCTGTCTCCTCCTCTGTCATGGGATTATCTCTTTTCCTTGTAAGAGTTTCTTCATTAATAAAACTCTTTTCCCTCACTTACACGTGTTAGTAGCC
TGCATGGAACTTGCAAAGCAGCTTTGCAGTTCTCATTTCTTGATGCTGGCTGTGAACAGTCTCCTTAAAGCTCAAGTTATTTCTGTTGTTCCAATACCAATTGTCTCGAT
CATACTGGCTCAAGCAGAAGGGAGCCTTGGTTTGAAAGAAAATTGGGAGTCATATCTTCGTTTTGAATGGTTCTCGTGGCCCCCAGGTTTATCTATATATGAGGTCTGCG
GAGCTCTTGTTTCAAATGCATCTCCTTGCAAAACAGTCGAAGATACTGGAAGGTGCTGCAAAGCTTGTGGAATGAGGCCTGACATTTTTGTGCCATACCAGACTTCAACT
GTTGGTCGTGTGATATCCAGTTCTCGGAATTCTCTTGTCAATCGCCCTGATTTTGTCTCCAAGTTTCTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGACCCTCTTATCTCTGCTTGAGGGGGTTATCCTTAGATCTGGCAGGAGGGATATGCGTTTTTGGAGTCTCGATCCCTCTGCGGGCTTCTCCTGCAGATCGTTCTTTCA
GTTCCTGATCAATCCCTCCCCTGTTAGAGAATCTGTTTTTTCGTGTCTGTGGAAGGTTAAAGCTCCGAAGAAAGTCTTATTCTTTGCTTGGCAGGTCATCTTAGGTCGTG
TTAACACTTGTGATAGGCTCTCGAGAGTGAAGGCTCCTTTAGTTGGCCCTTTCTGTTGTATTCTTTGTCGGAAGGCTGAGGAAAATCTTGATCACTTGTTATGGGATTGT
GAGTTTGCTCGCTCTGTCTGGAGCCTTTTTTTCAAGGTCTTCGAGTTTCAGTTTGCGAGCCAGAGGCATTGTAGGGATTTGATCGAGGAGTTCCTCCTCCATCCGCCTTT
TCGGGAGAGGGGAAAGTTACTTTGGCATGTGGGCGTGTGTGCTATTTTGTGGGGTTTATGGGGGGAAAGGAACAATAGAACTTTTAGAGGAGTTGAGAGACATCCTTGTG
AGGTTTGGGATACTATTAGATTTCATGTTTCCTTATGGGCCTCGGTTACCAAGGCTTTTTGGAGGTCTCCTTCGTTTTCTTTTGGCTTCAGTCATCCATTGTCCAATAGG
GAAACGACGGATGTTATGTCCCTTATGTCTTTGATTGAGGAGTTTGATTTCAGGGTGAGGAGGAGGGATTTCCGTTGTTGGAACCCTAGTCCTTCTGACGGGTTCTCTTG
CAGCTCCCTATTCCGTTGGCTCTTGCACCCTTCTCCCCAAAGTGAGTCCATTTTTTCGCGTGTGTGGAAGGTGAAAGTCCCGAAGAAGATTAGGTTTTTTATGTGGCAAG
TTATCCATGGCAGGGTTAATACCTACGATCGTCTGATGAAAAAGGATGCCCTCTTTGGTTGGTTCGTTTTGCTGCATTCTGTGTCGAAAGGTGGAGGAAGATCTGGATCA
CATTCTGTGGAGTTATGCCTATGCGCGTGCAGTGTGGGATCGGTTTGGTCAGGCTTTTGGGTTGTAGGGGATTTCTTTTGTTCACCACAGGGAGATGATTCGAGGAGTTC
CTCCTCCATCCGCCTTTTCGTGAGAGAGCGAGATTTTTGTGGCTTGGGGGGATGTGTGCATTGTTGTGGAACCTTTGGGGTGAAAGGAACAATAGAGTGTTTAGGGGGAG
AAAGAGGGATCCTGAAGATGTGTGGTCCCTCACTCGATATCATGTTTCTCTTTGGGCTTCGGAACCTGCTTGGTTATCTTTTGCTATCCAACGAAGAAAGGGATGACAGT
CACACAGCTACTAGGTGCTGCAACATGATGTATGGTTTTGACCAACAAAACAAAGGTCCGAAATCTGCATATGAAATTCATGGTGCTGGAGCTGTGGCCTGCTATACAAT
TGGCACCAGTCATCCAAGGATTTCTTTCCCAACATGTTCATATCAGTGCCAGAATGGAATTGGGACCATCCGACAACTTCAAAAGTACGCATCGAAAAAATTGGGAAAGA
GAAGTCAAGGGGATGAGAAACGATTGGCAAAATCGAGAAAGAGAAGGCAAAGATGCTTACGTCAAGAGCCATGGAATTATGATGCTCGATATCTTCTTATACAAAACTTT
CTGCAGAAGGCACGTGAAGAGAGATTTCCTTGTCATCTAGGTGTAATTATTGAGCGGCTAATCTTGGTTGCCTTTTCCAATGAACCATATTTTAAGCAAGATATGTCTCA
TCAATATAAAAAGTTTCAGCTGCTACTGTGTGCATCTGAGATCAGTTTCCAACGCGGTAGCCAAATTAAATGTATCAACTATGCCAAAGCTGCTTCTTCTATTTCACTTC
CTGATGATTATCTTATTTGTGCACACTTGTTACTGTGTCGAGCCTATGCTGCAGAAAATGATTCGATCAACCTCCACAAAGAGTTCATAAGATGTTTGGATTTAAAGACA
AATAACTATCTTGGTTGGGTATGTCTTAAATTCATTGCATCTCGATATGAGCTTCATGTTGAATCCAATACCTTAGAACTTAGTTTCAAGAAATGGTCAGTAGAGAGCAA
GAATCTGCAACACATGTTACTACCCATGTTTAGTCTGGTGGATGGTTTGATATCTTTTTGGAGCCAGGATTTTATGGCTGCTGAGAAGTATTTTGCACAAGCTTGTTCTT
TGGGACATGACGATGCCTGTCTCCTCCTCTGTCATGGGATTATCTCTTTTCCTTGTAAGAGTTTCTTCATTAATAAAACTCTTTTCCCTCACTTACACGTGTTAGTAGCC
TGCATGGAACTTGCAAAGCAGCTTTGCAGTTCTCATTTCTTGATGCTGGCTGTGAACAGTCTCCTTAAAGCTCAAGTTATTTCTGTTGTTCCAATACCAATTGTCTCGAT
CATACTGGCTCAAGCAGAAGGGAGCCTTGGTTTGAAAGAAAATTGGGAGTCATATCTTCGTTTTGAATGGTTCTCGTGGCCCCCAGGTTTATCTATATATGAGGTCTGCG
GAGCTCTTGTTTCAAATGCATCTCCTTGCAAAACAGTCGAAGATACTGGAAGGTGCTGCAAAGCTTGTGGAATGAGGCCTGACATTTTTGTGCCATACCAGACTTCAACT
GTTGGTCGTGTGATATCCAGTTCTCGGAATTCTCTTGTCAATCGCCCTGATTTTGTCTCCAAGTTTCTGTAG
Protein sequenceShow/hide protein sequence
MTLLSLLEGVILRSGRRDMRFWSLDPSAGFSCRSFFQFLINPSPVRESVFSCLWKVKAPKKVLFFAWQVILGRVNTCDRLSRVKAPLVGPFCCILCRKAEENLDHLLWDC
EFARSVWSLFFKVFEFQFASQRHCRDLIEEFLLHPPFRERGKLLWHVGVCAILWGLWGERNNRTFRGVERHPCEVWDTIRFHVSLWASVTKAFWRSPSFSFGFSHPLSNR
ETTDVMSLMSLIEEFDFRVRRRDFRCWNPSPSDGFSCSSLFRWLLHPSPQSESIFSRVWKVKVPKKIRFFMWQVIHGRVNTYDRLMKKDALFGWFVLLHSVSKGGGRSGS
HSVELCLCACSVGSVWSGFWVVGDFFCSPQGDDSRSSSSIRLFVRERDFCGLGGCVHCCGTFGVKGTIECLGGERGILKMCGPSLDIMFLFGLRNLLGYLLLSNEERDDS
HTATRCCNMMYGFDQQNKGPKSAYEIHGAGAVACYTIGTSHPRISFPTCSYQCQNGIGTIRQLQKYASKKLGKRSQGDEKRLAKSRKRRQRCLRQEPWNYDARYLLIQNF
LQKAREERFPCHLGVIIERLILVAFSNEPYFKQDMSHQYKKFQLLLCASEISFQRGSQIKCINYAKAASSISLPDDYLICAHLLLCRAYAAENDSINLHKEFIRCLDLKT
NNYLGWVCLKFIASRYELHVESNTLELSFKKWSVESKNLQHMLLPMFSLVDGLISFWSQDFMAAEKYFAQACSLGHDDACLLLCHGIISFPCKSFFINKTLFPHLHVLVA
CMELAKQLCSSHFLMLAVNSLLKAQVISVVPIPIVSIILAQAEGSLGLKENWESYLRFEWFSWPPGLSIYEVCGALVSNASPCKTVEDTGRCCKACGMRPDIFVPYQTST
VGRVISSSRNSLVNRPDFVSKFL