; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr029107 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr029107
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionTetratricopeptide repeat protein 38
Genome locationtig00153210:3270945..3277569
RNA-Seq ExpressionSgr029107
SyntenySgr029107
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily
IPR033891 - Tetratricopeptide repeat protein 38


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151169.1 tetratricopeptide repeat protein 38-like [Momordica charantia]1.9e-23786.99Show/hide
Query:  MEDGVKLDKWGYEIRTSSDACISAINAFYDQVLSYGRRRSVILEAPVHDKGCVLANILAAHFISSSEPSRVHHHLRAAEARLDLATSYEKAVFDAINCLI
        M DGVKLDKWGYEIRTSSDACI+AINA+YDQVLSYGRRRSVILEAPVHDK CVLAN L AHF+SSSEPSRV HHLRAAEARLD ATSYE+AVFDAI+CLI
Subjt:  MEDGVKLDKWGYEIRTSSDACISAINAFYDQVLSYGRRRSVILEAPVHDKGCVLANILAAHFISSSEPSRVHHHLRAAEARLDLATSYEKAVFDAINCLI

Query:  SKDRDDYVAVELHSELLNKFPKDLVSLKRAQVLCFYLGRANLSLALVQQVLPQNQDEDFIYGMLAFPLLELGRMAEAENAARRGLDINKKDCWAQHALCH
        SKDRDD VAVELH+ELL KFPKDLVSLKRAQVLCFY+G ANLSLALVQQVLP NQ+EDFIYGMLAFPLLELG MAEAENAARRGLDINKK+ WAQHALCH
Subjt:  SKDRDDYVAVELHSELLNKFPKDLVSLKRAQVLCFYLGRANLSLALVQQVLPQNQDEDFIYGMLAFPLLELGRMAEAENAARRGLDINKKDCWAQHALCH

Query:  VLQYGCHFKEAVEFMEACSPSWSDCLSFMLTHNWWHVALCYLEANSPLNKILEIYDNYIWKELEKPDAMGPEVYLNALGLMLRLFVRGDFDQCEGRLKIL
        VLQ+ CHFKEAVEFMEACSPSWSDC SFM+THNWWHVALCYLEAN PLNKILEIYD YIWKE+EKPDAMGPEVYLNA+GLMLRLFVRG+FDQCEGRLKIL
Subjt:  VLQYGCHFKEAVEFMEACSPSWSDCLSFMLTHNWWHVALCYLEANSPLNKILEIYDNYIWKELEKPDAMGPEVYLNALGLMLRLFVRGDFDQCEGRLKIL

Query:  ANVLTDKANWNLEWHFDLLTLWALAKTGKFSAAEELLGSLKSRMSKMMPKKQEKMQRGMLLAEALYEYGRGGYKRALDLLGLDFDANDCKMIGASNEQLD
        AN     ANW+LEWHFD+LTLWALAKTG+ SAAEELLGSL+SR+ KM  KKQEKMQRGMLLAEA+Y+YGRG YKRALDLLGLDFDAND KMIGASNEQ+D
Subjt:  ANVLTDKANWNLEWHFDLLTLWALAKTGKFSAAEELLGSLKSRMSKMMPKKQEKMQRGMLLAEALYEYGRGGYKRALDLLGLDFDANDCKMIGASNEQLD

Query:  VFNEVWYDILMNTGHATKAIEVIEKQIKQREGVPYLWRLLERGYSKTARPEEAAIAGGKTRSLEMAYFK
        VFNEVWYDILMNT HA+KAIEVI+KQ++ REGVP++WRLLERGYSKT RPEEAAIAGGK  SLE AYFK
Subjt:  VFNEVWYDILMNTGHATKAIEVIEKQIKQREGVPYLWRLLERGYSKTARPEEAAIAGGKTRSLEMAYFK

XP_022956843.1 tetratricopeptide repeat protein 38-like isoform X1 [Cucurbita moschata]2.7e-23685.71Show/hide
Query:  MEDGVKLDKWGYEIRTSSDACISAINAFYDQVLSYGRRRSVILEAPVHDKGCVLANILAAHFISSSEPSRVHHHLRAAEARLDLATSYEKAVFDAINCLI
        MEDG+KL KWGYE+RTSSDACISAINAFYDQVLSYGRRRSVILEAPVHDK CVLAN+ AA+F+SSS+PSRVHHHL+AA+  LD AT YEKAV+DAINCL+
Subjt:  MEDGVKLDKWGYEIRTSSDACISAINAFYDQVLSYGRRRSVILEAPVHDKGCVLANILAAHFISSSEPSRVHHHLRAAEARLDLATSYEKAVFDAINCLI

Query:  SKDRDDYVAVELHSELLNKFPKDLVSLKRAQVLCFYLGRANLSLALVQQVLPQNQDEDFIYGMLAFPLLELGRMAEAENAARRGLDINKKDCWAQHALCH
        S DRDD VAVEL ++LL  FPKDL+SLK+AQVLCFY+G  +LSLALVQQVLPQNQ+E FIYGMLAFPLLELG M EAE AARRGLDINKKD WAQHALCH
Subjt:  SKDRDDYVAVELHSELLNKFPKDLVSLKRAQVLCFYLGRANLSLALVQQVLPQNQDEDFIYGMLAFPLLELGRMAEAENAARRGLDINKKDCWAQHALCH

Query:  VLQYGCHFKEAVEFMEACSPSWSDCLSFMLTHNWWHVALCYLEANSPLNKILEIYDNYIWKELEKPDAMGPEVYLNALGLMLRLFVRGDFDQCEGRLKIL
        VLQY C FKEAVEFMEACSP+WSDCLSF++THNWWHVALCYLEANSPL+KILEIYDNYIWKELEKPDAMGP+VYLNALGLMLRLFVRG+F  CEGRLKIL
Subjt:  VLQYGCHFKEAVEFMEACSPSWSDCLSFMLTHNWWHVALCYLEANSPLNKILEIYDNYIWKELEKPDAMGPEVYLNALGLMLRLFVRGDFDQCEGRLKIL

Query:  ANVLTDKANWNLEWHFDLLTLWALAKTGKFSAAEELLGSLKSRMSKMMPKKQEKMQRGMLLAEALYEYGRGGYKRALDLLGLDFDANDCKMIGASNEQLD
        ANVLTDKANW+LEWHFDLLT WALAK+G+  AAEELLGSLKSRM KM PKKQEKMQRGMLLAEALY YGRG YKRALDL+GLDFDANDCKMIGASNEQLD
Subjt:  ANVLTDKANWNLEWHFDLLTLWALAKTGKFSAAEELLGSLKSRMSKMMPKKQEKMQRGMLLAEALYEYGRGGYKRALDLLGLDFDANDCKMIGASNEQLD

Query:  VFNEVWYDILMNTGHATKAIEVIEKQIKQREGVPYLWRLLERGYSKTARPEEAAIAGGKTRSLEMAYFK
        VFNEVWYDILMNTGHA KAIEVIEKQIK+RE  PYLWRLLERGYSK  RPEEAAIAG K RSLE A+FK
Subjt:  VFNEVWYDILMNTGHATKAIEVIEKQIKQREGVPYLWRLLERGYSKTARPEEAAIAGGKTRSLEMAYFK

XP_022976859.1 tetratricopeptide repeat protein 38-like isoform X1 [Cucurbita maxima]4.6e-23685.29Show/hide
Query:  MEDGVKLDKWGYEIRTSSDACISAINAFYDQVLSYGRRRSVILEAPVHDKGCVLANILAAHFISSSEPSRVHHHLRAAEARLDLATSYEKAVFDAINCLI
        MEDG+KL KWGYE+ TSSD+CISAINAFYDQVLSYGRRRSVILEAPVHDK CVLAN+ AA+F+SSS+PSRVHHHL+AA+ARLD AT YEKAV+DAI+CL+
Subjt:  MEDGVKLDKWGYEIRTSSDACISAINAFYDQVLSYGRRRSVILEAPVHDKGCVLANILAAHFISSSEPSRVHHHLRAAEARLDLATSYEKAVFDAINCLI

Query:  SKDRDDYVAVELHSELLNKFPKDLVSLKRAQVLCFYLGRANLSLALVQQVLPQNQDEDFIYGMLAFPLLELGRMAEAENAARRGLDINKKDCWAQHALCH
        S DRDD VAVEL ++LL  FPKDL+SLK+AQVLCFY+G  +LSLALVQQVLPQNQ+E FIYGMLAFPLLE+G M EAE AA+RGLDINKKD WAQHALCH
Subjt:  SKDRDDYVAVELHSELLNKFPKDLVSLKRAQVLCFYLGRANLSLALVQQVLPQNQDEDFIYGMLAFPLLELGRMAEAENAARRGLDINKKDCWAQHALCH

Query:  VLQYGCHFKEAVEFMEACSPSWSDCLSFMLTHNWWHVALCYLEANSPLNKILEIYDNYIWKELEKPDAMGPEVYLNALGLMLRLFVRGDFDQCEGRLKIL
        VLQY C FKEAVEFMEACSP+WSDCLSF++THNWWHVALCYLEANSPL+KILEIYDNYIWKELEKPDAMGP+VYLNALGLMLRLFVRG+F  CEGRLKIL
Subjt:  VLQYGCHFKEAVEFMEACSPSWSDCLSFMLTHNWWHVALCYLEANSPLNKILEIYDNYIWKELEKPDAMGPEVYLNALGLMLRLFVRGDFDQCEGRLKIL

Query:  ANVLTDKANWNLEWHFDLLTLWALAKTGKFSAAEELLGSLKSRMSKMMPKKQEKMQRGMLLAEALYEYGRGGYKRALDLLGLDFDANDCKMIGASNEQLD
        ANVLTDKANW+LEWHFDLLT WALAK+G+  AAEELLGSLKSRM KM PKKQEKMQRGMLLAEALY+YGRG YKRALDLLGLDFDANDCKMIGASNEQLD
Subjt:  ANVLTDKANWNLEWHFDLLTLWALAKTGKFSAAEELLGSLKSRMSKMMPKKQEKMQRGMLLAEALYEYGRGGYKRALDLLGLDFDANDCKMIGASNEQLD

Query:  VFNEVWYDILMNTGHATKAIEVIEKQIKQREGVPYLWRLLERGYSKTARPEEAAIAGGKTRSLEMAYFK
        VFNEVWYDILMNTGHA KAIEVIEKQIK+RE  PYLWRLLERGYSK  RPEEAAIAG K RSLE A+FK
Subjt:  VFNEVWYDILMNTGHATKAIEVIEKQIKQREGVPYLWRLLERGYSKTARPEEAAIAGGKTRSLEMAYFK

XP_023537841.1 tetratricopeptide repeat protein 38-like isoform X1 [Cucurbita pepo subsp. pepo]5.7e-23484.86Show/hide
Query:  MEDGVKLDKWGYEIRTSSDACISAINAFYDQVLSYGRRRSVILEAPVHDKGCVLANILAAHFISSSEPSRVHHHLRAAEARLDLATSYEKAVFDAINCLI
        MEDG+KL KWGYE+RTSSDACISAINAFYDQVLSYGRRRSVILEAPVHDK CVLAN+ AA+F+SSS+PSRVHHHL+AA+A LD AT YEKAV+DAI+CL+
Subjt:  MEDGVKLDKWGYEIRTSSDACISAINAFYDQVLSYGRRRSVILEAPVHDKGCVLANILAAHFISSSEPSRVHHHLRAAEARLDLATSYEKAVFDAINCLI

Query:  SKDRDDYVAVELHSELLNKFPKDLVSLKRAQVLCFYLGRANLSLALVQQVLPQNQDEDFIYGMLAFPLLELGRMAEAENAARRGLDINKKDCWAQHALCH
        S DRDD VAVEL ++LL  FPKDL+SLK+AQVLCFY+G  +LSLALVQQVLPQNQ+E FIYGMLAFPLLELG M EAE AARRGLDINKKD WAQHALCH
Subjt:  SKDRDDYVAVELHSELLNKFPKDLVSLKRAQVLCFYLGRANLSLALVQQVLPQNQDEDFIYGMLAFPLLELGRMAEAENAARRGLDINKKDCWAQHALCH

Query:  VLQYGCHFKEAVEFMEACSPSWSDCLSFMLTHNWWHVALCYLEANSPLNKILEIYDNYIWKELEKPDAMGPEVYLNALGLMLRLFVRGDFDQCEGRLKIL
        VLQY C FKEAVEFMEACSP+WSDCLSF++THNWWHVALCYLEANSPL+KILEIYD+YIWKELEKPDAMGP+VYLNALGLMLRLFVRG+F  CEGRLKIL
Subjt:  VLQYGCHFKEAVEFMEACSPSWSDCLSFMLTHNWWHVALCYLEANSPLNKILEIYDNYIWKELEKPDAMGPEVYLNALGLMLRLFVRGDFDQCEGRLKIL

Query:  ANVLTDKANWNLEWHFDLLTLWALAKTGKFSAAEELLGSLKSRMSKMMPKKQEKMQRGMLLAEALYEYGRGGYKRALDLLGLDFDANDCKMIGASNEQLD
        ANVLTDKANW+LEWHFDLLT WALAK+G+   AEELLGSL+SRM KM PKKQEKMQRGMLLAEALY+YGRG YK ALDLLGLDFDANDCKMIGASNEQLD
Subjt:  ANVLTDKANWNLEWHFDLLTLWALAKTGKFSAAEELLGSLKSRMSKMMPKKQEKMQRGMLLAEALYEYGRGGYKRALDLLGLDFDANDCKMIGASNEQLD

Query:  VFNEVWYDILMNTGHATKAIEVIEKQIKQREGVPYLWRLLERGYSKTARPEEAAIAGGKTRSLEMAYFK
        VFNEVWYDILMNTGHA KAIEVIEKQIK+RE  PYLWRLLERG+SK  RPEEAAIAG K RSLE A+FK
Subjt:  VFNEVWYDILMNTGHATKAIEVIEKQIKQREGVPYLWRLLERGYSKTARPEEAAIAGGKTRSLEMAYFK

XP_038891808.1 tetratricopeptide repeat protein 38 isoform X3 [Benincasa hispida]9.1e-23284.43Show/hide
Query:  MEDGVKLDKWGYEIRTSSDACISAINAFYDQVLSYGRRRSVILEAPVHDKGCVLANILAAHFISSSEPSRVHHHLRAAEARLDLATSYEKAVFDAINCLI
        ME GVKL KWGY+IRTSSDACISAINAFYDQVLSYGR+RSVIL+A VHDK CVLAN+LA HF+SSS+PSRVHHHL+AA+A LD AT YEKAVFDAI+CL+
Subjt:  MEDGVKLDKWGYEIRTSSDACISAINAFYDQVLSYGRRRSVILEAPVHDKGCVLANILAAHFISSSEPSRVHHHLRAAEARLDLATSYEKAVFDAINCLI

Query:  SKDRDDYVAVELHSELLNKFPKDLVSLKRAQVLCFYLGRANLSLALVQQVLPQNQDEDFIYGMLAFPLLELGRMAEAENAARRGLDINKKDCWAQHALCH
        S +RDD VAVELHSELL KFPKDL+SLKRAQ+LCFY+G A+LSLALVQQVLPQNQ+E FIYGMLAF LLELG M EAE AARRGLDINKKD WAQHALCH
Subjt:  SKDRDDYVAVELHSELLNKFPKDLVSLKRAQVLCFYLGRANLSLALVQQVLPQNQDEDFIYGMLAFPLLELGRMAEAENAARRGLDINKKDCWAQHALCH

Query:  VLQYGCHFKEAVEFMEACSPSWSDCLSFMLTHNWWHVALCYLEANSPLNKILEIYDNYIWKELEKPDAMGPEVYLNALGLMLRLFVRGDFDQCEGRLKIL
        VLQY CHFKEAVEFMEACSPSWSDCLSFM+THNWWHVALCYLEANS L++ILEIYDNYIWKELEKPDAMGP+VYLNALGLMLRLFVRG++D CEGRLKIL
Subjt:  VLQYGCHFKEAVEFMEACSPSWSDCLSFMLTHNWWHVALCYLEANSPLNKILEIYDNYIWKELEKPDAMGPEVYLNALGLMLRLFVRGDFDQCEGRLKIL

Query:  ANVLTDKANWNLEWHFDLLTLWALAKTGKFSAAEELLGSLKSRMSKMMPKKQEKMQRGMLLAEALYEYGRGGYKRALDLLGLDFDANDCKMIGASNEQLD
        A VLTDKANW+LEWHFD+LT WALAK G+  AAEELLGSLKSR+ KM PK++EKMQRGMLLAEALY+YGRG Y+ ALDLLGLDFDAND KMIGASNEQLD
Subjt:  ANVLTDKANWNLEWHFDLLTLWALAKTGKFSAAEELLGSLKSRMSKMMPKKQEKMQRGMLLAEALYEYGRGGYKRALDLLGLDFDANDCKMIGASNEQLD

Query:  VFNEVWYDILMNTGHATKAIEVIEKQIKQREGVPYLWRLLERGYSKTARPEEAAIAGGKTRSLEMAYFK
        VFNEVWYDILMNTGHA KAIEVIEKQIK+RE VPYLWRLLERG+ K  RPEEAAIA  K RSLE A+FK
Subjt:  VFNEVWYDILMNTGHATKAIEVIEKQIKQREGVPYLWRLLERGYSKTARPEEAAIAGGKTRSLEMAYFK

TrEMBL top hitse value%identityAlignment
A0A0A0KSI1 Tetratricopeptide repeat protein 385.7e-23284.22Show/hide
Query:  MEDGVKLDKWGYEIRTSSDACISAINAFYDQVLSYGRRRSVILEAPVHDKGCVLANILAAHFISSSEPSRVHHHLRAAEARLDLATSYEKAVFDAINCLI
        MEDGVKL KWGY IRTSSD CISAINAFYDQVLSYGR+RSVILEA VHDK CVLAN+L AHF+SSS+PSR H+HL+ A+A LD AT YEKAVFDAI+CL+
Subjt:  MEDGVKLDKWGYEIRTSSDACISAINAFYDQVLSYGRRRSVILEAPVHDKGCVLANILAAHFISSSEPSRVHHHLRAAEARLDLATSYEKAVFDAINCLI

Query:  SKDRDDYVAVELHSELLNKFPKDLVSLKRAQVLCFYLGRANLSLALVQQVLPQNQDEDFIYGMLAFPLLELGRMAEAENAARRGLDINKKDCWAQHALCH
        S DRDD VAVELH+ELL  FPKDL SLKRAQVLCFYLG A+LSLALVQQVLPQNQ+E FIYGMLAFPLLELG M EAE AARRGLDINKKD WAQHALCH
Subjt:  SKDRDDYVAVELHSELLNKFPKDLVSLKRAQVLCFYLGRANLSLALVQQVLPQNQDEDFIYGMLAFPLLELGRMAEAENAARRGLDINKKDCWAQHALCH

Query:  VLQYGCHFKEAVEFMEACSPSWSDCLSFMLTHNWWHVALCYLEANSPLNKILEIYDNYIWKELEKPDAMGPEVYLNALGLMLRLFVRGDFDQCEGRLKIL
        VLQY CHFKEAVEFME CSPSW DC+SFM+THNWWHVALCYLEANSPL+KILEIYDNYIWKELEKPDA+GPEVYLNALGLMLRLFVRG++D CEGRLKIL
Subjt:  VLQYGCHFKEAVEFMEACSPSWSDCLSFMLTHNWWHVALCYLEANSPLNKILEIYDNYIWKELEKPDAMGPEVYLNALGLMLRLFVRGDFDQCEGRLKIL

Query:  ANVLTDKANWNLEWHFDLLTLWALAKTGKFSAAEELLGSLKSRMSKMMPKKQEKMQRGMLLAEALYEYGRGGYKRALDLLGLDFDANDCKMIGASNEQLD
        ANVLTDKANW+LEWHFD+LTLWALAK G+  AA+ELLGSLKSR+SKM  KK+EKMQR +LLAEALY+YGRG Y+RALDLLGLDFDAND KMIGASNEQLD
Subjt:  ANVLTDKANWNLEWHFDLLTLWALAKTGKFSAAEELLGSLKSRMSKMMPKKQEKMQRGMLLAEALYEYGRGGYKRALDLLGLDFDANDCKMIGASNEQLD

Query:  VFNEVWYDILMNTGHATKAIEVIEKQIKQREGVPYLWRLLERGYSKTARPEEAAIAGGKTRSLEMAYFK
        VFNEVWYDILMNTGHA KAIEVIEKQIK+RE VPYLW LLERGY+K  RP+E+AIAG K RSLE A+FK
Subjt:  VFNEVWYDILMNTGHATKAIEVIEKQIKQREGVPYLWRLLERGYSKTARPEEAAIAGGKTRSLEMAYFK

A0A5A7STS3 Tetratricopeptide repeat protein 382.0e-22482.52Show/hide
Query:  MEDGVKLDKWGYEIRTSSDACISAINAFYDQVLSYGRRRSVILEAPVHDKGCVLANILAAHFISSSEPSRVHHHLRAAEARLDLATSYEKAVFDAINCLI
        MED VKL KWGY IRTSSD CISAIN FYDQVLSYGRRRSVILEA VHDK CVLAN+LAAHF+SSS+ SR H+HL+AA+A +D AT YEKAVFDAI+ L+
Subjt:  MEDGVKLDKWGYEIRTSSDACISAINAFYDQVLSYGRRRSVILEAPVHDKGCVLANILAAHFISSSEPSRVHHHLRAAEARLDLATSYEKAVFDAINCLI

Query:  SKDRDDYVAVELHSELLNKFPKDLVSLKRAQVLCFYLGRANLSLALVQQVLPQNQDEDFIYGMLAFPLLELGRMAEAENAARRGLDINKKDCWAQHALCH
        S DRDD VAVELH+ELL  FPKDL SLKRAQVLCFYLG  +LSLALV+QVLPQNQ+E FIYGMLAF LLELG M EAE AARRGLDI+KKD WAQHALCH
Subjt:  SKDRDDYVAVELHSELLNKFPKDLVSLKRAQVLCFYLGRANLSLALVQQVLPQNQDEDFIYGMLAFPLLELGRMAEAENAARRGLDINKKDCWAQHALCH

Query:  VLQYGCHFKEAVEFMEACSPSWSDCLSFMLTHNWWHVALCYLEANSPLNKILEIYDNYIWKELEKPDAMGPEVYLNALGLMLRLFVRGDFDQCEGRLKIL
        VLQY CHFKEAVEFMEACSPSW DC SFM+THNWWHVALCYLEANSP +KILE+YDNYIWKELEKPDAMGPEVYLNALGLMLRLFVRG+FD CEGRLKIL
Subjt:  VLQYGCHFKEAVEFMEACSPSWSDCLSFMLTHNWWHVALCYLEANSPLNKILEIYDNYIWKELEKPDAMGPEVYLNALGLMLRLFVRGDFDQCEGRLKIL

Query:  ANVLTDKANWNLEWHFDLLTLWALAKTGKFSAAEELLGSLKSRMSKMMPKKQEKMQRGMLLAEALYEYGRGGYKRALDLLGLDFDANDCKMIGASNEQLD
        ANVLTDKANW+LEWHFD+LT WALAK G+  AA++LLGSLKSR+ KM  KK+EKMQRG+LLAEALY+YGRG Y+ ALDLLGLDFDAND KMIGASNEQLD
Subjt:  ANVLTDKANWNLEWHFDLLTLWALAKTGKFSAAEELLGSLKSRMSKMMPKKQEKMQRGMLLAEALYEYGRGGYKRALDLLGLDFDANDCKMIGASNEQLD

Query:  VFNEVWYDILMNTGHATKAIEVIEKQIKQREGVPYLWRLLERGYSKTARPEEAAIAGGKTRSLEMAYFK
        VFNEVWYDILMNTGH  KAIEVIEKQ K+RE VPYLW LLERGY+K  RPEEAAIAG K RSLE A+FK
Subjt:  VFNEVWYDILMNTGHATKAIEVIEKQIKQREGVPYLWRLLERGYSKTARPEEAAIAGGKTRSLEMAYFK

A0A6J1DC80 Tetratricopeptide repeat protein 389.1e-23886.99Show/hide
Query:  MEDGVKLDKWGYEIRTSSDACISAINAFYDQVLSYGRRRSVILEAPVHDKGCVLANILAAHFISSSEPSRVHHHLRAAEARLDLATSYEKAVFDAINCLI
        M DGVKLDKWGYEIRTSSDACI+AINA+YDQVLSYGRRRSVILEAPVHDK CVLAN L AHF+SSSEPSRV HHLRAAEARLD ATSYE+AVFDAI+CLI
Subjt:  MEDGVKLDKWGYEIRTSSDACISAINAFYDQVLSYGRRRSVILEAPVHDKGCVLANILAAHFISSSEPSRVHHHLRAAEARLDLATSYEKAVFDAINCLI

Query:  SKDRDDYVAVELHSELLNKFPKDLVSLKRAQVLCFYLGRANLSLALVQQVLPQNQDEDFIYGMLAFPLLELGRMAEAENAARRGLDINKKDCWAQHALCH
        SKDRDD VAVELH+ELL KFPKDLVSLKRAQVLCFY+G ANLSLALVQQVLP NQ+EDFIYGMLAFPLLELG MAEAENAARRGLDINKK+ WAQHALCH
Subjt:  SKDRDDYVAVELHSELLNKFPKDLVSLKRAQVLCFYLGRANLSLALVQQVLPQNQDEDFIYGMLAFPLLELGRMAEAENAARRGLDINKKDCWAQHALCH

Query:  VLQYGCHFKEAVEFMEACSPSWSDCLSFMLTHNWWHVALCYLEANSPLNKILEIYDNYIWKELEKPDAMGPEVYLNALGLMLRLFVRGDFDQCEGRLKIL
        VLQ+ CHFKEAVEFMEACSPSWSDC SFM+THNWWHVALCYLEAN PLNKILEIYD YIWKE+EKPDAMGPEVYLNA+GLMLRLFVRG+FDQCEGRLKIL
Subjt:  VLQYGCHFKEAVEFMEACSPSWSDCLSFMLTHNWWHVALCYLEANSPLNKILEIYDNYIWKELEKPDAMGPEVYLNALGLMLRLFVRGDFDQCEGRLKIL

Query:  ANVLTDKANWNLEWHFDLLTLWALAKTGKFSAAEELLGSLKSRMSKMMPKKQEKMQRGMLLAEALYEYGRGGYKRALDLLGLDFDANDCKMIGASNEQLD
        AN     ANW+LEWHFD+LTLWALAKTG+ SAAEELLGSL+SR+ KM  KKQEKMQRGMLLAEA+Y+YGRG YKRALDLLGLDFDAND KMIGASNEQ+D
Subjt:  ANVLTDKANWNLEWHFDLLTLWALAKTGKFSAAEELLGSLKSRMSKMMPKKQEKMQRGMLLAEALYEYGRGGYKRALDLLGLDFDANDCKMIGASNEQLD

Query:  VFNEVWYDILMNTGHATKAIEVIEKQIKQREGVPYLWRLLERGYSKTARPEEAAIAGGKTRSLEMAYFK
        VFNEVWYDILMNT HA+KAIEVI+KQ++ REGVP++WRLLERGYSKT RPEEAAIAGGK  SLE AYFK
Subjt:  VFNEVWYDILMNTGHATKAIEVIEKQIKQREGVPYLWRLLERGYSKTARPEEAAIAGGKTRSLEMAYFK

A0A6J1GYC7 Tetratricopeptide repeat protein 381.3e-23685.71Show/hide
Query:  MEDGVKLDKWGYEIRTSSDACISAINAFYDQVLSYGRRRSVILEAPVHDKGCVLANILAAHFISSSEPSRVHHHLRAAEARLDLATSYEKAVFDAINCLI
        MEDG+KL KWGYE+RTSSDACISAINAFYDQVLSYGRRRSVILEAPVHDK CVLAN+ AA+F+SSS+PSRVHHHL+AA+  LD AT YEKAV+DAINCL+
Subjt:  MEDGVKLDKWGYEIRTSSDACISAINAFYDQVLSYGRRRSVILEAPVHDKGCVLANILAAHFISSSEPSRVHHHLRAAEARLDLATSYEKAVFDAINCLI

Query:  SKDRDDYVAVELHSELLNKFPKDLVSLKRAQVLCFYLGRANLSLALVQQVLPQNQDEDFIYGMLAFPLLELGRMAEAENAARRGLDINKKDCWAQHALCH
        S DRDD VAVEL ++LL  FPKDL+SLK+AQVLCFY+G  +LSLALVQQVLPQNQ+E FIYGMLAFPLLELG M EAE AARRGLDINKKD WAQHALCH
Subjt:  SKDRDDYVAVELHSELLNKFPKDLVSLKRAQVLCFYLGRANLSLALVQQVLPQNQDEDFIYGMLAFPLLELGRMAEAENAARRGLDINKKDCWAQHALCH

Query:  VLQYGCHFKEAVEFMEACSPSWSDCLSFMLTHNWWHVALCYLEANSPLNKILEIYDNYIWKELEKPDAMGPEVYLNALGLMLRLFVRGDFDQCEGRLKIL
        VLQY C FKEAVEFMEACSP+WSDCLSF++THNWWHVALCYLEANSPL+KILEIYDNYIWKELEKPDAMGP+VYLNALGLMLRLFVRG+F  CEGRLKIL
Subjt:  VLQYGCHFKEAVEFMEACSPSWSDCLSFMLTHNWWHVALCYLEANSPLNKILEIYDNYIWKELEKPDAMGPEVYLNALGLMLRLFVRGDFDQCEGRLKIL

Query:  ANVLTDKANWNLEWHFDLLTLWALAKTGKFSAAEELLGSLKSRMSKMMPKKQEKMQRGMLLAEALYEYGRGGYKRALDLLGLDFDANDCKMIGASNEQLD
        ANVLTDKANW+LEWHFDLLT WALAK+G+  AAEELLGSLKSRM KM PKKQEKMQRGMLLAEALY YGRG YKRALDL+GLDFDANDCKMIGASNEQLD
Subjt:  ANVLTDKANWNLEWHFDLLTLWALAKTGKFSAAEELLGSLKSRMSKMMPKKQEKMQRGMLLAEALYEYGRGGYKRALDLLGLDFDANDCKMIGASNEQLD

Query:  VFNEVWYDILMNTGHATKAIEVIEKQIKQREGVPYLWRLLERGYSKTARPEEAAIAGGKTRSLEMAYFK
        VFNEVWYDILMNTGHA KAIEVIEKQIK+RE  PYLWRLLERGYSK  RPEEAAIAG K RSLE A+FK
Subjt:  VFNEVWYDILMNTGHATKAIEVIEKQIKQREGVPYLWRLLERGYSKTARPEEAAIAGGKTRSLEMAYFK

A0A6J1IPU9 Tetratricopeptide repeat protein 382.2e-23685.29Show/hide
Query:  MEDGVKLDKWGYEIRTSSDACISAINAFYDQVLSYGRRRSVILEAPVHDKGCVLANILAAHFISSSEPSRVHHHLRAAEARLDLATSYEKAVFDAINCLI
        MEDG+KL KWGYE+ TSSD+CISAINAFYDQVLSYGRRRSVILEAPVHDK CVLAN+ AA+F+SSS+PSRVHHHL+AA+ARLD AT YEKAV+DAI+CL+
Subjt:  MEDGVKLDKWGYEIRTSSDACISAINAFYDQVLSYGRRRSVILEAPVHDKGCVLANILAAHFISSSEPSRVHHHLRAAEARLDLATSYEKAVFDAINCLI

Query:  SKDRDDYVAVELHSELLNKFPKDLVSLKRAQVLCFYLGRANLSLALVQQVLPQNQDEDFIYGMLAFPLLELGRMAEAENAARRGLDINKKDCWAQHALCH
        S DRDD VAVEL ++LL  FPKDL+SLK+AQVLCFY+G  +LSLALVQQVLPQNQ+E FIYGMLAFPLLE+G M EAE AA+RGLDINKKD WAQHALCH
Subjt:  SKDRDDYVAVELHSELLNKFPKDLVSLKRAQVLCFYLGRANLSLALVQQVLPQNQDEDFIYGMLAFPLLELGRMAEAENAARRGLDINKKDCWAQHALCH

Query:  VLQYGCHFKEAVEFMEACSPSWSDCLSFMLTHNWWHVALCYLEANSPLNKILEIYDNYIWKELEKPDAMGPEVYLNALGLMLRLFVRGDFDQCEGRLKIL
        VLQY C FKEAVEFMEACSP+WSDCLSF++THNWWHVALCYLEANSPL+KILEIYDNYIWKELEKPDAMGP+VYLNALGLMLRLFVRG+F  CEGRLKIL
Subjt:  VLQYGCHFKEAVEFMEACSPSWSDCLSFMLTHNWWHVALCYLEANSPLNKILEIYDNYIWKELEKPDAMGPEVYLNALGLMLRLFVRGDFDQCEGRLKIL

Query:  ANVLTDKANWNLEWHFDLLTLWALAKTGKFSAAEELLGSLKSRMSKMMPKKQEKMQRGMLLAEALYEYGRGGYKRALDLLGLDFDANDCKMIGASNEQLD
        ANVLTDKANW+LEWHFDLLT WALAK+G+  AAEELLGSLKSRM KM PKKQEKMQRGMLLAEALY+YGRG YKRALDLLGLDFDANDCKMIGASNEQLD
Subjt:  ANVLTDKANWNLEWHFDLLTLWALAKTGKFSAAEELLGSLKSRMSKMMPKKQEKMQRGMLLAEALYEYGRGGYKRALDLLGLDFDANDCKMIGASNEQLD

Query:  VFNEVWYDILMNTGHATKAIEVIEKQIKQREGVPYLWRLLERGYSKTARPEEAAIAGGKTRSLEMAYFK
        VFNEVWYDILMNTGHA KAIEVIEKQIK+RE  PYLWRLLERGYSK  RPEEAAIAG K RSLE A+FK
Subjt:  VFNEVWYDILMNTGHATKAIEVIEKQIKQREGVPYLWRLLERGYSKTARPEEAAIAGGKTRSLEMAYFK

SwissProt top hitse value%identityAlignment
Q6DIV2 Tetratricopeptide repeat protein 387.4e-1925.57Show/hide
Query:  AVELHSELLNKFPKDLVSLKRAQVLCFYLGRANLSLALVQQVLPQNQD----EDFIYGMLAFPLLELGRMAEAENAARRGLDINKKDCWAQHALCHVLQY
        A +L   +L   P DL++LK A    FYLG        V +VLP  +       ++ GM +F LLE     +A   A+  L +++ D W+ H + HV + 
Subjt:  AVELHSELLNKFPKDLVSLKRAQVLCFYLGRANLSLALVQQVLPQNQD----EDFIYGMLAFPLLELGRMAEAENAARRGLDINKKDCWAQHALCHVLQY

Query:  GCHFKEAVEFMEACSPSWSDCLSFMLTHNWWHVALCYLEANSPLNKILEIYDNYIWKELEKPDAMGPEVYLNALGLMLRLFVRGDFDQCEGRLKILANVL
               + FM+    +W      +  H +WH AL ++E        L +YDN+I  +      M   V  +++   L+L      D+ +  L+I  +  
Subjt:  GCHFKEAVEFMEACSPSWSDCLSFMLTHNWWHVALCYLEANSPLNKILEIYDNYIWKELEKPDAMGPEVYLNALGLMLRLFVRGDFDQCEGRLKILANVL

Query:  TDKANWNLEWHFDLLTLWALAKTGKFSAAEELLGSLKSRMSKMMPKKQEKMQRGML------LAEALYEYGRGGYKRALDLL-GLDFDANDCKMIGASNE
         D      + HF + +L         S  E++   L   M ++     E  Q G++      L  AL EY RG Y +A DL+  + +       IG S+ 
Subjt:  TDKANWNLEWHFDLLTLWALAKTGKFSAAEELLGSLKSRMSKMMPKKQEKMQRGML------LAEALYEYGRGGYKRALDLL-GLDFDANDCKMIGASNE

Query:  QLDVFNEVWYDILMNTG---HATKAIEVIEKQIKQREGVPYLWRLLER
        Q D+FN+V     +N+    H   A  ++ ++   R   P   RL+++
Subjt:  QLDVFNEVWYDILMNTG---HATKAIEVIEKQIKQREGVPYLWRLLER

Arabidopsis top hitse value%identityAlignment
AT1G27110.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.2e-15456.26Show/hide
Query:  MEDGVKLDKWGYEIRTSSDACISAINAFYDQVLSYGRRRSVILEAPVHDKGCVLANILAAHFISSSEPSRVHHHLRAAEARLDLATSYEKAVFDAINCLI
        M++  +    GYE+ TSSD CI++IN++ DQVL YGR + VILEAP +D  CVLANILAAH++SS +P R   +  AAE+RL  AT YEKAVF+A++ L+
Subjt:  MEDGVKLDKWGYEIRTSSDACISAINAFYDQVLSYGRRRSVILEAPVHDKGCVLANILAAHFISSSEPSRVHHHLRAAEARLDLATSYEKAVFDAINCLI

Query:  SKDRDDYVAVELHSELLNKFPKDLVSLKRAQVLCFYLGRANLSLALVQQVLPQNQDEDFIYGMLAFPLLELGRMAEAENAARRGLDINKKDCWAQHALCH
        S++ DD VA+ELHS+LL KFPKDL+S KR + LC Y+GR +LSL L +++LPQN+ + ++ GMLAF L+ELG + EAE AAR+G +IN+ D WA HALCH
Subjt:  SKDRDDYVAVELHSELLNKFPKDLVSLKRAQVLCFYLGRANLSLALVQQVLPQNQDEDFIYGMLAFPLLELGRMAEAENAARRGLDINKKDCWAQHALCH

Query:  VLQYGCHFKEAVEFMEACSPSWSDCLSFMLTHNWWHVALCYLEANSPLNKILEIYDNYIWKELEKPDAMGPEVYLNALGLMLRLFVRGDFDQ-CEGRLKI
        VLQ  C FKEAV+FME  S SW  C S   +HNWWHVA+CYLE  S ++K+ E+YD+ +WKELEK DA+  +VY +ALGL+LRL  RG  D   + RL+ 
Subjt:  VLQYGCHFKEAVEFMEACSPSWSDCLSFMLTHNWWHVALCYLEANSPLNKILEIYDNYIWKELEKPDAMGPEVYLNALGLMLRLFVRGDFDQ-CEGRLKI

Query:  LANVLTDKANWNLEWHFDLLTLWALAKTGKFSAAEELLGSLKSRMSKMMPKKQEKMQRGMLLAEALYEYGRGGYKRALDLLGLDFDANDCKMIGASNEQL
        LA+ LTDKA W  +W FD+ T+WAL+K  K S A ELL  LKSR S M PKKQ+ MQ+ +LLAEA+YEYG+G Y+ AL+LLGLDFDA + K+IG S  Q+
Subjt:  LANVLTDKANWNLEWHFDLLTLWALAKTGKFSAAEELLGSLKSRMSKMMPKKQEKMQRGMLLAEALYEYGRGGYKRALDLLGLDFDANDCKMIGASNEQL

Query:  DVFNEVWYDILMNTGHATKAIEVIEKQIKQREGVPYLWRLLERGYSKTARPEEAAIAGGKTRSLEMAYFKT
        DVFNE+WY +L+  G ++ AIEV+EK  KQR+G P+LWRLLE  YS   + +    AG K ++LE +YFK+
Subjt:  DVFNEVWYDILMNTGHATKAIEVIEKQIKQREGVPYLWRLLERGYSKTARPEEAAIAGGKTRSLEMAYFKT

AT1G27110.2 Tetratricopeptide repeat (TPR)-like superfamily protein3.0e-14857.6Show/hide
Query:  MEDGVKLDKWGYEIRTSSDACISAINAFYDQVLSYGRRRSVILEAPVHDKGCVLANILAAHFISSSEPSRVHHHLRAAEARLDLATSYEKAVFDAINCLI
        M++  +    GYE+ TSSD CI++IN++ DQVL YGR + VILEAP +D  CVLANILAAH++SS +P R   +  AAE+RL  AT YEKAVF+A++ L+
Subjt:  MEDGVKLDKWGYEIRTSSDACISAINAFYDQVLSYGRRRSVILEAPVHDKGCVLANILAAHFISSSEPSRVHHHLRAAEARLDLATSYEKAVFDAINCLI

Query:  SKDRDDYVAVELHSELLNKFPKDLVSLKRAQVLCFYLGRANLSLALVQQVLPQNQDEDFIYGMLAFPLLELGRMAEAENAARRGLDINKKDCWAQHALCH
        S++ DD VA+ELHS+LL KFPKDL+S KR + LC Y+GR +LSL L +++LPQN+ + ++ GMLAF L+ELG + EAE AAR+G +IN+ D WA HALCH
Subjt:  SKDRDDYVAVELHSELLNKFPKDLVSLKRAQVLCFYLGRANLSLALVQQVLPQNQDEDFIYGMLAFPLLELGRMAEAENAARRGLDINKKDCWAQHALCH

Query:  VLQYGCHFKEAVEFMEACSPSWSDCLSFMLTHNWWHVALCYLEANSPLNKILEIYDNYIWKELEKPDAMGPEVYLNALGLMLRLFVRGDFDQ-CEGRLKI
        VLQ  C FKEAV+FME  S SW  C S   +HNWWHVA+CYLE  S ++K+ E+YD+ +WKELEK DA+  +VY +ALGL+LRL  RG  D   + RL+ 
Subjt:  VLQYGCHFKEAVEFMEACSPSWSDCLSFMLTHNWWHVALCYLEANSPLNKILEIYDNYIWKELEKPDAMGPEVYLNALGLMLRLFVRGDFDQ-CEGRLKI

Query:  LANVLTDKANWNLEWHFDLLTLWALAKTGKFSAAEELLGSLKSRMSKMMPKKQEKMQRGMLLAEALYEYGRGGYKRALDLLGLDFDANDCKMIGASNEQL
        LA+ LTDKA W  +W FD+ T+WAL+K  K S A ELL  LKSR S M PKKQ+ MQ+ +LLAEA+YEYG+G Y+ AL+LLGLDFDA + K+IG S  Q+
Subjt:  LANVLTDKANWNLEWHFDLLTLWALAKTGKFSAAEELLGSLKSRMSKMMPKKQEKMQRGMLLAEALYEYGRGGYKRALDLLGLDFDANDCKMIGASNEQL

Query:  DVFNEVWYDILMNTGHATKAIEVIEKQIKQREGVPYLWRLL
        DVFNE+WY +L+  G ++ AIEV+EK  KQR+G P+LWRLL
Subjt:  DVFNEVWYDILMNTGHATKAIEVIEKQIKQREGVPYLWRLL

AT1G27110.3 Tetratricopeptide repeat (TPR)-like superfamily protein5.3e-12156.95Show/hide
Query:  DDYVAVELHSELLNKFPKDLVSLKRAQVLCFYLGRANLSLALVQQVLPQNQDEDFIYGMLAFPLLELGRMAEAENAARRGLDINKKDCWAQHALCHVLQY
        DD VA+ELHS+LL KFPKDL+S KR + LC Y+GR +LSL L +++LPQN+ + ++ GMLAF L+ELG + EAE AAR+G +IN+ D WA HALCHVLQ 
Subjt:  DDYVAVELHSELLNKFPKDLVSLKRAQVLCFYLGRANLSLALVQQVLPQNQDEDFIYGMLAFPLLELGRMAEAENAARRGLDINKKDCWAQHALCHVLQY

Query:  GCHFKEAVEFMEACSPSWSDCLSFMLTHNWWHVALCYLEANSPLNKILEIYDNYIWKELEKPDAMGPEVYLNALGLMLRLFVRGDFDQ-CEGRLKILANV
         C FKEAV+FME  S SW  C S   +HNWWHVA+CYLE  S ++K+ E+YD+ +WKELEK DA+  +VY +ALGL+LRL  RG  D   + RL+ LA+ 
Subjt:  GCHFKEAVEFMEACSPSWSDCLSFMLTHNWWHVALCYLEANSPLNKILEIYDNYIWKELEKPDAMGPEVYLNALGLMLRLFVRGDFDQ-CEGRLKILANV

Query:  LTDKANWNLEWHFDLLTLWALAKTGKFSAAEELLGSLKSRMSKMMPKKQEKMQRGMLLAEALYEYGRGGYKRALDLLGLDFDANDCKMIGASNEQLDVFN
        LTDKA W  +W FD+ T+WAL+K  K S A ELL  LKSR S M PKKQ+ MQ+ +LLAEA+YEYG+G Y+ AL+LLGLDFDA + K+IG S  Q+DVFN
Subjt:  LTDKANWNLEWHFDLLTLWALAKTGKFSAAEELLGSLKSRMSKMMPKKQEKMQRGMLLAEALYEYGRGGYKRALDLLGLDFDANDCKMIGASNEQLDVFN

Query:  EVWYDILMNTGHATKAIEVIEKQIKQREGVPYLWRLLERGYSKTARPEEAAIAGGKTRSLEMAYFKT
        E+WY +L+  G ++ AIEV+EK  KQR+G P+LWRLLE  YS   + +    AG K ++LE +YFK+
Subjt:  EVWYDILMNTGHATKAIEVIEKQIKQREGVPYLWRLLERGYSKTARPEEAAIAGGKTRSLEMAYFKT

AT1G27150.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.8e-16859.62Show/hide
Query:  MEDGVKLDKWGYEIRTSSDACISAINAFYDQVLSYGRRRSVILEAPVHDKGCVLANILAAHFISSSEPSRVHHHLRAAEARLDLATSYEKAVFDAINCLI
        ME  V+  +WGYE+ TSSDACI AIN+++ QVLSYGR+R VILEAP++DK CVL +ILAAHF+SSS+PSR + ++ AA + L+ +T YEKAV++A+  LI
Subjt:  MEDGVKLDKWGYEIRTSSDACISAINAFYDQVLSYGRRRSVILEAPVHDKGCVLANILAAHFISSSEPSRVHHHLRAAEARLDLATSYEKAVFDAINCLI

Query:  SKDRDDYVAVELHSELLNKFPKDLVSLKRAQVLCFYLGRANLSLALVQQVLPQNQDEDFIYGMLAFPLLELGRMAEAENAARRGLDINKKDCWAQHALCH
        S+DRDD +A E+H++LL +FPKDL SLKRAQ+L FY+G+ +  L LVQQVLP NQ+E +I+G+LAFPLLELGRM EA  A+R+G +INK+D WA H LCH
Subjt:  SKDRDDYVAVELHSELLNKFPKDLVSLKRAQVLCFYLGRANLSLALVQQVLPQNQDEDFIYGMLAFPLLELGRMAEAENAARRGLDINKKDCWAQHALCH

Query:  VLQYGCHFKEAVEFMEACSPSWSDCLSFMLTHNWWHVALCYLEANSPLNKILEIYDNYIWKELEKPDAMGPEVYLNALGLMLRLFVRGDFDQCEGRLKIL
        VLQ+ C FKEAVEFMEA + +W  C SFM THNWWHVALCYLE  SP++K+ EIYD++IWKELEK DA+ PEVYLNALGL++RL VR   D  E RLK L
Subjt:  VLQYGCHFKEAVEFMEACSPSWSDCLSFMLTHNWWHVALCYLEANSPLNKILEIYDNYIWKELEKPDAMGPEVYLNALGLMLRLFVRGDFDQCEGRLKIL

Query:  ANVLTDKANWNLEWHFDLLTLWALAKTGKFSAAEELLGSLKSRMSKMMPKKQEKMQRGMLLAEALYEYGRGGYKRALDLLGLDFDANDCKMIGASNEQLD
        A  LT++ANW LEWH D+L +WALAK G+ S A ELL  LK R+SK   KKQ+ MQ+G+ L EA+YEY RG Y++AL+LLG +F+A   K++GAS+EQ+D
Subjt:  ANVLTDKANWNLEWHFDLLTLWALAKTGKFSAAEELLGSLKSRMSKMMPKKQEKMQRGMLLAEALYEYGRGGYKRALDLLGLDFDANDCKMIGASNEQLD

Query:  VFNEVWYDILMNTGHATKAIEVIEKQIKQREGVPYLWRLLERGYSKTARPEEAAIAGGKTRSLEMAYF
        VFNE+W  +L+ TG ++ A EVI ++IK R+G+P++WRLLE+ YS     E  + A  + + LE  YF
Subjt:  VFNEVWYDILMNTGHATKAIEVIEKQIKQREGVPYLWRLLERGYSKTARPEEAAIAGGKTRSLEMAYF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGATGGAGTGAAATTGGACAAGTGGGGTTATGAAATTCGAACGTCGTCTGACGCTTGCATCTCTGCCATCAATGCATTCTACGACCAGGTGCTTAGTTAC
GGGAGGCGGAGGTCTGTGATTTTGGAGGCGCCGGTCCATGACAAAGGCTGCGTGCTTGCGAACATTTTGGCTGCTCATTTTATTTCCTCCTCCGAACCTTCCCGA
GTTCACCATCATCTCCGAGCAGCCGAGGCCCGTCTGGATCTTGCAACCTCGTACGAGAAAGCCGTTTTCGATGCTATCAATTGTTTGATTTCCAAGGACAGAGAC
GATTATGTCGCTGTTGAGCTGCACTCTGAGCTCCTTAACAAATTTCCCAAGGATCTGGTGTCTCTGAAAAGGGCGCAAGTGCTATGCTTTTACTTGGGAAGAGCC
AATCTATCTTTGGCTTTGGTTCAACAGGTTTTACCACAAAATCAAGATGAAGATTTCATTTATGGCATGCTTGCTTTTCCTTTGTTGGAGCTTGGCCGCATGGCA
GAAGCTGAAAATGCTGCAAGAAGGGGACTTGATATCAACAAGAAAGACTGTTGGGCACAGCATGCGTTGTGCCATGTTCTTCAATATGGGTGTCATTTTAAAGAA
GCCGTTGAGTTTATGGAAGCATGCTCGCCTTCGTGGAGTGACTGTTTATCATTCATGCTGACACATAATTGGTGGCATGTGGCTCTCTGTTACCTGGAAGCCAAT
TCTCCATTAAATAAAATCCTTGAAATATATGACAACTATATATGGAAGGAGTTAGAAAAACCCGATGCTATGGGACCAGAGGTATACTTGAATGCCCTTGGTTTG
ATGTTGCGGTTATTTGTGCGTGGTGACTTTGATCAATGTGAGGGTCGTCTGAAGATCTTGGCCAATGTTTTAACTGATAAAGCCAACTGGAACTTAGAGTGGCAC
TTTGACTTATTGACATTATGGGCTTTGGCTAAAACTGGAAAGTTTTCTGCAGCAGAAGAGTTGCTTGGGAGCTTGAAATCCCGAATGTCGAAAATGATGCCGAAG
AAACAAGAAAAGATGCAAAGAGGAATGCTGCTTGCAGAAGCTCTCTACGAGTACGGAAGAGGTGGTTACAAACGCGCATTAGACTTGCTTGGTCTGGATTTTGAT
GCAAATGACTGCAAGATGATTGGCGCATCCAACGAACAGCTCGATGTATTTAATGAAGTATGGTACGACATCTTGATGAATACAGGACATGCTACAAAGGCAATC
GAAGTAATCGAAAAGCAGATCAAGCAGAGGGAAGGAGTGCCATACTTGTGGCGCCTTCTGGAGAGAGGGTACAGCAAAACAGCAAGGCCGGAGGAAGCAGCCATT
GCCGGAGGCAAAACCAGGAGCCTGGAGATGGCATATTTTAAAACCTTCGTTTTCCTCGTTAAACGGTGGTCTAGAATTATGAAATTAGTGGTTGTCATATCTTCA
CAATTCCTTAACTTCCACCCACGTTCGACATCGACGACCGAACCCCTCCACGTGTGCGGTTTCGAATTCGCCGGAGCTCCTCCACGATCTCCTTGGCGTCGGGGC
GGTCGTCCTTGTCCGCCGCCACACATCGGAATGCCAGCTCCGCCACGGCTTCAACCCCGTCTATTACCTCTCCGTCAATACCCAAAACGGAGTCCACCACCTGAT
GAAGCTGACCCATTTGGATCTTGGATACTACCAGATCCGCCAGCGCCATTTCTCTCCTGTCTCGACTCTGATCCACAGCTTTCAGGCCAGAAATCAGCTCCAGTA
GCACCACTCCGAAGCTGTACACGTCACTTTTTTCCGTCAACCGGAAGGACCGGTGGTAGTCCGGATCCAAGTAACCCGGTGTGCCCTGAGCCTCGAAAGCCCAAA
ATCCCCCACTTTGATTCTCATATCTTTCTCCACAAAAATGTTGGAGGAGGTGATGTCTCTGTGAACAATGGGCGGCACCACCATGAAATGCAAATACTCCATAGC
CATGGCTATTTGCAGAGCAATATCAATCCTTACTTGCCAGGTCAGAGATCCCTTCCGGTACAAGCTCTTGGTGCCATGGAGATGGTCGGCGAGCGTTCCGTTTGG
GACAATCAAGATTTCGTTACAGAACGACTTGGTGAAGAAAGCACGGCCGGAGGCGGCGGCGGCGTGGTGCTTGTGAAGATACTTCACGGCCACCAGCCGGCCATC
GTTAAGCTGACCCAAATAAACAGAGCCAAAGCCACCGTCGCCGAGTTTCCGCTTGGGGTCGAACCTGTTGGTCGAAGATTCGAGTTCTTCGAAGGGGAGAACTGG
GATCTCAACCATCTTGACCTGAAGAAAGCCATCGTCACCGCAATAATCAATAACAAGCACATCAACGCGAAGACAGAAGACAAAATCGCGATTCTATTTGGCTTC
ACAAACATTACACTTCAAGAAGTAAGGGTCCTGATCTGCATCCCAGTCGACTTCCAGACCAAATCTGAGAAAATTGTCCAAGAACTCCAAAACCTCGCCCTGACA
CCCTTGCTCCGACACGGGCTGGTGCTCGTCGCCGCAATGCGGAGGCGAGCAGGGCTGGAGAAGAGAGAGTCGAGAACAAGAACCATCGGAGATTCGGAACGGCGA
GCCCGAGAGATTAATGGAGCGGCTGGGAATGGAGGGGAAGAGAACGGAAGAACAGCTCCGGTTCGCATCTTTGACTGTGGCAGTTACATTGGGTGGAAGAGGGGA
AAGGAGGAGAGTGGTGGAGCTCGTATTGAAACTCAACAGAGAAAAAGAAACGCCATTGATGGAGATAATAGAGTGGGGAGAAGAGCAACGTACCTGGAAAGAAGG
GTGGCCGTAACCAGTGACGGAAGAAAAGGGGAAAACAAGTTTGGAAGGGAAAGCTGTAAACAAAGGACAGAGCTGAAGGAGGAAGACAAGAATGGCGGAAACAGA
AAGGAGATTCCTTTAGCAATGCAACGTTGTTGGAATTGCAAAAGAGTGAAATGGGGTCAGTGTTGGAGGAAGACAAAGAAGGTAGGTCACGTGAACTGTGAAGAA
GGCAAGGGTTTTGGAATGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGATGGAGTGAAATTGGACAAGTGGGGTTATGAAATTCGAACGTCGTCTGACGCTTGCATCTCTGCCATCAATGCATTCTACGACCAGGTGCTTAGTTAC
GGGAGGCGGAGGTCTGTGATTTTGGAGGCGCCGGTCCATGACAAAGGCTGCGTGCTTGCGAACATTTTGGCTGCTCATTTTATTTCCTCCTCCGAACCTTCCCGA
GTTCACCATCATCTCCGAGCAGCCGAGGCCCGTCTGGATCTTGCAACCTCGTACGAGAAAGCCGTTTTCGATGCTATCAATTGTTTGATTTCCAAGGACAGAGAC
GATTATGTCGCTGTTGAGCTGCACTCTGAGCTCCTTAACAAATTTCCCAAGGATCTGGTGTCTCTGAAAAGGGCGCAAGTGCTATGCTTTTACTTGGGAAGAGCC
AATCTATCTTTGGCTTTGGTTCAACAGGTTTTACCACAAAATCAAGATGAAGATTTCATTTATGGCATGCTTGCTTTTCCTTTGTTGGAGCTTGGCCGCATGGCA
GAAGCTGAAAATGCTGCAAGAAGGGGACTTGATATCAACAAGAAAGACTGTTGGGCACAGCATGCGTTGTGCCATGTTCTTCAATATGGGTGTCATTTTAAAGAA
GCCGTTGAGTTTATGGAAGCATGCTCGCCTTCGTGGAGTGACTGTTTATCATTCATGCTGACACATAATTGGTGGCATGTGGCTCTCTGTTACCTGGAAGCCAAT
TCTCCATTAAATAAAATCCTTGAAATATATGACAACTATATATGGAAGGAGTTAGAAAAACCCGATGCTATGGGACCAGAGGTATACTTGAATGCCCTTGGTTTG
ATGTTGCGGTTATTTGTGCGTGGTGACTTTGATCAATGTGAGGGTCGTCTGAAGATCTTGGCCAATGTTTTAACTGATAAAGCCAACTGGAACTTAGAGTGGCAC
TTTGACTTATTGACATTATGGGCTTTGGCTAAAACTGGAAAGTTTTCTGCAGCAGAAGAGTTGCTTGGGAGCTTGAAATCCCGAATGTCGAAAATGATGCCGAAG
AAACAAGAAAAGATGCAAAGAGGAATGCTGCTTGCAGAAGCTCTCTACGAGTACGGAAGAGGTGGTTACAAACGCGCATTAGACTTGCTTGGTCTGGATTTTGAT
GCAAATGACTGCAAGATGATTGGCGCATCCAACGAACAGCTCGATGTATTTAATGAAGTATGGTACGACATCTTGATGAATACAGGACATGCTACAAAGGCAATC
GAAGTAATCGAAAAGCAGATCAAGCAGAGGGAAGGAGTGCCATACTTGTGGCGCCTTCTGGAGAGAGGGTACAGCAAAACAGCAAGGCCGGAGGAAGCAGCCATT
GCCGGAGGCAAAACCAGGAGCCTGGAGATGGCATATTTTAAAACCTTCGTTTTCCTCGTTAAACGGTGGTCTAGAATTATGAAATTAGTGGTTGTCATATCTTCA
CAATTCCTTAACTTCCACCCACGTTCGACATCGACGACCGAACCCCTCCACGTGTGCGGTTTCGAATTCGCCGGAGCTCCTCCACGATCTCCTTGGCGTCGGGGC
GGTCGTCCTTGTCCGCCGCCACACATCGGAATGCCAGCTCCGCCACGGCTTCAACCCCGTCTATTACCTCTCCGTCAATACCCAAAACGGAGTCCACCACCTGAT
GAAGCTGACCCATTTGGATCTTGGATACTACCAGATCCGCCAGCGCCATTTCTCTCCTGTCTCGACTCTGATCCACAGCTTTCAGGCCAGAAATCAGCTCCAGTA
GCACCACTCCGAAGCTGTACACGTCACTTTTTTCCGTCAACCGGAAGGACCGGTGGTAGTCCGGATCCAAGTAACCCGGTGTGCCCTGAGCCTCGAAAGCCCAAA
ATCCCCCACTTTGATTCTCATATCTTTCTCCACAAAAATGTTGGAGGAGGTGATGTCTCTGTGAACAATGGGCGGCACCACCATGAAATGCAAATACTCCATAGC
CATGGCTATTTGCAGAGCAATATCAATCCTTACTTGCCAGGTCAGAGATCCCTTCCGGTACAAGCTCTTGGTGCCATGGAGATGGTCGGCGAGCGTTCCGTTTGG
GACAATCAAGATTTCGTTACAGAACGACTTGGTGAAGAAAGCACGGCCGGAGGCGGCGGCGGCGTGGTGCTTGTGAAGATACTTCACGGCCACCAGCCGGCCATC
GTTAAGCTGACCCAAATAAACAGAGCCAAAGCCACCGTCGCCGAGTTTCCGCTTGGGGTCGAACCTGTTGGTCGAAGATTCGAGTTCTTCGAAGGGGAGAACTGG
GATCTCAACCATCTTGACCTGAAGAAAGCCATCGTCACCGCAATAATCAATAACAAGCACATCAACGCGAAGACAGAAGACAAAATCGCGATTCTATTTGGCTTC
ACAAACATTACACTTCAAGAAGTAAGGGTCCTGATCTGCATCCCAGTCGACTTCCAGACCAAATCTGAGAAAATTGTCCAAGAACTCCAAAACCTCGCCCTGACA
CCCTTGCTCCGACACGGGCTGGTGCTCGTCGCCGCAATGCGGAGGCGAGCAGGGCTGGAGAAGAGAGAGTCGAGAACAAGAACCATCGGAGATTCGGAACGGCGA
GCCCGAGAGATTAATGGAGCGGCTGGGAATGGAGGGGAAGAGAACGGAAGAACAGCTCCGGTTCGCATCTTTGACTGTGGCAGTTACATTGGGTGGAAGAGGGGA
AAGGAGGAGAGTGGTGGAGCTCGTATTGAAACTCAACAGAGAAAAAGAAACGCCATTGATGGAGATAATAGAGTGGGGAGAAGAGCAACGTACCTGGAAAGAAGG
GTGGCCGTAACCAGTGACGGAAGAAAAGGGGAAAACAAGTTTGGAAGGGAAAGCTGTAAACAAAGGACAGAGCTGAAGGAGGAAGACAAGAATGGCGGAAACAGA
AAGGAGATTCCTTTAGCAATGCAACGTTGTTGGAATTGCAAAAGAGTGAAATGGGGTCAGTGTTGGAGGAAGACAAAGAAGGTAGGTCACGTGAACTGTGAAGAA
GGCAAGGGTTTTGGAATGTGA
Protein sequenceShow/hide protein sequence
MEDGVKLDKWGYEIRTSSDACISAINAFYDQVLSYGRRRSVILEAPVHDKGCVLANILAAHFISSSEPSRVHHHLRAAEARLDLATSYEKAVFDAINCLISKDRD
DYVAVELHSELLNKFPKDLVSLKRAQVLCFYLGRANLSLALVQQVLPQNQDEDFIYGMLAFPLLELGRMAEAENAARRGLDINKKDCWAQHALCHVLQYGCHFKE
AVEFMEACSPSWSDCLSFMLTHNWWHVALCYLEANSPLNKILEIYDNYIWKELEKPDAMGPEVYLNALGLMLRLFVRGDFDQCEGRLKILANVLTDKANWNLEWH
FDLLTLWALAKTGKFSAAEELLGSLKSRMSKMMPKKQEKMQRGMLLAEALYEYGRGGYKRALDLLGLDFDANDCKMIGASNEQLDVFNEVWYDILMNTGHATKAI
EVIEKQIKQREGVPYLWRLLERGYSKTARPEEAAIAGGKTRSLEMAYFKTFVFLVKRWSRIMKLVVVISSQFLNFHPRSTSTTEPLHVCGFEFAGAPPRSPWRRG
GRPCPPPHIGMPAPPRLQPRLLPLRQYPKRSPPPDEADPFGSWILPDPPAPFLSCLDSDPQLSGQKSAPVAPLRSCTRHFFPSTGRTGGSPDPSNPVCPEPRKPK
IPHFDSHIFLHKNVGGGDVSVNNGRHHHEMQILHSHGYLQSNINPYLPGQRSLPVQALGAMEMVGERSVWDNQDFVTERLGEESTAGGGGGVVLVKILHGHQPAI
VKLTQINRAKATVAEFPLGVEPVGRRFEFFEGENWDLNHLDLKKAIVTAIINNKHINAKTEDKIAILFGFTNITLQEVRVLICIPVDFQTKSEKIVQELQNLALT
PLLRHGLVLVAAMRRRAGLEKRESRTRTIGDSERRAREINGAAGNGGEENGRTAPVRIFDCGSYIGWKRGKEESGGARIETQQRKRNAIDGDNRVGRRATYLERR
VAVTSDGRKGENKFGRESCKQRTELKEEDKNGGNRKEIPLAMQRCWNCKRVKWGQCWRKTKKVGHVNCEEGKGFGM