; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC09g1709 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC09g1709
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionTetratricopeptide repeat protein 38
Genome locationMC09:22441814..22448879
RNA-Seq ExpressionMC09g1709
SyntenyMC09g1709
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily
IPR033891 - Tetratricopeptide repeat protein 38


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004135208.1 tetratricopeptide repeat protein 38 [Cucumis sativus]1.25e-29182.39Show/hide
Query:  MGDGVKLDKWGYEIRTSSDACIAAINAYYDQVLSYGRRRSVILEAPVHDKDCVLANTLVAHFLSSSEPSRVLHHLRAAEAGLDHATSYERAVFDAISCLI
        M DGVKL KWGY IRTSSD CI+AINA+YDQVLSYGR+RSVILEA VHDKDCVLAN L AHFLSSS+PSR  +HL+ A+AGLD AT YE+AVFDAISCL+
Subjt:  MGDGVKLDKWGYEIRTSSDACIAAINAYYDQVLSYGRRRSVILEAPVHDKDCVLANTLVAHFLSSSEPSRVLHHLRAAEAGLDHATSYERAVFDAISCLI

Query:  SKDRDDDVAVELHAELLKKFPKDLVSLKRAQVLCFYMGDANLSLALVQQVPLLPKNVVLPHNQEEDFIYGMLAFPLLELGCMAEAENAARRGLDINKKEG
        S DRDD+VAVELH ELLK FPKDL SLKRAQVLCFY+G A+LSLALVQQV        LP NQEE FIYGMLAFPLLELGCM EAE AARRGLDINKK+G
Subjt:  SKDRDDDVAVELHAELLKKFPKDLVSLKRAQVLCFYMGDANLSLALVQQVPLLPKNVVLPHNQEEDFIYGMLAFPLLELGCMAEAENAARRGLDINKKEG

Query:  WAQHALCHVLQHKCHFKEAVEFMEACSPSWSDCSSFMVTHNWWHVALCYLEANFPLNKILEIYDKYIWKEVEKPDAMGPEVYLNAVGLMLRLFVRGEFDQ
        WAQHALCHVLQ++CHFKEAVEFME CSPSW DC SFMVTHNWWHVALCYLEAN PL+KILEIYD YIWKE+EKPDA+GPEVYLNA+GLMLRLFVRGE+D 
Subjt:  WAQHALCHVLQHKCHFKEAVEFMEACSPSWSDCSSFMVTHNWWHVALCYLEANFPLNKILEIYDKYIWKEVEKPDAMGPEVYLNAVGLMLRLFVRGEFDQ

Query:  CEGRLKILANVLTDKANWHLEWHFDILTLWALAKTGEISAAEELLGSLESRLLKMTSKKQEKMQRGMLLAEAVYKYGRGDYKRALDLLGLDFDANDYKMI
        CEGRLKILANVLTDKANWHLEWHFDILTLWALAK GEI AA+ELLGSL+SRL KMT+KK+EKMQR +LLAEA+YKYGRGDY+RALDLLGLDFDANDYKMI
Subjt:  CEGRLKILANVLTDKANWHLEWHFDILTLWALAKTGEISAAEELLGSLESRLLKMTSKKQEKMQRGMLLAEAVYKYGRGDYKRALDLLGLDFDANDYKMI

Query:  GASNEQVDVFNEVWYDILMNTVHASKAIEVIKKQLETREGVPFMWRLLERGYSKTGRPEEAAIAGGKAGSLEKAYFK
        GASNEQ+DVFNEVWYDILMNT HA+KAIEVI+KQ++ RE VP++W LLERGY+K GRP+E+AIAG KA SLEKA+FK
Subjt:  GASNEQVDVFNEVWYDILMNTVHASKAIEVIKKQLETREGVPFMWRLLERGYSKTGRPEEAAIAGGKAGSLEKAYFK

XP_022151169.1 tetratricopeptide repeat protein 38-like [Momordica charantia]0.096.86Show/hide
Query:  MGDGVKLDKWGYEIRTSSDACIAAINAYYDQVLSYGRRRSVILEAPVHDKDCVLANTLVAHFLSSSEPSRVLHHLRAAEAGLDHATSYERAVFDAISCLI
        MGDGVKLDKWGYEIRTSSDACIAAINAYYDQVLSYGRRRSVILEAPVHDKDCVLANTLVAHFLSSSEPSRVLHHLRAAEA LDHATSYERAVFDAISCLI
Subjt:  MGDGVKLDKWGYEIRTSSDACIAAINAYYDQVLSYGRRRSVILEAPVHDKDCVLANTLVAHFLSSSEPSRVLHHLRAAEAGLDHATSYERAVFDAISCLI

Query:  SKDRDDDVAVELHAELLKKFPKDLVSLKRAQVLCFYMGDANLSLALVQQVPLLPKNVVLPHNQEEDFIYGMLAFPLLELGCMAEAENAARRGLDINKKEG
        SKDRDDDVAVELHAELLKKFPKDLVSLKRAQVLCFYMGDANLSLALVQQV        LPHNQEEDFIYGMLAFPLLELGCMAEAENAARRGLDINKKEG
Subjt:  SKDRDDDVAVELHAELLKKFPKDLVSLKRAQVLCFYMGDANLSLALVQQVPLLPKNVVLPHNQEEDFIYGMLAFPLLELGCMAEAENAARRGLDINKKEG

Query:  WAQHALCHVLQHKCHFKEAVEFMEACSPSWSDCSSFMVTHNWWHVALCYLEANFPLNKILEIYDKYIWKEVEKPDAMGPEVYLNAVGLMLRLFVRGEFDQ
        WAQHALCHVLQHKCHFKEAVEFMEACSPSWSDCSSFMVTHNWWHVALCYLEANFPLNKILEIYDKYIWKEVEKPDAMGPEVYLNAVGLMLRLFVRGEFDQ
Subjt:  WAQHALCHVLQHKCHFKEAVEFMEACSPSWSDCSSFMVTHNWWHVALCYLEANFPLNKILEIYDKYIWKEVEKPDAMGPEVYLNAVGLMLRLFVRGEFDQ

Query:  CEGRLKILANVLTDKANWHLEWHFDILTLWALAKTGEISAAEELLGSLESRLLKMTSKKQEKMQRGMLLAEAVYKYGRGDYKRALDLLGLDFDANDYKMI
        CEGRLKILAN     ANWHLEWHFDILTLWALAKTGE SAAEELLGSLESRLLKMTSKKQEKMQRGMLLAEAVYKYGRGDYKRALDLLGLDFDANDYKMI
Subjt:  CEGRLKILANVLTDKANWHLEWHFDILTLWALAKTGEISAAEELLGSLESRLLKMTSKKQEKMQRGMLLAEAVYKYGRGDYKRALDLLGLDFDANDYKMI

Query:  GASNEQVDVFNEVWYDILMNTVHASKAIEVIKKQLETREGVPFMWRLLERGYSKTGRPEEAAIAGGKAGSLEKAYFK
        GASNEQVDVFNEVWYDILMNTVHASKAIEVIKKQLETREGVPFMWRLLERGYSKTGRPEEAAIAGGKAGSLEKAYFK
Subjt:  GASNEQVDVFNEVWYDILMNTVHASKAIEVIKKQLETREGVPFMWRLLERGYSKTGRPEEAAIAGGKAGSLEKAYFK

XP_022956843.1 tetratricopeptide repeat protein 38-like isoform X1 [Cucurbita moschata]3.59e-29181.97Show/hide
Query:  MGDGVKLDKWGYEIRTSSDACIAAINAYYDQVLSYGRRRSVILEAPVHDKDCVLANTLVAHFLSSSEPSRVLHHLRAAEAGLDHATSYERAVFDAISCLI
        M DG+KL KWGYE+RTSSDACI+AINA+YDQVLSYGRRRSVILEAPVHDK CVLAN   A+FLSSS+PSRV HHL+AA+ GLD AT YE+AV+DAI+CL+
Subjt:  MGDGVKLDKWGYEIRTSSDACIAAINAYYDQVLSYGRRRSVILEAPVHDKDCVLANTLVAHFLSSSEPSRVLHHLRAAEAGLDHATSYERAVFDAISCLI

Query:  SKDRDDDVAVELHAELLKKFPKDLVSLKRAQVLCFYMGDANLSLALVQQVPLLPKNVVLPHNQEEDFIYGMLAFPLLELGCMAEAENAARRGLDINKKEG
        S DRDD+VAVEL  +LLK FPKDL+SLK+AQVLCFYMG+ +LSLALVQQV        LP NQEE FIYGMLAFPLLELGCM EAE AARRGLDINKK+G
Subjt:  SKDRDDDVAVELHAELLKKFPKDLVSLKRAQVLCFYMGDANLSLALVQQVPLLPKNVVLPHNQEEDFIYGMLAFPLLELGCMAEAENAARRGLDINKKEG

Query:  WAQHALCHVLQHKCHFKEAVEFMEACSPSWSDCSSFMVTHNWWHVALCYLEANFPLNKILEIYDKYIWKEVEKPDAMGPEVYLNAVGLMLRLFVRGEFDQ
        WAQHALCHVLQ++C FKEAVEFMEACSP+WSDC SF+VTHNWWHVALCYLEAN PL+KILEIYD YIWKE+EKPDAMGP+VYLNA+GLMLRLFVRGEF  
Subjt:  WAQHALCHVLQHKCHFKEAVEFMEACSPSWSDCSSFMVTHNWWHVALCYLEANFPLNKILEIYDKYIWKEVEKPDAMGPEVYLNAVGLMLRLFVRGEFDQ

Query:  CEGRLKILANVLTDKANWHLEWHFDILTLWALAKTGEISAAEELLGSLESRLLKMTSKKQEKMQRGMLLAEAVYKYGRGDYKRALDLLGLDFDANDYKMI
        CEGRLKILANVLTDKANWHLEWHFD+LT WALAK+GEI AAEELLGSL+SR+LKMT KKQEKMQRGMLLAEA+Y YGRGDYKRALDL+GLDFDAND KMI
Subjt:  CEGRLKILANVLTDKANWHLEWHFDILTLWALAKTGEISAAEELLGSLESRLLKMTSKKQEKMQRGMLLAEAVYKYGRGDYKRALDLLGLDFDANDYKMI

Query:  GASNEQVDVFNEVWYDILMNTVHASKAIEVIKKQLETREGVPFMWRLLERGYSKTGRPEEAAIAGGKAGSLEKAYFK
        GASNEQ+DVFNEVWYDILMNT HA+KAIEVI+KQ++ RE  P++WRLLERGYSK GRPEEAAIAG KA SLEKA+FK
Subjt:  GASNEQVDVFNEVWYDILMNTVHASKAIEVIKKQLETREGVPFMWRLLERGYSKTGRPEEAAIAGGKAGSLEKAYFK

XP_022976859.1 tetratricopeptide repeat protein 38-like isoform X1 [Cucurbita maxima]3.59e-29181.97Show/hide
Query:  MGDGVKLDKWGYEIRTSSDACIAAINAYYDQVLSYGRRRSVILEAPVHDKDCVLANTLVAHFLSSSEPSRVLHHLRAAEAGLDHATSYERAVFDAISCLI
        M DG+KL KWGYE+ TSSD+CI+AINA+YDQVLSYGRRRSVILEAPVHDKDCVLAN   A+FLSSS+PSRV HHL+AA+A LD AT YE+AV+DAISCL+
Subjt:  MGDGVKLDKWGYEIRTSSDACIAAINAYYDQVLSYGRRRSVILEAPVHDKDCVLANTLVAHFLSSSEPSRVLHHLRAAEAGLDHATSYERAVFDAISCLI

Query:  SKDRDDDVAVELHAELLKKFPKDLVSLKRAQVLCFYMGDANLSLALVQQVPLLPKNVVLPHNQEEDFIYGMLAFPLLELGCMAEAENAARRGLDINKKEG
        S DRDD+VAVEL  +LLK FPKDL+SLK+AQVLCFYMG+ +LSLALVQQV        LP NQEE FIYGMLAFPLLE+GCM EAE AA+RGLDINKK+G
Subjt:  SKDRDDDVAVELHAELLKKFPKDLVSLKRAQVLCFYMGDANLSLALVQQVPLLPKNVVLPHNQEEDFIYGMLAFPLLELGCMAEAENAARRGLDINKKEG

Query:  WAQHALCHVLQHKCHFKEAVEFMEACSPSWSDCSSFMVTHNWWHVALCYLEANFPLNKILEIYDKYIWKEVEKPDAMGPEVYLNAVGLMLRLFVRGEFDQ
        WAQHALCHVLQ++C FKEAVEFMEACSP+WSDC SF+VTHNWWHVALCYLEAN PL+KILEIYD YIWKE+EKPDAMGP+VYLNA+GLMLRLFVRGEF  
Subjt:  WAQHALCHVLQHKCHFKEAVEFMEACSPSWSDCSSFMVTHNWWHVALCYLEANFPLNKILEIYDKYIWKEVEKPDAMGPEVYLNAVGLMLRLFVRGEFDQ

Query:  CEGRLKILANVLTDKANWHLEWHFDILTLWALAKTGEISAAEELLGSLESRLLKMTSKKQEKMQRGMLLAEAVYKYGRGDYKRALDLLGLDFDANDYKMI
        CEGRLKILANVLTDKANWHLEWHFD+LT WALAK+GEI AAEELLGSL+SR+LKMT KKQEKMQRGMLLAEA+YKYGRGDYKRALDLLGLDFDAND KMI
Subjt:  CEGRLKILANVLTDKANWHLEWHFDILTLWALAKTGEISAAEELLGSLESRLLKMTSKKQEKMQRGMLLAEAVYKYGRGDYKRALDLLGLDFDANDYKMI

Query:  GASNEQVDVFNEVWYDILMNTVHASKAIEVIKKQLETREGVPFMWRLLERGYSKTGRPEEAAIAGGKAGSLEKAYFK
        GASNEQ+DVFNEVWYDILMNT HA+KAIEVI+KQ++ RE  P++WRLLERGYSK GRPEEAAIAG KA SLEKA+FK
Subjt:  GASNEQVDVFNEVWYDILMNTVHASKAIEVIKKQLETREGVPFMWRLLERGYSKTGRPEEAAIAGGKAGSLEKAYFK

XP_023537841.1 tetratricopeptide repeat protein 38-like isoform X1 [Cucurbita pepo subsp. pepo]1.25e-29182.39Show/hide
Query:  MGDGVKLDKWGYEIRTSSDACIAAINAYYDQVLSYGRRRSVILEAPVHDKDCVLANTLVAHFLSSSEPSRVLHHLRAAEAGLDHATSYERAVFDAISCLI
        M DG+KL KWGYE+RTSSDACI+AINA+YDQVLSYGRRRSVILEAPVHDK CVLAN   A+FLSSS+PSRV HHL+AA+AGLD AT YE+AV+DAISCL+
Subjt:  MGDGVKLDKWGYEIRTSSDACIAAINAYYDQVLSYGRRRSVILEAPVHDKDCVLANTLVAHFLSSSEPSRVLHHLRAAEAGLDHATSYERAVFDAISCLI

Query:  SKDRDDDVAVELHAELLKKFPKDLVSLKRAQVLCFYMGDANLSLALVQQVPLLPKNVVLPHNQEEDFIYGMLAFPLLELGCMAEAENAARRGLDINKKEG
        S DRDD+VAVEL  +LLK FPKDL+SLK+AQVLCFYMG+ +LSLALVQQV        LP NQEE FIYGMLAFPLLELGCM EAE AARRGLDINKK+G
Subjt:  SKDRDDDVAVELHAELLKKFPKDLVSLKRAQVLCFYMGDANLSLALVQQVPLLPKNVVLPHNQEEDFIYGMLAFPLLELGCMAEAENAARRGLDINKKEG

Query:  WAQHALCHVLQHKCHFKEAVEFMEACSPSWSDCSSFMVTHNWWHVALCYLEANFPLNKILEIYDKYIWKEVEKPDAMGPEVYLNAVGLMLRLFVRGEFDQ
        WAQHALCHVLQ++C FKEAVEFMEACSP+WSDC SF+VTHNWWHVALCYLEAN PL+KILEIYD YIWKE+EKPDAMGP+VYLNA+GLMLRLFVRGEF  
Subjt:  WAQHALCHVLQHKCHFKEAVEFMEACSPSWSDCSSFMVTHNWWHVALCYLEANFPLNKILEIYDKYIWKEVEKPDAMGPEVYLNAVGLMLRLFVRGEFDQ

Query:  CEGRLKILANVLTDKANWHLEWHFDILTLWALAKTGEISAAEELLGSLESRLLKMTSKKQEKMQRGMLLAEAVYKYGRGDYKRALDLLGLDFDANDYKMI
        CEGRLKILANVLTDKANWHLEWHFD+LT WALAK+GEI  AEELLGSLESR+LKMT KKQEKMQRGMLLAEA+YKYGRGDYK ALDLLGLDFDAND KMI
Subjt:  CEGRLKILANVLTDKANWHLEWHFDILTLWALAKTGEISAAEELLGSLESRLLKMTSKKQEKMQRGMLLAEAVYKYGRGDYKRALDLLGLDFDANDYKMI

Query:  GASNEQVDVFNEVWYDILMNTVHASKAIEVIKKQLETREGVPFMWRLLERGYSKTGRPEEAAIAGGKAGSLEKAYFK
        GASNEQ+DVFNEVWYDILMNT HA+KAIEVI+KQ++ RE  P++WRLLERG+SK GRPEEAAIAG KA SLEKA+FK
Subjt:  GASNEQVDVFNEVWYDILMNTVHASKAIEVIKKQLETREGVPFMWRLLERGYSKTGRPEEAAIAGGKAGSLEKAYFK

TrEMBL top hitse value%identityAlignment
A0A0A0KSI1 Tetratricopeptide repeat protein 386.07e-29282.39Show/hide
Query:  MGDGVKLDKWGYEIRTSSDACIAAINAYYDQVLSYGRRRSVILEAPVHDKDCVLANTLVAHFLSSSEPSRVLHHLRAAEAGLDHATSYERAVFDAISCLI
        M DGVKL KWGY IRTSSD CI+AINA+YDQVLSYGR+RSVILEA VHDKDCVLAN L AHFLSSS+PSR  +HL+ A+AGLD AT YE+AVFDAISCL+
Subjt:  MGDGVKLDKWGYEIRTSSDACIAAINAYYDQVLSYGRRRSVILEAPVHDKDCVLANTLVAHFLSSSEPSRVLHHLRAAEAGLDHATSYERAVFDAISCLI

Query:  SKDRDDDVAVELHAELLKKFPKDLVSLKRAQVLCFYMGDANLSLALVQQVPLLPKNVVLPHNQEEDFIYGMLAFPLLELGCMAEAENAARRGLDINKKEG
        S DRDD+VAVELH ELLK FPKDL SLKRAQVLCFY+G A+LSLALVQQV        LP NQEE FIYGMLAFPLLELGCM EAE AARRGLDINKK+G
Subjt:  SKDRDDDVAVELHAELLKKFPKDLVSLKRAQVLCFYMGDANLSLALVQQVPLLPKNVVLPHNQEEDFIYGMLAFPLLELGCMAEAENAARRGLDINKKEG

Query:  WAQHALCHVLQHKCHFKEAVEFMEACSPSWSDCSSFMVTHNWWHVALCYLEANFPLNKILEIYDKYIWKEVEKPDAMGPEVYLNAVGLMLRLFVRGEFDQ
        WAQHALCHVLQ++CHFKEAVEFME CSPSW DC SFMVTHNWWHVALCYLEAN PL+KILEIYD YIWKE+EKPDA+GPEVYLNA+GLMLRLFVRGE+D 
Subjt:  WAQHALCHVLQHKCHFKEAVEFMEACSPSWSDCSSFMVTHNWWHVALCYLEANFPLNKILEIYDKYIWKEVEKPDAMGPEVYLNAVGLMLRLFVRGEFDQ

Query:  CEGRLKILANVLTDKANWHLEWHFDILTLWALAKTGEISAAEELLGSLESRLLKMTSKKQEKMQRGMLLAEAVYKYGRGDYKRALDLLGLDFDANDYKMI
        CEGRLKILANVLTDKANWHLEWHFDILTLWALAK GEI AA+ELLGSL+SRL KMT+KK+EKMQR +LLAEA+YKYGRGDY+RALDLLGLDFDANDYKMI
Subjt:  CEGRLKILANVLTDKANWHLEWHFDILTLWALAKTGEISAAEELLGSLESRLLKMTSKKQEKMQRGMLLAEAVYKYGRGDYKRALDLLGLDFDANDYKMI

Query:  GASNEQVDVFNEVWYDILMNTVHASKAIEVIKKQLETREGVPFMWRLLERGYSKTGRPEEAAIAGGKAGSLEKAYFK
        GASNEQ+DVFNEVWYDILMNT HA+KAIEVI+KQ++ RE VP++W LLERGY+K GRP+E+AIAG KA SLEKA+FK
Subjt:  GASNEQVDVFNEVWYDILMNTVHASKAIEVIKKQLETREGVPFMWRLLERGYSKTGRPEEAAIAGGKAGSLEKAYFK

A0A5A7STS3 Tetratricopeptide repeat protein 387.15e-28480.92Show/hide
Query:  MGDGVKLDKWGYEIRTSSDACIAAINAYYDQVLSYGRRRSVILEAPVHDKDCVLANTLVAHFLSSSEPSRVLHHLRAAEAGLDHATSYERAVFDAISCLI
        M D VKL KWGY IRTSSD CI+AIN +YDQVLSYGRRRSVILEA VHDKDCVLAN L AHFLSSS+ SR  +HL+AA+AG+D AT YE+AVFDAIS L+
Subjt:  MGDGVKLDKWGYEIRTSSDACIAAINAYYDQVLSYGRRRSVILEAPVHDKDCVLANTLVAHFLSSSEPSRVLHHLRAAEAGLDHATSYERAVFDAISCLI

Query:  SKDRDDDVAVELHAELLKKFPKDLVSLKRAQVLCFYMGDANLSLALVQQVPLLPKNVVLPHNQEEDFIYGMLAFPLLELGCMAEAENAARRGLDINKKEG
        S DRDD+VAVELH ELLK FPKDL SLKRAQVLCFY+G  +LSLALV+QV        LP NQEE FIYGMLAF LLELGCM EAE AARRGLDI+KK+ 
Subjt:  SKDRDDDVAVELHAELLKKFPKDLVSLKRAQVLCFYMGDANLSLALVQQVPLLPKNVVLPHNQEEDFIYGMLAFPLLELGCMAEAENAARRGLDINKKEG

Query:  WAQHALCHVLQHKCHFKEAVEFMEACSPSWSDCSSFMVTHNWWHVALCYLEANFPLNKILEIYDKYIWKEVEKPDAMGPEVYLNAVGLMLRLFVRGEFDQ
        WAQHALCHVLQ++CHFKEAVEFMEACSPSW DC SFMVTHNWWHVALCYLEAN P +KILE+YD YIWKE+EKPDAMGPEVYLNA+GLMLRLFVRGEFD 
Subjt:  WAQHALCHVLQHKCHFKEAVEFMEACSPSWSDCSSFMVTHNWWHVALCYLEANFPLNKILEIYDKYIWKEVEKPDAMGPEVYLNAVGLMLRLFVRGEFDQ

Query:  CEGRLKILANVLTDKANWHLEWHFDILTLWALAKTGEISAAEELLGSLESRLLKMTSKKQEKMQRGMLLAEAVYKYGRGDYKRALDLLGLDFDANDYKMI
        CEGRLKILANVLTDKANWHLEWHFDILT WALAK GE  AA++LLGSL+SRLLKMTSKK+EKMQRG+LLAEA+YKYGRGDY+ ALDLLGLDFDANDYKMI
Subjt:  CEGRLKILANVLTDKANWHLEWHFDILTLWALAKTGEISAAEELLGSLESRLLKMTSKKQEKMQRGMLLAEAVYKYGRGDYKRALDLLGLDFDANDYKMI

Query:  GASNEQVDVFNEVWYDILMNTVHASKAIEVIKKQLETREGVPFMWRLLERGYSKTGRPEEAAIAGGKAGSLEKAYFK
        GASNEQ+DVFNEVWYDILMNT H +KAIEVI+KQ + RE VP++W LLERGY+K GRPEEAAIAG KA SLEKA+FK
Subjt:  GASNEQVDVFNEVWYDILMNTVHASKAIEVIKKQLETREGVPFMWRLLERGYSKTGRPEEAAIAGGKAGSLEKAYFK

A0A6J1DC80 Tetratricopeptide repeat protein 380.096.86Show/hide
Query:  MGDGVKLDKWGYEIRTSSDACIAAINAYYDQVLSYGRRRSVILEAPVHDKDCVLANTLVAHFLSSSEPSRVLHHLRAAEAGLDHATSYERAVFDAISCLI
        MGDGVKLDKWGYEIRTSSDACIAAINAYYDQVLSYGRRRSVILEAPVHDKDCVLANTLVAHFLSSSEPSRVLHHLRAAEA LDHATSYERAVFDAISCLI
Subjt:  MGDGVKLDKWGYEIRTSSDACIAAINAYYDQVLSYGRRRSVILEAPVHDKDCVLANTLVAHFLSSSEPSRVLHHLRAAEAGLDHATSYERAVFDAISCLI

Query:  SKDRDDDVAVELHAELLKKFPKDLVSLKRAQVLCFYMGDANLSLALVQQVPLLPKNVVLPHNQEEDFIYGMLAFPLLELGCMAEAENAARRGLDINKKEG
        SKDRDDDVAVELHAELLKKFPKDLVSLKRAQVLCFYMGDANLSLALVQQV        LPHNQEEDFIYGMLAFPLLELGCMAEAENAARRGLDINKKEG
Subjt:  SKDRDDDVAVELHAELLKKFPKDLVSLKRAQVLCFYMGDANLSLALVQQVPLLPKNVVLPHNQEEDFIYGMLAFPLLELGCMAEAENAARRGLDINKKEG

Query:  WAQHALCHVLQHKCHFKEAVEFMEACSPSWSDCSSFMVTHNWWHVALCYLEANFPLNKILEIYDKYIWKEVEKPDAMGPEVYLNAVGLMLRLFVRGEFDQ
        WAQHALCHVLQHKCHFKEAVEFMEACSPSWSDCSSFMVTHNWWHVALCYLEANFPLNKILEIYDKYIWKEVEKPDAMGPEVYLNAVGLMLRLFVRGEFDQ
Subjt:  WAQHALCHVLQHKCHFKEAVEFMEACSPSWSDCSSFMVTHNWWHVALCYLEANFPLNKILEIYDKYIWKEVEKPDAMGPEVYLNAVGLMLRLFVRGEFDQ

Query:  CEGRLKILANVLTDKANWHLEWHFDILTLWALAKTGEISAAEELLGSLESRLLKMTSKKQEKMQRGMLLAEAVYKYGRGDYKRALDLLGLDFDANDYKMI
        CEGRLKILAN     ANWHLEWHFDILTLWALAKTGE SAAEELLGSLESRLLKMTSKKQEKMQRGMLLAEAVYKYGRGDYKRALDLLGLDFDANDYKMI
Subjt:  CEGRLKILANVLTDKANWHLEWHFDILTLWALAKTGEISAAEELLGSLESRLLKMTSKKQEKMQRGMLLAEAVYKYGRGDYKRALDLLGLDFDANDYKMI

Query:  GASNEQVDVFNEVWYDILMNTVHASKAIEVIKKQLETREGVPFMWRLLERGYSKTGRPEEAAIAGGKAGSLEKAYFK
        GASNEQVDVFNEVWYDILMNTVHASKAIEVIKKQLETREGVPFMWRLLERGYSKTGRPEEAAIAGGKAGSLEKAYFK
Subjt:  GASNEQVDVFNEVWYDILMNTVHASKAIEVIKKQLETREGVPFMWRLLERGYSKTGRPEEAAIAGGKAGSLEKAYFK

A0A6J1GYC7 Tetratricopeptide repeat protein 381.74e-29181.97Show/hide
Query:  MGDGVKLDKWGYEIRTSSDACIAAINAYYDQVLSYGRRRSVILEAPVHDKDCVLANTLVAHFLSSSEPSRVLHHLRAAEAGLDHATSYERAVFDAISCLI
        M DG+KL KWGYE+RTSSDACI+AINA+YDQVLSYGRRRSVILEAPVHDK CVLAN   A+FLSSS+PSRV HHL+AA+ GLD AT YE+AV+DAI+CL+
Subjt:  MGDGVKLDKWGYEIRTSSDACIAAINAYYDQVLSYGRRRSVILEAPVHDKDCVLANTLVAHFLSSSEPSRVLHHLRAAEAGLDHATSYERAVFDAISCLI

Query:  SKDRDDDVAVELHAELLKKFPKDLVSLKRAQVLCFYMGDANLSLALVQQVPLLPKNVVLPHNQEEDFIYGMLAFPLLELGCMAEAENAARRGLDINKKEG
        S DRDD+VAVEL  +LLK FPKDL+SLK+AQVLCFYMG+ +LSLALVQQV        LP NQEE FIYGMLAFPLLELGCM EAE AARRGLDINKK+G
Subjt:  SKDRDDDVAVELHAELLKKFPKDLVSLKRAQVLCFYMGDANLSLALVQQVPLLPKNVVLPHNQEEDFIYGMLAFPLLELGCMAEAENAARRGLDINKKEG

Query:  WAQHALCHVLQHKCHFKEAVEFMEACSPSWSDCSSFMVTHNWWHVALCYLEANFPLNKILEIYDKYIWKEVEKPDAMGPEVYLNAVGLMLRLFVRGEFDQ
        WAQHALCHVLQ++C FKEAVEFMEACSP+WSDC SF+VTHNWWHVALCYLEAN PL+KILEIYD YIWKE+EKPDAMGP+VYLNA+GLMLRLFVRGEF  
Subjt:  WAQHALCHVLQHKCHFKEAVEFMEACSPSWSDCSSFMVTHNWWHVALCYLEANFPLNKILEIYDKYIWKEVEKPDAMGPEVYLNAVGLMLRLFVRGEFDQ

Query:  CEGRLKILANVLTDKANWHLEWHFDILTLWALAKTGEISAAEELLGSLESRLLKMTSKKQEKMQRGMLLAEAVYKYGRGDYKRALDLLGLDFDANDYKMI
        CEGRLKILANVLTDKANWHLEWHFD+LT WALAK+GEI AAEELLGSL+SR+LKMT KKQEKMQRGMLLAEA+Y YGRGDYKRALDL+GLDFDAND KMI
Subjt:  CEGRLKILANVLTDKANWHLEWHFDILTLWALAKTGEISAAEELLGSLESRLLKMTSKKQEKMQRGMLLAEAVYKYGRGDYKRALDLLGLDFDANDYKMI

Query:  GASNEQVDVFNEVWYDILMNTVHASKAIEVIKKQLETREGVPFMWRLLERGYSKTGRPEEAAIAGGKAGSLEKAYFK
        GASNEQ+DVFNEVWYDILMNT HA+KAIEVI+KQ++ RE  P++WRLLERGYSK GRPEEAAIAG KA SLEKA+FK
Subjt:  GASNEQVDVFNEVWYDILMNTVHASKAIEVIKKQLETREGVPFMWRLLERGYSKTGRPEEAAIAGGKAGSLEKAYFK

A0A6J1IPU9 Tetratricopeptide repeat protein 381.74e-29181.97Show/hide
Query:  MGDGVKLDKWGYEIRTSSDACIAAINAYYDQVLSYGRRRSVILEAPVHDKDCVLANTLVAHFLSSSEPSRVLHHLRAAEAGLDHATSYERAVFDAISCLI
        M DG+KL KWGYE+ TSSD+CI+AINA+YDQVLSYGRRRSVILEAPVHDKDCVLAN   A+FLSSS+PSRV HHL+AA+A LD AT YE+AV+DAISCL+
Subjt:  MGDGVKLDKWGYEIRTSSDACIAAINAYYDQVLSYGRRRSVILEAPVHDKDCVLANTLVAHFLSSSEPSRVLHHLRAAEAGLDHATSYERAVFDAISCLI

Query:  SKDRDDDVAVELHAELLKKFPKDLVSLKRAQVLCFYMGDANLSLALVQQVPLLPKNVVLPHNQEEDFIYGMLAFPLLELGCMAEAENAARRGLDINKKEG
        S DRDD+VAVEL  +LLK FPKDL+SLK+AQVLCFYMG+ +LSLALVQQV        LP NQEE FIYGMLAFPLLE+GCM EAE AA+RGLDINKK+G
Subjt:  SKDRDDDVAVELHAELLKKFPKDLVSLKRAQVLCFYMGDANLSLALVQQVPLLPKNVVLPHNQEEDFIYGMLAFPLLELGCMAEAENAARRGLDINKKEG

Query:  WAQHALCHVLQHKCHFKEAVEFMEACSPSWSDCSSFMVTHNWWHVALCYLEANFPLNKILEIYDKYIWKEVEKPDAMGPEVYLNAVGLMLRLFVRGEFDQ
        WAQHALCHVLQ++C FKEAVEFMEACSP+WSDC SF+VTHNWWHVALCYLEAN PL+KILEIYD YIWKE+EKPDAMGP+VYLNA+GLMLRLFVRGEF  
Subjt:  WAQHALCHVLQHKCHFKEAVEFMEACSPSWSDCSSFMVTHNWWHVALCYLEANFPLNKILEIYDKYIWKEVEKPDAMGPEVYLNAVGLMLRLFVRGEFDQ

Query:  CEGRLKILANVLTDKANWHLEWHFDILTLWALAKTGEISAAEELLGSLESRLLKMTSKKQEKMQRGMLLAEAVYKYGRGDYKRALDLLGLDFDANDYKMI
        CEGRLKILANVLTDKANWHLEWHFD+LT WALAK+GEI AAEELLGSL+SR+LKMT KKQEKMQRGMLLAEA+YKYGRGDYKRALDLLGLDFDAND KMI
Subjt:  CEGRLKILANVLTDKANWHLEWHFDILTLWALAKTGEISAAEELLGSLESRLLKMTSKKQEKMQRGMLLAEAVYKYGRGDYKRALDLLGLDFDANDYKMI

Query:  GASNEQVDVFNEVWYDILMNTVHASKAIEVIKKQLETREGVPFMWRLLERGYSKTGRPEEAAIAGGKAGSLEKAYFK
        GASNEQ+DVFNEVWYDILMNT HA+KAIEVI+KQ++ RE  P++WRLLERGYSK GRPEEAAIAG KA SLEKA+FK
Subjt:  GASNEQVDVFNEVWYDILMNTVHASKAIEVIKKQLETREGVPFMWRLLERGYSKTGRPEEAAIAGGKAGSLEKAYFK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G27110.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.4e-14855.34Show/hide
Query:  GYEIRTSSDACIAAINAYYDQVLSYGRRRSVILEAPVHDKDCVLANTLVAHFLSSSEPSRVLHHLRAAEAGLDHATSYERAVFDAISCLISKDRDDDVAV
        GYE+ TSSD CIA+IN+Y DQVL YGR + VILEAP +D DCVLAN L AH+LSS +P R   +  AAE+ L  AT YE+AVF+A+S L+S++ DDDVA+
Subjt:  GYEIRTSSDACIAAINAYYDQVLSYGRRRSVILEAPVHDKDCVLANTLVAHFLSSSEPSRVLHHLRAAEAGLDHATSYERAVFDAISCLISKDRDDDVAV

Query:  ELHAELLKKFPKDLVSLKRAQVLCFYMGDANLSLALVQQVPLLPKNVVLPHNQEEDFIYGMLAFPLLELGCMAEAENAARRGLDINKKEGWAQHALCHVL
        ELH++LLKKFPKDL+S KR + LC YMG  +LSL L ++        +LP N+ + ++ GMLAF L+ELG + EAE AAR+G +IN+ + WA HALCHVL
Subjt:  ELHAELLKKFPKDLVSLKRAQVLCFYMGDANLSLALVQQVPLLPKNVVLPHNQEEDFIYGMLAFPLLELGCMAEAENAARRGLDINKKEGWAQHALCHVL

Query:  QHKCHFKEAVEFMEACSPSWSDCSSFMVTHNWWHVALCYLEANFPLNKILEIYDKYIWKEVEKPDAMGPEVYLNAVGLMLRLFVRGEFDQ-CEGRLKILA
        Q +C FKEAV+FME  S SW  CSS   +HNWWHVA+CYLE    ++K+ E+YD  +WKE+EK DA+  +VY +A+GL+LRL  RG+ D   + RL+ LA
Subjt:  QHKCHFKEAVEFMEACSPSWSDCSSFMVTHNWWHVALCYLEANFPLNKILEIYDKYIWKEVEKPDAMGPEVYLNAVGLMLRLFVRGEFDQ-CEGRLKILA

Query:  NVLTDKANWHLEWHFDILTLWALAKTGEISAAEELLGSLESRLLKMTSKKQEKMQRGMLLAEAVYKYGRGDYKRALDLLGLDFDANDYKMIGASNEQVDV
        + LTDKA W+ +W FDI T+WAL+K  + S A ELL  L+SR   M  KKQ+ MQ+ +LLAEAVY+YG+G+Y+ AL+LLGLDFDA +YK+IG S  Q+DV
Subjt:  NVLTDKANWHLEWHFDILTLWALAKTGEISAAEELLGSLESRLLKMTSKKQEKMQRGMLLAEAVYKYGRGDYKRALDLLGLDFDANDYKMIGASNEQVDV

Query:  FNEVWYDILMNTVHASKAIEVIKKQLETREGVPFMWRLLERGYSKTGRPEEAAIAGGKAGSLEKAYFK
        FNE+WY +L+    +S AIEV++K  + R+G PF+WRLLE  YS  G+ +    AG KA +LE +YFK
Subjt:  FNEVWYDILMNTVHASKAIEVIKKQLETREGVPFMWRLLERGYSKTGRPEEAAIAGGKAGSLEKAYFK

AT1G27110.2 Tetratricopeptide repeat (TPR)-like superfamily protein1.3e-14156.04Show/hide
Query:  GYEIRTSSDACIAAINAYYDQVLSYGRRRSVILEAPVHDKDCVLANTLVAHFLSSSEPSRVLHHLRAAEAGLDHATSYERAVFDAISCLISKDRDDDVAV
        GYE+ TSSD CIA+IN+Y DQVL YGR + VILEAP +D DCVLAN L AH+LSS +P R   +  AAE+ L  AT YE+AVF+A+S L+S++ DDDVA+
Subjt:  GYEIRTSSDACIAAINAYYDQVLSYGRRRSVILEAPVHDKDCVLANTLVAHFLSSSEPSRVLHHLRAAEAGLDHATSYERAVFDAISCLISKDRDDDVAV

Query:  ELHAELLKKFPKDLVSLKRAQVLCFYMGDANLSLALVQQVPLLPKNVVLPHNQEEDFIYGMLAFPLLELGCMAEAENAARRGLDINKKEGWAQHALCHVL
        ELH++LLKKFPKDL+S KR + LC YMG  +LSL L ++        +LP N+ + ++ GMLAF L+ELG + EAE AAR+G +IN+ + WA HALCHVL
Subjt:  ELHAELLKKFPKDLVSLKRAQVLCFYMGDANLSLALVQQVPLLPKNVVLPHNQEEDFIYGMLAFPLLELGCMAEAENAARRGLDINKKEGWAQHALCHVL

Query:  QHKCHFKEAVEFMEACSPSWSDCSSFMVTHNWWHVALCYLEANFPLNKILEIYDKYIWKEVEKPDAMGPEVYLNAVGLMLRLFVRGEFDQ-CEGRLKILA
        Q +C FKEAV+FME  S SW  CSS   +HNWWHVA+CYLE    ++K+ E+YD  +WKE+EK DA+  +VY +A+GL+LRL  RG+ D   + RL+ LA
Subjt:  QHKCHFKEAVEFMEACSPSWSDCSSFMVTHNWWHVALCYLEANFPLNKILEIYDKYIWKEVEKPDAMGPEVYLNAVGLMLRLFVRGEFDQ-CEGRLKILA

Query:  NVLTDKANWHLEWHFDILTLWALAKTGEISAAEELLGSLESRLLKMTSKKQEKMQRGMLLAEAVYKYGRGDYKRALDLLGLDFDANDYKMIGASNEQVDV
        + LTDKA W+ +W FDI T+WAL+K  + S A ELL  L+SR   M  KKQ+ MQ+ +LLAEAVY+YG+G+Y+ AL+LLGLDFDA +YK+IG S  Q+DV
Subjt:  NVLTDKANWHLEWHFDILTLWALAKTGEISAAEELLGSLESRLLKMTSKKQEKMQRGMLLAEAVYKYGRGDYKRALDLLGLDFDANDYKMIGASNEQVDV

Query:  FNEVWYDILMNTVHASKAIEVIKKQLETREGVPFMWRLL
        FNE+WY +L+    +S AIEV++K  + R+G PF+WRLL
Subjt:  FNEVWYDILMNTVHASKAIEVIKKQLETREGVPFMWRLL

AT1G27110.3 Tetratricopeptide repeat (TPR)-like superfamily protein2.0e-11554.28Show/hide
Query:  DDDVAVELHAELLKKFPKDLVSLKRAQVLCFYMGDANLSLALVQQVPLLPKNVVLPHNQEEDFIYGMLAFPLLELGCMAEAENAARRGLDINKKEGWAQH
        DDDVA+ELH++LLKKFPKDL+S KR + LC YMG  +LSL L ++        +LP N+ + ++ GMLAF L+ELG + EAE AAR+G +IN+ + WA H
Subjt:  DDDVAVELHAELLKKFPKDLVSLKRAQVLCFYMGDANLSLALVQQVPLLPKNVVLPHNQEEDFIYGMLAFPLLELGCMAEAENAARRGLDINKKEGWAQH

Query:  ALCHVLQHKCHFKEAVEFMEACSPSWSDCSSFMVTHNWWHVALCYLEANFPLNKILEIYDKYIWKEVEKPDAMGPEVYLNAVGLMLRLFVRGEFDQ-CEG
        ALCHVLQ +C FKEAV+FME  S SW  CSS   +HNWWHVA+CYLE    ++K+ E+YD  +WKE+EK DA+  +VY +A+GL+LRL  RG+ D   + 
Subjt:  ALCHVLQHKCHFKEAVEFMEACSPSWSDCSSFMVTHNWWHVALCYLEANFPLNKILEIYDKYIWKEVEKPDAMGPEVYLNAVGLMLRLFVRGEFDQ-CEG

Query:  RLKILANVLTDKANWHLEWHFDILTLWALAKTGEISAAEELLGSLESRLLKMTSKKQEKMQRGMLLAEAVYKYGRGDYKRALDLLGLDFDANDYKMIGAS
        RL+ LA+ LTDKA W+ +W FDI T+WAL+K  + S A ELL  L+SR   M  KKQ+ MQ+ +LLAEAVY+YG+G+Y+ AL+LLGLDFDA +YK+IG S
Subjt:  RLKILANVLTDKANWHLEWHFDILTLWALAKTGEISAAEELLGSLESRLLKMTSKKQEKMQRGMLLAEAVYKYGRGDYKRALDLLGLDFDANDYKMIGAS

Query:  NEQVDVFNEVWYDILMNTVHASKAIEVIKKQLETREGVPFMWRLLERGYSKTGRPEEAAIAGGKAGSLEKAYFK
          Q+DVFNE+WY +L+    +S AIEV++K  + R+G PF+WRLLE  YS  G+ +    AG KA +LE +YFK
Subjt:  NEQVDVFNEVWYDILMNTVHASKAIEVIKKQLETREGVPFMWRLLERGYSKTGRPEEAAIAGGKAGSLEKAYFK

AT1G27150.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.3e-16759.75Show/hide
Query:  VKLDKWGYEIRTSSDACIAAINAYYDQVLSYGRRRSVILEAPVHDKDCVLANTLVAHFLSSSEPSRVLHHLRAAEAGLDHATSYERAVFDAISCLISKDR
        V+  +WGYE+ TSSDACI AIN+Y+ QVLSYGR+R VILEAP++DKDCVL + L AHFLSSS+PSR   ++ AA + L+ +T YE+AV++A++ LIS+DR
Subjt:  VKLDKWGYEIRTSSDACIAAINAYYDQVLSYGRRRSVILEAPVHDKDCVLANTLVAHFLSSSEPSRVLHHLRAAEAGLDHATSYERAVFDAISCLISKDR

Query:  DDDVAVELHAELLKKFPKDLVSLKRAQVLCFYMGDANLSLALVQQVPLLPKNVVLPHNQEEDFIYGMLAFPLLELGCMAEAENAARRGLDINKKEGWAQH
        DDD+A E+H +LLK+FPKDL SLKRAQ+L FYMG  +  L LVQQ        VLP NQEE +I+G+LAFPLLELG M EA  A+R+G +INK++ WA H
Subjt:  DDDVAVELHAELLKKFPKDLVSLKRAQVLCFYMGDANLSLALVQQVPLLPKNVVLPHNQEEDFIYGMLAFPLLELGCMAEAENAARRGLDINKKEGWAQH

Query:  ALCHVLQHKCHFKEAVEFMEACSPSWSDCSSFMVTHNWWHVALCYLEANFPLNKILEIYDKYIWKEVEKPDAMGPEVYLNAVGLMLRLFVRGEFDQCEGR
         LCHVLQH+C FKEAVEFMEA + +W  CSSFM THNWWHVALCYLE   P++K+ EIYD +IWKE+EK DA+ PEVYLNA+GL++RL VR   D  E R
Subjt:  ALCHVLQHKCHFKEAVEFMEACSPSWSDCSSFMVTHNWWHVALCYLEANFPLNKILEIYDKYIWKEVEKPDAMGPEVYLNAVGLMLRLFVRGEFDQCEGR

Query:  LKILANVLTDKANWHLEWHFDILTLWALAKTGEISAAEELLGSLESRLLKMTSKKQEKMQRGMLLAEAVYKYGRGDYKRALDLLGLDFDANDYKMIGASN
        LK LA  LT++ANW+LEWH DIL +WALAK GE S A ELL  L+ RL K   KKQ+ MQ+G+ L EAVY+Y RG+Y++AL+LLG +F+A  YK++GAS+
Subjt:  LKILANVLTDKANWHLEWHFDILTLWALAKTGEISAAEELLGSLESRLLKMTSKKQEKMQRGMLLAEAVYKYGRGDYKRALDLLGLDFDANDYKMIGASN

Query:  EQVDVFNEVWYDILMNTVHASKAIEVIKKQLETREGVPFMWRLLERGYSKTGRPEEAAIAGGKAGSLEKAYF
        EQ+DVFNE+W  +L+ T  +S A EVI+++++ R+G+PFMWRLLE+ YS  G  E  + A  +A  LE  YF
Subjt:  EQVDVFNEVWYDILMNTVHASKAIEVIKKQLETREGVPFMWRLLERGYSKTGRPEEAAIAGGKAGSLEKAYF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGATGGAGTCAAATTGGACAAGTGGGGCTATGAAATTAGAACCTCATCTGACGCTTGCATCGCCGCCATCAATGCGTACTATGACCAGGTGCTTAGTTATGGGAG
GCGGAGGTCCGTAATTTTGGAGGCGCCGGTCCATGACAAGGACTGTGTACTCGCCAACACTTTGGTTGCTCATTTTCTTTCCTCCTCCGAACCTTCTCGAGTTCTTCATC
ATCTTCGAGCAGCTGAGGCCGGTCTGGACCATGCAACCTCGTACGAGAGAGCCGTTTTTGATGCTATCAGTTGTTTGATTTCCAAGGATAGAGACGATGATGTCGCTGTT
GAGCTACACGCTGAGCTGCTTAAAAAATTTCCAAAGGATCTGGTGTCTTTGAAAAGGGCTCAAGTGCTGTGCTTTTACATGGGAGATGCCAATCTATCTTTGGCTTTGGT
TCAGCAGGTTCCTTTATTACCTAAGAATGTAGTTTTACCACATAATCAAGAAGAAGATTTCATTTATGGCATGCTTGCTTTTCCTTTGTTGGAGCTTGGCTGCATGGCAG
AAGCTGAAAACGCTGCAAGAAGGGGGCTTGATATCAACAAGAAAGAGGGTTGGGCACAGCATGCGTTGTGCCATGTTCTTCAACATAAGTGTCATTTTAAAGAAGCGGTT
GAGTTTATGGAAGCATGCTCGCCTTCATGGAGTGACTGTTCATCATTCATGGTGACACATAATTGGTGGCATGTGGCTCTCTGTTACCTGGAAGCCAATTTTCCATTAAA
TAAAATTCTTGAAATATATGATAAGTATATATGGAAGGAGGTAGAAAAACCTGATGCTATGGGACCAGAGGTATACTTGAATGCCGTTGGTTTGATGTTGCGGTTGTTTG
TGCGTGGTGAATTTGATCAGTGTGAGGGTCGTCTGAAGATCTTGGCCAATGTTTTGACTGATAAAGCCAACTGGCATTTAGAGTGGCACTTCGACATATTGACATTATGG
GCTTTGGCTAAAACTGGAGAGATTTCTGCAGCAGAAGAGTTACTTGGGAGCTTGGAGTCCCGATTGTTGAAAATGACGTCAAAGAAACAAGAAAAGATGCAAAGAGGAAT
GCTGCTTGCAGAAGCTGTCTACAAGTACGGGAGAGGAGATTACAAACGCGCACTAGACTTACTTGGTCTGGATTTTGATGCAAATGACTATAAGATGATTGGTGCATCGA
ATGAACAGGTTGATGTATTTAATGAAGTATGGTACGACATCTTGATGAACACAGTACACGCTTCAAAGGCAATCGAAGTAATCAAGAAGCAACTCGAGACGAGGGAAGGA
GTTCCTTTTATGTGGCGCCTTTTGGAGAGAGGGTACAGCAAAACAGGAAGGCCGGAGGAAGCTGCCATTGCCGGAGGCAAAGCCGGAAGCCTGGAGAAGGCATATTTCAA
GTAG
mRNA sequenceShow/hide mRNA sequence
GTTGGTGGCGTAGATGTTTAACGATATGGCTTCGAATTATCAGAAAATTGCCTCCATCGCATTCTCCACCTACTCAGCCCAACACAAGTCGCTCTTTCTAGAAAGCAGCT
TTTGAAATCTAATGCTTTTTTATGGATATCGAACTCGACAAATCGATAGCGCTTTGTTTTTTTTTTTTTGGCAAAGGGATAGCCCTTGTTCTACTCGCAGAGTCCTTTGA
TATTATAAATATACAGTAATTCGACCGACATGGGGTTGGTTGGTCACAATCGGACTCCAATACAACATCGGGAGTTGGGAGTTGGGAGTGGGACTAATCTTCTCTGCCGA
GAAAAACTCTGATTCCTGAGAGAGCCATGGGAGATGGAGTCAAATTGGACAAGTGGGGCTATGAAATTAGAACCTCATCTGACGCTTGCATCGCCGCCATCAATGCGTAC
TATGACCAGGTGCTTAGTTATGGGAGGCGGAGGTCCGTAATTTTGGAGGCGCCGGTCCATGACAAGGACTGTGTACTCGCCAACACTTTGGTTGCTCATTTTCTTTCCTC
CTCCGAACCTTCTCGAGTTCTTCATCATCTTCGAGCAGCTGAGGCCGGTCTGGACCATGCAACCTCGTACGAGAGAGCCGTTTTTGATGCTATCAGTTGTTTGATTTCCA
AGGATAGAGACGATGATGTCGCTGTTGAGCTACACGCTGAGCTGCTTAAAAAATTTCCAAAGGATCTGGTGTCTTTGAAAAGGGCTCAAGTGCTGTGCTTTTACATGGGA
GATGCCAATCTATCTTTGGCTTTGGTTCAGCAGGTTCCTTTATTACCTAAGAATGTAGTTTTACCACATAATCAAGAAGAAGATTTCATTTATGGCATGCTTGCTTTTCC
TTTGTTGGAGCTTGGCTGCATGGCAGAAGCTGAAAACGCTGCAAGAAGGGGGCTTGATATCAACAAGAAAGAGGGTTGGGCACAGCATGCGTTGTGCCATGTTCTTCAAC
ATAAGTGTCATTTTAAAGAAGCGGTTGAGTTTATGGAAGCATGCTCGCCTTCATGGAGTGACTGTTCATCATTCATGGTGACACATAATTGGTGGCATGTGGCTCTCTGT
TACCTGGAAGCCAATTTTCCATTAAATAAAATTCTTGAAATATATGATAAGTATATATGGAAGGAGGTAGAAAAACCTGATGCTATGGGACCAGAGGTATACTTGAATGC
CGTTGGTTTGATGTTGCGGTTGTTTGTGCGTGGTGAATTTGATCAGTGTGAGGGTCGTCTGAAGATCTTGGCCAATGTTTTGACTGATAAAGCCAACTGGCATTTAGAGT
GGCACTTCGACATATTGACATTATGGGCTTTGGCTAAAACTGGAGAGATTTCTGCAGCAGAAGAGTTACTTGGGAGCTTGGAGTCCCGATTGTTGAAAATGACGTCAAAG
AAACAAGAAAAGATGCAAAGAGGAATGCTGCTTGCAGAAGCTGTCTACAAGTACGGGAGAGGAGATTACAAACGCGCACTAGACTTACTTGGTCTGGATTTTGATGCAAA
TGACTATAAGATGATTGGTGCATCGAATGAACAGGTTGATGTATTTAATGAAGTATGGTACGACATCTTGATGAACACAGTACACGCTTCAAAGGCAATCGAAGTAATCA
AGAAGCAACTCGAGACGAGGGAAGGAGTTCCTTTTATGTGGCGCCTTTTGGAGAGAGGGTACAGCAAAACAGGAAGGCCGGAGGAAGCTGCCATTGCCGGAGGCAAAGCC
GGAAGCCTGGAGAAGGCATATTTCAAGTAGATGCCTTGAAGTTGCAGGTCTTCATGATCTTCTGCCAGAGTTGTTCTGTGTTTCCGTTTTGGACAAAAAATTTAAACCCA
CGTAGATTACGAAGATTACTGCCTGTGTAATCTTTTAGTTAACAACTGCTGCCATGTAAATTATGACAAAGAGGGGATTCAGATATTTAGTTTGTTTCTCGAAAAGTAGA
GAATATGAAATTAGCGGTTTTGTTATTTCTTCACAACTCCTTGAGCAAAGTGGTAACAGAGCAAATAAAGGTGTAAAATTGGGAAAATAAGAAACAGCAAAAGGATTTTA
CATTTTACAGACTTCTTTCTACTCTCAGCATCGGGCATCAGCTGGGTCGCTAAAGCCACGTGGATCTTTGACAATTTACAGGCAAGCATCCAAAAATTGTTGTGACCTGT
CGATGTAGGTACTGTGGACTAGTCAAACCTCCCACATGCGCAAGTGGCCCCCACGAAAATACCACGTCAACCAATCAACTTCTAGCCACGTCAGCACCCACGTTCGAAAT
CGACGACCGAACCACTCCACCACGTGTGCGGTTCCGAATCCGCCGAAGCTCCTCCACGATCTCCTTGGCGTCGGGGCGGTCGTCCTTGTCCGCCGCCACGCACCGGAACG
CCAGCTCCGCCACCGCTTCGACGCCGTCGATAACCTCTCCGTCGATACCCAAAACGGAATCCACCACTTGATGAAGCTGACCCATTTGGATCTTCGACACTACCAGATCC
GCCAGAGCCATTTCTCTCCTGTCCCGGCTCTGATCCACCGCTTTCAGACCAGAAATCAGCTCCAGTAGCACTACTCCGAAGCTGTACACGTCACTTTTCTCCGTCAACCG
GAAGGACCGGTGGTAGTCCGGATCCAAGTAACCGGGCGTCCCCTGAGGTCCGGTGCAGACATACCCAGACGAAGACGACGTCGTATCGGAGAAAACCAGCAGCCTCGAAA
GCCCAAAATCTCCAACTTTGATTCTCATATCTTTCTCCACAAAAATATTCGAGGAGGTGATGTCTCTGTGAACAATCGGCGGCACCACCGAGAAATGCAGATACTCCATG
GCCATGGCGATTTGCAGAGCAATTTCAATCCTCACTTGCCACGTCAGAGATCCCTTCCGGTACAAGCTCTTCGGGCCATGGAGATGGTCGGCGAGCGTGCCATTGGGGAC
ATAATCGTAAACTAGGATAAGCCCTCTTGGGTCGCTGCAATACCCATGAAGCCTAACGAGATTCGGGTGATTAATCGAGGAAAGAATCAAGATTTCGTTACAGAACGACT
TGGTGAAGAAAGCCCTACCGGAGGAGGCGGCGGCGGTGGCGTGGTGCTTGTGAAGATACTTCACGGCCACTAATCGGCCGTCGTTGAGCTGACCCAAATAAACAGACCCA
AATCCACCGTCGCCCAATTTCCTTTTGGGGTCGAATCTGTTGGTTGAGGATTCGAGTTCTTCGTAGGGGAAAACTGGTGGGAGAAGATTGGGCGAGCGATGACGACTGAG
GAACTGGGTCGTGGGGTCTACTTCAATGGCGAGGGATCTCAACCATCTGGACCTGAAAAATGCCATCGTCACCGCGATGATCAGTAACAAACACATCAACGCGAAAACAG
AGGACAGAATAGCGATTCGATTAGGGTAGCCATGATGAGTCCATGGCGATGCGTATCGAGTCTTATGATAACATTTAAATTCCCTATCTGGGTCTGAGGAATCGAAGCCA
CACACACCATTTATAGCTTCACAATCATTACACTTCACAAAGTAAGGGTCCTGATCTGCGTCCCACTCAACTTCAAGCCCGAATCTGAGGAATTTGTCCAGAAACTCCAA
AACCTCACCCTGACAGCCCTGCTCCGTCACGGGCTGGCGCTCGACGCCGCACCCATGGAGCAAATTAACTGGTTTCTTGATGAGATTGCACTCCCATGGACAATGGCTGC
AGTTGGGAAGATGCGGAGGCGAACAGGGGCGAAGAAGAGACAGTCGAGAACAAGACCCGTCAGAGATTCTGAATGGCGAGCCCGAGAGATTAATCGAGCGGCTGGGAATG
GAGGGGAAAAGATCAGAAAAGCAGCTCCGGTTCGATTCTTTGACTGTGGCAGTGACATTGAGTGGAAGAGGGGAAAGGAGGAGAGTGGTGGAGCTCGTATTGAAACTCAA
GAGAGAAAAAGAAACGCCATTGATGGAGATGATAGAGTGGGGAGAAGAGCAACGCACCTGGAAAGAAGGGTGCCCATAGCCAAGAACAGAAGAAAACGGGAAAACGAGTT
TGGAAGAGAAAGGAGGACAGGGAAACTTCAAAGACTCTGCCGCCGCCGCCGCCGCCCCATCTGCTGAAAACAGAGGACAGAGCTCGAGGAAGAAGACAACAATGGCGAAA
ACGGAAATAGAGCAAGACGGCGAAGGCATAGAAAGACGAAATATGCAGAGAAAGGAAGAGGGCAAGAAGAGAAGTGAGATCTAAATCTGAAATCTAAGGAAAGGAAAGGA
AAAGAGATTCCTTTAGCAATGTTGTTGGAATTTGGAAGGAGGGAAAAGAGGAGTGGGAGGAGGAGAAAGAAGGGAGGTCACGTGAAGGCAAGGGCGAGTTTTTGGCTTTG
GGAGTGTGAAGACAAAACAAAAACTTTAAAAGCAGCACTGCGCCTCAGGCTCATCCAAAGTCCAAAGTTTCTTCCTTGTTTTTTTTTTTAATTTTGAAATCTGCTTTTGC
TTTTGCTTTTTGTTGTGTCAATTTTCTCAAACCTTTGCTCACTAAACTGTTGACCCAAGTCGGTTTTTGGGTACATGTGAGAGAGCGTTTTTTCTTTTAATTGTTAGACA
TTTGTGGAAAACTTTTTTGACTTTTCTATGTAACATTGAAATGAAAGTTGAACTTCTCCC
Protein sequenceShow/hide protein sequence
MGDGVKLDKWGYEIRTSSDACIAAINAYYDQVLSYGRRRSVILEAPVHDKDCVLANTLVAHFLSSSEPSRVLHHLRAAEAGLDHATSYERAVFDAISCLISKDRDDDVAV
ELHAELLKKFPKDLVSLKRAQVLCFYMGDANLSLALVQQVPLLPKNVVLPHNQEEDFIYGMLAFPLLELGCMAEAENAARRGLDINKKEGWAQHALCHVLQHKCHFKEAV
EFMEACSPSWSDCSSFMVTHNWWHVALCYLEANFPLNKILEIYDKYIWKEVEKPDAMGPEVYLNAVGLMLRLFVRGEFDQCEGRLKILANVLTDKANWHLEWHFDILTLW
ALAKTGEISAAEELLGSLESRLLKMTSKKQEKMQRGMLLAEAVYKYGRGDYKRALDLLGLDFDANDYKMIGASNEQVDVFNEVWYDILMNTVHASKAIEVIKKQLETREG
VPFMWRLLERGYSKTGRPEEAAIAGGKAGSLEKAYFK