; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g04310 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g04310
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUPF0481 protein At3g47200-like
Genome locationchr8:3146853..3148196
RNA-Seq ExpressionMoc08g04310
SyntenyMoc08g04310
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022131631.1 UPF0481 protein At3g47200-like [Momordica charantia]5.7e-12260.15Show/hide
Query:  ITEECSIYRVSKRLHNIHDTAFAPQAISIGPFHHGREELMAMEQLKLRFLHGYLRRVGMDVKAAFKIARDWETRARKCYAEPIDMKSEDFVTMMLVDGCF
        I+EECSIYR+SK LHNI+D A+ PQAISIGPFHHGR+ELMAME+LKL FL  YL +VGM+  AAF+IAR WE RAR+CYAEPI+M S DFV ++LVD  F
Subjt:  ITEECSIYRVSKRLHNIHDTAFAPQAISIGPFHHGREELMAMEQLKLRFLHGYLRRVGMDVKAAFKIARDWETRARKCYAEPIDMKSEDFVTMMLVDGCF

Query:  LVEFFIMGYEFSGSQTQLSYPLFSAISTDLLHDLILLENQLPYFILECLFKRSGRSCIYFT-CRYLGGWYIGVARL--PKVPAREANHLVDLLSFYYAIP
        LV   I+    SG+       LF AI  D+  DLILLENQLP+FIL  LF++   S  +    R    WY+G   L    + AR+ NHLVD LSFYYA+P
Subjt:  LVEFFIMGYEFSGSQTQLSYPLFSAISTDLLHDLILLENQLPYFILECLFKRSGRSCIYFT-CRYLGGWYIGVARL--PKVPAREANHLVDLLSFYYAIP

Query:  TVTISGN---NNGQKWERPPTATQLKEAGVRFQKAIDWKHITDISFR--DGVLKIPCFQITTSFETRVRNLLAFEHYYHLGRDKKC-LQYFSFLDDLIST
          T + +   N  +K   PPTAT+L EAGVRFQKA + K I DI F+  +GVL IP  +I  +FET +RNLLA+EH YH+G D++C +QY  FLDDLIST
Subjt:  TVTISGN---NNGQKWERPPTATQLKEAGVRFQKAIDWKHITDISFR--DGVLKIPCFQITTSFETRVRNLLAFEHYYHLGRDKKC-LQYFSFLDDLIST

Query:  EKDVGLLVKAGIISNNIGGNHEDIAKLFNDMVKYGNISSSFYYYSQISLDLRKYCDTPWHRWKASLKRDYFNSPWTSISFLAATFLILLTVVQTLYSAL
        E+DV LLVKAGII+N IGGN E+++K+FN++ K  ++   FYYY+QIS+ LRKYC+TPWHRW+ASLKRDYFNSPWTSISFLAAT  ILLTVVQT+YS +
Subjt:  EKDVGLLVKAGIISNNIGGNHEDIAKLFNDMVKYGNISSSFYYYSQISLDLRKYCDTPWHRWKASLKRDYFNSPWTSISFLAATFLILLTVVQTLYSAL

XP_022131632.1 UPF0481 protein At3g47200-like [Momordica charantia]2.0e-22889.71Show/hide
Query:  MEDNHIETHQQKNNDDACYGISNMDENNEVEETQGNSIHVVSIENRLKKLHPITEECSIYRVSKRLHNIHDTAFAPQAISIGPFHHGREELMAMEQLKLR
        MEDNHIETHQQKNNDDACYGISNMDENNEVEETQGNSIHVVSIENRLKKLHPITEECSIYRVSKRLHNIHDTAFAPQAISIGPFHHGREELMAMEQLKLR
Subjt:  MEDNHIETHQQKNNDDACYGISNMDENNEVEETQGNSIHVVSIENRLKKLHPITEECSIYRVSKRLHNIHDTAFAPQAISIGPFHHGREELMAMEQLKLR

Query:  FLHGYLRRVGMDVKAAFKIARDWETRARKCYAEPIDMKSEDFVTMMLVDGCFLVEFFIMGYEFSGSQTQLSYPLFSAISTDLLHDLILLENQLPYFILEC
        FLHGYLRRVGMDVKAAFKIARDWETRARKCYAEPIDMKSEDFVTMMLVDGCFLVEFFIMGYEFSGSQTQ                               
Subjt:  FLHGYLRRVGMDVKAAFKIARDWETRARKCYAEPIDMKSEDFVTMMLVDGCFLVEFFIMGYEFSGSQTQLSYPLFSAISTDLLHDLILLENQLPYFILEC

Query:  LFKRSGRSCIYFTCRYLGGWYIGVARLPKVPAREANHLVDLLSFYYAIPTVTISGNNNGQKWERPPTATQLKEAGVRFQKAIDWKHITDISFRDGVLKIP
                       YLGGWYIGVARLPKVPAREANHLVDLLSFYYAIPTVTISGNNNGQKWERPPTATQLKEAGVRFQKAIDWKHITDISFRDGVLKIP
Subjt:  LFKRSGRSCIYFTCRYLGGWYIGVARLPKVPAREANHLVDLLSFYYAIPTVTISGNNNGQKWERPPTATQLKEAGVRFQKAIDWKHITDISFRDGVLKIP

Query:  CFQITTSFETRVRNLLAFEHYYHLGRDKKCLQYFSFLDDLISTEKDVGLLVKAGIISNNIGGNHEDIAKLFNDMVKYGNISSSFYYYSQISLDLRKYCDT
        CFQITTSFETRVRNLLAFEHYYHLGRDKKCLQYFSFLDDLISTEKDVGLLVKAGIISNNIGGNHEDIAKLFNDMVKYGNISSSFYYYSQISLDLRKYCDT
Subjt:  CFQITTSFETRVRNLLAFEHYYHLGRDKKCLQYFSFLDDLISTEKDVGLLVKAGIISNNIGGNHEDIAKLFNDMVKYGNISSSFYYYSQISLDLRKYCDT

Query:  PWHRWKASLKRDYFNSPWTSISFLAATFLILLTVVQTLYSALQYPNK
        PWHRWKASLKRDYFNSPWTSISFLAATFLILLTVVQTLYSALQYPNK
Subjt:  PWHRWKASLKRDYFNSPWTSISFLAATFLILLTVVQTLYSALQYPNK

XP_022132066.1 UPF0481 protein At3g47200-like [Momordica charantia]1.2e-14060.66Show/hide
Query:  MEDNHIETHQQKNNDDACYGISNMDENNEVEETQGNSIHV-VSIENRLKKLHPITEECSIYRVSKRLHNIHDTAFAPQAISIGPFHHGREELMAMEQLKL
        MED+HIET+               D N +++E +    HV +S++N L+KLHPI+EECSIYRVSKRLHNI+D A+ PQAISIGPFHHG++E MAMEQLKL
Subjt:  MEDNHIETHQQKNNDDACYGISNMDENNEVEETQGNSIHV-VSIENRLKKLHPITEECSIYRVSKRLHNIHDTAFAPQAISIGPFHHGREELMAMEQLKL

Query:  RFLHGYLRRVGMDVKAAFKIARDWETRARKCYAEPIDMKSEDFVTMMLVDGCFLVEFFIMGYEFSG-SQTQLSYPLFSAISTDLLHDLILLENQLPYFIL
        RFL  YLRRVGM ++ AF+IA+ WETRARKCYAE IDMKS++FV MMLVDG FLVEF  M Y+++  +Q  L+Y LF AI  D+  DLILLENQLP+FIL
Subjt:  RFLHGYLRRVGMDVKAAFKIARDWETRARKCYAEPIDMKSEDFVTMMLVDGCFLVEFFIMGYEFSG-SQTQLSYPLFSAISTDLLHDLILLENQLPYFIL

Query:  ECLFKRSGRSC--IYFTCRYLGGWYIGVARL--PKVPAREANHLVDLLSFYYAIPTVTISGNNNGQKW---ERPPTATQLKEAGVRFQKAIDWKH-ITDI
        ECL  +   S   + FT  +   WY G   L   K+  ++ NHLVD LSFYYA+PTVT  G N+  K+   E PPTAT+L EAGV FQKA + K  I DI
Subjt:  ECLFKRSGRSC--IYFTCRYLGGWYIGVARL--PKVPAREANHLVDLLSFYYAIPTVTISGNNNGQKW---ERPPTATQLKEAGVRFQKAIDWKH-ITDI

Query:  SFRDGVLKIPCFQITTSFETRVRNLLAFEHYYHLGRDKKCL-QYFSFLDDLISTEKDVGLLVKAGIISNNIGGNHEDIAKLFNDMVKYGNISSSFYYYSQ
         F+DGVL IP  +I  +FET VRNLLA+EH YH+G D++CL QY  FLD+LISTE+DV LLVKAGII+NNIGGN+ED++KLFND+ K  NIS  FYYY+ 
Subjt:  SFRDGVLKIPCFQITTSFETRVRNLLAFEHYYHLGRDKKCL-QYFSFLDDLISTEKDVGLLVKAGIISNNIGGNHEDIAKLFNDMVKYGNISSSFYYYSQ

Query:  ISLDLRKYCDTPWHRWKASLKRDYFNSPWTSISFLAATFLILLTVVQTLYSALQY
        IS+DL KYC+T WHR  ASL+RDYFN+PW  ISFLAATFL+LLT +Q +YSA+ Y
Subjt:  ISLDLRKYCDTPWHRWKASLKRDYFNSPWTSISFLAATFLILLTVVQTLYSALQY

XP_022155136.1 UPF0481 protein At3g47200-like [Momordica charantia]4.1e-20582.25Show/hide
Query:  MEDNHIETHQQKNNDDACYGISNMDENNEVE-ETQGNSIHVVSIENRLKKLHPITEECSIYRVSKRLHNIHDTAFAPQAISIGPFHHGREELMAMEQLKL
        MEDNH+ THQQ N DD CYGI+N+ +    +  TQ NS HV SI+  L  LHP+TEECSIYRVSKRLHNIHD AFAPQ ISIGPFHHGR+ELMAMEQLKL
Subjt:  MEDNHIETHQQKNNDDACYGISNMDENNEVE-ETQGNSIHVVSIENRLKKLHPITEECSIYRVSKRLHNIHDTAFAPQAISIGPFHHGREELMAMEQLKL

Query:  RFLHGYLRRVGMDVKAAFKIARDWETRARKCYAEPIDMKSEDFVTMMLVDGCFLVEFFIMGYEFSGSQTQLSYPLFSAISTDLLHDLILLENQLPYFILE
        RFLHGYLRRVGM+VKAAF IAR+WE RARKCYAEPIDMKSEDFVTMMLVDGCFLVE FIM YEFSG+QTQLSYPLFSAISTDLLHDLI LENQLPYFILE
Subjt:  RFLHGYLRRVGMDVKAAFKIARDWETRARKCYAEPIDMKSEDFVTMMLVDGCFLVEFFIMGYEFSGSQTQLSYPLFSAISTDLLHDLILLENQLPYFILE

Query:  CLFKRSGRSCIYFTCRYLGGWYIGVARLPKVPAREANHLVDLLSFYYAIPTVTISGNNNGQKWERPPTATQLKEAGVRFQKAIDWKHITDISFRDGVLKI
        CLFK S  SCI     YLGGWYIG   LPKVP REANHLVDLLSFYYAIPTVTISG NNGQ+WERPPTATQLKEAGV+FQKA D KHITDISF+DGVLKI
Subjt:  CLFKRSGRSCIYFTCRYLGGWYIGVARLPKVPAREANHLVDLLSFYYAIPTVTISGNNNGQKWERPPTATQLKEAGVRFQKAIDWKHITDISFRDGVLKI

Query:  PCFQITTSFETRVRNLLAFEHYYHLGRDKKCLQYFSFLDDLISTEKDVGLLVKAGIISNNIGGNHEDIAKLFNDMVKYGNISSSFYYYSQISLDLRKYCD
        PCFQITT+FET VRNLLAFEHYYHLG  K+CLQYFSFLDDLI+TEKDV LLV+AGIISNNIGGN+E+IAKLFNDMVK+ NI+S  +YYSQISL+LRKYC 
Subjt:  PCFQITTSFETRVRNLLAFEHYYHLGRDKKCLQYFSFLDDLISTEKDVGLLVKAGIISNNIGGNHEDIAKLFNDMVKYGNISSSFYYYSQISLDLRKYCD

Query:  TPWHRWKASLKRDYFNSPWTSISFLAATFLILLTVVQTLYSALQY
        T  HRW ASL+RDYFN+PWT ISFLAATFLILLTVVQTLYSAL Y
Subjt:  TPWHRWKASLKRDYFNSPWTSISFLAATFLILLTVVQTLYSALQY

XP_022158990.1 UPF0481 protein At3g47200-like isoform X2 [Momordica charantia]4.1e-9646.62Show/hide
Query:  CYGISNMDENNEVEETQGNSIHVV--SIENRLKKLHPITEECSIYRVSKRLHNIHDTAFAPQAISIGPFHHGREELMAMEQLKLRFLHGYLRRVGMDVKA
        C G+  +  N         ++  V  SI+  L++L P+ EEC+I+RV +RL   +  A+ PQ ISIGPFHHGR++LM MEQ KLRFL  YLRR    ++ 
Subjt:  CYGISNMDENNEVEETQGNSIHVV--SIENRLKKLHPITEECSIYRVSKRLHNIHDTAFAPQAISIGPFHHGREELMAMEQLKLRFLHGYLRRVGMDVKA

Query:  AFKIARDWETRARKCYAEPIDMKSEDFVTMMLVDGCFLVEFFIMGYEFSGSQTQLSYP--LFSAISTDLLHDLILLENQLPYFILECLFKR----SGRSC
           I R WET AR CYAEPI+M S++FV MMLVDGCF+VE  +M     GS+T+  +   LF A+ TDL  DLI+LENQLP+F+L+ LF +    +G S 
Subjt:  AFKIARDWETRARKCYAEPIDMKSEDFVTMMLVDGCFLVEFFIMGYEFSGSQTQLSYP--LFSAISTDLLHDLILLENQLPYFILECLFKR----SGRSC

Query:  IYFTCRYLGGWYIGVARLPKVP------AREANHLVDLLSFYYAIPTVTISGNNNG-----QKWERPPTATQLKEAGVRFQKAIDWKHITDISFRDGVLK
        +  T  +     +   R  ++P        + NHLVD LSFYYA    ++S  ++      +K   PPT T+L EAG+ F+KA+  KHI DISF+D VL+
Subjt:  IYFTCRYLGGWYIGVARLPKVP------AREANHLVDLLSFYYAIPTVTISGNNNG-----QKWERPPTATQLKEAGVRFQKAIDWKHITDISFRDGVLK

Query:  IPCFQITTSFETRVRNLLAFEHYYHLGRDKKCLQYFSFLDDLISTEKDVGLLVKAGIISNNIGGNHEDIAKLFNDMVKYGNISSSFYYYSQISLDLRKYC
        IP  +I   FET VRNL+AFE Y++ G  K  +QYF FL+ LIS E+DV LLVKA II+N IGGN+++++ LFND+ K   +      ++ I+  L ++C
Subjt:  IPCFQITTSFETRVRNLLAFEHYYHLGRDKKCLQYFSFLDDLISTEKDVGLLVKAGIISNNIGGNHEDIAKLFNDMVKYGNISSSFYYYSQISLDLRKYC

Query:  DTPWHRWKASLKRDYFNSPWTSISFLAATFLILLTVVQTLYSAL
           W++  ASL+RDYFN+PW  ISF+AA FLILLT +QTL+SA+
Subjt:  DTPWHRWKASLKRDYFNSPWTSISFLAATFLILLTVVQTLYSAL

TrEMBL top hitse value%identityAlignment
A0A6J1BQ17 UPF0481 protein At3g47200-like2.8e-12260.15Show/hide
Query:  ITEECSIYRVSKRLHNIHDTAFAPQAISIGPFHHGREELMAMEQLKLRFLHGYLRRVGMDVKAAFKIARDWETRARKCYAEPIDMKSEDFVTMMLVDGCF
        I+EECSIYR+SK LHNI+D A+ PQAISIGPFHHGR+ELMAME+LKL FL  YL +VGM+  AAF+IAR WE RAR+CYAEPI+M S DFV ++LVD  F
Subjt:  ITEECSIYRVSKRLHNIHDTAFAPQAISIGPFHHGREELMAMEQLKLRFLHGYLRRVGMDVKAAFKIARDWETRARKCYAEPIDMKSEDFVTMMLVDGCF

Query:  LVEFFIMGYEFSGSQTQLSYPLFSAISTDLLHDLILLENQLPYFILECLFKRSGRSCIYFT-CRYLGGWYIGVARL--PKVPAREANHLVDLLSFYYAIP
        LV   I+    SG+       LF AI  D+  DLILLENQLP+FIL  LF++   S  +    R    WY+G   L    + AR+ NHLVD LSFYYA+P
Subjt:  LVEFFIMGYEFSGSQTQLSYPLFSAISTDLLHDLILLENQLPYFILECLFKRSGRSCIYFT-CRYLGGWYIGVARL--PKVPAREANHLVDLLSFYYAIP

Query:  TVTISGN---NNGQKWERPPTATQLKEAGVRFQKAIDWKHITDISFR--DGVLKIPCFQITTSFETRVRNLLAFEHYYHLGRDKKC-LQYFSFLDDLIST
          T + +   N  +K   PPTAT+L EAGVRFQKA + K I DI F+  +GVL IP  +I  +FET +RNLLA+EH YH+G D++C +QY  FLDDLIST
Subjt:  TVTISGN---NNGQKWERPPTATQLKEAGVRFQKAIDWKHITDISFR--DGVLKIPCFQITTSFETRVRNLLAFEHYYHLGRDKKC-LQYFSFLDDLIST

Query:  EKDVGLLVKAGIISNNIGGNHEDIAKLFNDMVKYGNISSSFYYYSQISLDLRKYCDTPWHRWKASLKRDYFNSPWTSISFLAATFLILLTVVQTLYSAL
        E+DV LLVKAGII+N IGGN E+++K+FN++ K  ++   FYYY+QIS+ LRKYC+TPWHRW+ASLKRDYFNSPWTSISFLAAT  ILLTVVQT+YS +
Subjt:  EKDVGLLVKAGIISNNIGGNHEDIAKLFNDMVKYGNISSSFYYYSQISLDLRKYCDTPWHRWKASLKRDYFNSPWTSISFLAATFLILLTVVQTLYSAL

A0A6J1BR71 UPF0481 protein At3g47200-like5.9e-14160.66Show/hide
Query:  MEDNHIETHQQKNNDDACYGISNMDENNEVEETQGNSIHV-VSIENRLKKLHPITEECSIYRVSKRLHNIHDTAFAPQAISIGPFHHGREELMAMEQLKL
        MED+HIET+               D N +++E +    HV +S++N L+KLHPI+EECSIYRVSKRLHNI+D A+ PQAISIGPFHHG++E MAMEQLKL
Subjt:  MEDNHIETHQQKNNDDACYGISNMDENNEVEETQGNSIHV-VSIENRLKKLHPITEECSIYRVSKRLHNIHDTAFAPQAISIGPFHHGREELMAMEQLKL

Query:  RFLHGYLRRVGMDVKAAFKIARDWETRARKCYAEPIDMKSEDFVTMMLVDGCFLVEFFIMGYEFSG-SQTQLSYPLFSAISTDLLHDLILLENQLPYFIL
        RFL  YLRRVGM ++ AF+IA+ WETRARKCYAE IDMKS++FV MMLVDG FLVEF  M Y+++  +Q  L+Y LF AI  D+  DLILLENQLP+FIL
Subjt:  RFLHGYLRRVGMDVKAAFKIARDWETRARKCYAEPIDMKSEDFVTMMLVDGCFLVEFFIMGYEFSG-SQTQLSYPLFSAISTDLLHDLILLENQLPYFIL

Query:  ECLFKRSGRSC--IYFTCRYLGGWYIGVARL--PKVPAREANHLVDLLSFYYAIPTVTISGNNNGQKW---ERPPTATQLKEAGVRFQKAIDWKH-ITDI
        ECL  +   S   + FT  +   WY G   L   K+  ++ NHLVD LSFYYA+PTVT  G N+  K+   E PPTAT+L EAGV FQKA + K  I DI
Subjt:  ECLFKRSGRSC--IYFTCRYLGGWYIGVARL--PKVPAREANHLVDLLSFYYAIPTVTISGNNNGQKW---ERPPTATQLKEAGVRFQKAIDWKH-ITDI

Query:  SFRDGVLKIPCFQITTSFETRVRNLLAFEHYYHLGRDKKCL-QYFSFLDDLISTEKDVGLLVKAGIISNNIGGNHEDIAKLFNDMVKYGNISSSFYYYSQ
         F+DGVL IP  +I  +FET VRNLLA+EH YH+G D++CL QY  FLD+LISTE+DV LLVKAGII+NNIGGN+ED++KLFND+ K  NIS  FYYY+ 
Subjt:  SFRDGVLKIPCFQITTSFETRVRNLLAFEHYYHLGRDKKCL-QYFSFLDDLISTEKDVGLLVKAGIISNNIGGNHEDIAKLFNDMVKYGNISSSFYYYSQ

Query:  ISLDLRKYCDTPWHRWKASLKRDYFNSPWTSISFLAATFLILLTVVQTLYSALQY
        IS+DL KYC+T WHR  ASL+RDYFN+PW  ISFLAATFL+LLT +Q +YSA+ Y
Subjt:  ISLDLRKYCDTPWHRWKASLKRDYFNSPWTSISFLAATFLILLTVVQTLYSALQY

A0A6J1BRK2 UPF0481 protein At3g47200-like9.9e-22989.71Show/hide
Query:  MEDNHIETHQQKNNDDACYGISNMDENNEVEETQGNSIHVVSIENRLKKLHPITEECSIYRVSKRLHNIHDTAFAPQAISIGPFHHGREELMAMEQLKLR
        MEDNHIETHQQKNNDDACYGISNMDENNEVEETQGNSIHVVSIENRLKKLHPITEECSIYRVSKRLHNIHDTAFAPQAISIGPFHHGREELMAMEQLKLR
Subjt:  MEDNHIETHQQKNNDDACYGISNMDENNEVEETQGNSIHVVSIENRLKKLHPITEECSIYRVSKRLHNIHDTAFAPQAISIGPFHHGREELMAMEQLKLR

Query:  FLHGYLRRVGMDVKAAFKIARDWETRARKCYAEPIDMKSEDFVTMMLVDGCFLVEFFIMGYEFSGSQTQLSYPLFSAISTDLLHDLILLENQLPYFILEC
        FLHGYLRRVGMDVKAAFKIARDWETRARKCYAEPIDMKSEDFVTMMLVDGCFLVEFFIMGYEFSGSQTQ                               
Subjt:  FLHGYLRRVGMDVKAAFKIARDWETRARKCYAEPIDMKSEDFVTMMLVDGCFLVEFFIMGYEFSGSQTQLSYPLFSAISTDLLHDLILLENQLPYFILEC

Query:  LFKRSGRSCIYFTCRYLGGWYIGVARLPKVPAREANHLVDLLSFYYAIPTVTISGNNNGQKWERPPTATQLKEAGVRFQKAIDWKHITDISFRDGVLKIP
                       YLGGWYIGVARLPKVPAREANHLVDLLSFYYAIPTVTISGNNNGQKWERPPTATQLKEAGVRFQKAIDWKHITDISFRDGVLKIP
Subjt:  LFKRSGRSCIYFTCRYLGGWYIGVARLPKVPAREANHLVDLLSFYYAIPTVTISGNNNGQKWERPPTATQLKEAGVRFQKAIDWKHITDISFRDGVLKIP

Query:  CFQITTSFETRVRNLLAFEHYYHLGRDKKCLQYFSFLDDLISTEKDVGLLVKAGIISNNIGGNHEDIAKLFNDMVKYGNISSSFYYYSQISLDLRKYCDT
        CFQITTSFETRVRNLLAFEHYYHLGRDKKCLQYFSFLDDLISTEKDVGLLVKAGIISNNIGGNHEDIAKLFNDMVKYGNISSSFYYYSQISLDLRKYCDT
Subjt:  CFQITTSFETRVRNLLAFEHYYHLGRDKKCLQYFSFLDDLISTEKDVGLLVKAGIISNNIGGNHEDIAKLFNDMVKYGNISSSFYYYSQISLDLRKYCDT

Query:  PWHRWKASLKRDYFNSPWTSISFLAATFLILLTVVQTLYSALQYPNK
        PWHRWKASLKRDYFNSPWTSISFLAATFLILLTVVQTLYSALQYPNK
Subjt:  PWHRWKASLKRDYFNSPWTSISFLAATFLILLTVVQTLYSALQYPNK

A0A6J1DQT2 UPF0481 protein At3g47200-like2.0e-20582.25Show/hide
Query:  MEDNHIETHQQKNNDDACYGISNMDENNEVE-ETQGNSIHVVSIENRLKKLHPITEECSIYRVSKRLHNIHDTAFAPQAISIGPFHHGREELMAMEQLKL
        MEDNH+ THQQ N DD CYGI+N+ +    +  TQ NS HV SI+  L  LHP+TEECSIYRVSKRLHNIHD AFAPQ ISIGPFHHGR+ELMAMEQLKL
Subjt:  MEDNHIETHQQKNNDDACYGISNMDENNEVE-ETQGNSIHVVSIENRLKKLHPITEECSIYRVSKRLHNIHDTAFAPQAISIGPFHHGREELMAMEQLKL

Query:  RFLHGYLRRVGMDVKAAFKIARDWETRARKCYAEPIDMKSEDFVTMMLVDGCFLVEFFIMGYEFSGSQTQLSYPLFSAISTDLLHDLILLENQLPYFILE
        RFLHGYLRRVGM+VKAAF IAR+WE RARKCYAEPIDMKSEDFVTMMLVDGCFLVE FIM YEFSG+QTQLSYPLFSAISTDLLHDLI LENQLPYFILE
Subjt:  RFLHGYLRRVGMDVKAAFKIARDWETRARKCYAEPIDMKSEDFVTMMLVDGCFLVEFFIMGYEFSGSQTQLSYPLFSAISTDLLHDLILLENQLPYFILE

Query:  CLFKRSGRSCIYFTCRYLGGWYIGVARLPKVPAREANHLVDLLSFYYAIPTVTISGNNNGQKWERPPTATQLKEAGVRFQKAIDWKHITDISFRDGVLKI
        CLFK S  SCI     YLGGWYIG   LPKVP REANHLVDLLSFYYAIPTVTISG NNGQ+WERPPTATQLKEAGV+FQKA D KHITDISF+DGVLKI
Subjt:  CLFKRSGRSCIYFTCRYLGGWYIGVARLPKVPAREANHLVDLLSFYYAIPTVTISGNNNGQKWERPPTATQLKEAGVRFQKAIDWKHITDISFRDGVLKI

Query:  PCFQITTSFETRVRNLLAFEHYYHLGRDKKCLQYFSFLDDLISTEKDVGLLVKAGIISNNIGGNHEDIAKLFNDMVKYGNISSSFYYYSQISLDLRKYCD
        PCFQITT+FET VRNLLAFEHYYHLG  K+CLQYFSFLDDLI+TEKDV LLV+AGIISNNIGGN+E+IAKLFNDMVK+ NI+S  +YYSQISL+LRKYC 
Subjt:  PCFQITTSFETRVRNLLAFEHYYHLGRDKKCLQYFSFLDDLISTEKDVGLLVKAGIISNNIGGNHEDIAKLFNDMVKYGNISSSFYYYSQISLDLRKYCD

Query:  TPWHRWKASLKRDYFNSPWTSISFLAATFLILLTVVQTLYSALQY
        T  HRW ASL+RDYFN+PWT ISFLAATFLILLTVVQTLYSAL Y
Subjt:  TPWHRWKASLKRDYFNSPWTSISFLAATFLILLTVVQTLYSALQY

A0A6J1DXD6 UPF0481 protein At3g47200-like isoform X22.0e-9646.62Show/hide
Query:  CYGISNMDENNEVEETQGNSIHVV--SIENRLKKLHPITEECSIYRVSKRLHNIHDTAFAPQAISIGPFHHGREELMAMEQLKLRFLHGYLRRVGMDVKA
        C G+  +  N         ++  V  SI+  L++L P+ EEC+I+RV +RL   +  A+ PQ ISIGPFHHGR++LM MEQ KLRFL  YLRR    ++ 
Subjt:  CYGISNMDENNEVEETQGNSIHVV--SIENRLKKLHPITEECSIYRVSKRLHNIHDTAFAPQAISIGPFHHGREELMAMEQLKLRFLHGYLRRVGMDVKA

Query:  AFKIARDWETRARKCYAEPIDMKSEDFVTMMLVDGCFLVEFFIMGYEFSGSQTQLSYP--LFSAISTDLLHDLILLENQLPYFILECLFKR----SGRSC
           I R WET AR CYAEPI+M S++FV MMLVDGCF+VE  +M     GS+T+  +   LF A+ TDL  DLI+LENQLP+F+L+ LF +    +G S 
Subjt:  AFKIARDWETRARKCYAEPIDMKSEDFVTMMLVDGCFLVEFFIMGYEFSGSQTQLSYP--LFSAISTDLLHDLILLENQLPYFILECLFKR----SGRSC

Query:  IYFTCRYLGGWYIGVARLPKVP------AREANHLVDLLSFYYAIPTVTISGNNNG-----QKWERPPTATQLKEAGVRFQKAIDWKHITDISFRDGVLK
        +  T  +     +   R  ++P        + NHLVD LSFYYA    ++S  ++      +K   PPT T+L EAG+ F+KA+  KHI DISF+D VL+
Subjt:  IYFTCRYLGGWYIGVARLPKVP------AREANHLVDLLSFYYAIPTVTISGNNNG-----QKWERPPTATQLKEAGVRFQKAIDWKHITDISFRDGVLK

Query:  IPCFQITTSFETRVRNLLAFEHYYHLGRDKKCLQYFSFLDDLISTEKDVGLLVKAGIISNNIGGNHEDIAKLFNDMVKYGNISSSFYYYSQISLDLRKYC
        IP  +I   FET VRNL+AFE Y++ G  K  +QYF FL+ LIS E+DV LLVKA II+N IGGN+++++ LFND+ K   +      ++ I+  L ++C
Subjt:  IPCFQITTSFETRVRNLLAFEHYYHLGRDKKCLQYFSFLDDLISTEKDVGLLVKAGIISNNIGGNHEDIAKLFNDMVKYGNISSSFYYYSQISLDLRKYC

Query:  DTPWHRWKASLKRDYFNSPWTSISFLAATFLILLTVVQTLYSAL
           W++  ASL+RDYFN+PW  ISF+AA FLILLT +QTL+SA+
Subjt:  DTPWHRWKASLKRDYFNSPWTSISFLAATFLILLTVVQTLYSAL

SwissProt top hitse value%identityAlignment
Q9SD53 UPF0481 protein At3g472004.1e-3025.93Show/hide
Query:  EECSIYRVSKRLHNIHDTAFAPQAISIGPFHHGREELMAMEQLKLRFLHGYL---RRVGMDVKAAFKIARDWETRARKCYAEPIDMKSEDFVTMMLVDGC
        E C I+RV +    ++  A+ P+ +SIGP+H+G + L  ++Q K R L  +L   ++  ++     K   D E + RK Y+E +     D + MM++DGC
Subjt:  EECSIYRVSKRLHNIHDTAFAPQAISIGPFHHGREELMAMEQLKLRFLHGYL---RRVGMDVKAAFKIARDWETRARKCYAEPIDMKSEDFVTMMLVDGC

Query:  FLVEFFIMGYEFSGSQTQLSYPLFSA--ISTDLLHDLILLENQLPYFILECLFKRS--------GRSCIYFTCRYL---GGWYIGVARLPKVPAREANHL
        F++  F++    SG+      P+FS   + + +  DL+LLENQ+P+F+L+ L+  S         R   +F    +   G ++       K    +A HL
Subjt:  FLVEFFIMGYEFSGSQTQLSYPLFSA--ISTDLLHDLILLENQLPYFILECLFKRS--------GRSCIYFTCRYL---GGWYIGVARLPKVPAREANHL

Query:  VDLL----------SFYYAIPTVTI------SGNNNGQKWERPP---TATQLKEAGVRFQ-KAIDWKHITDISFRDGVLKIPCFQITTSFETRVRNLLAF
        +DL+          S   + P V +      SGN      +  P   +A +L+  G++F+ +      I ++  +   L+IP  +      +   N +AF
Subjt:  VDLL----------SFYYAIPTVTI------SGNNNGQKWERPP---TATQLKEAGVRFQ-KAIDWKHITDISFRDGVLKIPCFQITTSFETRVRNLLAF

Query:  EHYYHLGRDKKCLQYFSFLDDLISTEKDVGLLVKAGIISNNIGGNHEDIAKLFNDMVKYGNISSSFYYYSQISLDLRKYCDTPWHRWKASLKRDYFNSPW
        E +Y      +   Y  F+  L++ E+DV  L    +I  N  G++ ++++ F  + K         Y + +   + +Y    ++   A  +  +F SPW
Subjt:  EHYYHLGRDKKCLQYFSFLDDLISTEKDVGLLVKAGIISNNIGGNHEDIAKLFNDMVKYGNISSSFYYYSQISLDLRKYCDTPWHRWKASLKRDYFNSPW

Query:  TSISFLAATFLILLTVVQTLYSALQYPN
        T +S  A  F+ILLT++Q+  + L Y N
Subjt:  TSISFLAATFLILLTVVQTLYSALQYPN

Arabidopsis top hitse value%identityAlignment
AT3G50120.1 Plant protein of unknown function (DUF247)3.1e-4927.96Show/hide
Query:  EDNHIETHQQKNNDDACYGISNMDENNEVEETQGNSIHVVSIENRLKKLH-----PITEECSIYRVSKRLHNIHDTAFAPQAISIGPFHHGREELMAMEQ
        +D +I  +QQ  +D A Y I    +++  +        V+SI ++L++ H      +  +  IYRV   L    + ++ PQ +S+GP+HHG++ L +M++
Subjt:  EDNHIETHQQKNNDDACYGISNMDENNEVEETQGNSIHVVSIENRLKKLH-----PITEECSIYRVSKRLHNIHDTAFAPQAISIGPFHHGREELMAMEQ

Query:  LKLRFLHGYLRRVGMDVKAAFKIARDWETRARKCYAEPIDMKSEDFVTMMLVDGCFLVEFFIMGYEFSGSQTQLSY----PLFSAIST--DLLHDLILLE
         K R ++  L+R    +K      R+ E +AR CY  P+ + S +F+ M+++DGCF++E F    E     T+L Y    P+F+   +   +  D+++LE
Subjt:  LKLRFLHGYLRRVGMDVKAAFKIARDWETRARKCYAEPIDMKSEDFVTMMLVDGCFLVEFFIMGYEFSGSQTQLSY----PLFSAIST--DLLHDLILLE

Query:  NQLPYFILECLFK-----RSGRSCI-YFTCRYLGGWY--------IGVARLPKVPAREAN----------HLVDLLSFYYAIPTVTISGNNNGQKWERPP
        NQLP F+L  L +     R+    +     R+              G ++L    AR+ +          H +D+        +         ++W R  
Subjt:  NQLPYFILECLFK-----RSGRSCI-YFTCRYLGGWY--------IGVARLPKVPAREAN----------HLVDLLSFYYAIPTVTISGNNNGQKWERPP

Query:  ------------TATQLKEAGVRFQKAIDWKHITDISFRDGVLKIPCFQITTSFETRVRNLLAFEHYYHLGRDKKCLQYFSFLDDLISTEKDVGLLVKAG
                      T+LKEAG++F++        D+ F++G L+IP   I    ++   NL+AFE   H+        Y  F+D+LI + +DV  L   G
Subjt:  ------------TATQLKEAGVRFQKAIDWKHITDISFRDGVLKIPCFQITTSFETRVRNLLAFEHYYHLGRDKKCLQYFSFLDDLISTEKDVGLLVKAG

Query:  IISNNIGGNHEDIAKLFNDMVKYGNISSSFYYYSQISLDLRKYCDTPWHRWKASLKRDYFNSPWTSISFLAATFLILLTVVQTLYSALQY
        II + +G + E +A LFN + +     +   Y S++S+++ +Y D  W+ W+A+LK  YFN+PW  +SF AA  L++LT  Q+ Y+   Y
Subjt:  IISNNIGGNHEDIAKLFNDMVKYGNISSSFYYYSQISLDLRKYCDTPWHRWKASLKRDYFNSPWTSISFLAATFLILLTVVQTLYSALQY

AT3G50160.1 Plant protein of unknown function (DUF247)2.7e-4529.19Show/hide
Query:  EDNHIETHQQKNNDDACYGISNMDENNEVEETQGNSIHVVSIENRLKKLHPIT----EECSIYRVSKRLHNIHDTAFAPQAISIGPFHHGREELMAMEQL
        E  +IE  ++K  +     + ++++ NE +  +   I V+S+ +++K L        +   IYRV   L      ++ PQ +SIGP+HHG + LM ME+ 
Subjt:  EDNHIETHQQKNNDDACYGISNMDENNEVEETQGNSIHVVSIENRLKKLHPIT----EECSIYRVSKRLHNIHDTAFAPQAISIGPFHHGREELMAMEQL

Query:  KLRFLHGYLRRVGMDVKAAFKIARDWETRARKCYAEPIDMKSEDFVTMMLVDGCFLVEFFIMGYE-FSGSQTQLSYPLFS--AISTDLLHDLILLENQLP
        K R ++  + R   D++      ++ E +AR CY  PI+M   +F+ M+++DG F++E F    E F       + P+F    +   +  D+++LENQLP
Subjt:  KLRFLHGYLRRVGMDVKAAFKIARDWETRARKCYAEPIDMKSEDFVTMMLVDGCFLVEFFIMGYE-FSGSQTQLSYPLFS--AISTDLLHDLILLENQLP

Query:  YFILECLFKRSGRSCIYFTCRYLGGWYIGVARLPKVPARE------ANHLVDLLSFYYAIPTVTISGNN---NGQKWERPPTATQLKEAGVRFQKAIDWK
        + +L+ L +      +      L   +      P +P RE        H +D+L       + T   +    N Q  +     T+L+ AGV F +  +  
Subjt:  YFILECLFKRSGRSCIYFTCRYLGGWYIGVARLPKVPARE------ANHLVDLLSFYYAIPTVTISGNN---NGQKWERPPTATQLKEAGVRFQKAIDWK

Query:  HITDISFRDGVLKIPCFQITTSFETRVRNLLAFEHYYHLGRDKKCLQYFSFLDDLISTEKDVGLLVKAGIISNNIGGNHEDIAKLFNDMVKYGNISSSFY
        H  DI F++G LKIP   I    ++   NL+AFE   H+   KK   Y  F+D+LI++ +DV  L   GII N +G + E ++ LFN + K      +  
Subjt:  HITDISFRDGVLKIPCFQITTSFETRVRNLLAFEHYYHLGRDKKCLQYFSFLDDLISTEKDVGLLVKAGIISNNIGGNHEDIAKLFNDMVKYGNISSSFY

Query:  YYSQISLDLRKYCDTPWHRWKASLKRDYFNSPWTSISFLAATFLILLTVVQTLYSALQY
        Y S ++ ++  Y    W+  KA+L+  YFN+PW   SF+AA  L++ T  Q+ ++   Y
Subjt:  YYSQISLDLRKYCDTPWHRWKASLKRDYFNSPWTSISFLAATFLILLTVVQTLYSALQY

AT3G50170.1 Plant protein of unknown function (DUF247)3.9e-4428.78Show/hide
Query:  EDNHIETHQQKNNDDACYGISNMDENNEVEETQGNSIHVVSIENRLKKLH-----PITEECSIYRVSKRLHNIHDTAFAPQAISIGPFHHGREELMAMEQ
        +D +++ H+Q +++    G   ++E    EET G+S  V+SI ++L++        I  +  IYRV   L      ++ PQ +S+GP+HHG++ L  ME+
Subjt:  EDNHIETHQQKNNDDACYGISNMDENNEVEETQGNSIHVVSIENRLKKLH-----PITEECSIYRVSKRLHNIHDTAFAPQAISIGPFHHGREELMAMEQ

Query:  LKLRFLHGYLRRVGMDVKAAFKIARDWETRARKCYAEPIDMKSEDFVTMMLVDGCFLVEFFIMGYEFSGSQTQLSY----PLFS--AISTDLLHDLILLE
         K R L+  L+R+   ++      R+ E +AR CY  PI +   +F  M+++DGCF++E F    E     T++ Y    P+F+   +   +  D+I+LE
Subjt:  LKLRFLHGYLRRVGMDVKAAFKIARDWETRARKCYAEPIDMKSEDFVTMMLVDGCFLVEFFIMGYEFSGSQTQLSY----PLFS--AISTDLLHDLILLE

Query:  NQLPYFILECLFK------------------------RSGRSCIYFTCRYLGGWYIGVARLPKVPAREANHLVDLL--SFYYAIPTVT-------ISGNN
        NQLP F+L+ L +                         +G +        L  W      L  +  +   H +D+   S   + PT         ++ N 
Subjt:  NQLPYFILECLFK------------------------RSGRSCIYFTCRYLGGWYIGVARLPKVPAREANHLVDLL--SFYYAIPTVT-------ISGNN

Query:  ---NGQKWERPPTATQLKEAGVRFQKAIDWKHITDISFRDGVLKIPCFQITTSFETRVRNLLAFEHYYHLGRDKKCLQYFSFLDDLISTEKDVGLLVKAG
           + ++ +     T+L+EAGV+F+K        DI F++G L+IP   I    ++   NL+AFE   H+        Y  F+D+LI++ +DV  L   G
Subjt:  ---NGQKWERPPTATQLKEAGVRFQKAIDWKHITDISFRDGVLKIPCFQITTSFETRVRNLLAFEHYYHLGRDKKCLQYFSFLDDLISTEKDVGLLVKAG

Query:  IISNNIGGNHEDIAKLFNDMVKYGNISSSFYYYSQISLDLRKYCDTPWHRWKASLKRDYFNSPWTSISFLAATFLILLTVVQTLYSALQY
        II + +G + E +A LFN + +         + S++S D+ +Y +  W+  KA+L   YFN+PW   SF AA  L+LLT+ Q+ Y+   Y
Subjt:  IISNNIGGNHEDIAKLFNDMVKYGNISSSFYYYSQISLDLRKYCDTPWHRWKASLKRDYFNSPWTSISFLAATFLILLTVVQTLYSALQY

AT4G31980.1 unknown protein1.6e-6635.71Show/hide
Query:  CYGISNMDENNEVEETQGNSIHVVSIENRLKKLHPITEECSIYRVSKRLHNIHDTAFAPQAISIGPFHHGREELMAMEQLKLRFLHGYLRRVGMDVKAAF
        C+    M++N      +G+++ V SI+ +L  L  ++ +C IY+V  +L  ++  A+ P+ +S GP H G+EEL AME  K R+L  ++ R    ++   
Subjt:  CYGISNMDENNEVEETQGNSIHVVSIENRLKKLHPITEECSIYRVSKRLHNIHDTAFAPQAISIGPFHHGREELMAMEQLKLRFLHGYLRRVGMDVKAAF

Query:  KIARDWETRARKCYAEPIDMKSEDFVTMMLVDGCFLVEFFIMGY--EFSGSQTQLSYPLFSAISTDLLHDLILLENQLPYFILECLF-------KRSGRS
        ++AR WE  AR CYAE + + S++FV M++VDG FLVE  +  +     G   ++     S + TD+  D+IL+ENQLP+F+++ +F       ++   S
Subjt:  KIARDWETRARKCYAEPIDMKSEDFVTMMLVDGCFLVEFFIMGY--EFSGSQTQLSYPLFSAISTDLLHDLILLENQLPYFILECLF-------KRSGRS

Query:  CIYFTCRYLGGWYIGVARLPKVPAREANHLVDLLSFYYAIPTVTISGNNNGQKWERPPTATQLKEAGVRFQKAIDWKHITDISFRDGVLKIPCFQITTSF
         I    R+   +++      K    E  H VDLL   Y +P   I       K +  P AT+L  AGVRF+ A     + DISF DGVLKIP   +    
Subjt:  CIYFTCRYLGGWYIGVARLPKVPAREANHLVDLLSFYYAIPTVTISGNNNGQKWERPPTATQLKEAGVRFQKAIDWKHITDISFRDGVLKIPCFQITTSF

Query:  ETRVRNLLAFEHYYHLGRDKKCLQYFSFLDDLISTEKDVGLLVKAGIISNNIGGNHEDIAKLFNDMVKYGNISSSFYYYSQISLDLRKYCDTPWHRWKAS
        E+  +N++ FE       +K  L Y   L   I +  D  LL+ +GII N + GN  D++ LFN + K   I    +Y+S +S +L+ YC+TPW+RWKA 
Subjt:  ETRVRNLLAFEHYYHLGRDKKCLQYFSFLDDLISTEKDVGLLVKAGIISNNIGGNHEDIAKLFNDMVKYGNISSSFYYYSQISLDLRKYCDTPWHRWKAS

Query:  LKRDYFNSPWTSISFLAATFLILLTVVQTLYSAL
        L+RDYF++PW   S  AA  L+LLT +Q++ S L
Subjt:  LKRDYFNSPWTSISFLAATFLILLTVVQTLYSAL

AT5G11290.1 Plant protein of unknown function (DUF247)1.8e-4933.52Show/hide
Query:  MEQLKLRFLHGYLRRVGMDVKAAFKIARDWETRARKCYAEPIDMKSEDFVTMMLVDGCFLVEFFIMG-YEFSGSQTQLSYPLFSAISTDLLHDLILLENQ
        ME  KLR+L  ++ R  + ++   ++AR WE RAR CY E + + S+++V M++VD  FLVE  +   ++         Y     I  D+ HD++LLENQ
Subjt:  MEQLKLRFLHGYLRRVGMDVKAAFKIARDWETRARKCYAEPIDMKSEDFVTMMLVDGCFLVEFFIMG-YEFSGSQTQLSYPLFSAISTDLLHDLILLENQ

Query:  LPYFILECLF-------KRSGRSCIYFTCRYLGGWYIGVARLPK-VPAREANHLVDLLSFYYAIPTVTISGNNNGQKWERPPTATQLKEAGVRFQKAIDW
        LPYF++E +F        R           +   +++ +    + +   +  H VDLL   + +P V      + +  +   +A +++ AGV+ Q A + 
Subjt:  LPYFILECLF-------KRSGRSCIYFTCRYLGGWYIGVARLPK-VPAREANHLVDLLSFYYAIPTVTISGNNNGQKWERPPTATQLKEAGVRFQKAIDW

Query:  KHITDISFRDGVLKIPCFQITTSFETRVRNLLAFEHYYHLGRDKKCLQYFSFLDDLISTEKDVGLLVKAGIISNNIGGNHEDIAKLFNDMVKYGNISSSF
            DISF +GVL IP  +I    E+  RN++ FE  + L  D   + Y  FL   I +  D  L +  GII N   GN ED+++LFN ++K    S S 
Subjt:  KHITDISFRDGVLKIPCFQITTSFETRVRNLLAFEHYYHLGRDKKCLQYFSFLDDLISTEKDVGLLVKAGIISNNIGGNHEDIAKLFNDMVKYGNISSSF

Query:  YYYSQISLDLRKYCDTPWHRWKASLKRDYFNSPWTSISFLAATFLILLTVVQTLYSAL
        +YY  +  +L+ +C+ PW++WKA+L+RDYF++PW++ S +AA  L+LLT VQ + S L
Subjt:  YYYSQISLDLRKYCDTPWHRWKASLKRDYFNSPWTSISFLAATFLILLTVVQTLYSAL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGACAATCACATTGAAACGCATCAACAGAAGAATAACGACGACGCATGTTATGGTATTTCAAATATGGATGAGAATAATGAAGTTGAAGAGACCCAAGGTAATTC
AATCCATGTGGTATCCATCGAAAACAGGCTTAAGAAATTGCATCCTATCACTGAAGAATGCAGCATCTATCGGGTTTCCAAACGACTACACAATATTCATGATACGGCCT
TTGCGCCTCAAGCCATCTCCATCGGTCCTTTTCACCATGGTCGAGAGGAGTTGATGGCCATGGAACAACTTAAACTCCGCTTTCTCCATGGTTATCTACGCCGCGTAGGA
ATGGACGTTAAGGCTGCATTTAAAATTGCTCGGGATTGGGAGACACGAGCCCGCAAGTGCTACGCAGAACCCATAGACATGAAGAGCGAGGACTTTGTGACAATGATGCT
TGTGGATGGATGTTTCTTAGTGGAGTTCTTCATAATGGGTTATGAATTCAGTGGAAGTCAAACCCAGTTAAGCTATCCATTGTTCAGTGCTATAAGTACTGACTTACTTC
ACGACTTAATACTGCTTGAGAATCAACTCCCTTATTTTATTCTTGAATGTCTATTCAAACGTAGTGGCCGTTCTTGTATATATTTTACGTGCAGATATCTCGGTGGTTGG
TATATAGGAGTGGCTCGGCTTCCTAAGGTGCCCGCGAGAGAAGCAAACCACTTGGTTGATTTATTAAGCTTTTACTACGCCATCCCCACAGTGACAATTAGTGGAAACAA
CAACGGCCAGAAATGGGAGCGTCCCCCAACTGCAACCCAGCTTAAAGAGGCTGGTGTTAGGTTCCAGAAAGCAATAGACTGGAAACACATTACAGACATAAGCTTCAGAG
ACGGTGTTTTGAAGATCCCTTGTTTCCAAATTACGACGAGCTTTGAAACCCGTGTGCGAAACCTGTTGGCGTTTGAGCACTATTACCACTTGGGGCGTGATAAGAAGTGT
TTACAATATTTTTCATTTCTGGACGATTTGATAAGCACGGAGAAAGACGTAGGTTTACTTGTGAAGGCAGGAATCATCTCTAATAATATCGGCGGTAATCATGAAGACAT
TGCGAAGTTGTTTAACGATATGGTCAAATATGGCAACATTTCATCCTCGTTTTACTACTACAGCCAAATTAGCCTGGATTTACGTAAGTACTGCGACACACCGTGGCACC
GGTGGAAGGCTTCACTAAAACGTGACTATTTCAATAGTCCATGGACTTCTATCTCCTTCCTTGCTGCAACCTTCCTCATTCTCCTCACTGTCGTGCAAACCCTCTACTCT
GCTCTACAATATCCCAACAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGACAATCACATTGAAACGCATCAACAGAAGAATAACGACGACGCATGTTATGGTATTTCAAATATGGATGAGAATAATGAAGTTGAAGAGACCCAAGGTAATTC
AATCCATGTGGTATCCATCGAAAACAGGCTTAAGAAATTGCATCCTATCACTGAAGAATGCAGCATCTATCGGGTTTCCAAACGACTACACAATATTCATGATACGGCCT
TTGCGCCTCAAGCCATCTCCATCGGTCCTTTTCACCATGGTCGAGAGGAGTTGATGGCCATGGAACAACTTAAACTCCGCTTTCTCCATGGTTATCTACGCCGCGTAGGA
ATGGACGTTAAGGCTGCATTTAAAATTGCTCGGGATTGGGAGACACGAGCCCGCAAGTGCTACGCAGAACCCATAGACATGAAGAGCGAGGACTTTGTGACAATGATGCT
TGTGGATGGATGTTTCTTAGTGGAGTTCTTCATAATGGGTTATGAATTCAGTGGAAGTCAAACCCAGTTAAGCTATCCATTGTTCAGTGCTATAAGTACTGACTTACTTC
ACGACTTAATACTGCTTGAGAATCAACTCCCTTATTTTATTCTTGAATGTCTATTCAAACGTAGTGGCCGTTCTTGTATATATTTTACGTGCAGATATCTCGGTGGTTGG
TATATAGGAGTGGCTCGGCTTCCTAAGGTGCCCGCGAGAGAAGCAAACCACTTGGTTGATTTATTAAGCTTTTACTACGCCATCCCCACAGTGACAATTAGTGGAAACAA
CAACGGCCAGAAATGGGAGCGTCCCCCAACTGCAACCCAGCTTAAAGAGGCTGGTGTTAGGTTCCAGAAAGCAATAGACTGGAAACACATTACAGACATAAGCTTCAGAG
ACGGTGTTTTGAAGATCCCTTGTTTCCAAATTACGACGAGCTTTGAAACCCGTGTGCGAAACCTGTTGGCGTTTGAGCACTATTACCACTTGGGGCGTGATAAGAAGTGT
TTACAATATTTTTCATTTCTGGACGATTTGATAAGCACGGAGAAAGACGTAGGTTTACTTGTGAAGGCAGGAATCATCTCTAATAATATCGGCGGTAATCATGAAGACAT
TGCGAAGTTGTTTAACGATATGGTCAAATATGGCAACATTTCATCCTCGTTTTACTACTACAGCCAAATTAGCCTGGATTTACGTAAGTACTGCGACACACCGTGGCACC
GGTGGAAGGCTTCACTAAAACGTGACTATTTCAATAGTCCATGGACTTCTATCTCCTTCCTTGCTGCAACCTTCCTCATTCTCCTCACTGTCGTGCAAACCCTCTACTCT
GCTCTACAATATCCCAACAAGTGA
Protein sequenceShow/hide protein sequence
MEDNHIETHQQKNNDDACYGISNMDENNEVEETQGNSIHVVSIENRLKKLHPITEECSIYRVSKRLHNIHDTAFAPQAISIGPFHHGREELMAMEQLKLRFLHGYLRRVG
MDVKAAFKIARDWETRARKCYAEPIDMKSEDFVTMMLVDGCFLVEFFIMGYEFSGSQTQLSYPLFSAISTDLLHDLILLENQLPYFILECLFKRSGRSCIYFTCRYLGGW
YIGVARLPKVPAREANHLVDLLSFYYAIPTVTISGNNNGQKWERPPTATQLKEAGVRFQKAIDWKHITDISFRDGVLKIPCFQITTSFETRVRNLLAFEHYYHLGRDKKC
LQYFSFLDDLISTEKDVGLLVKAGIISNNIGGNHEDIAKLFNDMVKYGNISSSFYYYSQISLDLRKYCDTPWHRWKASLKRDYFNSPWTSISFLAATFLILLTVVQTLYS
ALQYPNK