; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g07310 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g07310
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionPlant protein of unknown function (DUF247)
Genome locationchr5:5202397..5203698
RNA-Seq ExpressionMoc05g07310
SyntenyMoc05g07310
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004158 - Protein of unknown function DUF247, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7018572.1 UPF0481 protein, partial [Cucurbita argyrosperma subsp. argyrosperma]2.2e-17171.36Show/hide
Query:  MNPSRALSHAIDIPAISRERSDEESLLCSMEAKMEAFCSSIIIFKVPDEISIDNREVFVPAKVSIGPFHHGAPHLESMEDLKWNYLCAFLKHNPSVGLDD
        M+ SRALSH+ID+PA S+  S+EESLL S+E K+EAFCSSI IF+ P+EISI++R VFVP+KVSIGPFHHGAPHLESME+LKW YL AFLK+NPSV L  
Subjt:  MNPSRALSHAIDIPAISRERSDEESLLCSMEAKMEAFCSSIIIFKVPDEISIDNREVFVPAKVSIGPFHHGAPHLESMEDLKWNYLCAFLKHNPSVGLDD

Query:  LLEFVAKSESRVRKCYEVEFHDLDSQKFARMMVLDCCFVLELLLRFSIKRLKRRNDPVFTTPGLLLDLKSDLILLENQIPYFLLREVYEKVQDSREENMP
        L+E V KSESR+RKCYE EF+  DS KF+++M+LDCCF+LELLLRFS KRL+RRND VFTTPGLL DL+ DL+LLENQIPYFLL++VYE VQD  EE M 
Subjt:  LLEFVAKSESRVRKCYEVEFHDLDSQKFARMMVLDCCFVLELLLRFSIKRLKRRNDPVFTTPGLLLDLKSDLILLENQIPYFLLREVYEKVQDSREENMP

Query:  LNDLAFRFFRTIVAGERQSVYDNFQQDADHLLDIVHSCFLSTYPRIETKNNKSKTAELPRASKLKSAGIKFKNAVTPKSVLDIKFQNGGLEIPTLEVSKH
        LNDL FRFF+T+VAG+RQ VYDNF  +ADHLL++VHSCFLSTYPR+ET N+KSK+ ELP ASKLK+AGIK KNA + KS+LDIKFQNG LEIP L+V + 
Subjt:  LNDLAFRFFRTIVAGERQSVYDNFQQDADHLLDIVHSCFLSTYPRIETKNNKSKTAELPRASKLKSAGIKFKNAVTPKSVLDIKFQNGGLEIPTLEVSKH

Query:  TETILKNLIAYEICQIGSAQQVKSYVDFMSHLLQSDEDMKLLCGRKILINLEKDETQIIANLKWMRQQKANLSGTYFAGVVQKLNEPPDRFIVWWRRLRR
        TE IL+NL+AYEI Q GS +QVKSY++FMSHLLQSD+D+K+L  RKIL + E DE QII NLKWM  ++ +LSGTYFAG+VQKLNE PDR +  WR+LRR
Subjt:  TETILKNLIAYEICQIGSAQQVKSYVDFMSHLLQSDEDMKLLCGRKILINLEKDETQIIANLKWMRQQKANLSGTYFAGVVQKLNEPPDRFIVWWRRLRR

Query:  NPVAIGVVAVWALVVIFVAAFFSALSLLQRRYR
        NPVAIG+VAV  +VVIFVAAFFSA S+LQRRY+
Subjt:  NPVAIGVVAVWALVVIFVAAFFSALSLLQRRYR

XP_022138112.1 UPF0481 protein At3g47200-like [Momordica charantia]3.6e-246100Show/hide
Query:  MNPSRALSHAIDIPAISRERSDEESLLCSMEAKMEAFCSSIIIFKVPDEISIDNREVFVPAKVSIGPFHHGAPHLESMEDLKWNYLCAFLKHNPSVGLDD
        MNPSRALSHAIDIPAISRERSDEESLLCSMEAKMEAFCSSIIIFKVPDEISIDNREVFVPAKVSIGPFHHGAPHLESMEDLKWNYLCAFLKHNPSVGLDD
Subjt:  MNPSRALSHAIDIPAISRERSDEESLLCSMEAKMEAFCSSIIIFKVPDEISIDNREVFVPAKVSIGPFHHGAPHLESMEDLKWNYLCAFLKHNPSVGLDD

Query:  LLEFVAKSESRVRKCYEVEFHDLDSQKFARMMVLDCCFVLELLLRFSIKRLKRRNDPVFTTPGLLLDLKSDLILLENQIPYFLLREVYEKVQDSREENMP
        LLEFVAKSESRVRKCYEVEFHDLDSQKFARMMVLDCCFVLELLLRFSIKRLKRRNDPVFTTPGLLLDLKSDLILLENQIPYFLLREVYEKVQDSREENMP
Subjt:  LLEFVAKSESRVRKCYEVEFHDLDSQKFARMMVLDCCFVLELLLRFSIKRLKRRNDPVFTTPGLLLDLKSDLILLENQIPYFLLREVYEKVQDSREENMP

Query:  LNDLAFRFFRTIVAGERQSVYDNFQQDADHLLDIVHSCFLSTYPRIETKNNKSKTAELPRASKLKSAGIKFKNAVTPKSVLDIKFQNGGLEIPTLEVSKH
        LNDLAFRFFRTIVAGERQSVYDNFQQDADHLLDIVHSCFLSTYPRIETKNNKSKTAELPRASKLKSAGIKFKNAVTPKSVLDIKFQNGGLEIPTLEVSKH
Subjt:  LNDLAFRFFRTIVAGERQSVYDNFQQDADHLLDIVHSCFLSTYPRIETKNNKSKTAELPRASKLKSAGIKFKNAVTPKSVLDIKFQNGGLEIPTLEVSKH

Query:  TETILKNLIAYEICQIGSAQQVKSYVDFMSHLLQSDEDMKLLCGRKILINLEKDETQIIANLKWMRQQKANLSGTYFAGVVQKLNEPPDRFIVWWRRLRR
        TETILKNLIAYEICQIGSAQQVKSYVDFMSHLLQSDEDMKLLCGRKILINLEKDETQIIANLKWMRQQKANLSGTYFAGVVQKLNEPPDRFIVWWRRLRR
Subjt:  TETILKNLIAYEICQIGSAQQVKSYVDFMSHLLQSDEDMKLLCGRKILINLEKDETQIIANLKWMRQQKANLSGTYFAGVVQKLNEPPDRFIVWWRRLRR

Query:  NPVAIGVVAVWALVVIFVAAFFSALSLLQRRYR
        NPVAIGVVAVWALVVIFVAAFFSALSLLQRRYR
Subjt:  NPVAIGVVAVWALVVIFVAAFFSALSLLQRRYR

XP_022955709.1 UPF0481 protein At3g47200-like [Cucurbita moschata]4.5e-17271.59Show/hide
Query:  MNPSRALSHAIDIPAISRERSDEESLLCSMEAKMEAFCSSIIIFKVPDEISIDNREVFVPAKVSIGPFHHGAPHLESMEDLKWNYLCAFLKHNPSVGLDD
        M+ SRALSH+ID+PA S+  S+EESLL S+E K+EAFCSSI IF+ P+EISI++R VFVPAKVSIGPFHHGAPHLESME+LKW YL AFLK+NPSV L  
Subjt:  MNPSRALSHAIDIPAISRERSDEESLLCSMEAKMEAFCSSIIIFKVPDEISIDNREVFVPAKVSIGPFHHGAPHLESMEDLKWNYLCAFLKHNPSVGLDD

Query:  LLEFVAKSESRVRKCYEVEFHDLDSQKFARMMVLDCCFVLELLLRFSIKRLKRRNDPVFTTPGLLLDLKSDLILLENQIPYFLLREVYEKVQDSREENMP
        L+E V KSESR+RKCYE EF+  DS KF+++M+LDCCF+LELLLRFS KRL+RRND VFTTPGLL DL+ DL+LLENQIPYFLL++VYE VQD  EENM 
Subjt:  LLEFVAKSESRVRKCYEVEFHDLDSQKFARMMVLDCCFVLELLLRFSIKRLKRRNDPVFTTPGLLLDLKSDLILLENQIPYFLLREVYEKVQDSREENMP

Query:  LNDLAFRFFRTIVAGERQSVYDNFQQDADHLLDIVHSCFLSTYPRIETKNNKSKTAELPRASKLKSAGIKFKNAVTPKSVLDIKFQNGGLEIPTLEVSKH
        LNDL FRFF+T+VAG+RQ VYDNF  +ADHLL++VHSCFLSTYPR+ET N+KSK+ ELP ASKLK+AGIK KNA + KS+LDIKFQNG LEIP L+V + 
Subjt:  LNDLAFRFFRTIVAGERQSVYDNFQQDADHLLDIVHSCFLSTYPRIETKNNKSKTAELPRASKLKSAGIKFKNAVTPKSVLDIKFQNGGLEIPTLEVSKH

Query:  TETILKNLIAYEICQIGSAQQVKSYVDFMSHLLQSDEDMKLLCGRKILINLEKDETQIIANLKWMRQQKANLSGTYFAGVVQKLNEPPDRFIVWWRRLRR
        TE IL+NL+AYEI Q GS +QVKSY++FMSHLLQSD+D+K+L  RKIL + E DE QII NLKWM  ++ +LSGTYFAG+VQKLNE PDR +  WR+LRR
Subjt:  TETILKNLIAYEICQIGSAQQVKSYVDFMSHLLQSDEDMKLLCGRKILINLEKDETQIIANLKWMRQQKANLSGTYFAGVVQKLNEPPDRFIVWWRRLRR

Query:  NPVAIGVVAVWALVVIFVAAFFSALSLLQRRYR
         PVAIG+VAV  +VVIFVAAFFSA S+LQRRY+
Subjt:  NPVAIGVVAVWALVVIFVAAFFSALSLLQRRYR

XP_023526431.1 UPF0481 protein At3g47200-like [Cucurbita pepo subsp. pepo]1.8e-17372.06Show/hide
Query:  MNPSRALSHAIDIPAISRERSDEESLLCSMEAKMEAFCSSIIIFKVPDEISIDNREVFVPAKVSIGPFHHGAPHLESMEDLKWNYLCAFLKHNPSVGLDD
        M+ SRALSH+ID+PA S+  S+EESLL S+E K+EAFCSSI IF+ P+EISI++R VFVPAKVSIGPFHHGAPHLESME+LKW YL AFLK+NPSV L  
Subjt:  MNPSRALSHAIDIPAISRERSDEESLLCSMEAKMEAFCSSIIIFKVPDEISIDNREVFVPAKVSIGPFHHGAPHLESMEDLKWNYLCAFLKHNPSVGLDD

Query:  LLEFVAKSESRVRKCYEVEFHDLDSQKFARMMVLDCCFVLELLLRFSIKRLKRRNDPVFTTPGLLLDLKSDLILLENQIPYFLLREVYEKVQDSREENMP
        L+E V KSESR+RKCYE EF+  DS KF+++M+LDCCF+LELLLRFS KRL+RRND VFTTPGLL DL+ DL+LLENQIPYFLL++VYE VQD  EENM 
Subjt:  LLEFVAKSESRVRKCYEVEFHDLDSQKFARMMVLDCCFVLELLLRFSIKRLKRRNDPVFTTPGLLLDLKSDLILLENQIPYFLLREVYEKVQDSREENMP

Query:  LNDLAFRFFRTIVAGERQSVYDNFQQDADHLLDIVHSCFLSTYPRIETKNNKSKTAELPRASKLKSAGIKFKNAVTPKSVLDIKFQNGGLEIPTLEVSKH
        LNDL FRFF+T+V G+RQ VYDNF  +ADHLL++VHSCFLSTYPR+ET N+KSK+ ELP ASKLK+AGIK KNA + KS+LDIKFQNG LEIP L+V + 
Subjt:  LNDLAFRFFRTIVAGERQSVYDNFQQDADHLLDIVHSCFLSTYPRIETKNNKSKTAELPRASKLKSAGIKFKNAVTPKSVLDIKFQNGGLEIPTLEVSKH

Query:  TETILKNLIAYEICQIGSAQQVKSYVDFMSHLLQSDEDMKLLCGRKILINLEKDETQIIANLKWMRQQKANLSGTYFAGVVQKLNEPPDRFIVWWRRLRR
        TE IL+NL+AYEI Q GS +QVKSY++FMSHLLQSD+D+K+L  RKILI+ E DE QII NLKWM  +K +LSGTYFAG+VQKLNE PDR +  WR+LRR
Subjt:  TETILKNLIAYEICQIGSAQQVKSYVDFMSHLLQSDEDMKLLCGRKILINLEKDETQIIANLKWMRQQKANLSGTYFAGVVQKLNEPPDRFIVWWRRLRR

Query:  NPVAIGVVAVWALVVIFVAAFFSALSLLQRRYR
        NPVAIG+VAV  +VVIFVAAFFSA S+LQRRY+
Subjt:  NPVAIGVVAVWALVVIFVAAFFSALSLLQRRYR

XP_038880915.1 UPF0481 protein At3g47200-like [Benincasa hispida]7.1e-17872.75Show/hide
Query:  MNPSRALSHAIDIPAISRERSDEESLLCSMEAKMEAFCSSIIIFKVPDEISIDNREVFVPAKVSIGPFHHGAPHLESMEDLKWNYLCAFLKHNPSVGLDD
        M  S+  SH+IDI AI++  S EESLL S+E K+EAFCSSI IF+ P++ISI+++ VFVPAKVSIGPFHHGAPHLE ME+LKW YL  FLKHNPS+ LDD
Subjt:  MNPSRALSHAIDIPAISRERSDEESLLCSMEAKMEAFCSSIIIFKVPDEISIDNREVFVPAKVSIGPFHHGAPHLESMEDLKWNYLCAFLKHNPSVGLDD

Query:  LLEFVAKSESRVRKCYEVEFHDLDSQKFARMMVLDCCFVLELLLRFSIKRLKRRNDPVFTTPGLLLDLKSDLILLENQIPYFLLREVYEKVQDSREENMP
        L+E V KSESR+RKCYE EF+DLDS KF++MM+LDCCF+LELLLR+S KR +R NDPVF TPGLL DL+ DL+LLENQIPYFLL EVYE VQD  EENM 
Subjt:  LLEFVAKSESRVRKCYEVEFHDLDSQKFARMMVLDCCFVLELLLRFSIKRLKRRNDPVFTTPGLLLDLKSDLILLENQIPYFLLREVYEKVQDSREENMP

Query:  LNDLAFRFFRTIVAGERQSVYDNFQQDADHLLDIVHSCFLSTYPRIETKNNKSKTAELPRASKLKSAGIKFKNAVTPKSVLDIKFQNGGLEIPTLEVSKH
        LNDL FRFF+T+VAG+R+ VYDNF  +ADHLL++VHSCFLSTYPR+ET N+KSK+ ELP ASKLK+AGIKFKNA +PKS+LDIKFQ G LEIP L V + 
Subjt:  LNDLAFRFFRTIVAGERQSVYDNFQQDADHLLDIVHSCFLSTYPRIETKNNKSKTAELPRASKLKSAGIKFKNAVTPKSVLDIKFQNGGLEIPTLEVSKH

Query:  TETILKNLIAYEICQIGSAQQVKSYVDFMSHLLQSDEDMKLLCGRKILINLEKDETQIIANLKWMRQQKANLSGTYFAGVVQKLNEPPDRFIVWWRRLRR
        TE IL+NL AYEI Q GS  QVKSY++FMSHLLQSDED+K+LC RKILI+LE DE QII NLKWMR++K +LSGTYFAG+VQKLNE PDR +  WR LRR
Subjt:  TETILKNLIAYEICQIGSAQQVKSYVDFMSHLLQSDEDMKLLCGRKILINLEKDETQIIANLKWMRQQKANLSGTYFAGVVQKLNEPPDRFIVWWRRLRR

Query:  NPVAIGVVAVWALVVIFVAAFFSALSLLQRRYR
        NPVAIGV AVW +VVIFVAAFFSA+SLLQRRY+
Subjt:  NPVAIGVVAVWALVVIFVAAFFSALSLLQRRYR

TrEMBL top hitse value%identityAlignment
A0A0A0L821 Uncharacterized protein3.5e-15464.2Show/hide
Query:  MNPSRALSHAIDIPAISRERSDEESLLCSMEAKMEAFCSSIIIFKVPDEISIDNREVFVPAKVSIGPFHHGAPHLESMEDLKWNYLCAFLKHNPSVGLDD
        M+PS  +SH I+I  IS+E   EESLL  +E K+EA CSS  I+K P EI+I++R VF+PAKVSIGPFHHGAPHLES+E LKW+YL  FL H PS+ L D
Subjt:  MNPSRALSHAIDIPAISRERSDEESLLCSMEAKMEAFCSSIIIFKVPDEISIDNREVFVPAKVSIGPFHHGAPHLESMEDLKWNYLCAFLKHNPSVGLDD

Query:  LLEFVAKSESRVRKCYEVEFHDLDSQKFARMMVLDCCFVLELLLRFSIKRLKRRNDPVFTTPGLLLDLKSDLILLENQIPYFLLREVYEKVQDSREENMP
        L++ V KSESR RKCYE EF+  D  +F+++M+LDCCF+LELLLR++ +R +R NDPVFTTPGLL DL+ DL+LLENQIPYFLL E+Y KV D  EENM 
Subjt:  LLEFVAKSESRVRKCYEVEFHDLDSQKFARMMVLDCCFVLELLLRFSIKRLKRRNDPVFTTPGLLLDLKSDLILLENQIPYFLLREVYEKVQDSREENMP

Query:  LNDLAFRFFRTIVAGERQSVYDNFQQDADHLLDIVHSCFLSTYPRIETKNNKSKTAELPRASKLKSAGIKFKNAVTPKSVLDIKFQNGGLEIPTLEVSKH
        L+DL  RFFRT+V G+R+ + DNF  +A+HLL++V+SCFLSTYP +ET N+K K+ ELP ASKLK+AGIKFKNA + KS+LDIKFQNG LEIP L V + 
Subjt:  LNDLAFRFFRTIVAGERQSVYDNFQQDADHLLDIVHSCFLSTYPRIETKNNKSKTAELPRASKLKSAGIKFKNAVTPKSVLDIKFQNGGLEIPTLEVSKH

Query:  TETILKNLIAYEICQIGSAQQVKSYVDFMSHLLQSDEDMKLLCGRKILINLEKDETQIIANLKWMRQQKANLSGTYFAGVVQKLNEPPDRFIVWWRRLRR
        TETIL+NL AYEICQ G+  QVKSY++FMSHLLQSDED+K+LC +KIL  L+ +E QII  LKW+R+QK +LSGT+FAG+VQKL E PDR +  WRRLR 
Subjt:  TETILKNLIAYEICQIGSAQQVKSYVDFMSHLLQSDEDMKLLCGRKILINLEKDETQIIANLKWMRQQKANLSGTYFAGVVQKLNEPPDRFIVWWRRLRR

Query:  NPVAIGVVAVWALVVIFVAAFFSALSLLQRRYR
        N  AI V  V  +VVIF AAFF+A S+LQRRY+
Subjt:  NPVAIGVVAVWALVVIFVAAFFSALSLLQRRYR

A0A1S3AY98 UPF0481 protein At3g47200-like7.7e-16267.67Show/hide
Query:  MNPSRALSHAIDIPAISRERSDEESLLCSMEAKMEAFCSSIIIFKVPDEISIDNREVFVPAKVSIGPFHHGAPHLESMEDLKWNYLCAFLKHNPSVGLDD
        M+ S  +SH I+IP IS+E S EESLL S+E K+EA CSS+ IFK P EI+I+ R VFVPAKVSIGPFHHGA HL+S+E+LKW YL  FLKHN S+ L D
Subjt:  MNPSRALSHAIDIPAISRERSDEESLLCSMEAKMEAFCSSIIIFKVPDEISIDNREVFVPAKVSIGPFHHGAPHLESMEDLKWNYLCAFLKHNPSVGLDD

Query:  LLEFVAKSESRVRKCYEVEFHDLDSQKFARMMVLDCCFVLELLLRFSIKRLKRRNDPVFTTPGLLLDLKSDLILLENQIPYFLLREVYEKVQDSREENMP
        L++ V KSESR++KCYE +F  LD  +F+ +M+LDCCF+LELLLR+S +R KRRNDPVFTTPGLL D+K DL+LLENQIPYFLL E+YEKV D REENM 
Subjt:  LLEFVAKSESRVRKCYEVEFHDLDSQKFARMMVLDCCFVLELLLRFSIKRLKRRNDPVFTTPGLLLDLKSDLILLENQIPYFLLREVYEKVQDSREENMP

Query:  LNDLAFRFFRTIVAGERQSVYDNFQQDADHLLDIVHSCFLSTYPRIETKNNKSKTAELPRASKLKSAGIKFKNAVTPKSVLDIKFQNGGLEIPTLEVSKH
        L+DL FRFFRT+V G+R+ + DNF  +ADHLL++VHSCFLSTYP ++T N+K K+ ELP ASKLK+AGIKFKNA + KS+LDIKFQNG LEIP L V + 
Subjt:  LNDLAFRFFRTIVAGERQSVYDNFQQDADHLLDIVHSCFLSTYPRIETKNNKSKTAELPRASKLKSAGIKFKNAVTPKSVLDIKFQNGGLEIPTLEVSKH

Query:  TETILKNLIAYEICQIGSAQQVKSYVDFMSHLLQSDEDMKLLCGRKILINLEKDETQIIANLKWMRQQKANLSGTYFAGVVQKLNEPPDRFIVWWRRLRR
        TE IL+NL AYEI Q G+ QQVKSY+ FMSHLLQSD D+K+LC +KIL  LE DE QII NLKW+R+QK +LSGTYFAG+VQKLNE PDR +V WRRLRR
Subjt:  TETILKNLIAYEICQIGSAQQVKSYVDFMSHLLQSDEDMKLLCGRKILINLEKDETQIIANLKWMRQQKANLSGTYFAGVVQKLNEPPDRFIVWWRRLRR

Query:  NPVAIGVVAVWALVVIFVAAFFSALSLLQRRYR
         P AIGV A   +VVIF AAFF+A S+LQRRY+
Subjt:  NPVAIGVVAVWALVVIFVAAFFSALSLLQRRYR

A0A6J1CA62 UPF0481 protein At3g47200-like1.7e-246100Show/hide
Query:  MNPSRALSHAIDIPAISRERSDEESLLCSMEAKMEAFCSSIIIFKVPDEISIDNREVFVPAKVSIGPFHHGAPHLESMEDLKWNYLCAFLKHNPSVGLDD
        MNPSRALSHAIDIPAISRERSDEESLLCSMEAKMEAFCSSIIIFKVPDEISIDNREVFVPAKVSIGPFHHGAPHLESMEDLKWNYLCAFLKHNPSVGLDD
Subjt:  MNPSRALSHAIDIPAISRERSDEESLLCSMEAKMEAFCSSIIIFKVPDEISIDNREVFVPAKVSIGPFHHGAPHLESMEDLKWNYLCAFLKHNPSVGLDD

Query:  LLEFVAKSESRVRKCYEVEFHDLDSQKFARMMVLDCCFVLELLLRFSIKRLKRRNDPVFTTPGLLLDLKSDLILLENQIPYFLLREVYEKVQDSREENMP
        LLEFVAKSESRVRKCYEVEFHDLDSQKFARMMVLDCCFVLELLLRFSIKRLKRRNDPVFTTPGLLLDLKSDLILLENQIPYFLLREVYEKVQDSREENMP
Subjt:  LLEFVAKSESRVRKCYEVEFHDLDSQKFARMMVLDCCFVLELLLRFSIKRLKRRNDPVFTTPGLLLDLKSDLILLENQIPYFLLREVYEKVQDSREENMP

Query:  LNDLAFRFFRTIVAGERQSVYDNFQQDADHLLDIVHSCFLSTYPRIETKNNKSKTAELPRASKLKSAGIKFKNAVTPKSVLDIKFQNGGLEIPTLEVSKH
        LNDLAFRFFRTIVAGERQSVYDNFQQDADHLLDIVHSCFLSTYPRIETKNNKSKTAELPRASKLKSAGIKFKNAVTPKSVLDIKFQNGGLEIPTLEVSKH
Subjt:  LNDLAFRFFRTIVAGERQSVYDNFQQDADHLLDIVHSCFLSTYPRIETKNNKSKTAELPRASKLKSAGIKFKNAVTPKSVLDIKFQNGGLEIPTLEVSKH

Query:  TETILKNLIAYEICQIGSAQQVKSYVDFMSHLLQSDEDMKLLCGRKILINLEKDETQIIANLKWMRQQKANLSGTYFAGVVQKLNEPPDRFIVWWRRLRR
        TETILKNLIAYEICQIGSAQQVKSYVDFMSHLLQSDEDMKLLCGRKILINLEKDETQIIANLKWMRQQKANLSGTYFAGVVQKLNEPPDRFIVWWRRLRR
Subjt:  TETILKNLIAYEICQIGSAQQVKSYVDFMSHLLQSDEDMKLLCGRKILINLEKDETQIIANLKWMRQQKANLSGTYFAGVVQKLNEPPDRFIVWWRRLRR

Query:  NPVAIGVVAVWALVVIFVAAFFSALSLLQRRYR
        NPVAIGVVAVWALVVIFVAAFFSALSLLQRRYR
Subjt:  NPVAIGVVAVWALVVIFVAAFFSALSLLQRRYR

A0A6J1GVU1 UPF0481 protein At3g47200-like2.2e-17271.59Show/hide
Query:  MNPSRALSHAIDIPAISRERSDEESLLCSMEAKMEAFCSSIIIFKVPDEISIDNREVFVPAKVSIGPFHHGAPHLESMEDLKWNYLCAFLKHNPSVGLDD
        M+ SRALSH+ID+PA S+  S+EESLL S+E K+EAFCSSI IF+ P+EISI++R VFVPAKVSIGPFHHGAPHLESME+LKW YL AFLK+NPSV L  
Subjt:  MNPSRALSHAIDIPAISRERSDEESLLCSMEAKMEAFCSSIIIFKVPDEISIDNREVFVPAKVSIGPFHHGAPHLESMEDLKWNYLCAFLKHNPSVGLDD

Query:  LLEFVAKSESRVRKCYEVEFHDLDSQKFARMMVLDCCFVLELLLRFSIKRLKRRNDPVFTTPGLLLDLKSDLILLENQIPYFLLREVYEKVQDSREENMP
        L+E V KSESR+RKCYE EF+  DS KF+++M+LDCCF+LELLLRFS KRL+RRND VFTTPGLL DL+ DL+LLENQIPYFLL++VYE VQD  EENM 
Subjt:  LLEFVAKSESRVRKCYEVEFHDLDSQKFARMMVLDCCFVLELLLRFSIKRLKRRNDPVFTTPGLLLDLKSDLILLENQIPYFLLREVYEKVQDSREENMP

Query:  LNDLAFRFFRTIVAGERQSVYDNFQQDADHLLDIVHSCFLSTYPRIETKNNKSKTAELPRASKLKSAGIKFKNAVTPKSVLDIKFQNGGLEIPTLEVSKH
        LNDL FRFF+T+VAG+RQ VYDNF  +ADHLL++VHSCFLSTYPR+ET N+KSK+ ELP ASKLK+AGIK KNA + KS+LDIKFQNG LEIP L+V + 
Subjt:  LNDLAFRFFRTIVAGERQSVYDNFQQDADHLLDIVHSCFLSTYPRIETKNNKSKTAELPRASKLKSAGIKFKNAVTPKSVLDIKFQNGGLEIPTLEVSKH

Query:  TETILKNLIAYEICQIGSAQQVKSYVDFMSHLLQSDEDMKLLCGRKILINLEKDETQIIANLKWMRQQKANLSGTYFAGVVQKLNEPPDRFIVWWRRLRR
        TE IL+NL+AYEI Q GS +QVKSY++FMSHLLQSD+D+K+L  RKIL + E DE QII NLKWM  ++ +LSGTYFAG+VQKLNE PDR +  WR+LRR
Subjt:  TETILKNLIAYEICQIGSAQQVKSYVDFMSHLLQSDEDMKLLCGRKILINLEKDETQIIANLKWMRQQKANLSGTYFAGVVQKLNEPPDRFIVWWRRLRR

Query:  NPVAIGVVAVWALVVIFVAAFFSALSLLQRRYR
         PVAIG+VAV  +VVIFVAAFFSA S+LQRRY+
Subjt:  NPVAIGVVAVWALVVIFVAAFFSALSLLQRRYR

A0A6J1IWZ4 UPF0481 protein At3g47200-like2.5e-15269.6Show/hide
Query:  MNPSRALSHAIDIPAISRERSDEESLLCSMEAKMEAFCSSIIIFKVPDEISIDNREVFVPAKVSIGPFHHGAPHLESMEDLKWNYLCAFLKHNPSVGLDD
        M+ SRALSH+ID+PA S+  S+EESLL S+E K+EAFCSSI IF+  +EISI++R VFVPAKVSIGPFHHGAPHLESME+LKW YL AFLK+NPSV L  
Subjt:  MNPSRALSHAIDIPAISRERSDEESLLCSMEAKMEAFCSSIIIFKVPDEISIDNREVFVPAKVSIGPFHHGAPHLESMEDLKWNYLCAFLKHNPSVGLDD

Query:  LLEFVAKSESRVRKCYEVEFHDLDSQKFARMMVLDCCFVLELLLRFSIKRLKRRNDPVFTTPGLLLDLKSDLILLENQIPYFLLREVYEKVQDSREENMP
        L+E V KSESR+RKCYE EF+  DS KF+++M+LDCCF+LELLLR+S KRL+RRND VFTTPGLL DL+ DL+LLENQIPYFLL++VY  VQD  EENM 
Subjt:  LLEFVAKSESRVRKCYEVEFHDLDSQKFARMMVLDCCFVLELLLRFSIKRLKRRNDPVFTTPGLLLDLKSDLILLENQIPYFLLREVYEKVQDSREENMP

Query:  LNDLAFRFFRTIVAGERQSVYDNFQQDADHLLDIVHSCFLSTYPRIETKNNKSKTAELPRASKLKSAGIKFKNAVTPKSVLDIKFQNGGLEIPTLEVSKH
        LNDL FRFF+T+VAG+RQ VYDNF  +ADHLL+++HSCFLSTYPR+ET N+ SK+ ELP ASKLK+AGIK KN  + KS+LDIKFQNG LEIP L+V + 
Subjt:  LNDLAFRFFRTIVAGERQSVYDNFQQDADHLLDIVHSCFLSTYPRIETKNNKSKTAELPRASKLKSAGIKFKNAVTPKSVLDIKFQNGGLEIPTLEVSKH

Query:  TETILKNLIAYEICQIGSAQQVKSYVDFMSHLLQSDEDMKLLCGRKILINLEKDETQIIANLKWMRQQKANLSGTYFAGVVQKLNEPPDRFIVWWRRL
        TE IL+NL+AYEI Q GS +QVKSY++FMSHLLQSD+D+K+L  RKIL + E DE QII NLKWMR +K +LSGTYFAG+VQKLN+  DR  VW + L
Subjt:  TETILKNLIAYEICQIGSAQQVKSYVDFMSHLLQSDEDMKLLCGRKILINLEKDETQIIANLKWMRQQKANLSGTYFAGVVQKLNEPPDRFIVWWRRL

SwissProt top hitse value%identityAlignment
P0C897 Putative UPF0481 protein At3g026451.6e-1520.16Show/hide
Query:  SIIIFKVPDEISIDNREVFVPAKVSIGPFHHGAPHLESMEDLKWNYLCAFLKHNPSVGLDDLLEFVAKSESRVRKCYEVEFHDLDSQKFARMMVLDCCFV
        ++ IF VP  +   + + + P +VSIGP+H   P L  ME  K            S    DL+E +   E ++R CY  ++   + +    +M +D  F+
Subjt:  SIIIFKVPDEISIDNREVFVPAKVSIGPFHHGAPHLESMEDLKWNYLCAFLKHNPSVGLDDLLEFVAKSESRVRKCYEVEFHDLDSQKFARMMVLDCCFV

Query:  LELLLRFSIKRLKRRNDPVFTTPGLLLDLKSDLILLENQIPYFLLREVYEKVQDSREEN-----MPLNDLAFRFFRTIVAGERQSVYDNFQQDADHLLDI
        +E L  +S ++++   + V        ++  D++++ENQIP F+LR+  E   +S E         L  L       ++  +   +     Q+ +H+LD 
Subjt:  LELLLRFSIKRLKRRNDPVFTTPGLLLDLKSDLILLENQIPYFLLREVYEKVQDSREEN-----MPLNDLAFRFFRTIVAGERQSVYDNFQQDADHLLDI

Query:  VHSCFLSTYPRIE----------------------------------------------------------------------TKNNKSKTAE-------
        ++   +   PRIE                                                                       + N++ T         
Subjt:  VHSCFLSTYPRIE----------------------------------------------------------------------TKNNKSKTAE-------

Query:  ------------LPRASKLKSAGIKFK-NAVTPKSVLDIKFQNGGLEIPTLEVSKHTETILKNLIAYEICQIGSAQQVKSYVDFMSHLLQSDEDMKLLCG
                    +P  S L  AG++FK  A    S +     +G   +P + +  +TET+L+NL+AYE            Y + ++ ++ S+ED++LL  
Subjt:  ------------LPRASKLKSAGIKFK-NAVTPKSVLDIKFQNGGLEIPTLEVSKHTETILKNLIAYEICQIGSAQQVKSYVDFMSHLLQSDEDMKLLCG

Query:  RKILINLEKDETQIIANLKWMRQQKANLSGTYFAGVVQKLNEPPDRFIVW-WRRLRRNPVAIGVVAVWALVVIFVAAFFSALSLLQ
        + +L++  K + +  A   W    K+        G + K  E  +R+    W+      V + V   W ++    A     L  LQ
Subjt:  RKILINLEKDETQIIANLKWMRQQKANLSGTYFAGVVQKLNEPPDRFIVW-WRRLRRNPVAIGVVAVWALVVIFVAAFFSALSLLQ

Q9SD53 UPF0481 protein At3g472006.3e-3630.11Show/hide
Query:  EESLLCSMEAKMEAFCSSIIIFKVPDEISIDNREVFVPAKVSIGPFHHGAPHLESMEDLKWNYLCAFLKHNPSVGLDD--LLEFVAKSESRVRKCYEVEF
        +E +L    A  E+ C    IF+VP+     N + + P  VSIGP+H+G  HL+ ++  K   L  FL       +++  L++ V   E ++RK Y  E 
Subjt:  EESLLCSMEAKMEAFCSSIIIFKVPDEISIDNREVFVPAKVSIGPFHHGAPHLESMEDLKWNYLCAFLKHNPSVGLDD--LLEFVAKSESRVRKCYEVEF

Query:  ---HDLDSQKFARMMVLDCCFVLELLLRFSIKRLKRRNDPVFTTPGLLLDLKSDLILLENQIPYFLLREVYEKVQDSREENMPLNDLAFRFFRTIVAGER
           HDL       MMVLD CF+L + L  S   ++   DP+F+ P LL  ++SDL+LLENQ+P+F+L+ +Y  V      +  LN +AF FF+  +  E 
Subjt:  ---HDLDSQKFARMMVLDCCFVLELLLRFSIKRLKRRNDPVFTTPGLLLDLKSDLILLENQIPYFLLREVYEKVQDSREENMPLNDLAFRFFRTIVAGER

Query:  QSVYDNFQQDADHLLDIVHSCFL--------STYPRIETKNNKSKTAELP-----------RASKLKSAGIKFK-NAVTPKSVLDIKFQNGGLEIPTLEV
             +    A HLLD++   FL        ++ P ++ + ++ K+  +P            A +L+  GIKF+       S+L+++ +   L+IP L  
Subjt:  QSVYDNFQQDADHLLDIVHSCFL--------STYPRIETKNNKSKTAELP-----------RASKLKSAGIKFK-NAVTPKSVLDIKFQNGGLEIPTLEV

Query:  SKHTETILKNLIAYEICQIGSAQQVKSYVDFMSHLLQSDEDMKLLCGRKILI
             +   N +A+E     S+ ++ +Y+ FM  LL ++ED+  L   K++I
Subjt:  SKHTETILKNLIAYEICQIGSAQQVKSYVDFMSHLLQSDEDMKLLCGRKILI

Arabidopsis top hitse value%identityAlignment
AT2G36430.1 Plant protein of unknown function (DUF247)6.4e-4429.43Show/hide
Query:  IDIPAISRERSDEESLLCSMEAKMEAFCSSIIIFKVPDEISIDNREVFVPAKVSIGPFHHGAPHLESMEDLKWNYLCAFLKHNPSVGLDDLLEFVAKSES
        I I  + ++  +   LL S   K    CS   IF+VP  +   N   + P  VSIGP+H G   L+ +E+ KW YL   L    ++ L+D ++ V   E 
Subjt:  IDIPAISRERSDEESLLCSMEAKMEAFCSSIIIFKVPDEISIDNREVFVPAKVSIGPFHHGAPHLESMEDLKWNYLCAFLKHNPSVGLDDLLEFVAKSES

Query:  RVRKCYEVEFHDLDSQKFARMMVLDCCFVLELLLRFSIKRLKRRNDPVFTTPGLLLDLKSDLILLENQIPYFLLREVYE--KVQDSREENMPLNDLAFRF
          R+CY    H +DS++F  MMVLD CF+LEL  + +       NDP+     +L     D + LENQIP+F+L  ++   +  +  E N  L  LAF F
Subjt:  RVRKCYEVEFHDLDSQKFARMMVLDCCFVLELLLRFSIKRLKRRNDPVFTTPGLLLDLKSDLILLENQIPYFLLREVYE--KVQDSREENMPLNDLAFRF

Query:  FRTIVAGERQSVYDNFQQDADHLLDIVHSCF-----LSTYPRIETKNNKSKTAELPRASKLKSAGIKFKNAVTPKSVLDIKFQNGGLEIPTLEVSKHTET
        F  ++    + +    +  A HLLD++ S F     L T P       K  +  +   SKL+ AGIK +     +S L ++F++G +E+P + V     +
Subjt:  FRTIVAGERQSVYDNFQQDADHLLDIVHSCF-----LSTYPRIETKNNKSKTAELPRASKLKSAGIKFKNAVTPKSVLDIKFQNGGLEIPTLEVSKHTET

Query:  ILKNLIAYEICQIGSAQQVKSYVDFMSHLLQSDEDMKLLCGRKILINLEKDETQIIANLKWMRQQKA-NLSGTYFAGVVQKLNE
         L+N +AYE C +  +    +Y   +  L  + +D++ LC + I+ N    +T++   +  + +  A +++  Y   + +++NE
Subjt:  ILKNLIAYEICQIGSAQQVKSYVDFMSHLLQSDEDMKLLCGRKILINLEKDETQIIANLKWMRQQKA-NLSGTYFAGVVQKLNE

AT3G50130.1 Plant protein of unknown function (DUF247)3.5e-4234.51Show/hide
Query:  IIIFKVPDEISIDNREVFVPAKVSIGPFHHGAPHLESMEDLKW---NYLCAFLKHNPSVGLDDLLEFVAKSESRVRKCYEVEFHDLDSQKFARMMVLDCC
        + I++VP  +  +N++ + P  VS+GPFHHG  HL  M+  KW   N + A  KH+  + +D + E     E R R CYE    DL S KF+ M+VLD C
Subjt:  IIIFKVPDEISIDNREVFVPAKVSIGPFHHGAPHLESMEDLKW---NYLCAFLKHNPSVGLDDLLEFVAKSESRVRKCYEVEFHDLDSQKFARMMVLDCC

Query:  FVLELLLRFSIKRLKR----RNDPVFTTPGLLLDLKSDLILLENQIPYFLLREVYEKVQDSREENMPLNDLAFRFFRTIVAGER--QSVYDNFQQD----
        FVLE L R + +        RNDPVF   G +  ++ D+++LENQ+P F+L  + E     R +   ++ LA RFF  ++  +       D+ +QD    
Subjt:  FVLELLLRFSIKRLKR----RNDPVFTTPGLLLDLKSDLILLENQIPYFLLREVYEKVQDSREENMPLNDLAFRFFRTIVAGER--QSVYDNFQQD----

Query:  --AD------HLLDIVHSCFL----STYPRIETKN--------NKSKTAELPRASKLKSAGIKFKNAVTPKSVLDIKFQNGGLEIPTLEVSKHTETILKN
          AD      H LD+     L    +  PR+            +K +   +   ++L+ AGIKF+   T +   DI+F+NG LEIP L +   T+++  N
Subjt:  --AD------HLLDIVHSCFL----STYPRIETKN--------NKSKTAELPRASKLKSAGIKFKNAVTPKSVLDIKFQNGGLEIPTLEVSKHTETILKN

Query:  LIAYEICQIGSAQQVKSYVDFMSHLLQSDEDMKLL--CG
        LIA+E C I S+  + SY+ FM +L+ S ED++ L  CG
Subjt:  LIAYEICQIGSAQQVKSYVDFMSHLLQSDEDMKLL--CG

AT3G50180.1 Plant protein of unknown function (DUF247)2.5e-4335.11Show/hide
Query:  IIIFKVPDEISIDNREVFVPAKVSIGPFHHGAPHLESMEDLKWNYLCAFLKHNPSVGLDDLLEFVAKSESRVRKCYEVEFHDLDSQKFARMMVLDCCFVL
        + I+KVP  +  ++++ + P  VS+GP+HHG    +SME  KW  +   LK   + G++  L+ + + E + R CYE     L S +F  M++LD CF+L
Subjt:  IIIFKVPDEISIDNREVFVPAKVSIGPFHHGAPHLESMEDLKWNYLCAFLKHNPSVGLDDLLEFVAKSESRVRKCYEVEFHDLDSQKFARMMVLDCCFVL

Query:  ELLLRFSIKRLK---RRNDPVFTTPGLLLDLKSDLILLENQIPYFLLREVYEKVQDSREENMPLNDLAFRFFRTIVAGERQSVYDNFQQDAD----HLLD
        ELL   +   LK     NDPVF   G +  ++ D+I+LENQ+P F+L  + E +Q   +    L +L  RFF  ++        ++  +       H LD
Subjt:  ELLLRFSIKRLK---RRNDPVFTTPGLLLDLKSDLILLENQIPYFLLREVYEKVQDSREENMPLNDLAFRFFRTIVAGERQSVYDNFQQDAD----HLLD

Query:  IVHSCFLSTYPRIETKNNKSKTAE------LPRASKLKSAGIKFKNAVTPKSVLDIKFQNGGLEIPTLEVSKHTETILKNLIAYEICQIGSAQQVKSYVD
        + H   L  +PR   K N S+ A+      +P  ++L+ AG KFK   T +   DIKF NG LEIP L +   T+++  NLIA+E C I S+  + SY+ 
Subjt:  IVHSCFLSTYPRIETKNNKSKTAE------LPRASKLKSAGIKFKNAVTPKSVLDIKFQNGGLEIPTLEVSKHTETILKNLIAYEICQIGSAQQVKSYVD

Query:  FMSHLLQSDEDMKLL--CG
        FM +L+ S ED+  L  CG
Subjt:  FMSHLLQSDEDMKLL--CG

AT4G31980.1 unknown protein1.3e-4730.99Show/hide
Query:  RSDEESLLCSMEAKMEAFCSSI----IIFKVPDEISIDNREVFVPAKVSIGPFHHGAPHLESMEDLKWNYLCAFLKHNPSVGLDDLLEFVAKSESRVRKC
        +++ ++L+ S++AK+ AF SS+     I+KVP+++   N + + P  VS GP H G   L++MED K+ YL +F+    S  L+DL+      E   R C
Subjt:  RSDEESLLCSMEAKMEAFCSSI----IIFKVPDEISIDNREVFVPAKVSIGPFHHGAPHLESMEDLKWNYLCAFLKHNPSVGLDDLLEFVAKSESRVRKC

Query:  YEVEFHDLDSQKFARMMVLDCCFVLELLLRFSIKRLKRRNDPVFTTPGLLLDLKSDLILLENQIPYFLLREVYEKVQDSREENMP----LNDLAFRFFRT
        Y  E   L S +F  M+V+D  F++ELLLR    RL+  ND +F    ++ D+  D+IL+ENQ+P+F+++E++  + +  ++  P    L    F +F +
Subjt:  YEVEFHDLDSQKFARMMVLDCCFVLELLLRFSIKRLKRRNDPVFTTPGLLLDLKSDLILLENQIPYFLLREVYEKVQDSREENMP----LNDLAFRFFRT

Query:  IVAGERQSVYDNFQQDADHLLDIVHSCFLSTYPRIETKNNKSKTAELPRASKLKSAGIKFKNAVTPKSVLDIKFQNGGLEIPTLEVSKHTETILKNLIAY
         +  E+      F  + +H +D++ SC+L  +P I+ +    K    P A++L +AG++FK A T   +LDI F +G L+IPT+ V   TE++ KN+I +
Subjt:  IVAGERQSVYDNFQQDADHLLDIVHSCFLSTYPRIETKNNKSKTAELPRASKLKSAGIKFKNAVTPKSVLDIKFQNGGLEIPTLEVSKHTETILKNLIAY

Query:  EICQIGSAQQVKSYVDFMSHLLQSDEDMKLLCGRKILINLEKDETQIIANLKWMRQQKANLSGTYFAGVVQKL----NEPPDRFIVWWRRLRR----NPV
        E C+  S +    Y+  +   ++S  D  LL    I++N   +   +      + ++       YF+ + + L    N P +R   W   LRR    NP 
Subjt:  EICQIGSAQQVKSYVDFMSHLLQSDEDMKLLCGRKILINLEKDETQIIANLKWMRQQKANLSGTYFAGVVQKL----NEPPDRFIVWWRRLRR----NPV

Query:  AIGVV--AVWALVVIFVAAFFSALSL
        A+  V  A+  L++ F+ +  S L+L
Subjt:  AIGVV--AVWALVVIFVAAFFSALSL

AT5G22540.1 Plant protein of unknown function (DUF247)3.5e-4230.95Show/hide
Query:  SDEESLLCSMEAKMEAFCSSIIIFKVPDEISIDNREVFVPAKVSIGPFHHGAPHLESMEDLKWNYLCAFLKHNPSVGL--DDLLEFVAKSESRVRKCYEV
        S++  L    E+     C    I ++P  ++  N + + P  VSIGP+HHG  HL+  +  K  +L  F+      G    +L++ V+  E  +R  Y  
Subjt:  SDEESLLCSMEAKMEAFCSSIIIFKVPDEISIDNREVFVPAKVSIGPFHHGAPHLESMEDLKWNYLCAFLKHNPSVGL--DDLLEFVAKSESRVRKCYEV

Query:  EFHDLDSQKFARMMVLDCCFVLELLLRFSIK-RLKRRNDPVFTTPGLLLDLKSDLILLENQIPYFLLREVYEKVQDSREENMPLNDLAFRFFRTIVAGER
        +   LDS+   +MMVLD CF+L L    S K      +DP+F  P +L  +++DL+LLENQ+PY LL+ ++E           LN++AF FF   +    
Subjt:  EFHDLDSQKFARMMVLDCCFVLELLLRFSIK-RLKRRNDPVFTTPGLLLDLKSDLILLENQIPYFLLREVYEKVQDSREENMPLNDLAFRFFRTIVAGER

Query:  QSVYDNFQQDADHLLDIVHSCFLSTYPRIETKNNKSKTA--------ELPRASKLKSAGIKFKNAVTPKSVLDIKFQNGGLEIPTLEVSKHTETILKNLI
             ++  +A HLLD++   F+    +   K++ SK++         +  A KL   GIKFK      S+LDI + NG L IP + +   T +I  N +
Subjt:  QSVYDNFQQDADHLLDIVHSCFLSTYPRIETKNNKSKTA--------ELPRASKLKSAGIKFKNAVTPKSVLDIKFQNGGLEIPTLEVSKHTETILKNLI

Query:  AYEICQIGSAQQVKSYVDFMSHLLQSDEDMKLLCGRKILINLEKDETQIIANLKWMRQQKA-NLSGTYFAGVVQKLNE
        A+E     S+  + SYV FM+ L+  + D   L  R+IL N    E ++    K + +  A +L  +Y A V + +NE
Subjt:  AYEICQIGSAQQVKSYVDFMSHLLQSDEDMKLLCGRKILINLEKDETQIIANLKWMRQQKA-NLSGTYFAGVVQKLNE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCCATCAAGAGCACTTTCTCATGCGATTGATATTCCGGCAATCTCACGTGAAAGATCCGACGAAGAATCCCTCCTATGCTCCATGGAAGCAAAAATGGAAGCCTT
CTGCTCATCAATTATCATCTTCAAAGTTCCAGACGAAATCAGTATCGACAACAGAGAAGTCTTCGTCCCGGCCAAAGTTTCGATCGGCCCTTTCCACCACGGCGCTCCAC
ATCTGGAATCCATGGAAGATCTGAAGTGGAACTACTTGTGCGCTTTCCTCAAGCACAATCCGTCTGTCGGTTTAGATGATCTTCTCGAATTCGTTGCAAAATCAGAGAGC
CGAGTGAGAAAATGCTATGAGGTAGAGTTCCATGATCTCGACAGCCAAAAGTTCGCGCGGATGATGGTGCTCGATTGCTGCTTCGTTCTCGAGCTGCTTTTGCGATTCTC
GATAAAGAGGCTCAAACGCCGGAACGATCCTGTTTTCACTACTCCTGGTTTGCTCCTCGATTTGAAGTCCGATTTGATACTGCTTGAAAATCAGATTCCGTATTTCCTTC
TGAGAGAGGTTTATGAAAAAGTGCAAGATTCAAGGGAGGAAAATATGCCTCTCAATGACCTCGCCTTCCGATTCTTCAGAACTATAGTTGCCGGAGAACGGCAATCTGTT
TACGACAATTTCCAGCAAGATGCAGATCATCTGCTTGATATCGTGCACTCTTGTTTCCTCTCCACATATCCTCGAATCGAAACGAAAAACAACAAATCGAAGACGGCAGA
ATTACCTCGTGCGTCGAAGCTTAAATCTGCGGGAATCAAATTCAAGAACGCCGTAACTCCGAAGAGCGTACTGGACATCAAATTTCAGAACGGCGGCCTCGAAATTCCCA
CTCTCGAAGTGTCCAAGCACACAGAAACGATTCTTAAGAATCTGATCGCGTACGAGATCTGTCAAATCGGAAGCGCTCAGCAAGTGAAATCGTATGTCGATTTCATGAGT
CACCTTCTCCAGTCGGACGAGGACATGAAGCTGCTCTGCGGACGAAAAATCCTGATCAATCTCGAGAAGGATGAGACGCAGATTATCGCGAATCTGAAATGGATGAGGCA
GCAGAAGGCCAACTTGTCGGGAACGTACTTCGCCGGCGTTGTTCAGAAATTAAACGAGCCGCCGGACCGATTCATAGTATGGTGGCGGAGGCTGAGAAGAAATCCGGTGG
CCATCGGCGTCGTCGCAGTTTGGGCGTTGGTTGTGATCTTCGTGGCGGCCTTCTTCTCTGCACTTTCTCTCCTTCAGCGCCGTTACAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGAATCCATCAAGAGCACTTTCTCATGCGATTGATATTCCGGCAATCTCACGTGAAAGATCCGACGAAGAATCCCTCCTATGCTCCATGGAAGCAAAAATGGAAGCCTT
CTGCTCATCAATTATCATCTTCAAAGTTCCAGACGAAATCAGTATCGACAACAGAGAAGTCTTCGTCCCGGCCAAAGTTTCGATCGGCCCTTTCCACCACGGCGCTCCAC
ATCTGGAATCCATGGAAGATCTGAAGTGGAACTACTTGTGCGCTTTCCTCAAGCACAATCCGTCTGTCGGTTTAGATGATCTTCTCGAATTCGTTGCAAAATCAGAGAGC
CGAGTGAGAAAATGCTATGAGGTAGAGTTCCATGATCTCGACAGCCAAAAGTTCGCGCGGATGATGGTGCTCGATTGCTGCTTCGTTCTCGAGCTGCTTTTGCGATTCTC
GATAAAGAGGCTCAAACGCCGGAACGATCCTGTTTTCACTACTCCTGGTTTGCTCCTCGATTTGAAGTCCGATTTGATACTGCTTGAAAATCAGATTCCGTATTTCCTTC
TGAGAGAGGTTTATGAAAAAGTGCAAGATTCAAGGGAGGAAAATATGCCTCTCAATGACCTCGCCTTCCGATTCTTCAGAACTATAGTTGCCGGAGAACGGCAATCTGTT
TACGACAATTTCCAGCAAGATGCAGATCATCTGCTTGATATCGTGCACTCTTGTTTCCTCTCCACATATCCTCGAATCGAAACGAAAAACAACAAATCGAAGACGGCAGA
ATTACCTCGTGCGTCGAAGCTTAAATCTGCGGGAATCAAATTCAAGAACGCCGTAACTCCGAAGAGCGTACTGGACATCAAATTTCAGAACGGCGGCCTCGAAATTCCCA
CTCTCGAAGTGTCCAAGCACACAGAAACGATTCTTAAGAATCTGATCGCGTACGAGATCTGTCAAATCGGAAGCGCTCAGCAAGTGAAATCGTATGTCGATTTCATGAGT
CACCTTCTCCAGTCGGACGAGGACATGAAGCTGCTCTGCGGACGAAAAATCCTGATCAATCTCGAGAAGGATGAGACGCAGATTATCGCGAATCTGAAATGGATGAGGCA
GCAGAAGGCCAACTTGTCGGGAACGTACTTCGCCGGCGTTGTTCAGAAATTAAACGAGCCGCCGGACCGATTCATAGTATGGTGGCGGAGGCTGAGAAGAAATCCGGTGG
CCATCGGCGTCGTCGCAGTTTGGGCGTTGGTTGTGATCTTCGTGGCGGCCTTCTTCTCTGCACTTTCTCTCCTTCAGCGCCGTTACAGATGA
Protein sequenceShow/hide protein sequence
MNPSRALSHAIDIPAISRERSDEESLLCSMEAKMEAFCSSIIIFKVPDEISIDNREVFVPAKVSIGPFHHGAPHLESMEDLKWNYLCAFLKHNPSVGLDDLLEFVAKSES
RVRKCYEVEFHDLDSQKFARMMVLDCCFVLELLLRFSIKRLKRRNDPVFTTPGLLLDLKSDLILLENQIPYFLLREVYEKVQDSREENMPLNDLAFRFFRTIVAGERQSV
YDNFQQDADHLLDIVHSCFLSTYPRIETKNNKSKTAELPRASKLKSAGIKFKNAVTPKSVLDIKFQNGGLEIPTLEVSKHTETILKNLIAYEICQIGSAQQVKSYVDFMS
HLLQSDEDMKLLCGRKILINLEKDETQIIANLKWMRQQKANLSGTYFAGVVQKLNEPPDRFIVWWRRLRRNPVAIGVVAVWALVVIFVAAFFSALSLLQRRYR