; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS008823 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS008823
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionProtein of unknown function (DUF616)
Genome locationscaffold4:4049466..4052452
RNA-Seq ExpressionMS008823
SyntenyMS008823
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR006852 - Protein of unknown function DUF616


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022133586.1 uncharacterized protein LOC111006123 [Momordica charantia]4.7e-27998.97Show/hide
Query:  MSNSSNSVSIPVSDNEFDEVERMRVRARRKRKKLGSRVKNELARRVFRMMLRYWLVVFFLIAAGLLIFEATRIGRKSRMEAKSELGDTTRPTLGDSVLGT
        MSNSSNSVSIPVSDNEFDEVERMRVRARRKRKKLGSRVKNELARRVFRMMLRYWLVVFFLIAAGLLIFEATRIGRKSRMEAKSELGDTTRPTLGDSVLGT
Subjt:  MSNSSNSVSIPVSDNEFDEVERMRVRARRKRKKLGSRVKNELARRVFRMMLRYWLVVFFLIAAGLLIFEATRIGRKSRMEAKSELGDTTRPTLGDSVLGT

Query:  NKKSGLDNKPDGNLNRLDPVTRMVAGVREPCLKLLPPKELEQLDIPVHDGSPVPAIDVNYITENDNSILRDKTSQSRQSTEATRFNLFTGYQTLDQREKS
        +KKSGLDNKPDGNLNRLDPVTRMVAGVREPCLKLLPPKELEQLDIPVHDGSPVPAIDVNYITENDNSILRDKTSQSRQSTEATRFNLFTGYQTLDQREKS
Subjt:  NKKSGLDNKPDGNLNRLDPVTRMVAGVREPCLKLLPPKELEQLDIPVHDGSPVPAIDVNYITENDNSILRDKTSQSRQSTEATRFNLFTGYQTLDQREKS

Query:  FQVNGTVEVHCGFYSDNGGFRISDGDRNYMKTCTLVVSTCAFGGGDDLYQPIGMSDASLRKVVCYVAFWDEITLSAQESMGHIIGENGFFGKWRIIVVRD
        FQVNG VEVHCGFYS+NGGFRISDGDRNYM+TCTLVVSTCAFGGGDDLYQPIGMSDASLRK VCYVAFWDEITLSAQESMGHIIGENGFFGKWRIIVVRD
Subjt:  FQVNGTVEVHCGFYSDNGGFRISDGDRNYMKTCTLVVSTCAFGGGDDLYQPIGMSDASLRKVVCYVAFWDEITLSAQESMGHIIGENGFFGKWRIIVVRD

Query:  LPFTDQRLNGKIPKMLGHRLFPNVKYSIWVDSKSQFRRDPLGVFEALLWHSNSEFAISQHGARSSVYDEAGAVVRKHKATPEEVDMQIKRYHHDHFPDDK
        LPFTDQRLNGKIPKMLGHRLFPNVKYSIWVDSKSQFRRDPLGVFEALLWHSNSEFAISQHGARSSVYDEAGAVVRKHKATPEEVDMQIKRYHHDHFPDDK
Subjt:  LPFTDQRLNGKIPKMLGHRLFPNVKYSIWVDSKSQFRRDPLGVFEALLWHSNSEFAISQHGARSSVYDEAGAVVRKHKATPEEVDMQIKRYHHDHFPDDK

Query:  RFNGHKALAEASVIVREHSPVVNLFMCLWFNEVVRFTSRDQLSFPYVLWRLKLLKKIHMFPVCTRKDLVNSMGHIRKAKPLNVSRLS
        RFNGHKALAEASVIVREHSPVVNLFMCLWFNEVVRFTSRDQLSFPYVLWRLKLLKKIHMFPVCTRKDLVNSMGHIRKAKPLNVSRLS
Subjt:  RFNGHKALAEASVIVREHSPVVNLFMCLWFNEVVRFTSRDQLSFPYVLWRLKLLKKIHMFPVCTRKDLVNSMGHIRKAKPLNVSRLS

XP_022941376.1 uncharacterized protein LOC111446685 [Cucurbita moschata]1.6e-23483.16Show/hide
Query:  MSNSSNSVSIPVSDNEFDEVERMRVRARRKRKKLGSRVKNELARRVFRMMLRYWLVVFFLIAAGLLIFEATRIGRKSRMEAKSELGDTTRPTLGDSVLGT
        MS SSNSVSI VSDNE D++ERMRVRARRKRKK G+R KNELARRVFR++L+YW+VVFFL+A  LL+FEATRIGRKSR+E KSE+  +TRPT+ + V+ T
Subjt:  MSNSSNSVSIPVSDNEFDEVERMRVRARRKRKKLGSRVKNELARRVFRMMLRYWLVVFFLIAAGLLIFEATRIGRKSRMEAKSELGDTTRPTLGDSVLGT

Query:  NKKSGLDNKPDGNLNRLDPVTRMVAGVREPCLKLLPPKELEQLDIPVHDGSPVPAIDVNYITENDNSILRDKTSQSRQSTEATRFNLFTGYQTLDQREKS
         KKS LD+KPDGNLN+LDPVTR VAGVRE CLKLLPPKELEQLDIPV D SPVP IDV+YIT+ND+S+L DK S  +QS   TRFNLFTGYQTL+QRE+S
Subjt:  NKKSGLDNKPDGNLNRLDPVTRMVAGVREPCLKLLPPKELEQLDIPVHDGSPVPAIDVNYITENDNSILRDKTSQSRQSTEATRFNLFTGYQTLDQREKS

Query:  FQVNGTVEVHCGFYSDNGGFRISDGDRNYMKTCTLVVSTCAFGGGDDLYQPIGMSDASLRKVVCYVAFWDEITLSAQESMGHIIGENGFFGKWRIIVVRD
        FQ N TVEVHCGFY DNGGFRIS+ DRNYM  C+LVVSTCAFGGGDDLYQPIGMS+ASLRK VCYVAFWDEITLSAQE +GH+IGE+GF GKWR+IVVRD
Subjt:  FQVNGTVEVHCGFYSDNGGFRISDGDRNYMKTCTLVVSTCAFGGGDDLYQPIGMSDASLRKVVCYVAFWDEITLSAQESMGHIIGENGFFGKWRIIVVRD

Query:  LPFTDQRLNGKIPKMLGHRLFPNVKYSIWVDSKSQFRRDPLGVFEALLWHSNSEFAISQHGARSSVYDEAGAVVRKHKATPEEVDMQIKRYHHDHFPDDK
        LPFTDQRLNGKIPKMLGHRLFPNVKYSIWVDSKSQFRRDPLGVFEALLW S+SE AISQHGARSSVYDEAGAVV+KHKATPEEV++QI +Y HDHFPDDK
Subjt:  LPFTDQRLNGKIPKMLGHRLFPNVKYSIWVDSKSQFRRDPLGVFEALLWHSNSEFAISQHGARSSVYDEAGAVVRKHKATPEEVDMQIKRYHHDHFPDDK

Query:  RFNGHKALAEASVIVREHSPVVNLFMCLWFNEVVRFTSRDQLSFPYVLWRLKLLKKIHMFPVCTRKDLVNSMGHIRKAKPLNVSRLS
        RFNGHKALAEASVIVREH+PVVNL MCLWFNEVVRFTSRDQLSFPYVLWRLK +KKI+MFPVC RKDLVNSMGHIRKAKPLNVSRLS
Subjt:  RFNGHKALAEASVIVREHSPVVNLFMCLWFNEVVRFTSRDQLSFPYVLWRLKLLKKIHMFPVCTRKDLVNSMGHIRKAKPLNVSRLS

XP_023523253.1 uncharacterized protein LOC111787501 [Cucurbita pepo subsp. pepo]9.9e-23783.78Show/hide
Query:  MSNSSNSVSIPVSDNEFDEVERMRVRARRKRKKLGSRVKNELARRVFRMMLRYWLVVFFLIAAGLLIFEATRIGRKSRMEAKSELGDTTRPTLGDSVLGT
        MS SSNSVSI VSDNE D++ERMRVRARRKRKK G+R KNELARRVFR++L+YW+VVFFL+A  LL+FEATRIGRKSR+E KSE+  +TRPT+ + V+GT
Subjt:  MSNSSNSVSIPVSDNEFDEVERMRVRARRKRKKLGSRVKNELARRVFRMMLRYWLVVFFLIAAGLLIFEATRIGRKSRMEAKSELGDTTRPTLGDSVLGT

Query:  NKKSGLDNKPDGNLNRLDPVTRMVAGVREPCLKLLPPKELEQLDIPVHDGSPVPAIDVNYITENDNSILRDKTSQSRQSTEATRFNLFTGYQTLDQREKS
         KKS LD+KPDGNLNRLDPVTR VAGVRE CLKLLPPKELEQLDIPV DGSPVP IDV+YIT+ND+S+L DK S S+QS   TRFNLFTGYQTL+QRE+S
Subjt:  NKKSGLDNKPDGNLNRLDPVTRMVAGVREPCLKLLPPKELEQLDIPVHDGSPVPAIDVNYITENDNSILRDKTSQSRQSTEATRFNLFTGYQTLDQREKS

Query:  FQVNGTVEVHCGFYSDNGGFRISDGDRNYMKTCTLVVSTCAFGGGDDLYQPIGMSDASLRKVVCYVAFWDEITLSAQESMGHIIGENGFFGKWRIIVVRD
        FQ N TVEVHCGFY DNGGFRIS+ DRNYM+ C+LVVSTCAFGGGDDLYQPIGMS+ASLRK VCYVAFWDEITLSAQE +GH+IGE+GF GKWR+IVVRD
Subjt:  FQVNGTVEVHCGFYSDNGGFRISDGDRNYMKTCTLVVSTCAFGGGDDLYQPIGMSDASLRKVVCYVAFWDEITLSAQESMGHIIGENGFFGKWRIIVVRD

Query:  LPFTDQRLNGKIPKMLGHRLFPNVKYSIWVDSKSQFRRDPLGVFEALLWHSNSEFAISQHGARSSVYDEAGAVVRKHKATPEEVDMQIKRYHHDHFPDDK
        LPFTDQRLNGKIPKMLGHRLFPNVKYSIWVDSKSQFRRDPLGVFEALLW S+SE AISQHGARSSVYDEAGAVV+KHKATPEEV++QI +Y HDHFPDDK
Subjt:  LPFTDQRLNGKIPKMLGHRLFPNVKYSIWVDSKSQFRRDPLGVFEALLWHSNSEFAISQHGARSSVYDEAGAVVRKHKATPEEVDMQIKRYHHDHFPDDK

Query:  RFNGHKALAEASVIVREHSPVVNLFMCLWFNEVVRFTSRDQLSFPYVLWRLKLLKKIHMFPVCTRKDLVNSMGHIRKAKPLNVSRLS
        R NGHKALAEASVIVREH+PVVNL MCLWFNEVVRFTSRDQLSFPYVLWRLK +KKI+MFPVC RKDLVNSMGHIRKAKPLNVSRLS
Subjt:  RFNGHKALAEASVIVREHSPVVNLFMCLWFNEVVRFTSRDQLSFPYVLWRLKLLKKIHMFPVCTRKDLVNSMGHIRKAKPLNVSRLS

XP_038894398.1 uncharacterized protein LOC120083005 isoform X1 [Benincasa hispida]2.8e-23985.3Show/hide
Query:  MSNSSNSVSIPVSDNEFDEVERMRVRARRKRKKLGSRVKNELARRVFRMMLRYWLVVFFLIAAGLLIFEATRIGRKSRMEAKSELGDTTRPTLGDSVLGT
        MSN+SN+VSI VSDNE DE+ERMRVR RRKRKK G+RVKNELA R+FRMML+YWLVVFFL+AAGLL+FEAT+IG+KSR+EAKSE    TR  L DS LG 
Subjt:  MSNSSNSVSIPVSDNEFDEVERMRVRARRKRKKLGSRVKNELARRVFRMMLRYWLVVFFLIAAGLLIFEATRIGRKSRMEAKSELGDTTRPTLGDSVLGT

Query:  NKKSGLDNKPDGNLNRLDPVTRMVAGVREPCLKLLPPKELEQLDIPVHDGSPVPAIDVNYITENDNSILRDKTSQSRQSTEATRFNLFTGYQTLDQREKS
          +SGLDNKPDGNLNRLDPVTRMV+G+RE CLKLLPPKELEQLDIPV +GSPVP IDVNYIT +D+S+L DK S SRQS  ATRFNLFTGYQTLDQREKS
Subjt:  NKKSGLDNKPDGNLNRLDPVTRMVAGVREPCLKLLPPKELEQLDIPVHDGSPVPAIDVNYITENDNSILRDKTSQSRQSTEATRFNLFTGYQTLDQREKS

Query:  FQVNGTVEVHCGFYSDNGGFRISDGDRNYMKTCTLVVSTCAFGGGDDLYQPIGMSDASLRKVVCYVAFWDEITLSAQESMGHIIGENGFFGKWRIIVVRD
        F+VN TVEVHCGFYSD+GGFRIS+ DR +M+TCTLVVSTCAFGGGDDLYQPIGMS+ASLRK VCYVAFWDEITLSAQE++GH+IGE+GF GKWR++VVRD
Subjt:  FQVNGTVEVHCGFYSDNGGFRISDGDRNYMKTCTLVVSTCAFGGGDDLYQPIGMSDASLRKVVCYVAFWDEITLSAQESMGHIIGENGFFGKWRIIVVRD

Query:  LPFTDQRLNGKIPKMLGHRLFPNVKYSIWVDSKSQFRRDPLGVFEALLWHSNSEFAISQHGARSSVYDEAGAVVRKHKATPEEVDMQIKRYHHDHFPDDK
        LPFTDQRLNGKIPKMLGHRLFPNVKYSIWVDSKSQFRRDPLGVFEALLW SNSE AISQHGARSSVYDEAGAVV+KHKATPEEVD+QIKRY HD+FPDDK
Subjt:  LPFTDQRLNGKIPKMLGHRLFPNVKYSIWVDSKSQFRRDPLGVFEALLWHSNSEFAISQHGARSSVYDEAGAVVRKHKATPEEVDMQIKRYHHDHFPDDK

Query:  RFNGHKALAEASVIVREHSPVVNLFMCLWFNEVVRFTSRDQLSFPYVLWRLKLLKKIHMFPVCTRKDLVNSMGHIRKAKPLNV
        RFNGHKALAEASVIVREHSP+VNLFMCLWFNEVVRFTSRDQLSFPYVLWRLKL+KKI+MFPVC RKDLVNSMGHIRKAKPLN+
Subjt:  RFNGHKALAEASVIVREHSPVVNLFMCLWFNEVVRFTSRDQLSFPYVLWRLKLLKKIHMFPVCTRKDLVNSMGHIRKAKPLNV

XP_038894402.1 uncharacterized protein LOC120083005 isoform X2 [Benincasa hispida]8.7e-24185.22Show/hide
Query:  MSNSSNSVSIPVSDNEFDEVERMRVRARRKRKKLGSRVKNELARRVFRMMLRYWLVVFFLIAAGLLIFEATRIGRKSRMEAKSELGDTTRPTLGDSVLGT
        MSN+SN+VSI VSDNE DE+ERMRVR RRKRKK G+RVKNELA R+FRMML+YWLVVFFL+AAGLL+FEAT+IG+KSR+EAKSE    TR  L DS LG 
Subjt:  MSNSSNSVSIPVSDNEFDEVERMRVRARRKRKKLGSRVKNELARRVFRMMLRYWLVVFFLIAAGLLIFEATRIGRKSRMEAKSELGDTTRPTLGDSVLGT

Query:  NKKSGLDNKPDGNLNRLDPVTRMVAGVREPCLKLLPPKELEQLDIPVHDGSPVPAIDVNYITENDNSILRDKTSQSRQSTEATRFNLFTGYQTLDQREKS
          +SGLDNKPDGNLNRLDPVTRMV+G+RE CLKLLPPKELEQLDIPV +GSPVP IDVNYIT +D+S+L DK S SRQS  ATRFNLFTGYQTLDQREKS
Subjt:  NKKSGLDNKPDGNLNRLDPVTRMVAGVREPCLKLLPPKELEQLDIPVHDGSPVPAIDVNYITENDNSILRDKTSQSRQSTEATRFNLFTGYQTLDQREKS

Query:  FQVNGTVEVHCGFYSDNGGFRISDGDRNYMKTCTLVVSTCAFGGGDDLYQPIGMSDASLRKVVCYVAFWDEITLSAQESMGHIIGENGFFGKWRIIVVRD
        F+VN TVEVHCGFYSD+GGFRIS+ DR +M+TCTLVVSTCAFGGGDDLYQPIGMS+ASLRK VCYVAFWDEITLSAQE++GH+IGE+GF GKWR++VVRD
Subjt:  FQVNGTVEVHCGFYSDNGGFRISDGDRNYMKTCTLVVSTCAFGGGDDLYQPIGMSDASLRKVVCYVAFWDEITLSAQESMGHIIGENGFFGKWRIIVVRD

Query:  LPFTDQRLNGKIPKMLGHRLFPNVKYSIWVDSKSQFRRDPLGVFEALLWHSNSEFAISQHGARSSVYDEAGAVVRKHKATPEEVDMQIKRYHHDHFPDDK
        LPFTDQRLNGKIPKMLGHRLFPNVKYSIWVDSKSQFRRDPLGVFEALLW SNSE AISQHGARSSVYDEAGAVV+KHKATPEEVD+QIKRY HD+FPDDK
Subjt:  LPFTDQRLNGKIPKMLGHRLFPNVKYSIWVDSKSQFRRDPLGVFEALLWHSNSEFAISQHGARSSVYDEAGAVVRKHKATPEEVDMQIKRYHHDHFPDDK

Query:  RFNGHKALAEASVIVREHSPVVNLFMCLWFNEVVRFTSRDQLSFPYVLWRLKLLKKIHMFPVCTRKDLVNSMGHIRKAKPLNVSRLS
        RFNGHKALAEASVIVREHSP+VNLFMCLWFNEVVRFTSRDQLSFPYVLWRLKL+KKI+MFPVC RKDLVNSMGHIRKAKPLN+SR+S
Subjt:  RFNGHKALAEASVIVREHSPVVNLFMCLWFNEVVRFTSRDQLSFPYVLWRLKLLKKIHMFPVCTRKDLVNSMGHIRKAKPLNVSRLS

TrEMBL top hitse value%identityAlignment
A0A1S3CRP8 uncharacterized protein LOC103503985 isoform X16.1e-23280.64Show/hide
Query:  MSNSSNSVSIPVSDNEFDEVERMRVRARRKRKKLGSRVKNELARRVFRMMLRYWLVVFFLIAAGLLIFEATRIGRKSRMEAKSELGDTTRPTLGDSVLGT
        MSN+SNSVSIPVSDNE D++ERMRVR RRKRKK G+RV NELA RVF+MML+YWLVVFFL+AAGLL+FEAT+IG+ SR+E KSE   TT P L DS L T
Subjt:  MSNSSNSVSIPVSDNEFDEVERMRVRARRKRKKLGSRVKNELARRVFRMMLRYWLVVFFLIAAGLLIFEATRIGRKSRMEAKSELGDTTRPTLGDSVLGT

Query:  NKKSGLDNKPDGNLNRLDPVTRMVAGVREPCLKLLPPKELEQLDIPVHDGSPVPAIDVNYITENDNSILRDKTSQSRQSTEATRFNLFTGYQTLDQREKS
         K+SGLD KPDGNLNRLDPVTRMV GVRE CLK+LPPKELEQLDIPV +GSPVP IDVNYI+++DNS+  DKTS SRQS  +TRFNLFTGYQTL+QRE S
Subjt:  NKKSGLDNKPDGNLNRLDPVTRMVAGVREPCLKLLPPKELEQLDIPVHDGSPVPAIDVNYITENDNSILRDKTSQSRQSTEATRFNLFTGYQTLDQREKS

Query:  F--------------QVNGTVEVHCGFYSDNGGFRISDGDRNYMKTCTLVVSTCAFGGGDDLYQPIGMSDASLRKVVCYVAFWDEITLSAQESMGHIIGE
        +              QVN TVEVHCGFYSD+GGF+IS+ D+ +M+TCT VVSTCAFGGGDDLYQPIGMS+ASLRK VCYVAFWDE+TLS QES G +IGE
Subjt:  F--------------QVNGTVEVHCGFYSDNGGFRISDGDRNYMKTCTLVVSTCAFGGGDDLYQPIGMSDASLRKVVCYVAFWDEITLSAQESMGHIIGE

Query:  NGFFGKWRIIVVRDLPFTDQRLNGKIPKMLGHRLFPNVKYSIWVDSKSQFRRDPLGVFEALLWHSNSEFAISQHGARSSVYDEAGAVVRKHKATPEEVDM
         GF GKWR++VVRDLPF DQRLNGKIPKMLGHRLFPNVKYSIWVDSKSQFRRDPLGVFEALLW SNSE AISQHGARSSVYDEAGAVV+KHKATPEEVD+
Subjt:  NGFFGKWRIIVVRDLPFTDQRLNGKIPKMLGHRLFPNVKYSIWVDSKSQFRRDPLGVFEALLWHSNSEFAISQHGARSSVYDEAGAVVRKHKATPEEVDM

Query:  QIKRYHHDHFPDDKRFNGHKALAEASVIVREHSPVVNLFMCLWFNEVVRFTSRDQLSFPYVLWRLKLLKKIHMFPVCTRKDLVNSMGHIRKAKPLNVSRL
        QIK+Y HD FPDDKRFNGHKALAEASVIVR+HSPVVNLFMCLWFNEVVRFTSRDQLSFPYVLWRLK++KKI+MFPVC RKDLVNSMGHI KAKPLNVSRL
Subjt:  QIKRYHHDHFPDDKRFNGHKALAEASVIVREHSPVVNLFMCLWFNEVVRFTSRDQLSFPYVLWRLKLLKKIHMFPVCTRKDLVNSMGHIRKAKPLNVSRL

Query:  S
        S
Subjt:  S

A0A1S3CRR6 uncharacterized protein LOC103503985 isoform X22.2e-23482.75Show/hide
Query:  MSNSSNSVSIPVSDNEFDEVERMRVRARRKRKKLGSRVKNELARRVFRMMLRYWLVVFFLIAAGLLIFEATRIGRKSRMEAKSELGDTTRPTLGDSVLGT
        MSN+SNSVSIPVSDNE D++ERMRVR RRKRKK G+RV NELA RVF+MML+YWLVVFFL+AAGLL+FEAT+IG+ SR+E KSE   TT P L DS L T
Subjt:  MSNSSNSVSIPVSDNEFDEVERMRVRARRKRKKLGSRVKNELARRVFRMMLRYWLVVFFLIAAGLLIFEATRIGRKSRMEAKSELGDTTRPTLGDSVLGT

Query:  NKKSGLDNKPDGNLNRLDPVTRMVAGVREPCLKLLPPKELEQLDIPVHDGSPVPAIDVNYITENDNSILRDKTSQSRQSTEATRFNLFTGYQTLDQREKS
         K+SGLD KPDGNLNRLDPVTRMV GVRE CLK+LPPKELEQLDIPV +GSPVP IDVNYI+++DNS+  DKTS SRQS  +TRFNLFTGYQTL+QRE S
Subjt:  NKKSGLDNKPDGNLNRLDPVTRMVAGVREPCLKLLPPKELEQLDIPVHDGSPVPAIDVNYITENDNSILRDKTSQSRQSTEATRFNLFTGYQTLDQREKS

Query:  FQVNGTVEVHCGFYSDNGGFRISDGDRNYMKTCTLVVSTCAFGGGDDLYQPIGMSDASLRKVVCYVAFWDEITLSAQESMGHIIGENGFFGKWRIIVVRD
        ++VN TVEVHCGFYSD+GGF+IS+ D+ +M+TCT VVSTCAFGGGDDLYQPIGMS+ASLRK VCYVAFWDE+TLS QES G +IGE GF GKWR++VVRD
Subjt:  FQVNGTVEVHCGFYSDNGGFRISDGDRNYMKTCTLVVSTCAFGGGDDLYQPIGMSDASLRKVVCYVAFWDEITLSAQESMGHIIGENGFFGKWRIIVVRD

Query:  LPFTDQRLNGKIPKMLGHRLFPNVKYSIWVDSKSQFRRDPLGVFEALLWHSNSEFAISQHGARSSVYDEAGAVVRKHKATPEEVDMQIKRYHHDHFPDDK
        LPF DQRLNGKIPKMLGHRLFPNVKYSIWVDSKSQFRRDPLGVFEALLW SNSE AISQHGARSSVYDEAGAVV+KHKATPEEVD+QIK+Y HD FPDDK
Subjt:  LPFTDQRLNGKIPKMLGHRLFPNVKYSIWVDSKSQFRRDPLGVFEALLWHSNSEFAISQHGARSSVYDEAGAVVRKHKATPEEVDMQIKRYHHDHFPDDK

Query:  RFNGHKALAEASVIVREHSPVVNLFMCLWFNEVVRFTSRDQLSFPYVLWRLKLLKKIHMFPVCTRKDLVNSMGHIRKAKPLNVSRLS
        RFNGHKALAEASVIVR+HSPVVNLFMCLWFNEVVRFTSRDQLSFPYVLWRLK++KKI+MFPVC RKDLVNSMGHI KAKPLNVSRLS
Subjt:  RFNGHKALAEASVIVREHSPVVNLFMCLWFNEVVRFTSRDQLSFPYVLWRLKLLKKIHMFPVCTRKDLVNSMGHIRKAKPLNVSRLS

A0A6J1BVI9 uncharacterized protein LOC1110061232.3e-27998.97Show/hide
Query:  MSNSSNSVSIPVSDNEFDEVERMRVRARRKRKKLGSRVKNELARRVFRMMLRYWLVVFFLIAAGLLIFEATRIGRKSRMEAKSELGDTTRPTLGDSVLGT
        MSNSSNSVSIPVSDNEFDEVERMRVRARRKRKKLGSRVKNELARRVFRMMLRYWLVVFFLIAAGLLIFEATRIGRKSRMEAKSELGDTTRPTLGDSVLGT
Subjt:  MSNSSNSVSIPVSDNEFDEVERMRVRARRKRKKLGSRVKNELARRVFRMMLRYWLVVFFLIAAGLLIFEATRIGRKSRMEAKSELGDTTRPTLGDSVLGT

Query:  NKKSGLDNKPDGNLNRLDPVTRMVAGVREPCLKLLPPKELEQLDIPVHDGSPVPAIDVNYITENDNSILRDKTSQSRQSTEATRFNLFTGYQTLDQREKS
        +KKSGLDNKPDGNLNRLDPVTRMVAGVREPCLKLLPPKELEQLDIPVHDGSPVPAIDVNYITENDNSILRDKTSQSRQSTEATRFNLFTGYQTLDQREKS
Subjt:  NKKSGLDNKPDGNLNRLDPVTRMVAGVREPCLKLLPPKELEQLDIPVHDGSPVPAIDVNYITENDNSILRDKTSQSRQSTEATRFNLFTGYQTLDQREKS

Query:  FQVNGTVEVHCGFYSDNGGFRISDGDRNYMKTCTLVVSTCAFGGGDDLYQPIGMSDASLRKVVCYVAFWDEITLSAQESMGHIIGENGFFGKWRIIVVRD
        FQVNG VEVHCGFYS+NGGFRISDGDRNYM+TCTLVVSTCAFGGGDDLYQPIGMSDASLRK VCYVAFWDEITLSAQESMGHIIGENGFFGKWRIIVVRD
Subjt:  FQVNGTVEVHCGFYSDNGGFRISDGDRNYMKTCTLVVSTCAFGGGDDLYQPIGMSDASLRKVVCYVAFWDEITLSAQESMGHIIGENGFFGKWRIIVVRD

Query:  LPFTDQRLNGKIPKMLGHRLFPNVKYSIWVDSKSQFRRDPLGVFEALLWHSNSEFAISQHGARSSVYDEAGAVVRKHKATPEEVDMQIKRYHHDHFPDDK
        LPFTDQRLNGKIPKMLGHRLFPNVKYSIWVDSKSQFRRDPLGVFEALLWHSNSEFAISQHGARSSVYDEAGAVVRKHKATPEEVDMQIKRYHHDHFPDDK
Subjt:  LPFTDQRLNGKIPKMLGHRLFPNVKYSIWVDSKSQFRRDPLGVFEALLWHSNSEFAISQHGARSSVYDEAGAVVRKHKATPEEVDMQIKRYHHDHFPDDK

Query:  RFNGHKALAEASVIVREHSPVVNLFMCLWFNEVVRFTSRDQLSFPYVLWRLKLLKKIHMFPVCTRKDLVNSMGHIRKAKPLNVSRLS
        RFNGHKALAEASVIVREHSPVVNLFMCLWFNEVVRFTSRDQLSFPYVLWRLKLLKKIHMFPVCTRKDLVNSMGHIRKAKPLNVSRLS
Subjt:  RFNGHKALAEASVIVREHSPVVNLFMCLWFNEVVRFTSRDQLSFPYVLWRLKLLKKIHMFPVCTRKDLVNSMGHIRKAKPLNVSRLS

A0A6J1FKY2 uncharacterized protein LOC1114466857.7e-23583.16Show/hide
Query:  MSNSSNSVSIPVSDNEFDEVERMRVRARRKRKKLGSRVKNELARRVFRMMLRYWLVVFFLIAAGLLIFEATRIGRKSRMEAKSELGDTTRPTLGDSVLGT
        MS SSNSVSI VSDNE D++ERMRVRARRKRKK G+R KNELARRVFR++L+YW+VVFFL+A  LL+FEATRIGRKSR+E KSE+  +TRPT+ + V+ T
Subjt:  MSNSSNSVSIPVSDNEFDEVERMRVRARRKRKKLGSRVKNELARRVFRMMLRYWLVVFFLIAAGLLIFEATRIGRKSRMEAKSELGDTTRPTLGDSVLGT

Query:  NKKSGLDNKPDGNLNRLDPVTRMVAGVREPCLKLLPPKELEQLDIPVHDGSPVPAIDVNYITENDNSILRDKTSQSRQSTEATRFNLFTGYQTLDQREKS
         KKS LD+KPDGNLN+LDPVTR VAGVRE CLKLLPPKELEQLDIPV D SPVP IDV+YIT+ND+S+L DK S  +QS   TRFNLFTGYQTL+QRE+S
Subjt:  NKKSGLDNKPDGNLNRLDPVTRMVAGVREPCLKLLPPKELEQLDIPVHDGSPVPAIDVNYITENDNSILRDKTSQSRQSTEATRFNLFTGYQTLDQREKS

Query:  FQVNGTVEVHCGFYSDNGGFRISDGDRNYMKTCTLVVSTCAFGGGDDLYQPIGMSDASLRKVVCYVAFWDEITLSAQESMGHIIGENGFFGKWRIIVVRD
        FQ N TVEVHCGFY DNGGFRIS+ DRNYM  C+LVVSTCAFGGGDDLYQPIGMS+ASLRK VCYVAFWDEITLSAQE +GH+IGE+GF GKWR+IVVRD
Subjt:  FQVNGTVEVHCGFYSDNGGFRISDGDRNYMKTCTLVVSTCAFGGGDDLYQPIGMSDASLRKVVCYVAFWDEITLSAQESMGHIIGENGFFGKWRIIVVRD

Query:  LPFTDQRLNGKIPKMLGHRLFPNVKYSIWVDSKSQFRRDPLGVFEALLWHSNSEFAISQHGARSSVYDEAGAVVRKHKATPEEVDMQIKRYHHDHFPDDK
        LPFTDQRLNGKIPKMLGHRLFPNVKYSIWVDSKSQFRRDPLGVFEALLW S+SE AISQHGARSSVYDEAGAVV+KHKATPEEV++QI +Y HDHFPDDK
Subjt:  LPFTDQRLNGKIPKMLGHRLFPNVKYSIWVDSKSQFRRDPLGVFEALLWHSNSEFAISQHGARSSVYDEAGAVVRKHKATPEEVDMQIKRYHHDHFPDDK

Query:  RFNGHKALAEASVIVREHSPVVNLFMCLWFNEVVRFTSRDQLSFPYVLWRLKLLKKIHMFPVCTRKDLVNSMGHIRKAKPLNVSRLS
        RFNGHKALAEASVIVREH+PVVNL MCLWFNEVVRFTSRDQLSFPYVLWRLK +KKI+MFPVC RKDLVNSMGHIRKAKPLNVSRLS
Subjt:  RFNGHKALAEASVIVREHSPVVNLFMCLWFNEVVRFTSRDQLSFPYVLWRLKLLKKIHMFPVCTRKDLVNSMGHIRKAKPLNVSRLS

A0A6J1IY59 uncharacterized protein LOC1114809981.3e-23482.96Show/hide
Query:  MSNSSNSVSIPVSDNEFDEVERMRVRARRKRKKLGSRVKNELARRVFRMMLRYWLVVFFLIAAGLLIFEATRIGRKSRMEAKSELGDTTRPTLGDSVLGT
        MS SSNSVSI VSDNE D++ERMRVRARRKRKK G+R KNELARRVFR++L+YW+VVFFL+A  LL+FEATRIGRKSR+E KSE+  +TRPT+ + V+G 
Subjt:  MSNSSNSVSIPVSDNEFDEVERMRVRARRKRKKLGSRVKNELARRVFRMMLRYWLVVFFLIAAGLLIFEATRIGRKSRMEAKSELGDTTRPTLGDSVLGT

Query:  NKKSGLDNKPDGNLNRLDPVTRMVAGVREPCLKLLPPKELEQLDIPVHDGSPVPAIDVNYITENDNSILRDKTSQSRQSTEATRFNLFTGYQTLDQREKS
         KKS LD+KPDGNLNRLDPVTR VAGVRE CLKLLPPKELEQLDIPV DGSPV  IDV+YIT+ND+++L DK S S+QS   TRFNLFTGYQTL+QRE+S
Subjt:  NKKSGLDNKPDGNLNRLDPVTRMVAGVREPCLKLLPPKELEQLDIPVHDGSPVPAIDVNYITENDNSILRDKTSQSRQSTEATRFNLFTGYQTLDQREKS

Query:  FQVNGTVEVHCGFYSDNGGFRISDGDRNYMKTCTLVVSTCAFGGGDDLYQPIGMSDASLRKVVCYVAFWDEITLSAQESMGHIIGENGFFGKWRIIVVRD
        FQ N TVEVHCGFY DNGGF+IS  DRNYM+ C+LVVSTCAFGGGDDLYQPIGMS+ASLRK VCYVAFWDEITLSAQE +GH+IGE+GF GKWR+IVVRD
Subjt:  FQVNGTVEVHCGFYSDNGGFRISDGDRNYMKTCTLVVSTCAFGGGDDLYQPIGMSDASLRKVVCYVAFWDEITLSAQESMGHIIGENGFFGKWRIIVVRD

Query:  LPFTDQRLNGKIPKMLGHRLFPNVKYSIWVDSKSQFRRDPLGVFEALLWHSNSEFAISQHGARSSVYDEAGAVVRKHKATPEEVDMQIKRYHHDHFPDDK
        LPFTDQRLNGKIPKMLGHRLFPNVKYSIWVDSKSQFRRDPLGVFEALLW S+SE AISQHGARSSVYDEAGAVV+KHKATPEEV++QI +Y HDHFPDDK
Subjt:  LPFTDQRLNGKIPKMLGHRLFPNVKYSIWVDSKSQFRRDPLGVFEALLWHSNSEFAISQHGARSSVYDEAGAVVRKHKATPEEVDMQIKRYHHDHFPDDK

Query:  RFNGHKALAEASVIVREHSPVVNLFMCLWFNEVVRFTSRDQLSFPYVLWRLKLLKKIHMFPVCTRKDLVNSMGHIRKAKPLNVSRLS
        RFNGHKALAEASVIVREH P VNL MCLWFNEVVRFTSRDQLSFPYVLWRLK++KKI+MFPVC RKDLVNSMGHIRKAKPLNVSRLS
Subjt:  RFNGHKALAEASVIVREHSPVVNLFMCLWFNEVVRFTSRDQLSFPYVLWRLKLLKKIHMFPVCTRKDLVNSMGHIRKAKPLNVSRLS

SwissProt top hitse value%identityAlignment
Q9FZ97 Probable hexosyltransferase MUCI703.0e-5039.8Show/hide
Query:  FTGYQTLDQREKSFQVNGTVEVHCGFYSD-----NGGFRISDGDRNYMKTCT-LVVSTCAFGGGDDLYQPIGMSDASLRKVVCYVAFWDEITLSAQESMG
        F GY TL  R  SF +  T+ VHCGF        N GF I + D   MK C  +VV++  F   DD+  P  +S  +  + VC+  F DE T S  +   
Subjt:  FTGYQTLDQREKSFQVNGTVEVHCGFYSD-----NGGFRISDGDRNYMKTCT-LVVSTCAFGGGDDLYQPIGMSDASLRKVVCYVAFWDEITLSAQESMG

Query:  HIIGENGFFGKWRIIVVRDLPFTDQRLNGKIPKMLGHRLFPNVKYSIWVDSKSQFRRDPLGVFEALLWHSNSEFAISQHGARSSVYDEAGAVVRKHKATP
         + G N   G WR++VV +LP++D R NGK+PK+L HR+FPN +YS+W+D K +   DP  + E  LW  N+ FAIS+H  R  V  EA A     K   
Subjt:  HIIGENGFFGKWRIIVVRDLPFTDQRLNGKIPKMLGHRLFPNVKYSIWVDSKSQFRRDPLGVFEALLWHSNSEFAISQHGARSSVYDEAGAVVRKHKATP

Query:  EEVDMQIKRYHHDHFP--DDKRFNGHKALAEASVIVREHSPVVNLFMCLWFNEVVRFTSRDQLSFPYVLWRLKLLKK----IHMFPVCTRKDLVNSMGH
          +D Q+  Y ++        +      + E  VI+REH P+ NLF CLWFNEV RFTSRDQ+SF  V  R K+  K    + MF  C R++ V    H
Subjt:  EEVDMQIKRYHHDHFP--DDKRFNGHKALAEASVIVREHSPVVNLFMCLWFNEVVRFTSRDQLSFPYVLWRLKLLKK----IHMFPVCTRKDLVNSMGH

Arabidopsis top hitse value%identityAlignment
AT1G34550.1 Protein of unknown function (DUF616)8.5e-6140.33Show/hide
Query:  FTGYQTLDQREKSFQVNGTVEVHCGFY-----SDNGGFRISDGDRNYMKTCTLVVSTCAFGGGDDLYQPIGMSDASL-RKVVCYVAFWDEITLSAQESMG
        F G+Q+L +RE SF V    ++HCGF      S + GF +++ D NY+  C + VS+C FG  D L  P     + L RK VC++ F DEIT+    + G
Subjt:  FTGYQTLDQREKSFQVNGTVEVHCGFY-----SDNGGFRISDGDRNYMKTCTLVVSTCAFGGGDDLYQPIGMSDASL-RKVVCYVAFWDEITLSAQESMG

Query:  HIIGENGFFGKWRIIVVRDLPFTDQRLNGKIPKMLGHRLFPNVKYSIWVDSKSQFRRDPLGVFEALLWHSNSEFAISQHGARSSVYDEAGAVVRKHKATP
        H     GF G W+++VV++LP+ D R  GKIPKML HRLFP+ +YSIW+DSK + + DPL + E  LW    E+AIS H  R  +++E     + +K   
Subjt:  HIIGENGFFGKWRIIVVRDLPFTDQRLNGKIPKMLGHRLFPNVKYSIWVDSKSQFRRDPLGVFEALLWHSNSEFAISQHGARSSVYDEAGAVVRKHKATP

Query:  EEVDMQIKRYHHDHFPDDKRFNGH-------KALAEASVIVREHSPVVNLFMCLWFNEVVRFTSRDQLSFPYVLWRLKLLK-----KIHMFPVCTRKDLV
          ++ Q + Y  D      RFN           + E S IVR H+P+ NLF CLWFNEV RFT RDQLSF Y   +L+ +       +HMF  C R+ + 
Subjt:  EEVDMQIKRYHHDHFPDDKRFNGH-------KALAEASVIVREHSPVVNLFMCLWFNEVVRFTSRDQLSFPYVLWRLKLLK-----KIHMFPVCTRKDLV

Query:  NSMGH
            H
Subjt:  NSMGH

AT2G02910.1 Protein of unknown function (DUF616)2.4e-6343.93Show/hide
Query:  FTGYQTLDQREKSFQ-VNGTVEVHCGFYSDNGGFRISDGDRNYMKTCTLVVSTCAFGGGDDLYQPI--GMSDASLRKVVCYVAFWDEITLSAQESMGHII
        F G+QTL +RE+S+  VN T  +HCGF     GF +S+ DR YMK C + VS+C FG  D L +P    +S+ S R  VC+V F DE TLS   S GH+ 
Subjt:  FTGYQTLDQREKSFQ-VNGTVEVHCGFYSDNGGFRISDGDRNYMKTCTLVVSTCAFGGGDDLYQPI--GMSDASLRKVVCYVAFWDEITLSAQESMGHII

Query:  GENGFFGKWRIIVVRDLPFTDQRLNGKIPKMLGHRLFPNVKYSIWVDSKSQFRRDPLGVFEALLWHSNSEFAISQHGARSSVYDEAGAVVRKHKATPEEV
         + GF G W+ +VV +LP+ D R  GK+PK L HRLFP+ +YSIW+DSK +   DP+ + +  LW + SEFAIS H  R  V+DE     R +K     +
Subjt:  GENGFFGKWRIIVVRDLPFTDQRLNGKIPKMLGHRLFPNVKYSIWVDSKSQFRRDPLGVFEALLWHSNSEFAISQHGARSSVYDEAGAVVRKHKATPEEV

Query:  DMQIKRYHHDHF----PDDKRFNGHKALAEASVIVREHSPVVNLFMCLWFNEVVRFTSRDQLSFPYVLWRLKLLK-----KIHMFPVCTRKDLVNSMGHI
        D Q   Y  D      P D        + E S IVR H+P+ NLF CLWFNEV RFTSRDQLSF Y   +L+ L      +++MF  C R+ L     H 
Subjt:  DMQIKRYHHDHF----PDDKRFNGHKALAEASVIVREHSPVVNLFMCLWFNEVVRFTSRDQLSFPYVLWRLKLLK-----KIHMFPVCTRKDLVNSMGHI

Query:  RKAKP
          + P
Subjt:  RKAKP

AT4G09630.1 Protein of unknown function (DUF616)7.7e-6240.72Show/hide
Query:  FTGYQTLDQREKSFQVNGTVEVHCGFYS-----DNGGFRISDGDRNYMKTCTLVVSTCAFGGGDDLYQPIGMSDASL-RKVVCYVAFWDEITLSAQESMG
        F G+Q+L +RE SF V    ++HCGF        + GF +++ D NY+  C + V +C FG  D L  P     +SL RK VC+V F DEIT+    + G
Subjt:  FTGYQTLDQREKSFQVNGTVEVHCGFYS-----DNGGFRISDGDRNYMKTCTLVVSTCAFGGGDDLYQPIGMSDASL-RKVVCYVAFWDEITLSAQESMG

Query:  HIIGENGFFGKWRIIVVRDLPFTDQRLNGKIPKMLGHRLFPNVKYSIWVDSKSQFRRDPLGVFEALLWHSNSEFAISQHGARSSVYDEAGAVVRKHKATP
         +    GF G W+++VVR+LP+TD R  GKIPK+L HRLF + +YSIW+DSK + + DPL + E  LW    E+AIS H  R  +++E     + +K   
Subjt:  HIIGENGFFGKWRIIVVRDLPFTDQRLNGKIPKMLGHRLFPNVKYSIWVDSKSQFRRDPLGVFEALLWHSNSEFAISQHGARSSVYDEAGAVVRKHKATP

Query:  EEVDMQIKRYHHDHFPDDKRFNGHKAL----AEASVIVREHSPVVNLFMCLWFNEVVRFTSRDQLSFPYVLWRLKLLK-----KIHMFPVCTRKDLVNSM
          +D Q + Y  D        + HK L     E S IVREH+P+ NLF CLWFNEV RFT RDQLSF Y   +L  +       +HMF  C R+ +    
Subjt:  EEVDMQIKRYHHDHFPDDKRFNGHKAL----AEASVIVREHSPVVNLFMCLWFNEVVRFTSRDQLSFPYVLWRLKLLK-----KIHMFPVCTRKDLVNSM

Query:  GHIRKAK
         H  + K
Subjt:  GHIRKAK

AT4G38500.1 Protein of unknown function (DUF616)1.8e-5537.03Show/hide
Query:  NSILRDKTSQSRQSTEATRFNLFTGYQTLDQREKSFQVNGTVEVHCGFYSDNGGFRISDGDRNYMKTCTLVVSTCAFGGGDDLYQPIGMSDASLRKVVCY
        ++I+R+ T  +   +  ++F LF G  +  +RE+SF++   ++VHCGF    GG  +S  D+ Y+K C  VV+T  F   D+ +QP  +S  S+  + C+
Subjt:  NSILRDKTSQSRQSTEATRFNLFTGYQTLDQREKSFQVNGTVEVHCGFYSDNGGFRISDGDRNYMKTCTLVVSTCAFGGGDDLYQPIGMSDASLRKVVCY

Query:  VAFWDEITLS---AQESMGHIIGENGFFGKWRIIVVRDLPFTDQRLNGKIPKMLGHRLFPNVKYSIWVDSKSQFRRDPLGVFEALLWHSNSEFAISQHGA
        +   DE++L       ++   +    + G WR+I+++  P+ + R NGK+PK+L HRLFP  +YSIW+D K +   DPL + E  LW     FAI+QH  
Subjt:  VAFWDEITLS---AQESMGHIIGENGFFGKWRIIVVRDLPFTDQRLNGKIPKMLGHRLFPNVKYSIWVDSKSQFRRDPLGVFEALLWHSNSEFAISQHGA

Query:  RSSVYDEAGAVVRKHKATPEEVDMQIKRYHHDHF-PDDKRFNGHKALAEASVIVREHSPVVNLFMCLWFNEVVRFTSRDQLSFPYVLWRLKLLKKIHMFP
          ++Y+EA A  R+ +     VD+ +K Y ++   P   + N    + E +VI+REH+ + NLF CLWFNEV   T RDQLSF YV+ RLK   K+ MF 
Subjt:  RSSVYDEAGAVVRKHKATPEEVDMQIKRYHHDHF-PDDKRFNGHKALAEASVIVREHSPVVNLFMCLWFNEVVRFTSRDQLSFPYVLWRLKLLKKIHMFP

Query:  VCTRKDLVNSMGHIRK
         C    L     HIR+
Subjt:  VCTRKDLVNSMGHIRK

AT5G42660.1 Protein of unknown function (DUF616)3.2e-17764.44Show/hide
Query:  SSNSVSIPVSDNEFDEVERMRVRARRKRKKLGSRVKNELARRVFRMMLRYWLVVFFLIAAGLLIFEATRIGRKSRMEAKSELGDTTRPTLGDSVLGTNKK
        ++NSVSI VSD+E +++ R+R R RRKRKK+  R   EL R V R+ +RYW+V+ FL+A GLL+FE+TRIG KS         D  +P L       N K
Subjt:  SSNSVSIPVSDNEFDEVERMRVRARRKRKKLGSRVKNELARRVFRMMLRYWLVVFFLIAAGLLIFEATRIGRKSRMEAKSELGDTTRPTLGDSVLGTNKK

Query:  SGLDNKPDGNLNRLDPVTRMVAGVREPCLKLLPPKELEQLDIPVHDGSPVPAIDVNYITENDNSILRDKTSQSRQSTEATRFNLFTGYQTLDQREKSFQV
             K +GNLNRLDP T+++ GVR+ CLKLLPP+ELE LDI     S  P   V Y+T+ D S+   +  +       TRFNLFTG QT  +RE SFQV
Subjt:  SGLDNKPDGNLNRLDPVTRMVAGVREPCLKLLPPKELEQLDIPVHDGSPVPAIDVNYITENDNSILRDKTSQSRQSTEATRFNLFTGYQTLDQREKSFQV

Query:  NGTVEVHCGFYSDNGGFRISDGDRNYMKTCTLVVSTCAFGGGDDLYQPIGMSDASLRKVVCYVAFWDEITLSAQESMGHIIGENGFFGKWRIIVVRDLPF
          TV +HCGF+++NGGFRISD D+ +M +C +VVSTCAFGGGD+LY+PIGMS  S +K VCYVAFWDE+TL+ QE+ GH I EN   GKWRI++V+DLPF
Subjt:  NGTVEVHCGFYSDNGGFRISDGDRNYMKTCTLVVSTCAFGGGDDLYQPIGMSDASLRKVVCYVAFWDEITLSAQESMGHIIGENGFFGKWRIIVVRDLPF

Query:  TDQRLNGKIPKMLGHRLFPNVKYSIWVDSKSQFRRDPLGVFEALLWHSNSEFAISQHGARSSVYDEAGAVVRKHKATPEEVDMQIKRYHHDHFPDDKRFN
        TDQRLNGKIPKML HRLFP+ KYSIWVDSKSQFRRDPLGV +ALLW +NS  AIS+HGARSSVYDEA AV++KHKATPEEV++QI +Y HD  P+DKRFN
Subjt:  TDQRLNGKIPKMLGHRLFPNVKYSIWVDSKSQFRRDPLGVFEALLWHSNSEFAISQHGARSSVYDEAGAVVRKHKATPEEVDMQIKRYHHDHFPDDKRFN

Query:  GHKALAEASVIVREHSPVVNLFMCLWFNEVVRFTSRDQLSFPYVLWRLKLLKKIHMFPVCTRKDLVNSMGHIRKAKPL
        G KAL+EASVIVREH+P+ NLFMCLWFNEVVRFTSRDQLSFPYVLWRLK+LK I+MFPVCTRKDLVNS+GH+RKAKPL
Subjt:  GHKALAEASVIVREHSPVVNLFMCLWFNEVVRFTSRDQLSFPYVLWRLKLLKKIHMFPVCTRKDLVNSMGHIRKAKPL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGAACAGTAGTAACAGTGTATCGATCCCGGTTTCGGACAATGAGTTTGACGAGGTGGAAAGGATGCGTGTTCGAGCCCGGAGGAAGCGCAAAAAACTAGGAAGCAG
AGTCAAGAACGAGTTGGCTCGTCGAGTTTTTAGAATGATGCTGAGATATTGGTTGGTTGTCTTCTTCCTTATTGCTGCGGGCTTGCTTATATTTGAGGCAACAAGAATTG
GTCGGAAATCGAGGATGGAGGCGAAGTCGGAACTTGGTGACACGACGAGACCTACGTTAGGCGACTCGGTACTTGGAACTAATAAGAAGTCGGGGTTGGACAATAAACCA
GATGGGAATTTGAATAGACTCGATCCTGTTACTCGGATGGTTGCCGGAGTAAGAGAACCTTGCTTGAAACTTCTGCCACCAAAAGAACTTGAGCAATTGGATATTCCTGT
GCACGATGGTTCGCCAGTTCCAGCGATTGATGTGAACTACATAACTGAGAATGATAATTCAATTTTAAGAGATAAAACTTCCCAGTCGCGACAGAGCACGGAGGCCACAA
GATTTAATCTTTTTACCGGATACCAGACTCTCGATCAGAGAGAAAAAAGTTTTCAGGTAAATGGAACTGTTGAGGTACATTGCGGTTTTTACAGTGATAATGGGGGTTTC
AGAATTTCTGATGGAGACAGAAATTACATGAAAACATGCACCCTTGTGGTGTCCACATGTGCATTTGGTGGTGGAGATGATCTTTATCAACCAATTGGAATGTCAGATGC
ATCACTTCGTAAGGTTGTTTGCTATGTTGCATTCTGGGATGAAATCACTTTAAGTGCACAGGAGTCGATGGGACATATTATTGGCGAGAATGGATTTTTTGGGAAGTGGC
GCATTATAGTTGTGAGGGATCTTCCTTTCACTGACCAAAGATTGAATGGTAAAATCCCGAAGATGTTGGGTCATCGTCTCTTCCCTAATGTGAAGTACTCCATTTGGGTA
GATTCAAAGTCTCAATTCAGGAGGGATCCTCTAGGAGTGTTTGAAGCTCTTCTTTGGCATTCAAATTCTGAATTTGCAATATCACAACATGGAGCTCGTAGTAGTGTCTA
TGACGAGGCTGGAGCTGTGGTCAGGAAGCATAAAGCCACCCCAGAAGAAGTTGATATGCAGATAAAACGGTACCATCATGATCACTTCCCAGATGACAAGAGATTTAACG
GACATAAAGCTCTTGCAGAGGCCTCTGTGATTGTTAGGGAGCACTCGCCAGTGGTGAATTTGTTCATGTGCCTCTGGTTTAATGAGGTGGTGCGCTTTACTTCGCGGGAT
CAGCTGAGCTTCCCATATGTTCTATGGCGGCTGAAACTTCTGAAGAAAATTCATATGTTCCCCGTTTGCACTCGCAAAGATCTTGTCAATAGTATGGGCCACATTCGCAA
GGCTAAGCCTCTGAATGTAAGCCGATTATCT
mRNA sequenceShow/hide mRNA sequence
ATGTCGAACAGTAGTAACAGTGTATCGATCCCGGTTTCGGACAATGAGTTTGACGAGGTGGAAAGGATGCGTGTTCGAGCCCGGAGGAAGCGCAAAAAACTAGGAAGCAG
AGTCAAGAACGAGTTGGCTCGTCGAGTTTTTAGAATGATGCTGAGATATTGGTTGGTTGTCTTCTTCCTTATTGCTGCGGGCTTGCTTATATTTGAGGCAACAAGAATTG
GTCGGAAATCGAGGATGGAGGCGAAGTCGGAACTTGGTGACACGACGAGACCTACGTTAGGCGACTCGGTACTTGGAACTAATAAGAAGTCGGGGTTGGACAATAAACCA
GATGGGAATTTGAATAGACTCGATCCTGTTACTCGGATGGTTGCCGGAGTAAGAGAACCTTGCTTGAAACTTCTGCCACCAAAAGAACTTGAGCAATTGGATATTCCTGT
GCACGATGGTTCGCCAGTTCCAGCGATTGATGTGAACTACATAACTGAGAATGATAATTCAATTTTAAGAGATAAAACTTCCCAGTCGCGACAGAGCACGGAGGCCACAA
GATTTAATCTTTTTACCGGATACCAGACTCTCGATCAGAGAGAAAAAAGTTTTCAGGTAAATGGAACTGTTGAGGTACATTGCGGTTTTTACAGTGATAATGGGGGTTTC
AGAATTTCTGATGGAGACAGAAATTACATGAAAACATGCACCCTTGTGGTGTCCACATGTGCATTTGGTGGTGGAGATGATCTTTATCAACCAATTGGAATGTCAGATGC
ATCACTTCGTAAGGTTGTTTGCTATGTTGCATTCTGGGATGAAATCACTTTAAGTGCACAGGAGTCGATGGGACATATTATTGGCGAGAATGGATTTTTTGGGAAGTGGC
GCATTATAGTTGTGAGGGATCTTCCTTTCACTGACCAAAGATTGAATGGTAAAATCCCGAAGATGTTGGGTCATCGTCTCTTCCCTAATGTGAAGTACTCCATTTGGGTA
GATTCAAAGTCTCAATTCAGGAGGGATCCTCTAGGAGTGTTTGAAGCTCTTCTTTGGCATTCAAATTCTGAATTTGCAATATCACAACATGGAGCTCGTAGTAGTGTCTA
TGACGAGGCTGGAGCTGTGGTCAGGAAGCATAAAGCCACCCCAGAAGAAGTTGATATGCAGATAAAACGGTACCATCATGATCACTTCCCAGATGACAAGAGATTTAACG
GACATAAAGCTCTTGCAGAGGCCTCTGTGATTGTTAGGGAGCACTCGCCAGTGGTGAATTTGTTCATGTGCCTCTGGTTTAATGAGGTGGTGCGCTTTACTTCGCGGGAT
CAGCTGAGCTTCCCATATGTTCTATGGCGGCTGAAACTTCTGAAGAAAATTCATATGTTCCCCGTTTGCACTCGCAAAGATCTTGTCAATAGTATGGGCCACATTCGCAA
GGCTAAGCCTCTGAATGTAAGCCGATTATCT
Protein sequenceShow/hide protein sequence
MSNSSNSVSIPVSDNEFDEVERMRVRARRKRKKLGSRVKNELARRVFRMMLRYWLVVFFLIAAGLLIFEATRIGRKSRMEAKSELGDTTRPTLGDSVLGTNKKSGLDNKP
DGNLNRLDPVTRMVAGVREPCLKLLPPKELEQLDIPVHDGSPVPAIDVNYITENDNSILRDKTSQSRQSTEATRFNLFTGYQTLDQREKSFQVNGTVEVHCGFYSDNGGF
RISDGDRNYMKTCTLVVSTCAFGGGDDLYQPIGMSDASLRKVVCYVAFWDEITLSAQESMGHIIGENGFFGKWRIIVVRDLPFTDQRLNGKIPKMLGHRLFPNVKYSIWV
DSKSQFRRDPLGVFEALLWHSNSEFAISQHGARSSVYDEAGAVVRKHKATPEEVDMQIKRYHHDHFPDDKRFNGHKALAEASVIVREHSPVVNLFMCLWFNEVVRFTSRD
QLSFPYVLWRLKLLKKIHMFPVCTRKDLVNSMGHIRKAKPLNVSRLS