; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS000248 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS000248
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionprotein LNK1 isoform X1
Genome locationscaffold44:195941..199531
RNA-Seq ExpressionMS000248
SyntenyMS000248
Gene Ontology termsGO:0006354 - DNA-templated transcription, elongation (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0009649 - entrainment of circadian clock (biological process)
GO:0032922 - circadian regulation of gene expression (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003712 - transcription coregulator activity (molecular function)
GO:0070063 - RNA polymerase binding (molecular function)
InterPro domainsIPR039928 - LNK family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022140998.1 protein LNK1 isoform X1 [Momordica charantia]1.6e-22590.13Show/hide
Query:  MSNNGFKTGVGTDLDYCTDDRIVTDNSAADENDMYQYSVSHMSQTDNDISFLDDDHENKENTDLLYYGWQDIGSFEDVDRMFRNCDSTFGLGNLSNEDDL
        MSNNGFKTGVGTDLDYCTDDRIVTDNSAADENDMYQYSVSHMSQTDNDISFLDDDHENKENTDLLYYGWQDIGSFEDVDRMFRNCDSTFGLGNLSNEDDL
Subjt:  MSNNGFKTGVGTDLDYCTDDRIVTDNSAADENDMYQYSVSHMSQTDNDISFLDDDHENKENTDLLYYGWQDIGSFEDVDRMFRNCDSTFGLGNLSNEDDL

Query:  RWFSPSHGSEKLEDPSKPNFKFSCCEGSTINDASEFNEDSNPVNSDPSSDGLNRNNILTGCKVNDGITDIGDSPAISHLSAADVSDIKSNSRGDLMPKNQ
        RWFSPSHGSEKLEDPSKPNFKFSCCEGSTINDASEFNEDSNPVNSDPSSDGLNRNNILTGCKVNDGITDIG+SPAISHLSAADVSDIKSNSRGDLMPKNQ
Subjt:  RWFSPSHGSEKLEDPSKPNFKFSCCEGSTINDASEFNEDSNPVNSDPSSDGLNRNNILTGCKVNDGITDIGDSPAISHLSAADVSDIKSNSRGDLMPKNQ

Query:  ESSYASNQLQSIPSSHYPSFDAATVAANENREKLYHQDLQSSFNKNFAFMSTPSTEPFNTSFPVRKQVPRSESDIDDGHSETGVVSRGSRAELDSSNTQD
        ESSYASNQLQSIPSSHYPSFDAATVAANENREKLYHQDLQSSFNKNFAFMSTPSTEPFNTSFPVRKQVPRSESDIDDGHSETGVVSRGSRAELDSSNTQD
Subjt:  ESSYASNQLQSIPSSHYPSFDAATVAANENREKLYHQDLQSSFNKNFAFMSTPSTEPFNTSFPVRKQVPRSESDIDDGHSETGVVSRGSRAELDSSNTQD

Query:  KSCRSTMLDGISLEATSFRQLQQVMEQV-------LSLSFFYLFFSV-----CGVVSKFL----------TVDFGFNRSGGFLDLETDTNPIDRSVAHLL
        KSCRSTMLDGISLEATSFRQLQQVMEQ+       +  S + L  S      C  +++ +          ++D   NRSGGFLDLETDTNPIDRSVAHLL
Subjt:  KSCRSTMLDGISLEATSFRQLQQVMEQV-------LSLSFFYLFFSV-----CGVVSKFL----------TVDFGFNRSGGFLDLETDTNPIDRSVAHLL

Query:  FHRPSDPSIMPVGGNTLPLKSHKLVPPEKQTFQDETGGVAAAACADQKPLANGKKL
        FHRPSDPSIMPVGGNTLPLKSHKLVPPEKQTFQDETGGVAAAACADQKPLANGKKL
Subjt:  FHRPSDPSIMPVGGNTLPLKSHKLVPPEKQTFQDETGGVAAAACADQKPLANGKKL

XP_022140999.1 protein LNK1 isoform X2 [Momordica charantia]1.6e-22590.13Show/hide
Query:  MSNNGFKTGVGTDLDYCTDDRIVTDNSAADENDMYQYSVSHMSQTDNDISFLDDDHENKENTDLLYYGWQDIGSFEDVDRMFRNCDSTFGLGNLSNEDDL
        MSNNGFKTGVGTDLDYCTDDRIVTDNSAADENDMYQYSVSHMSQTDNDISFLDDDHENKENTDLLYYGWQDIGSFEDVDRMFRNCDSTFGLGNLSNEDDL
Subjt:  MSNNGFKTGVGTDLDYCTDDRIVTDNSAADENDMYQYSVSHMSQTDNDISFLDDDHENKENTDLLYYGWQDIGSFEDVDRMFRNCDSTFGLGNLSNEDDL

Query:  RWFSPSHGSEKLEDPSKPNFKFSCCEGSTINDASEFNEDSNPVNSDPSSDGLNRNNILTGCKVNDGITDIGDSPAISHLSAADVSDIKSNSRGDLMPKNQ
        RWFSPSHGSEKLEDPSKPNFKFSCCEGSTINDASEFNEDSNPVNSDPSSDGLNRNNILTGCKVNDGITDIG+SPAISHLSAADVSDIKSNSRGDLMPKNQ
Subjt:  RWFSPSHGSEKLEDPSKPNFKFSCCEGSTINDASEFNEDSNPVNSDPSSDGLNRNNILTGCKVNDGITDIGDSPAISHLSAADVSDIKSNSRGDLMPKNQ

Query:  ESSYASNQLQSIPSSHYPSFDAATVAANENREKLYHQDLQSSFNKNFAFMSTPSTEPFNTSFPVRKQVPRSESDIDDGHSETGVVSRGSRAELDSSNTQD
        ESSYASNQLQSIPSSHYPSFDAATVAANENREKLYHQDLQSSFNKNFAFMSTPSTEPFNTSFPVRKQVPRSESDIDDGHSETGVVSRGSRAELDSSNTQD
Subjt:  ESSYASNQLQSIPSSHYPSFDAATVAANENREKLYHQDLQSSFNKNFAFMSTPSTEPFNTSFPVRKQVPRSESDIDDGHSETGVVSRGSRAELDSSNTQD

Query:  KSCRSTMLDGISLEATSFRQLQQVMEQV-------LSLSFFYLFFSV-----CGVVSKFL----------TVDFGFNRSGGFLDLETDTNPIDRSVAHLL
        KSCRSTMLDGISLEATSFRQLQQVMEQ+       +  S + L  S      C  +++ +          ++D   NRSGGFLDLETDTNPIDRSVAHLL
Subjt:  KSCRSTMLDGISLEATSFRQLQQVMEQV-------LSLSFFYLFFSV-----CGVVSKFL----------TVDFGFNRSGGFLDLETDTNPIDRSVAHLL

Query:  FHRPSDPSIMPVGGNTLPLKSHKLVPPEKQTFQDETGGVAAAACADQKPLANGKKL
        FHRPSDPSIMPVGGNTLPLKSHKLVPPEKQTFQDETGGVAAAACADQKPLANGKKL
Subjt:  FHRPSDPSIMPVGGNTLPLKSHKLVPPEKQTFQDETGGVAAAACADQKPLANGKKL

XP_022141000.1 protein LNK1 isoform X3 [Momordica charantia]1.6e-20689.18Show/hide
Query:  MSNNGFKTGVGTDLDYCTDDRIVTDNSAADENDMYQYSVSHMSQTDNDISFLDDDHENKENTDLLYYGWQDIGSFEDVDRMFRNCDSTFGLGNLSNEDDL
        MSNNGFKTGVGTDLDYCTDDRIVTDNSAADENDMYQYSVSHMSQTDNDISFLDDDHENKENTDLLYYGWQDIGSFEDVDRMFRNCDSTFGLGNLSNEDDL
Subjt:  MSNNGFKTGVGTDLDYCTDDRIVTDNSAADENDMYQYSVSHMSQTDNDISFLDDDHENKENTDLLYYGWQDIGSFEDVDRMFRNCDSTFGLGNLSNEDDL

Query:  RWFSPSHGSEKLEDPSKPNFKFSCCEGSTINDASEFNEDSNPVNSDPSSDGLNRNNILTGCKVNDGITDIGDSPAISHLSAADVSDIKSNSRGDLMPKNQ
        RWFSPSHGSEKLEDPSKPNFKFSCCEGSTINDASEFNEDSNPVNSDPSSDGLNRNNILTGCKVNDGITDIG+SPAISHLSAADVSDIKSNSRGDLMPKNQ
Subjt:  RWFSPSHGSEKLEDPSKPNFKFSCCEGSTINDASEFNEDSNPVNSDPSSDGLNRNNILTGCKVNDGITDIGDSPAISHLSAADVSDIKSNSRGDLMPKNQ

Query:  ESSYASNQLQSIPSSHYPSFDAATVAANENREKLYHQDLQSSFNKNFAFMSTPSTEPFNTSFPVRKQVPRSESDIDDGHSETGVVSRGSRAELDSSNTQD
        ESSYASNQLQSIPSSHYPSFDAATVAANENREKLYHQDLQSSFNKNFAFMSTPSTEPFNTSFPVRKQVPRSESDIDDGHSETGVVSRGSRAELDSSNTQD
Subjt:  ESSYASNQLQSIPSSHYPSFDAATVAANENREKLYHQDLQSSFNKNFAFMSTPSTEPFNTSFPVRKQVPRSESDIDDGHSETGVVSRGSRAELDSSNTQD

Query:  KSCRSTMLDGISLEATSFRQLQQVMEQV-------LSLSFFYLFFSV-----CGVVSKFL----------TVDFGFNRSGGFLDLETDTNPIDRSVAHLL
        KSCRSTMLDGISLEATSFRQLQQVMEQ+       +  S + L  S      C  +++ +          ++D   NRSGGFLDLETDTNPIDRSVAHLL
Subjt:  KSCRSTMLDGISLEATSFRQLQQVMEQV-------LSLSFFYLFFSV-----CGVVSKFL----------TVDFGFNRSGGFLDLETDTNPIDRSVAHLL

Query:  FHRPSDPSIMPVGGNTLPLKSHKLV
        FHRPSDPSIMPVGGNTLPLKSHKL+
Subjt:  FHRPSDPSIMPVGGNTLPLKSHKLV

XP_022141001.1 protein LNK1 isoform X4 [Momordica charantia]1.6e-22590.13Show/hide
Query:  MSNNGFKTGVGTDLDYCTDDRIVTDNSAADENDMYQYSVSHMSQTDNDISFLDDDHENKENTDLLYYGWQDIGSFEDVDRMFRNCDSTFGLGNLSNEDDL
        MSNNGFKTGVGTDLDYCTDDRIVTDNSAADENDMYQYSVSHMSQTDNDISFLDDDHENKENTDLLYYGWQDIGSFEDVDRMFRNCDSTFGLGNLSNEDDL
Subjt:  MSNNGFKTGVGTDLDYCTDDRIVTDNSAADENDMYQYSVSHMSQTDNDISFLDDDHENKENTDLLYYGWQDIGSFEDVDRMFRNCDSTFGLGNLSNEDDL

Query:  RWFSPSHGSEKLEDPSKPNFKFSCCEGSTINDASEFNEDSNPVNSDPSSDGLNRNNILTGCKVNDGITDIGDSPAISHLSAADVSDIKSNSRGDLMPKNQ
        RWFSPSHGSEKLEDPSKPNFKFSCCEGSTINDASEFNEDSNPVNSDPSSDGLNRNNILTGCKVNDGITDIG+SPAISHLSAADVSDIKSNSRGDLMPKNQ
Subjt:  RWFSPSHGSEKLEDPSKPNFKFSCCEGSTINDASEFNEDSNPVNSDPSSDGLNRNNILTGCKVNDGITDIGDSPAISHLSAADVSDIKSNSRGDLMPKNQ

Query:  ESSYASNQLQSIPSSHYPSFDAATVAANENREKLYHQDLQSSFNKNFAFMSTPSTEPFNTSFPVRKQVPRSESDIDDGHSETGVVSRGSRAELDSSNTQD
        ESSYASNQLQSIPSSHYPSFDAATVAANENREKLYHQDLQSSFNKNFAFMSTPSTEPFNTSFPVRKQVPRSESDIDDGHSETGVVSRGSRAELDSSNTQD
Subjt:  ESSYASNQLQSIPSSHYPSFDAATVAANENREKLYHQDLQSSFNKNFAFMSTPSTEPFNTSFPVRKQVPRSESDIDDGHSETGVVSRGSRAELDSSNTQD

Query:  KSCRSTMLDGISLEATSFRQLQQVMEQV-------LSLSFFYLFFSV-----CGVVSKFL----------TVDFGFNRSGGFLDLETDTNPIDRSVAHLL
        KSCRSTMLDGISLEATSFRQLQQVMEQ+       +  S + L  S      C  +++ +          ++D   NRSGGFLDLETDTNPIDRSVAHLL
Subjt:  KSCRSTMLDGISLEATSFRQLQQVMEQV-------LSLSFFYLFFSV-----CGVVSKFL----------TVDFGFNRSGGFLDLETDTNPIDRSVAHLL

Query:  FHRPSDPSIMPVGGNTLPLKSHKLVPPEKQTFQDETGGVAAAACADQKPLANGKKL
        FHRPSDPSIMPVGGNTLPLKSHKLVPPEKQTFQDETGGVAAAACADQKPLANGKKL
Subjt:  FHRPSDPSIMPVGGNTLPLKSHKLVPPEKQTFQDETGGVAAAACADQKPLANGKKL

XP_038903970.1 protein LNK1 isoform X3 [Benincasa hispida]1.4e-19480.26Show/hide
Query:  MSNNGFKTGVGTDLDYCTDDRIVTDNSAADENDMYQYSVSHMSQTDNDISFLDDDHENKENTDLLYYGWQDIGSFEDVDRMFRNCDSTFGLGNLSNEDDL
        MSNN FKT VGTDLDYCTDD IVTDNSAADENDMYQYSVSH+SQTDNDISFLDDD EN+EN DLLYYGWQDIGSFEDVDRMFRNCDSTFGLGNLSNEDDL
Subjt:  MSNNGFKTGVGTDLDYCTDDRIVTDNSAADENDMYQYSVSHMSQTDNDISFLDDDHENKENTDLLYYGWQDIGSFEDVDRMFRNCDSTFGLGNLSNEDDL

Query:  RWFSPSHGSEKLEDPSKPNFKFSCCEGSTINDASEFNEDSNPVNSDPSSDGLNRNNILTGCKVNDGITDIGDSPAISHLSAADVSDIKSNSRGDLMPKNQ
        RWFSPSHGSEKLEDPSK NFKFSCCEGSTINDASEFNE+SNPVNS+PSSDGLNRNNIL GCK+NDGITDIGDS A+SHLSAAD++D K +SRGDL+PK Q
Subjt:  RWFSPSHGSEKLEDPSKPNFKFSCCEGSTINDASEFNEDSNPVNSDPSSDGLNRNNILTGCKVNDGITDIGDSPAISHLSAADVSDIKSNSRGDLMPKNQ

Query:  ESSYASNQLQSIPSSHYPSFDAATVAANENREKLYHQDLQSSFNKNFAFMSTPSTEPFNTSFPVRKQVPRSESDIDDGHSETGVVSRGSRAELDSSNTQD
        ESSYASNQL    SSHYPSFDA T+AANENREKLYHQDL +SFNKNF FMS PS+E FNTSFPVRKQ  RSES+IDDGHSETGVVSRGSRAELDSSN QD
Subjt:  ESSYASNQLQSIPSSHYPSFDAATVAANENREKLYHQDLQSSFNKNFAFMSTPSTEPFNTSFPVRKQVPRSESDIDDGHSETGVVSRGSRAELDSSNTQD

Query:  KSCRSTMLDGISLEATSFRQLQQVMEQV-------LSLSFFYLFFSV-----CGVVS------KFL----TVDFGFNRSGGFLDLETDTNPIDRSVAHLL
        KSCRST LDGISLEATSFRQLQQVMEQ+       +  S + L  S      C  ++      KF+    ++D   NRSGGFLDLETDTNPIDRSVAHLL
Subjt:  KSCRSTMLDGISLEATSFRQLQQVMEQV-------LSLSFFYLFFSV-----CGVVS------KFL----TVDFGFNRSGGFLDLETDTNPIDRSVAHLL

Query:  FHRPSDPSIMPVGGNTLPLKSHKLVPPEKQTFQDETGGVAAAACADQKPLANGKKL
        FHRPSDPS+MP GGNTLPLKSHKLV  EKQ FQDET G AAA  ADQKPL+NGKKL
Subjt:  FHRPSDPSIMPVGGNTLPLKSHKLVPPEKQTFQDETGGVAAAACADQKPLANGKKL

TrEMBL top hitse value%identityAlignment
A0A0A0KVV5 Uncharacterized protein6.9e-19580.04Show/hide
Query:  MSNNGFKTGVGTDLDYCTDDRIVTDNSAADENDMYQYSVSHMSQTDNDISFLDDDHENKENTDLLYYGWQDIGSFEDVDRMFRNCDSTFGLGNLSNEDDL
        MSN+ FKT VGTDLDYCTDD IVTDNSAADENDMYQYSVSHMSQTDNDISFLDDD ENKEN DLLYYGWQDIGSFEDVDRMFRNCDSTFGLGNLSNEDDL
Subjt:  MSNNGFKTGVGTDLDYCTDDRIVTDNSAADENDMYQYSVSHMSQTDNDISFLDDDHENKENTDLLYYGWQDIGSFEDVDRMFRNCDSTFGLGNLSNEDDL

Query:  RWFSPSHGSEKLEDPSKPNFKFSCCEGSTINDASEFNEDSNPVNSDPSSDGLNRNNILTGCKVNDGITDIGDSPAISHLSAADVSDIKSNSRGDLMPKNQ
        RWFSPSHG+EKLEDPSKPNFKFSCCEGSTINDA+EFNE+SNPVNS+ S DGLNRNNIL GCK+NDGITDIGDS AISHLSAAD+SD K NS GDL+PK Q
Subjt:  RWFSPSHGSEKLEDPSKPNFKFSCCEGSTINDASEFNEDSNPVNSDPSSDGLNRNNILTGCKVNDGITDIGDSPAISHLSAADVSDIKSNSRGDLMPKNQ

Query:  ESSYASNQLQSIPSSHYPSFDAATVAANENREKLYHQDLQSSFNKNFAFMSTPSTEPFNTSFPVRKQVPRSESDIDDGHSETGVVSRGSRAELDSSNTQD
        ESSYASNQL    SSHYPSFDA T+ ANENREKLYHQDL +SFNKNF FMS PS+E FNTSFPVRKQ PRSES+IDDGHSE+GVVSRGSR ELDSSN QD
Subjt:  ESSYASNQLQSIPSSHYPSFDAATVAANENREKLYHQDLQSSFNKNFAFMSTPSTEPFNTSFPVRKQVPRSESDIDDGHSETGVVSRGSRAELDSSNTQD

Query:  KSCRSTMLDGISLEATSFRQLQQVMEQV-------LSLSFFYLFFSV-----CGVVS------KFL----TVDFGFNRSGGFLDLETDTNPIDRSVAHLL
        K CRSTMLDGISLEATSFRQLQQVMEQ+       +  S + L  S      C  ++      KF+    ++D   NRSGGFLDLETDTNPIDRSVAHLL
Subjt:  KSCRSTMLDGISLEATSFRQLQQVMEQV-------LSLSFFYLFFSV-----CGVVS------KFL----TVDFGFNRSGGFLDLETDTNPIDRSVAHLL

Query:  FHRPSDPSIMPVGGNTLPLKSHKLVPPEKQTFQDETGGVAAAACADQKPLANGKKL
        FHRPSDPS+MP GGNTL LKSHKLVP EKQ FQDETGG  AAACADQK L+NGKKL
Subjt:  FHRPSDPSIMPVGGNTLPLKSHKLVPPEKQTFQDETGGVAAAACADQKPLANGKKL

A0A6J1CGQ9 protein LNK1 isoform X37.9e-20789.18Show/hide
Query:  MSNNGFKTGVGTDLDYCTDDRIVTDNSAADENDMYQYSVSHMSQTDNDISFLDDDHENKENTDLLYYGWQDIGSFEDVDRMFRNCDSTFGLGNLSNEDDL
        MSNNGFKTGVGTDLDYCTDDRIVTDNSAADENDMYQYSVSHMSQTDNDISFLDDDHENKENTDLLYYGWQDIGSFEDVDRMFRNCDSTFGLGNLSNEDDL
Subjt:  MSNNGFKTGVGTDLDYCTDDRIVTDNSAADENDMYQYSVSHMSQTDNDISFLDDDHENKENTDLLYYGWQDIGSFEDVDRMFRNCDSTFGLGNLSNEDDL

Query:  RWFSPSHGSEKLEDPSKPNFKFSCCEGSTINDASEFNEDSNPVNSDPSSDGLNRNNILTGCKVNDGITDIGDSPAISHLSAADVSDIKSNSRGDLMPKNQ
        RWFSPSHGSEKLEDPSKPNFKFSCCEGSTINDASEFNEDSNPVNSDPSSDGLNRNNILTGCKVNDGITDIG+SPAISHLSAADVSDIKSNSRGDLMPKNQ
Subjt:  RWFSPSHGSEKLEDPSKPNFKFSCCEGSTINDASEFNEDSNPVNSDPSSDGLNRNNILTGCKVNDGITDIGDSPAISHLSAADVSDIKSNSRGDLMPKNQ

Query:  ESSYASNQLQSIPSSHYPSFDAATVAANENREKLYHQDLQSSFNKNFAFMSTPSTEPFNTSFPVRKQVPRSESDIDDGHSETGVVSRGSRAELDSSNTQD
        ESSYASNQLQSIPSSHYPSFDAATVAANENREKLYHQDLQSSFNKNFAFMSTPSTEPFNTSFPVRKQVPRSESDIDDGHSETGVVSRGSRAELDSSNTQD
Subjt:  ESSYASNQLQSIPSSHYPSFDAATVAANENREKLYHQDLQSSFNKNFAFMSTPSTEPFNTSFPVRKQVPRSESDIDDGHSETGVVSRGSRAELDSSNTQD

Query:  KSCRSTMLDGISLEATSFRQLQQVMEQV-------LSLSFFYLFFSV-----CGVVSKFL----------TVDFGFNRSGGFLDLETDTNPIDRSVAHLL
        KSCRSTMLDGISLEATSFRQLQQVMEQ+       +  S + L  S      C  +++ +          ++D   NRSGGFLDLETDTNPIDRSVAHLL
Subjt:  KSCRSTMLDGISLEATSFRQLQQVMEQV-------LSLSFFYLFFSV-----CGVVSKFL----------TVDFGFNRSGGFLDLETDTNPIDRSVAHLL

Query:  FHRPSDPSIMPVGGNTLPLKSHKLV
        FHRPSDPSIMPVGGNTLPLKSHKL+
Subjt:  FHRPSDPSIMPVGGNTLPLKSHKLV

A0A6J1CHC6 protein LNK1 isoform X27.6e-22690.13Show/hide
Query:  MSNNGFKTGVGTDLDYCTDDRIVTDNSAADENDMYQYSVSHMSQTDNDISFLDDDHENKENTDLLYYGWQDIGSFEDVDRMFRNCDSTFGLGNLSNEDDL
        MSNNGFKTGVGTDLDYCTDDRIVTDNSAADENDMYQYSVSHMSQTDNDISFLDDDHENKENTDLLYYGWQDIGSFEDVDRMFRNCDSTFGLGNLSNEDDL
Subjt:  MSNNGFKTGVGTDLDYCTDDRIVTDNSAADENDMYQYSVSHMSQTDNDISFLDDDHENKENTDLLYYGWQDIGSFEDVDRMFRNCDSTFGLGNLSNEDDL

Query:  RWFSPSHGSEKLEDPSKPNFKFSCCEGSTINDASEFNEDSNPVNSDPSSDGLNRNNILTGCKVNDGITDIGDSPAISHLSAADVSDIKSNSRGDLMPKNQ
        RWFSPSHGSEKLEDPSKPNFKFSCCEGSTINDASEFNEDSNPVNSDPSSDGLNRNNILTGCKVNDGITDIG+SPAISHLSAADVSDIKSNSRGDLMPKNQ
Subjt:  RWFSPSHGSEKLEDPSKPNFKFSCCEGSTINDASEFNEDSNPVNSDPSSDGLNRNNILTGCKVNDGITDIGDSPAISHLSAADVSDIKSNSRGDLMPKNQ

Query:  ESSYASNQLQSIPSSHYPSFDAATVAANENREKLYHQDLQSSFNKNFAFMSTPSTEPFNTSFPVRKQVPRSESDIDDGHSETGVVSRGSRAELDSSNTQD
        ESSYASNQLQSIPSSHYPSFDAATVAANENREKLYHQDLQSSFNKNFAFMSTPSTEPFNTSFPVRKQVPRSESDIDDGHSETGVVSRGSRAELDSSNTQD
Subjt:  ESSYASNQLQSIPSSHYPSFDAATVAANENREKLYHQDLQSSFNKNFAFMSTPSTEPFNTSFPVRKQVPRSESDIDDGHSETGVVSRGSRAELDSSNTQD

Query:  KSCRSTMLDGISLEATSFRQLQQVMEQV-------LSLSFFYLFFSV-----CGVVSKFL----------TVDFGFNRSGGFLDLETDTNPIDRSVAHLL
        KSCRSTMLDGISLEATSFRQLQQVMEQ+       +  S + L  S      C  +++ +          ++D   NRSGGFLDLETDTNPIDRSVAHLL
Subjt:  KSCRSTMLDGISLEATSFRQLQQVMEQV-------LSLSFFYLFFSV-----CGVVSKFL----------TVDFGFNRSGGFLDLETDTNPIDRSVAHLL

Query:  FHRPSDPSIMPVGGNTLPLKSHKLVPPEKQTFQDETGGVAAAACADQKPLANGKKL
        FHRPSDPSIMPVGGNTLPLKSHKLVPPEKQTFQDETGGVAAAACADQKPLANGKKL
Subjt:  FHRPSDPSIMPVGGNTLPLKSHKLVPPEKQTFQDETGGVAAAACADQKPLANGKKL

A0A6J1CHP2 protein LNK1 isoform X47.6e-22690.13Show/hide
Query:  MSNNGFKTGVGTDLDYCTDDRIVTDNSAADENDMYQYSVSHMSQTDNDISFLDDDHENKENTDLLYYGWQDIGSFEDVDRMFRNCDSTFGLGNLSNEDDL
        MSNNGFKTGVGTDLDYCTDDRIVTDNSAADENDMYQYSVSHMSQTDNDISFLDDDHENKENTDLLYYGWQDIGSFEDVDRMFRNCDSTFGLGNLSNEDDL
Subjt:  MSNNGFKTGVGTDLDYCTDDRIVTDNSAADENDMYQYSVSHMSQTDNDISFLDDDHENKENTDLLYYGWQDIGSFEDVDRMFRNCDSTFGLGNLSNEDDL

Query:  RWFSPSHGSEKLEDPSKPNFKFSCCEGSTINDASEFNEDSNPVNSDPSSDGLNRNNILTGCKVNDGITDIGDSPAISHLSAADVSDIKSNSRGDLMPKNQ
        RWFSPSHGSEKLEDPSKPNFKFSCCEGSTINDASEFNEDSNPVNSDPSSDGLNRNNILTGCKVNDGITDIG+SPAISHLSAADVSDIKSNSRGDLMPKNQ
Subjt:  RWFSPSHGSEKLEDPSKPNFKFSCCEGSTINDASEFNEDSNPVNSDPSSDGLNRNNILTGCKVNDGITDIGDSPAISHLSAADVSDIKSNSRGDLMPKNQ

Query:  ESSYASNQLQSIPSSHYPSFDAATVAANENREKLYHQDLQSSFNKNFAFMSTPSTEPFNTSFPVRKQVPRSESDIDDGHSETGVVSRGSRAELDSSNTQD
        ESSYASNQLQSIPSSHYPSFDAATVAANENREKLYHQDLQSSFNKNFAFMSTPSTEPFNTSFPVRKQVPRSESDIDDGHSETGVVSRGSRAELDSSNTQD
Subjt:  ESSYASNQLQSIPSSHYPSFDAATVAANENREKLYHQDLQSSFNKNFAFMSTPSTEPFNTSFPVRKQVPRSESDIDDGHSETGVVSRGSRAELDSSNTQD

Query:  KSCRSTMLDGISLEATSFRQLQQVMEQV-------LSLSFFYLFFSV-----CGVVSKFL----------TVDFGFNRSGGFLDLETDTNPIDRSVAHLL
        KSCRSTMLDGISLEATSFRQLQQVMEQ+       +  S + L  S      C  +++ +          ++D   NRSGGFLDLETDTNPIDRSVAHLL
Subjt:  KSCRSTMLDGISLEATSFRQLQQVMEQV-------LSLSFFYLFFSV-----CGVVSKFL----------TVDFGFNRSGGFLDLETDTNPIDRSVAHLL

Query:  FHRPSDPSIMPVGGNTLPLKSHKLVPPEKQTFQDETGGVAAAACADQKPLANGKKL
        FHRPSDPSIMPVGGNTLPLKSHKLVPPEKQTFQDETGGVAAAACADQKPLANGKKL
Subjt:  FHRPSDPSIMPVGGNTLPLKSHKLVPPEKQTFQDETGGVAAAACADQKPLANGKKL

A0A6J1CJA6 protein LNK1 isoform X17.6e-22690.13Show/hide
Query:  MSNNGFKTGVGTDLDYCTDDRIVTDNSAADENDMYQYSVSHMSQTDNDISFLDDDHENKENTDLLYYGWQDIGSFEDVDRMFRNCDSTFGLGNLSNEDDL
        MSNNGFKTGVGTDLDYCTDDRIVTDNSAADENDMYQYSVSHMSQTDNDISFLDDDHENKENTDLLYYGWQDIGSFEDVDRMFRNCDSTFGLGNLSNEDDL
Subjt:  MSNNGFKTGVGTDLDYCTDDRIVTDNSAADENDMYQYSVSHMSQTDNDISFLDDDHENKENTDLLYYGWQDIGSFEDVDRMFRNCDSTFGLGNLSNEDDL

Query:  RWFSPSHGSEKLEDPSKPNFKFSCCEGSTINDASEFNEDSNPVNSDPSSDGLNRNNILTGCKVNDGITDIGDSPAISHLSAADVSDIKSNSRGDLMPKNQ
        RWFSPSHGSEKLEDPSKPNFKFSCCEGSTINDASEFNEDSNPVNSDPSSDGLNRNNILTGCKVNDGITDIG+SPAISHLSAADVSDIKSNSRGDLMPKNQ
Subjt:  RWFSPSHGSEKLEDPSKPNFKFSCCEGSTINDASEFNEDSNPVNSDPSSDGLNRNNILTGCKVNDGITDIGDSPAISHLSAADVSDIKSNSRGDLMPKNQ

Query:  ESSYASNQLQSIPSSHYPSFDAATVAANENREKLYHQDLQSSFNKNFAFMSTPSTEPFNTSFPVRKQVPRSESDIDDGHSETGVVSRGSRAELDSSNTQD
        ESSYASNQLQSIPSSHYPSFDAATVAANENREKLYHQDLQSSFNKNFAFMSTPSTEPFNTSFPVRKQVPRSESDIDDGHSETGVVSRGSRAELDSSNTQD
Subjt:  ESSYASNQLQSIPSSHYPSFDAATVAANENREKLYHQDLQSSFNKNFAFMSTPSTEPFNTSFPVRKQVPRSESDIDDGHSETGVVSRGSRAELDSSNTQD

Query:  KSCRSTMLDGISLEATSFRQLQQVMEQV-------LSLSFFYLFFSV-----CGVVSKFL----------TVDFGFNRSGGFLDLETDTNPIDRSVAHLL
        KSCRSTMLDGISLEATSFRQLQQVMEQ+       +  S + L  S      C  +++ +          ++D   NRSGGFLDLETDTNPIDRSVAHLL
Subjt:  KSCRSTMLDGISLEATSFRQLQQVMEQV-------LSLSFFYLFFSV-----CGVVSKFL----------TVDFGFNRSGGFLDLETDTNPIDRSVAHLL

Query:  FHRPSDPSIMPVGGNTLPLKSHKLVPPEKQTFQDETGGVAAAACADQKPLANGKKL
        FHRPSDPSIMPVGGNTLPLKSHKLVPPEKQTFQDETGGVAAAACADQKPLANGKKL
Subjt:  FHRPSDPSIMPVGGNTLPLKSHKLVPPEKQTFQDETGGVAAAACADQKPLANGKKL

SwissProt top hitse value%identityAlignment
A8MQN2 Protein LNK12.2e-4132.39Show/hide
Query:  SNNGFKTG-VGTDLDYCTDDRIVTDNSAADENDMYQYSVSHMSQTDNDISFLDDDHENKENTDLLYYGWQDIGSFEDVDRMFRNCDSTFGLGNLSNEDDL
        S++GF  G V    ++ T D ++ D SAA  + +Y YS++ +   +ND+SF D+   +KE  D L+YGW DIG+FEDVD M R+CDSTFGL +L+NE DL
Subjt:  SNNGFKTG-VGTDLDYCTDDRIVTDNSAADENDMYQYSVSHMSQTDNDISFLDDDHENKENTDLLYYGWQDIGSFEDVDRMFRNCDSTFGLGNLSNEDDL

Query:  RWFSPSHGSE------------------------KLED---PSKPNFKFSCCEGSTINDASEFNEDSNPV----------------------------NS
         WFS +  +E                        ++ED    S+PN       G TI D S   + S  V                            + 
Subjt:  RWFSPSHGSE------------------------KLED---PSKPNFKFSCCEGSTINDASEFNEDSNPV----------------------------NS

Query:  DPSSDGLNRNNI-LTGCKVNDGITDIG--------------------DSPAISHLSAADVSDIKSNSRGDLMPKNQESSYASNQLQSIPSSHYPSFDAAT
        D  SDG + N+  L    ++  I D                      + P++   +    S IKS ++      + E SY SN  QSI S   P+ D   
Subjt:  DPSSDGLNRNNI-LTGCKVNDGITDIG--------------------DSPAISHLSAADVSDIKSNSRGDLMPKNQESSYASNQLQSIPSSHYPSFDAAT

Query:  VAANENREKLY-HQDLQSSFNKNFAFMSTPSTEPFNTSFPVRKQVPRSESDIDDGHSETGVVSRGSRAELDSSNTQDKSCRSTMLDGISLEATSFRQLQQ
            E R  L   QD+  SF  N    S   +  F  + P++K        +++ H       R +  EL++SN Q  SC S+++D ISLEATSFRQLQQ
Subjt:  VAANENREKLY-HQDLQSSFNKNFAFMSTPSTEPFNTSFPVRKQVPRSESDIDDGHSETGVVSRGSRAELDSSNTQDKSCRSTMLDGISLEATSFRQLQQ

Query:  VMEQVLSLSFFYLFFSVCGVVSKFLTVDFGFNRS----------------GGFLDLETDTNPIDRSVAHLLFHRPSDPSIMPVGGNTLPLKSHKLVP
        V+EQ+   +   +  S+  +         G NR                  GF+D+ETDTNPIDRS+AHLLFHRPSD S+     N L  KSH ++P
Subjt:  VMEQVLSLSFFYLFFSVCGVVSKFLTVDFGFNRS----------------GGFLDLETDTNPIDRSVAHLLFHRPSDPSIMPVGGNTLPLKSHKLVP

Arabidopsis top hitse value%identityAlignment
AT3G54500.1 BEST Arabidopsis thaliana protein match is: dentin sialophosphoprotein-related (TAIR:AT5G64170.1)2.7e-0552.08Show/hide
Query:  ENKENTDLLYYGWQDIGSFEDVDRMFRNCDSTFGLGNLSNEDDLRWFS
        E+KE  D   Y W +IGSF+D+DRMF N    FG G+LS  D+L W S
Subjt:  ENKENTDLLYYGWQDIGSFEDVDRMFRNCDSTFGLGNLSNEDDLRWFS

AT3G54500.1 BEST Arabidopsis thaliana protein match is: dentin sialophosphoprotein-related (TAIR:AT5G64170.1)8.5e-0465.52Show/hide
Query:  RSGGFLDLETDTNPIDRSVAHLLFHRPSD
        R  G  D E  TNP DR+VAHLLFHRP D
Subjt:  RSGGFLDLETDTNPIDRSVAHLLFHRPSD

AT3G54500.2 BEST Arabidopsis thaliana protein match is: dentin sialophosphoprotein-related (TAIR:AT5G64170.1)2.7e-0552.08Show/hide
Query:  ENKENTDLLYYGWQDIGSFEDVDRMFRNCDSTFGLGNLSNEDDLRWFS
        E+KE  D   Y W +IGSF+D+DRMF N    FG G+LS  D+L W S
Subjt:  ENKENTDLLYYGWQDIGSFEDVDRMFRNCDSTFGLGNLSNEDDLRWFS

AT3G54500.2 BEST Arabidopsis thaliana protein match is: dentin sialophosphoprotein-related (TAIR:AT5G64170.1)8.5e-0465.52Show/hide
Query:  RSGGFLDLETDTNPIDRSVAHLLFHRPSD
        R  G  D E  TNP DR+VAHLLFHRP D
Subjt:  RSGGFLDLETDTNPIDRSVAHLLFHRPSD

AT3G54500.3 FUNCTIONS IN: molecular_function unknown2.7e-0552.08Show/hide
Query:  ENKENTDLLYYGWQDIGSFEDVDRMFRNCDSTFGLGNLSNEDDLRWFS
        E+KE  D   Y W +IGSF+D+DRMF N    FG G+LS  D+L W S
Subjt:  ENKENTDLLYYGWQDIGSFEDVDRMFRNCDSTFGLGNLSNEDDLRWFS

AT3G54500.3 FUNCTIONS IN: molecular_function unknown8.5e-0465.52Show/hide
Query:  RSGGFLDLETDTNPIDRSVAHLLFHRPSD
        R  G  D E  TNP DR+VAHLLFHRP D
Subjt:  RSGGFLDLETDTNPIDRSVAHLLFHRPSD

AT5G64170.1 dentin sialophosphoprotein-related1.6e-4232.39Show/hide
Query:  SNNGFKTG-VGTDLDYCTDDRIVTDNSAADENDMYQYSVSHMSQTDNDISFLDDDHENKENTDLLYYGWQDIGSFEDVDRMFRNCDSTFGLGNLSNEDDL
        S++GF  G V    ++ T D ++ D SAA  + +Y YS++ +   +ND+SF D+   +KE  D L+YGW DIG+FEDVD M R+CDSTFGL +L+NE DL
Subjt:  SNNGFKTG-VGTDLDYCTDDRIVTDNSAADENDMYQYSVSHMSQTDNDISFLDDDHENKENTDLLYYGWQDIGSFEDVDRMFRNCDSTFGLGNLSNEDDL

Query:  RWFSPSHGSE------------------------KLED---PSKPNFKFSCCEGSTINDASEFNEDSNPV----------------------------NS
         WFS +  +E                        ++ED    S+PN       G TI D S   + S  V                            + 
Subjt:  RWFSPSHGSE------------------------KLED---PSKPNFKFSCCEGSTINDASEFNEDSNPV----------------------------NS

Query:  DPSSDGLNRNNI-LTGCKVNDGITDIG--------------------DSPAISHLSAADVSDIKSNSRGDLMPKNQESSYASNQLQSIPSSHYPSFDAAT
        D  SDG + N+  L    ++  I D                      + P++   +    S IKS ++      + E SY SN  QSI S   P+ D   
Subjt:  DPSSDGLNRNNI-LTGCKVNDGITDIG--------------------DSPAISHLSAADVSDIKSNSRGDLMPKNQESSYASNQLQSIPSSHYPSFDAAT

Query:  VAANENREKLY-HQDLQSSFNKNFAFMSTPSTEPFNTSFPVRKQVPRSESDIDDGHSETGVVSRGSRAELDSSNTQDKSCRSTMLDGISLEATSFRQLQQ
            E R  L   QD+  SF  N    S   +  F  + P++K        +++ H       R +  EL++SN Q  SC S+++D ISLEATSFRQLQQ
Subjt:  VAANENREKLY-HQDLQSSFNKNFAFMSTPSTEPFNTSFPVRKQVPRSESDIDDGHSETGVVSRGSRAELDSSNTQDKSCRSTMLDGISLEATSFRQLQQ

Query:  VMEQVLSLSFFYLFFSVCGVVSKFLTVDFGFNRS----------------GGFLDLETDTNPIDRSVAHLLFHRPSDPSIMPVGGNTLPLKSHKLVP
        V+EQ+   +   +  S+  +         G NR                  GF+D+ETDTNPIDRS+AHLLFHRPSD S+     N L  KSH ++P
Subjt:  VMEQVLSLSFFYLFFSVCGVVSKFLTVDFGFNRS----------------GGFLDLETDTNPIDRSVAHLLFHRPSDPSIMPVGGNTLPLKSHKLVP

AT5G64170.2 dentin sialophosphoprotein-related1.6e-4232.39Show/hide
Query:  SNNGFKTG-VGTDLDYCTDDRIVTDNSAADENDMYQYSVSHMSQTDNDISFLDDDHENKENTDLLYYGWQDIGSFEDVDRMFRNCDSTFGLGNLSNEDDL
        S++GF  G V    ++ T D ++ D SAA  + +Y YS++ +   +ND+SF D+   +KE  D L+YGW DIG+FEDVD M R+CDSTFGL +L+NE DL
Subjt:  SNNGFKTG-VGTDLDYCTDDRIVTDNSAADENDMYQYSVSHMSQTDNDISFLDDDHENKENTDLLYYGWQDIGSFEDVDRMFRNCDSTFGLGNLSNEDDL

Query:  RWFSPSHGSE------------------------KLED---PSKPNFKFSCCEGSTINDASEFNEDSNPV----------------------------NS
         WFS +  +E                        ++ED    S+PN       G TI D S   + S  V                            + 
Subjt:  RWFSPSHGSE------------------------KLED---PSKPNFKFSCCEGSTINDASEFNEDSNPV----------------------------NS

Query:  DPSSDGLNRNNI-LTGCKVNDGITDIG--------------------DSPAISHLSAADVSDIKSNSRGDLMPKNQESSYASNQLQSIPSSHYPSFDAAT
        D  SDG + N+  L    ++  I D                      + P++   +    S IKS ++      + E SY SN  QSI S   P+ D   
Subjt:  DPSSDGLNRNNI-LTGCKVNDGITDIG--------------------DSPAISHLSAADVSDIKSNSRGDLMPKNQESSYASNQLQSIPSSHYPSFDAAT

Query:  VAANENREKLY-HQDLQSSFNKNFAFMSTPSTEPFNTSFPVRKQVPRSESDIDDGHSETGVVSRGSRAELDSSNTQDKSCRSTMLDGISLEATSFRQLQQ
            E R  L   QD+  SF  N    S   +  F  + P++K        +++ H       R +  EL++SN Q  SC S+++D ISLEATSFRQLQQ
Subjt:  VAANENREKLY-HQDLQSSFNKNFAFMSTPSTEPFNTSFPVRKQVPRSESDIDDGHSETGVVSRGSRAELDSSNTQDKSCRSTMLDGISLEATSFRQLQQ

Query:  VMEQVLSLSFFYLFFSVCGVVSKFLTVDFGFNRS----------------GGFLDLETDTNPIDRSVAHLLFHRPSDPSIMPVGGNTLPLKSHKLVP
        V+EQ+   +   +  S+  +         G NR                  GF+D+ETDTNPIDRS+AHLLFHRPSD S+     N L  KSH ++P
Subjt:  VMEQVLSLSFFYLFFSVCGVVSKFLTVDFGFNRS----------------GGFLDLETDTNPIDRSVAHLLFHRPSDPSIMPVGGNTLPLKSHKLVP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAAACAATGGCTTTAAAACAGGCGTGGGTACAGATCTTGATTACTGCACAGATGATCGTATTGTAACCGACAATAGTGCTGCAGACGAGAATGACATGTATCAATA
TTCTGTCAGTCACATGTCCCAAACAGATAATGATATTAGTTTTCTTGACGATGATCATGAAAACAAAGAAAATACTGATCTTTTGTATTATGGGTGGCAAGATATAGGAA
GCTTTGAGGACGTCGATAGGATGTTTAGAAATTGTGATTCAACATTTGGGCTTGGGAATCTAAGCAATGAAGATGACCTACGCTGGTTTTCCCCATCCCATGGCTCTGAA
AAACTTGAAGATCCATCAAAGCCAAACTTCAAATTTTCATGCTGTGAAGGAAGTACAATAAATGATGCATCGGAATTTAATGAAGATTCTAATCCTGTGAATTCAGACCC
TTCATCTGATGGTTTGAACAGAAATAATATTTTAACGGGGTGCAAGGTGAATGATGGGATTACAGATATTGGTGACTCTCCTGCTATTAGTCACTTATCAGCTGCTGACG
TGTCAGATATAAAAAGCAATTCTAGAGGTGACTTGATGCCTAAAAACCAGGAGTCATCTTATGCATCTAATCAACTACAGTCTATACCTAGCTCTCATTATCCTTCCTTT
GACGCTGCAACAGTTGCAGCAAATGAAAACAGAGAAAAACTGTACCACCAGGATTTACAATCCTCATTCAATAAGAATTTTGCTTTTATGTCTACGCCAAGCACAGAGCC
ATTCAATACTTCGTTTCCAGTTAGGAAGCAGGTGCCACGGTCTGAAAGTGATATTGATGATGGTCACAGTGAAACTGGAGTAGTTAGCCGAGGAAGTCGAGCCGAATTAG
ATTCGTCAAATACACAGGATAAGTCTTGCAGGAGCACTATGCTTGATGGAATCTCTTTGGAAGCAACTAGTTTTCGCCAGCTTCAACAAGTAATGGAGCAGGTACTGAGT
TTGAGCTTCTTTTATCTTTTTTTTTCCGTCTGTGGAGTTGTTTCCAAATTTCTTACTGTGGATTTCGGTTTTAACAGGAGTGGAGGATTTTTGGATCTGGAAACTGATAC
CAATCCTATAGACCGGTCCGTTGCTCACTTGCTATTTCACCGGCCTTCGGATCCATCTATAATGCCTGTTGGTGGCAACACCTTGCCTCTGAAATCTCACAAACTGGTGC
CACCCGAAAAACAAACTTTCCAGGACGAAACCGGTGGAGTTGCTGCTGCTGCTTGTGCCGATCAAAAGCCATTGGCGAATGGGAAGAAACTA
mRNA sequenceShow/hide mRNA sequence
ATGTCAAACAATGGCTTTAAAACAGGCGTGGGTACAGATCTTGATTACTGCACAGATGATCGTATTGTAACCGACAATAGTGCTGCAGACGAGAATGACATGTATCAATA
TTCTGTCAGTCACATGTCCCAAACAGATAATGATATTAGTTTTCTTGACGATGATCATGAAAACAAAGAAAATACTGATCTTTTGTATTATGGGTGGCAAGATATAGGAA
GCTTTGAGGACGTCGATAGGATGTTTAGAAATTGTGATTCAACATTTGGGCTTGGGAATCTAAGCAATGAAGATGACCTACGCTGGTTTTCCCCATCCCATGGCTCTGAA
AAACTTGAAGATCCATCAAAGCCAAACTTCAAATTTTCATGCTGTGAAGGAAGTACAATAAATGATGCATCGGAATTTAATGAAGATTCTAATCCTGTGAATTCAGACCC
TTCATCTGATGGTTTGAACAGAAATAATATTTTAACGGGGTGCAAGGTGAATGATGGGATTACAGATATTGGTGACTCTCCTGCTATTAGTCACTTATCAGCTGCTGACG
TGTCAGATATAAAAAGCAATTCTAGAGGTGACTTGATGCCTAAAAACCAGGAGTCATCTTATGCATCTAATCAACTACAGTCTATACCTAGCTCTCATTATCCTTCCTTT
GACGCTGCAACAGTTGCAGCAAATGAAAACAGAGAAAAACTGTACCACCAGGATTTACAATCCTCATTCAATAAGAATTTTGCTTTTATGTCTACGCCAAGCACAGAGCC
ATTCAATACTTCGTTTCCAGTTAGGAAGCAGGTGCCACGGTCTGAAAGTGATATTGATGATGGTCACAGTGAAACTGGAGTAGTTAGCCGAGGAAGTCGAGCCGAATTAG
ATTCGTCAAATACACAGGATAAGTCTTGCAGGAGCACTATGCTTGATGGAATCTCTTTGGAAGCAACTAGTTTTCGCCAGCTTCAACAAGTAATGGAGCAGGTACTGAGT
TTGAGCTTCTTTTATCTTTTTTTTTCCGTCTGTGGAGTTGTTTCCAAATTTCTTACTGTGGATTTCGGTTTTAACAGGAGTGGAGGATTTTTGGATCTGGAAACTGATAC
CAATCCTATAGACCGGTCCGTTGCTCACTTGCTATTTCACCGGCCTTCGGATCCATCTATAATGCCTGTTGGTGGCAACACCTTGCCTCTGAAATCTCACAAACTGGTGC
CACCCGAAAAACAAACTTTCCAGGACGAAACCGGTGGAGTTGCTGCTGCTGCTTGTGCCGATCAAAAGCCATTGGCGAATGGGAAGAAACTA
Protein sequenceShow/hide protein sequence
MSNNGFKTGVGTDLDYCTDDRIVTDNSAADENDMYQYSVSHMSQTDNDISFLDDDHENKENTDLLYYGWQDIGSFEDVDRMFRNCDSTFGLGNLSNEDDLRWFSPSHGSE
KLEDPSKPNFKFSCCEGSTINDASEFNEDSNPVNSDPSSDGLNRNNILTGCKVNDGITDIGDSPAISHLSAADVSDIKSNSRGDLMPKNQESSYASNQLQSIPSSHYPSF
DAATVAANENREKLYHQDLQSSFNKNFAFMSTPSTEPFNTSFPVRKQVPRSESDIDDGHSETGVVSRGSRAELDSSNTQDKSCRSTMLDGISLEATSFRQLQQVMEQVLS
LSFFYLFFSVCGVVSKFLTVDFGFNRSGGFLDLETDTNPIDRSVAHLLFHRPSDPSIMPVGGNTLPLKSHKLVPPEKQTFQDETGGVAAAACADQKPLANGKKL