; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI06G24570 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI06G24570
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionHTH myb-type domain-containing protein
Genome locationChr6:21956827..21965625
RNA-Seq ExpressionCSPI06G24570
SyntenyCSPI06G24570
Gene Ontology termsNA
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004143262.2 uncharacterized protein LOC101219571 isoform X2 [Cucumis sativus]1.1e-27799.16Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVERPFLPTPLNDSNEVEDLLVESKSEHVLGNCLRVQDFSCDFGYGIQTNGGLDSNSKQGGEHELKFGDLDQLLDDANEVG
        MDQEVHFCQKFTNMKSHWVKVERPFLPTPLNDSNEVEDLLVESKSEHVLG+CLRVQDFSCDFGYGIQTNGGLDSNSKQGGEHELKFGDLDQLLDDANEVG
Subjt:  MDQEVHFCQKFTNMKSHWVKVERPFLPTPLNDSNEVEDLLVESKSEHVLGNCLRVQDFSCDFGYGIQTNGGLDSNSKQGGEHELKFGDLDQLLDDANEVG

Query:  EFHATNNLPNTYVEVAENSFRQNRGFQLGNSSSERESQGPSRIDTDAFGISELSATMVMEAEFSNTPVERGLTHELSPGLGTKGRCVTPLEGNICGTILD
        EFHATNNLPNTY EVAENSFRQNRGFQL NSSSERESQGPSRIDTDAFGISELSATMVMEAEFSNTPVERGLTHELSPGLGTKGRCVTPLEGNICGTILD
Subjt:  EFHATNNLPNTYVEVAENSFRQNRGFQLGNSSSERESQGPSRIDTDAFGISELSATMVMEAEFSNTPVERGLTHELSPGLGTKGRCVTPLEGNICGTILD

Query:  NRNIHKFNTNENYLENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEELLDSKSEHNKGRRKLPRKDKYLKVMSTEESNHIRHEVQMTPRSDSQCG
        NRNIHKFNTNENY+ENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEELLDSKSEHNKGRRKLPRKDKYLKVMSTEESNHIRHEVQMTPRSDSQCG
Subjt:  NRNIHKFNTNENYLENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEELLDSKSEHNKGRRKLPRKDKYLKVMSTEESNHIRHEVQMTPRSDSQCG

Query:  TSVPVQPKSERRHPKKHVPDSEFLSEDELSATECKNVYSSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRN
        TSVPVQPKSERRHPKKHVPDSEFLSEDELSATECKNVYSSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRN
Subjt:  TSVPVQPKSERRHPKKHVPDSEFLSEDELSATECKNVYSSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRN

Query:  LLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELANIYPYPKERGPKSVKEITPPMDLIESNSLSFNWGRKKYD
        LLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELANIYPYPKERGPKSVKEITPPMDLIESNSLSFNWGRKKYD
Subjt:  LLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELANIYPYPKERGPKSVKEITPPMDLIESNSLSFNWGRKKYD

XP_008449224.1 PREDICTED: uncharacterized protein LOC103491166 isoform X1 [Cucumis melo]1.3e-26595.19Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVERPFLPTPLNDSNEVEDLLVESKSEHVLGNCLRVQDFSCDFGYGIQTN-GGLDSNSKQGGEHELKFGDLDQLLDDANEV
        MDQEVHFCQKFTNMKSHWVKVE PFLP PLNDSNEVEDLLVESKSEHVLGNCLRVQDFSCDFGYGIQTN GGLDSNSKQGGEHELKFGD DQLLDDANEV
Subjt:  MDQEVHFCQKFTNMKSHWVKVERPFLPTPLNDSNEVEDLLVESKSEHVLGNCLRVQDFSCDFGYGIQTN-GGLDSNSKQGGEHELKFGDLDQLLDDANEV

Query:  GEFHATNNLPNTYVEVAENSFRQNRGFQLGNSSSERESQGPSRIDTDAFGISELSATMVMEAEFSNTPVERGLTHELSPGLGTKGRCVTPLEGNICGTIL
        GEFHATNNLPNTY EVAENSFR+NR FQLGNSSSE +S GPSRIDTDAFGISELSATMVMEAEF+NTPVERGLTHELSPGLGTKGRCVTPLEGNICGTIL
Subjt:  GEFHATNNLPNTYVEVAENSFRQNRGFQLGNSSSERESQGPSRIDTDAFGISELSATMVMEAEFSNTPVERGLTHELSPGLGTKGRCVTPLEGNICGTIL

Query:  DNRNIHKFNTNENYLENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEELLDSKSEHNKGRRKLPRKDKYLKVMSTEESNHIRHEVQMTPRSDSQC
        DNRNIHKFNTNENY+ENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEE LDSKSEHNKGRRKLPRKDKYLKVMSTEES HIRHEVQM PRSDSQC
Subjt:  DNRNIHKFNTNENYLENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEELLDSKSEHNKGRRKLPRKDKYLKVMSTEESNHIRHEVQMTPRSDSQC

Query:  GTSVPVQPKSERRHPKKHVPDSEFLSEDELSATECKNVYSSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWR
        GTSVPVQPKSERRHP KHVP S FLSEDE SATECKNVYSSA+RCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWR
Subjt:  GTSVPVQPKSERRHPKKHVPDSEFLSEDELSATECKNVYSSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWR

Query:  NLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELANIYPYPKERGPKSVKEITPPMDLIESNSLSFNWGRKKYD
        NLLRASCVNIQNKKG+EGKQTHASRPLPKSLLQRVYELANIYPYPKERGPKSVK ITPPMDLIESNSLSFNWGRKKY+
Subjt:  NLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELANIYPYPKERGPKSVKEITPPMDLIESNSLSFNWGRKKYD

XP_008449225.1 PREDICTED: uncharacterized protein LOC103491166 isoform X2 [Cucumis melo]5.3e-26795.39Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVERPFLPTPLNDSNEVEDLLVESKSEHVLGNCLRVQDFSCDFGYGIQTNGGLDSNSKQGGEHELKFGDLDQLLDDANEVG
        MDQEVHFCQKFTNMKSHWVKVE PFLP PLNDSNEVEDLLVESKSEHVLGNCLRVQDFSCDFGYGIQTNGGLDSNSKQGGEHELKFGD DQLLDDANEVG
Subjt:  MDQEVHFCQKFTNMKSHWVKVERPFLPTPLNDSNEVEDLLVESKSEHVLGNCLRVQDFSCDFGYGIQTNGGLDSNSKQGGEHELKFGDLDQLLDDANEVG

Query:  EFHATNNLPNTYVEVAENSFRQNRGFQLGNSSSERESQGPSRIDTDAFGISELSATMVMEAEFSNTPVERGLTHELSPGLGTKGRCVTPLEGNICGTILD
        EFHATNNLPNTY EVAENSFR+NR FQLGNSSSE +S GPSRIDTDAFGISELSATMVMEAEF+NTPVERGLTHELSPGLGTKGRCVTPLEGNICGTILD
Subjt:  EFHATNNLPNTYVEVAENSFRQNRGFQLGNSSSERESQGPSRIDTDAFGISELSATMVMEAEFSNTPVERGLTHELSPGLGTKGRCVTPLEGNICGTILD

Query:  NRNIHKFNTNENYLENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEELLDSKSEHNKGRRKLPRKDKYLKVMSTEESNHIRHEVQMTPRSDSQCG
        NRNIHKFNTNENY+ENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEE LDSKSEHNKGRRKLPRKDKYLKVMSTEES HIRHEVQM PRSDSQCG
Subjt:  NRNIHKFNTNENYLENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEELLDSKSEHNKGRRKLPRKDKYLKVMSTEESNHIRHEVQMTPRSDSQCG

Query:  TSVPVQPKSERRHPKKHVPDSEFLSEDELSATECKNVYSSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRN
        TSVPVQPKSERRHP KHVP S FLSEDE SATECKNVYSSA+RCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRN
Subjt:  TSVPVQPKSERRHPKKHVPDSEFLSEDELSATECKNVYSSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRN

Query:  LLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELANIYPYPKERGPKSVKEITPPMDLIESNSLSFNWGRKKYD
        LLRASCVNIQNKKG+EGKQTHASRPLPKSLLQRVYELANIYPYPKERGPKSVK ITPPMDLIESNSLSFNWGRKKY+
Subjt:  LLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELANIYPYPKERGPKSVKEITPPMDLIESNSLSFNWGRKKYD

XP_011657653.1 uncharacterized protein LOC101219571 isoform X1 [Cucumis sativus]2.8e-27698.95Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVERPFLPTPLNDSNEVEDLLVESKSEHVLGNCLRVQDFSCDFGYGIQTN-GGLDSNSKQGGEHELKFGDLDQLLDDANEV
        MDQEVHFCQKFTNMKSHWVKVERPFLPTPLNDSNEVEDLLVESKSEHVLG+CLRVQDFSCDFGYGIQTN GGLDSNSKQGGEHELKFGDLDQLLDDANEV
Subjt:  MDQEVHFCQKFTNMKSHWVKVERPFLPTPLNDSNEVEDLLVESKSEHVLGNCLRVQDFSCDFGYGIQTN-GGLDSNSKQGGEHELKFGDLDQLLDDANEV

Query:  GEFHATNNLPNTYVEVAENSFRQNRGFQLGNSSSERESQGPSRIDTDAFGISELSATMVMEAEFSNTPVERGLTHELSPGLGTKGRCVTPLEGNICGTIL
        GEFHATNNLPNTY EVAENSFRQNRGFQL NSSSERESQGPSRIDTDAFGISELSATMVMEAEFSNTPVERGLTHELSPGLGTKGRCVTPLEGNICGTIL
Subjt:  GEFHATNNLPNTYVEVAENSFRQNRGFQLGNSSSERESQGPSRIDTDAFGISELSATMVMEAEFSNTPVERGLTHELSPGLGTKGRCVTPLEGNICGTIL

Query:  DNRNIHKFNTNENYLENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEELLDSKSEHNKGRRKLPRKDKYLKVMSTEESNHIRHEVQMTPRSDSQC
        DNRNIHKFNTNENY+ENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEELLDSKSEHNKGRRKLPRKDKYLKVMSTEESNHIRHEVQMTPRSDSQC
Subjt:  DNRNIHKFNTNENYLENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEELLDSKSEHNKGRRKLPRKDKYLKVMSTEESNHIRHEVQMTPRSDSQC

Query:  GTSVPVQPKSERRHPKKHVPDSEFLSEDELSATECKNVYSSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWR
        GTSVPVQPKSERRHPKKHVPDSEFLSEDELSATECKNVYSSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWR
Subjt:  GTSVPVQPKSERRHPKKHVPDSEFLSEDELSATECKNVYSSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWR

Query:  NLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELANIYPYPKERGPKSVKEITPPMDLIESNSLSFNWGRKKYD
        NLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELANIYPYPKERGPKSVKEITPPMDLIESNSLSFNWGRKKYD
Subjt:  NLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELANIYPYPKERGPKSVKEITPPMDLIESNSLSFNWGRKKYD

XP_031742605.1 uncharacterized protein LOC101219571 isoform X3 [Cucumis sativus]1.4e-25192.68Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVERPFLPTPLNDSNEVEDLLVESKSEHVLGNCLRVQDFSCDFGYGIQTN-GGLDSNSKQGGEHELKFGDLDQLLDDANEV
        MDQEVHFCQKFTNMKSHWVKVERPFLPTPLNDSNEVEDLLVESKSEHVLG+CLRVQDFSCDFGYGIQTN GGLDSNSKQGGEHELKFGDLDQLLDDANEV
Subjt:  MDQEVHFCQKFTNMKSHWVKVERPFLPTPLNDSNEVEDLLVESKSEHVLGNCLRVQDFSCDFGYGIQTN-GGLDSNSKQGGEHELKFGDLDQLLDDANEV

Query:  GEFHATNNLPNTYVEVAENSFRQNRGFQLGNSSSERESQGPSRIDTDAFGISELSATMVMEAEFSNTPVERGLTHELSPGLGTKGRCVTPLEGNICGTIL
        GEFHATNNLPNTY EVAENSFRQNRGFQL NSSSERESQGPSRIDTDAFGISELSATMVMEAEFSNTPVERGLTHELSPGLGTK                
Subjt:  GEFHATNNLPNTYVEVAENSFRQNRGFQLGNSSSERESQGPSRIDTDAFGISELSATMVMEAEFSNTPVERGLTHELSPGLGTKGRCVTPLEGNICGTIL

Query:  DNRNIHKFNTNENYLENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEELLDSKSEHNKGRRKLPRKDKYLKVMSTEESNHIRHEVQMTPRSDSQC
                       ENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEELLDSKSEHNKGRRKLPRKDKYLKVMSTEESNHIRHEVQMTPRSDSQC
Subjt:  DNRNIHKFNTNENYLENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEELLDSKSEHNKGRRKLPRKDKYLKVMSTEESNHIRHEVQMTPRSDSQC

Query:  GTSVPVQPKSERRHPKKHVPDSEFLSEDELSATECKNVYSSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWR
        GTSVPVQPKSERRHPKKHVPDSEFLSEDELSATECKNVYSSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWR
Subjt:  GTSVPVQPKSERRHPKKHVPDSEFLSEDELSATECKNVYSSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWR

Query:  NLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELANIYPYPKERGPKSVKEITPPMDLIESNSLSFNWGRKKYD
        NLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELANIYPYPKERGPKSVKEITPPMDLIESNSLSFNWGRKKYD
Subjt:  NLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELANIYPYPKERGPKSVKEITPPMDLIESNSLSFNWGRKKYD

TrEMBL top hitse value%identityAlignment
A0A1S3BKX9 uncharacterized protein LOC103491166 isoform X16.3e-26695.19Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVERPFLPTPLNDSNEVEDLLVESKSEHVLGNCLRVQDFSCDFGYGIQTN-GGLDSNSKQGGEHELKFGDLDQLLDDANEV
        MDQEVHFCQKFTNMKSHWVKVE PFLP PLNDSNEVEDLLVESKSEHVLGNCLRVQDFSCDFGYGIQTN GGLDSNSKQGGEHELKFGD DQLLDDANEV
Subjt:  MDQEVHFCQKFTNMKSHWVKVERPFLPTPLNDSNEVEDLLVESKSEHVLGNCLRVQDFSCDFGYGIQTN-GGLDSNSKQGGEHELKFGDLDQLLDDANEV

Query:  GEFHATNNLPNTYVEVAENSFRQNRGFQLGNSSSERESQGPSRIDTDAFGISELSATMVMEAEFSNTPVERGLTHELSPGLGTKGRCVTPLEGNICGTIL
        GEFHATNNLPNTY EVAENSFR+NR FQLGNSSSE +S GPSRIDTDAFGISELSATMVMEAEF+NTPVERGLTHELSPGLGTKGRCVTPLEGNICGTIL
Subjt:  GEFHATNNLPNTYVEVAENSFRQNRGFQLGNSSSERESQGPSRIDTDAFGISELSATMVMEAEFSNTPVERGLTHELSPGLGTKGRCVTPLEGNICGTIL

Query:  DNRNIHKFNTNENYLENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEELLDSKSEHNKGRRKLPRKDKYLKVMSTEESNHIRHEVQMTPRSDSQC
        DNRNIHKFNTNENY+ENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEE LDSKSEHNKGRRKLPRKDKYLKVMSTEES HIRHEVQM PRSDSQC
Subjt:  DNRNIHKFNTNENYLENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEELLDSKSEHNKGRRKLPRKDKYLKVMSTEESNHIRHEVQMTPRSDSQC

Query:  GTSVPVQPKSERRHPKKHVPDSEFLSEDELSATECKNVYSSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWR
        GTSVPVQPKSERRHP KHVP S FLSEDE SATECKNVYSSA+RCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWR
Subjt:  GTSVPVQPKSERRHPKKHVPDSEFLSEDELSATECKNVYSSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWR

Query:  NLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELANIYPYPKERGPKSVKEITPPMDLIESNSLSFNWGRKKYD
        NLLRASCVNIQNKKG+EGKQTHASRPLPKSLLQRVYELANIYPYPKERGPKSVK ITPPMDLIESNSLSFNWGRKKY+
Subjt:  NLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELANIYPYPKERGPKSVKEITPPMDLIESNSLSFNWGRKKYD

A0A1S3BLJ8 uncharacterized protein LOC103491166 isoform X31.7e-21594.95Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVERPFLPTPLNDSNEVEDLLVESKSEHVLGNCLRVQDFSCDFGYGIQTN-GGLDSNSKQGGEHELKFGDLDQLLDDANEV
        MDQEVHFCQKFTNMKSHWVKVE PFLP PLNDSNEVEDLLVESKSEHVLGNCLRVQDFSCDFGYGIQTN GGLDSNSKQGGEHELKFGD DQLLDDANEV
Subjt:  MDQEVHFCQKFTNMKSHWVKVERPFLPTPLNDSNEVEDLLVESKSEHVLGNCLRVQDFSCDFGYGIQTN-GGLDSNSKQGGEHELKFGDLDQLLDDANEV

Query:  GEFHATNNLPNTYVEVAENSFRQNRGFQLGNSSSERESQGPSRIDTDAFGISELSATMVMEAEFSNTPVERGLTHELSPGLGTKGRCVTPLEGNICGTIL
        GEFHATNNLPNTY EVAENSFR+NR FQLGNSSSE +S GPSRIDTDAFGISELSATMVMEAEF+NTPVERGLTHELSPGLGTKGRCVTPLEGNICGTIL
Subjt:  GEFHATNNLPNTYVEVAENSFRQNRGFQLGNSSSERESQGPSRIDTDAFGISELSATMVMEAEFSNTPVERGLTHELSPGLGTKGRCVTPLEGNICGTIL

Query:  DNRNIHKFNTNENYLENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEELLDSKSEHNKGRRKLPRKDKYLKVMSTEESNHIRHEVQMTPRSDSQC
        DNRNIHKFNTNENY+ENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEE LDSKSEHNKGRRKLPRKDKYLKVMSTEES HIRHEVQM PRSDSQC
Subjt:  DNRNIHKFNTNENYLENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEELLDSKSEHNKGRRKLPRKDKYLKVMSTEESNHIRHEVQMTPRSDSQC

Query:  GTSVPVQPKSERRHPKKHVPDSEFLSEDELSATECKNVYSSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLR
        GTSVPVQPKSERRHP KHVP S FLSEDE SATECKNVYSSA+RCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLR
Subjt:  GTSVPVQPKSERRHPKKHVPDSEFLSEDELSATECKNVYSSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLR

A0A1S3BLK0 uncharacterized protein LOC103491166 isoform X22.6e-26795.39Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVERPFLPTPLNDSNEVEDLLVESKSEHVLGNCLRVQDFSCDFGYGIQTNGGLDSNSKQGGEHELKFGDLDQLLDDANEVG
        MDQEVHFCQKFTNMKSHWVKVE PFLP PLNDSNEVEDLLVESKSEHVLGNCLRVQDFSCDFGYGIQTNGGLDSNSKQGGEHELKFGD DQLLDDANEVG
Subjt:  MDQEVHFCQKFTNMKSHWVKVERPFLPTPLNDSNEVEDLLVESKSEHVLGNCLRVQDFSCDFGYGIQTNGGLDSNSKQGGEHELKFGDLDQLLDDANEVG

Query:  EFHATNNLPNTYVEVAENSFRQNRGFQLGNSSSERESQGPSRIDTDAFGISELSATMVMEAEFSNTPVERGLTHELSPGLGTKGRCVTPLEGNICGTILD
        EFHATNNLPNTY EVAENSFR+NR FQLGNSSSE +S GPSRIDTDAFGISELSATMVMEAEF+NTPVERGLTHELSPGLGTKGRCVTPLEGNICGTILD
Subjt:  EFHATNNLPNTYVEVAENSFRQNRGFQLGNSSSERESQGPSRIDTDAFGISELSATMVMEAEFSNTPVERGLTHELSPGLGTKGRCVTPLEGNICGTILD

Query:  NRNIHKFNTNENYLENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEELLDSKSEHNKGRRKLPRKDKYLKVMSTEESNHIRHEVQMTPRSDSQCG
        NRNIHKFNTNENY+ENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEE LDSKSEHNKGRRKLPRKDKYLKVMSTEES HIRHEVQM PRSDSQCG
Subjt:  NRNIHKFNTNENYLENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEELLDSKSEHNKGRRKLPRKDKYLKVMSTEESNHIRHEVQMTPRSDSQCG

Query:  TSVPVQPKSERRHPKKHVPDSEFLSEDELSATECKNVYSSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRN
        TSVPVQPKSERRHP KHVP S FLSEDE SATECKNVYSSA+RCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRN
Subjt:  TSVPVQPKSERRHPKKHVPDSEFLSEDELSATECKNVYSSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRN

Query:  LLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELANIYPYPKERGPKSVKEITPPMDLIESNSLSFNWGRKKYD
        LLRASCVNIQNKKG+EGKQTHASRPLPKSLLQRVYELANIYPYPKERGPKSVK ITPPMDLIESNSLSFNWGRKKY+
Subjt:  LLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELANIYPYPKERGPKSVKEITPPMDLIESNSLSFNWGRKKYD

A0A6J1CRG2 uncharacterized protein LOC111013581 isoform X25.4e-21780.87Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVERPFLPTPLNDSNEVEDLLVESKSEHVLGNCLRVQDFSCDFGYGIQTN-GGLDSNSKQGGEHELKFGDLDQLLDDANEV
        MDQEVHFCQKFTNMKSHWVKV+  FLP PLN+ NEVE LLVE KS HVLG+CLR QDFSCDF YGIQTN GGLDSNSKQ GEHELKF DLDQLL D NEV
Subjt:  MDQEVHFCQKFTNMKSHWVKVERPFLPTPLNDSNEVEDLLVESKSEHVLGNCLRVQDFSCDFGYGIQTN-GGLDSNSKQGGEHELKFGDLDQLLDDANEV

Query:  GEFHATNNLPNTYVEVAENSFRQNRGFQLGNSSSERESQGPSRIDTDAFGISELSATMVMEAEFSN-TPVERGLTHELSPGLGTKGRCVTPLEGNICGTI
         EFHATNNLPNTY EVAENSFR+NRG QLGN SSE +SQG SR DT+AF ISELSA MV EAE +N TPV+RGLTHEL  GL TKGRC TPL+G+IC TI
Subjt:  GEFHATNNLPNTYVEVAENSFRQNRGFQLGNSSSERESQGPSRIDTDAFGISELSATMVMEAEFSN-TPVERGLTHELSPGLGTKGRCVTPLEGNICGTI

Query:  LDNRNIHKFNTNENYLENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEELLDSKSEHNKGRRKLPRKDKYLKVMSTEESNHIRHEVQM-TPRSDS
        LDN NIHKF+TNE  LENG LSDENVKG+I A++LA CSR+RRLRKPTRRYIEE  DSKSE +KG+RK P KDKY+KV S EESNHIRH+VQM TP  +S
Subjt:  LDNRNIHKFNTNENYLENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEELLDSKSEHNKGRRKLPRKDKYLKVMSTEESNHIRHEVQM-TPRSDS

Query:  QCGTSVPVQPKSERRHPKKHVPDSEFLSEDELSATECKNVYSSAKRCKKYDRRR-QKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRD
         CGTS+PVQ +S+RR PKKHVP S FLSE+E SATECK VYSSAKRCKK+DRR+ QKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFA+SP+RTPIDLRD
Subjt:  QCGTSVPVQPKSERRHPKKHVPDSEFLSEDELSATECKNVYSSAKRCKKYDRRR-QKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRD

Query:  KWRNLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELANIYPYPKERGPKSVKEITPPMDLIESNSLSFNWGRKKYD
        KWRNLLRASCVNIQN+ GIE KQ+HASRPLPKSLLQRVYELANIYPYPKER PKSVK  T PM LIESNSLSFNWGRKKYD
Subjt:  KWRNLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELANIYPYPKERGPKSVKEITPPMDLIESNSLSFNWGRKKYD

A0A6J1CRQ1 uncharacterized protein LOC111013581 isoform X11.3e-21580.71Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVERPFLPTPLNDSNEVEDLLVESKSEHVLGNCLRVQDFSCDFGYGIQTN-GGLDSNSKQGGEHELKFGDLDQLLDDANEV
        MDQEVHFCQKFTNMKSHWVKV+  FLP PLN+ NEVE LLVE KS HVLG+CLR QDFSCDF YGIQTN GGLDSNSKQ GEHELKF DLDQLL D NEV
Subjt:  MDQEVHFCQKFTNMKSHWVKVERPFLPTPLNDSNEVEDLLVESKSEHVLGNCLRVQDFSCDFGYGIQTN-GGLDSNSKQGGEHELKFGDLDQLLDDANEV

Query:  GEFHATNNLPNTYVEVAENSFRQNRGFQLGNSSSERESQGPSRIDTDAFGISELSATMVMEAEFSN-TPVERGLTHELSPGLGTKGRCVTPLEGNICGTI
         EFHATNNLPNTY EVAENSFR+NRG QLGN SSE +SQG SR DT+AF ISELSA MV EAE +N TPV+RGLTHEL  GL TKGRC TPL+G+IC TI
Subjt:  GEFHATNNLPNTYVEVAENSFRQNRGFQLGNSSSERESQGPSRIDTDAFGISELSATMVMEAEFSN-TPVERGLTHELSPGLGTKGRCVTPLEGNICGTI

Query:  LDNRNIHKFNTNENYLENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEELLDSKSEHNKGRRKLPRKDKYLKVMSTEESNHIRHEVQM-TPRSDS
        LDN NIHKF+TNE  LENG LSDENVKG+I A++LA CSR+RRLRKPTRRYIEE  DSKSE +KG+RK P KDKY+KV S EESNHIRH+VQM TP  +S
Subjt:  LDNRNIHKFNTNENYLENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEELLDSKSEHNKGRRKLPRKDKYLKVMSTEESNHIRHEVQM-TPRSDS

Query:  QCGTSVPVQPKSERRHPKKHVPDSEFLSEDELSATECKNVYSSAKRCKKYDRRR-QKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLR-
         CGTS+PVQ +S+RR PKKHVP S FLSE+E SATECK VYSSAKRCKK+DRR+ QKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFA+SP+RTPIDLR 
Subjt:  QCGTSVPVQPKSERRHPKKHVPDSEFLSEDELSATECKNVYSSAKRCKKYDRRR-QKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLR-

Query:  DKWRNLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELANIYPYPKERGPKSVKEITPPMDLIESNSLSFNWGRKKYD
        DKWRNLLRASCVNIQN+ GIE KQ+HASRPLPKSLLQRVYELANIYPYPKER PKSVK  T PM LIESNSLSFNWGRKKYD
Subjt:  DKWRNLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELANIYPYPKERGPKSVKEITPPMDLIESNSLSFNWGRKKYD

SwissProt top hitse value%identityAlignment
Q6R0E3 Telomere repeat-binding protein 51.1e-0934.41Show/hide
Query:  KRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRV
        KR +   RR ++ +++ EV  LV  +   GTGRW  +K   F ++ HRT +DL+DKW+ L+  + ++ Q ++G          P+P+ LL RV
Subjt:  KRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRV

Q9C7B1 Telomere repeat-binding protein 31.6e-1131.82Show/hide
Query:  VPVQPKSERRHPKKHVPDSEFLSEDELSATECKNVYSSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLL
        VP Q       P     +   L E E+ A     +    KR +   RR ++ +++TEV  LV  + E GTGRW  +K   F  + HRT +DL+DKW+ L+
Subjt:  VPVQPKSERRHPKKHVPDSEFLSEDELSATECKNVYSSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLL

Query:  RASCVNIQNKKGIEGKQTHASRPLPKSLLQRV
          + ++ Q ++G          P+P+ LL RV
Subjt:  RASCVNIQNKKGIEGKQTHASRPLPKSLLQRV

Q9FFY9 Telomere repeat-binding protein 42.0e-1135.71Show/hide
Query:  VYSSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRV
        V    KR +   RR ++ +++TEV  LV  + E GTGRW  +K   F ++ HRT +DL+DKW+ L+  + ++ Q ++G          P+P+ LL RV
Subjt:  VYSSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRV

Q9M347 Telomere repeat-binding protein 61.1e-0930.95Show/hide
Query:  PKKHVPDSEFL-SEDELSATECKNV----YSSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVN
        P+  +  ++ L S D   A   K+V       A + +   RR ++ +T++EV  LV  +   GTGRW  +K H F    HRT +DL+DKW+ L+  + ++
Subjt:  PKKHVPDSEFL-SEDELSATECKNV----YSSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVN

Query:  IQNKKGIEGKQTHASRPLPKSLLQRV
         + ++G          P+P+ LL RV
Subjt:  IQNKKGIEGKQTHASRPLPKSLLQRV

Q9SNB9 Telomere repeat-binding protein 26.6e-1036.05Show/hide
Query:  RRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRV
        RR ++ +++TEV  LV  + + GTGRW  +K   F  + HRT +DL+DKW+ L+  + ++ Q ++G          P+P+ LL RV
Subjt:  RRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRV

Arabidopsis top hitse value%identityAlignment
AT1G17460.1 TRF-like 31.8e-1533.54Show/hide
Query:  IRHEVQMTP--RSDSQCGTSVPVQPKSERRHPKKHVPD-SEFLSEDEL----SATECKNVYSSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTH
        + ++V   P  +S S+C     VQ +S++ H K    D  + + E EL      +   N   +        R+  + WT++EV +LV+G+++YG G+WT 
Subjt:  IRHEVQMTP--RSDSQCGTSVPVQPKSERRHPKKHVPD-SEFLSEDEL----SATECKNVYSSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTH

Query:  IKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELA
        IKK  F+   HRT +DL+DKWRNL +AS  N        G + H S  +P  ++ +V ELA
Subjt:  IKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELA

AT1G72650.1 TRF-like 64.6e-1928.41Show/hide
Query:  RRLRKPTRRYIEELLDSKSEHNKGRRKLPRKDKYLKVMSTEESNHIRHEVQMT-PRSDSQCGT--SVPVQPKSERRHPKKHVP-----DSEFLSEDELSA
        +R+RKPTRRYIEEL ++  +    +  +P KD+ L   S   S  +    ++T  R  S  G+   VP      R  P++++       S +L ED+ SA
Subjt:  RRLRKPTRRYIEELLDSKSEHNKGRRKLPRKDKYLKVMSTEESNHIRHEVQMT-PRSDSQCGT--SVPVQPKSERRHPKKHVP-----DSEFLSEDELSA

Query:  TECK----------------NVYSSAKRCKKYD------------------------------------------------RRRQKMWTLTEVMRLVDGI
         E                  +V  SA R  + +                                                R+  + WTL+E+ +LV+G+
Subjt:  TECK----------------NVYSSAKRCKKYD------------------------------------------------RRRQKMWTLTEVMRLVDGI

Query:  AEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELA
        ++YG G+W+ IKKHLF+S  +RT +DL+DKWRNLL+ S         +   + H S  +P  +L RV ELA
Subjt:  AEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELA

AT1G72650.2 TRF-like 64.6e-1928.41Show/hide
Query:  RRLRKPTRRYIEELLDSKSEHNKGRRKLPRKDKYLKVMSTEESNHIRHEVQMT-PRSDSQCGT--SVPVQPKSERRHPKKHVP-----DSEFLSEDELSA
        +R+RKPTRRYIEEL ++  +    +  +P KD+ L   S   S  +    ++T  R  S  G+   VP      R  P++++       S +L ED+ SA
Subjt:  RRLRKPTRRYIEELLDSKSEHNKGRRKLPRKDKYLKVMSTEESNHIRHEVQMT-PRSDSQCGT--SVPVQPKSERRHPKKHVP-----DSEFLSEDELSA

Query:  TECK----------------NVYSSAKRCKKYD------------------------------------------------RRRQKMWTLTEVMRLVDGI
         E                  +V  SA R  + +                                                R+  + WTL+E+ +LV+G+
Subjt:  TECK----------------NVYSSAKRCKKYD------------------------------------------------RRRQKMWTLTEVMRLVDGI

Query:  AEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELA
        ++YG G+W+ IKKHLF+S  +RT +DL+DKWRNLL+ S         +   + H S  +P  +L RV ELA
Subjt:  AEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELA

AT2G37025.1 TRF-like 85.5e-2843.75Show/hide
Query:  RRHPKK---HVPDSEFLSEDELSATECKNVYSSAK--RCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRAS
        R+ P K   H    +  S+D+L+ +E ++  S  K  R K   R+ Q++WTL EVM LVDGI+ +G G+WT IK H F  + HR P+D+RDKWRNLL+AS
Subjt:  RRHPKK---HVPDSEFLSEDELSATECKNVYSSAK--RCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRAS

Query:  CVNIQNKKGIEGKQTHASRPLPKSLLQRVYELANIYPYPKERGP
             N    E K+   +R +PK +L RV ELA+++PYP  + P
Subjt:  CVNIQNKKGIEGKQTHASRPLPKSLLQRVYELANIYPYPKERGP

AT2G37025.2 TRF-like 85.5e-2843.75Show/hide
Query:  RRHPKK---HVPDSEFLSEDELSATECKNVYSSAK--RCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRAS
        R+ P K   H    +  S+D+L+ +E ++  S  K  R K   R+ Q++WTL EVM LVDGI+ +G G+WT IK H F  + HR P+D+RDKWRNLL+AS
Subjt:  RRHPKK---HVPDSEFLSEDELSATECKNVYSSAK--RCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRAS

Query:  CVNIQNKKGIEGKQTHASRPLPKSLLQRVYELANIYPYPKERGP
             N    E K+   +R +PK +L RV ELA+++PYP  + P
Subjt:  CVNIQNKKGIEGKQTHASRPLPKSLLQRVYELANIYPYPKERGP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCAAGAAGTGCATTTCTGCCAGAAGTTCACAAATATGAAATCTCATTGGGTAAAAGTGGAGAGACCTTTTCTTCCTACACCACTAAATGATTCAAATGAAGTTGA
GGATTTACTTGTGGAGTCTAAAAGCGAGCATGTTTTAGGAAATTGTTTGAGAGTTCAAGATTTCTCTTGCGACTTTGGCTATGGAATACAAACAAACGGTGGATTGGATT
CTAATAGCAAGCAGGGAGGCGAACATGAACTTAAATTTGGAGATCTTGATCAACTGCTGGATGATGCCAATGAAGTAGGGGAATTCCATGCAACAAACAATCTGCCAAAT
ACATATGTCGAAGTTGCTGAAAATTCTTTCAGACAGAATAGGGGATTTCAATTGGGAAACTCAAGTTCAGAGAGGGAATCTCAGGGACCAAGCAGGATTGATACTGATGC
TTTTGGAATATCAGAATTATCAGCAACAATGGTAATGGAGGCTGAATTCAGTAATACACCTGTTGAGAGGGGTTTAACTCATGAGTTGTCCCCAGGTCTGGGGACCAAAG
GTAGGTGTGTAACACCACTTGAAGGCAACATCTGTGGTACAATACTTGATAATAGAAATATCCATAAGTTCAATACTAATGAAAACTATTTAGAAAATGGCGATTTATCT
GATGAAAATGTGAAGGGTGATATTGTGGCAAACGAACTTGCCAGTTGTTCAAGGGAGAGGAGATTGCGTAAGCCTACACGAAGATACATCGAAGAACTTTTAGATTCGAA
GTCTGAACATAACAAGGGAAGGCGAAAACTTCCTAGAAAAGATAAATACCTGAAAGTGATGTCTACTGAAGAATCCAATCACATTAGACATGAGGTACAAATGACGCCTA
GAAGTGATTCACAATGTGGTACGTCTGTTCCAGTGCAGCCTAAATCTGAAAGAAGACATCCAAAGAAGCATGTGCCAGATTCAGAATTTCTATCCGAAGATGAATTGTCT
GCAACTGAGTGTAAGAATGTTTATTCATCTGCTAAAAGATGTAAAAAGTATGATAGGAGGCGCCAGAAGATGTGGACCCTCACTGAAGTAATGCGATTAGTTGATGGAAT
CGCCGAGTATGGAACTGGCCGCTGGACTCATATTAAGAAGCATCTATTTGCATCTTCTCCTCATCGTACACCTATAGATCTCAGGGACAAATGGCGAAATCTTCTGAGAG
CTAGCTGTGTTAACATACAGAACAAAAAAGGGATTGAAGGGAAGCAGACACATGCCTCACGTCCATTACCAAAGTCCCTGCTCCAACGTGTTTATGAACTGGCCAACATC
TATCCATACCCAAAGGAGCGCGGTCCAAAATCAGTCAAAGAAATTACACCTCCCATGGATCTTATTGAAAGTAACTCTTTGTCATTCAATTGGGGGCGGAAGAAGTATGA
CTGA
mRNA sequenceShow/hide mRNA sequence
AAAAAACCACATTTCTATTTTTCCGGTTTCGATCCTCTGAAACATCATTCTCTCCCCAAATCTTCTCTTCAACCTAAGCTCGGATCTTCACCTTCCGTGCTGGATTGATT
GAGCTTAAAAGTAAACAAATTATGGATCAAGAAGTGCATTTCTGCCAGAAGTTCACAAATATGAAATCTCATTGGGTAAAAGTGGAGAGACCTTTTCTTCCTACACCACT
AAATGATTCAAATGAAGTTGAGGATTTACTTGTGGAGTCTAAAAGCGAGCATGTTTTAGGAAATTGTTTGAGAGTTCAAGATTTCTCTTGCGACTTTGGCTATGGAATAC
AAACAAACGGTGGATTGGATTCTAATAGCAAGCAGGGAGGCGAACATGAACTTAAATTTGGAGATCTTGATCAACTGCTGGATGATGCCAATGAAGTAGGGGAATTCCAT
GCAACAAACAATCTGCCAAATACATATGTCGAAGTTGCTGAAAATTCTTTCAGACAGAATAGGGGATTTCAATTGGGAAACTCAAGTTCAGAGAGGGAATCTCAGGGACC
AAGCAGGATTGATACTGATGCTTTTGGAATATCAGAATTATCAGCAACAATGGTAATGGAGGCTGAATTCAGTAATACACCTGTTGAGAGGGGTTTAACTCATGAGTTGT
CCCCAGGTCTGGGGACCAAAGGTAGGTGTGTAACACCACTTGAAGGCAACATCTGTGGTACAATACTTGATAATAGAAATATCCATAAGTTCAATACTAATGAAAACTAT
TTAGAAAATGGCGATTTATCTGATGAAAATGTGAAGGGTGATATTGTGGCAAACGAACTTGCCAGTTGTTCAAGGGAGAGGAGATTGCGTAAGCCTACACGAAGATACAT
CGAAGAACTTTTAGATTCGAAGTCTGAACATAACAAGGGAAGGCGAAAACTTCCTAGAAAAGATAAATACCTGAAAGTGATGTCTACTGAAGAATCCAATCACATTAGAC
ATGAGGTACAAATGACGCCTAGAAGTGATTCACAATGTGGTACGTCTGTTCCAGTGCAGCCTAAATCTGAAAGAAGACATCCAAAGAAGCATGTGCCAGATTCAGAATTT
CTATCCGAAGATGAATTGTCTGCAACTGAGTGTAAGAATGTTTATTCATCTGCTAAAAGATGTAAAAAGTATGATAGGAGGCGCCAGAAGATGTGGACCCTCACTGAAGT
AATGCGATTAGTTGATGGAATCGCCGAGTATGGAACTGGCCGCTGGACTCATATTAAGAAGCATCTATTTGCATCTTCTCCTCATCGTACACCTATAGATCTCAGGGACA
AATGGCGAAATCTTCTGAGAGCTAGCTGTGTTAACATACAGAACAAAAAAGGGATTGAAGGGAAGCAGACACATGCCTCACGTCCATTACCAAAGTCCCTGCTCCAACGT
GTTTATGAACTGGCCAACATCTATCCATACCCAAAGGAGCGCGGTCCAAAATCAGTCAAAGAAATTACACCTCCCATGGATCTTATTGAAAGTAACTCTTTGTCATTCAA
TTGGGGGCGGAAGAAGTATGACTGACATCAACTTTGGAAGCAGCAGAAATTCCTTTGCAGTCGGAGGTTGAAGTCTAAGTAATACTTATAATTAGATGTAAAAAGATCTC
TGTTTCTGTTTTGACCCTTTTGTAATGGTGATATGCACTTTGAAATTGGGAAGAAAATCTTTTATAAAAGCCACGGAGC
Protein sequenceShow/hide protein sequence
MDQEVHFCQKFTNMKSHWVKVERPFLPTPLNDSNEVEDLLVESKSEHVLGNCLRVQDFSCDFGYGIQTNGGLDSNSKQGGEHELKFGDLDQLLDDANEVGEFHATNNLPN
TYVEVAENSFRQNRGFQLGNSSSERESQGPSRIDTDAFGISELSATMVMEAEFSNTPVERGLTHELSPGLGTKGRCVTPLEGNICGTILDNRNIHKFNTNENYLENGDLS
DENVKGDIVANELASCSRERRLRKPTRRYIEELLDSKSEHNKGRRKLPRKDKYLKVMSTEESNHIRHEVQMTPRSDSQCGTSVPVQPKSERRHPKKHVPDSEFLSEDELS
ATECKNVYSSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELANI
YPYPKERGPKSVKEITPPMDLIESNSLSFNWGRKKYD