; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0020845 (gene) of Chayote v1 genome

Gene IDSed0020845
OrganismSechium edule (Chayote v1)
DescriptionHTH myb-type domain-containing protein
Genome locationLG14:4066878..4073127
RNA-Seq ExpressionSed0020845
SyntenySed0020845
Gene Ontology termsNA
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008449224.1 PREDICTED: uncharacterized protein LOC103491166 isoform X1 [Cucumis melo]2.9e-21781.08Show/hide
Query:  MDQEVHFCPKFTNMKSHWVKVEGSFISRPLNEANEVEHLLVEPKSENVLGNCLRVQDFSCDFGYGIQTNGGGLDSNSKQGGGHELKFGDLDQLLDDANDV
        MDQEVHFC KFTNMKSHWVKVEG F+  PLN++NEVE LLVE KSE+VLGNCLRVQDFSCDFGYGIQTNGGGLDSNSKQGG HELKFGD DQLLDDAN+V
Subjt:  MDQEVHFCPKFTNMKSHWVKVEGSFISRPLNEANEVEHLLVEPKSENVLGNCLRVQDFSCDFGYGIQTNGGGLDSNSKQGGGHELKFGDLDQLLDDANDV

Query:  GEFHATNNLPNTYAEVAGNSFRQNRRLQMGNFSSESKSQGSSRSDTDAFGISELSATMVMEAEFNNTPVERGFTHELCAGLRTKGRRVTPLEGSICDT--
        GEFHATNNLPNTYAEVA NSFR+NRR Q+GN SSE+KS G SR DTDAFGISELSATMVMEAEFNNTPVERG THEL  GL TKGR VTPLEG+IC T  
Subjt:  GEFHATNNLPNTYAEVAGNSFRQNRRLQMGNFSSESKSQGSSRSDTDAFGISELSATMVMEAEFNNTPVERGFTHELCAGLRTKGRRVTPLEGSICDT--

Query:  ---NIHKFNTYESYIENGDLSDENVKGDVVANKLASCSRERRLRKPTRRYIEEFADSKSESNQGKKKPPTKDKYMKRTMIEESNHIRHKVQMLTSRSESH
           NIHKFNT E+YIENGDLSDENVKGD+VAN+LASCSRERRLRKPTRRYIEEF DSKSE N+G++K P KDKY+K    EES HIRH+VQM+  RS+S 
Subjt:  ---NIHKFNTYESYIENGDLSDENVKGDVVANKLASCSRERRLRKPTRRYIEEFADSKSESNQGKKKPPTKDKYMKRTMIEESNHIRHKVQMLTSRSESH

Query:  CGTSVPVQSRSQRRLPKKHVPVLEFLSEDESSATECEKVYSSVTRSKKHDRRKHHKMWTLTEVMRLVDGISEYGTGRWTHIKKYLFASSPHRTPIDLRDK
        CGTSVPVQ +S+RR P KHVPV  FLSEDESSATEC+ VYSS  R KK+DRR+  KMWTLTEVMRLVDGI+EYGTGRWTHIKK+LFASSPHRTPIDLRDK
Subjt:  CGTSVPVQSRSQRRLPKKHVPVLEFLSEDESSATECEKVYSSVTRSKKHDRRKHHKMWTLTEVMRLVDGISEYGTGRWTHIKKYLFASSPHRTPIDLRDK

Query:  WRNLQKASCVNMQNRKGIERKQSHASRPLPKSLLQRVYELDNIYPYPKKRSPKSVEPTTPPMYLDESNSSLSFNWGRKKYE
        WRNL +ASCVN+QN+KG+E KQ+HASRPLPKSLLQRVYEL NIYPYPK+R PKSV+  TPPM L ESN SLSFNWGRKKYE
Subjt:  WRNLQKASCVNMQNRKGIERKQSHASRPLPKSLLQRVYELDNIYPYPKKRSPKSVEPTTPPMYLDESNSSLSFNWGRKKYE

XP_022143738.1 uncharacterized protein LOC111013581 isoform X2 [Momordica charantia]4.5e-21881.12Show/hide
Query:  MDQEVHFCPKFTNMKSHWVKVEGSFISRPLNEANEVEHLLVEPKSENVLGNCLRVQDFSCDFGYGIQTNGGGLDSNSKQGGGHELKFGDLDQLLDDANDV
        MDQEVHFC KFTNMKSHWVKV+GSF+  PLNE NEVEHLLVEPKS +VLG+CLR QDFSCDF YGIQTN GGLDSNSKQ G HELKF DLDQLL D N+V
Subjt:  MDQEVHFCPKFTNMKSHWVKVEGSFISRPLNEANEVEHLLVEPKSENVLGNCLRVQDFSCDFGYGIQTNGGGLDSNSKQGGGHELKFGDLDQLLDDANDV

Query:  GEFHATNNLPNTYAEVAGNSFRQNRRLQMGNFSSESKSQGSSRSDTDAFGISELSATMVMEAEFNN-TPVERGFTHELCAGLRTKGRRVTPLEGSICDT-
         EFHATNNLPNTY EVA NSFR+NR LQ+GN SSESKSQGSSR+DT+AF ISELSA MV EAE NN TPV+RG THELCAGLRTKGR  TPL+GSIC T 
Subjt:  GEFHATNNLPNTYAEVAGNSFRQNRRLQMGNFSSESKSQGSSRSDTDAFGISELSATMVMEAEFNN-TPVERGFTHELCAGLRTKGRRVTPLEGSICDT-

Query:  ----NIHKFNTYESYIENGDLSDENVKGDVVANKLASCSRERRLRKPTRRYIEEFADSKSESNQGKKKPPTKDKYMKRTMIEESNHIRHKVQMLTSRSES
            NIHKF+T E  +ENG LSDENVKG++ A+KLA CSR+RRLRKPTRRYIEEFADSKSES++GK+KPPTKDKY+K T IEESNHIRHKVQMLT   ES
Subjt:  ----NIHKFNTYESYIENGDLSDENVKGDVVANKLASCSRERRLRKPTRRYIEEFADSKSESNQGKKKPPTKDKYMKRTMIEESNHIRHKVQMLTSRSES

Query:  HCGTSVPVQSRSQRRLPKKHVPVLEFLSEDESSATECEKVYSSVTRSKKHDRRKHHKMWTLTEVMRLVDGISEYGTGRWTHIKKYLFASSPHRTPIDLRD
        HCGTS+PVQSRSQRRLPKKHVPV  FLSE+ESSATEC+ VYSS  R KKHDRRKH KMWTLTEVMRLVDGI+EYGTGRWTHIKK+LFA+SP+RTPIDLRD
Subjt:  HCGTSVPVQSRSQRRLPKKHVPVLEFLSEDESSATECEKVYSSVTRSKKHDRRKHHKMWTLTEVMRLVDGISEYGTGRWTHIKKYLFASSPHRTPIDLRD

Query:  KWRNLQKASCVNMQNRKGIERKQSHASRPLPKSLLQRVYELDNIYPYPKKRSPKSVEPTTPPMYLDESNSSLSFNWGRKKYE
        KWRNL +ASCVN+QNR GIERKQSHASRPLPKSLLQRVYEL NIYPYPK+RSPKSV+ TT PM+L ESN SLSFNWGRKKY+
Subjt:  KWRNLQKASCVNMQNRKGIERKQSHASRPLPKSLLQRVYELDNIYPYPKKRSPKSVEPTTPPMYLDESNSSLSFNWGRKKYE

XP_038881566.1 uncharacterized protein LOC120073047 isoform X1 [Benincasa hispida]3.3e-22984.71Show/hide
Query:  MDQEVHFCPKFTNMKSHWVKVEGSFISRPLNEANEVEHLLVEPKSENVLGNCLRVQDFSCDFGYGIQTNG-GGLDSNSKQGGGHELKFGDLDQLLDDAND
        MDQEVHFC KFTNMKSHWV+VEG F+  PLN++NEVE LLVEPKS++VLGNCLRVQDFSCDFGYGIQTNG GGLDSNSKQGG HELKFGDLDQLLDDAN+
Subjt:  MDQEVHFCPKFTNMKSHWVKVEGSFISRPLNEANEVEHLLVEPKSENVLGNCLRVQDFSCDFGYGIQTNG-GGLDSNSKQGGGHELKFGDLDQLLDDAND

Query:  VGEFHATNNLPNTYAEVAGNSFRQNRRLQMGNFSSESKSQGSSRSDTDAFGISELSATMVMEAEFNNTPVERGFTHELCAGLRTKGRRV--TPLEGSICD
        VGEFHATNNL +TYAEVA NSFRQNR LQ+GN SS SKSQG SRSDTDAFGISELSATMVME EFNNTPVERG THEL  GLRTKGR V  TPLEG+ICD
Subjt:  VGEFHATNNLPNTYAEVAGNSFRQNRRLQMGNFSSESKSQGSSRSDTDAFGISELSATMVMEAEFNNTPVERGFTHELCAGLRTKGRRV--TPLEGSICD

Query:  T-----NIHKFNTYESYIENGDLSDENVKGDVVANKLASCSRERRLRKPTRRYIEEFADSKSESNQGKKKPPTKDKYMKRTMIEESNHIRHKVQMLTSRS
        T     NIHKFNT E+YIENGDLSDENVKGD+VANKLASCSRERRLRKPTRRYIEEFADSKSE+N+G++KPPTKDKY+K T  EESNHIRH+VQMLT RS
Subjt:  T-----NIHKFNTYESYIENGDLSDENVKGDVVANKLASCSRERRLRKPTRRYIEEFADSKSESNQGKKKPPTKDKYMKRTMIEESNHIRHKVQMLTSRS

Query:  ESHCGTSVPVQSRSQRRLPKKHVPVLEFLSEDESSATECEKVYSSVTRSKKHDRRKHHKMWTLTEVMRLVDGISEYGTGRWTHIKKYLFASSPHRTPIDL
        E HCGTSVPVQSRSQRR PKKHVPV  FLSEDESSATEC+ VYSS  R KK+DRR+H KMW+LTEVMRLVDGI+EYGTGRWT IKK+LFASSPHRTPIDL
Subjt:  ESHCGTSVPVQSRSQRRLPKKHVPVLEFLSEDESSATECEKVYSSVTRSKKHDRRKHHKMWTLTEVMRLVDGISEYGTGRWTHIKKYLFASSPHRTPIDL

Query:  RDKWRNLQKASCVNMQNRKGIERKQSHASRPLPKSLLQRVYELDNIYPYPKKRSPKSVEPTTPPMYLDESNSSLSFNWGRKKYE
        RDKWRNL +ASCVN+QNRKGIERKQSHASRPLPKSLLQRVYEL NIYPYPK+RSPKSV+ TTPPM+L ESN SLSFNWGRKKYE
Subjt:  RDKWRNLQKASCVNMQNRKGIERKQSHASRPLPKSLLQRVYELDNIYPYPKKRSPKSVEPTTPPMYLDESNSSLSFNWGRKKYE

XP_038881567.1 uncharacterized protein LOC120073047 isoform X2 [Benincasa hispida]2.9e-21784.33Show/hide
Query:  VKVEGSFISRPLNEANEVEHLLVEPKSENVLGNCLRVQDFSCDFGYGIQTNG-GGLDSNSKQGGGHELKFGDLDQLLDDANDVGEFHATNNLPNTYAEVA
        V+VEG F+  PLN++NEVE LLVEPKS++VLGNCLRVQDFSCDFGYGIQTNG GGLDSNSKQGG HELKFGDLDQLLDDAN+VGEFHATNNL +TYAEVA
Subjt:  VKVEGSFISRPLNEANEVEHLLVEPKSENVLGNCLRVQDFSCDFGYGIQTNG-GGLDSNSKQGGGHELKFGDLDQLLDDANDVGEFHATNNLPNTYAEVA

Query:  GNSFRQNRRLQMGNFSSESKSQGSSRSDTDAFGISELSATMVMEAEFNNTPVERGFTHELCAGLRTKGRRV--TPLEGSICDT-----NIHKFNTYESYI
         NSFRQNR LQ+GN SS SKSQG SRSDTDAFGISELSATMVME EFNNTPVERG THEL  GLRTKGR V  TPLEG+ICDT     NIHKFNT E+YI
Subjt:  GNSFRQNRRLQMGNFSSESKSQGSSRSDTDAFGISELSATMVMEAEFNNTPVERGFTHELCAGLRTKGRRV--TPLEGSICDT-----NIHKFNTYESYI

Query:  ENGDLSDENVKGDVVANKLASCSRERRLRKPTRRYIEEFADSKSESNQGKKKPPTKDKYMKRTMIEESNHIRHKVQMLTSRSESHCGTSVPVQSRSQRRL
        ENGDLSDENVKGD+VANKLASCSRERRLRKPTRRYIEEFADSKSE+N+G++KPPTKDKY+K T  EESNHIRH+VQMLT RSE HCGTSVPVQSRSQRR 
Subjt:  ENGDLSDENVKGDVVANKLASCSRERRLRKPTRRYIEEFADSKSESNQGKKKPPTKDKYMKRTMIEESNHIRHKVQMLTSRSESHCGTSVPVQSRSQRRL

Query:  PKKHVPVLEFLSEDESSATECEKVYSSVTRSKKHDRRKHHKMWTLTEVMRLVDGISEYGTGRWTHIKKYLFASSPHRTPIDLRDKWRNLQKASCVNMQNR
        PKKHVPV  FLSEDESSATEC+ VYSS  R KK+DRR+H KMW+LTEVMRLVDGI+EYGTGRWT IKK+LFASSPHRTPIDLRDKWRNL +ASCVN+QNR
Subjt:  PKKHVPVLEFLSEDESSATECEKVYSSVTRSKKHDRRKHHKMWTLTEVMRLVDGISEYGTGRWTHIKKYLFASSPHRTPIDLRDKWRNLQKASCVNMQNR

Query:  KGIERKQSHASRPLPKSLLQRVYELDNIYPYPKKRSPKSVEPTTPPMYLDESNSSLSFNWGRKKYE
        KGIERKQSHASRPLPKSLLQRVYEL NIYPYPK+RSPKSV+ TTPPM+L ESN SLSFNWGRKKYE
Subjt:  KGIERKQSHASRPLPKSLLQRVYELDNIYPYPKKRSPKSVEPTTPPMYLDESNSSLSFNWGRKKYE

XP_038881569.1 uncharacterized protein LOC120073047 isoform X3 [Benincasa hispida]7.0e-22784.5Show/hide
Query:  MDQEVHFCPKFTNMKSHWVKVEGSFISRPLNEANEVEHLLVEPKSENVLGNCLRVQDFSCDFGYGIQTNG-GGLDSNSKQGGGHELKFGDLDQLLDDAND
        MDQEVHFC KFTNMKSHWV+VEG F+  PLN++NEVE LLVEPKS++VLGNCLRVQDFSCDFGYGIQTNG GGLDSNSKQGG HELKFGDLDQLLDDAN+
Subjt:  MDQEVHFCPKFTNMKSHWVKVEGSFISRPLNEANEVEHLLVEPKSENVLGNCLRVQDFSCDFGYGIQTNG-GGLDSNSKQGGGHELKFGDLDQLLDDAND

Query:  VGEFHATNNLPNTYAEVAGNSFRQNRRLQMGNFSSESKSQGSSRSDTDAFGISELSATMVMEAEFNNTPVERGFTHELCAGLRTKGRRV--TPLEGSICD
        VGEFHATNNL N  AEVA NSFRQNR LQ+GN SS SKSQG SRSDTDAFGISELSATMVME EFNNTPVERG THEL  GLRTKGR V  TPLEG+ICD
Subjt:  VGEFHATNNLPNTYAEVAGNSFRQNRRLQMGNFSSESKSQGSSRSDTDAFGISELSATMVMEAEFNNTPVERGFTHELCAGLRTKGRRV--TPLEGSICD

Query:  T-----NIHKFNTYESYIENGDLSDENVKGDVVANKLASCSRERRLRKPTRRYIEEFADSKSESNQGKKKPPTKDKYMKRTMIEESNHIRHKVQMLTSRS
        T     NIHKFNT E+YIENGDLSDENVKGD+VANKLASCSRERRLRKPTRRYIEEFADSKSE+N+G++KPPTKDKY+K T  EESNHIRH+VQMLT RS
Subjt:  T-----NIHKFNTYESYIENGDLSDENVKGDVVANKLASCSRERRLRKPTRRYIEEFADSKSESNQGKKKPPTKDKYMKRTMIEESNHIRHKVQMLTSRS

Query:  ESHCGTSVPVQSRSQRRLPKKHVPVLEFLSEDESSATECEKVYSSVTRSKKHDRRKHHKMWTLTEVMRLVDGISEYGTGRWTHIKKYLFASSPHRTPIDL
        E HCGTSVPVQSRSQRR PKKHVPV  FLSEDESSATEC+ VYSS  R KK+DRR+H KMW+LTEVMRLVDGI+EYGTGRWT IKK+LFASSPHRTPIDL
Subjt:  ESHCGTSVPVQSRSQRRLPKKHVPVLEFLSEDESSATECEKVYSSVTRSKKHDRRKHHKMWTLTEVMRLVDGISEYGTGRWTHIKKYLFASSPHRTPIDL

Query:  RDKWRNLQKASCVNMQNRKGIERKQSHASRPLPKSLLQRVYELDNIYPYPKKRSPKSVEPTTPPMYLDESNSSLSFNWGRKKYE
        RDKWRNL +ASCVN+QNRKGIERKQSHASRPLPKSLLQRVYEL NIYPYPK+RSPKSV+ TTPPM+L ESN SLSFNWGRKKYE
Subjt:  RDKWRNLQKASCVNMQNRKGIERKQSHASRPLPKSLLQRVYELDNIYPYPKKRSPKSVEPTTPPMYLDESNSSLSFNWGRKKYE

TrEMBL top hitse value%identityAlignment
A0A1S3BKX9 uncharacterized protein LOC103491166 isoform X11.4e-21781.08Show/hide
Query:  MDQEVHFCPKFTNMKSHWVKVEGSFISRPLNEANEVEHLLVEPKSENVLGNCLRVQDFSCDFGYGIQTNGGGLDSNSKQGGGHELKFGDLDQLLDDANDV
        MDQEVHFC KFTNMKSHWVKVEG F+  PLN++NEVE LLVE KSE+VLGNCLRVQDFSCDFGYGIQTNGGGLDSNSKQGG HELKFGD DQLLDDAN+V
Subjt:  MDQEVHFCPKFTNMKSHWVKVEGSFISRPLNEANEVEHLLVEPKSENVLGNCLRVQDFSCDFGYGIQTNGGGLDSNSKQGGGHELKFGDLDQLLDDANDV

Query:  GEFHATNNLPNTYAEVAGNSFRQNRRLQMGNFSSESKSQGSSRSDTDAFGISELSATMVMEAEFNNTPVERGFTHELCAGLRTKGRRVTPLEGSICDT--
        GEFHATNNLPNTYAEVA NSFR+NRR Q+GN SSE+KS G SR DTDAFGISELSATMVMEAEFNNTPVERG THEL  GL TKGR VTPLEG+IC T  
Subjt:  GEFHATNNLPNTYAEVAGNSFRQNRRLQMGNFSSESKSQGSSRSDTDAFGISELSATMVMEAEFNNTPVERGFTHELCAGLRTKGRRVTPLEGSICDT--

Query:  ---NIHKFNTYESYIENGDLSDENVKGDVVANKLASCSRERRLRKPTRRYIEEFADSKSESNQGKKKPPTKDKYMKRTMIEESNHIRHKVQMLTSRSESH
           NIHKFNT E+YIENGDLSDENVKGD+VAN+LASCSRERRLRKPTRRYIEEF DSKSE N+G++K P KDKY+K    EES HIRH+VQM+  RS+S 
Subjt:  ---NIHKFNTYESYIENGDLSDENVKGDVVANKLASCSRERRLRKPTRRYIEEFADSKSESNQGKKKPPTKDKYMKRTMIEESNHIRHKVQMLTSRSESH

Query:  CGTSVPVQSRSQRRLPKKHVPVLEFLSEDESSATECEKVYSSVTRSKKHDRRKHHKMWTLTEVMRLVDGISEYGTGRWTHIKKYLFASSPHRTPIDLRDK
        CGTSVPVQ +S+RR P KHVPV  FLSEDESSATEC+ VYSS  R KK+DRR+  KMWTLTEVMRLVDGI+EYGTGRWTHIKK+LFASSPHRTPIDLRDK
Subjt:  CGTSVPVQSRSQRRLPKKHVPVLEFLSEDESSATECEKVYSSVTRSKKHDRRKHHKMWTLTEVMRLVDGISEYGTGRWTHIKKYLFASSPHRTPIDLRDK

Query:  WRNLQKASCVNMQNRKGIERKQSHASRPLPKSLLQRVYELDNIYPYPKKRSPKSVEPTTPPMYLDESNSSLSFNWGRKKYE
        WRNL +ASCVN+QN+KG+E KQ+HASRPLPKSLLQRVYEL NIYPYPK+R PKSV+  TPPM L ESN SLSFNWGRKKYE
Subjt:  WRNLQKASCVNMQNRKGIERKQSHASRPLPKSLLQRVYELDNIYPYPKKRSPKSVEPTTPPMYLDESNSSLSFNWGRKKYE

A0A1S3BLK0 uncharacterized protein LOC103491166 isoform X21.3e-21580.87Show/hide
Query:  MDQEVHFCPKFTNMKSHWVKVEGSFISRPLNEANEVEHLLVEPKSENVLGNCLRVQDFSCDFGYGIQTNGGGLDSNSKQGGGHELKFGDLDQLLDDANDV
        MDQEVHFC KFTNMKSHWVKVEG F+  PLN++NEVE LLVE KSE+VLGNCLRVQDFSCDFGYGIQTN GGLDSNSKQGG HELKFGD DQLLDDAN+V
Subjt:  MDQEVHFCPKFTNMKSHWVKVEGSFISRPLNEANEVEHLLVEPKSENVLGNCLRVQDFSCDFGYGIQTNGGGLDSNSKQGGGHELKFGDLDQLLDDANDV

Query:  GEFHATNNLPNTYAEVAGNSFRQNRRLQMGNFSSESKSQGSSRSDTDAFGISELSATMVMEAEFNNTPVERGFTHELCAGLRTKGRRVTPLEGSICDT--
        GEFHATNNLPNTYAEVA NSFR+NRR Q+GN SSE+KS G SR DTDAFGISELSATMVMEAEFNNTPVERG THEL  GL TKGR VTPLEG+IC T  
Subjt:  GEFHATNNLPNTYAEVAGNSFRQNRRLQMGNFSSESKSQGSSRSDTDAFGISELSATMVMEAEFNNTPVERGFTHELCAGLRTKGRRVTPLEGSICDT--

Query:  ---NIHKFNTYESYIENGDLSDENVKGDVVANKLASCSRERRLRKPTRRYIEEFADSKSESNQGKKKPPTKDKYMKRTMIEESNHIRHKVQMLTSRSESH
           NIHKFNT E+YIENGDLSDENVKGD+VAN+LASCSRERRLRKPTRRYIEEF DSKSE N+G++K P KDKY+K    EES HIRH+VQM+  RS+S 
Subjt:  ---NIHKFNTYESYIENGDLSDENVKGDVVANKLASCSRERRLRKPTRRYIEEFADSKSESNQGKKKPPTKDKYMKRTMIEESNHIRHKVQMLTSRSESH

Query:  CGTSVPVQSRSQRRLPKKHVPVLEFLSEDESSATECEKVYSSVTRSKKHDRRKHHKMWTLTEVMRLVDGISEYGTGRWTHIKKYLFASSPHRTPIDLRDK
        CGTSVPVQ +S+RR P KHVPV  FLSEDESSATEC+ VYSS  R KK+DRR+  KMWTLTEVMRLVDGI+EYGTGRWTHIKK+LFASSPHRTPIDLRDK
Subjt:  CGTSVPVQSRSQRRLPKKHVPVLEFLSEDESSATECEKVYSSVTRSKKHDRRKHHKMWTLTEVMRLVDGISEYGTGRWTHIKKYLFASSPHRTPIDLRDK

Query:  WRNLQKASCVNMQNRKGIERKQSHASRPLPKSLLQRVYELDNIYPYPKKRSPKSVEPTTPPMYLDESNSSLSFNWGRKKYE
        WRNL +ASCVN+QN+KG+E KQ+HASRPLPKSLLQRVYEL NIYPYPK+R PKSV+  TPPM L ESN SLSFNWGRKKYE
Subjt:  WRNLQKASCVNMQNRKGIERKQSHASRPLPKSLLQRVYELDNIYPYPKKRSPKSVEPTTPPMYLDESNSSLSFNWGRKKYE

A0A6J1CRG2 uncharacterized protein LOC111013581 isoform X22.2e-21881.12Show/hide
Query:  MDQEVHFCPKFTNMKSHWVKVEGSFISRPLNEANEVEHLLVEPKSENVLGNCLRVQDFSCDFGYGIQTNGGGLDSNSKQGGGHELKFGDLDQLLDDANDV
        MDQEVHFC KFTNMKSHWVKV+GSF+  PLNE NEVEHLLVEPKS +VLG+CLR QDFSCDF YGIQTN GGLDSNSKQ G HELKF DLDQLL D N+V
Subjt:  MDQEVHFCPKFTNMKSHWVKVEGSFISRPLNEANEVEHLLVEPKSENVLGNCLRVQDFSCDFGYGIQTNGGGLDSNSKQGGGHELKFGDLDQLLDDANDV

Query:  GEFHATNNLPNTYAEVAGNSFRQNRRLQMGNFSSESKSQGSSRSDTDAFGISELSATMVMEAEFNN-TPVERGFTHELCAGLRTKGRRVTPLEGSICDT-
         EFHATNNLPNTY EVA NSFR+NR LQ+GN SSESKSQGSSR+DT+AF ISELSA MV EAE NN TPV+RG THELCAGLRTKGR  TPL+GSIC T 
Subjt:  GEFHATNNLPNTYAEVAGNSFRQNRRLQMGNFSSESKSQGSSRSDTDAFGISELSATMVMEAEFNN-TPVERGFTHELCAGLRTKGRRVTPLEGSICDT-

Query:  ----NIHKFNTYESYIENGDLSDENVKGDVVANKLASCSRERRLRKPTRRYIEEFADSKSESNQGKKKPPTKDKYMKRTMIEESNHIRHKVQMLTSRSES
            NIHKF+T E  +ENG LSDENVKG++ A+KLA CSR+RRLRKPTRRYIEEFADSKSES++GK+KPPTKDKY+K T IEESNHIRHKVQMLT   ES
Subjt:  ----NIHKFNTYESYIENGDLSDENVKGDVVANKLASCSRERRLRKPTRRYIEEFADSKSESNQGKKKPPTKDKYMKRTMIEESNHIRHKVQMLTSRSES

Query:  HCGTSVPVQSRSQRRLPKKHVPVLEFLSEDESSATECEKVYSSVTRSKKHDRRKHHKMWTLTEVMRLVDGISEYGTGRWTHIKKYLFASSPHRTPIDLRD
        HCGTS+PVQSRSQRRLPKKHVPV  FLSE+ESSATEC+ VYSS  R KKHDRRKH KMWTLTEVMRLVDGI+EYGTGRWTHIKK+LFA+SP+RTPIDLRD
Subjt:  HCGTSVPVQSRSQRRLPKKHVPVLEFLSEDESSATECEKVYSSVTRSKKHDRRKHHKMWTLTEVMRLVDGISEYGTGRWTHIKKYLFASSPHRTPIDLRD

Query:  KWRNLQKASCVNMQNRKGIERKQSHASRPLPKSLLQRVYELDNIYPYPKKRSPKSVEPTTPPMYLDESNSSLSFNWGRKKYE
        KWRNL +ASCVN+QNR GIERKQSHASRPLPKSLLQRVYEL NIYPYPK+RSPKSV+ TT PM+L ESN SLSFNWGRKKY+
Subjt:  KWRNLQKASCVNMQNRKGIERKQSHASRPLPKSLLQRVYELDNIYPYPKKRSPKSVEPTTPPMYLDESNSSLSFNWGRKKYE

A0A6J1CRQ1 uncharacterized protein LOC111013581 isoform X15.4e-21780.95Show/hide
Query:  MDQEVHFCPKFTNMKSHWVKVEGSFISRPLNEANEVEHLLVEPKSENVLGNCLRVQDFSCDFGYGIQTNGGGLDSNSKQGGGHELKFGDLDQLLDDANDV
        MDQEVHFC KFTNMKSHWVKV+GSF+  PLNE NEVEHLLVEPKS +VLG+CLR QDFSCDF YGIQTN GGLDSNSKQ G HELKF DLDQLL D N+V
Subjt:  MDQEVHFCPKFTNMKSHWVKVEGSFISRPLNEANEVEHLLVEPKSENVLGNCLRVQDFSCDFGYGIQTNGGGLDSNSKQGGGHELKFGDLDQLLDDANDV

Query:  GEFHATNNLPNTYAEVAGNSFRQNRRLQMGNFSSESKSQGSSRSDTDAFGISELSATMVMEAEFNN-TPVERGFTHELCAGLRTKGRRVTPLEGSICDT-
         EFHATNNLPNTY EVA NSFR+NR LQ+GN SSESKSQGSSR+DT+AF ISELSA MV EAE NN TPV+RG THELCAGLRTKGR  TPL+GSIC T 
Subjt:  GEFHATNNLPNTYAEVAGNSFRQNRRLQMGNFSSESKSQGSSRSDTDAFGISELSATMVMEAEFNN-TPVERGFTHELCAGLRTKGRRVTPLEGSICDT-

Query:  ----NIHKFNTYESYIENGDLSDENVKGDVVANKLASCSRERRLRKPTRRYIEEFADSKSESNQGKKKPPTKDKYMKRTMIEESNHIRHKVQMLTSRSES
            NIHKF+T E  +ENG LSDENVKG++ A+KLA CSR+RRLRKPTRRYIEEFADSKSES++GK+KPPTKDKY+K T IEESNHIRHKVQMLT   ES
Subjt:  ----NIHKFNTYESYIENGDLSDENVKGDVVANKLASCSRERRLRKPTRRYIEEFADSKSESNQGKKKPPTKDKYMKRTMIEESNHIRHKVQMLTSRSES

Query:  HCGTSVPVQSRSQRRLPKKHVPVLEFLSEDESSATECEKVYSSVTRSKKHDRRKHHKMWTLTEVMRLVDGISEYGTGRWTHIKKYLFASSPHRTPIDLR-
        HCGTS+PVQSRSQRRLPKKHVPV  FLSE+ESSATEC+ VYSS  R KKHDRRKH KMWTLTEVMRLVDGI+EYGTGRWTHIKK+LFA+SP+RTPIDLR 
Subjt:  HCGTSVPVQSRSQRRLPKKHVPVLEFLSEDESSATECEKVYSSVTRSKKHDRRKHHKMWTLTEVMRLVDGISEYGTGRWTHIKKYLFASSPHRTPIDLR-

Query:  DKWRNLQKASCVNMQNRKGIERKQSHASRPLPKSLLQRVYELDNIYPYPKKRSPKSVEPTTPPMYLDESNSSLSFNWGRKKYE
        DKWRNL +ASCVN+QNR GIERKQSHASRPLPKSLLQRVYEL NIYPYPK+RSPKSV+ TT PM+L ESN SLSFNWGRKKY+
Subjt:  DKWRNLQKASCVNMQNRKGIERKQSHASRPLPKSLLQRVYELDNIYPYPKKRSPKSVEPTTPPMYLDESNSSLSFNWGRKKYE

A0A6J1ECM4 uncharacterized protein LOC111433102 isoform X13.1e-18872.77Show/hide
Query:  MDQEVHFCPKFTNMKSHWVKVEGSFISRPLNEANEVEHLLVEPKSENVLGNCLRVQDFSCDFGYGIQTNGGGLDSNSKQGGGHELKFGDLDQLLDDANDV
        MDQEVHFC KFTNM  HWVK+EGSF+  PLNE+NEV+H LVEPKS++ LGNCLRVQDFS DFGY IQTNG                              
Subjt:  MDQEVHFCPKFTNMKSHWVKVEGSFISRPLNEANEVEHLLVEPKSENVLGNCLRVQDFSCDFGYGIQTNGGGLDSNSKQGGGHELKFGDLDQLLDDANDV

Query:  GEFHATNNLPNTYAEVAGNSFRQNRRLQMGNFSSESKSQGSSRSDTDAFGISELSATMVMEAEFNNTPVERGFTHELCAGLRTKGRRVTPLEGSICDT--
                     AEV  NSFRQNR LQ+G  SSESKSQGSSRSDTDAF ISELSATMVMEAEFNNTPVER  T EL +GLRT+G   TP EG+ICDT  
Subjt:  GEFHATNNLPNTYAEVAGNSFRQNRRLQMGNFSSESKSQGSSRSDTDAFGISELSATMVMEAEFNNTPVERGFTHELCAGLRTKGRRVTPLEGSICDT--

Query:  ---NIHKFNTYESYIENGDLSDENVKGDVVANKLASCSRERRLRKPTRRYIEEFADSKSESNQGKKKPPTKDKYMKRTMIEESNHIRHKVQMLTSRSESH
           NIHKFNT E+Y+EN  +SDENVKGD+VA+KLASCSRERRLRKPTRRYIEEFADSKSE+N+G++KPPTKDKY+K T  EESNHIRHKVQMLT + ESH
Subjt:  ---NIHKFNTYESYIENGDLSDENVKGDVVANKLASCSRERRLRKPTRRYIEEFADSKSESNQGKKKPPTKDKYMKRTMIEESNHIRHKVQMLTSRSESH

Query:  CGTSVPVQSRSQRRLPKKHVPVLEFLSEDESSATECEKVYSSVTRSKKHDRRKHHKMWTLTEVMRLVDGISEYGTGRWTHIKKYLFASSPHRTPIDLRDK
        CGTSVPVQSRSQRR P+KHVPV  FLSEDE SATEC+ VYSS    KK+DRRKH KMWTLTEVMRLVDGI+EYGTGRWT IK++LFASSPHRTPIDLRDK
Subjt:  CGTSVPVQSRSQRRLPKKHVPVLEFLSEDESSATECEKVYSSVTRSKKHDRRKHHKMWTLTEVMRLVDGISEYGTGRWTHIKKYLFASSPHRTPIDLRDK

Query:  WRNLQKASCVNMQNRKGIERKQSHASRPLPKSLLQRVYELDNIYPYPKKRSPKSVEPTTPPMYLDESNSSLSFNWGRKKYE
        WRNL KASCVN+QN KG E KQ HASRPLPKSLLQRVYEL NIYPYPK+RSPK V   TPPMYL ESN SLSFNWGRKKYE
Subjt:  WRNLQKASCVNMQNRKGIERKQSHASRPLPKSLLQRVYELDNIYPYPKKRSPKSVEPTTPPMYLDESNSSLSFNWGRKKYE

SwissProt top hitse value%identityAlignment
Q9C7B1 Telomere repeat-binding protein 36.5e-1036.78Show/hide
Query:  RRKHHKMWTLTEVMRLVDGISEYGTGRWTHIKKYLFASSPHRTPIDLRDKWRNLQKASCVNMQNRKGIERKQSHASRPLPKSLLQRV
        +R+  + +++TEV  LV  + E GTGRW  +K   F  + HRT +DL+DKW+ L   + ++ Q R+G          P+P+ LL RV
Subjt:  RRKHHKMWTLTEVMRLVDGISEYGTGRWTHIKKYLFASSPHRTPIDLRDKWRNLQKASCVNMQNRKGIERKQSHASRPLPKSLLQRV

Q9FFY9 Telomere repeat-binding protein 42.1e-0828.22Show/hide
Query:  CSRERRLRKPT-RRYIEEFADSKSESNQGKKKPPTKDKYMKRTMIEESNHIRHKVQMLTSRSESHCGTSVPVQSRSQRRLPKKHVPVLEFLSEDESSATE
        CS    L  PT    + E + +      G   PP  + Y    +I   N + +  +++   S+          SR+        VPVL      ES A  
Subjt:  CSRERRLRKPT-RRYIEEFADSKSESNQGKKKPPTKDKYMKRTMIEESNHIRHKVQMLTSRSESHCGTSVPVQSRSQRRLPKKHVPVLEFLSEDESSATE

Query:  CEKVYSSVTRSKKHDRRKHHKMWTLTEVMRLVDGISEYGTGRWTHIKKYLFASSPHRTPIDLRDKWRNLQKASCVNMQNRKGIERKQSHASRPLPKSLLQ
           V     R++   RR   + +++TEV  LV  + E GTGRW  +K   F ++ HRT +DL+DKW+ L   + ++ Q R+G          P+P+ LL 
Subjt:  CEKVYSSVTRSKKHDRRKHHKMWTLTEVMRLVDGISEYGTGRWTHIKKYLFASSPHRTPIDLRDKWRNLQKASCVNMQNRKGIERKQSHASRPLPKSLLQ

Query:  RV
        RV
Subjt:  RV

Q9LL45 Telomere-binding protein 14.2e-0936.08Show/hide
Query:  VTRSKKHD--RRKHHKMWTLTEVMRLVDGISEYGTGRWTHIKKYLFASSPHRTPIDLRDKWRNLQKASCVNMQNRKGIERKQSHASRPLPKSLLQRV
        ++RSK+ D  +R+  + +T+ EV  LV+ +   GTGRW  +K   F +  HRT +DL+DKW+ L   + +  Q R+G          P+P+ LL RV
Subjt:  VTRSKKHD--RRKHHKMWTLTEVMRLVDGISEYGTGRWTHIKKYLFASSPHRTPIDLRDKWRNLQKASCVNMQNRKGIERKQSHASRPLPKSLLQRV

Q9M347 Telomere repeat-binding protein 61.2e-0834.48Show/hide
Query:  RRKHHKMWTLTEVMRLVDGISEYGTGRWTHIKKYLFASSPHRTPIDLRDKWRNLQKASCVNMQNRKGIERKQSHASRPLPKSLLQRV
        +R+  + +T++EV  LV  +   GTGRW  +K + F    HRT +DL+DKW+ L   + ++ + R+G          P+P+ LL RV
Subjt:  RRKHHKMWTLTEVMRLVDGISEYGTGRWTHIKKYLFASSPHRTPIDLRDKWRNLQKASCVNMQNRKGIERKQSHASRPLPKSLLQRV

Q9SNB9 Telomere repeat-binding protein 24.2e-0935.63Show/hide
Query:  RRKHHKMWTLTEVMRLVDGISEYGTGRWTHIKKYLFASSPHRTPIDLRDKWRNLQKASCVNMQNRKGIERKQSHASRPLPKSLLQRV
        +R+  + +++TEV  LV  + + GTGRW  +K   F  + HRT +DL+DKW+ L   + ++ Q R+G          P+P+ LL RV
Subjt:  RRKHHKMWTLTEVMRLVDGISEYGTGRWTHIKKYLFASSPHRTPIDLRDKWRNLQKASCVNMQNRKGIERKQSHASRPLPKSLLQRV

Arabidopsis top hitse value%identityAlignment
AT1G17460.1 TRF-like 32.5e-1739.2Show/hide
Query:  ESSATECEKVYSSVTRSKKHDRRKHHKMWTLTEVMRLVDGISEYGTGRWTHIKKYLFASSPHRTPIDLRDKWRNLQKASCVNMQNRKGIERKQSHASRPL
        +SS    +  +     ++    RK H+ WT++EV +LV+G+S+YG G+WT IKK  F+   HRT +DL+DKWRNLQKAS  N +   G+++   H S  +
Subjt:  ESSATECEKVYSSVTRSKKHDRRKHHKMWTLTEVMRLVDGISEYGTGRWTHIKKYLFASSPHRTPIDLRDKWRNLQKASCVNMQNRKGIERKQSHASRPL

Query:  PKSLLQRVYELDNIYPYPKKRSPKS
        P  ++ +V EL       +K+SP S
Subjt:  PKSLLQRVYELDNIYPYPKKRSPKS

AT1G72650.1 TRF-like 65.8e-2228.62Show/hide
Query:  RRLRKPTRRYIEEFADSKSESNQGKKKPPTKDKYMKRTMIEESNHIRHKVQMLTSRSESHCGT--SVPVQSRSQRRLPKKHVPVL---------------
        +R+RKPTRRYIEE +++  +    K   P+KD+ +       S  +    ++  +R  S  G+   VP  S  +R  P++++  L               
Subjt:  RRLRKPTRRYIEEFADSKSESNQGKKKPPTKDKYMKRTMIEESNHIRHKVQMLTSRSESHCGT--SVPVQSRSQRRLPKKHVPVL---------------

Query:  -------------------------------EFLSEDESSA----------TECEKVYSSVTRSKKHD-----------RRKHHKMWTLTEVMRLVDGIS
                                       EF + DE++            E E + SS   S +++           RRKHH+ WTL+E+ +LV+G+S
Subjt:  -------------------------------EFLSEDESSA----------TECEKVYSSVTRSKKHD-----------RRKHHKMWTLTEVMRLVDGIS

Query:  EYGTGRWTHIKKYLFASSPHRTPIDLRDKWRNLQKASCVNMQNRKGIERKQSHASRPLPKSLLQRVYEL
        +YG G+W+ IKK+LF+S  +RT +DL+DKWRNL K S     +   +   + H S  +P  +L RV EL
Subjt:  EYGTGRWTHIKKYLFASSPHRTPIDLRDKWRNLQKASCVNMQNRKGIERKQSHASRPLPKSLLQRVYEL

AT1G72650.2 TRF-like 65.8e-2228.62Show/hide
Query:  RRLRKPTRRYIEEFADSKSESNQGKKKPPTKDKYMKRTMIEESNHIRHKVQMLTSRSESHCGT--SVPVQSRSQRRLPKKHVPVL---------------
        +R+RKPTRRYIEE +++  +    K   P+KD+ +       S  +    ++  +R  S  G+   VP  S  +R  P++++  L               
Subjt:  RRLRKPTRRYIEEFADSKSESNQGKKKPPTKDKYMKRTMIEESNHIRHKVQMLTSRSESHCGT--SVPVQSRSQRRLPKKHVPVL---------------

Query:  -------------------------------EFLSEDESSA----------TECEKVYSSVTRSKKHD-----------RRKHHKMWTLTEVMRLVDGIS
                                       EF + DE++            E E + SS   S +++           RRKHH+ WTL+E+ +LV+G+S
Subjt:  -------------------------------EFLSEDESSA----------TECEKVYSSVTRSKKHD-----------RRKHHKMWTLTEVMRLVDGIS

Query:  EYGTGRWTHIKKYLFASSPHRTPIDLRDKWRNLQKASCVNMQNRKGIERKQSHASRPLPKSLLQRVYEL
        +YG G+W+ IKK+LF+S  +RT +DL+DKWRNL K S     +   +   + H S  +P  +L RV EL
Subjt:  EYGTGRWTHIKKYLFASSPHRTPIDLRDKWRNLQKASCVNMQNRKGIERKQSHASRPLPKSLLQRVYEL

AT2G37025.1 TRF-like 82.7e-2745.19Show/hide
Query:  HVPVLEFLSEDESSATECEKVYSSVTRSK-KHDRRKHHKMWTLTEVMRLVDGISEYGTGRWTHIKKYLFASSPHRTPIDLRDKWRNLQKASCVNMQNRKG
        H  + +  S+D+ + +E E   S    S+ K DRRK+ ++WTL EVM LVDGIS +G G+WT IK + F  + HR P+D+RDKWRNL KAS     N   
Subjt:  HVPVLEFLSEDESSATECEKVYSSVTRSK-KHDRRKHHKMWTLTEVMRLVDGISEYGTGRWTHIKKYLFASSPHRTPIDLRDKWRNLQKASCVNMQNRKG

Query:  IERKQSHASRPLPKSLLQRVYELDNIYPYPKKRSP
         E K+   +R +PK +L RV EL +++PYP  +SP
Subjt:  IERKQSHASRPLPKSLLQRVYELDNIYPYPKKRSP

AT2G37025.2 TRF-like 82.7e-2745.19Show/hide
Query:  HVPVLEFLSEDESSATECEKVYSSVTRSK-KHDRRKHHKMWTLTEVMRLVDGISEYGTGRWTHIKKYLFASSPHRTPIDLRDKWRNLQKASCVNMQNRKG
        H  + +  S+D+ + +E E   S    S+ K DRRK+ ++WTL EVM LVDGIS +G G+WT IK + F  + HR P+D+RDKWRNL KAS     N   
Subjt:  HVPVLEFLSEDESSATECEKVYSSVTRSK-KHDRRKHHKMWTLTEVMRLVDGISEYGTGRWTHIKKYLFASSPHRTPIDLRDKWRNLQKASCVNMQNRKG

Query:  IERKQSHASRPLPKSLLQRVYELDNIYPYPKKRSP
         E K+   +R +PK +L RV EL +++PYP  +SP
Subjt:  IERKQSHASRPLPKSLLQRVYELDNIYPYPKKRSP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCAAGAAGTGCATTTCTGCCCGAAGTTCACAAATATGAAATCTCATTGGGTAAAAGTGGAGGGATCTTTTATTTCTCGACCATTAAATGAAGCAAATGAAGTTGA
ACATTTACTTGTGGAGCCAAAAAGCGAAAATGTCTTAGGAAATTGCTTGAGAGTTCAAGATTTCTCTTGTGACTTTGGCTACGGGATACAAACAAACGGTGGTGGATTGG
ATTCTAATAGCAAGCAGGGAGGCGGACATGAGCTAAAATTTGGAGATCTTGATCAACTGCTGGATGATGCCAATGACGTAGGGGAATTCCATGCAACGAACAATCTGCCA
AATACATATGCAGAAGTTGCTGGAAATTCTTTCAGACAGAATAGGCGATTACAAATGGGAAACTTCAGTTCAGAGAGCAAGTCTCAGGGATCAAGCAGGAGTGACACTGA
TGCCTTTGGAATATCAGAATTGTCAGCAACGATGGTAATGGAGGCCGAATTCAATAATACGCCTGTTGAGAGGGGTTTTACTCATGAGTTGTGTGCTGGTCTGAGGACCA
AAGGTAGGCGTGTAACACCACTTGAAGGCAGCATCTGCGATACAAATATTCATAAATTTAATACTTACGAAAGCTATATAGAAAATGGAGATTTATCTGATGAAAATGTG
AAGGGTGATGTTGTGGCAAACAAACTTGCCAGTTGTTCAAGGGAGAGGAGATTGCGTAAGCCTACTCGAAGATACATTGAAGAATTTGCAGATTCAAAGTCTGAAAGTAA
CCAAGGAAAGAAAAAACCTCCTACAAAAGATAAATACATGAAAAGGACGATGATTGAGGAATCTAATCATATTAGACATAAGGTGCAAATGTTGACGTCTAGAAGTGAAT
CACATTGTGGTACTTCTGTTCCAGTGCAGTCTCGATCTCAAAGAAGACTTCCAAAGAAGCATGTACCCGTTTTAGAATTTCTATCGGAAGACGAATCTTCGGCAACAGAA
TGTGAAAAAGTCTATTCATCTGTTACAAGATCTAAAAAGCATGATAGAAGGAAGCACCATAAAATGTGGACCCTAACTGAAGTAATGCGATTAGTTGATGGAATTTCTGA
ATATGGAACTGGCCGCTGGACTCATATAAAGAAGTACCTATTTGCATCTTCTCCTCACCGCACGCCGATAGATCTCAGGGACAAATGGCGAAATCTTCAGAAAGCTAGCT
GTGTTAACATGCAGAACAGAAAAGGGATTGAACGGAAGCAGTCGCATGCGTCACGTCCACTGCCGAAGTCCCTGCTTCAACGTGTCTACGAACTCGACAATATTTATCCA
TATCCAAAGAAACGCAGTCCAAAATCAGTGGAACCAACTACACCTCCCATGTATCTTGACGAAAGTAACTCCTCTTTGTCATTCAATTGGGGGCGGAAGAAATATGAATG
A
mRNA sequenceShow/hide mRNA sequence
CCGGCAAGTAAAAAACGCCACATAAATACACTTCTCCTCTTCACACTCGCTTTTTTGGCGGTTTAAAAATTACCATAATTTCCATTTCCAGGTTTTCTTCTCTTCGAGCT
AAGCTCCGATCTTCAGTTTCCTCGCTGAGCTACTAATCCCTATCCCGGGCATGTAGGATTGAGCTAAGCCTACTTATGGATCAAGAAGTGCATTTCTGCCCGAAGTTCAC
AAATATGAAATCTCATTGGGTAAAAGTGGAGGGATCTTTTATTTCTCGACCATTAAATGAAGCAAATGAAGTTGAACATTTACTTGTGGAGCCAAAAAGCGAAAATGTCT
TAGGAAATTGCTTGAGAGTTCAAGATTTCTCTTGTGACTTTGGCTACGGGATACAAACAAACGGTGGTGGATTGGATTCTAATAGCAAGCAGGGAGGCGGACATGAGCTA
AAATTTGGAGATCTTGATCAACTGCTGGATGATGCCAATGACGTAGGGGAATTCCATGCAACGAACAATCTGCCAAATACATATGCAGAAGTTGCTGGAAATTCTTTCAG
ACAGAATAGGCGATTACAAATGGGAAACTTCAGTTCAGAGAGCAAGTCTCAGGGATCAAGCAGGAGTGACACTGATGCCTTTGGAATATCAGAATTGTCAGCAACGATGG
TAATGGAGGCCGAATTCAATAATACGCCTGTTGAGAGGGGTTTTACTCATGAGTTGTGTGCTGGTCTGAGGACCAAAGGTAGGCGTGTAACACCACTTGAAGGCAGCATC
TGCGATACAAATATTCATAAATTTAATACTTACGAAAGCTATATAGAAAATGGAGATTTATCTGATGAAAATGTGAAGGGTGATGTTGTGGCAAACAAACTTGCCAGTTG
TTCAAGGGAGAGGAGATTGCGTAAGCCTACTCGAAGATACATTGAAGAATTTGCAGATTCAAAGTCTGAAAGTAACCAAGGAAAGAAAAAACCTCCTACAAAAGATAAAT
ACATGAAAAGGACGATGATTGAGGAATCTAATCATATTAGACATAAGGTGCAAATGTTGACGTCTAGAAGTGAATCACATTGTGGTACTTCTGTTCCAGTGCAGTCTCGA
TCTCAAAGAAGACTTCCAAAGAAGCATGTACCCGTTTTAGAATTTCTATCGGAAGACGAATCTTCGGCAACAGAATGTGAAAAAGTCTATTCATCTGTTACAAGATCTAA
AAAGCATGATAGAAGGAAGCACCATAAAATGTGGACCCTAACTGAAGTAATGCGATTAGTTGATGGAATTTCTGAATATGGAACTGGCCGCTGGACTCATATAAAGAAGT
ACCTATTTGCATCTTCTCCTCACCGCACGCCGATAGATCTCAGGGACAAATGGCGAAATCTTCAGAAAGCTAGCTGTGTTAACATGCAGAACAGAAAAGGGATTGAACGG
AAGCAGTCGCATGCGTCACGTCCACTGCCGAAGTCCCTGCTTCAACGTGTCTACGAACTCGACAATATTTATCCATATCCAAAGAAACGCAGTCCAAAATCAGTGGAACC
AACTACACCTCCCATGTATCTTGACGAAAGTAACTCCTCTTTGTCATTCAATTGGGGGCGGAAGAAATATGAATGACATCTACTTTGGAAGCAACAGAAATGTGGAAGTC
CAAATAATAATACTTACAATTAGATGTAAAAAAATCTTTGTTTCTATTTTGATCCTTTTGTACACTGATATGTACTCTAAAGTGGGAAGATAATCTTCCATTATTAACTG
AAAATTCCTAATACTTTCTCCTCTTCCATACCCCTGTCTTCTTAAGTTGCATTTTTTAGGTAATCATAAATCACGTCATAAACATTTCCTGATACCTATTGCTAATTCTT
GTGAGAAGCTAGATTCTG
Protein sequenceShow/hide protein sequence
MDQEVHFCPKFTNMKSHWVKVEGSFISRPLNEANEVEHLLVEPKSENVLGNCLRVQDFSCDFGYGIQTNGGGLDSNSKQGGGHELKFGDLDQLLDDANDVGEFHATNNLP
NTYAEVAGNSFRQNRRLQMGNFSSESKSQGSSRSDTDAFGISELSATMVMEAEFNNTPVERGFTHELCAGLRTKGRRVTPLEGSICDTNIHKFNTYESYIENGDLSDENV
KGDVVANKLASCSRERRLRKPTRRYIEEFADSKSESNQGKKKPPTKDKYMKRTMIEESNHIRHKVQMLTSRSESHCGTSVPVQSRSQRRLPKKHVPVLEFLSEDESSATE
CEKVYSSVTRSKKHDRRKHHKMWTLTEVMRLVDGISEYGTGRWTHIKKYLFASSPHRTPIDLRDKWRNLQKASCVNMQNRKGIERKQSHASRPLPKSLLQRVYELDNIYP
YPKKRSPKSVEPTTPPMYLDESNSSLSFNWGRKKYE