; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr017181 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr017181
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionHTH myb-type domain-containing protein
Genome locationtig00153033:307379..315632
RNA-Seq ExpressionSgr017181
SyntenySgr017181
Gene Ontology termsNA
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008449224.1 PREDICTED: uncharacterized protein LOC103491166 isoform X1 [Cucumis melo]5.6e-21378.54Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVEGSFLPAPLNESNEVEHLLVEPKSNHILGNCLRVQDFSCDFDYGIQTNGGGLDSNSKQGGEHELKFGDLDQLLDDANEV
        MDQEVHFCQKFTNMKSHWVKVEG FLPAPLN+SNEVE LLVE KS H+LGNCLRVQDFSCDF YGIQTNGGGLDSNSKQGGEHELKFGD DQLLDDANEV
Subjt:  MDQEVHFCQKFTNMKSHWVKVEGSFLPAPLNESNEVEHLLVEPKSNHILGNCLRVQDFSCDFDYGIQTNGGGLDSNSKQGGEHELKFGDLDQLLDDANEV

Query:  GEFHATNNLPNTYTEVAENSFRQNSGLQIVSNNG-------------------------YRAGSNNTPVERGLTHELCGGLMTKGRCVSPLEGSICARIL
        GEFHATNNLPNTY EVAENSFR+N   Q+ +++                            A  NNTPVERGLTHEL  GL TKGRCV+PLEG+IC  IL
Subjt:  GEFHATNNLPNTYTEVAENSFRQNSGLQIVSNNG-------------------------YRAGSNNTPVERGLTHELCGGLMTKGRCVSPLEGSICARIL

Query:  DNINIHKFDTNESHIENGDLSDENVKGDIEASNLVICSRERRLRKPTRRYIEEFADSKPEGYKGGRKPPTKDKYMKVTSIEESNHIRHKVQMLTRGGESH
        DN NIHKF+TNE++IENGDLSDENVKGDI A+ L  CSRERRLRKPTRRYIEEF DSK E  KG RK P KDKY+KV S EES HIRH+VQM+ R  +S 
Subjt:  DNINIHKFDTNESHIENGDLSDENVKGDIEASNLVICSRERRLRKPTRRYIEEFADSKPEGYKGGRKPPTKDKYMKVTSIEESNHIRHKVQMLTRGGESH

Query:  CGTSVPVQSRSHRRHPKKHIPDSGFLSEDESSATECKNVYSSAKRCKKYDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPYRTPIDLRDK
        CGTSVPVQ +S RRHP KH+P SGFLSEDESSATECKNVYSSA+RCKKYDRR+ QKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSP+RTPIDLRDK
Subjt:  CGTSVPVQSRSHRRHPKKHIPDSGFLSEDESSATECKNVYSSAKRCKKYDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPYRTPIDLRDK

Query:  WRNLLKASCVNIQNRKGIEQKQSHASRPLPKSLLQRVYELANIYPYPKERSTKSVKATIPPVHLIESNSLSFNWGRKKYE
        WRNLL+ASCVNIQN+KG+E KQ+HASRPLPKSLLQRVYELANIYPYPKER  KSVKA  PP+ LIESNSLSFNWGRKKYE
Subjt:  WRNLLKASCVNIQNRKGIEQKQSHASRPLPKSLLQRVYELANIYPYPKERSTKSVKATIPPVHLIESNSLSFNWGRKKYE

XP_022143737.1 uncharacterized protein LOC111013581 isoform X1 [Momordica charantia]1.4e-22481.74Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVEGSFLPAPLNESNEVEHLLVEPKSNHILGNCLRVQDFSCDFDYGIQTNGGGLDSNSKQGGEHELKFGDLDQLLDDANEV
        MDQEVHFCQKFTNMKSHWVKV+GSFLPAPLNE NEVEHLLVEPKSNH+LG+CLR QDFSCDFDYGIQTN GGLDSNSKQ GEHELKF DLDQLL D NEV
Subjt:  MDQEVHFCQKFTNMKSHWVKVEGSFLPAPLNESNEVEHLLVEPKSNHILGNCLRVQDFSCDFDYGIQTNGGGLDSNSKQGGEHELKFGDLDQLLDDANEV

Query:  GEFHATNNLPNTYTEVAENSFRQNSGLQI--VSNNGYRAGS------------------------NNTPVERGLTHELCGGLMTKGRCVSPLEGSICARI
         EFHATNNLPNTYTEVAENSFR+N GLQ+  +S+     GS                        N TPV+RGLTHELC GL TKGRC +PL+GSIC+ I
Subjt:  GEFHATNNLPNTYTEVAENSFRQNSGLQI--VSNNGYRAGS------------------------NNTPVERGLTHELCGGLMTKGRCVSPLEGSICARI

Query:  LDNINIHKFDTNESHIENGDLSDENVKGDIEASNLVICSRERRLRKPTRRYIEEFADSKPEGYKGGRKPPTKDKYMKVTSIEESNHIRHKVQMLTRGGES
        LDNINIHKFDTNE  +ENG LSDENVKG+IEAS L ICSR+RRLRKPTRRYIEEFADSK E +KG RKPPTKDKY+KVTSIEESNHIRHKVQMLT GGES
Subjt:  LDNINIHKFDTNESHIENGDLSDENVKGDIEASNLVICSRERRLRKPTRRYIEEFADSKPEGYKGGRKPPTKDKYMKVTSIEESNHIRHKVQMLTRGGES

Query:  HCGTSVPVQSRSHRRHPKKHIPDSGFLSEDESSATECKNVYSSAKRCKKYDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPYRTPIDLR-
        HCGTS+PVQSRS RR PKKH+P SGFLSE+ESSATECK VYSSAKRCKK+DRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFA+SPYRTPIDLR 
Subjt:  HCGTSVPVQSRSHRRHPKKHIPDSGFLSEDESSATECKNVYSSAKRCKKYDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPYRTPIDLR-

Query:  DKWRNLLKASCVNIQNRKGIEQKQSHASRPLPKSLLQRVYELANIYPYPKERSTKSVKATIPPVHLIESNSLSFNWGRKKYE
        DKWRNLL+ASCVNIQNR GIE+KQSHASRPLPKSLLQRVYELANIYPYPKERS KSVKAT  P+HLIESNSLSFNWGRKKY+
Subjt:  DKWRNLLKASCVNIQNRKGIEQKQSHASRPLPKSLLQRVYELANIYPYPKERSTKSVKATIPPVHLIESNSLSFNWGRKKYE

XP_022143738.1 uncharacterized protein LOC111013581 isoform X2 [Momordica charantia]5.7e-22681.91Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVEGSFLPAPLNESNEVEHLLVEPKSNHILGNCLRVQDFSCDFDYGIQTNGGGLDSNSKQGGEHELKFGDLDQLLDDANEV
        MDQEVHFCQKFTNMKSHWVKV+GSFLPAPLNE NEVEHLLVEPKSNH+LG+CLR QDFSCDFDYGIQTN GGLDSNSKQ GEHELKF DLDQLL D NEV
Subjt:  MDQEVHFCQKFTNMKSHWVKVEGSFLPAPLNESNEVEHLLVEPKSNHILGNCLRVQDFSCDFDYGIQTNGGGLDSNSKQGGEHELKFGDLDQLLDDANEV

Query:  GEFHATNNLPNTYTEVAENSFRQNSGLQI--VSNNGYRAGS------------------------NNTPVERGLTHELCGGLMTKGRCVSPLEGSICARI
         EFHATNNLPNTYTEVAENSFR+N GLQ+  +S+     GS                        N TPV+RGLTHELC GL TKGRC +PL+GSIC+ I
Subjt:  GEFHATNNLPNTYTEVAENSFRQNSGLQI--VSNNGYRAGS------------------------NNTPVERGLTHELCGGLMTKGRCVSPLEGSICARI

Query:  LDNINIHKFDTNESHIENGDLSDENVKGDIEASNLVICSRERRLRKPTRRYIEEFADSKPEGYKGGRKPPTKDKYMKVTSIEESNHIRHKVQMLTRGGES
        LDNINIHKFDTNE  +ENG LSDENVKG+IEAS L ICSR+RRLRKPTRRYIEEFADSK E +KG RKPPTKDKY+KVTSIEESNHIRHKVQMLT GGES
Subjt:  LDNINIHKFDTNESHIENGDLSDENVKGDIEASNLVICSRERRLRKPTRRYIEEFADSKPEGYKGGRKPPTKDKYMKVTSIEESNHIRHKVQMLTRGGES

Query:  HCGTSVPVQSRSHRRHPKKHIPDSGFLSEDESSATECKNVYSSAKRCKKYDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPYRTPIDLRD
        HCGTS+PVQSRS RR PKKH+P SGFLSE+ESSATECK VYSSAKRCKK+DRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFA+SPYRTPIDLRD
Subjt:  HCGTSVPVQSRSHRRHPKKHIPDSGFLSEDESSATECKNVYSSAKRCKKYDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPYRTPIDLRD

Query:  KWRNLLKASCVNIQNRKGIEQKQSHASRPLPKSLLQRVYELANIYPYPKERSTKSVKATIPPVHLIESNSLSFNWGRKKYE
        KWRNLL+ASCVNIQNR GIE+KQSHASRPLPKSLLQRVYELANIYPYPKERS KSVKAT  P+HLIESNSLSFNWGRKKY+
Subjt:  KWRNLLKASCVNIQNRKGIEQKQSHASRPLPKSLLQRVYELANIYPYPKERSTKSVKATIPPVHLIESNSLSFNWGRKKYE

XP_038881566.1 uncharacterized protein LOC120073047 isoform X1 [Benincasa hispida]3.5e-22381.37Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVEGSFLPAPLNESNEVEHLLVEPKSNHILGNCLRVQDFSCDFDYGIQTNG-GGLDSNSKQGGEHELKFGDLDQLLDDANE
        MDQEVHFCQKFTNMKSHWV+VEG FLPAPLN+SNEVE LLVEPKS+H+LGNCLRVQDFSCDF YGIQTNG GGLDSNSKQGGEHELKFGDLDQLLDDANE
Subjt:  MDQEVHFCQKFTNMKSHWVKVEGSFLPAPLNESNEVEHLLVEPKSNHILGNCLRVQDFSCDFDYGIQTNG-GGLDSNSKQGGEHELKFGDLDQLLDDANE

Query:  VGEFHATNNLPNTYTEVAENSFRQNSGLQIVSNNGYRAGS-------------------------NNTPVERGLTHELCGGLMTKGRCV--SPLEGSICA
        VGEFHATNNL +TY EVAENSFRQN GLQ+ +++                               NNTPVERGLTHEL  GL TKGRCV  +PLEG+IC 
Subjt:  VGEFHATNNLPNTYTEVAENSFRQNSGLQIVSNNGYRAGS-------------------------NNTPVERGLTHELCGGLMTKGRCV--SPLEGSICA

Query:  RILDNINIHKFDTNESHIENGDLSDENVKGDIEASNLVICSRERRLRKPTRRYIEEFADSKPEGYKGGRKPPTKDKYMKVTSIEESNHIRHKVQMLTRGG
         ILDN NIHKF+TNE++IENGDLSDENVKGDI A+ L  CSRERRLRKPTRRYIEEFADSK E  KG RKPPTKDKY+KVTS EESNHIRH+VQMLT   
Subjt:  RILDNINIHKFDTNESHIENGDLSDENVKGDIEASNLVICSRERRLRKPTRRYIEEFADSKPEGYKGGRKPPTKDKYMKVTSIEESNHIRHKVQMLTRGG

Query:  ESHCGTSVPVQSRSHRRHPKKHIPDSGFLSEDESSATECKNVYSSAKRCKKYDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPYRTPIDL
        E HCGTSVPVQSRS RRHPKKH+P SGFLSEDESSATECKNVYSS KRCKKYDRR+HQKMW+LTEVMRLVDGIAEYGTGRWT IKKHLFASSP+RTPIDL
Subjt:  ESHCGTSVPVQSRSHRRHPKKHIPDSGFLSEDESSATECKNVYSSAKRCKKYDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPYRTPIDL

Query:  RDKWRNLLKASCVNIQNRKGIEQKQSHASRPLPKSLLQRVYELANIYPYPKERSTKSVKATIPPVHLIESNSLSFNWGRKKYE
        RDKWRNLL+ASCVNIQNRKGIE+KQSHASRPLPKSLLQRVYELANIYPYPKERS KSVKAT PP+HLIESNSLSFNWGRKKYE
Subjt:  RDKWRNLLKASCVNIQNRKGIEQKQSHASRPLPKSLLQRVYELANIYPYPKERSTKSVKATIPPVHLIESNSLSFNWGRKKYE

XP_038881569.1 uncharacterized protein LOC120073047 isoform X3 [Benincasa hispida]9.5e-22181.16Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVEGSFLPAPLNESNEVEHLLVEPKSNHILGNCLRVQDFSCDFDYGIQTNG-GGLDSNSKQGGEHELKFGDLDQLLDDANE
        MDQEVHFCQKFTNMKSHWV+VEG FLPAPLN+SNEVE LLVEPKS+H+LGNCLRVQDFSCDF YGIQTNG GGLDSNSKQGGEHELKFGDLDQLLDDANE
Subjt:  MDQEVHFCQKFTNMKSHWVKVEGSFLPAPLNESNEVEHLLVEPKSNHILGNCLRVQDFSCDFDYGIQTNG-GGLDSNSKQGGEHELKFGDLDQLLDDANE

Query:  VGEFHATNNLPNTYTEVAENSFRQNSGLQIVSNNGYRAGS-------------------------NNTPVERGLTHELCGGLMTKGRCV--SPLEGSICA
        VGEFHATNNL N   EVAENSFRQN GLQ+ +++                               NNTPVERGLTHEL  GL TKGRCV  +PLEG+IC 
Subjt:  VGEFHATNNLPNTYTEVAENSFRQNSGLQIVSNNGYRAGS-------------------------NNTPVERGLTHELCGGLMTKGRCV--SPLEGSICA

Query:  RILDNINIHKFDTNESHIENGDLSDENVKGDIEASNLVICSRERRLRKPTRRYIEEFADSKPEGYKGGRKPPTKDKYMKVTSIEESNHIRHKVQMLTRGG
         ILDN NIHKF+TNE++IENGDLSDENVKGDI A+ L  CSRERRLRKPTRRYIEEFADSK E  KG RKPPTKDKY+KVTS EESNHIRH+VQMLT   
Subjt:  RILDNINIHKFDTNESHIENGDLSDENVKGDIEASNLVICSRERRLRKPTRRYIEEFADSKPEGYKGGRKPPTKDKYMKVTSIEESNHIRHKVQMLTRGG

Query:  ESHCGTSVPVQSRSHRRHPKKHIPDSGFLSEDESSATECKNVYSSAKRCKKYDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPYRTPIDL
        E HCGTSVPVQSRS RRHPKKH+P SGFLSEDESSATECKNVYSS KRCKKYDRR+HQKMW+LTEVMRLVDGIAEYGTGRWT IKKHLFASSP+RTPIDL
Subjt:  ESHCGTSVPVQSRSHRRHPKKHIPDSGFLSEDESSATECKNVYSSAKRCKKYDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPYRTPIDL

Query:  RDKWRNLLKASCVNIQNRKGIEQKQSHASRPLPKSLLQRVYELANIYPYPKERSTKSVKATIPPVHLIESNSLSFNWGRKKYE
        RDKWRNLL+ASCVNIQNRKGIE+KQSHASRPLPKSLLQRVYELANIYPYPKERS KSVKAT PP+HLIESNSLSFNWGRKKYE
Subjt:  RDKWRNLLKASCVNIQNRKGIEQKQSHASRPLPKSLLQRVYELANIYPYPKERSTKSVKATIPPVHLIESNSLSFNWGRKKYE

TrEMBL top hitse value%identityAlignment
A0A1S3BKX9 uncharacterized protein LOC103491166 isoform X12.7e-21378.54Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVEGSFLPAPLNESNEVEHLLVEPKSNHILGNCLRVQDFSCDFDYGIQTNGGGLDSNSKQGGEHELKFGDLDQLLDDANEV
        MDQEVHFCQKFTNMKSHWVKVEG FLPAPLN+SNEVE LLVE KS H+LGNCLRVQDFSCDF YGIQTNGGGLDSNSKQGGEHELKFGD DQLLDDANEV
Subjt:  MDQEVHFCQKFTNMKSHWVKVEGSFLPAPLNESNEVEHLLVEPKSNHILGNCLRVQDFSCDFDYGIQTNGGGLDSNSKQGGEHELKFGDLDQLLDDANEV

Query:  GEFHATNNLPNTYTEVAENSFRQNSGLQIVSNNG-------------------------YRAGSNNTPVERGLTHELCGGLMTKGRCVSPLEGSICARIL
        GEFHATNNLPNTY EVAENSFR+N   Q+ +++                            A  NNTPVERGLTHEL  GL TKGRCV+PLEG+IC  IL
Subjt:  GEFHATNNLPNTYTEVAENSFRQNSGLQIVSNNG-------------------------YRAGSNNTPVERGLTHELCGGLMTKGRCVSPLEGSICARIL

Query:  DNINIHKFDTNESHIENGDLSDENVKGDIEASNLVICSRERRLRKPTRRYIEEFADSKPEGYKGGRKPPTKDKYMKVTSIEESNHIRHKVQMLTRGGESH
        DN NIHKF+TNE++IENGDLSDENVKGDI A+ L  CSRERRLRKPTRRYIEEF DSK E  KG RK P KDKY+KV S EES HIRH+VQM+ R  +S 
Subjt:  DNINIHKFDTNESHIENGDLSDENVKGDIEASNLVICSRERRLRKPTRRYIEEFADSKPEGYKGGRKPPTKDKYMKVTSIEESNHIRHKVQMLTRGGESH

Query:  CGTSVPVQSRSHRRHPKKHIPDSGFLSEDESSATECKNVYSSAKRCKKYDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPYRTPIDLRDK
        CGTSVPVQ +S RRHP KH+P SGFLSEDESSATECKNVYSSA+RCKKYDRR+ QKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSP+RTPIDLRDK
Subjt:  CGTSVPVQSRSHRRHPKKHIPDSGFLSEDESSATECKNVYSSAKRCKKYDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPYRTPIDLRDK

Query:  WRNLLKASCVNIQNRKGIEQKQSHASRPLPKSLLQRVYELANIYPYPKERSTKSVKATIPPVHLIESNSLSFNWGRKKYE
        WRNLL+ASCVNIQN+KG+E KQ+HASRPLPKSLLQRVYELANIYPYPKER  KSVKA  PP+ LIESNSLSFNWGRKKYE
Subjt:  WRNLLKASCVNIQNRKGIEQKQSHASRPLPKSLLQRVYELANIYPYPKERSTKSVKATIPPVHLIESNSLSFNWGRKKYE

A0A1S3BLK0 uncharacterized protein LOC103491166 isoform X22.5e-21178.33Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVEGSFLPAPLNESNEVEHLLVEPKSNHILGNCLRVQDFSCDFDYGIQTNGGGLDSNSKQGGEHELKFGDLDQLLDDANEV
        MDQEVHFCQKFTNMKSHWVKVEG FLPAPLN+SNEVE LLVE KS H+LGNCLRVQDFSCDF YGIQTN GGLDSNSKQGGEHELKFGD DQLLDDANEV
Subjt:  MDQEVHFCQKFTNMKSHWVKVEGSFLPAPLNESNEVEHLLVEPKSNHILGNCLRVQDFSCDFDYGIQTNGGGLDSNSKQGGEHELKFGDLDQLLDDANEV

Query:  GEFHATNNLPNTYTEVAENSFRQNSGLQIVSNNG-------------------------YRAGSNNTPVERGLTHELCGGLMTKGRCVSPLEGSICARIL
        GEFHATNNLPNTY EVAENSFR+N   Q+ +++                            A  NNTPVERGLTHEL  GL TKGRCV+PLEG+IC  IL
Subjt:  GEFHATNNLPNTYTEVAENSFRQNSGLQIVSNNG-------------------------YRAGSNNTPVERGLTHELCGGLMTKGRCVSPLEGSICARIL

Query:  DNINIHKFDTNESHIENGDLSDENVKGDIEASNLVICSRERRLRKPTRRYIEEFADSKPEGYKGGRKPPTKDKYMKVTSIEESNHIRHKVQMLTRGGESH
        DN NIHKF+TNE++IENGDLSDENVKGDI A+ L  CSRERRLRKPTRRYIEEF DSK E  KG RK P KDKY+KV S EES HIRH+VQM+ R  +S 
Subjt:  DNINIHKFDTNESHIENGDLSDENVKGDIEASNLVICSRERRLRKPTRRYIEEFADSKPEGYKGGRKPPTKDKYMKVTSIEESNHIRHKVQMLTRGGESH

Query:  CGTSVPVQSRSHRRHPKKHIPDSGFLSEDESSATECKNVYSSAKRCKKYDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPYRTPIDLRDK
        CGTSVPVQ +S RRHP KH+P SGFLSEDESSATECKNVYSSA+RCKKYDRR+ QKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSP+RTPIDLRDK
Subjt:  CGTSVPVQSRSHRRHPKKHIPDSGFLSEDESSATECKNVYSSAKRCKKYDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPYRTPIDLRDK

Query:  WRNLLKASCVNIQNRKGIEQKQSHASRPLPKSLLQRVYELANIYPYPKERSTKSVKATIPPVHLIESNSLSFNWGRKKYE
        WRNLL+ASCVNIQN+KG+E KQ+HASRPLPKSLLQRVYELANIYPYPKER  KSVKA  PP+ LIESNSLSFNWGRKKYE
Subjt:  WRNLLKASCVNIQNRKGIEQKQSHASRPLPKSLLQRVYELANIYPYPKERSTKSVKATIPPVHLIESNSLSFNWGRKKYE

A0A6J1CRG2 uncharacterized protein LOC111013581 isoform X22.8e-22681.91Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVEGSFLPAPLNESNEVEHLLVEPKSNHILGNCLRVQDFSCDFDYGIQTNGGGLDSNSKQGGEHELKFGDLDQLLDDANEV
        MDQEVHFCQKFTNMKSHWVKV+GSFLPAPLNE NEVEHLLVEPKSNH+LG+CLR QDFSCDFDYGIQTN GGLDSNSKQ GEHELKF DLDQLL D NEV
Subjt:  MDQEVHFCQKFTNMKSHWVKVEGSFLPAPLNESNEVEHLLVEPKSNHILGNCLRVQDFSCDFDYGIQTNGGGLDSNSKQGGEHELKFGDLDQLLDDANEV

Query:  GEFHATNNLPNTYTEVAENSFRQNSGLQI--VSNNGYRAGS------------------------NNTPVERGLTHELCGGLMTKGRCVSPLEGSICARI
         EFHATNNLPNTYTEVAENSFR+N GLQ+  +S+     GS                        N TPV+RGLTHELC GL TKGRC +PL+GSIC+ I
Subjt:  GEFHATNNLPNTYTEVAENSFRQNSGLQI--VSNNGYRAGS------------------------NNTPVERGLTHELCGGLMTKGRCVSPLEGSICARI

Query:  LDNINIHKFDTNESHIENGDLSDENVKGDIEASNLVICSRERRLRKPTRRYIEEFADSKPEGYKGGRKPPTKDKYMKVTSIEESNHIRHKVQMLTRGGES
        LDNINIHKFDTNE  +ENG LSDENVKG+IEAS L ICSR+RRLRKPTRRYIEEFADSK E +KG RKPPTKDKY+KVTSIEESNHIRHKVQMLT GGES
Subjt:  LDNINIHKFDTNESHIENGDLSDENVKGDIEASNLVICSRERRLRKPTRRYIEEFADSKPEGYKGGRKPPTKDKYMKVTSIEESNHIRHKVQMLTRGGES

Query:  HCGTSVPVQSRSHRRHPKKHIPDSGFLSEDESSATECKNVYSSAKRCKKYDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPYRTPIDLRD
        HCGTS+PVQSRS RR PKKH+P SGFLSE+ESSATECK VYSSAKRCKK+DRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFA+SPYRTPIDLRD
Subjt:  HCGTSVPVQSRSHRRHPKKHIPDSGFLSEDESSATECKNVYSSAKRCKKYDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPYRTPIDLRD

Query:  KWRNLLKASCVNIQNRKGIEQKQSHASRPLPKSLLQRVYELANIYPYPKERSTKSVKATIPPVHLIESNSLSFNWGRKKYE
        KWRNLL+ASCVNIQNR GIE+KQSHASRPLPKSLLQRVYELANIYPYPKERS KSVKAT  P+HLIESNSLSFNWGRKKY+
Subjt:  KWRNLLKASCVNIQNRKGIEQKQSHASRPLPKSLLQRVYELANIYPYPKERSTKSVKATIPPVHLIESNSLSFNWGRKKYE

A0A6J1CRQ1 uncharacterized protein LOC111013581 isoform X16.8e-22581.74Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVEGSFLPAPLNESNEVEHLLVEPKSNHILGNCLRVQDFSCDFDYGIQTNGGGLDSNSKQGGEHELKFGDLDQLLDDANEV
        MDQEVHFCQKFTNMKSHWVKV+GSFLPAPLNE NEVEHLLVEPKSNH+LG+CLR QDFSCDFDYGIQTN GGLDSNSKQ GEHELKF DLDQLL D NEV
Subjt:  MDQEVHFCQKFTNMKSHWVKVEGSFLPAPLNESNEVEHLLVEPKSNHILGNCLRVQDFSCDFDYGIQTNGGGLDSNSKQGGEHELKFGDLDQLLDDANEV

Query:  GEFHATNNLPNTYTEVAENSFRQNSGLQI--VSNNGYRAGS------------------------NNTPVERGLTHELCGGLMTKGRCVSPLEGSICARI
         EFHATNNLPNTYTEVAENSFR+N GLQ+  +S+     GS                        N TPV+RGLTHELC GL TKGRC +PL+GSIC+ I
Subjt:  GEFHATNNLPNTYTEVAENSFRQNSGLQI--VSNNGYRAGS------------------------NNTPVERGLTHELCGGLMTKGRCVSPLEGSICARI

Query:  LDNINIHKFDTNESHIENGDLSDENVKGDIEASNLVICSRERRLRKPTRRYIEEFADSKPEGYKGGRKPPTKDKYMKVTSIEESNHIRHKVQMLTRGGES
        LDNINIHKFDTNE  +ENG LSDENVKG+IEAS L ICSR+RRLRKPTRRYIEEFADSK E +KG RKPPTKDKY+KVTSIEESNHIRHKVQMLT GGES
Subjt:  LDNINIHKFDTNESHIENGDLSDENVKGDIEASNLVICSRERRLRKPTRRYIEEFADSKPEGYKGGRKPPTKDKYMKVTSIEESNHIRHKVQMLTRGGES

Query:  HCGTSVPVQSRSHRRHPKKHIPDSGFLSEDESSATECKNVYSSAKRCKKYDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPYRTPIDLR-
        HCGTS+PVQSRS RR PKKH+P SGFLSE+ESSATECK VYSSAKRCKK+DRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFA+SPYRTPIDLR 
Subjt:  HCGTSVPVQSRSHRRHPKKHIPDSGFLSEDESSATECKNVYSSAKRCKKYDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPYRTPIDLR-

Query:  DKWRNLLKASCVNIQNRKGIEQKQSHASRPLPKSLLQRVYELANIYPYPKERSTKSVKATIPPVHLIESNSLSFNWGRKKYE
        DKWRNLL+ASCVNIQNR GIE+KQSHASRPLPKSLLQRVYELANIYPYPKERS KSVKAT  P+HLIESNSLSFNWGRKKY+
Subjt:  DKWRNLLKASCVNIQNRKGIEQKQSHASRPLPKSLLQRVYELANIYPYPKERSTKSVKATIPPVHLIESNSLSFNWGRKKYE

A0A6J1ECM4 uncharacterized protein LOC111433102 isoform X11.6e-18170Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVEGSFLPAPLNESNEVEHLLVEPKSNHILGNCLRVQDFSCDFDYGIQTNGGGLDSNSKQGGEHELKFGDLDQLLDDANEV
        MDQEVHFCQKFTNM  HWVK+EGSFLPAPLNESNEV+H LVEPKS+H LGNCLRVQDFS DF Y IQTNG                              
Subjt:  MDQEVHFCQKFTNMKSHWVKVEGSFLPAPLNESNEVEHLLVEPKSNHILGNCLRVQDFSCDFDYGIQTNGGGLDSNSKQGGEHELKFGDLDQLLDDANEV

Query:  GEFHATNNLPNTYTEVAENSFRQNSGLQI--VSNNGYRAGS-----------------------NNTPVERGLTHELCGGLMTKGRCVSPLEGSICARIL
                      EV ENSFRQN GLQ+   S+     GS                       NNTPVER LT EL  GL T+G C +P EG+IC  IL
Subjt:  GEFHATNNLPNTYTEVAENSFRQNSGLQI--VSNNGYRAGS-----------------------NNTPVERGLTHELCGGLMTKGRCVSPLEGSICARIL

Query:  DNINIHKFDTNESHIENGDLSDENVKGDIEASNLVICSRERRLRKPTRRYIEEFADSKPEGYKGGRKPPTKDKYMKVTSIEESNHIRHKVQMLTRGGESH
        DN NIHKF+TNE+++EN  +SDENVKGDI AS L  CSRERRLRKPTRRYIEEFADSK E  KG RKPPTKDKY+KVTS EESNHIRHKVQMLT   ESH
Subjt:  DNINIHKFDTNESHIENGDLSDENVKGDIEASNLVICSRERRLRKPTRRYIEEFADSKPEGYKGGRKPPTKDKYMKVTSIEESNHIRHKVQMLTRGGESH

Query:  CGTSVPVQSRSHRRHPKKHIPDSGFLSEDESSATECKNVYSSAKRCKKYDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPYRTPIDLRDK
        CGTSVPVQSRS RRHP+KH+P SGFLSEDE SATECKNVYSSAK CKKYDRRKHQKMWTLTEVMRLVDGIAEYGTGRWT IK+HLFASSP+RTPIDLRDK
Subjt:  CGTSVPVQSRSHRRHPKKHIPDSGFLSEDESSATECKNVYSSAKRCKKYDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPYRTPIDLRDK

Query:  WRNLLKASCVNIQNRKGIEQKQSHASRPLPKSLLQRVYELANIYPYPKERSTKSVKATIPPVHLIESNSLSFNWGRKKYE
        WRNLLKASCVNIQN KG E KQ HASRPLPKSLLQRVYELANIYPYPKERS K V+   PP++LIESNSLSFNWGRKKYE
Subjt:  WRNLLKASCVNIQNRKGIEQKQSHASRPLPKSLLQRVYELANIYPYPKERSTKSVKATIPPVHLIESNSLSFNWGRKKYE

SwissProt top hitse value%identityAlignment
Q9C7B1 Telomere repeat-binding protein 36.3e-1035.63Show/hide
Query:  RRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPYRTPIDLRDKWRNLLKASCVNIQNRKGIEQKQSHASRPLPKSLLQRV
        +R+ ++ +++TEV  LV  + E GTGRW  +K   F  + +RT +DL+DKW+ L+  + ++ Q R+G          P+P+ LL RV
Subjt:  RRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPYRTPIDLRDKWRNLLKASCVNIQNRKGIEQKQSHASRPLPKSLLQRV

Q9FFY9 Telomere repeat-binding protein 46.3e-1034.26Show/hide
Query:  ESSATECKNVYSSAKRCKKYDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPYRTPIDLRDKWRNLLKASCVNIQNRKGIEQKQSHASRPL
        ES A     V    KR  +  +R+ ++ +++TEV  LV  + E GTGRW  +K   F ++ +RT +DL+DKW+ L+  + ++ Q R+G          P+
Subjt:  ESSATECKNVYSSAKRCKKYDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPYRTPIDLRDKWRNLLKASCVNIQNRKGIEQKQSHASRPL

Query:  PKSLLQRV
        P+ LL RV
Subjt:  PKSLLQRV

Q9LL45 Telomere-binding protein 17.0e-0933Show/hide
Query:  NVYSSAKRCKKYDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPYRTPIDLRDKWRNLLKASCVNIQNRKGIEQKQSHASRPLPKSLLQRV
        NV  S  +   + +R+ ++ +T+ EV  LV+ +   GTGRW  +K   F +  +RT +DL+DKW+ L+  + +  Q R+G          P+P+ LL RV
Subjt:  NVYSSAKRCKKYDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPYRTPIDLRDKWRNLLKASCVNIQNRKGIEQKQSHASRPLPKSLLQRV

Q9M347 Telomere repeat-binding protein 63.2e-0934.48Show/hide
Query:  RRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPYRTPIDLRDKWRNLLKASCVNIQNRKGIEQKQSHASRPLPKSLLQRV
        +R+ ++ +T++EV  LV  +   GTGRW  +K H F    +RT +DL+DKW+ L+  + ++ + R+G          P+P+ LL RV
Subjt:  RRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPYRTPIDLRDKWRNLLKASCVNIQNRKGIEQKQSHASRPLPKSLLQRV

Q9SNB9 Telomere repeat-binding protein 24.1e-0934.48Show/hide
Query:  RRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPYRTPIDLRDKWRNLLKASCVNIQNRKGIEQKQSHASRPLPKSLLQRV
        +R+ ++ +++TEV  LV  + + GTGRW  +K   F  + +RT +DL+DKW+ L+  + ++ Q R+G          P+P+ LL RV
Subjt:  RRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPYRTPIDLRDKWRNLLKASCVNIQNRKGIEQKQSHASRPLPKSLLQRV

Arabidopsis top hitse value%identityAlignment
AT1G17460.1 TRF-like 32.3e-1538.39Show/hide
Query:  ESSATECKNVYSSAKRCKKYDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPYRTPIDLRDKWRNLLKASCVNIQNRKGIEQKQSHASRPL
        +SS     + +  A   +    RK  + WT++EV +LV+G+++YG G+WT IKK  F+   +RT +DL+DKWRNL KAS  N +   G+++   H S  +
Subjt:  ESSATECKNVYSSAKRCKKYDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPYRTPIDLRDKWRNLLKASCVNIQNRKGIEQKQSHASRPL

Query:  PKSLLQRVYELA
        P  ++ +V ELA
Subjt:  PKSLLQRVYELA

AT1G72650.1 TRF-like 62.5e-2228.66Show/hide
Query:  DTNESHIENGDLSDENVKGDIEASNLVICSRE--------RRLRKPTRRYIEEFADSKPEGYKGGRKPPTKDKYM----KVTSIEESNHIRHKV-QMLTR
        D   + +++   S ++V G     N    S +        +R+RKPTRRYIEE +++  +        P+KD+ +    +V SI  S+  R  V +M++ 
Subjt:  DTNESHIENGDLSDENVKGDIEASNLVICSRE--------RRLRKPTRRYIEEFADSKPEGYKGGRKPPTKDKYM----KVTSIEESNHIRHKV-QMLTR

Query:  GGESHCGTSVPVQSRSHRRHPKKHIP-----DSGFLSEDESSATECK----------------NVYSSAKRCKKYD------------------------
         G       VP  S   R  P+++I       S +L ED++SA E                  +V  SA R  + +                        
Subjt:  GGESHCGTSVPVQSRSHRRHPKKHIP-----DSGFLSEDESSATECK----------------NVYSSAKRCKKYD------------------------

Query:  -----------------------RRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPYRTPIDLRDKWRNLLKASCVNIQNRKGIEQKQSHASR
                               RRKH + WTL+E+ +LV+G+++YG G+W+ IKKHLF+S  YRT +DL+DKWRNLLK S     +   +   + H S 
Subjt:  -----------------------RRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPYRTPIDLRDKWRNLLKASCVNIQNRKGIEQKQSHASR

Query:  PLPKSLLQRVYELA
         +P  +L RV ELA
Subjt:  PLPKSLLQRVYELA

AT1G72650.2 TRF-like 62.5e-2228.66Show/hide
Query:  DTNESHIENGDLSDENVKGDIEASNLVICSRE--------RRLRKPTRRYIEEFADSKPEGYKGGRKPPTKDKYM----KVTSIEESNHIRHKV-QMLTR
        D   + +++   S ++V G     N    S +        +R+RKPTRRYIEE +++  +        P+KD+ +    +V SI  S+  R  V +M++ 
Subjt:  DTNESHIENGDLSDENVKGDIEASNLVICSRE--------RRLRKPTRRYIEEFADSKPEGYKGGRKPPTKDKYM----KVTSIEESNHIRHKV-QMLTR

Query:  GGESHCGTSVPVQSRSHRRHPKKHIP-----DSGFLSEDESSATECK----------------NVYSSAKRCKKYD------------------------
         G       VP  S   R  P+++I       S +L ED++SA E                  +V  SA R  + +                        
Subjt:  GGESHCGTSVPVQSRSHRRHPKKHIP-----DSGFLSEDESSATECK----------------NVYSSAKRCKKYD------------------------

Query:  -----------------------RRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPYRTPIDLRDKWRNLLKASCVNIQNRKGIEQKQSHASR
                               RRKH + WTL+E+ +LV+G+++YG G+W+ IKKHLF+S  YRT +DL+DKWRNLLK S     +   +   + H S 
Subjt:  -----------------------RRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPYRTPIDLRDKWRNLLKASCVNIQNRKGIEQKQSHASR

Query:  PLPKSLLQRVYELA
         +P  +L RV ELA
Subjt:  PLPKSLLQRVYELA

AT2G37025.1 TRF-like 83.9e-3130.62Show/hide
Query:  FLPAPLNESNEVEHLLVEPKSNHILGNCLRVQDFSCDFDYGIQTNGGGLDSNSKQGGEHELKFGDLDQLLDDANEVGEFHATNNLPNTY------TEVAE
        FL  P +++  V+  L     N  +G   ++++FS  FD     + GG+++ S Q  E  LKF  LD +LD  +EV + +A++ L +        TEV +
Subjt:  FLPAPLNESNEVEHLLVEPKSNHILGNCLRVQDFSCDFDYGIQTNGGGLDSNSKQGGEHELKFGDLDQLLDDANEVGEFHATNNLPNTY------TEVAE

Query:  NSFRQNS-GLQIVSNNGYRAGSNNTPVERGLTHELCGGLMTKGRCVSPLEGSICARILDNINI-HKFDTNESHIENGDLS---DENVKGDIEASNLV--I
        N    +  GL   S+  Y  G + +            GL    +   P   S   +I+ +  +   +DTNES  +   LS      +K       L+   
Subjt:  NSFRQNS-GLQIVSNNGYRAGSNNTPVERGLTHELCGGLMTKGRCVSPLEGSICARILDNINI-HKFDTNESHIENGDLS---DENVKGDIEASNLV--I

Query:  CSR--ERRLRKPTRRYIEEFADSKPEGYKGGRKPPTKDKYMKVTSIEESNHIRHKVQMLTRGGESHCGTSVPVQSRSHRRHPKKHIPDSGFLSEDESSAT
        CSR    R RK    Y     D +P      RK P K KY+  TS+E++                                           S+D+ + +
Subjt:  CSR--ERRLRKPTRRYIEEFADSKPEGYKGGRKPPTKDKYMKVTSIEESNHIRHKVQMLTRGGESHCGTSVPVQSRSHRRHPKKHIPDSGFLSEDESSAT

Query:  ECKNVYSSAKRCK-KYDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPYRTPIDLRDKWRNLLKASCVNIQNRKGIEQKQSHASRPLPKSL
        E ++  S  K  + K DRRK+Q++WTL EVM LVDGI+ +G G+WT IK H F  + +R P+D+RDKWRNLLKAS     N    E+K+   +R +PK +
Subjt:  ECKNVYSSAKRCK-KYDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPYRTPIDLRDKWRNLLKASCVNIQNRKGIEQKQSHASRPLPKSL

Query:  LQRVYELANIYPYPKERS
        L RV ELA+++PYP  +S
Subjt:  LQRVYELANIYPYPKERS

AT2G37025.2 TRF-like 83.9e-3130.62Show/hide
Query:  FLPAPLNESNEVEHLLVEPKSNHILGNCLRVQDFSCDFDYGIQTNGGGLDSNSKQGGEHELKFGDLDQLLDDANEVGEFHATNNLPNTY------TEVAE
        FL  P +++  V+  L     N  +G   ++++FS  FD     + GG+++ S Q  E  LKF  LD +LD  +EV + +A++ L +        TEV +
Subjt:  FLPAPLNESNEVEHLLVEPKSNHILGNCLRVQDFSCDFDYGIQTNGGGLDSNSKQGGEHELKFGDLDQLLDDANEVGEFHATNNLPNTY------TEVAE

Query:  NSFRQNS-GLQIVSNNGYRAGSNNTPVERGLTHELCGGLMTKGRCVSPLEGSICARILDNINI-HKFDTNESHIENGDLS---DENVKGDIEASNLV--I
        N    +  GL   S+  Y  G + +            GL    +   P   S   +I+ +  +   +DTNES  +   LS      +K       L+   
Subjt:  NSFRQNS-GLQIVSNNGYRAGSNNTPVERGLTHELCGGLMTKGRCVSPLEGSICARILDNINI-HKFDTNESHIENGDLS---DENVKGDIEASNLV--I

Query:  CSR--ERRLRKPTRRYIEEFADSKPEGYKGGRKPPTKDKYMKVTSIEESNHIRHKVQMLTRGGESHCGTSVPVQSRSHRRHPKKHIPDSGFLSEDESSAT
        CSR    R RK    Y     D +P      RK P K KY+  TS+E++                                           S+D+ + +
Subjt:  CSR--ERRLRKPTRRYIEEFADSKPEGYKGGRKPPTKDKYMKVTSIEESNHIRHKVQMLTRGGESHCGTSVPVQSRSHRRHPKKHIPDSGFLSEDESSAT

Query:  ECKNVYSSAKRCK-KYDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPYRTPIDLRDKWRNLLKASCVNIQNRKGIEQKQSHASRPLPKSL
        E ++  S  K  + K DRRK+Q++WTL EVM LVDGI+ +G G+WT IK H F  + +R P+D+RDKWRNLLKAS     N    E+K+   +R +PK +
Subjt:  ECKNVYSSAKRCK-KYDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPYRTPIDLRDKWRNLLKASCVNIQNRKGIEQKQSHASRPLPKSL

Query:  LQRVYELANIYPYPKERS
        L RV ELA+++PYP  +S
Subjt:  LQRVYELANIYPYPKERS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GATTGAGCTTAAAAGTACACAAAATTATGGATCAAGAAGTGCATTTCTGCCAGAAGTTCACAAATATGAAATCTCATTGGGTAAAGGTGGAGGGATCTTTTCTTCCTGCA
CCCTTAAATGAATCAAATGAAGTTGAGCATTTACTTGTGGAGCCCAAAAGCAACCATATTTTAGGAAATTGCTTGAGAGTTCAAGATTTCTCTTGTGACTTTGACTATGG
GATACAAACAAACGGTGGTGGATTGGATTCTAATAGCAAGCAGGGAGGCGAACATGAACTTAAATTTGGAGATCTTGATCAACTGCTGGATGATGCCAATGAAGTAGGGG
AATTCCATGCAACAAACAATCTGCCAAACACATATACAGAAGTTGCTGAAAATTCTTTCAGACAAAATAGCGGATTACAAATTGTCAGCAACAATGGTTATAGAGCTGGA
TCCAATAATACACCTGTTGAGAGGGGTTTGACTCATGAGTTGTGCGGTGGTCTGATGACCAAAGGTAGGTGTGTATCACCACTTGAAGGCAGCATCTGCGCTAGGATACT
TGATAATATAAATATCCATAAGTTTGATACTAATGAAAGCCATATAGAAAATGGTGATTTATCTGATGAAAATGTAAAGGGTGATATTGAGGCAAGCAACCTTGTCATTT
GTTCAAGGGAGAGGAGATTGCGTAAACCTACTCGAAGATACATTGAAGAATTTGCAGATTCAAAGCCTGAAGGTTACAAGGGAGGGAGAAAACCTCCTACAAAAGATAAA
TATATGAAAGTCACGTCTATTGAAGAGTCTAATCATATTAGACATAAGGTACAAATGTTGACGCGTGGAGGGGAATCACATTGTGGTACTTCTGTTCCGGTGCAGTCTCG
ATCTCATAGAAGACATCCAAAGAAGCATATACCAGATTCAGGATTTCTATCAGAAGACGAATCTTCTGCAACTGAATGTAAAAATGTTTATTCATCTGCTAAACGATGTA
AAAAGTATGATAGGAGGAAGCATCAGAAGATGTGGACCCTTACTGAAGTAATGAGATTAGTTGATGGAATTGCTGAATATGGAACTGGCCGCTGGACTCATATAAAGAAG
CACCTATTTGCATCTTCTCCTTATCGCACGCCTATAGATCTCAGGGACAAATGGCGAAATCTTCTGAAAGCTAGCTGTGTTAACATACAGAACAGAAAAGGGATCGAACA
GAAGCAATCACATGCCTCACGTCCATTGCCTAAGTCCCTGCTCCAACGTGTCTATGAACTGGCCAATATTTATCCATATCCAAAGGAGCGCAGTACAAAATCAGTCAAAG
CAACTATACCTCCCGTGCATCTTATTGAAAGTAACTCTTTGTCATTTAATTGGGGGCGGAAGAAGTATGAATGA
mRNA sequenceShow/hide mRNA sequence
GATTGAGCTTAAAAGTACACAAAATTATGGATCAAGAAGTGCATTTCTGCCAGAAGTTCACAAATATGAAATCTCATTGGGTAAAGGTGGAGGGATCTTTTCTTCCTGCA
CCCTTAAATGAATCAAATGAAGTTGAGCATTTACTTGTGGAGCCCAAAAGCAACCATATTTTAGGAAATTGCTTGAGAGTTCAAGATTTCTCTTGTGACTTTGACTATGG
GATACAAACAAACGGTGGTGGATTGGATTCTAATAGCAAGCAGGGAGGCGAACATGAACTTAAATTTGGAGATCTTGATCAACTGCTGGATGATGCCAATGAAGTAGGGG
AATTCCATGCAACAAACAATCTGCCAAACACATATACAGAAGTTGCTGAAAATTCTTTCAGACAAAATAGCGGATTACAAATTGTCAGCAACAATGGTTATAGAGCTGGA
TCCAATAATACACCTGTTGAGAGGGGTTTGACTCATGAGTTGTGCGGTGGTCTGATGACCAAAGGTAGGTGTGTATCACCACTTGAAGGCAGCATCTGCGCTAGGATACT
TGATAATATAAATATCCATAAGTTTGATACTAATGAAAGCCATATAGAAAATGGTGATTTATCTGATGAAAATGTAAAGGGTGATATTGAGGCAAGCAACCTTGTCATTT
GTTCAAGGGAGAGGAGATTGCGTAAACCTACTCGAAGATACATTGAAGAATTTGCAGATTCAAAGCCTGAAGGTTACAAGGGAGGGAGAAAACCTCCTACAAAAGATAAA
TATATGAAAGTCACGTCTATTGAAGAGTCTAATCATATTAGACATAAGGTACAAATGTTGACGCGTGGAGGGGAATCACATTGTGGTACTTCTGTTCCGGTGCAGTCTCG
ATCTCATAGAAGACATCCAAAGAAGCATATACCAGATTCAGGATTTCTATCAGAAGACGAATCTTCTGCAACTGAATGTAAAAATGTTTATTCATCTGCTAAACGATGTA
AAAAGTATGATAGGAGGAAGCATCAGAAGATGTGGACCCTTACTGAAGTAATGAGATTAGTTGATGGAATTGCTGAATATGGAACTGGCCGCTGGACTCATATAAAGAAG
CACCTATTTGCATCTTCTCCTTATCGCACGCCTATAGATCTCAGGGACAAATGGCGAAATCTTCTGAAAGCTAGCTGTGTTAACATACAGAACAGAAAAGGGATCGAACA
GAAGCAATCACATGCCTCACGTCCATTGCCTAAGTCCCTGCTCCAACGTGTCTATGAACTGGCCAATATTTATCCATATCCAAAGGAGCGCAGTACAAAATCAGTCAAAG
CAACTATACCTCCCGTGCATCTTATTGAAAGTAACTCTTTGTCATTTAATTGGGGGCGGAAGAAGTATGAATGA
Protein sequenceShow/hide protein sequence
LSLKVHKIMDQEVHFCQKFTNMKSHWVKVEGSFLPAPLNESNEVEHLLVEPKSNHILGNCLRVQDFSCDFDYGIQTNGGGLDSNSKQGGEHELKFGDLDQLLDDANEVGE
FHATNNLPNTYTEVAENSFRQNSGLQIVSNNGYRAGSNNTPVERGLTHELCGGLMTKGRCVSPLEGSICARILDNINIHKFDTNESHIENGDLSDENVKGDIEASNLVIC
SRERRLRKPTRRYIEEFADSKPEGYKGGRKPPTKDKYMKVTSIEESNHIRHKVQMLTRGGESHCGTSVPVQSRSHRRHPKKHIPDSGFLSEDESSATECKNVYSSAKRCK
KYDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPYRTPIDLRDKWRNLLKASCVNIQNRKGIEQKQSHASRPLPKSLLQRVYELANIYPYPKERSTKSVKA
TIPPVHLIESNSLSFNWGRKKYE