; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg012116 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg012116
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionHTH myb-type domain-containing protein
Genome locationscaffold1:13089828..13100802
RNA-Seq ExpressionSpg012116
SyntenySpg012116
Gene Ontology termsNA
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143737.1 uncharacterized protein LOC111013581 isoform X1 [Momordica charantia]1.0e-19363.64Show/hide
Query:  YVKVEESFLPAPLNESNEVEHLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGGGLDSNSKQGGEHELKFGTEGMGINEVSFEEPLRLHYKAPFATEKKL
        +VKV+ SFLPAPLNE NEVEHLLVEPKS+HVLG+CLR QDFSCDF YGIQTN GGLDSNSKQ GEHELKF                              
Subjt:  YVKVEESFLPAPLNESNEVEHLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGGGLDSNSKQGGEHELKFGTEGMGINEVSFEEPLRLHYKAPFATEKKL

Query:  AAAWIEISQKACQKGLGLMKNIDNVEMLGTASRRCSVAAQNKEGNCGTTLRCCALGFLAGAITCRSVETPKGSVATLCSRVSKLLQDLDQLLDDANEVEE
                                                                                             +DLDQLL D NEVEE
Subjt:  AAAWIEISQKACQKGLGLMKNIDNVEMLGTASRRCSVAAQNKEGNCGTTLRCCALGFLAGAITCRSVETPKGSVATLCSRVSKLLQDLDQLLDDANEVEE

Query:  FHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGSSRSDTDAFGISELSATMVMEAEFNN-TPVEGGLTHESCTGLRTKGRCVTPLEGSICDTILD
        FHATNNLPNTY EVAENSFR+NRGLQLGNLSSESKSQGSSR+DT+AF ISELSA MV EAE NN TPV+ GLTHE C GLRTKGRC TPL+GSIC TILD
Subjt:  FHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGSSRSDTDAFGISELSATMVMEAEFNN-TPVEGGLTHESCTGLRTKGRCVTPLEGSICDTILD

Query:  NRNIHKFNTNESYIENGDLSDENVKGDIVASKLEFNTYIENGDLSDENVKGDIVASKLASCSRERRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYMK
        N NIHKF+TNE                           +ENG LSDENVKG+I ASKLA CSR+RRLRKPTRRYIEEFADSKSES+KG+RKPPTKDKY+K
Subjt:  NRNIHKFNTNESYIENGDLSDENVKGDIVASKLEFNTYIENGDLSDENVKGDIVASKLASCSRERRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYMK

Query:  GTSIEESNHIRHKVQMLTPRSESHCGTSVPVQSRSQRRHPKKHVPVSGFLSEDESSATECKNVHSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGTG
         TSIEESNHIRHKVQMLTP  ESHCGTS+PVQSRSQRR PKKHVPVSGFLSE+ESSATECK V+SSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGTG
Subjt:  GTSIEESNHIRHKVQMLTPRSESHCGTSVPVQSRSQRRHPKKHVPVSGFLSEDESSATECKNVHSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGTG

Query:  HWTHIKKHLFASSPHRTPIDLR--------------------IERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKPVKATTPPLLLTDSNSLSFNWG
         WTHIKKHLFA+SP+RTPIDLR                    IERKQSHASRPLPKSLLQRVYELANIYPYPKERSPK VKATT P+ L +SNSLSFNWG
Subjt:  HWTHIKKHLFASSPHRTPIDLR--------------------IERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKPVKATTPPLLLTDSNSLSFNWG

Query:  RKKYE
        RKKY+
Subjt:  RKKYE

XP_022143738.1 uncharacterized protein LOC111013581 isoform X2 [Momordica charantia]8.0e-19463.74Show/hide
Query:  YVKVEESFLPAPLNESNEVEHLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGGGLDSNSKQGGEHELKFGTEGMGINEVSFEEPLRLHYKAPFATEKKL
        +VKV+ SFLPAPLNE NEVEHLLVEPKS+HVLG+CLR QDFSCDF YGIQTN GGLDSNSKQ GEHELKF                              
Subjt:  YVKVEESFLPAPLNESNEVEHLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGGGLDSNSKQGGEHELKFGTEGMGINEVSFEEPLRLHYKAPFATEKKL

Query:  AAAWIEISQKACQKGLGLMKNIDNVEMLGTASRRCSVAAQNKEGNCGTTLRCCALGFLAGAITCRSVETPKGSVATLCSRVSKLLQDLDQLLDDANEVEE
                                                                                             +DLDQLL D NEVEE
Subjt:  AAAWIEISQKACQKGLGLMKNIDNVEMLGTASRRCSVAAQNKEGNCGTTLRCCALGFLAGAITCRSVETPKGSVATLCSRVSKLLQDLDQLLDDANEVEE

Query:  FHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGSSRSDTDAFGISELSATMVMEAEFNN-TPVEGGLTHESCTGLRTKGRCVTPLEGSICDTILD
        FHATNNLPNTY EVAENSFR+NRGLQLGNLSSESKSQGSSR+DT+AF ISELSA MV EAE NN TPV+ GLTHE C GLRTKGRC TPL+GSIC TILD
Subjt:  FHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGSSRSDTDAFGISELSATMVMEAEFNN-TPVEGGLTHESCTGLRTKGRCVTPLEGSICDTILD

Query:  NRNIHKFNTNESYIENGDLSDENVKGDIVASKLEFNTYIENGDLSDENVKGDIVASKLASCSRERRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYMK
        N NIHKF+TNE                           +ENG LSDENVKG+I ASKLA CSR+RRLRKPTRRYIEEFADSKSES+KG+RKPPTKDKY+K
Subjt:  NRNIHKFNTNESYIENGDLSDENVKGDIVASKLEFNTYIENGDLSDENVKGDIVASKLASCSRERRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYMK

Query:  GTSIEESNHIRHKVQMLTPRSESHCGTSVPVQSRSQRRHPKKHVPVSGFLSEDESSATECKNVHSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGTG
         TSIEESNHIRHKVQMLTP  ESHCGTS+PVQSRSQRR PKKHVPVSGFLSE+ESSATECK V+SSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGTG
Subjt:  GTSIEESNHIRHKVQMLTPRSESHCGTSVPVQSRSQRRHPKKHVPVSGFLSEDESSATECKNVHSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGTG

Query:  HWTHIKKHLFASSPHRTPIDLR-------------------IERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKPVKATTPPLLLTDSNSLSFNWGR
         WTHIKKHLFA+SP+RTPIDLR                   IERKQSHASRPLPKSLLQRVYELANIYPYPKERSPK VKATT P+ L +SNSLSFNWGR
Subjt:  HWTHIKKHLFASSPHRTPIDLR-------------------IERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKPVKATTPPLLLTDSNSLSFNWGR

Query:  KKYE
        KKY+
Subjt:  KKYE

XP_038881566.1 uncharacterized protein LOC120073047 isoform X1 [Benincasa hispida]1.2e-20567Show/hide
Query:  YVKVEESFLPAPLNESNEVEHLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNG-GGLDSNSKQGGEHELKFGTEGMGINEVSFEEPLRLHYKAPFATEKK
        +V+VE  FLPAPLN+SNEVE LLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNG GGLDSNSKQGGEHELKFG                            
Subjt:  YVKVEESFLPAPLNESNEVEHLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNG-GGLDSNSKQGGEHELKFGTEGMGINEVSFEEPLRLHYKAPFATEKK

Query:  LAAAWIEISQKACQKGLGLMKNIDNVEMLGTASRRCSVAAQNKEGNCGTTLRCCALGFLAGAITCRSVETPKGSVATLCSRVSKLLQDLDQLLDDANEVE
                                                                                               DLDQLLDDANEV 
Subjt:  LAAAWIEISQKACQKGLGLMKNIDNVEMLGTASRRCSVAAQNKEGNCGTTLRCCALGFLAGAITCRSVETPKGSVATLCSRVSKLLQDLDQLLDDANEVE

Query:  EFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGSSRSDTDAFGISELSATMVMEAEFNNTPVEGGLTHESCTGLRTKGRCV--TPLEGSICDTI
        EFHATNNL +TYAEVAENSFRQNRGLQLGN SS SKSQG SRSDTDAFGISELSATMVME EFNNTPVE GLTHE   GLRTKGRCV  TPLEG+ICDTI
Subjt:  EFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGSSRSDTDAFGISELSATMVMEAEFNNTPVEGGLTHESCTGLRTKGRCV--TPLEGSICDTI

Query:  LDNRNIHKFNTNESYIENGDLSDENVKGDIVASKLEFNTYIENGDLSDENVKGDIVASKLASCSRERRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKY
        LDNRNIHKFNTNE+                         YIENGDLSDENVKGDIVA+KLASCSRERRLRKPTRRYIEEFADSKSE+NKGRRKPPTKDKY
Subjt:  LDNRNIHKFNTNESYIENGDLSDENVKGDIVASKLEFNTYIENGDLSDENVKGDIVASKLASCSRERRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKY

Query:  MKGTSIEESNHIRHKVQMLTPRSESHCGTSVPVQSRSQRRHPKKHVPVSGFLSEDESSATECKNVHSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYG
        +K TS EESNHIRH+VQMLTPRSE HCGTSVPVQSRSQRRHPKKHVPVSGFLSEDESSATECKNV+SS KRCKK+DRR+HQKMW+LTEVMRLVDGIAEYG
Subjt:  MKGTSIEESNHIRHKVQMLTPRSESHCGTSVPVQSRSQRRHPKKHVPVSGFLSEDESSATECKNVHSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYG

Query:  TGHWTHIKKHLFASSPHRTPIDLR-------------------IERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKPVKATTPPLLLTDSNSLSFNW
        TG WT IKKHLFASSPHRTPIDLR                   IERKQSHASRPLPKSLLQRVYELANIYPYPKERSPK VKATTPP+ L +SNSLSFNW
Subjt:  TGHWTHIKKHLFASSPHRTPIDLR-------------------IERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKPVKATTPPLLLTDSNSLSFNW

Query:  GRKKYE
        GRKKYE
Subjt:  GRKKYE

XP_038881567.1 uncharacterized protein LOC120073047 isoform X2 [Benincasa hispida]2.0e-20567.11Show/hide
Query:  VKVEESFLPAPLNESNEVEHLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNG-GGLDSNSKQGGEHELKFGTEGMGINEVSFEEPLRLHYKAPFATEKKL
        V+VE  FLPAPLN+SNEVE LLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNG GGLDSNSKQGGEHELKFG                             
Subjt:  VKVEESFLPAPLNESNEVEHLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNG-GGLDSNSKQGGEHELKFGTEGMGINEVSFEEPLRLHYKAPFATEKKL

Query:  AAAWIEISQKACQKGLGLMKNIDNVEMLGTASRRCSVAAQNKEGNCGTTLRCCALGFLAGAITCRSVETPKGSVATLCSRVSKLLQDLDQLLDDANEVEE
                                                                                              DLDQLLDDANEV E
Subjt:  AAAWIEISQKACQKGLGLMKNIDNVEMLGTASRRCSVAAQNKEGNCGTTLRCCALGFLAGAITCRSVETPKGSVATLCSRVSKLLQDLDQLLDDANEVEE

Query:  FHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGSSRSDTDAFGISELSATMVMEAEFNNTPVEGGLTHESCTGLRTKGRCV--TPLEGSICDTIL
        FHATNNL +TYAEVAENSFRQNRGLQLGN SS SKSQG SRSDTDAFGISELSATMVME EFNNTPVE GLTHE   GLRTKGRCV  TPLEG+ICDTIL
Subjt:  FHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGSSRSDTDAFGISELSATMVMEAEFNNTPVEGGLTHESCTGLRTKGRCV--TPLEGSICDTIL

Query:  DNRNIHKFNTNESYIENGDLSDENVKGDIVASKLEFNTYIENGDLSDENVKGDIVASKLASCSRERRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYM
        DNRNIHKFNTNE+                         YIENGDLSDENVKGDIVA+KLASCSRERRLRKPTRRYIEEFADSKSE+NKGRRKPPTKDKY+
Subjt:  DNRNIHKFNTNESYIENGDLSDENVKGDIVASKLEFNTYIENGDLSDENVKGDIVASKLASCSRERRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYM

Query:  KGTSIEESNHIRHKVQMLTPRSESHCGTSVPVQSRSQRRHPKKHVPVSGFLSEDESSATECKNVHSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGT
        K TS EESNHIRH+VQMLTPRSE HCGTSVPVQSRSQRRHPKKHVPVSGFLSEDESSATECKNV+SS KRCKK+DRR+HQKMW+LTEVMRLVDGIAEYGT
Subjt:  KGTSIEESNHIRHKVQMLTPRSESHCGTSVPVQSRSQRRHPKKHVPVSGFLSEDESSATECKNVHSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGT

Query:  GHWTHIKKHLFASSPHRTPIDLR-------------------IERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKPVKATTPPLLLTDSNSLSFNWG
        G WT IKKHLFASSPHRTPIDLR                   IERKQSHASRPLPKSLLQRVYELANIYPYPKERSPK VKATTPP+ L +SNSLSFNWG
Subjt:  GHWTHIKKHLFASSPHRTPIDLR-------------------IERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKPVKATTPPLLLTDSNSLSFNWG

Query:  RKKYE
        RKKYE
Subjt:  RKKYE

XP_038881569.1 uncharacterized protein LOC120073047 isoform X3 [Benincasa hispida]2.5e-20366.83Show/hide
Query:  YVKVEESFLPAPLNESNEVEHLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNG-GGLDSNSKQGGEHELKFGTEGMGINEVSFEEPLRLHYKAPFATEKK
        +V+VE  FLPAPLN+SNEVE LLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNG GGLDSNSKQGGEHELKFG                            
Subjt:  YVKVEESFLPAPLNESNEVEHLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNG-GGLDSNSKQGGEHELKFGTEGMGINEVSFEEPLRLHYKAPFATEKK

Query:  LAAAWIEISQKACQKGLGLMKNIDNVEMLGTASRRCSVAAQNKEGNCGTTLRCCALGFLAGAITCRSVETPKGSVATLCSRVSKLLQDLDQLLDDANEVE
                                                                                               DLDQLLDDANEV 
Subjt:  LAAAWIEISQKACQKGLGLMKNIDNVEMLGTASRRCSVAAQNKEGNCGTTLRCCALGFLAGAITCRSVETPKGSVATLCSRVSKLLQDLDQLLDDANEVE

Query:  EFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGSSRSDTDAFGISELSATMVMEAEFNNTPVEGGLTHESCTGLRTKGRCV--TPLEGSICDTI
        EFHATNNL N  AEVAENSFRQNRGLQLGN SS SKSQG SRSDTDAFGISELSATMVME EFNNTPVE GLTHE   GLRTKGRCV  TPLEG+ICDTI
Subjt:  EFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGSSRSDTDAFGISELSATMVMEAEFNNTPVEGGLTHESCTGLRTKGRCV--TPLEGSICDTI

Query:  LDNRNIHKFNTNESYIENGDLSDENVKGDIVASKLEFNTYIENGDLSDENVKGDIVASKLASCSRERRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKY
        LDNRNIHKFNTNE+                         YIENGDLSDENVKGDIVA+KLASCSRERRLRKPTRRYIEEFADSKSE+NKGRRKPPTKDKY
Subjt:  LDNRNIHKFNTNESYIENGDLSDENVKGDIVASKLEFNTYIENGDLSDENVKGDIVASKLASCSRERRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKY

Query:  MKGTSIEESNHIRHKVQMLTPRSESHCGTSVPVQSRSQRRHPKKHVPVSGFLSEDESSATECKNVHSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYG
        +K TS EESNHIRH+VQMLTPRSE HCGTSVPVQSRSQRRHPKKHVPVSGFLSEDESSATECKNV+SS KRCKK+DRR+HQKMW+LTEVMRLVDGIAEYG
Subjt:  MKGTSIEESNHIRHKVQMLTPRSESHCGTSVPVQSRSQRRHPKKHVPVSGFLSEDESSATECKNVHSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYG

Query:  TGHWTHIKKHLFASSPHRTPIDLR-------------------IERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKPVKATTPPLLLTDSNSLSFNW
        TG WT IKKHLFASSPHRTPIDLR                   IERKQSHASRPLPKSLLQRVYELANIYPYPKERSPK VKATTPP+ L +SNSLSFNW
Subjt:  TGHWTHIKKHLFASSPHRTPIDLR-------------------IERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKPVKATTPPLLLTDSNSLSFNW

Query:  GRKKYE
        GRKKYE
Subjt:  GRKKYE

TrEMBL top hitse value%identityAlignment
A0A1S3BKX9 uncharacterized protein LOC103491166 isoform X12.1e-19263.52Show/hide
Query:  YVKVEESFLPAPLNESNEVEHLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGGGLDSNSKQGGEHELKFGTEGMGINEVSFEEPLRLHYKAPFATEKKL
        +VKVE  FLPAPLN+SNEVE LLVE KS+HVLGNCLRVQDFSCDFGYGIQTNGGGLDSNSKQGGEHELKFG                             
Subjt:  YVKVEESFLPAPLNESNEVEHLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGGGLDSNSKQGGEHELKFGTEGMGINEVSFEEPLRLHYKAPFATEKKL

Query:  AAAWIEISQKACQKGLGLMKNIDNVEMLGTASRRCSVAAQNKEGNCGTTLRCCALGFLAGAITCRSVETPKGSVATLCSRVSKLLQDLDQLLDDANEVEE
                                                                                              D DQLLDDANEV E
Subjt:  AAAWIEISQKACQKGLGLMKNIDNVEMLGTASRRCSVAAQNKEGNCGTTLRCCALGFLAGAITCRSVETPKGSVATLCSRVSKLLQDLDQLLDDANEVEE

Query:  FHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGSSRSDTDAFGISELSATMVMEAEFNNTPVEGGLTHESCTGLRTKGRCVTPLEGSICDTILDN
        FHATNNLPNTYAEVAENSFR+NR  QLGN SSE+KS G SR DTDAFGISELSATMVMEAEFNNTPVE GLTHE   GL TKGRCVTPLEG+IC TILDN
Subjt:  FHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGSSRSDTDAFGISELSATMVMEAEFNNTPVEGGLTHESCTGLRTKGRCVTPLEGSICDTILDN

Query:  RNIHKFNTNESYIENGDLSDENVKGDIVASKLEFNTYIENGDLSDENVKGDIVASKLASCSRERRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYMKG
        RNIHKFNTNE+                         YIENGDLSDENVKGDIVA++LASCSRERRLRKPTRRYIEEF DSKSE NKGRRK P KDKY+K 
Subjt:  RNIHKFNTNESYIENGDLSDENVKGDIVASKLEFNTYIENGDLSDENVKGDIVASKLASCSRERRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYMKG

Query:  TSIEESNHIRHKVQMLTPRSESHCGTSVPVQSRSQRRHPKKHVPVSGFLSEDESSATECKNVHSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGH
         S EES HIRH+VQM+ PRS+S CGTSVPVQ +S+RRHP KHVPVSGFLSEDESSATECKNV+SSA+RCKK+DRR+ QKMWTLTEVMRLVDGIAEYGTG 
Subjt:  TSIEESNHIRHKVQMLTPRSESHCGTSVPVQSRSQRRHPKKHVPVSGFLSEDESSATECKNVHSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGH

Query:  WTHIKKHLFASSPHRTPIDLR-------------------IERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKPVKATTPPLLLTDSNSLSFNWGRK
        WTHIKKHLFASSPHRTPIDLR                   +E KQ+HASRPLPKSLLQRVYELANIYPYPKER PK VKA TPP+ L +SNSLSFNWGRK
Subjt:  WTHIKKHLFASSPHRTPIDLR-------------------IERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKPVKATTPPLLLTDSNSLSFNWGRK

Query:  KYE
        KYE
Subjt:  KYE

A0A1S3BLK0 uncharacterized protein LOC103491166 isoform X22.0e-19063.35Show/hide
Query:  YVKVEESFLPAPLNESNEVEHLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGGGLDSNSKQGGEHELKFGTEGMGINEVSFEEPLRLHYKAPFATEKKL
        +VKVE  FLPAPLN+SNEVE LLVE KS+HVLGNCLRVQDFSCDFGYGIQTN GGLDSNSKQGGEHELKFG                             
Subjt:  YVKVEESFLPAPLNESNEVEHLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGGGLDSNSKQGGEHELKFGTEGMGINEVSFEEPLRLHYKAPFATEKKL

Query:  AAAWIEISQKACQKGLGLMKNIDNVEMLGTASRRCSVAAQNKEGNCGTTLRCCALGFLAGAITCRSVETPKGSVATLCSRVSKLLQDLDQLLDDANEVEE
                                                                                              D DQLLDDANEV E
Subjt:  AAAWIEISQKACQKGLGLMKNIDNVEMLGTASRRCSVAAQNKEGNCGTTLRCCALGFLAGAITCRSVETPKGSVATLCSRVSKLLQDLDQLLDDANEVEE

Query:  FHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGSSRSDTDAFGISELSATMVMEAEFNNTPVEGGLTHESCTGLRTKGRCVTPLEGSICDTILDN
        FHATNNLPNTYAEVAENSFR+NR  QLGN SSE+KS G SR DTDAFGISELSATMVMEAEFNNTPVE GLTHE   GL TKGRCVTPLEG+IC TILDN
Subjt:  FHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGSSRSDTDAFGISELSATMVMEAEFNNTPVEGGLTHESCTGLRTKGRCVTPLEGSICDTILDN

Query:  RNIHKFNTNESYIENGDLSDENVKGDIVASKLEFNTYIENGDLSDENVKGDIVASKLASCSRERRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYMKG
        RNIHKFNTNE+                         YIENGDLSDENVKGDIVA++LASCSRERRLRKPTRRYIEEF DSKSE NKGRRK P KDKY+K 
Subjt:  RNIHKFNTNESYIENGDLSDENVKGDIVASKLEFNTYIENGDLSDENVKGDIVASKLASCSRERRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYMKG

Query:  TSIEESNHIRHKVQMLTPRSESHCGTSVPVQSRSQRRHPKKHVPVSGFLSEDESSATECKNVHSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGH
         S EES HIRH+VQM+ PRS+S CGTSVPVQ +S+RRHP KHVPVSGFLSEDESSATECKNV+SSA+RCKK+DRR+ QKMWTLTEVMRLVDGIAEYGTG 
Subjt:  TSIEESNHIRHKVQMLTPRSESHCGTSVPVQSRSQRRHPKKHVPVSGFLSEDESSATECKNVHSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGH

Query:  WTHIKKHLFASSPHRTPIDLR-------------------IERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKPVKATTPPLLLTDSNSLSFNWGRK
        WTHIKKHLFASSPHRTPIDLR                   +E KQ+HASRPLPKSLLQRVYELANIYPYPKER PK VKA TPP+ L +SNSLSFNWGRK
Subjt:  WTHIKKHLFASSPHRTPIDLR-------------------IERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKPVKATTPPLLLTDSNSLSFNWGRK

Query:  KYE
        KYE
Subjt:  KYE

A0A6J1CQ81 uncharacterized protein LOC111013581 isoform X37.1e-17277.07Show/hide
Query:  KLLQDLDQLLDDANEVEEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGSSRSDTDAFGISELSATMVMEAEFNN-TPVEGGLTHESCTGLRT
        K + DLDQLL D NEVEEFHATNNLPNTY EVAENSFR+NRGLQLGNLSSESKSQGSSR+DT+AF ISELSA MV EAE NN TPV+ GLTHE C GLRT
Subjt:  KLLQDLDQLLDDANEVEEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGSSRSDTDAFGISELSATMVMEAEFNN-TPVEGGLTHESCTGLRT

Query:  KGRCVTPLEGSICDTILDNRNIHKFNTNESYIENGDLSDENVKGDIVASKLEFNTYIENGDLSDENVKGDIVASKLASCSRERRLRKPTRRYIEEFADSK
        KGRC TPL+GSIC TILDN NIHKF+TNE                           +ENG LSDENVKG+I ASKLA CSR+RRLRKPTRRYIEEFADSK
Subjt:  KGRCVTPLEGSICDTILDNRNIHKFNTNESYIENGDLSDENVKGDIVASKLEFNTYIENGDLSDENVKGDIVASKLASCSRERRLRKPTRRYIEEFADSK

Query:  SESNKGRRKPPTKDKYMKGTSIEESNHIRHKVQMLTPRSESHCGTSVPVQSRSQRRHPKKHVPVSGFLSEDESSATECKNVHSSAKRCKKHDRRKHQKMW
        SES+KG+RKPPTKDKY+K TSIEESNHIRHKVQMLTP  ESHCGTS+PVQSRSQRR PKKHVPVSGFLSE+ESSATECK V+SSAKRCKKHDRRKHQKMW
Subjt:  SESNKGRRKPPTKDKYMKGTSIEESNHIRHKVQMLTPRSESHCGTSVPVQSRSQRRHPKKHVPVSGFLSEDESSATECKNVHSSAKRCKKHDRRKHQKMW

Query:  TLTEVMRLVDGIAEYGTGHWTHIKKHLFASSPHRTPIDLR--------------------IERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKPVKA
        TLTEVMRLVDGIAEYGTG WTHIKKHLFA+SP+RTPIDLR                    IERKQSHASRPLPKSLLQRVYELANIYPYPKERSPK VKA
Subjt:  TLTEVMRLVDGIAEYGTGHWTHIKKHLFASSPHRTPIDLR--------------------IERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKPVKA

Query:  TTPPLLLTDSNSLSFNWGRKKYE
        TT P+ L +SNSLSFNWGRKKY+
Subjt:  TTPPLLLTDSNSLSFNWGRKKYE

A0A6J1CRG2 uncharacterized protein LOC111013581 isoform X23.9e-19463.74Show/hide
Query:  YVKVEESFLPAPLNESNEVEHLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGGGLDSNSKQGGEHELKFGTEGMGINEVSFEEPLRLHYKAPFATEKKL
        +VKV+ SFLPAPLNE NEVEHLLVEPKS+HVLG+CLR QDFSCDF YGIQTN GGLDSNSKQ GEHELKF                              
Subjt:  YVKVEESFLPAPLNESNEVEHLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGGGLDSNSKQGGEHELKFGTEGMGINEVSFEEPLRLHYKAPFATEKKL

Query:  AAAWIEISQKACQKGLGLMKNIDNVEMLGTASRRCSVAAQNKEGNCGTTLRCCALGFLAGAITCRSVETPKGSVATLCSRVSKLLQDLDQLLDDANEVEE
                                                                                             +DLDQLL D NEVEE
Subjt:  AAAWIEISQKACQKGLGLMKNIDNVEMLGTASRRCSVAAQNKEGNCGTTLRCCALGFLAGAITCRSVETPKGSVATLCSRVSKLLQDLDQLLDDANEVEE

Query:  FHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGSSRSDTDAFGISELSATMVMEAEFNN-TPVEGGLTHESCTGLRTKGRCVTPLEGSICDTILD
        FHATNNLPNTY EVAENSFR+NRGLQLGNLSSESKSQGSSR+DT+AF ISELSA MV EAE NN TPV+ GLTHE C GLRTKGRC TPL+GSIC TILD
Subjt:  FHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGSSRSDTDAFGISELSATMVMEAEFNN-TPVEGGLTHESCTGLRTKGRCVTPLEGSICDTILD

Query:  NRNIHKFNTNESYIENGDLSDENVKGDIVASKLEFNTYIENGDLSDENVKGDIVASKLASCSRERRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYMK
        N NIHKF+TNE                           +ENG LSDENVKG+I ASKLA CSR+RRLRKPTRRYIEEFADSKSES+KG+RKPPTKDKY+K
Subjt:  NRNIHKFNTNESYIENGDLSDENVKGDIVASKLEFNTYIENGDLSDENVKGDIVASKLASCSRERRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYMK

Query:  GTSIEESNHIRHKVQMLTPRSESHCGTSVPVQSRSQRRHPKKHVPVSGFLSEDESSATECKNVHSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGTG
         TSIEESNHIRHKVQMLTP  ESHCGTS+PVQSRSQRR PKKHVPVSGFLSE+ESSATECK V+SSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGTG
Subjt:  GTSIEESNHIRHKVQMLTPRSESHCGTSVPVQSRSQRRHPKKHVPVSGFLSEDESSATECKNVHSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGTG

Query:  HWTHIKKHLFASSPHRTPIDLR-------------------IERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKPVKATTPPLLLTDSNSLSFNWGR
         WTHIKKHLFA+SP+RTPIDLR                   IERKQSHASRPLPKSLLQRVYELANIYPYPKERSPK VKATT P+ L +SNSLSFNWGR
Subjt:  HWTHIKKHLFASSPHRTPIDLR-------------------IERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKPVKATTPPLLLTDSNSLSFNWGR

Query:  KKYE
        KKY+
Subjt:  KKYE

A0A6J1CRQ1 uncharacterized protein LOC111013581 isoform X15.1e-19463.64Show/hide
Query:  YVKVEESFLPAPLNESNEVEHLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGGGLDSNSKQGGEHELKFGTEGMGINEVSFEEPLRLHYKAPFATEKKL
        +VKV+ SFLPAPLNE NEVEHLLVEPKS+HVLG+CLR QDFSCDF YGIQTN GGLDSNSKQ GEHELKF                              
Subjt:  YVKVEESFLPAPLNESNEVEHLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGGGLDSNSKQGGEHELKFGTEGMGINEVSFEEPLRLHYKAPFATEKKL

Query:  AAAWIEISQKACQKGLGLMKNIDNVEMLGTASRRCSVAAQNKEGNCGTTLRCCALGFLAGAITCRSVETPKGSVATLCSRVSKLLQDLDQLLDDANEVEE
                                                                                             +DLDQLL D NEVEE
Subjt:  AAAWIEISQKACQKGLGLMKNIDNVEMLGTASRRCSVAAQNKEGNCGTTLRCCALGFLAGAITCRSVETPKGSVATLCSRVSKLLQDLDQLLDDANEVEE

Query:  FHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGSSRSDTDAFGISELSATMVMEAEFNN-TPVEGGLTHESCTGLRTKGRCVTPLEGSICDTILD
        FHATNNLPNTY EVAENSFR+NRGLQLGNLSSESKSQGSSR+DT+AF ISELSA MV EAE NN TPV+ GLTHE C GLRTKGRC TPL+GSIC TILD
Subjt:  FHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGSSRSDTDAFGISELSATMVMEAEFNN-TPVEGGLTHESCTGLRTKGRCVTPLEGSICDTILD

Query:  NRNIHKFNTNESYIENGDLSDENVKGDIVASKLEFNTYIENGDLSDENVKGDIVASKLASCSRERRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYMK
        N NIHKF+TNE                           +ENG LSDENVKG+I ASKLA CSR+RRLRKPTRRYIEEFADSKSES+KG+RKPPTKDKY+K
Subjt:  NRNIHKFNTNESYIENGDLSDENVKGDIVASKLEFNTYIENGDLSDENVKGDIVASKLASCSRERRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYMK

Query:  GTSIEESNHIRHKVQMLTPRSESHCGTSVPVQSRSQRRHPKKHVPVSGFLSEDESSATECKNVHSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGTG
         TSIEESNHIRHKVQMLTP  ESHCGTS+PVQSRSQRR PKKHVPVSGFLSE+ESSATECK V+SSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGTG
Subjt:  GTSIEESNHIRHKVQMLTPRSESHCGTSVPVQSRSQRRHPKKHVPVSGFLSEDESSATECKNVHSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGTG

Query:  HWTHIKKHLFASSPHRTPIDLR--------------------IERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKPVKATTPPLLLTDSNSLSFNWG
         WTHIKKHLFA+SP+RTPIDLR                    IERKQSHASRPLPKSLLQRVYELANIYPYPKERSPK VKATT P+ L +SNSLSFNWG
Subjt:  HWTHIKKHLFASSPHRTPIDLR--------------------IERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKPVKATTPPLLLTDSNSLSFNWG

Query:  RKKYE
        RKKY+
Subjt:  RKKYE

SwissProt top hitse value%identityAlignment
Q9C7B1 Telomere repeat-binding protein 39.4e-0432.04Show/hide
Query:  LSEDESSATECKNVHSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGHWTHIKKHLFASSPHRTPIDLRIERK----------QSHASRPLPKSLL
        L E E  A     ++   KR +   RR  ++ +++TEV  LV  + E GTG W  +K   F  + HRT +DL+ + K          Q     P+P+ LL
Subjt:  LSEDESSATECKNVHSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGHWTHIKKHLFASSPHRTPIDLRIERK----------QSHASRPLPKSLL

Query:  QRV
         RV
Subjt:  QRV

Q9FFY9 Telomere repeat-binding protein 41.4e-0433.33Show/hide
Query:  ESSATECKNVHSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGHWTHIKKHLFASSPHRTPIDLRIERK----------QSHASRPLPKSLLQRV
        ES A     V+   KR +   RR  ++ +++TEV  LV  + E GTG W  +K   F ++ HRT +DL+ + K          Q     P+P+ LL RV
Subjt:  ESSATECKNVHSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGHWTHIKKHLFASSPHRTPIDLRIERK----------QSHASRPLPKSLLQRV

Q9SNB9 Telomere repeat-binding protein 29.4e-0432.05Show/hide
Query:  RRKHQKMWTLTEVMRLVDGIAEYGTGHWTHIKKHLFASSPHRTPIDLRIERK----------QSHASRPLPKSLLQRV
        +R+ ++ +++TEV  LV  + + GTG W  +K   F  + HRT +DL+ + K          Q     P+P+ LL RV
Subjt:  RRKHQKMWTLTEVMRLVDGIAEYGTGHWTHIKKHLFASSPHRTPIDLRIERK----------QSHASRPLPKSLLQRV

Arabidopsis top hitse value%identityAlignment
AT1G17460.1 TRF-like 33.2e-0730.14Show/hide
Query:  RSESHCGTSVPVQSRSQRRHPKK-HVPVSGFLSEDE---SSATECKNVHSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGHWTHIKKHLFASSPH
        +S S C     VQ  S + H K     V   + E E   SS     +    A   +    RK  + WT++EV +LV+G+++YG G WT IKK  F+   H
Subjt:  RSESHCGTSVPVQSRSQRRHPKK-HVPVSGFLSEDE---SSATECKNVHSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGHWTHIKKHLFASSPH

Query:  RTPIDLRIERK---------------QSHASRPLPKSLLQRVYELA
        RT +DL+ + +               + H S  +P  ++ +V ELA
Subjt:  RTPIDLRIERK---------------QSHASRPLPKSLLQRVYELA

AT1G72650.1 TRF-like 63.5e-1425.93Show/hide
Query:  RRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYMKGTSIEESNHIRHKVQMLTPRSESHCGT--SVPVQSRSQRRHPKKHVPV-----SGFLSEDESSA
        +R+RKPTRRYIEE +++  +    +   P+KD+ +   S   S  +    ++   R  S  G+   VP  S  +R  P++++       S +L ED++SA
Subjt:  RRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYMKGTSIEESNHIRHKVQMLTPRSESHCGT--SVPVQSRSQRRHPKKHVPV-----SGFLSEDESSA

Query:  TECK----------------NVHSSAKRCKKHD-----------------------------------------------RRKHQKMWTLTEVMRLVDGI
         E                  +V  SA R  +++                                               RRKH + WTL+E+ +LV+G+
Subjt:  TECK----------------NVHSSAKRCKKHD-----------------------------------------------RRKHQKMWTLTEVMRLVDGI

Query:  AEYGTGHWTHIKKHLFASSPHRTPIDLR------------------IERKQSHASRPLPKSLLQRVYELA
        ++YG G W+ IKKHLF+S  +RT +DL+                  +   + H S  +P  +L RV ELA
Subjt:  AEYGTGHWTHIKKHLFASSPHRTPIDLR------------------IERKQSHASRPLPKSLLQRVYELA

AT1G72650.2 TRF-like 63.5e-1425.93Show/hide
Query:  RRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYMKGTSIEESNHIRHKVQMLTPRSESHCGT--SVPVQSRSQRRHPKKHVPV-----SGFLSEDESSA
        +R+RKPTRRYIEE +++  +    +   P+KD+ +   S   S  +    ++   R  S  G+   VP  S  +R  P++++       S +L ED++SA
Subjt:  RRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYMKGTSIEESNHIRHKVQMLTPRSESHCGT--SVPVQSRSQRRHPKKHVPV-----SGFLSEDESSA

Query:  TECK----------------NVHSSAKRCKKHD-----------------------------------------------RRKHQKMWTLTEVMRLVDGI
         E                  +V  SA R  +++                                               RRKH + WTL+E+ +LV+G+
Subjt:  TECK----------------NVHSSAKRCKKHD-----------------------------------------------RRKHQKMWTLTEVMRLVDGI

Query:  AEYGTGHWTHIKKHLFASSPHRTPIDLR------------------IERKQSHASRPLPKSLLQRVYELA
        ++YG G W+ IKKHLF+S  +RT +DL+                  +   + H S  +P  +L RV ELA
Subjt:  AEYGTGHWTHIKKHLFASSPHRTPIDLR------------------IERKQSHASRPLPKSLLQRVYELA

AT2G37025.1 TRF-like 81.3e-1938.19Show/hide
Query:  RRHPKK---HVPVSGFLSEDESSATECKNVHSSAKRCK-KHDRRKHQKMWTLTEVMRLVDGIAEYGTGHWTHIKKHLFASSPHRTPIDLR----------
        R+ P K   H  +    S+D+ + +E ++  S  K  + K DRRK+Q++WTL EVM LVDGI+ +G G WT IK H F  + HR P+D+R          
Subjt:  RRHPKK---HVPVSGFLSEDESSATECKNVHSSAKRCK-KHDRRKHQKMWTLTEVMRLVDGIAEYGTGHWTHIKKHLFASSPHRTPIDLR----------

Query:  ---------IERKQSHASRPLPKSLLQRVYELANIYPYPKERSP
                  E K+   +R +PK +L RV ELA+++PYP  +SP
Subjt:  ---------IERKQSHASRPLPKSLLQRVYELANIYPYPKERSP

AT2G37025.2 TRF-like 81.3e-1938.19Show/hide
Query:  RRHPKK---HVPVSGFLSEDESSATECKNVHSSAKRCK-KHDRRKHQKMWTLTEVMRLVDGIAEYGTGHWTHIKKHLFASSPHRTPIDLR----------
        R+ P K   H  +    S+D+ + +E ++  S  K  + K DRRK+Q++WTL EVM LVDGI+ +G G WT IK H F  + HR P+D+R          
Subjt:  RRHPKK---HVPVSGFLSEDESSATECKNVHSSAKRCK-KHDRRKHQKMWTLTEVMRLVDGIAEYGTGHWTHIKKHLFASSPHRTPIDLR----------

Query:  ---------IERKQSHASRPLPKSLLQRVYELANIYPYPKERSP
                  E K+   +R +PK +L RV ELA+++PYP  +SP
Subjt:  ---------IERKQSHASRPLPKSLLQRVYELANIYPYPKERSP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCGTTCACAACAATAATTATGTTGATATGCCTTTTCTTGTTGTGTTAGAGCTACTAATCCGTATCAGGACATGTAGGATTGAGCTTAAAAGTAAACCAAATTATGG
ATCAAGAAGTGCATTTCTGCCCGAAGTTCACAAATATGAAATCTCATTGGACTATATCTACGTAAAAGTGGAGGAATCTTTTCTTCCTGCACCATTAAATGAATCAAATG
AAGTTGAGCATTTACTTGTGGAGCCTAAAAGCGACCATGTTTTAGGAAATTGCTTAAGAGTTCAAGATTTCTCTTGTGACTTTGGCTATGGAATACAAACAAACGGTGGT
GGATTGGATTCTAATAGCAAGCAGGGAGGCGAACATGAACTTAAATTTGGAACTGAAGGCATGGGAATAAATGAAGTTTCATTTGAAGAACCTCTGAGATTGCATTATAA
AGCACCTTTTGCAACGGAGAAGAAGCTGGCAGCTGCTTGGATAGAGATTTCACAGAAAGCATGCCAGAAGGGTCTAGGATTGATGAAAAATATCGATAACGTCGAGATGC
TAGGGACAGCGTCTCGACGATGTTCTGTTGCTGCACAAAACAAAGAAGGAAATTGTGGTACAACGTTGAGATGCTGTGCCCTTGGGTTTTTGGCTGGGGCAATCACCTGC
AGAAGCGTTGAGACGCCAAAAGGGAGCGTCGCGACGCTATGCTCTCGGGTCTCTAAATTGCTGCAGGATCTTGATCAACTGCTGGATGATGCCAATGAAGTAGAGGAATT
CCATGCAACAAACAATCTGCCAAATACATATGCAGAAGTTGCTGAAAATTCTTTCAGACAGAATAGGGGATTGCAATTGGGAAACTTAAGTTCAGAGAGCAAGTCTCAGG
GATCAAGCAGGAGTGATACTGATGCTTTTGGAATATCAGAATTGTCAGCAACAATGGTAATGGAGGCTGAGTTCAATAATACACCTGTTGAGGGGGGTTTAACTCATGAG
TCGTGCACTGGTCTGAGGACCAAAGGTAGGTGTGTAACACCACTTGAAGGCAGCATCTGCGATACGATACTTGATAATAGAAATATCCATAAGTTTAATACTAATGAAAG
CTATATAGAAAATGGCGATTTATCTGATGAAAATGTGAAGGGTGATATTGTGGCAAGCAAACTTGAGTTTAATACCTATATAGAAAATGGCGATTTATCTGATGAAAATG
TGAAGGGTGATATTGTGGCAAGCAAACTTGCCAGTTGTTCAAGGGAGAGGAGATTGCGTAAGCCTACTCGAAGATACATTGAAGAATTTGCAGATTCAAAGTCTGAAAGT
AACAAGGGAAGGAGAAAACCTCCTACAAAAGATAAATATATGAAAGGGACATCTATTGAAGAATCTAATCACATTAGACATAAGGTACAAATGTTGACGCCTAGAAGTGA
ATCACATTGTGGTACTTCTGTTCCAGTGCAGTCTCGATCTCAAAGAAGACATCCAAAGAAGCACGTACCAGTTTCAGGATTTCTATCGGAAGATGAATCTTCTGCAACTG
AATGTAAAAACGTTCATTCATCTGCTAAAAGATGTAAAAAGCATGATAGGAGGAAGCACCAGAAGATGTGGACCCTTACTGAAGTAATGCGATTAGTTGATGGAATTGCT
GAATATGGAACTGGCCATTGGACTCATATAAAGAAGCACCTATTTGCATCTTCTCCTCATCGCACACCTATAGATCTCAGGATCGAACGGAAGCAATCACATGCCTCGCG
TCCACTGCCAAAGTCCCTGCTCCAACGTGTCTATGAACTGGCCAATATTTATCCATATCCAAAGGAGCGCAGTCCAAAACCAGTTAAAGCAACTACACCTCCCTTGCTTC
TTACTGACAGTAACTCTTTGTCATTCAATTGGGGGCGGAAGAAGTATGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGCTCGTTCACAACAATAATTATGTTGATATGCCTTTTCTTGTTGTGTTAGAGCTACTAATCCGTATCAGGACATGTAGGATTGAGCTTAAAAGTAAACCAAATTATGG
ATCAAGAAGTGCATTTCTGCCCGAAGTTCACAAATATGAAATCTCATTGGACTATATCTACGTAAAAGTGGAGGAATCTTTTCTTCCTGCACCATTAAATGAATCAAATG
AAGTTGAGCATTTACTTGTGGAGCCTAAAAGCGACCATGTTTTAGGAAATTGCTTAAGAGTTCAAGATTTCTCTTGTGACTTTGGCTATGGAATACAAACAAACGGTGGT
GGATTGGATTCTAATAGCAAGCAGGGAGGCGAACATGAACTTAAATTTGGAACTGAAGGCATGGGAATAAATGAAGTTTCATTTGAAGAACCTCTGAGATTGCATTATAA
AGCACCTTTTGCAACGGAGAAGAAGCTGGCAGCTGCTTGGATAGAGATTTCACAGAAAGCATGCCAGAAGGGTCTAGGATTGATGAAAAATATCGATAACGTCGAGATGC
TAGGGACAGCGTCTCGACGATGTTCTGTTGCTGCACAAAACAAAGAAGGAAATTGTGGTACAACGTTGAGATGCTGTGCCCTTGGGTTTTTGGCTGGGGCAATCACCTGC
AGAAGCGTTGAGACGCCAAAAGGGAGCGTCGCGACGCTATGCTCTCGGGTCTCTAAATTGCTGCAGGATCTTGATCAACTGCTGGATGATGCCAATGAAGTAGAGGAATT
CCATGCAACAAACAATCTGCCAAATACATATGCAGAAGTTGCTGAAAATTCTTTCAGACAGAATAGGGGATTGCAATTGGGAAACTTAAGTTCAGAGAGCAAGTCTCAGG
GATCAAGCAGGAGTGATACTGATGCTTTTGGAATATCAGAATTGTCAGCAACAATGGTAATGGAGGCTGAGTTCAATAATACACCTGTTGAGGGGGGTTTAACTCATGAG
TCGTGCACTGGTCTGAGGACCAAAGGTAGGTGTGTAACACCACTTGAAGGCAGCATCTGCGATACGATACTTGATAATAGAAATATCCATAAGTTTAATACTAATGAAAG
CTATATAGAAAATGGCGATTTATCTGATGAAAATGTGAAGGGTGATATTGTGGCAAGCAAACTTGAGTTTAATACCTATATAGAAAATGGCGATTTATCTGATGAAAATG
TGAAGGGTGATATTGTGGCAAGCAAACTTGCCAGTTGTTCAAGGGAGAGGAGATTGCGTAAGCCTACTCGAAGATACATTGAAGAATTTGCAGATTCAAAGTCTGAAAGT
AACAAGGGAAGGAGAAAACCTCCTACAAAAGATAAATATATGAAAGGGACATCTATTGAAGAATCTAATCACATTAGACATAAGGTACAAATGTTGACGCCTAGAAGTGA
ATCACATTGTGGTACTTCTGTTCCAGTGCAGTCTCGATCTCAAAGAAGACATCCAAAGAAGCACGTACCAGTTTCAGGATTTCTATCGGAAGATGAATCTTCTGCAACTG
AATGTAAAAACGTTCATTCATCTGCTAAAAGATGTAAAAAGCATGATAGGAGGAAGCACCAGAAGATGTGGACCCTTACTGAAGTAATGCGATTAGTTGATGGAATTGCT
GAATATGGAACTGGCCATTGGACTCATATAAAGAAGCACCTATTTGCATCTTCTCCTCATCGCACACCTATAGATCTCAGGATCGAACGGAAGCAATCACATGCCTCGCG
TCCACTGCCAAAGTCCCTGCTCCAACGTGTCTATGAACTGGCCAATATTTATCCATATCCAAAGGAGCGCAGTCCAAAACCAGTTAAAGCAACTACACCTCCCTTGCTTC
TTACTGACAGTAACTCTTTGTCATTCAATTGGGGGCGGAAGAAGTATGAATGA
Protein sequenceShow/hide protein sequence
MLVHNNNYVDMPFLVVLELLIRIRTCRIELKSKPNYGSRSAFLPEVHKYEISLDYIYVKVEESFLPAPLNESNEVEHLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGG
GLDSNSKQGGEHELKFGTEGMGINEVSFEEPLRLHYKAPFATEKKLAAAWIEISQKACQKGLGLMKNIDNVEMLGTASRRCSVAAQNKEGNCGTTLRCCALGFLAGAITC
RSVETPKGSVATLCSRVSKLLQDLDQLLDDANEVEEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGSSRSDTDAFGISELSATMVMEAEFNNTPVEGGLTHE
SCTGLRTKGRCVTPLEGSICDTILDNRNIHKFNTNESYIENGDLSDENVKGDIVASKLEFNTYIENGDLSDENVKGDIVASKLASCSRERRLRKPTRRYIEEFADSKSES
NKGRRKPPTKDKYMKGTSIEESNHIRHKVQMLTPRSESHCGTSVPVQSRSQRRHPKKHVPVSGFLSEDESSATECKNVHSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIA
EYGTGHWTHIKKHLFASSPHRTPIDLRIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKPVKATTPPLLLTDSNSLSFNWGRKKYE