; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018058 (gene) of Snake gourd v1 genome

Gene IDTan0018058
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptioncell division cycle protein 123 homolog
Genome locationLG05:78649766..78657967
RNA-Seq ExpressionTan0018058
SyntenyTan0018058
Gene Ontology termsGO:0010197 - polar nucleus fusion (biological process)
GO:0051301 - cell division (biological process)
GO:0051726 - regulation of cell cycle (biological process)
GO:0055085 - transmembrane transport (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0032153 - cell division site (cellular component)
GO:0043073 - germ cell nucleus (cellular component)
GO:0022857 - transmembrane transporter activity (molecular function)
InterPro domainsIPR000620 - EamA domain
IPR009772 - Cell division cycle protein 123
IPR030184 - WAT1-related protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583861.1 Cell division cycle protein 123-like protein, partial [Cucurbita argyrosperma subsp. sororia]9.2e-17792.19Show/hide
Query:  MKQEEVNHCQIQEWYPKFKSFSIKTLIHELPESFVHYLLDDSGPFLLPLSISNEDALPNRIVNPHEQDDYQLKEGSDDESEQSTAPPSFPELESDVKQSI
        MKQEEVN CQIQEWYPKFKSFSIKTLIHELPESFV YLLDDSGPFLLPLSISNEDALPNR+VNP E++DYQLKEGSDDESEQST+PPSFPELESDVKQSI
Subjt:  MKQEEVNHCQIQEWYPKFKSFSIKTLIHELPESFVHYLLDDSGPFLLPLSISNEDALPNRIVNPHEQDDYQLKEGSDDESEQSTAPPSFPELESDVKQSI

Query:  CSLGGSVFPKLNWSAPKDSAWISPTGTLKCSSFSEIALLLRSSDSLVHDLCHAYDSCTDKSSSRPSRFFLALRKWYPSLRPEMEFRCFVRNRNLVGISQR
         SLGGSVFPKLNWSAPKDSAWIS TGTLKC SFSEIALLLRSSDSLVHDLCHAYDSCTDKSSSRPSRFFLALRKWYPSLRPEMEFRCFVRNR+LVGISQR
Subjt:  CSLGGSVFPKLNWSAPKDSAWISPTGTLKCSSFSEIALLLRSSDSLVHDLCHAYDSCTDKSSSRPSRFFLALRKWYPSLRPEMEFRCFVRNRNLVGISQR

Query:  EVTTFYPALVEKKENLQEVIGEFFIDHVKPSFESENYTLDVYVTKNEAVKILDFNPWGAFTLALLFDWEELEEKNEDIELRIVESRRAVRPGLKTAVPFD
        EVTTFYPALVEK E+L EVI EFFIDHVK SFESENYT DVYVTKNEAVKI+DFNPWGAFTL LLF+WEELEE+NE+++ RIVESRRAVRPGLKTAVPFD
Subjt:  EVTTFYPALVEKKENLQEVIGEFFIDHVKPSFESENYTLDVYVTKNEAVKILDFNPWGAFTLALLFDWEELEEKNEDIELRIVESRRAVRPGLKTAVPFD

Query:  YLDTSPGSGWDQFLRNADQELQQQTRDDDGLNP
        YLDTSPGSGWDQFL+NADQELQQQTRDDDGLNP
Subjt:  YLDTSPGSGWDQFLRNADQELQQQTRDDDGLNP

KAG7019483.1 Cell division cycle protein-like protein, partial [Cucurbita argyrosperma subsp. argyrosperma]6.0e-17691.59Show/hide
Query:  MKQEEVNHCQIQEWYPKFKSFSIKTLIHELPESFVHYLLDDSGPFLLPLSISNEDALPNRIVNPHEQDDYQLKEGSDDESEQSTAPPSFPELESDVKQSI
        MKQEEVN CQIQEWYPKFKSFSIKTLIHELPESFV YLLDDSGPFLLPLSISNEDALPNR+VNP +++DYQLKEGSDDESEQST+PPSFPELESDVKQSI
Subjt:  MKQEEVNHCQIQEWYPKFKSFSIKTLIHELPESFVHYLLDDSGPFLLPLSISNEDALPNRIVNPHEQDDYQLKEGSDDESEQSTAPPSFPELESDVKQSI

Query:  CSLGGSVFPKLNWSAPKDSAWISPTGTLKCSSFSEIALLLRSSDSLVHDLCHAYDSCTDKSSSRPSRFFLALRKWYPSLRPEMEFRCFVRNRNLVGISQR
         SLGGSVFPKLNWSAPKDSAWIS TGTLKC SFSEIALLLRSSDSLVHDLCHAYDSCTDKSSSRPSRFFLALRKWYPSLRPEMEFRCFVRNR+LVGISQR
Subjt:  CSLGGSVFPKLNWSAPKDSAWISPTGTLKCSSFSEIALLLRSSDSLVHDLCHAYDSCTDKSSSRPSRFFLALRKWYPSLRPEMEFRCFVRNRNLVGISQR

Query:  EVTTFYPALVEKKENLQEVIGEFFIDHVKPSFESENYTLDVYVTKNEAVKILDFNPWGAFTLALLFDWEELEEKNEDIELRIVESRRAVRPGLKTAVPFD
        EVTTFYPALVEK E+L EVI EFFIDHVK SFESENYT DVYVTKNEAVKI+DFNPWGAFTL LLF+WEELEE+NE+++ RIVESRRAVRPGLKTAVPFD
Subjt:  EVTTFYPALVEKKENLQEVIGEFFIDHVKPSFESENYTLDVYVTKNEAVKILDFNPWGAFTLALLFDWEELEEKNEDIELRIVESRRAVRPGLKTAVPFD

Query:  YLDTSPGSGWDQFLRNADQELQQQTRDDDGLNP
        YLDTSPGSGWDQFL+NADQELQQQ+RDDDGLNP
Subjt:  YLDTSPGSGWDQFLRNADQELQQQTRDDDGLNP

XP_022927184.1 cell division cycle protein 123 homolog [Cucurbita moschata]1.9e-17490.99Show/hide
Query:  MKQEEVNHCQIQEWYPKFKSFSIKTLIHELPESFVHYLLDDSGPFLLPLSISNEDALPNRIVNPHEQDDYQLKEGSDDESEQSTAPPSFPELESDVKQSI
        MKQEEVN CQIQEWYPKFKSFSIKTLIHELPESFV YLLDDSGPFLLPLSISNEDALPNR+VNP +++DY LKEGSDDESEQST+PPSFPELESDVKQSI
Subjt:  MKQEEVNHCQIQEWYPKFKSFSIKTLIHELPESFVHYLLDDSGPFLLPLSISNEDALPNRIVNPHEQDDYQLKEGSDDESEQSTAPPSFPELESDVKQSI

Query:  CSLGGSVFPKLNWSAPKDSAWISPTGTLKCSSFSEIALLLRSSDSLVHDLCHAYDSCTDKSSSRPSRFFLALRKWYPSLRPEMEFRCFVRNRNLVGISQR
         SLGGSVFPKLNWSAPKDSAWIS TGTLKC S SEIALLLRSSDSLVHDLCHAYDSCTDKSSSRPSRFFLALRKWYPSLRPEMEFRCFVRNR+LVGISQR
Subjt:  CSLGGSVFPKLNWSAPKDSAWISPTGTLKCSSFSEIALLLRSSDSLVHDLCHAYDSCTDKSSSRPSRFFLALRKWYPSLRPEMEFRCFVRNRNLVGISQR

Query:  EVTTFYPALVEKKENLQEVIGEFFIDHVKPSFESENYTLDVYVTKNEAVKILDFNPWGAFTLALLFDWEELEEKNEDIELRIVESRRAVRPGLKTAVPFD
        EVTTFYPALVEK E+L EVI EFFIDHVK  FESENYT DVYVTKNEAVKI+DFNPWGAFTL LLF+WEELEE+NE+++ RIVESRRAVRPGLKTAVPFD
Subjt:  EVTTFYPALVEKKENLQEVIGEFFIDHVKPSFESENYTLDVYVTKNEAVKILDFNPWGAFTLALLFDWEELEEKNEDIELRIVESRRAVRPGLKTAVPFD

Query:  YLDTSPGSGWDQFLRNADQELQQQTRDDDGLNP
        YLDTSPGSGWDQFL+NADQELQQQTRDDDGLNP
Subjt:  YLDTSPGSGWDQFLRNADQELQQQTRDDDGLNP

XP_023001444.1 cell division cycle protein 123 homolog [Cucurbita maxima]1.0e-17591.59Show/hide
Query:  MKQEEVNHCQIQEWYPKFKSFSIKTLIHELPESFVHYLLDDSGPFLLPLSISNEDALPNRIVNPHEQDDYQLKEGSDDESEQSTAPPSFPELESDVKQSI
        MKQEEVN CQIQEWYPKFKSFSIKTLIHELPESFV YLLDDSGPFLLPLSISNEDALPNR+VNP +++DYQLKEGSDDESEQST+PPSFPELESDVKQSI
Subjt:  MKQEEVNHCQIQEWYPKFKSFSIKTLIHELPESFVHYLLDDSGPFLLPLSISNEDALPNRIVNPHEQDDYQLKEGSDDESEQSTAPPSFPELESDVKQSI

Query:  CSLGGSVFPKLNWSAPKDSAWISPTGTLKCSSFSEIALLLRSSDSLVHDLCHAYDSCTDKSSSRPSRFFLALRKWYPSLRPEMEFRCFVRNRNLVGISQR
         SLGGSVFPKLNWSAPKDSAWIS TGTLKC SFSEIALLLRSSDSLVHDLCHAYDSCTDKSSSRPSRFFLALRKWYPSLRPEMEFRCFVRNR+LVGISQR
Subjt:  CSLGGSVFPKLNWSAPKDSAWISPTGTLKCSSFSEIALLLRSSDSLVHDLCHAYDSCTDKSSSRPSRFFLALRKWYPSLRPEMEFRCFVRNRNLVGISQR

Query:  EVTTFYPALVEKKENLQEVIGEFFIDHVKPSFESENYTLDVYVTKNEAVKILDFNPWGAFTLALLFDWEELEEKNEDIELRIVESRRAVRPGLKTAVPFD
        EVTTFYPALVEK E+L EVI EFFIDH+K SFESENYT DVYVTKNEAVKI+DFNPWGAFTL LLF WEELEE+NE+++ RIVESRRAVRPGLKTAVPFD
Subjt:  EVTTFYPALVEKKENLQEVIGEFFIDHVKPSFESENYTLDVYVTKNEAVKILDFNPWGAFTLALLFDWEELEEKNEDIELRIVESRRAVRPGLKTAVPFD

Query:  YLDTSPGSGWDQFLRNADQELQQQTRDDDGLNP
        YLDTSPGSGWDQFL+NADQELQQQTRDDDGLNP
Subjt:  YLDTSPGSGWDQFLRNADQELQQQTRDDDGLNP

XP_023520072.1 cell division cycle protein 123 homolog [Cucurbita pepo subsp. pepo]1.1e-17491.32Show/hide
Query:  MKQEEVNHCQIQEWYPKFKSFSIKTLIHELPESFVHYLLDDSGPFLLPLSISNEDALPNRIVNPHEQDDYQLKEGSDDESEQSTAPPSFPELESDVKQSI
        MKQEEVN CQIQEWYPKFKSFSIKTLIHELPESFV YLLDDSGPFLLPLSISNEDALPNR+VNP +++DYQLK+GSDDESEQST+PPSFPELESDVKQSI
Subjt:  MKQEEVNHCQIQEWYPKFKSFSIKTLIHELPESFVHYLLDDSGPFLLPLSISNEDALPNRIVNPHEQDDYQLKEGSDDESEQSTAPPSFPELESDVKQSI

Query:  CSLGGSVFPKLNWSAPKDSAWISPTGTLKCSSFSEIALLLRSSDSLVHDLCHAYDSCTDKSSSRPSRFFLALRKWYPSLRPEMEFRCFVRNRNLVGISQR
         SLGGSVFPKLNWSAPKDSAWIS TGTLKC SFSEIALLLRSSDSLVHDLCHAYDSCTDKSSSRPSRFFLALRKWYPSLRPEMEFRCFVRNR+LVGISQR
Subjt:  CSLGGSVFPKLNWSAPKDSAWISPTGTLKCSSFSEIALLLRSSDSLVHDLCHAYDSCTDKSSSRPSRFFLALRKWYPSLRPEMEFRCFVRNRNLVGISQR

Query:  EVTTFYPALVEKKENLQEVIGEFFIDHVKPSFESENYTLDVYVTKNEAVKILDFNPWGAFTLALLFDWEELEEKNEDIELRIVESRRAVRPGLKTAVPFD
        EVTTFYPALVEK E+L EVI EFFIDHVK SFESENYT DVYVTKNEAVKI+DFNPWGAFTL LLF+WEELEE+NE+++ RIVESRRAVRPGLKTAVPFD
Subjt:  EVTTFYPALVEKKENLQEVIGEFFIDHVKPSFESENYTLDVYVTKNEAVKILDFNPWGAFTLALLFDWEELEEKNEDIELRIVESRRAVRPGLKTAVPFD

Query:  YLDTSPGSGWDQFLRNADQELQQQTR-DDDGLNP
        YLDTSPGSGWDQFL+NADQELQQQTR DDDGLNP
Subjt:  YLDTSPGSGWDQFLRNADQELQQQTR-DDDGLNP

TrEMBL top hitse value%identityAlignment
A0A0A0LZ38 Uncharacterized protein1.5e-16185.5Show/hide
Query:  MKQEEVNHCQIQEWYPKFKSFSIKTLIHELPESFVHYLLDDSGPFLLPLSISNEDALPNRIVNPHEQDDYQLKEGSDDESEQSTA-PPSFPELESDVKQS
        MK+EEVN CQIQEWYPKFKSFSIKTLIH LPESFVHYLLDDS PF+LPLSISN+DALPNR+ NP ++ D+QLK+ SDD+S+Q T+ PPSFP+LESDVK S
Subjt:  MKQEEVNHCQIQEWYPKFKSFSIKTLIHELPESFVHYLLDDSGPFLLPLSISNEDALPNRIVNPHEQDDYQLKEGSDDESEQSTA-PPSFPELESDVKQS

Query:  ICSLGGSVFPKLNWSAPKDSAWISPTGTLKCSSFSEIALLLRSSDSLVHDLCHAYDSCTDKSSSRPSRFFLALRKWYPSLRPEMEFRCFVRNRNLVGISQ
        I SLGGSVFPKLNWSAPKDSAWISP GTLKCSSFSEIALLLRSSDSLVHDLCHAYDSCTDKSSSRPS+FFLALRKWYPSLRPEMEFRCFV+NRNL+GISQ
Subjt:  ICSLGGSVFPKLNWSAPKDSAWISPTGTLKCSSFSEIALLLRSSDSLVHDLCHAYDSCTDKSSSRPSRFFLALRKWYPSLRPEMEFRCFVRNRNLVGISQ

Query:  REVTTFYPALVEKKENLQEVIGEFFIDHVKPSFESENYTLDVYVTKNEAVKILDFNPWGAFTLALLFDWEELEE-KNEDIELRIVESRRAVRPGLKTAVP
        REVTTFYPALVEKKE L+EVI EFFIDHVK +FE ENYTLDVYVT+NE+VKI+DFNPWGAFTL LLFDWEELEE + E+I+LRIVE RRAVRPGLKTAVP
Subjt:  REVTTFYPALVEKKENLQEVIGEFFIDHVKPSFESENYTLDVYVTKNEAVKILDFNPWGAFTLALLFDWEELEE-KNEDIELRIVESRRAVRPGLKTAVP

Query:  FDYLDTSPGSGWDQFLRNADQELQQQTRDDD
        FDYLD S GSGWDQFL+NADQE QQQTRDD+
Subjt:  FDYLDTSPGSGWDQFLRNADQELQQQTRDDD

A0A5D3CIM0 Cell division cycle protein 123-like protein2.6e-16185.15Show/hide
Query:  MKQEEVNHCQIQEWYPKFKSFSIKTLIHELPESFVHYLLDDSGPFLLPLSISNEDALPNRIVNPHEQDDYQLKEGSDDESEQSTAPPSFPELESDVKQSI
        MK EEVN CQIQEWYPKFK FSIKTLIH LPESFVHYLLDDS PFLLPLSISN+DALPNR+ NP ++ D+QLK+ SDD+S+Q T+PPSFP+LESDVK SI
Subjt:  MKQEEVNHCQIQEWYPKFKSFSIKTLIHELPESFVHYLLDDSGPFLLPLSISNEDALPNRIVNPHEQDDYQLKEGSDDESEQSTAPPSFPELESDVKQSI

Query:  CSLGGSVFPKLNWSAPKDSAWISPTGTLKCSSFSEIALLLRSSDSLVHDLCHAYDSCTDKSSSRPSRFFLALRKWYPSLRPEMEFRCFVRNRNLVGISQR
         SLGGSVFPKLNWSAPKDSAWISP GTLKCSSFSEIALLLRSSDSLVHDLCHAYDSCTDKSSSRPS+FFLALRKWYPSLRPEMEFRCFV+NRNL+GISQR
Subjt:  CSLGGSVFPKLNWSAPKDSAWISPTGTLKCSSFSEIALLLRSSDSLVHDLCHAYDSCTDKSSSRPSRFFLALRKWYPSLRPEMEFRCFVRNRNLVGISQR

Query:  EVTTFYPALVEKKENLQEVIGEFFIDHVKPSFESENYTLDVYVTKNEAVKILDFNPWGAFTLALLFDWEEL-EEKNEDIELRIVESRRAVRPGLKTAVPF
        EVTTFYPALVEKKE L+EVI EFFIDHVK +FE ENYT DVYVT+NE+VKI+DFNPWGAFTL LLFDWEEL EE+ E+I+LRIVESRRAVRPGLKTAVPF
Subjt:  EVTTFYPALVEKKENLQEVIGEFFIDHVKPSFESENYTLDVYVTKNEAVKILDFNPWGAFTLALLFDWEEL-EEKNEDIELRIVESRRAVRPGLKTAVPF

Query:  DYLDTSPGSGWDQFLRNADQELQQQTRDDD
        DYLD S GSGWDQFL+NADQE Q Q RDD+
Subjt:  DYLDTSPGSGWDQFLRNADQELQQQTRDDD

A0A6J1CG73 cell division cycle protein 123 homolog3.8e-16888.41Show/hide
Query:  MKQEEVNHCQIQEWYPKFKSFSIKTLIHELPESFVHYLLDDSGPFLLPLSISNEDALPNRIVNPHEQDDYQLKEGSDDESEQSTAPPSFPELESDVKQSI
        MK+EEVN CQIQEWYP FKS SIKTLIHELPESFVHYLLDDSGPFLLP+SISNEDALPNR+V+P E++DYQLKEGSDDESEQST+PPSFPELESDVK SI
Subjt:  MKQEEVNHCQIQEWYPKFKSFSIKTLIHELPESFVHYLLDDSGPFLLPLSISNEDALPNRIVNPHEQDDYQLKEGSDDESEQSTAPPSFPELESDVKQSI

Query:  CSLGGSVFPKLNWSAPKDSAWISPTGTLKCSSFSEIALLLRSSDSLVHDLCHAYDSCTDKSSSRPSRFFLALRKWYPSLRPEMEFRCFVRNRNLVGISQR
         +LGGSVFPKLNWSAPKDSAWISPTGTLKCSSFSEIALL RSSDSLVHDLC AYDSC DK+SSRPSRFFLALRKWYPSLRPEMEFRCFVRNR+LVGISQR
Subjt:  CSLGGSVFPKLNWSAPKDSAWISPTGTLKCSSFSEIALLLRSSDSLVHDLCHAYDSCTDKSSSRPSRFFLALRKWYPSLRPEMEFRCFVRNRNLVGISQR

Query:  EVTTFYPALVEKKENLQEVIGEFFIDHVKPSFESENYTLDVYVTKNEAVKILDFNPWGAFTLALLFDWEELEEKNEDIELRIVESRRAVRPGLKTAVPFD
        EVTTFYPALVEKKE+LQ +I EFF+DHV+ SFESENYT DVY T+NE VKILDFNPWGAFTL LLFDWEELEE+ +++E RIVESRRAVRPGLKTAVPFD
Subjt:  EVTTFYPALVEKKENLQEVIGEFFIDHVKPSFESENYTLDVYVTKNEAVKILDFNPWGAFTLALLFDWEELEEKNEDIELRIVESRRAVRPGLKTAVPFD

Query:  YLDTSPGSGWDQFLRNADQELQQQTRDD
        YLDTSPGSGWD FLRNADQELQQQTRDD
Subjt:  YLDTSPGSGWDQFLRNADQELQQQTRDD

A0A6J1EHA8 cell division cycle protein 123 homolog9.3e-17590.99Show/hide
Query:  MKQEEVNHCQIQEWYPKFKSFSIKTLIHELPESFVHYLLDDSGPFLLPLSISNEDALPNRIVNPHEQDDYQLKEGSDDESEQSTAPPSFPELESDVKQSI
        MKQEEVN CQIQEWYPKFKSFSIKTLIHELPESFV YLLDDSGPFLLPLSISNEDALPNR+VNP +++DY LKEGSDDESEQST+PPSFPELESDVKQSI
Subjt:  MKQEEVNHCQIQEWYPKFKSFSIKTLIHELPESFVHYLLDDSGPFLLPLSISNEDALPNRIVNPHEQDDYQLKEGSDDESEQSTAPPSFPELESDVKQSI

Query:  CSLGGSVFPKLNWSAPKDSAWISPTGTLKCSSFSEIALLLRSSDSLVHDLCHAYDSCTDKSSSRPSRFFLALRKWYPSLRPEMEFRCFVRNRNLVGISQR
         SLGGSVFPKLNWSAPKDSAWIS TGTLKC S SEIALLLRSSDSLVHDLCHAYDSCTDKSSSRPSRFFLALRKWYPSLRPEMEFRCFVRNR+LVGISQR
Subjt:  CSLGGSVFPKLNWSAPKDSAWISPTGTLKCSSFSEIALLLRSSDSLVHDLCHAYDSCTDKSSSRPSRFFLALRKWYPSLRPEMEFRCFVRNRNLVGISQR

Query:  EVTTFYPALVEKKENLQEVIGEFFIDHVKPSFESENYTLDVYVTKNEAVKILDFNPWGAFTLALLFDWEELEEKNEDIELRIVESRRAVRPGLKTAVPFD
        EVTTFYPALVEK E+L EVI EFFIDHVK  FESENYT DVYVTKNEAVKI+DFNPWGAFTL LLF+WEELEE+NE+++ RIVESRRAVRPGLKTAVPFD
Subjt:  EVTTFYPALVEKKENLQEVIGEFFIDHVKPSFESENYTLDVYVTKNEAVKILDFNPWGAFTLALLFDWEELEEKNEDIELRIVESRRAVRPGLKTAVPFD

Query:  YLDTSPGSGWDQFLRNADQELQQQTRDDDGLNP
        YLDTSPGSGWDQFL+NADQELQQQTRDDDGLNP
Subjt:  YLDTSPGSGWDQFLRNADQELQQQTRDDDGLNP

A0A6J1KMR4 cell division cycle protein 123 homolog4.9e-17691.59Show/hide
Query:  MKQEEVNHCQIQEWYPKFKSFSIKTLIHELPESFVHYLLDDSGPFLLPLSISNEDALPNRIVNPHEQDDYQLKEGSDDESEQSTAPPSFPELESDVKQSI
        MKQEEVN CQIQEWYPKFKSFSIKTLIHELPESFV YLLDDSGPFLLPLSISNEDALPNR+VNP +++DYQLKEGSDDESEQST+PPSFPELESDVKQSI
Subjt:  MKQEEVNHCQIQEWYPKFKSFSIKTLIHELPESFVHYLLDDSGPFLLPLSISNEDALPNRIVNPHEQDDYQLKEGSDDESEQSTAPPSFPELESDVKQSI

Query:  CSLGGSVFPKLNWSAPKDSAWISPTGTLKCSSFSEIALLLRSSDSLVHDLCHAYDSCTDKSSSRPSRFFLALRKWYPSLRPEMEFRCFVRNRNLVGISQR
         SLGGSVFPKLNWSAPKDSAWIS TGTLKC SFSEIALLLRSSDSLVHDLCHAYDSCTDKSSSRPSRFFLALRKWYPSLRPEMEFRCFVRNR+LVGISQR
Subjt:  CSLGGSVFPKLNWSAPKDSAWISPTGTLKCSSFSEIALLLRSSDSLVHDLCHAYDSCTDKSSSRPSRFFLALRKWYPSLRPEMEFRCFVRNRNLVGISQR

Query:  EVTTFYPALVEKKENLQEVIGEFFIDHVKPSFESENYTLDVYVTKNEAVKILDFNPWGAFTLALLFDWEELEEKNEDIELRIVESRRAVRPGLKTAVPFD
        EVTTFYPALVEK E+L EVI EFFIDH+K SFESENYT DVYVTKNEAVKI+DFNPWGAFTL LLF WEELEE+NE+++ RIVESRRAVRPGLKTAVPFD
Subjt:  EVTTFYPALVEKKENLQEVIGEFFIDHVKPSFESENYTLDVYVTKNEAVKILDFNPWGAFTLALLFDWEELEEKNEDIELRIVESRRAVRPGLKTAVPFD

Query:  YLDTSPGSGWDQFLRNADQELQQQTRDDDGLNP
        YLDTSPGSGWDQFL+NADQELQQQTRDDDGLNP
Subjt:  YLDTSPGSGWDQFLRNADQELQQQTRDDDGLNP

SwissProt top hitse value%identityAlignment
Q5XEZ0 WAT1-related protein At1g010706.0e-6245.24Show/hide
Query:  YRQTISALFLTPIAYFWERKTRPQLTAYILFLLFISSLLGLTLTLYLFLIGLHYTSATFSCAFLNLVPVNTFILAVLFRMEKVD-MKSKGGRAKVLGTLV
        YR  ISAL L P AY  ERKTRPQ+T  ++   F+S LLG +L  + FL+GL YTSAT SCA ++++P  TF LA++FR E V  +K+K G  KV+GTL+
Subjt:  YRQTISALFLTPIAYFWERKTRPQLTAYILFLLFISSLLGLTLTLYLFLIGLHYTSATFSCAFLNLVPVNTFILAVLFRMEKVD-MKSKGGRAKVLGTLV

Query:  CISGNLILILYKGMPLT-----SPGSATGKTNGVRAMVQAETPGRWLIGSLVLTAGCFMWSSWFLMQSRVGKVYPCQYSSTCIMSFFSAIQSAVLHLIID
        CISG L L  YKG  ++     S G A+   N        +    WL+G L LT G  + S W L Q  +   YPC+YSSTC+MS F+A Q A+L L   
Subjt:  CISGNLILILYKGMPLT-----SPGSATGKTNGVRAMVQAETPGRWLIGSLVLTAGCFMWSSWFLMQSRVGKVYPCQYSSTCIMSFFSAIQSAVLHLIID

Query:  RKNSVFVVKGKFAILSLIYAGSVGSGLCYVGMSWCVKQKGPVFTAAFTPFMEIFAAIFDFFILHEQIHLGSVVGSVLIISGMYILLWGKDREAK
        R  + +++  +F I  +IYAG VG  +  V  +W +K+ G VF +AF P   I A +FDF ILH  ++LGSV+GS++ I+G+Y+ LWGK++E +
Subjt:  RKNSVFVVKGKFAILSLIYAGSVGSGLCYVGMSWCVKQKGPVFTAAFTPFMEIFAAIFDFFILHEQIHLGSVVGSVLIISGMYILLWGKDREAK

Q9LI65 WAT1-related protein At3g303406.8e-8254.55Show/hide
Query:  YRQTISALFLTPIAYFWERKTRPQLTAYILFLLFISSLLGLTLTLYLFLIGLHYTSATFSCAFLNLVPVNTFILAVLFRMEKVDMKSKGGRAKVLGTLVC
        YR  +  LFL P A F ER  RP+LT  IL  LF S+LLG +L  Y FLIGL YTS+TFS AF N+VP  TF LA++FR E +++KS  GRAK+LGT++C
Subjt:  YRQTISALFLTPIAYFWERKTRPQLTAYILFLLFISSLLGLTLTLYLFLIGLHYTSATFSCAFLNLVPVNTFILAVLFRMEKVDMKSKGGRAKVLGTLVC

Query:  ISGNLILILYKGMPLTSPGSATGKTNGVRAMVQAETPGRWLIGSLVLTAGCFMWSSWFLMQSRVGKVYPCQYSSTCIMSFFSAIQSAVLHLIIDRKNSVF
        I G L+L LYKG  L+   S   +T+       A T  +W +GS++L     +WSSWF++Q+++ +VYPCQY+ST I+SFF  IQSA+L LI +R  S++
Subjt:  ISGNLILILYKGMPLTSPGSATGKTNGVRAMVQAETPGRWLIGSLVLTAGCFMWSSWFLMQSRVGKVYPCQYSSTCIMSFFSAIQSAVLHLIIDRKNSVF

Query:  VVKGKFAILSLIYAGSVGSGLCYVGMSWCVKQKGPVFTAAFTPFMEIFAAIFDFFILHEQIHLGSVVGSVLIISGMYILLWGKDRE
        VVK KF +L+L+Y+G VGSGLCYVGMSWC++Q+G VFT++F P +++FAAIF F  LHEQI+ GSV+GS++II G+YILLWGK ++
Subjt:  VVKGKFAILSLIYAGSVGSGLCYVGMSWCVKQKGPVFTAAFTPFMEIFAAIFDFFILHEQIHLGSVVGSVLIISGMYILLWGKDRE

Q9M129 WAT1-related protein At4g014503.1e-6643.89Show/hide
Query:  YRQTISALFLTPIAYFWERKTRPQLTAYILFLLFISSLLGLTLTLYLFLIGLHYTSATFSCAFLNLVPVNTFILAVLFRMEKVDMKSKGGRAKVLGTLVC
        YR  IS LFL P+AYFWERKTRP+LT  I   LF+S+L G +L  Y +L+GL YTSAT   AF  ++P  TF++A++F  EK+ +K+K G   VLGTL+ 
Subjt:  YRQTISALFLTPIAYFWERKTRPQLTAYILFLLFISSLLGLTLTLYLFLIGLHYTSATFSCAFLNLVPVNTFILAVLFRMEKVDMKSKGGRAKVLGTLVC

Query:  ISGNLILILYKGMPLT-SPGSATGKTNGVRAMVQAETPGRWLIGSLVLTAGCFMWSSWFLMQSRVGKVYPCQYSSTCIMSFFSAIQSAVLHLIIDRKNSV
        + G L+L +Y+G+PLT SP  A    N             W+ G   L  G  ++SSW L+Q+++   YPC YSST I+S F  +Q A+L LI  R    
Subjt:  ISGNLILILYKGMPLT-SPGSATGKTNGVRAMVQAETPGRWLIGSLVLTAGCFMWSSWFLMQSRVGKVYPCQYSSTCIMSFFSAIQSAVLHLIIDRKNSV

Query:  FVVKGKFAILSLIYAGSVGSGLCYVGMSWCVKQKGPVFTAAFTPFMEIFAAIFDFFILHEQIHLGSVVGSVLIISGMYILLWGKDREAKDKEYALKQTVS
        ++++ +  I++++ AG V  G+C VGMSWC+KQ+GPV +++F+P + + A +FDF ILH +I+LGSV+GSV+++ G+YI LW + ++  + +     T +
Subjt:  FVVKGKFAILSLIYAGSVGSGLCYVGMSWCVKQKGPVFTAAFTPFMEIFAAIFDFFILHEQIHLGSVVGSVLIISGMYILLWGKDREAKDKEYALKQTVS

Query:  VEE
        VEE
Subjt:  VEE

Q9M130 WAT1-related protein At4g014405.1e-7751.03Show/hide
Query:  YRQTISALFLTPIAYFWERKTRPQLTAYILFLLFISSLLGLTLTLYLFLIGLHYTSATFSCAFLNLVPVNTFILAVLFRMEKVDMKSKGGRAKVLGTLVC
        YR  IS LFL PIA+FWERKTRP LT  IL  LF S+L+G +LT Y FL+GL YTSAT +CAF+++ P  TF++A++FR+EK++MKSK G   V+G L+C
Subjt:  YRQTISALFLTPIAYFWERKTRPQLTAYILFLLFISSLLGLTLTLYLFLIGLHYTSATFSCAFLNLVPVNTFILAVLFRMEKVDMKSKGGRAKVLGTLVC

Query:  ISGNLILILYKGMPLTSPGSATGKTNGVRAMVQAETPGRWLIGSLVLTAGCFMWSSWFLMQSRVGKVYPCQYSSTCIMSFFSAIQSAVLHLIIDRKNSVF
        I G L+L +YKG+PLT       +T+ +     A  P  W+IG ++L AG   + SW L+Q++V + YPCQYSST ++SFF  IQ A+L LI  R  + +
Subjt:  ISGNLILILYKGMPLTSPGSATGKTNGVRAMVQAETPGRWLIGSLVLTAGCFMWSSWFLMQSRVGKVYPCQYSSTCIMSFFSAIQSAVLHLIIDRKNSVF

Query:  VVKGKFAILSLIYAGSVGSGLCYVGMSWCVKQKGPVFTAAFTPFMEIFAAIFDFFILHEQIHLGSVVGSVLIISGMYILLWGKDREAKDK
        ++  K  I++++YAG+V  G+C VG SWC++++GP+FT+ FTP   IFA +FDF ILH QI LGSVVGS ++I G+YI L GK R  K++
Subjt:  VVKGKFAILSLIYAGSVGSGLCYVGMSWCVKQKGPVFTAAFTPFMEIFAAIFDFFILHEQIHLGSVVGSVLIISGMYILLWGKDREAKDK

Q9M131 WAT1-related protein At4g014307.3e-6044.44Show/hide
Query:  YRQTISALFLTPIAYFWERKTRPQLTAYILFLLFISSLLGLTLTLYLFLIGLHYTSATFSCAFLNLVPVNTFILAVLFRMEKV-DMKSKGGRAKVLGTLV
        YR  ISAL L P +Y WERKTRPQLT  +L   FIS LLG +L  + FL+GL YTSAT S A ++++P  TF LA++FR+E   ++KSK G  KV+GTL+
Subjt:  YRQTISALFLTPIAYFWERKTRPQLTAYILFLLFISSLLGLTLTLYLFLIGLHYTSATFSCAFLNLVPVNTFILAVLFRMEKV-DMKSKGGRAKVLGTLV

Query:  CISGNLILILYKGMPLTSPGSATGKTNGVRAMVQAETPGRWLIGSLVLTAGCFMWSSWFLMQSRVGKVYP-CQYSSTCIMSFFSAIQSAVLHLIIDRKNS
        CI G ++L  YKG  L++P S     +        +   +WL+G L L  G  + S W L Q ++   YP  +YSSTC+MS F++ Q A+L L   R   
Subjt:  CISGNLILILYKGMPLTSPGSATGKTNGVRAMVQAETPGRWLIGSLVLTAGCFMWSSWFLMQSRVGKVYP-CQYSSTCIMSFFSAIQSAVLHLIIDRKNS

Query:  VFVVKGKFAILSLIYAGSVGSGLCYVGMSWCVKQKGPVFTAAFTPFMEIFAAIFDFFILHEQIHLGSVVGSVLIISGMYILLWGKDRE
         ++++ KF IL  +YAG VG  +  V  SW +K  G VF + F+P   + A +FDF ILH  ++LGS++GSV+ I+G+Y+ LWG+  E
Subjt:  VFVVKGKFAILSLIYAGSVGSGLCYVGMSWCVKQKGPVFTAAFTPFMEIFAAIFDFFILHEQIHLGSVVGSVLIISGMYILLWGKDRE

Arabidopsis top hitse value%identityAlignment
AT3G30340.1 nodulin MtN21 /EamA-like transporter family protein4.9e-8354.55Show/hide
Query:  YRQTISALFLTPIAYFWERKTRPQLTAYILFLLFISSLLGLTLTLYLFLIGLHYTSATFSCAFLNLVPVNTFILAVLFRMEKVDMKSKGGRAKVLGTLVC
        YR  +  LFL P A F ER  RP+LT  IL  LF S+LLG +L  Y FLIGL YTS+TFS AF N+VP  TF LA++FR E +++KS  GRAK+LGT++C
Subjt:  YRQTISALFLTPIAYFWERKTRPQLTAYILFLLFISSLLGLTLTLYLFLIGLHYTSATFSCAFLNLVPVNTFILAVLFRMEKVDMKSKGGRAKVLGTLVC

Query:  ISGNLILILYKGMPLTSPGSATGKTNGVRAMVQAETPGRWLIGSLVLTAGCFMWSSWFLMQSRVGKVYPCQYSSTCIMSFFSAIQSAVLHLIIDRKNSVF
        I G L+L LYKG  L+   S   +T+       A T  +W +GS++L     +WSSWF++Q+++ +VYPCQY+ST I+SFF  IQSA+L LI +R  S++
Subjt:  ISGNLILILYKGMPLTSPGSATGKTNGVRAMVQAETPGRWLIGSLVLTAGCFMWSSWFLMQSRVGKVYPCQYSSTCIMSFFSAIQSAVLHLIIDRKNSVF

Query:  VVKGKFAILSLIYAGSVGSGLCYVGMSWCVKQKGPVFTAAFTPFMEIFAAIFDFFILHEQIHLGSVVGSVLIISGMYILLWGKDRE
        VVK KF +L+L+Y+G VGSGLCYVGMSWC++Q+G VFT++F P +++FAAIF F  LHEQI+ GSV+GS++II G+YILLWGK ++
Subjt:  VVKGKFAILSLIYAGSVGSGLCYVGMSWCVKQKGPVFTAAFTPFMEIFAAIFDFFILHEQIHLGSVVGSVLIISGMYILLWGKDRE

AT4G01440.1 nodulin MtN21 /EamA-like transporter family protein3.6e-7851.03Show/hide
Query:  YRQTISALFLTPIAYFWERKTRPQLTAYILFLLFISSLLGLTLTLYLFLIGLHYTSATFSCAFLNLVPVNTFILAVLFRMEKVDMKSKGGRAKVLGTLVC
        YR  IS LFL PIA+FWERKTRP LT  IL  LF S+L+G +LT Y FL+GL YTSAT +CAF+++ P  TF++A++FR+EK++MKSK G   V+G L+C
Subjt:  YRQTISALFLTPIAYFWERKTRPQLTAYILFLLFISSLLGLTLTLYLFLIGLHYTSATFSCAFLNLVPVNTFILAVLFRMEKVDMKSKGGRAKVLGTLVC

Query:  ISGNLILILYKGMPLTSPGSATGKTNGVRAMVQAETPGRWLIGSLVLTAGCFMWSSWFLMQSRVGKVYPCQYSSTCIMSFFSAIQSAVLHLIIDRKNSVF
        I G L+L +YKG+PLT       +T+ +     A  P  W+IG ++L AG   + SW L+Q++V + YPCQYSST ++SFF  IQ A+L LI  R  + +
Subjt:  ISGNLILILYKGMPLTSPGSATGKTNGVRAMVQAETPGRWLIGSLVLTAGCFMWSSWFLMQSRVGKVYPCQYSSTCIMSFFSAIQSAVLHLIIDRKNSVF

Query:  VVKGKFAILSLIYAGSVGSGLCYVGMSWCVKQKGPVFTAAFTPFMEIFAAIFDFFILHEQIHLGSVVGSVLIISGMYILLWGKDREAKDK
        ++  K  I++++YAG+V  G+C VG SWC++++GP+FT+ FTP   IFA +FDF ILH QI LGSVVGS ++I G+YI L GK R  K++
Subjt:  VVKGKFAILSLIYAGSVGSGLCYVGMSWCVKQKGPVFTAAFTPFMEIFAAIFDFFILHEQIHLGSVVGSVLIISGMYILLWGKDREAKDK

AT4G01450.2 nodulin MtN21 /EamA-like transporter family protein2.2e-6743.89Show/hide
Query:  YRQTISALFLTPIAYFWERKTRPQLTAYILFLLFISSLLGLTLTLYLFLIGLHYTSATFSCAFLNLVPVNTFILAVLFRMEKVDMKSKGGRAKVLGTLVC
        YR  IS LFL P+AYFWERKTRP+LT  I   LF+S+L G +L  Y +L+GL YTSAT   AF  ++P  TF++A++F  EK+ +K+K G   VLGTL+ 
Subjt:  YRQTISALFLTPIAYFWERKTRPQLTAYILFLLFISSLLGLTLTLYLFLIGLHYTSATFSCAFLNLVPVNTFILAVLFRMEKVDMKSKGGRAKVLGTLVC

Query:  ISGNLILILYKGMPLT-SPGSATGKTNGVRAMVQAETPGRWLIGSLVLTAGCFMWSSWFLMQSRVGKVYPCQYSSTCIMSFFSAIQSAVLHLIIDRKNSV
        + G L+L +Y+G+PLT SP  A    N             W+ G   L  G  ++SSW L+Q+++   YPC YSST I+S F  +Q A+L LI  R    
Subjt:  ISGNLILILYKGMPLT-SPGSATGKTNGVRAMVQAETPGRWLIGSLVLTAGCFMWSSWFLMQSRVGKVYPCQYSSTCIMSFFSAIQSAVLHLIIDRKNSV

Query:  FVVKGKFAILSLIYAGSVGSGLCYVGMSWCVKQKGPVFTAAFTPFMEIFAAIFDFFILHEQIHLGSVVGSVLIISGMYILLWGKDREAKDKEYALKQTVS
        ++++ +  I++++ AG V  G+C VGMSWC+KQ+GPV +++F+P + + A +FDF ILH +I+LGSV+GSV+++ G+YI LW + ++  + +     T +
Subjt:  FVVKGKFAILSLIYAGSVGSGLCYVGMSWCVKQKGPVFTAAFTPFMEIFAAIFDFFILHEQIHLGSVVGSVLIISGMYILLWGKDREAKDKEYALKQTVS

Query:  VEE
        VEE
Subjt:  VEE

AT4G05440.1 temperature sensing protein-related2.5e-11961.86Show/hide
Query:  MKQEEVNHCQIQEWYPKFKSFSIKTLIHELPESFVHYLLDDSGPFLLPLSISNEDALPNRIVNPHEQDDYQL-KEGSDDESEQSTAPPSFPELESDVKQS
        MK++EVN CQIQ WYP+FKS +IKT  H+LPESF++YL+DDSGPFLLP S++NEDA+PNR+ N  E+DD+Q+ +E SDDE       PSFPELE ++++S
Subjt:  MKQEEVNHCQIQEWYPKFKSFSIKTLIHELPESFVHYLLDDSGPFLLPLSISNEDALPNRIVNPHEQDDYQL-KEGSDDESEQSTAPPSFPELESDVKQS

Query:  ICSLGGSVFPKLNWSAPKDSAWISPTGTLKCSSFSEIALLLRSSDSLVHDLCHAYDSCTDKSSSRPSRFFLALRKWYPSLRPEMEFRCFVRNRNLVGISQ
        I +LGG++ PKLNWS+PKD+AWISP+  L C+ F+EIALL RSSDSL HDL +AYDSC+DK SSRP  F+LALRKWYPSL+PEMEFRCFV++  LVGI Q
Subjt:  ICSLGGSVFPKLNWSAPKDSAWISPTGTLKCSSFSEIALLLRSSDSLVHDLCHAYDSCTDKSSSRPSRFFLALRKWYPSLRPEMEFRCFVRNRNLVGISQ

Query:  REVTTFYPALVEKKENLQEVIGEFFIDHVKPSFESENYTLDVYVTKNEAVKILDFNPWGAFTLALLFDWEELEE---KNEDIELRIVESRRAVRPGLKTA
        REVTTFYP L+ +K+ L+ +I EFF D ++  FESENYT DVYVTK   VK++DFN W   TL L++ WEELE+   + +++ELRIVESR +V PGLKTA
Subjt:  REVTTFYPALVEKKENLQEVIGEFFIDHVKPSFESENYTLDVYVTKNEAVKILDFNPWGAFTLALLFDWEELEE---KNEDIELRIVESRRAVRPGLKTA

Query:  VPFDYLDTSPGSGWDQFLRNADQELQQQTRDDD
        VP+DYLD S GSGW Q L+  ++E Q+  +  D
Subjt:  VPFDYLDTSPGSGWDQFLRNADQELQQQTRDDD

AT4G05440.2 temperature sensing protein-related4.4e-7666.34Show/hide
Query:  MKQEEVNHCQIQEWYPKFKSFSIKTLIHELPESFVHYLLDDSGPFLLPLSISNEDALPNRIVNPHEQDDYQL-KEGSDDESEQSTAPPSFPELESDVKQS
        MK++EVN CQIQ WYP+FKS +IKT  H+LPESF++YL+DDSGPFLLP S++NEDA+PNR+ N  E+DD+Q+ +E SDDE       PSFPELE ++++S
Subjt:  MKQEEVNHCQIQEWYPKFKSFSIKTLIHELPESFVHYLLDDSGPFLLPLSISNEDALPNRIVNPHEQDDYQL-KEGSDDESEQSTAPPSFPELESDVKQS

Query:  ICSLGGSVFPKLNWSAPKDSAWISPTGTLKCSSFSEIALLLRSSDSLVHDLCHAYDSCTDKSSSRPSRFFLALRKWYPSLRPEMEFRCFVRNRNLVGISQ
        I +LGG++ PKLNWS+PKD+AWISP+  L C+ F+EIALL RSSDSL HDL +AYDSC+DK SSRP  F+LALRKWYPSL+PEMEFRCFV++  LVGI Q
Subjt:  ICSLGGSVFPKLNWSAPKDSAWISPTGTLKCSSFSEIALLLRSSDSLVHDLCHAYDSCTDKSSSRPSRFFLALRKWYPSLRPEMEFRCFVRNRNLVGISQ

Query:  RE
        RE
Subjt:  RE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATCGGCAAACAATTTCCGCTCTTTTCTTGACGCCGATAGCCTATTTTTGGGAAAGAAAAACCAGACCTCAGCTTACAGCTTACATCTTATTCCTCCTTTTCATCAG
TTCCTTACTTGGGTTAACACTAACATTATACCTATTTCTAATTGGTCTGCACTACACATCTGCAACTTTCTCTTGTGCATTTCTCAACCTGGTGCCTGTAAATACGTTTA
TTCTAGCCGTATTATTCCGTATGGAGAAAGTTGACATGAAAAGCAAGGGTGGAAGAGCCAAAGTTCTTGGGACTCTGGTGTGCATTTCAGGAAACTTAATTTTGATTCTG
TACAAAGGAATGCCGCTAACCAGCCCCGGATCAGCAACCGGAAAAACAAATGGAGTTCGTGCAATGGTTCAAGCTGAAACACCAGGAAGATGGTTAATAGGTTCATTGGT
TTTGACTGCAGGCTGCTTTATGTGGTCCTCTTGGTTCCTTATGCAATCAAGGGTTGGCAAGGTTTATCCATGTCAATATTCTAGTACCTGCATCATGTCCTTCTTCAGCG
CCATCCAATCGGCTGTTTTACACTTGATTATAGACAGGAAGAACTCAGTGTTCGTTGTGAAAGGGAAGTTTGCTATCTTAAGTCTCATATATGCTGGATCAGTGGGTTCA
GGGTTGTGCTATGTGGGGATGTCTTGGTGCGTAAAACAGAAAGGGCCAGTTTTCACAGCTGCTTTTACACCTTTCATGGAGATTTTTGCAGCCATATTTGATTTCTTCAT
CTTGCATGAACAGATTCACCTAGGAAGTGTTGTGGGATCAGTTTTGATCATAAGTGGAATGTACATTCTTCTATGGGGTAAAGATAGAGAAGCAAAGGACAAGGAGTATG
CATTGAAGCAGACTGTATCAGTTGAAGAAGCCCCTCATTTGATAACCATTTTTTTTTTAAATTATGCTTGTTTTTCACAATTTCTCTTCATCTGCAGCTGCAACTGCAAC
CACATGAAGCAAGAAGAAGTCAATCACTGCCAAATTCAAGAATGGTATCCCAAATTCAAATCCTTCTCTATCAAAACCCTAATTCACGAGCTTCCAGAATCTTTCGTTCA
TTACCTTCTCGACGACTCCGGACCTTTCCTCCTTCCACTTTCCATCTCAAACGAGGACGCTCTTCCCAATAGAATCGTCAATCCCCATGAACAAGACGATTACCAGTTGA
AAGAAGGATCCGATGATGAATCGGAGCAATCCACTGCACCTCCTTCGTTTCCGGAGCTTGAATCTGATGTTAAACAGTCGATTTGCTCTCTCGGAGGTTCCGTCTTCCCG
AAGTTGAACTGGAGCGCACCGAAAGACTCTGCCTGGATTAGCCCTACCGGGACTTTAAAGTGCTCTTCGTTCAGTGAGATTGCGCTCTTGCTTCGATCTTCCGATTCGCT
TGTTCACGATCTCTGTCACGCGTACGATTCCTGCACCGATAAATCGTCGTCGAGGCCATCGAGATTCTTCCTCGCGCTTCGTAAGTGGTACCCGTCCCTTCGGCCAGAGA
TGGAGTTTCGTTGCTTCGTAAGGAATCGGAACCTGGTTGGCATTTCCCAGCGCGAGGTCACAACGTTCTATCCTGCGTTAGTGGAGAAGAAGGAGAATCTGCAAGAGGTC
ATCGGAGAATTCTTCATCGACCATGTGAAGCCAAGCTTCGAATCGGAGAATTACACGTTGGATGTGTATGTGACGAAGAACGAAGCTGTTAAGATACTGGATTTCAATCC
ATGGGGAGCATTCACACTTGCATTGCTGTTCGATTGGGAGGAATTGGAGGAGAAGAACGAAGATATTGAATTGAGAATCGTGGAGAGTCGGAGGGCGGTGAGGCCTGGAT
TGAAGACTGCGGTTCCATTCGATTACTTGGATACGAGCCCTGGAAGTGGTTGGGACCAGTTTTTGAGAAATGCAGATCAAGAATTGCAGCAGCAAACCAGAGACGATGAC
GGATTGAATCCCTAA
mRNA sequenceShow/hide mRNA sequence
AAAGATGCTCAGCACTATAGATTAACAAGAAAGGCATCATGAGATTTTAACAACCTTTCCGATATCGGTCCTTATAAACTTGTGCTGCATTTCAAACGTCAGCGTCGTGA
CGATTCATGAGCTTATTCCAAACACATGACTGTTAAAGGTTCTAAAGAAAGGGAGCAGAGACAAATATGAAAGAACATTAAGATATTCTGATTCAATTTGCTTTTATAGC
AGTGGTGTTAATATACTACTCAAGGAAATTCTTAGTGAAGGGATCAGTCAGCTACTAATTGTTATGTATCGGCAAACAATTTCCGCTCTTTTCTTGACGCCGATAGCCTA
TTTTTGGGAAAGAAAAACCAGACCTCAGCTTACAGCTTACATCTTATTCCTCCTTTTCATCAGTTCCTTACTTGGGTTAACACTAACATTATACCTATTTCTAATTGGTC
TGCACTACACATCTGCAACTTTCTCTTGTGCATTTCTCAACCTGGTGCCTGTAAATACGTTTATTCTAGCCGTATTATTCCGTATGGAGAAAGTTGACATGAAAAGCAAG
GGTGGAAGAGCCAAAGTTCTTGGGACTCTGGTGTGCATTTCAGGAAACTTAATTTTGATTCTGTACAAAGGAATGCCGCTAACCAGCCCCGGATCAGCAACCGGAAAAAC
AAATGGAGTTCGTGCAATGGTTCAAGCTGAAACACCAGGAAGATGGTTAATAGGTTCATTGGTTTTGACTGCAGGCTGCTTTATGTGGTCCTCTTGGTTCCTTATGCAAT
CAAGGGTTGGCAAGGTTTATCCATGTCAATATTCTAGTACCTGCATCATGTCCTTCTTCAGCGCCATCCAATCGGCTGTTTTACACTTGATTATAGACAGGAAGAACTCA
GTGTTCGTTGTGAAAGGGAAGTTTGCTATCTTAAGTCTCATATATGCTGGATCAGTGGGTTCAGGGTTGTGCTATGTGGGGATGTCTTGGTGCGTAAAACAGAAAGGGCC
AGTTTTCACAGCTGCTTTTACACCTTTCATGGAGATTTTTGCAGCCATATTTGATTTCTTCATCTTGCATGAACAGATTCACCTAGGAAGTGTTGTGGGATCAGTTTTGA
TCATAAGTGGAATGTACATTCTTCTATGGGGTAAAGATAGAGAAGCAAAGGACAAGGAGTATGCATTGAAGCAGACTGTATCAGTTGAAGAAGCCCCTCATTTGATAACC
ATTTTTTTTTTAAATTATGCTTGTTTTTCACAATTTCTCTTCATCTGCAGCTGCAACTGCAACCACATGAAGCAAGAAGAAGTCAATCACTGCCAAATTCAAGAATGGTA
TCCCAAATTCAAATCCTTCTCTATCAAAACCCTAATTCACGAGCTTCCAGAATCTTTCGTTCATTACCTTCTCGACGACTCCGGACCTTTCCTCCTTCCACTTTCCATCT
CAAACGAGGACGCTCTTCCCAATAGAATCGTCAATCCCCATGAACAAGACGATTACCAGTTGAAAGAAGGATCCGATGATGAATCGGAGCAATCCACTGCACCTCCTTCG
TTTCCGGAGCTTGAATCTGATGTTAAACAGTCGATTTGCTCTCTCGGAGGTTCCGTCTTCCCGAAGTTGAACTGGAGCGCACCGAAAGACTCTGCCTGGATTAGCCCTAC
CGGGACTTTAAAGTGCTCTTCGTTCAGTGAGATTGCGCTCTTGCTTCGATCTTCCGATTCGCTTGTTCACGATCTCTGTCACGCGTACGATTCCTGCACCGATAAATCGT
CGTCGAGGCCATCGAGATTCTTCCTCGCGCTTCGTAAGTGGTACCCGTCCCTTCGGCCAGAGATGGAGTTTCGTTGCTTCGTAAGGAATCGGAACCTGGTTGGCATTTCC
CAGCGCGAGGTCACAACGTTCTATCCTGCGTTAGTGGAGAAGAAGGAGAATCTGCAAGAGGTCATCGGAGAATTCTTCATCGACCATGTGAAGCCAAGCTTCGAATCGGA
GAATTACACGTTGGATGTGTATGTGACGAAGAACGAAGCTGTTAAGATACTGGATTTCAATCCATGGGGAGCATTCACACTTGCATTGCTGTTCGATTGGGAGGAATTGG
AGGAGAAGAACGAAGATATTGAATTGAGAATCGTGGAGAGTCGGAGGGCGGTGAGGCCTGGATTGAAGACTGCGGTTCCATTCGATTACTTGGATACGAGCCCTGGAAGT
GGTTGGGACCAGTTTTTGAGAAATGCAGATCAAGAATTGCAGCAGCAAACCAGAGACGATGACGGATTGAATCCCTAA
Protein sequenceShow/hide protein sequence
MYRQTISALFLTPIAYFWERKTRPQLTAYILFLLFISSLLGLTLTLYLFLIGLHYTSATFSCAFLNLVPVNTFILAVLFRMEKVDMKSKGGRAKVLGTLVCISGNLILIL
YKGMPLTSPGSATGKTNGVRAMVQAETPGRWLIGSLVLTAGCFMWSSWFLMQSRVGKVYPCQYSSTCIMSFFSAIQSAVLHLIIDRKNSVFVVKGKFAILSLIYAGSVGS
GLCYVGMSWCVKQKGPVFTAAFTPFMEIFAAIFDFFILHEQIHLGSVVGSVLIISGMYILLWGKDREAKDKEYALKQTVSVEEAPHLITIFFLNYACFSQFLFICSCNCN
HMKQEEVNHCQIQEWYPKFKSFSIKTLIHELPESFVHYLLDDSGPFLLPLSISNEDALPNRIVNPHEQDDYQLKEGSDDESEQSTAPPSFPELESDVKQSICSLGGSVFP
KLNWSAPKDSAWISPTGTLKCSSFSEIALLLRSSDSLVHDLCHAYDSCTDKSSSRPSRFFLALRKWYPSLRPEMEFRCFVRNRNLVGISQREVTTFYPALVEKKENLQEV
IGEFFIDHVKPSFESENYTLDVYVTKNEAVKILDFNPWGAFTLALLFDWEELEEKNEDIELRIVESRRAVRPGLKTAVPFDYLDTSPGSGWDQFLRNADQELQQQTRDDD
GLNP