; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg25946 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg25946
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
Descriptionxylulose kinase
Genome locationCarg_Chr20:3999633..4006124
RNA-Seq ExpressionCarg25946
SyntenyCarg25946
Gene Ontology termsGO:0005975 - carbohydrate metabolic process (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0016301 - kinase activity (molecular function)
GO:0016773 - phosphotransferase activity, alcohol group as acceptor (molecular function)
InterPro domainsIPR018484 - Carbohydrate kinase, FGGY, N-terminal
IPR018485 - Carbohydrate kinase, FGGY, C-terminal
IPR043129 - ATPase, nucleotide binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6571081.1 D-ribulose kinase, partial [Cucurbita argyrosperma subsp. sororia]3.9e-24390.36Show/hide
Query:  MLPSNLHSPAANLFLLPSTSCSPCNHGIWISRNQNRRNRRRTTMSVATEVVLPQAGNRLYLGMDFGTSGARFALIDKEGAVCAEGKREYPLFKNDETIDW
        MLPSNLHSPAANLFLLPSTSCSPCNHGIWISRNQNRRNRRRTTMSVATEVVLPQAGNRLYLGMDFGTSGARFALIDKEGAVCAEGKREYPLFKNDETIDW
Subjt:  MLPSNLHSPAANLFLLPSTSCSPCNHGIWISRNQNRRNRRRTTMSVATEVVLPQAGNRLYLGMDFGTSGARFALIDKEGAVCAEGKREYPLFKNDETIDW

Query:  ARSWKTTLFSLLEDVPNHYRHLVASISIDGTSATTIIVDSNTGQPLSKPLLYNEACPDALPLVKSIAPVNHTVCSASSTLCKLVSWWNSADSNKEYATLL
        ARSWKTTLFSLLEDVPNHYRHLVASISIDGTSATTIIVDSNTGQPLSKPLLYNEACPDALPLVKSIAPVNHTVCSASSTLCKLVSWWNSADSNKEYATLL
Subjt:  ARSWKTTLFSLLEDVPNHYRHLVASISIDGTSATTIIVDSNTGQPLSKPLLYNEACPDALPLVKSIAPVNHTVCSASSTLCKLVSWWNSADSNKEYATLL

Query:  HQADWLLWFLHGKLGVSDYNNALKVGYDPEIESYPPWLLAQPYSMLLPHV-------------------FQMIALHAQEPQIVLLHFL--------QHVT
        HQADWLLWFLHGKLGVSDYNNALKVGYDPE ESYPPWLLAQPYSMLLPHV                   F    +        +  FL        Q VT
Subjt:  HQADWLLWFLHGKLGVSDYNNALKVGYDPEIESYPPWLLAQPYSMLLPHV-------------------FQMIALHAQEPQIVLLHFL--------QHVT

Query:  SLGSTLAIKLLSTNRIDDARFGVYSHRLDNMWLVGGASNTGGAVLRQIFTDERLEQLSKQINPMKSSPLDYYPLTSIGERFPEADPQMAPRLHPRPENDV
        SLGSTLAIKLLSTNRIDDARFGVYSHRLDNMWLVGGASNTGGAVLRQIFTDERLEQLSKQINPMKSSPLDYYPLTSIGERFPEADPQMAPRLHPRPENDV
Subjt:  SLGSTLAIKLLSTNRIDDARFGVYSHRLDNMWLVGGASNTGGAVLRQIFTDERLEQLSKQINPMKSSPLDYYPLTSIGERFPEADPQMAPRLHPRPENDV

Query:  EYLHGILESIARIEGKAYRLLKDLGATQVEEVFTAGGGSKNEKWTKIRERVLGLPVSRASQTEAAYGAALLALKGAQ
        EYLHGILESIARIEGKAYRLLKDLGATQVEEVFTAGGGSKNEKWTKIRERVLGLPVSRASQTEAAYGAAL+ALKGAQ
Subjt:  EYLHGILESIARIEGKAYRLLKDLGATQVEEVFTAGGGSKNEKWTKIRERVLGLPVSRASQTEAAYGAALLALKGAQ

KAG7010896.1 Xylulose kinase [Cucurbita argyrosperma subsp. argyrosperma]3.1e-264100Show/hide
Query:  MLPSNLHSPAANLFLLPSTSCSPCNHGIWISRNQNRRNRRRTTMSVATEVVLPQAGNRLYLGMDFGTSGARFALIDKEGAVCAEGKREYPLFKNDETIDW
        MLPSNLHSPAANLFLLPSTSCSPCNHGIWISRNQNRRNRRRTTMSVATEVVLPQAGNRLYLGMDFGTSGARFALIDKEGAVCAEGKREYPLFKNDETIDW
Subjt:  MLPSNLHSPAANLFLLPSTSCSPCNHGIWISRNQNRRNRRRTTMSVATEVVLPQAGNRLYLGMDFGTSGARFALIDKEGAVCAEGKREYPLFKNDETIDW

Query:  ARSWKTTLFSLLEDVPNHYRHLVASISIDGTSATTIIVDSNTGQPLSKPLLYNEACPDALPLVKSIAPVNHTVCSASSTLCKLVSWWNSADSNKEYATLL
        ARSWKTTLFSLLEDVPNHYRHLVASISIDGTSATTIIVDSNTGQPLSKPLLYNEACPDALPLVKSIAPVNHTVCSASSTLCKLVSWWNSADSNKEYATLL
Subjt:  ARSWKTTLFSLLEDVPNHYRHLVASISIDGTSATTIIVDSNTGQPLSKPLLYNEACPDALPLVKSIAPVNHTVCSASSTLCKLVSWWNSADSNKEYATLL

Query:  HQADWLLWFLHGKLGVSDYNNALKVGYDPEIESYPPWLLAQPYSMLLPHVFQMIALHAQEPQIVLLHFLQHVTSLGSTLAIKLLSTNRIDDARFGVYSHR
        HQADWLLWFLHGKLGVSDYNNALKVGYDPEIESYPPWLLAQPYSMLLPHVFQMIALHAQEPQIVLLHFLQHVTSLGSTLAIKLLSTNRIDDARFGVYSHR
Subjt:  HQADWLLWFLHGKLGVSDYNNALKVGYDPEIESYPPWLLAQPYSMLLPHVFQMIALHAQEPQIVLLHFLQHVTSLGSTLAIKLLSTNRIDDARFGVYSHR

Query:  LDNMWLVGGASNTGGAVLRQIFTDERLEQLSKQINPMKSSPLDYYPLTSIGERFPEADPQMAPRLHPRPENDVEYLHGILESIARIEGKAYRLLKDLGAT
        LDNMWLVGGASNTGGAVLRQIFTDERLEQLSKQINPMKSSPLDYYPLTSIGERFPEADPQMAPRLHPRPENDVEYLHGILESIARIEGKAYRLLKDLGAT
Subjt:  LDNMWLVGGASNTGGAVLRQIFTDERLEQLSKQINPMKSSPLDYYPLTSIGERFPEADPQMAPRLHPRPENDVEYLHGILESIARIEGKAYRLLKDLGAT

Query:  QVEEVFTAGGGSKNEKWTKIRERVLGLPVSRASQTEAAYGAALLALKGAQLFMQ
        QVEEVFTAGGGSKNEKWTKIRERVLGLPVSRASQTEAAYGAALLALKGAQLFMQ
Subjt:  QVEEVFTAGGGSKNEKWTKIRERVLGLPVSRASQTEAAYGAALLALKGAQLFMQ

XP_022944280.1 uncharacterized protein LOC111448774 isoform X1 [Cucurbita moschata]5.6e-24289.19Show/hide
Query:  MLPSNLHSPAANLFLLPSTSCSPCNHGIWISRNQNRRNRRRTTMSVATEVVLPQAGNRLYLGMDFGTSGARFALIDKEGAVCAEGKREYPLFKNDETIDW
        MLPSNLHSPAANLFLLPSTSCSPCNHGIWISRNQNRRNRRRTTMSVATEVVL QAGNRLYLGMDFGTSGARFALIDKEGAVCAEGKREYPLFKNDETIDW
Subjt:  MLPSNLHSPAANLFLLPSTSCSPCNHGIWISRNQNRRNRRRTTMSVATEVVLPQAGNRLYLGMDFGTSGARFALIDKEGAVCAEGKREYPLFKNDETIDW

Query:  ARSWKTTLFSLLEDVPNHYRHLVASISIDGTSATTIIVDSNTGQPLSKPLLYNEACPDALPLVKSIAPVNHTVCSASSTLCKLVSWWNSADSNKEYATLL
        ARSWKTTLFSLLEDVPNHYRHLVASISIDGTSATTIIVDSNTGQPLSKPLLYNE CPDALPLVKSIAPVNHTVCSASSTLCKLVSWWNSADSNKEYATLL
Subjt:  ARSWKTTLFSLLEDVPNHYRHLVASISIDGTSATTIIVDSNTGQPLSKPLLYNEACPDALPLVKSIAPVNHTVCSASSTLCKLVSWWNSADSNKEYATLL

Query:  HQADWLLWFLHGKLGVSDYNNALKVGYDPEIESYPPWLLAQPYSMLLPHV-------------------FQMIALHAQEPQIVLLHFL--------QHVT
        HQADWLLWFLHGKLGVSDYNNALKVGYDPEIESYPPWLLAQPYSMLLPHV                   F    +        +  FL        Q VT
Subjt:  HQADWLLWFLHGKLGVSDYNNALKVGYDPEIESYPPWLLAQPYSMLLPHV-------------------FQMIALHAQEPQIVLLHFL--------QHVT

Query:  SLGSTLAIKLLSTNRIDDARFGVYSHRLDNMWLVGGASNTGGAVLRQIFTDERLEQLSKQINPMKSSPLDYYPLTSIGERFPEADPQMAPRLHPRPENDV
        SLGSTLAIKLLSTNRIDDARFGVYSHRLDNMWLVGGASNTGGAVLRQIFTD++LE+LSKQINPM+SSPLDYYPLTSIGERFPEADPQMAPRLHPRPENDV
Subjt:  SLGSTLAIKLLSTNRIDDARFGVYSHRLDNMWLVGGASNTGGAVLRQIFTDERLEQLSKQINPMKSSPLDYYPLTSIGERFPEADPQMAPRLHPRPENDV

Query:  EYLHGILESIARIEGKAYRLLKDLGATQVEEVFTAGGGSKNEKWTKIRERVLGLPVSRASQTEAAYGAALLALKGAQLFMQ
        EYLHGILESIARIEGKAYR LKDLGATQVEEVFTAGGGSKNEKWTKIRERVLGLPVSRASQTEAAYGAALLAL+GAQLFMQ
Subjt:  EYLHGILESIARIEGKAYRLLKDLGATQVEEVFTAGGGSKNEKWTKIRERVLGLPVSRASQTEAAYGAALLALKGAQLFMQ

XP_022944283.1 uncharacterized protein LOC111448774 isoform X4 [Cucurbita moschata]5.8e-23986.49Show/hide
Query:  MLPSNLHSPAANLFLLPSTSCSPCNH---------------GIWISRNQNRRNRRRTTMSVATEVVLPQAGNRLYLGMDFGTSGARFALIDKEGAVCAEG
        MLPSNLHSPAANLFLLPSTSCSPCNH               GIWISRNQNRRNRRRTTMSVATEVVL QAGNRLYLGMDFGTSGARFALIDKEGAVCAEG
Subjt:  MLPSNLHSPAANLFLLPSTSCSPCNH---------------GIWISRNQNRRNRRRTTMSVATEVVLPQAGNRLYLGMDFGTSGARFALIDKEGAVCAEG

Query:  KREYPLFKNDETIDWARSWKTTLFSLLEDVPNHYRHLVASISIDGTSATTIIVDSNTGQPLSKPLLYNEACPDALPLVKSIAPVNHTVCSASSTLCKLVS
        KREYPLFKNDETIDWARSWKTTLFSLLEDVPNHYRHLVASISIDGTSATTIIVDSNTGQPLSKPLLYNE CPDALPLVKSIAPVNHTVCSASSTLCKLVS
Subjt:  KREYPLFKNDETIDWARSWKTTLFSLLEDVPNHYRHLVASISIDGTSATTIIVDSNTGQPLSKPLLYNEACPDALPLVKSIAPVNHTVCSASSTLCKLVS

Query:  WWNSADSNKEYATLLHQADWLLWFLHGKLGVSDYNNALKVGYDPEIESYPPWLLAQPYSMLLPHV-------------------FQMIALHAQEPQIVLL
        WWNSADSNKEYATLLHQADWLLWFLHGKLGVSDYNNALKVGYDPEIESYPPWLLAQPYSMLLPHV                   F    +        + 
Subjt:  WWNSADSNKEYATLLHQADWLLWFLHGKLGVSDYNNALKVGYDPEIESYPPWLLAQPYSMLLPHV-------------------FQMIALHAQEPQIVLL

Query:  HFL--------QHVTSLGSTLAIKLLSTNRIDDARFGVYSHRLDNMWLVGGASNTGGAVLRQIFTDERLEQLSKQINPMKSSPLDYYPLTSIGERFPEAD
         FL        Q VTSLGSTLAIKLLSTNRIDDARFGVYSHRLDNMWLVGGASNTGGAVLRQIFTD++LE+LSKQINPM+SSPLDYYPLTSIGERFPEAD
Subjt:  HFL--------QHVTSLGSTLAIKLLSTNRIDDARFGVYSHRLDNMWLVGGASNTGGAVLRQIFTDERLEQLSKQINPMKSSPLDYYPLTSIGERFPEAD

Query:  PQMAPRLHPRPENDVEYLHGILESIARIEGKAYRLLKDLGATQVEEVFTAGGGSKNEKWTKIRERVLGLPVSRASQTEAAYGAALLALKGAQLFMQ
        PQMAPRLHPRPENDVEYLHGILESIARIEGKAYR LKDLGATQVEEVFTAGGGSKNEKWTKIRERVLGLPVSRASQTEAAYGAALLAL+GAQLFMQ
Subjt:  PQMAPRLHPRPENDVEYLHGILESIARIEGKAYRLLKDLGATQVEEVFTAGGGSKNEKWTKIRERVLGLPVSRASQTEAAYGAALLALKGAQLFMQ

XP_023512560.1 uncharacterized protein LOC111777264 isoform X1 [Cucurbita pepo subsp. pepo]3.6e-24189.19Show/hide
Query:  MLPSNLHSPAANLFLLPSTSCSPCNHGIWISRNQNRRNRRRTTMSVATEVVLPQAGNRLYLGMDFGTSGARFALIDKEGAVCAEGKREYPLFKNDETIDW
        MLPSNL SPAANLFLLPSTSCSPCNHGIWISRNQNRRNRRRTTMSVATEVVLPQAG RLYLGMDFGTSGARFALIDKEGAVCAEGKREYPLFKNDETIDW
Subjt:  MLPSNLHSPAANLFLLPSTSCSPCNHGIWISRNQNRRNRRRTTMSVATEVVLPQAGNRLYLGMDFGTSGARFALIDKEGAVCAEGKREYPLFKNDETIDW

Query:  ARSWKTTLFSLLEDVPNHYRHLVASISIDGTSATTIIVDSNTGQPLSKPLLYNEACPDALPLVKSIAPVNHTVCSASSTLCKLVSWWNSADSNKEYATLL
        ARSWKTTLFSLLEDVPNHYRHLVASISIDGTSATTIIVDSNTGQPLSKPLLYNE CPDALPLVKSIAPVNHTVCSASSTLCKLVSWWNSA SNKEYATLL
Subjt:  ARSWKTTLFSLLEDVPNHYRHLVASISIDGTSATTIIVDSNTGQPLSKPLLYNEACPDALPLVKSIAPVNHTVCSASSTLCKLVSWWNSADSNKEYATLL

Query:  HQADWLLWFLHGKLGVSDYNNALKVGYDPEIESYPPWLLAQPYSMLLPHV-------------------FQMIALHAQEPQIVLLHFL--------QHVT
        HQADWLLWFLHGKLGVSDYNNALKVGYDPEIESYPPWLLAQPYSMLLPHV                   F    +        +  FL        Q VT
Subjt:  HQADWLLWFLHGKLGVSDYNNALKVGYDPEIESYPPWLLAQPYSMLLPHV-------------------FQMIALHAQEPQIVLLHFL--------QHVT

Query:  SLGSTLAIKLLSTNRIDDARFGVYSHRLDNMWLVGGASNTGGAVLRQIFTDERLEQLSKQINPMKSSPLDYYPLTSIGERFPEADPQMAPRLHPRPENDV
        SLGSTLAIKLLSTNRIDDARFGVYSHRLDNMWLVGGASNTGGAVLRQIFTD++LE+LSKQINPMKSSPLDYYPLTS+GERFPEADPQMAPRLHPRPENDV
Subjt:  SLGSTLAIKLLSTNRIDDARFGVYSHRLDNMWLVGGASNTGGAVLRQIFTDERLEQLSKQINPMKSSPLDYYPLTSIGERFPEADPQMAPRLHPRPENDV

Query:  EYLHGILESIARIEGKAYRLLKDLGATQVEEVFTAGGGSKNEKWTKIRERVLGLPVSRASQTEAAYGAALLALKGAQLFMQ
        EYLHGILESIARIEGKAYRLLKDLGATQVEEVFTAGGGSKNEKWTKIRERVLGLPVSRASQTEAAYGAALLALKGAQLFMQ
Subjt:  EYLHGILESIARIEGKAYRLLKDLGATQVEEVFTAGGGSKNEKWTKIRERVLGLPVSRASQTEAAYGAALLALKGAQLFMQ

TrEMBL top hitse value%identityAlignment
A0A6J1FU00 uncharacterized protein LOC111448774 isoform X23.4e-23788.36Show/hide
Query:  MLPSNLHSPAANLFLLPSTSCSPCNHGIWISRNQNRRNRRRTTMSVATEVVLPQAGNRLYLGMDFGTSGARFALIDKEGAVCAEGKREYPLFKNDETIDW
        MLPSNLHSPAANLFLLPSTSCSP    IWISRNQNRRNRRRTTMSVATEVVL QAGNRLYLGMDFGTSGARFALIDKEGAVCAEGKREYPLFKNDETIDW
Subjt:  MLPSNLHSPAANLFLLPSTSCSPCNHGIWISRNQNRRNRRRTTMSVATEVVLPQAGNRLYLGMDFGTSGARFALIDKEGAVCAEGKREYPLFKNDETIDW

Query:  ARSWKTTLFSLLEDVPNHYRHLVASISIDGTSATTIIVDSNTGQPLSKPLLYNEACPDALPLVKSIAPVNHTVCSASSTLCKLVSWWNSADSNKEYATLL
        ARSWKTTLFSLLEDVPNHYRHLVASISIDGTSATTIIVDSNTGQPLSKPLLYNE CPDALPLVKSIAPVNHTVCSASSTLCKLVSWWNSADSNKEYATLL
Subjt:  ARSWKTTLFSLLEDVPNHYRHLVASISIDGTSATTIIVDSNTGQPLSKPLLYNEACPDALPLVKSIAPVNHTVCSASSTLCKLVSWWNSADSNKEYATLL

Query:  HQADWLLWFLHGKLGVSDYNNALKVGYDPEIESYPPWLLAQPYSMLLPHV-------------------FQMIALHAQEPQIVLLHFL--------QHVT
        HQADWLLWFLHGKLGVSDYNNALKVGYDPEIESYPPWLLAQPYSMLLPHV                   F    +        +  FL        Q VT
Subjt:  HQADWLLWFLHGKLGVSDYNNALKVGYDPEIESYPPWLLAQPYSMLLPHV-------------------FQMIALHAQEPQIVLLHFL--------QHVT

Query:  SLGSTLAIKLLSTNRIDDARFGVYSHRLDNMWLVGGASNTGGAVLRQIFTDERLEQLSKQINPMKSSPLDYYPLTSIGERFPEADPQMAPRLHPRPENDV
        SLGSTLAIKLLSTNRIDDARFGVYSHRLDNMWLVGGASNTGGAVLRQIFTD++LE+LSKQINPM+SSPLDYYPLTSIGERFPEADPQMAPRLHPRPENDV
Subjt:  SLGSTLAIKLLSTNRIDDARFGVYSHRLDNMWLVGGASNTGGAVLRQIFTDERLEQLSKQINPMKSSPLDYYPLTSIGERFPEADPQMAPRLHPRPENDV

Query:  EYLHGILESIARIEGKAYRLLKDLGATQVEEVFTAGGGSKNEKWTKIRERVLGLPVSRASQTEAAYGAALLALKGAQLFMQ
        EYLHGILESIARIEGKAYR LKDLGATQVEEVFTAGGGSKNEKWTKIRERVLGLPVSRASQTEAAYGAALLAL+GAQLFMQ
Subjt:  EYLHGILESIARIEGKAYRLLKDLGATQVEEVFTAGGGSKNEKWTKIRERVLGLPVSRASQTEAAYGAALLALKGAQLFMQ

A0A6J1FXW8 uncharacterized protein LOC111448774 isoform X42.8e-23986.49Show/hide
Query:  MLPSNLHSPAANLFLLPSTSCSPCNH---------------GIWISRNQNRRNRRRTTMSVATEVVLPQAGNRLYLGMDFGTSGARFALIDKEGAVCAEG
        MLPSNLHSPAANLFLLPSTSCSPCNH               GIWISRNQNRRNRRRTTMSVATEVVL QAGNRLYLGMDFGTSGARFALIDKEGAVCAEG
Subjt:  MLPSNLHSPAANLFLLPSTSCSPCNH---------------GIWISRNQNRRNRRRTTMSVATEVVLPQAGNRLYLGMDFGTSGARFALIDKEGAVCAEG

Query:  KREYPLFKNDETIDWARSWKTTLFSLLEDVPNHYRHLVASISIDGTSATTIIVDSNTGQPLSKPLLYNEACPDALPLVKSIAPVNHTVCSASSTLCKLVS
        KREYPLFKNDETIDWARSWKTTLFSLLEDVPNHYRHLVASISIDGTSATTIIVDSNTGQPLSKPLLYNE CPDALPLVKSIAPVNHTVCSASSTLCKLVS
Subjt:  KREYPLFKNDETIDWARSWKTTLFSLLEDVPNHYRHLVASISIDGTSATTIIVDSNTGQPLSKPLLYNEACPDALPLVKSIAPVNHTVCSASSTLCKLVS

Query:  WWNSADSNKEYATLLHQADWLLWFLHGKLGVSDYNNALKVGYDPEIESYPPWLLAQPYSMLLPHV-------------------FQMIALHAQEPQIVLL
        WWNSADSNKEYATLLHQADWLLWFLHGKLGVSDYNNALKVGYDPEIESYPPWLLAQPYSMLLPHV                   F    +        + 
Subjt:  WWNSADSNKEYATLLHQADWLLWFLHGKLGVSDYNNALKVGYDPEIESYPPWLLAQPYSMLLPHV-------------------FQMIALHAQEPQIVLL

Query:  HFL--------QHVTSLGSTLAIKLLSTNRIDDARFGVYSHRLDNMWLVGGASNTGGAVLRQIFTDERLEQLSKQINPMKSSPLDYYPLTSIGERFPEAD
         FL        Q VTSLGSTLAIKLLSTNRIDDARFGVYSHRLDNMWLVGGASNTGGAVLRQIFTD++LE+LSKQINPM+SSPLDYYPLTSIGERFPEAD
Subjt:  HFL--------QHVTSLGSTLAIKLLSTNRIDDARFGVYSHRLDNMWLVGGASNTGGAVLRQIFTDERLEQLSKQINPMKSSPLDYYPLTSIGERFPEAD

Query:  PQMAPRLHPRPENDVEYLHGILESIARIEGKAYRLLKDLGATQVEEVFTAGGGSKNEKWTKIRERVLGLPVSRASQTEAAYGAALLALKGAQLFMQ
        PQMAPRLHPRPENDVEYLHGILESIARIEGKAYR LKDLGATQVEEVFTAGGGSKNEKWTKIRERVLGLPVSRASQTEAAYGAALLAL+GAQLFMQ
Subjt:  PQMAPRLHPRPENDVEYLHGILESIARIEGKAYRLLKDLGATQVEEVFTAGGGSKNEKWTKIRERVLGLPVSRASQTEAAYGAALLALKGAQLFMQ

A0A6J1FYK4 uncharacterized protein LOC111448774 isoform X12.7e-24289.19Show/hide
Query:  MLPSNLHSPAANLFLLPSTSCSPCNHGIWISRNQNRRNRRRTTMSVATEVVLPQAGNRLYLGMDFGTSGARFALIDKEGAVCAEGKREYPLFKNDETIDW
        MLPSNLHSPAANLFLLPSTSCSPCNHGIWISRNQNRRNRRRTTMSVATEVVL QAGNRLYLGMDFGTSGARFALIDKEGAVCAEGKREYPLFKNDETIDW
Subjt:  MLPSNLHSPAANLFLLPSTSCSPCNHGIWISRNQNRRNRRRTTMSVATEVVLPQAGNRLYLGMDFGTSGARFALIDKEGAVCAEGKREYPLFKNDETIDW

Query:  ARSWKTTLFSLLEDVPNHYRHLVASISIDGTSATTIIVDSNTGQPLSKPLLYNEACPDALPLVKSIAPVNHTVCSASSTLCKLVSWWNSADSNKEYATLL
        ARSWKTTLFSLLEDVPNHYRHLVASISIDGTSATTIIVDSNTGQPLSKPLLYNE CPDALPLVKSIAPVNHTVCSASSTLCKLVSWWNSADSNKEYATLL
Subjt:  ARSWKTTLFSLLEDVPNHYRHLVASISIDGTSATTIIVDSNTGQPLSKPLLYNEACPDALPLVKSIAPVNHTVCSASSTLCKLVSWWNSADSNKEYATLL

Query:  HQADWLLWFLHGKLGVSDYNNALKVGYDPEIESYPPWLLAQPYSMLLPHV-------------------FQMIALHAQEPQIVLLHFL--------QHVT
        HQADWLLWFLHGKLGVSDYNNALKVGYDPEIESYPPWLLAQPYSMLLPHV                   F    +        +  FL        Q VT
Subjt:  HQADWLLWFLHGKLGVSDYNNALKVGYDPEIESYPPWLLAQPYSMLLPHV-------------------FQMIALHAQEPQIVLLHFL--------QHVT

Query:  SLGSTLAIKLLSTNRIDDARFGVYSHRLDNMWLVGGASNTGGAVLRQIFTDERLEQLSKQINPMKSSPLDYYPLTSIGERFPEADPQMAPRLHPRPENDV
        SLGSTLAIKLLSTNRIDDARFGVYSHRLDNMWLVGGASNTGGAVLRQIFTD++LE+LSKQINPM+SSPLDYYPLTSIGERFPEADPQMAPRLHPRPENDV
Subjt:  SLGSTLAIKLLSTNRIDDARFGVYSHRLDNMWLVGGASNTGGAVLRQIFTDERLEQLSKQINPMKSSPLDYYPLTSIGERFPEADPQMAPRLHPRPENDV

Query:  EYLHGILESIARIEGKAYRLLKDLGATQVEEVFTAGGGSKNEKWTKIRERVLGLPVSRASQTEAAYGAALLALKGAQLFMQ
        EYLHGILESIARIEGKAYR LKDLGATQVEEVFTAGGGSKNEKWTKIRERVLGLPVSRASQTEAAYGAALLAL+GAQLFMQ
Subjt:  EYLHGILESIARIEGKAYRLLKDLGATQVEEVFTAGGGSKNEKWTKIRERVLGLPVSRASQTEAAYGAALLALKGAQLFMQ

A0A6J1J7R6 uncharacterized protein LOC111484231 isoform X41.9e-23585.28Show/hide
Query:  MLPSNLHSPAANLFLLPSTSCSPCNH---------------GIWISRNQNRRNRRRTTMSVATEVVLPQAGNRLYLGMDFGTSGARFALIDKEGAVCAEG
        MLPSNL SPAANLFLLPSTSCSPCNH               GIWISRNQ+RRNRRRTTM V  EVV PQAGNRLYLGMDFGTSGARFALIDKEGAVCAEG
Subjt:  MLPSNLHSPAANLFLLPSTSCSPCNH---------------GIWISRNQNRRNRRRTTMSVATEVVLPQAGNRLYLGMDFGTSGARFALIDKEGAVCAEG

Query:  KREYPLFKNDETIDWARSWKTTLFSLLEDVPNHYRHLVASISIDGTSATTIIVDSNTGQPLSKPLLYNEACPDALPLVKSIAPVNHTVCSASSTLCKLVS
        KR+YPLFKNDETIDWARSWKTTLFSLLEDVPNHYRHLVASISIDGTSATTIIVDSNTGQPLSKPLLYNE CPDALPLVKSIAPVNHTVCSASSTLCKLVS
Subjt:  KREYPLFKNDETIDWARSWKTTLFSLLEDVPNHYRHLVASISIDGTSATTIIVDSNTGQPLSKPLLYNEACPDALPLVKSIAPVNHTVCSASSTLCKLVS

Query:  WWNSADSNKEYATLLHQADWLLWFLHGKLGVSDYNNALKVGYDPEIESYPPWLLAQPYSMLLPHV-------------------FQMIALHAQEPQIVLL
        WWNSADSNKEYATLLHQADWLLWFLHGKLGVSDYNNALKVGYDPEIESYPPWLLAQPYSMLLPHV                   F    +        + 
Subjt:  WWNSADSNKEYATLLHQADWLLWFLHGKLGVSDYNNALKVGYDPEIESYPPWLLAQPYSMLLPHV-------------------FQMIALHAQEPQIVLL

Query:  HFL--------QHVTSLGSTLAIKLLSTNRIDDARFGVYSHRLDNMWLVGGASNTGGAVLRQIFTDERLEQLSKQINPMKSSPLDYYPLTSIGERFPEAD
         FL        Q VTSLGSTLAIKLLSTNRIDDARFGVYSHRLDNMWLVGGASNTGGAVLRQIFTD++LE+LSKQINPMKSSPLDYYPLTSIGERFPEAD
Subjt:  HFL--------QHVTSLGSTLAIKLLSTNRIDDARFGVYSHRLDNMWLVGGASNTGGAVLRQIFTDERLEQLSKQINPMKSSPLDYYPLTSIGERFPEAD

Query:  PQMAPRLHPRPENDVEYLHGILESIARIEGKAYRLLKDLGATQVEEVFTAGGGSKNEKWTKIRERVLGLPVSRASQTEAAYGAALLALKGAQLFMQ
        PQMAPRLHPRPENDVEYLHGILESIARIEGK YRLLKDLGATQVEEV TAGGGSKNEKWTKIRERVLGLPVSRASQTEAAYGAALLALKGA+LFMQ
Subjt:  PQMAPRLHPRPENDVEYLHGILESIARIEGKAYRLLKDLGATQVEEVFTAGGGSKNEKWTKIRERVLGLPVSRASQTEAAYGAALLALKGAQLFMQ

A0A6J1JG91 uncharacterized protein LOC111484231 isoform X11.8e-23887.94Show/hide
Query:  MLPSNLHSPAANLFLLPSTSCSPCNHGIWISRNQNRRNRRRTTMSVATEVVLPQAGNRLYLGMDFGTSGARFALIDKEGAVCAEGKREYPLFKNDETIDW
        MLPSNL SPAANLFLLPSTSCSPCNHGIWISRNQ+RRNRRRTTM V  EVV PQAGNRLYLGMDFGTSGARFALIDKEGAVCAEGKR+YPLFKNDETIDW
Subjt:  MLPSNLHSPAANLFLLPSTSCSPCNHGIWISRNQNRRNRRRTTMSVATEVVLPQAGNRLYLGMDFGTSGARFALIDKEGAVCAEGKREYPLFKNDETIDW

Query:  ARSWKTTLFSLLEDVPNHYRHLVASISIDGTSATTIIVDSNTGQPLSKPLLYNEACPDALPLVKSIAPVNHTVCSASSTLCKLVSWWNSADSNKEYATLL
        ARSWKTTLFSLLEDVPNHYRHLVASISIDGTSATTIIVDSNTGQPLSKPLLYNE CPDALPLVKSIAPVNHTVCSASSTLCKLVSWWNSADSNKEYATLL
Subjt:  ARSWKTTLFSLLEDVPNHYRHLVASISIDGTSATTIIVDSNTGQPLSKPLLYNEACPDALPLVKSIAPVNHTVCSASSTLCKLVSWWNSADSNKEYATLL

Query:  HQADWLLWFLHGKLGVSDYNNALKVGYDPEIESYPPWLLAQPYSMLLPHV-------------------FQMIALHAQEPQIVLLHFL--------QHVT
        HQADWLLWFLHGKLGVSDYNNALKVGYDPEIESYPPWLLAQPYSMLLPHV                   F    +        +  FL        Q VT
Subjt:  HQADWLLWFLHGKLGVSDYNNALKVGYDPEIESYPPWLLAQPYSMLLPHV-------------------FQMIALHAQEPQIVLLHFL--------QHVT

Query:  SLGSTLAIKLLSTNRIDDARFGVYSHRLDNMWLVGGASNTGGAVLRQIFTDERLEQLSKQINPMKSSPLDYYPLTSIGERFPEADPQMAPRLHPRPENDV
        SLGSTLAIKLLSTNRIDDARFGVYSHRLDNMWLVGGASNTGGAVLRQIFTD++LE+LSKQINPMKSSPLDYYPLTSIGERFPEADPQMAPRLHPRPENDV
Subjt:  SLGSTLAIKLLSTNRIDDARFGVYSHRLDNMWLVGGASNTGGAVLRQIFTDERLEQLSKQINPMKSSPLDYYPLTSIGERFPEADPQMAPRLHPRPENDV

Query:  EYLHGILESIARIEGKAYRLLKDLGATQVEEVFTAGGGSKNEKWTKIRERVLGLPVSRASQTEAAYGAALLALKGAQLFMQ
        EYLHGILESIARIEGK YRLLKDLGATQVEEV TAGGGSKNEKWTKIRERVLGLPVSRASQTEAAYGAALLALKGA+LFMQ
Subjt:  EYLHGILESIARIEGKAYRLLKDLGATQVEEVFTAGGGSKNEKWTKIRERVLGLPVSRASQTEAAYGAALLALKGAQLFMQ

SwissProt top hitse value%identityAlignment
P27155 Xylulose kinase2.4e-0921.12Show/hide
Query:  LGMDFGTSGARFALIDKEGAVCAEGKREYPLF--KNDETIDWARSWKTTLFSLLEDVPNHYRHL-VASISIDGTSATTIIVDSNTGQPLSKPLLYNE--A
        +G+D GTS  +  +++K G V       Y     K+  +      W       L+ + NHY H  +  IS  G     +++D   G P+   +L+N+   
Subjt:  LGMDFGTSGARFALIDKEGAVCAEGKREYPLF--KNDETIDWARSWKTTLFSLLEDVPNHYRHL-VASISIDGTSATTIIVDSNTGQPLSKPLLYNE--A

Query:  CPDALPLVK-----SIAPVNHTVCSASSTLCKLVSWWN-SADSNKEYATLLHQADWLLWFLHGKLGVSDYNNALKVGYDPEIESYPPWLLAQPYSMLLPH
          +   + K     S+  +         TL KL+   N   D+ K     +   D++++ L G +     + A  + +  + E++   LL +    + P 
Subjt:  CPDALPLVK-----SIAPVNHTVCSASSTLCKLVSWWN-SADSNKEYATLLHQADWLLWFLHGKLGVSDYNNALKVGYDPEIESYPPWLLAQPYSMLLPH

Query:  VF-QMIALHAQEPQIV------------------------------LLHFLQHVTSLG-STLAIKL-LSTNRIDDARFGVYSHRLDNMWLVGGASNTGGA
        +  ++IA H +  Q+                               +    + + S+G S +A+ +  ST+  +D     ++H + N   + G + + G 
Subjt:  VF-QMIALHAQEPQIV------------------------------LLHFLQHVTSLG-STLAIKL-LSTNRIDDARFGVYSHRLDNMWLVGGASNTGGA

Query:  VL----RQIFTDERLEQLSKQINPMK--SSPLDYYPLTSIGERFPEADPQMAPRLHPRPEN--DVEYLHGILESIARIEGKAYRLLKDLGATQVEEVFTA
         L    + I  DE      K IN  +  ++ L Y P   +GER P  D  +         N   ++    ++E I     ++  ++K+  A  + E+ + 
Subjt:  VL----RQIFTDERLEQLSKQINPMK--SSPLDYYPLTSIGERFPEADPQMAPRLHPRPEN--DVEYLHGILESIARIEGKAYRLLKDLGATQVEEVFTA

Query:  GGGSKNEKWTKIRERVLGLPV-SRASQTEAAYGAALLALKGAQLF
        GGG+KN +W +I+  +    + +R  +   AYGAA++A  G Q F
Subjt:  GGGSKNEKWTKIRERVLGLPV-SRASQTEAAYGAALLALKGAQLF

Q31KC7 D-ribulose kinase4.7e-8242.24Show/hide
Query:  LGMDFGTSGARFALIDKEGAVCAEGKREYPLFKNDETIDWARSWKTTLFSLLEDVPNHYRHLVASISIDGTSATTIIVDSNTGQPLSKPLLYNEACPDAL
        LG+DFGTSGAR    D +          +P      + +W + W+  L+ LL  +P  +R  +  I+IDGTS T ++ D   GQP ++PLLYN+ACP  L
Subjt:  LGMDFGTSGARFALIDKEGAVCAEGKREYPLFKNDETIDWARSWKTTLFSLLEDVPNHYRHLVASISIDGTSATTIIVDSNTGQPLSKPLLYNEACPDAL

Query:  PLVKSIAPVNHTVCSASSTLCKLVSWWNSADSNKEYATLLHQADWLLWFLHGKLGVSDYNNALKVGYDPEIESYPPWLLAQPYSMLLPHVFQM-IALHAQ
          +    P +H   S++S+L KL  W     +      +L QADWL   LHG    SDY+NALK+GY P+ E +   LL      LLP V +  +A+   
Subjt:  PLVKSIAPVNHTVCSASSTLCKLVSWWNSADSNKEYATLLHQADWLLWFLHGKLGVSDYNNALKVGYDPEIESYPPWLLAQPYSMLLPHVFQM-IALHAQ

Query:  EPQIV------------------LLHFL--------QHVTSLGSTLAIKLLSTNRIDDARFGVYSHRLDNMWLVGGASNTGGAVLRQIFTDERLEQLSKQ
         P I                   +  FL        + VTSLGST+ +KLLS   + D   GVYSH+L   WL GGASN GGA LRQ F D  LE LS Q
Subjt:  EPQIV------------------LLHFL--------QHVTSLGSTLAIKLLSTNRIDDARFGVYSHRLDNMWLVGGASNTGGAVLRQIFTDERLEQLSKQ

Query:  INPMKSSPLDYYPLTSIGERFPEADPQMAPRLHPRPENDVEYLHGILESIARIEGKAYRLLKDLGATQVEEVFTAGGGSKNEKWTKIRERVLGLPVSRAS
        I+P K S LDYYPL S GERFP ADP   P+L PRPEN V++L G+LE + ++E   Y+ L+DLGAT ++ ++TAGGG+KN  W ++R++ +G+P++ A 
Subjt:  INPMKSSPLDYYPLTSIGERFPEADPQMAPRLHPRPENDVEYLHGILESIARIEGKAYRLLKDLGATQVEEVFTAGGGSKNEKWTKIRERVLGLPVSRAS

Query:  QTEAAYGAALLALKGAQLF
         TEAA+G A LA  G   F
Subjt:  QTEAAYGAALLALKGAQLF

Q8L794 D-ribulose kinase5.6e-16065.81Show/hide
Query:  RLYLGMDFGTSGARFALIDKEGAVCAEGKREYPLFKNDETIDWARSWKTTLFSLLEDVPNHYRHLVASISIDGTSATTIIVDSNTGQPLSKPLLYNEACP
        +LYLGMDFGTSG RF +ID++G + A+GKREYP F  +E++ WA SWK TLFSLLED+P   R LV+SIS+DGTSATT+I++S +G+ L +P LYN++CP
Subjt:  RLYLGMDFGTSGARFALIDKEGAVCAEGKREYPLFKNDETIDWARSWKTTLFSLLEDVPNHYRHLVASISIDGTSATTIIVDSNTGQPLSKPLLYNEACP

Query:  DALPLVKSIAPVNHTVCSASSTLCKLVSWWNSADSNKEYATLLHQADWLLWFLHGKLGVSDYNNALKVGYDPEIESYPPWLLAQPYSMLLPHV-------
        DALP VKSIAP NHTVCS +STLCKLVSWWN+   N+E A LLHQADWLLW LHG+LGVSDYNNALKVGYDPE ESYP WLL QPYS LLP V       
Subjt:  DALPLVKSIAPVNHTVCSASSTLCKLVSWWNSADSNKEYATLLHQADWLLWFLHGKLGVSDYNNALKVGYDPEIESYPPWLLAQPYSMLLPHV-------

Query:  ---------------------------FQMIALHAQEPQIVLLHFLQHVTSLGSTLAIKLLSTNRIDDARFGVYSHRLDNMWLVGGASNTGGAVLRQIFT
                                      +A  A EP        + VTSLGSTLAIKLLST R+DDAR+GVYSHRLD+ WLVGGASNTGGA+LRQ+F+
Subjt:  ---------------------------FQMIALHAQEPQIVLLHFLQHVTSLGSTLAIKLLSTNRIDDARFGVYSHRLDNMWLVGGASNTGGAVLRQIFT

Query:  DERLEQLSKQINPMKSSPLDYYPLTSIGERFPEADPQMAPRLHPRPENDVEYLHGILESIARIEGKAYRLLKDLGATQVEEVFTAGGGSKNEKWTKIRER
        DE+LE+LS++INPM  SPLDYYPL S GERFP ADP +APRL PRPE+DVE+LHGILESIARIEGK Y+LLK+LGAT+ EEV TAGGG+KN+KW KIR+R
Subjt:  DERLEQLSKQINPMKSSPLDYYPLTSIGERFPEADPQMAPRLHPRPENDVEYLHGILESIARIEGKAYRLLKDLGATQVEEVFTAGGGSKNEKWTKIRER

Query:  VLGLPVSRASQTEAAYGAALLALKGAQ
        VLGLPV +A  TEA+YGA+LLALKGA+
Subjt:  VLGLPVSRASQTEAAYGAALLALKGAQ

Arabidopsis top hitse value%identityAlignment
AT2G21370.1 xylulose kinase-14.0e-16165.81Show/hide
Query:  RLYLGMDFGTSGARFALIDKEGAVCAEGKREYPLFKNDETIDWARSWKTTLFSLLEDVPNHYRHLVASISIDGTSATTIIVDSNTGQPLSKPLLYNEACP
        +LYLGMDFGTSG RF +ID++G + A+GKREYP F  +E++ WA SWK TLFSLLED+P   R LV+SIS+DGTSATT+I++S +G+ L +P LYN++CP
Subjt:  RLYLGMDFGTSGARFALIDKEGAVCAEGKREYPLFKNDETIDWARSWKTTLFSLLEDVPNHYRHLVASISIDGTSATTIIVDSNTGQPLSKPLLYNEACP

Query:  DALPLVKSIAPVNHTVCSASSTLCKLVSWWNSADSNKEYATLLHQADWLLWFLHGKLGVSDYNNALKVGYDPEIESYPPWLLAQPYSMLLPHV-------
        DALP VKSIAP NHTVCS +STLCKLVSWWN+   N+E A LLHQADWLLW LHG+LGVSDYNNALKVGYDPE ESYP WLL QPYS LLP V       
Subjt:  DALPLVKSIAPVNHTVCSASSTLCKLVSWWNSADSNKEYATLLHQADWLLWFLHGKLGVSDYNNALKVGYDPEIESYPPWLLAQPYSMLLPHV-------

Query:  ---------------------------FQMIALHAQEPQIVLLHFLQHVTSLGSTLAIKLLSTNRIDDARFGVYSHRLDNMWLVGGASNTGGAVLRQIFT
                                      +A  A EP        + VTSLGSTLAIKLLST R+DDAR+GVYSHRLD+ WLVGGASNTGGA+LRQ+F+
Subjt:  ---------------------------FQMIALHAQEPQIVLLHFLQHVTSLGSTLAIKLLSTNRIDDARFGVYSHRLDNMWLVGGASNTGGAVLRQIFT

Query:  DERLEQLSKQINPMKSSPLDYYPLTSIGERFPEADPQMAPRLHPRPENDVEYLHGILESIARIEGKAYRLLKDLGATQVEEVFTAGGGSKNEKWTKIRER
        DE+LE+LS++INPM  SPLDYYPL S GERFP ADP +APRL PRPE+DVE+LHGILESIARIEGK Y+LLK+LGAT+ EEV TAGGG+KN+KW KIR+R
Subjt:  DERLEQLSKQINPMKSSPLDYYPLTSIGERFPEADPQMAPRLHPRPENDVEYLHGILESIARIEGKAYRLLKDLGATQVEEVFTAGGGSKNEKWTKIRER

Query:  VLGLPVSRASQTEAAYGAALLALKGAQ
        VLGLPV +A  TEA+YGA+LLALKGA+
Subjt:  VLGLPVSRASQTEAAYGAALLALKGAQ

AT2G21370.2 xylulose kinase-12.6e-14466.49Show/hide
Query:  WARSWKTTLFSLLEDVPNHYRHLVASISIDGTSATTIIVDSNTGQPLSKPLLYNEACPDALPLVKSIAPVNHTVCSASSTLCKLVSWWNSADSNKEYATL
        WA SWK TLFSLLED+P   R LV+SIS+DGTSATT+I++S +G+ L +P LYN++CPDALP VKSIAP NHTVCS +STLCKLVSWWN+   N+E A L
Subjt:  WARSWKTTLFSLLEDVPNHYRHLVASISIDGTSATTIIVDSNTGQPLSKPLLYNEACPDALPLVKSIAPVNHTVCSASSTLCKLVSWWNSADSNKEYATL

Query:  LHQADWLLWFLHGKLGVSDYNNALKVGYDPEIESYPPWLLAQPYSMLLPHV----------------------------------FQMIALHAQEPQIVL
        LHQADWLLW LHG+LGVSDYNNALKVGYDPE ESYP WLL QPYS LLP V                                     +A  A EP    
Subjt:  LHQADWLLWFLHGKLGVSDYNNALKVGYDPEIESYPPWLLAQPYSMLLPHV----------------------------------FQMIALHAQEPQIVL

Query:  LHFLQHVTSLGSTLAIKLLSTNRIDDARFGVYSHRLDNMWLVGGASNTGGAVLRQIFTDERLEQLSKQINPMKSSPLDYYPLTSIGERFPEADPQMAPRL
            + VTSLGSTLAIKLLST R+DDAR+GVYSHRLD+ WLVGGASNTGGA+LRQ+F+DE+LE+LS++INPM  SPLDYYPL S GERFP ADP +APRL
Subjt:  LHFLQHVTSLGSTLAIKLLSTNRIDDARFGVYSHRLDNMWLVGGASNTGGAVLRQIFTDERLEQLSKQINPMKSSPLDYYPLTSIGERFPEADPQMAPRL

Query:  HPRPENDVEYLHGILESIARIEGKAYRLLKDLGATQVEEVFTAGGGSKNEKWTKIRERVLGLPVSRASQTEAAYGAALLALKGAQ
         PRPE+DVE+LHGILESIARIEGK Y+LLK+LGAT+ EEV TAGGG+KN+KW KIR+RVLGLPV +A  TEA+YGA+LLALKGA+
Subjt:  HPRPENDVEYLHGILESIARIEGKAYRLLKDLGATQVEEVFTAGGGSKNEKWTKIRERVLGLPVSRASQTEAAYGAALLALKGAQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTCCTTCCAATCTTCATTCTCCCGCGGCAAATTTGTTTTTGCTTCCATCAACGTCATGCTCGCCCTGTAATCATGGGATTTGGATTTCGAGGAATCAGAATCGAAG
AAATCGAAGACGGACAACAATGAGTGTTGCAACGGAAGTGGTGCTTCCTCAAGCAGGGAATCGGCTTTACCTTGGGATGGATTTTGGTACCTCTGGGGCAAGATTTGCGC
TCATTGACAAGGAAGGAGCTGTTTGTGCGGAAGGAAAGAGGGAGTATCCGCTCTTTAAGAATGACGAAACAATCGACTGGGCAAGATCGTGGAAAACAACACTTTTCTCA
TTGCTTGAAGATGTTCCAAATCATTATCGTCATTTGGTGGCATCTATTTCTATTGATGGCACATCTGCAACAACCATAATTGTGGACAGCAACACAGGACAGCCATTGTC
AAAGCCATTATTGTACAATGAGGCTTGTCCTGATGCCTTACCATTGGTGAAGTCTATTGCTCCAGTGAACCATACAGTCTGCTCTGCGTCATCTACTTTATGCAAGCTGG
TTTCATGGTGGAACAGTGCCGACTCAAATAAAGAATATGCCACATTGTTACATCAAGCAGATTGGTTGTTGTGGTTTCTACATGGGAAGCTTGGGGTTTCAGATTATAAT
AATGCTCTGAAGGTTGGCTATGATCCTGAAATTGAATCTTACCCACCCTGGCTTCTTGCTCAACCATATTCTATGCTTTTACCTCATGTTTTCCAAATGATTGCATTGCA
TGCACAGGAACCACAGATAGTATTGCTGCATTTCTTGCAGCACGTCACTTCCTTGGGATCCACGCTTGCCATCAAATTACTGAGTACCAATAGGATTGACGACGCACGGT
TTGGAGTGTACAGCCACCGACTTGACAATATGTGGCTCGTAGGAGGTGCTTCAAACACAGGTGGAGCCGTTCTAAGACAAATCTTTACTGACGAGCGATTAGAACAATTG
AGCAAACAAATCAACCCCATGAAAAGTTCACCTCTAGATTACTATCCTTTGACTTCAATTGGAGAGAGATTTCCGGAGGCAGACCCACAAATGGCTCCCAGATTACATCC
ACGGCCAGAAAATGATGTCGAATATTTGCATGGAATTTTGGAATCTATTGCCCGTATTGAGGGAAAGGCTTATAGGTTATTAAAGGATCTCGGAGCAACTCAGGTAGAAG
AAGTGTTCACAGCTGGAGGTGGATCCAAGAATGAGAAATGGACGAAGATACGAGAGAGAGTTCTTGGTTTGCCTGTGAGTCGGGCAAGTCAGACCGAGGCTGCATATGGA
GCTGCTCTATTGGCATTAAAAGGTGCGCAATTATTCATGCAATGA
mRNA sequenceShow/hide mRNA sequence
AAGAAACTGAAGCACTTCACAGAATTATGCTTCCTTCCAATCTTCATTCTCCCGCGGCAAATTTGTTTTTGCTTCCATCAACGTCATGCTCGCCCTGTAATCATGGGATT
TGGATTTCGAGGAATCAGAATCGAAGAAATCGAAGACGGACAACAATGAGTGTTGCAACGGAAGTGGTGCTTCCTCAAGCAGGGAATCGGCTTTACCTTGGGATGGATTT
TGGTACCTCTGGGGCAAGATTTGCGCTCATTGACAAGGAAGGAGCTGTTTGTGCGGAAGGAAAGAGGGAGTATCCGCTCTTTAAGAATGACGAAACAATCGACTGGGCAA
GATCGTGGAAAACAACACTTTTCTCATTGCTTGAAGATGTTCCAAATCATTATCGTCATTTGGTGGCATCTATTTCTATTGATGGCACATCTGCAACAACCATAATTGTG
GACAGCAACACAGGACAGCCATTGTCAAAGCCATTATTGTACAATGAGGCTTGTCCTGATGCCTTACCATTGGTGAAGTCTATTGCTCCAGTGAACCATACAGTCTGCTC
TGCGTCATCTACTTTATGCAAGCTGGTTTCATGGTGGAACAGTGCCGACTCAAATAAAGAATATGCCACATTGTTACATCAAGCAGATTGGTTGTTGTGGTTTCTACATG
GGAAGCTTGGGGTTTCAGATTATAATAATGCTCTGAAGGTTGGCTATGATCCTGAAATTGAATCTTACCCACCCTGGCTTCTTGCTCAACCATATTCTATGCTTTTACCT
CATGTTTTCCAAATGATTGCATTGCATGCACAGGAACCACAGATAGTATTGCTGCATTTCTTGCAGCACGTCACTTCCTTGGGATCCACGCTTGCCATCAAATTACTGAG
TACCAATAGGATTGACGACGCACGGTTTGGAGTGTACAGCCACCGACTTGACAATATGTGGCTCGTAGGAGGTGCTTCAAACACAGGTGGAGCCGTTCTAAGACAAATCT
TTACTGACGAGCGATTAGAACAATTGAGCAAACAAATCAACCCCATGAAAAGTTCACCTCTAGATTACTATCCTTTGACTTCAATTGGAGAGAGATTTCCGGAGGCAGAC
CCACAAATGGCTCCCAGATTACATCCACGGCCAGAAAATGATGTCGAATATTTGCATGGAATTTTGGAATCTATTGCCCGTATTGAGGGAAAGGCTTATAGGTTATTAAA
GGATCTCGGAGCAACTCAGGTAGAAGAAGTGTTCACAGCTGGAGGTGGATCCAAGAATGAGAAATGGACGAAGATACGAGAGAGAGTTCTTGGTTTGCCTGTGAGTCGGG
CAAGTCAGACCGAGGCTGCATATGGAGCTGCTCTATTGGCATTAAAAGGTGCGCAATTATTCATGCAATGAGAGCTTTGTACATTGTTGAACCCTGAATATTTTGCCATT
TTGGTGGCTATTAAACGAACTTATATAAGCTCCAATTAAAGAATAGTGGTACAATTAAATGAAAGATGCATACTTTCAGGCAATTTTGACAGAAAAAACAACCTTAAAGG
CAACACTCCTTATGATGCACAAGCGAAAGACTTCATAGCAGATATAGCTAAAAGAAGCCTAAGATAAAGTTAATCTGCTATCTGTTTTAAGCTTCATTCAGGAGCACAAC
CTTCCAAAATTGCCGGTGAAATTCTATGCTTCTTATATAACATTGGTTTCAGCAGCCATTCAGCCTAGATCTATTCCTTTATCAAACTAGGGCAAAGACATCGAAAAACT
ATCGAATCGTCCACAGCAGCTCACATGGACTCACCTATGCAGCTTTTGGCTTTTCTAGTTTATAATGGCTTAAATCACTGCTGTAAAGCATTCCATTAGGAGAGAGGATG
CGCCGAAGAGCGATAAGAAACCGAGCCTAAGTAAGAGAAAGTAAAGAAAGTGATTCTAGAAAGTGAGAAGAGCAGCTATCTTCACATAATAAGTACAGATATCCTATTCT
TACCCATTGAAATCCGGTCAGGGCATACCAGCAGCCTGTCAAACCATAACCTCTAGTGCTAATAACCTGCAATACAAGGGCGCCAAGAGAAAGGCATCCAGTCATCGACA
AACTAATAAATTTAAGATCTCGTCCAGCCTGCAACGTCCCTTCTAAGCTATGAGTTGGGGGTGTTATGACTAACGCCAAGAAATATGGAATCAACACTTTATGCATCTGC
ATGTACAA
Protein sequenceShow/hide protein sequence
MLPSNLHSPAANLFLLPSTSCSPCNHGIWISRNQNRRNRRRTTMSVATEVVLPQAGNRLYLGMDFGTSGARFALIDKEGAVCAEGKREYPLFKNDETIDWARSWKTTLFS
LLEDVPNHYRHLVASISIDGTSATTIIVDSNTGQPLSKPLLYNEACPDALPLVKSIAPVNHTVCSASSTLCKLVSWWNSADSNKEYATLLHQADWLLWFLHGKLGVSDYN
NALKVGYDPEIESYPPWLLAQPYSMLLPHVFQMIALHAQEPQIVLLHFLQHVTSLGSTLAIKLLSTNRIDDARFGVYSHRLDNMWLVGGASNTGGAVLRQIFTDERLEQL
SKQINPMKSSPLDYYPLTSIGERFPEADPQMAPRLHPRPENDVEYLHGILESIARIEGKAYRLLKDLGATQVEEVFTAGGGSKNEKWTKIRERVLGLPVSRASQTEAAYG
AALLALKGAQLFMQ