; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0015734 (gene) of Chayote v1 genome

Gene IDSed0015734
OrganismSechium edule (Chayote v1)
DescriptionAmino-acid racemase isoform 2
Genome locationLG12:33839418..33841002
RNA-Seq ExpressionSed0015734
SyntenySed0015734
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0036361 - racemase activity, acting on amino acids and derivatives (molecular function)
InterPro domainsIPR001920 - Asp/Glu racemase
IPR015942 - Asp/Glu/hydantoin racemase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6587918.1 hypothetical protein SDJN03_16483, partial [Cucurbita argyrosperma subsp. sororia]1.1e-14380.3Show/hide
Query:  MAMSFYALNCPVPVRRNACESVTCFRR-----SFVQICSIVRTDENDNLPGSKKIPSSGKSVSKCQIHKPLLVQPNTVGVIGGVSVISTLLFLEKLVWWS
        MA+ FYALNCP PVRRNA ES+T FRR     S +QI S+V+TDENDNLP SKKI + GKSVSK +  KPLLVQPNTVGVIGGVSVISTLLFLEKLVWWS
Subjt:  MAMSFYALNCPVPVRRNACESVTCFRR-----SFVQICSIVRTDENDNLPGSKKIPSSGKSVSKCQIHKPLLVQPNTVGVIGGVSVISTLLFLEKLVWWS

Query:  LKNGQESIPFVVCSDPGLGKGIASLASLSTFSTSSSPHGHDEAPIVENLKRKRAFLEQSGARCLITPCHLSHGWLGDTAESCKLPFLHVGDCVTIELKEA
        LK+GQ+SIPFVVCSDP LGKGI  L SL+TF+ +SS H   +API+E LK+K  FLEQSGARCLITPCHLSH WLGDT  SCKLPFLHVGDCV +ELKEA
Subjt:  LKNGQESIPFVVCSDPGLGKGIASLASLSTFSTSSSPHGHDEAPIVENLKRKRAFLEQSGARCLITPCHLSHGWLGDTAESCKLPFLHVGDCVTIELKEA

Query:  KLKPLEAGSNVRIGLLSTDTAAVAGFYHERLQNQGFDVVLPDEATMKNIIVPAVEALNKRDHEGARNLLRIAVHILLIRGVNMVILASDELLNLLPPDDP
        KLKPLEAGS+VRIGLL+TD A V GFY ERLQNQGF+VVLPD+AT+K+I+VPAVEALNKRD EGARNLLRIAVHILLIR VN+VILASDE LNLLPPDDP
Subjt:  KLKPLEAGSNVRIGLLSTDTAAVAGFYHERLQNQGFDVVLPDEATMKNIIVPAVEALNKRDHEGARNLLRIAVHILLIRGVNMVILASDELLNLLPPDDP

Query:  ILKKCIDPMDALARAAIKWSRSTGILHEKA
        ILKKCIDPMDALARAAIKWSRSTG LHEKA
Subjt:  ILKKCIDPMDALARAAIKWSRSTGILHEKA

XP_022931647.1 uncharacterized protein LOC111437803 isoform X1 [Cucurbita moschata]3.9e-14480.91Show/hide
Query:  MAMSFYALNCPVPVRRNACESVTCFRR-----SFVQICSIVRTDENDNLPGSKKIPSSGKSVSKCQIHKPLLVQPNTVGVIGGVSVISTLLFLEKLVWWS
        MAM FYALNCP PVRRNA ES+T FRR     S +QI S+V+TDENDNLP SKKI + GKSVSK +  KPLLVQPNTVGVIGGVSVISTLLFLEKLVWWS
Subjt:  MAMSFYALNCPVPVRRNACESVTCFRR-----SFVQICSIVRTDENDNLPGSKKIPSSGKSVSKCQIHKPLLVQPNTVGVIGGVSVISTLLFLEKLVWWS

Query:  LKNGQESIPFVVCSDPGLGKGIASLASLSTFSTSSSPHGHDEAPIVENLKRKRAFLEQSGARCLITPCHLSHGWLGDTAESCKLPFLHVGDCVTIELKEA
        LK+GQ+SIPFVVCSDP LGKGI  L SL+TF  +SS H   +API+E LK+K  FLEQSGARCLITPCHLSH WL DT  SCKLPFLHVGDCV +ELKEA
Subjt:  LKNGQESIPFVVCSDPGLGKGIASLASLSTFSTSSSPHGHDEAPIVENLKRKRAFLEQSGARCLITPCHLSHGWLGDTAESCKLPFLHVGDCVTIELKEA

Query:  KLKPLEAGSNVRIGLLSTDTAAVAGFYHERLQNQGFDVVLPDEATMKNIIVPAVEALNKRDHEGARNLLRIAVHILLIRGVNMVILASDELLNLLPPDDP
        KLKPLEAGS+VRIGLL+TD A V GFY ERLQNQGF+VVLPDEAT+K+I+VPAVEALNKRD EGARNLLRIAVHILLIR VN+VILASDELLNLLPPDDP
Subjt:  KLKPLEAGSNVRIGLLSTDTAAVAGFYHERLQNQGFDVVLPDEATMKNIIVPAVEALNKRDHEGARNLLRIAVHILLIRGVNMVILASDELLNLLPPDDP

Query:  ILKKCIDPMDALARAAIKWSRSTGILHEKA
        ILKKCIDPMDALARAAIKWSRSTG LHEKA
Subjt:  ILKKCIDPMDALARAAIKWSRSTGILHEKA

XP_023006388.1 uncharacterized protein LOC111499133 isoform X1 [Cucurbita maxima]1.2e-14581.52Show/hide
Query:  MAMSFYALNCPVPVRRNACESVTCFRR-----SFVQICSIVRTDENDNLPGSKKIPSSGKSVSKCQIHKPLLVQPNTVGVIGGVSVISTLLFLEKLVWWS
        MAM FYALNCP PVRRNA ES+T FRR     S +QI S+V+TDENDNLP SKKI S+GKSVSK +  KPLLVQPNTVGVIGGVSVISTLLFLEKLVWWS
Subjt:  MAMSFYALNCPVPVRRNACESVTCFRR-----SFVQICSIVRTDENDNLPGSKKIPSSGKSVSKCQIHKPLLVQPNTVGVIGGVSVISTLLFLEKLVWWS

Query:  LKNGQESIPFVVCSDPGLGKGIASLASLSTFSTSSSPHGHDEAPIVENLKRKRAFLEQSGARCLITPCHLSHGWLGDTAESCKLPFLHVGDCVTIELKEA
        LK+GQ+SIPFVVCSDP LGKGI  L SL+TF+ +SS H   +API+E LK+K  FLEQSGARCLITPCHLSH WLGDT  SCKLPFLHVGDCV +ELKEA
Subjt:  LKNGQESIPFVVCSDPGLGKGIASLASLSTFSTSSSPHGHDEAPIVENLKRKRAFLEQSGARCLITPCHLSHGWLGDTAESCKLPFLHVGDCVTIELKEA

Query:  KLKPLEAGSNVRIGLLSTDTAAVAGFYHERLQNQGFDVVLPDEATMKNIIVPAVEALNKRDHEGARNLLRIAVHILLIRGVNMVILASDELLNLLPPDDP
        KLKPLEAGS+VRIGLL+TD A V GFY ERLQNQGF+VVLPDEAT+K+I+VPAVEALNKRD EGARNLLRIAVHILLIR VN+VILASDELLNLLPPDDP
Subjt:  KLKPLEAGSNVRIGLLSTDTAAVAGFYHERLQNQGFDVVLPDEATMKNIIVPAVEALNKRDHEGARNLLRIAVHILLIRGVNMVILASDELLNLLPPDDP

Query:  ILKKCIDPMDALARAAIKWSRSTGILHEKA
        ILKKCIDPMDALARAAIKWSRSTG LHEKA
Subjt:  ILKKCIDPMDALARAAIKWSRSTGILHEKA

XP_023530993.1 uncharacterized protein LOC111793383 isoform X1 [Cucurbita pepo subsp. pepo]7.8e-14581.21Show/hide
Query:  MAMSFYALNCPVPVRRNACESVTCFRR-----SFVQICSIVRTDENDNLPGSKKIPSSGKSVSKCQIHKPLLVQPNTVGVIGGVSVISTLLFLEKLVWWS
        MAM FYALNCP PVRRNA ES+T FRR     S +QI S+V+TDENDNLP SKKI + GKSVSK +  KPLLVQPNTVGVIGGVSVISTLLFLEKLVWWS
Subjt:  MAMSFYALNCPVPVRRNACESVTCFRR-----SFVQICSIVRTDENDNLPGSKKIPSSGKSVSKCQIHKPLLVQPNTVGVIGGVSVISTLLFLEKLVWWS

Query:  LKNGQESIPFVVCSDPGLGKGIASLASLSTFSTSSSPHGHDEAPIVENLKRKRAFLEQSGARCLITPCHLSHGWLGDTAESCKLPFLHVGDCVTIELKEA
        LK+GQ+SIPFVVCSDP LGKGI  L SL+TF  +SS H   +API+E LK+K  FLEQSGARCLITPCHLSH WLGDT  SCKLPFLHVGDCV +ELKEA
Subjt:  LKNGQESIPFVVCSDPGLGKGIASLASLSTFSTSSSPHGHDEAPIVENLKRKRAFLEQSGARCLITPCHLSHGWLGDTAESCKLPFLHVGDCVTIELKEA

Query:  KLKPLEAGSNVRIGLLSTDTAAVAGFYHERLQNQGFDVVLPDEATMKNIIVPAVEALNKRDHEGARNLLRIAVHILLIRGVNMVILASDELLNLLPPDDP
        KLKPLEAGS+VRIGLL+TD A V GFY ERLQNQGF+VVLPDEAT+K+I+VPAVEALNKRD EGARNLLRIAVHILLIR VN+VILASDELLNLLPPDDP
Subjt:  KLKPLEAGSNVRIGLLSTDTAAVAGFYHERLQNQGFDVVLPDEATMKNIIVPAVEALNKRDHEGARNLLRIAVHILLIRGVNMVILASDELLNLLPPDDP

Query:  ILKKCIDPMDALARAAIKWSRSTGILHEKA
        ILKKCIDPMDALARAAIKWSRSTG LHEKA
Subjt:  ILKKCIDPMDALARAAIKWSRSTGILHEKA

XP_038878952.1 uncharacterized protein LOC120071036 [Benincasa hispida]2.3e-14980.91Show/hide
Query:  MAMSFYALNCPVPVRRNACESVTCFRR-----SFVQICSIVRTDENDNLPGSKKIPSSGKSVSKCQIHKPLLVQPNTVGVIGGVSVISTLLFLEKLVWWS
        M  SFYAL CP P+RRNA E +T FRR     S VQI S+++TD NDNLPGSKKI SSGKS+SK +  KPLLVQP TVGVIGGVSV STLLFLEKLVWWS
Subjt:  MAMSFYALNCPVPVRRNACESVTCFRR-----SFVQICSIVRTDENDNLPGSKKIPSSGKSVSKCQIHKPLLVQPNTVGVIGGVSVISTLLFLEKLVWWS

Query:  LKNGQESIPFVVCSDPGLGKGIASLASLSTFSTSSSPHGHDEAPIVENLKRKRAFLEQSGARCLITPCHLSHGWLGDTAESCKLPFLHVGDCVTIELKEA
        +K+GQESIPFVVCSDP LGKGI  L S  TFST+SS +GH + PI+ENLKRKRAFLEQSGARCLITPCHL+H WLGDT+ESCKLPFLHVGDCV  ELKEA
Subjt:  LKNGQESIPFVVCSDPGLGKGIASLASLSTFSTSSSPHGHDEAPIVENLKRKRAFLEQSGARCLITPCHLSHGWLGDTAESCKLPFLHVGDCVTIELKEA

Query:  KLKPLEAGSNVRIGLLSTDTAAVAGFYHERLQNQGFDVVLPDEATMKNIIVPAVEALNKRDHEGARNLLRIAVHILLIRGVNMVILASDELLNLLPPDDP
        KLKPLE GS+VRIGLL+TDTA VAGFYHERLQNQGFD++LPDEATM++I++PAVEALNKRD EGARNLLRIAVH+LLIR VNMVILASDELLNLLPPDDP
Subjt:  KLKPLEAGSNVRIGLLSTDTAAVAGFYHERLQNQGFDVVLPDEATMKNIIVPAVEALNKRDHEGARNLLRIAVHILLIRGVNMVILASDELLNLLPPDDP

Query:  ILKKCIDPMDALARAAIKWSRSTGILHEKA
        ILKKCIDPMDALARAAIKWSRST  LHEKA
Subjt:  ILKKCIDPMDALARAAIKWSRSTGILHEKA

TrEMBL top hitse value%identityAlignment
A0A1S3CMU2 uncharacterized protein LOC103502741 isoform X11.3e-14079.88Show/hide
Query:  MAMSFYALNCPVPVRRNACESVTCFRR-----SFVQICSIVRTDENDNLPGSKKIPSSGKSVSKCQIHKPLLVQPNTVGVIGGVSVISTLLFLEKLVWWS
        MAM FYA  CP PVRRNA E +T FRR     S VQI S+V+TD NDNLP SKKI S GKS+SK +  KPLLVQPNTVGVIGGVSV STLLFLEKLVWWS
Subjt:  MAMSFYALNCPVPVRRNACESVTCFRR-----SFVQICSIVRTDENDNLPGSKKIPSSGKSVSKCQIHKPLLVQPNTVGVIGGVSVISTLLFLEKLVWWS

Query:  LKNGQESIPFVVCSDPGLGKGIASLASLSTFSTSSSPHGHDEAPIVENLKRKRAFLEQSGARCLITPCHLSHGWLGDTAESCKLPFLHVGDCVTIELKEA
        LK+GQESIPFVVCS+P LGKGI  + SL TFST SS +GH +API+ENL RKRAFLE SGARCLITPCHL+H WL DT+ESCKLPFLHVGDCV  ELKEA
Subjt:  LKNGQESIPFVVCSDPGLGKGIASLASLSTFSTSSSPHGHDEAPIVENLKRKRAFLEQSGARCLITPCHLSHGWLGDTAESCKLPFLHVGDCVTIELKEA

Query:  KLKPLEAGSNVRIGLLSTDTAAVAGFYHERLQNQGFDVVLPDEATMKNIIVPAVEALNKRDHEGARNLLRIAVHILLIRGVNMVILASDELLNLLPPDDP
         LKPLE G NVRIGLL+TDT  V G Y+ERLQNQGFDV+LPDEATMK+I++PAVEALNKRD EGARNLLRIAVH+LLIR VNMVILASDELLNLLPPDDP
Subjt:  KLKPLEAGSNVRIGLLSTDTAAVAGFYHERLQNQGFDVVLPDEATMKNIIVPAVEALNKRDHEGARNLLRIAVHILLIRGVNMVILASDELLNLLPPDDP

Query:  ILKKCIDPMDALARAAIKWSRST
        ILKKCIDPMDALARAAIKWSRST
Subjt:  ILKKCIDPMDALARAAIKWSRST

A0A5A7TXR8 Amino-acid racemase isoform 21.3e-14079.88Show/hide
Query:  MAMSFYALNCPVPVRRNACESVTCFRR-----SFVQICSIVRTDENDNLPGSKKIPSSGKSVSKCQIHKPLLVQPNTVGVIGGVSVISTLLFLEKLVWWS
        MAM FYA  CP PVRRNA E +T FRR     S VQI S+V+TD NDNLP SKKI S GKS+SK +  KPLLVQPNTVGVIGGVSV STLLFLEKLVWWS
Subjt:  MAMSFYALNCPVPVRRNACESVTCFRR-----SFVQICSIVRTDENDNLPGSKKIPSSGKSVSKCQIHKPLLVQPNTVGVIGGVSVISTLLFLEKLVWWS

Query:  LKNGQESIPFVVCSDPGLGKGIASLASLSTFSTSSSPHGHDEAPIVENLKRKRAFLEQSGARCLITPCHLSHGWLGDTAESCKLPFLHVGDCVTIELKEA
        LK+GQESIPFVVCS+P LGKGI  + SL TFST SS +GH +API+ENL RKRAFLE SGARCLITPCHL+H WL DT+ESCKLPFLHVGDCV  ELKEA
Subjt:  LKNGQESIPFVVCSDPGLGKGIASLASLSTFSTSSSPHGHDEAPIVENLKRKRAFLEQSGARCLITPCHLSHGWLGDTAESCKLPFLHVGDCVTIELKEA

Query:  KLKPLEAGSNVRIGLLSTDTAAVAGFYHERLQNQGFDVVLPDEATMKNIIVPAVEALNKRDHEGARNLLRIAVHILLIRGVNMVILASDELLNLLPPDDP
         LKPLE G NVRIGLL+TDT  V G Y+ERLQNQGFDV+LPDEATMK+I++PAVEALNKRD EGARNLLRIAVH+LLIR VNMVILASDELLNLLPPDDP
Subjt:  KLKPLEAGSNVRIGLLSTDTAAVAGFYHERLQNQGFDVVLPDEATMKNIIVPAVEALNKRDHEGARNLLRIAVHILLIRGVNMVILASDELLNLLPPDDP

Query:  ILKKCIDPMDALARAAIKWSRST
        ILKKCIDPMDALARAAIKWSRST
Subjt:  ILKKCIDPMDALARAAIKWSRST

A0A5D3E266 Amino-acid racemase isoform 22.4e-13979.57Show/hide
Query:  MAMSFYALNCPVPVRRNACESVTCFRR-----SFVQICSIVRTDENDNLPGSKKIPSSGKSVSKCQIHKPLLVQPNTVGVIGGVSVISTLLFLEKLVWWS
        MAM FYA   P PVRRNA E +T FRR     S VQI S+V+TD NDNLP SKKI S GKS+SK +  KPLLVQPNTVGVIGGVSV STLLFLEKLVWWS
Subjt:  MAMSFYALNCPVPVRRNACESVTCFRR-----SFVQICSIVRTDENDNLPGSKKIPSSGKSVSKCQIHKPLLVQPNTVGVIGGVSVISTLLFLEKLVWWS

Query:  LKNGQESIPFVVCSDPGLGKGIASLASLSTFSTSSSPHGHDEAPIVENLKRKRAFLEQSGARCLITPCHLSHGWLGDTAESCKLPFLHVGDCVTIELKEA
        LK+GQESIPFVVCS+P LGKGI  + SL TFST SS +GH +API+ENL RKRAFLE SGARCLITPCHL+H WL DT+ESCKLPFLHVGDCV  ELKEA
Subjt:  LKNGQESIPFVVCSDPGLGKGIASLASLSTFSTSSSPHGHDEAPIVENLKRKRAFLEQSGARCLITPCHLSHGWLGDTAESCKLPFLHVGDCVTIELKEA

Query:  KLKPLEAGSNVRIGLLSTDTAAVAGFYHERLQNQGFDVVLPDEATMKNIIVPAVEALNKRDHEGARNLLRIAVHILLIRGVNMVILASDELLNLLPPDDP
         LKPLE G NVRIGLL+TDT  V G Y+ERLQNQGFDV+LPDEATMK+I++PAVEALNKRD EGARNLLRIAVH+LLIR VNMVILASDELLNLLPPDDP
Subjt:  KLKPLEAGSNVRIGLLSTDTAAVAGFYHERLQNQGFDVVLPDEATMKNIIVPAVEALNKRDHEGARNLLRIAVHILLIRGVNMVILASDELLNLLPPDDP

Query:  ILKKCIDPMDALARAAIKWSRST
        ILKKCIDPMDALARAAIKWSRST
Subjt:  ILKKCIDPMDALARAAIKWSRST

A0A6J1EU92 uncharacterized protein LOC111437803 isoform X11.9e-14480.91Show/hide
Query:  MAMSFYALNCPVPVRRNACESVTCFRR-----SFVQICSIVRTDENDNLPGSKKIPSSGKSVSKCQIHKPLLVQPNTVGVIGGVSVISTLLFLEKLVWWS
        MAM FYALNCP PVRRNA ES+T FRR     S +QI S+V+TDENDNLP SKKI + GKSVSK +  KPLLVQPNTVGVIGGVSVISTLLFLEKLVWWS
Subjt:  MAMSFYALNCPVPVRRNACESVTCFRR-----SFVQICSIVRTDENDNLPGSKKIPSSGKSVSKCQIHKPLLVQPNTVGVIGGVSVISTLLFLEKLVWWS

Query:  LKNGQESIPFVVCSDPGLGKGIASLASLSTFSTSSSPHGHDEAPIVENLKRKRAFLEQSGARCLITPCHLSHGWLGDTAESCKLPFLHVGDCVTIELKEA
        LK+GQ+SIPFVVCSDP LGKGI  L SL+TF  +SS H   +API+E LK+K  FLEQSGARCLITPCHLSH WL DT  SCKLPFLHVGDCV +ELKEA
Subjt:  LKNGQESIPFVVCSDPGLGKGIASLASLSTFSTSSSPHGHDEAPIVENLKRKRAFLEQSGARCLITPCHLSHGWLGDTAESCKLPFLHVGDCVTIELKEA

Query:  KLKPLEAGSNVRIGLLSTDTAAVAGFYHERLQNQGFDVVLPDEATMKNIIVPAVEALNKRDHEGARNLLRIAVHILLIRGVNMVILASDELLNLLPPDDP
        KLKPLEAGS+VRIGLL+TD A V GFY ERLQNQGF+VVLPDEAT+K+I+VPAVEALNKRD EGARNLLRIAVHILLIR VN+VILASDELLNLLPPDDP
Subjt:  KLKPLEAGSNVRIGLLSTDTAAVAGFYHERLQNQGFDVVLPDEATMKNIIVPAVEALNKRDHEGARNLLRIAVHILLIRGVNMVILASDELLNLLPPDDP

Query:  ILKKCIDPMDALARAAIKWSRSTGILHEKA
        ILKKCIDPMDALARAAIKWSRSTG LHEKA
Subjt:  ILKKCIDPMDALARAAIKWSRSTGILHEKA

A0A6J1L4S4 uncharacterized protein LOC111499133 isoform X15.8e-14681.52Show/hide
Query:  MAMSFYALNCPVPVRRNACESVTCFRR-----SFVQICSIVRTDENDNLPGSKKIPSSGKSVSKCQIHKPLLVQPNTVGVIGGVSVISTLLFLEKLVWWS
        MAM FYALNCP PVRRNA ES+T FRR     S +QI S+V+TDENDNLP SKKI S+GKSVSK +  KPLLVQPNTVGVIGGVSVISTLLFLEKLVWWS
Subjt:  MAMSFYALNCPVPVRRNACESVTCFRR-----SFVQICSIVRTDENDNLPGSKKIPSSGKSVSKCQIHKPLLVQPNTVGVIGGVSVISTLLFLEKLVWWS

Query:  LKNGQESIPFVVCSDPGLGKGIASLASLSTFSTSSSPHGHDEAPIVENLKRKRAFLEQSGARCLITPCHLSHGWLGDTAESCKLPFLHVGDCVTIELKEA
        LK+GQ+SIPFVVCSDP LGKGI  L SL+TF+ +SS H   +API+E LK+K  FLEQSGARCLITPCHLSH WLGDT  SCKLPFLHVGDCV +ELKEA
Subjt:  LKNGQESIPFVVCSDPGLGKGIASLASLSTFSTSSSPHGHDEAPIVENLKRKRAFLEQSGARCLITPCHLSHGWLGDTAESCKLPFLHVGDCVTIELKEA

Query:  KLKPLEAGSNVRIGLLSTDTAAVAGFYHERLQNQGFDVVLPDEATMKNIIVPAVEALNKRDHEGARNLLRIAVHILLIRGVNMVILASDELLNLLPPDDP
        KLKPLEAGS+VRIGLL+TD A V GFY ERLQNQGF+VVLPDEAT+K+I+VPAVEALNKRD EGARNLLRIAVHILLIR VN+VILASDELLNLLPPDDP
Subjt:  KLKPLEAGSNVRIGLLSTDTAAVAGFYHERLQNQGFDVVLPDEATMKNIIVPAVEALNKRDHEGARNLLRIAVHILLIRGVNMVILASDELLNLLPPDDP

Query:  ILKKCIDPMDALARAAIKWSRSTGILHEKA
        ILKKCIDPMDALARAAIKWSRSTG LHEKA
Subjt:  ILKKCIDPMDALARAAIKWSRSTGILHEKA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15410.1 aspartate-glutamate racemase family2.5e-7748.11Show/hide
Query:  IVRTDENDNLPGSKK---IPSSGKSVSKCQIHKPLLVQPNTVGVIGGVSVISTLLFLEKLVWWSLKNGQESIPFVVCSDPGLGKGIASLASLSTFSTSSS
        ++  DE+++LP  KK   +    ++         LL Q NTVG+IGGVS  STL F++KLV WS  +G+ S+PFV+CSDP L K    L      S  S 
Subjt:  IVRTDENDNLPGSKK---IPSSGKSVSKCQIHKPLLVQPNTVGVIGGVSVISTLLFLEKLVWWSLKNGQESIPFVVCSDPGLGKGIASLASLSTFSTSSS

Query:  PHGHDEAP-----IVENLKRKRAFLEQSGARCLITPCHLSHGWLGDTAESCKLPFLHVGDCVTIELKEAKLKPLEAGSNVRIGLLSTDTAAVAGFYHERL
         H  +  P     IVENL+ KR +LE+ GA+ ++ PCH++H W  +  E   +P LH+G+C+  EL+EAK+KPLEAG+ +R+G+++T     AGFY E+L
Subjt:  PHGHDEAP-----IVENLKRKRAFLEQSGARCLITPCHLSHGWLGDTAESCKLPFLHVGDCVTIELKEAKLKPLEAGSNVRIGLLSTDTAAVAGFYHERL

Query:  QNQGFDVVLPDEATMKNIIVPAVEALNKRDHEGARNLLRIAVHILLIRGVNMVILASDELLNLLPPDDPILKKCIDPMDALARAAIKWSRS
        Q+ GF+ VLPD+ATM++ ++P++EA+ ++D EGARNLLRIA+ +LL++ VN+V+L SDE+ +LLP DDP+LKKC+DPMDALAR+AIKW+ +
Subjt:  QNQGFDVVLPDEATMKNIIVPAVEALNKRDHEGARNLLRIAVHILLIRGVNMVILASDELLNLLPPDDPILKKCIDPMDALARAAIKWSRS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTATGTCTTTTTATGCACTGAATTGCCCTGTACCTGTTCGACGAAATGCATGTGAGAGTGTTACTTGTTTTAGAAGGAGTTTTGTACAGATTTGTTCTATTGTTCG
TACTGATGAGAATGATAACTTACCAGGATCCAAGAAGATACCGAGTTCTGGGAAATCTGTGTCAAAATGTCAAATTCATAAGCCCCTTCTTGTGCAGCCAAACACTGTGG
GTGTTATAGGGGGAGTTTCGGTTATTTCGACTTTATTGTTCTTGGAAAAGCTTGTGTGGTGGAGTTTGAAGAATGGACAGGAGAGCATACCTTTTGTTGTTTGCAGTGAT
CCAGGATTAGGGAAAGGGATTGCTTCTCTTGCTTCATTGAGTACATTCAGCACAAGTTCTTCTCCACATGGTCATGATGAAGCTCCTATCGTTGAGAATTTGAAGCGGAA
AAGGGCGTTTCTCGAGCAGTCTGGAGCTCGGTGCTTGATTACCCCTTGCCATCTTTCACATGGGTGGCTTGGTGACACAGCTGAGAGTTGCAAATTGCCTTTCCTTCATG
TGGGAGATTGTGTTACAATAGAGCTTAAAGAGGCTAAGCTTAAGCCACTTGAAGCTGGGAGCAATGTCCGGATTGGGCTGCTTAGTACTGACACAGCAGCAGTGGCTGGT
TTTTACCATGAGAGGCTACAAAACCAGGGCTTCGATGTTGTGTTGCCAGACGAAGCAACCATGAAGAATATAATAGTTCCTGCAGTTGAAGCTTTGAACAAAAGGGATCA
TGAAGGAGCAAGAAATCTGTTGAGAATTGCTGTCCATATTCTTTTGATAAGGGGTGTGAATATGGTAATACTTGCTTCTGATGAATTGCTTAATCTTCTTCCCCCTGATG
ATCCCATTTTGAAAAAATGTATTGACCCCATGGATGCCTTGGCCAGAGCAGCCATTAAATGGTCTAGATCTACAGGAATTTTACATGAGAAAGCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTATGTCTTTTTATGCACTGAATTGCCCTGTACCTGTTCGACGAAATGCATGTGAGAGTGTTACTTGTTTTAGAAGGAGTTTTGTACAGATTTGTTCTATTGTTCG
TACTGATGAGAATGATAACTTACCAGGATCCAAGAAGATACCGAGTTCTGGGAAATCTGTGTCAAAATGTCAAATTCATAAGCCCCTTCTTGTGCAGCCAAACACTGTGG
GTGTTATAGGGGGAGTTTCGGTTATTTCGACTTTATTGTTCTTGGAAAAGCTTGTGTGGTGGAGTTTGAAGAATGGACAGGAGAGCATACCTTTTGTTGTTTGCAGTGAT
CCAGGATTAGGGAAAGGGATTGCTTCTCTTGCTTCATTGAGTACATTCAGCACAAGTTCTTCTCCACATGGTCATGATGAAGCTCCTATCGTTGAGAATTTGAAGCGGAA
AAGGGCGTTTCTCGAGCAGTCTGGAGCTCGGTGCTTGATTACCCCTTGCCATCTTTCACATGGGTGGCTTGGTGACACAGCTGAGAGTTGCAAATTGCCTTTCCTTCATG
TGGGAGATTGTGTTACAATAGAGCTTAAAGAGGCTAAGCTTAAGCCACTTGAAGCTGGGAGCAATGTCCGGATTGGGCTGCTTAGTACTGACACAGCAGCAGTGGCTGGT
TTTTACCATGAGAGGCTACAAAACCAGGGCTTCGATGTTGTGTTGCCAGACGAAGCAACCATGAAGAATATAATAGTTCCTGCAGTTGAAGCTTTGAACAAAAGGGATCA
TGAAGGAGCAAGAAATCTGTTGAGAATTGCTGTCCATATTCTTTTGATAAGGGGTGTGAATATGGTAATACTTGCTTCTGATGAATTGCTTAATCTTCTTCCCCCTGATG
ATCCCATTTTGAAAAAATGTATTGACCCCATGGATGCCTTGGCCAGAGCAGCCATTAAATGGTCTAGATCTACAGGAATTTTACATGAGAAAGCTTAG
Protein sequenceShow/hide protein sequence
MAMSFYALNCPVPVRRNACESVTCFRRSFVQICSIVRTDENDNLPGSKKIPSSGKSVSKCQIHKPLLVQPNTVGVIGGVSVISTLLFLEKLVWWSLKNGQESIPFVVCSD
PGLGKGIASLASLSTFSTSSSPHGHDEAPIVENLKRKRAFLEQSGARCLITPCHLSHGWLGDTAESCKLPFLHVGDCVTIELKEAKLKPLEAGSNVRIGLLSTDTAAVAG
FYHERLQNQGFDVVLPDEATMKNIIVPAVEALNKRDHEGARNLLRIAVHILLIRGVNMVILASDELLNLLPPDDPILKKCIDPMDALARAAIKWSRSTGILHEKA