; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg20761 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg20761
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionClavaminate synthase-like protein
Genome locationCarg_Chr15:9995576..9999623
RNA-Seq ExpressionCarg20761
SyntenyCarg20761
Gene Ontology termsGO:0016491 - oxidoreductase activity (molecular function)
InterPro domainsIPR003819 - TauD/TfdA-like domain
IPR042098 - Taurine dioxygenase TauD-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579699.1 Protein CHUP1, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0089.05Show/hide
Query:  MEFSSKEFKVGKCEGQKEVDGETMPLVLQPPESDKANFDSLLLALQNNKDWLDQMIIKHSAVLLRGYNVSKPEEFNEIVEAFRWEDIRYVGPAPRTHIYK
        MEFSSKEFKVGKCEGQKEVDGETMPLVLQPPESDKANFDSLLLALQNNKDWLDQMIIKHSAVLLRGYNVSKPEEFNEIVEAF WEDIRYVGPAPRTHIYK
Subjt:  MEFSSKEFKVGKCEGQKEVDGETMPLVLQPPESDKANFDSLLLALQNNKDWLDQMIIKHSAVLLRGYNVSKPEEFNEIVEAFRWEDIRYVGPAPRTHIYK

Query:  RIWTANEGPLSEFIYYHHEMVLIKEYPKRVILYCEIPPPEGGETPFVPSFKVTERMVKEFPKEVEEMDKKGLKYTFTALSNNDTSSMRGRGWEDAFGSSD
        RIWTANEGPLSEFIYYHHEMVLIKEYPKRVILYCEIPPPEGGETPFVPSFKVTERMVKEFPKEVEEMDKKGLKYTFTALSNNDTSSMRGRGWEDAFGSSD
Subjt:  RIWTANEGPLSEFIYYHHEMVLIKEYPKRVILYCEIPPPEGGETPFVPSFKVTERMVKEFPKEVEEMDKKGLKYTFTALSNNDTSSMRGRGWEDAFGSSD

Query:  RYEAEKRANALGMEVEWLPNGAMKTILGPRCLTKVFDGRKGRRMWFNTMVGMHGKEHSSALMADGMEISEHVVKRCQQIIEEESIQFKWEKGDVLFLDNY
        RYEAEKRANALGMEVEWLPNGAMKTILGPRCLTKVFDGRKGRRMWFNTMVGMHGKEHSSALMADGMEISE VVKRCQQIIEEESIQFKWEKGDVLFLDNY
Subjt:  RYEAEKRANALGMEVEWLPNGAMKTILGPRCLTKVFDGRKGRRMWFNTMVGMHGKEHSSALMADGMEISEHVVKRCQQIIEEESIQFKWEKGDVLFLDNY

Query:  ALLHGRRPSLPPRR-----------MMNRFGVLVSVSIAAYAIRQLTIRSWSSLIYSENGEHTEKNLRRQRRMFHGLDEEQEEQKEANSMNDLEDGDHDH
        ALLHGRRPSLPPRR           MMNRFGVLVSVSIAAYAIRQLTIRSWSSLIYSENGEHTEKNLRRQRRMFHGLDEEQEEQKEANSMNDLEDGDHDH
Subjt:  ALLHGRRPSLPPRR-----------MMNRFGVLVSVSIAAYAIRQLTIRSWSSLIYSENGEHTEKNLRRQRRMFHGLDEEQEEQKEANSMNDLEDGDHDH

Query:  SLDELQEHLPQNKVGETHKIEMERLLEQVMELEERKVKLEGELLMYDGIKNSEMDIMELKKQLEAKNDDINKSNITISSLQAERKELQEEIVKGALVKKE
        SLDELQEHLPQNKVGETHKIEMERLLEQVMELEERKVKLEGELLMYDGIKNSEMDIMELKKQLEAKNDDINKSNITISSLQAERKELQEEIVKGALVKKE
Subjt:  SLDELQEHLPQNKVGETHKIEMERLLEQVMELEERKVKLEGELLMYDGIKNSEMDIMELKKQLEAKNDDINKSNITISSLQAERKELQEEIVKGALVKKE

Query:  LMEAKGKIKELQRQIQVDANQTKQHLLLLKQQVSTLQAKEEEALKKEVELYKKLRAEKDFE---------------------------------------
        LMEAKGKIKELQRQIQVDANQTKQHLLLLKQQVSTLQAKEEEALKKEVELYKKLRAEKDFE                                       
Subjt:  LMEAKGKIKELQRQIQVDANQTKQHLLLLKQQVSTLQAKEEEALKKEVELYKKLRAEKDFE---------------------------------------

Query:  -----------------------------MNKFSEVEELVYLRWINACLRYELRDDDETPAGESARYLNKNSSPKSKEKAKQLMLEYAGTETDSIDSSKS
                                     MNKFSEVEELVYLRWINACLRYELRDDDETPAGESARYLNKNSSPKSKEKAKQLMLEYAGTETDSIDSSKS
Subjt:  -----------------------------MNKFSEVEELVYLRWINACLRYELRDDDETPAGESARYLNKNSSPKSKEKAKQLMLEYAGTETDSIDSSKS

Query:  RSSSSFSEKPNLIRNNCESSGVLSLPMIGLSHGRKDRLEAVLAVRAETLTLSEVRRLQ
        RSSSSFSEKPNLIRNNCESSGVLSLPMIGLS GR DRLEAVLAVRAETLTLSEVRRLQ
Subjt:  RSSSSFSEKPNLIRNNCESSGVLSLPMIGLSHGRKDRLEAVLAVRAETLTLSEVRRLQ

KAG7017141.1 Clavaminate synthase-like protein, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+00100Show/hide
Query:  MEFSSKEFKVGKCEGQKEVDGETMPLVLQPPESDKANFDSLLLALQNNKDWLDQMIIKHSAVLLRGYNVSKPEEFNEIVEAFRWEDIRYVGPAPRTHIYK
        MEFSSKEFKVGKCEGQKEVDGETMPLVLQPPESDKANFDSLLLALQNNKDWLDQMIIKHSAVLLRGYNVSKPEEFNEIVEAFRWEDIRYVGPAPRTHIYK
Subjt:  MEFSSKEFKVGKCEGQKEVDGETMPLVLQPPESDKANFDSLLLALQNNKDWLDQMIIKHSAVLLRGYNVSKPEEFNEIVEAFRWEDIRYVGPAPRTHIYK

Query:  RIWTANEGPLSEFIYYHHEMVLIKEYPKRVILYCEIPPPEGGETPFVPSFKVTERMVKEFPKEVEEMDKKGLKYTFTALSNNDTSSMRGRGWEDAFGSSD
        RIWTANEGPLSEFIYYHHEMVLIKEYPKRVILYCEIPPPEGGETPFVPSFKVTERMVKEFPKEVEEMDKKGLKYTFTALSNNDTSSMRGRGWEDAFGSSD
Subjt:  RIWTANEGPLSEFIYYHHEMVLIKEYPKRVILYCEIPPPEGGETPFVPSFKVTERMVKEFPKEVEEMDKKGLKYTFTALSNNDTSSMRGRGWEDAFGSSD

Query:  RYEAEKRANALGMEVEWLPNGAMKTILGPRCLTKVFDGRKGRRMWFNTMVGMHGKEHSSALMADGMEISEHVVKRCQQIIEEESIQFKWEKGDVLFLDNY
        RYEAEKRANALGMEVEWLPNGAMKTILGPRCLTKVFDGRKGRRMWFNTMVGMHGKEHSSALMADGMEISEHVVKRCQQIIEEESIQFKWEKGDVLFLDNY
Subjt:  RYEAEKRANALGMEVEWLPNGAMKTILGPRCLTKVFDGRKGRRMWFNTMVGMHGKEHSSALMADGMEISEHVVKRCQQIIEEESIQFKWEKGDVLFLDNY

Query:  ALLHGRRPSLPPRRMMNRFGVLVSVSIAAYAIRQLTIRSWSSLIYSENGEHTEKNLRRQRRMFHGLDEEQEEQKEANSMNDLEDGDHDHSLDELQEHLPQ
        ALLHGRRPSLPPRRMMNRFGVLVSVSIAAYAIRQLTIRSWSSLIYSENGEHTEKNLRRQRRMFHGLDEEQEEQKEANSMNDLEDGDHDHSLDELQEHLPQ
Subjt:  ALLHGRRPSLPPRRMMNRFGVLVSVSIAAYAIRQLTIRSWSSLIYSENGEHTEKNLRRQRRMFHGLDEEQEEQKEANSMNDLEDGDHDHSLDELQEHLPQ

Query:  NKVGETHKIEMERLLEQVMELEERKVKLEGELLMYDGIKNSEMDIMELKKQLEAKNDDINKSNITISSLQAERKELQEEIVKGALVKKELMEAKGKIKEL
        NKVGETHKIEMERLLEQVMELEERKVKLEGELLMYDGIKNSEMDIMELKKQLEAKNDDINKSNITISSLQAERKELQEEIVKGALVKKELMEAKGKIKEL
Subjt:  NKVGETHKIEMERLLEQVMELEERKVKLEGELLMYDGIKNSEMDIMELKKQLEAKNDDINKSNITISSLQAERKELQEEIVKGALVKKELMEAKGKIKEL

Query:  QRQIQVDANQTKQHLLLLKQQVSTLQAKEEEALKKEVELYKKLRAEKDFEMNKFSEVEELVYLRWINACLRYELRDDDETPAGESARYLNKNSSPKSKEK
        QRQIQVDANQTKQHLLLLKQQVSTLQAKEEEALKKEVELYKKLRAEKDFEMNKFSEVEELVYLRWINACLRYELRDDDETPAGESARYLNKNSSPKSKEK
Subjt:  QRQIQVDANQTKQHLLLLKQQVSTLQAKEEEALKKEVELYKKLRAEKDFEMNKFSEVEELVYLRWINACLRYELRDDDETPAGESARYLNKNSSPKSKEK

Query:  AKQLMLEYAGTETDSIDSSKSRSSSSFSEKPNLIRNNCESSGVLSLPMIGLSHGRKDRLEAVLAVRAETLTLSEVRRLQVCSRNSVNSVATSFKLMSKTV
        AKQLMLEYAGTETDSIDSSKSRSSSSFSEKPNLIRNNCESSGVLSLPMIGLSHGRKDRLEAVLAVRAETLTLSEVRRLQVCSRNSVNSVATSFKLMSKTV
Subjt:  AKQLMLEYAGTETDSIDSSKSRSSSSFSEKPNLIRNNCESSGVLSLPMIGLSHGRKDRLEAVLAVRAETLTLSEVRRLQVCSRNSVNSVATSFKLMSKTV

Query:  EESLKQRTL
        EESLKQRTL
Subjt:  EESLKQRTL

XP_022929006.1 clavaminate synthase-like protein At3g21360 [Cucurbita moschata]1.2e-18299.05Show/hide
Query:  MEFSSKEFKVGKCEGQKEVDGETMPLVLQPPESDKANFDSLLLALQNNKDWLDQMIIKHSAVLLRGYNVSKPEEFNEIVEAFRWEDIRYVGPAPRTHIYK
        MEFSSKEFKVGKCEGQKEVDGETMPLVLQPPESDKANFDSLLLALQNNKDWLDQMIIKHSAVLLRGYNVSKPEEFNEIVEAF WEDIRYVGPAPRTHIYK
Subjt:  MEFSSKEFKVGKCEGQKEVDGETMPLVLQPPESDKANFDSLLLALQNNKDWLDQMIIKHSAVLLRGYNVSKPEEFNEIVEAFRWEDIRYVGPAPRTHIYK

Query:  RIWTANEGPLSEFIYYHHEMVLIKEYPKRVILYCEIPPPEGGETPFVPSFKVTERMVKEFPKEVEEMDKKGLKYTFTALSNNDTSSMRGRGWEDAFGSSD
        RIWTANEGPLSEFIYYHHEMVLIKEYPKRVILYCEIPPPEGGETPFVPSFKVTERMVKEFPKEVEEMDKKGLKYTFTALSNNDTSSMRGRGWEDAFGSSD
Subjt:  RIWTANEGPLSEFIYYHHEMVLIKEYPKRVILYCEIPPPEGGETPFVPSFKVTERMVKEFPKEVEEMDKKGLKYTFTALSNNDTSSMRGRGWEDAFGSSD

Query:  RYEAEKRANALGMEVEWLPNGAMKTILGPRCLTKVFDGRKGRRMWFNTMVGMHGKEHSSALMADGMEISEHVVKRCQQIIEEESIQFKWEKGDVLFLDNY
        RYEAEKRANALGMEVEWLPNGAMKTILGPRCLTKVFDGRKGRRMWFNTMVGMHGKEHSSALMADGMEISEHVVKRCQQIIEEESIQFKWEKGDVLFLDNY
Subjt:  RYEAEKRANALGMEVEWLPNGAMKTILGPRCLTKVFDGRKGRRMWFNTMVGMHGKEHSSALMADGMEISEHVVKRCQQIIEEESIQFKWEKGDVLFLDNY

Query:  ALLHGRRPSLPPRRMM
        ALLHGRRPSLPPRR++
Subjt:  ALLHGRRPSLPPRRMM

XP_022969958.1 clavaminate synthase-like protein At3g21360 [Cucurbita maxima]3.6e-17997.47Show/hide
Query:  MEFSSKEFKVGKCEGQKEVDGETMPLVLQPPESDKANFDSLLLALQNNKDWLDQMIIKHSAVLLRGYNVSKPEEFNEIVEAFRWEDIRYVGPAPRTHIYK
        MEFSSKEFKVGKCEGQKEVDGETMPLVLQPPES+KANFDSLLLAL+NNKDWLDQMIIKHSAVLLRGY+VSK EEFNEIVEAF WEDIRYVGPAPRTHIYK
Subjt:  MEFSSKEFKVGKCEGQKEVDGETMPLVLQPPESDKANFDSLLLALQNNKDWLDQMIIKHSAVLLRGYNVSKPEEFNEIVEAFRWEDIRYVGPAPRTHIYK

Query:  RIWTANEGPLSEFIYYHHEMVLIKEYPKRVILYCEIPPPEGGETPFVPSFKVTERMVKEFPKEVEEMDKKGLKYTFTALSNNDTSSMRGRGWEDAFGSSD
        RIWTANEGPLSEFIYYHHEMVLIKEYPKRVILYCEIPPPEGGETPFVPSFKVTERMVKEFPKEVEEMDKKGLKYTFTALSNNDTSSMRGRGWEDAFGSSD
Subjt:  RIWTANEGPLSEFIYYHHEMVLIKEYPKRVILYCEIPPPEGGETPFVPSFKVTERMVKEFPKEVEEMDKKGLKYTFTALSNNDTSSMRGRGWEDAFGSSD

Query:  RYEAEKRANALGMEVEWLPNGAMKTILGPRCLTKVFDGRKGRRMWFNTMVGMHGKEHSSALMADGMEISEHVVKRCQQIIEEESIQFKWEKGDVLFLDNY
        RYEAEKRANALGMEVEWLPNGAMKTILGPRCLTKVFDGRK RRMWFNTMVGMHGKEHSSALMADGMEISEHVVKRCQQIIEEESIQFKWEKGDVLFLDNY
Subjt:  RYEAEKRANALGMEVEWLPNGAMKTILGPRCLTKVFDGRKGRRMWFNTMVGMHGKEHSSALMADGMEISEHVVKRCQQIIEEESIQFKWEKGDVLFLDNY

Query:  ALLHGRRPSLPPRRMM
        ALLHGRRPSLPPRR++
Subjt:  ALLHGRRPSLPPRRMM

XP_023551299.1 clavaminate synthase-like protein At3g21360 [Cucurbita pepo subsp. pepo]3.0e-18198.1Show/hide
Query:  MEFSSKEFKVGKCEGQKEVDGETMPLVLQPPESDKANFDSLLLALQNNKDWLDQMIIKHSAVLLRGYNVSKPEEFNEIVEAFRWEDIRYVGPAPRTHIYK
        MEFSSKEFKVGKCEGQKEVDGETMPLVLQPPESDKANFDSLLLAL+NNKDWLDQMIIKHSAVLLRGY+VSKP+EFNEIVEAF WEDIRYVGPAPRTHIYK
Subjt:  MEFSSKEFKVGKCEGQKEVDGETMPLVLQPPESDKANFDSLLLALQNNKDWLDQMIIKHSAVLLRGYNVSKPEEFNEIVEAFRWEDIRYVGPAPRTHIYK

Query:  RIWTANEGPLSEFIYYHHEMVLIKEYPKRVILYCEIPPPEGGETPFVPSFKVTERMVKEFPKEVEEMDKKGLKYTFTALSNNDTSSMRGRGWEDAFGSSD
        RIWTANEGPLSEFIYYHHEMVLIKEYPKRVILYCEIPPPEGGETPFVPSFKVTERMVKEFPKEVEEMDKKGLKYTFTALSNNDTSSMRGRGWEDAFGSSD
Subjt:  RIWTANEGPLSEFIYYHHEMVLIKEYPKRVILYCEIPPPEGGETPFVPSFKVTERMVKEFPKEVEEMDKKGLKYTFTALSNNDTSSMRGRGWEDAFGSSD

Query:  RYEAEKRANALGMEVEWLPNGAMKTILGPRCLTKVFDGRKGRRMWFNTMVGMHGKEHSSALMADGMEISEHVVKRCQQIIEEESIQFKWEKGDVLFLDNY
        RYEAEKRANALGMEVEWLPNGAMKTILGPRCLTKVFDGRKGRRMWFNTMVGMHGKEHSSALMADGMEISEHVVKRCQQIIEEESIQFKWEKGDVLFLDNY
Subjt:  RYEAEKRANALGMEVEWLPNGAMKTILGPRCLTKVFDGRKGRRMWFNTMVGMHGKEHSSALMADGMEISEHVVKRCQQIIEEESIQFKWEKGDVLFLDNY

Query:  ALLHGRRPSLPPRRMM
        ALLHGRRPSLPPRR++
Subjt:  ALLHGRRPSLPPRRMM

TrEMBL top hitse value%identityAlignment
A0A0A0KV15 TauD domain-containing protein6.7e-16387.07Show/hide
Query:  MEFSS-KEFKVGKCEGQKEVDGETMPLVLQPPESDKANFDSLLLALQNNKDWLDQMIIKHSAVLLRGYNVSKPEEFNEIVEAFRWEDIRYVGPAPRTHIY
        MEF+S KEFK+GKCEGQKEVDGE++PLVL PP++ KA+F+SLLL+L+ N DWL++MIIKHSAVLLRGY+VSK +EFN+IVE F WEDIRYVGPAPRTHIY
Subjt:  MEFSS-KEFKVGKCEGQKEVDGETMPLVLQPPESDKANFDSLLLALQNNKDWLDQMIIKHSAVLLRGYNVSKPEEFNEIVEAFRWEDIRYVGPAPRTHIY

Query:  KRIWTANEGPLSEFIYYHHEMVLIKEYPKRVILYCEIPPPEGGETPFVPSFKVTERMVKEFPKEVEEMDKKGLKYTFTALSNNDTSSMRGRGWEDAFGSS
        KRIWTANEGPLSEFIY+HHEMVLIKEYPKRVILYCEIPPPEGGETP VPSFKVTE+MVKEFPKEVEEMDKKGLKYTFTALS NDTSSMRGRGW+D FGSS
Subjt:  KRIWTANEGPLSEFIYYHHEMVLIKEYPKRVILYCEIPPPEGGETPFVPSFKVTERMVKEFPKEVEEMDKKGLKYTFTALSNNDTSSMRGRGWEDAFGSS

Query:  DRYEAEKRANALGMEVEWLPNGAMKTILGPRCLTKVFDGRKGRRMWFNTMVGMHGKEHSSALMADGMEISEHVVKRCQQIIEEESIQFKWEKGDVLFLDN
        D  EAEKRANALGM+VEWLPNGAMKTILGPRCLTKVFDGRKGRRMWFNT+VGMHGKEHSSALMADG EI+E+VVKRCQ+IIEEESIQF+WEKGDVLFLDN
Subjt:  DRYEAEKRANALGMEVEWLPNGAMKTILGPRCLTKVFDGRKGRRMWFNTMVGMHGKEHSSALMADGMEISEHVVKRCQQIIEEESIQFKWEKGDVLFLDN

Query:  YALLHGRRPSLPPRRMM
        YALLHGRRPSLPPR+++
Subjt:  YALLHGRRPSLPPRRMM

A0A1S3CUA2 clavaminate synthase-like protein At3g213603.2e-16589.59Show/hide
Query:  MEFSS-KEFKVGKCEGQKEVDGETMPLVLQPPESDKANFDSLLLALQNNKDWLDQMIIKHSAVLLRGYNVSKPEEFNEIVEAFRWEDIRYVGPAPRTHIY
        MEF+S KEFK+GKCEGQKEVDGET+PLVL PP+++KA+F+SLLLAL+ N DWLDQMIIKHSAVLLRGY+VSK EEFN+IVEAF WEDIRYVGPAPRTHIY
Subjt:  MEFSS-KEFKVGKCEGQKEVDGETMPLVLQPPESDKANFDSLLLALQNNKDWLDQMIIKHSAVLLRGYNVSKPEEFNEIVEAFRWEDIRYVGPAPRTHIY

Query:  KRIWTANEGPLSEFIYYHHEMVLIKEYPKRVILYCEIPPPEGGETPFVPSFKVTERMVKEFPKEVEEMDKKGLKYTFTALSNNDTSSMRGRGWEDAFGSS
        KRIWTANEGPLSEFIYYHHEMVLIKEYPKRVILYCEIPPPEGGETP VPSFKVTERMVKEFPKEVEEM+KKGLKYTFTALS NDTSSMRGRGW+D FGSS
Subjt:  KRIWTANEGPLSEFIYYHHEMVLIKEYPKRVILYCEIPPPEGGETPFVPSFKVTERMVKEFPKEVEEMDKKGLKYTFTALSNNDTSSMRGRGWEDAFGSS

Query:  DRYEAEKRANALGMEVEWLPNGAMKTILGPRCLTKVFDGRKGRRMWFNTMVGMHGKEHSSALMADGMEISEHVVKRCQQIIEEESIQFKWEKGDVLFLDN
        D  EAEKRANALGM+VEWLPNGAMKTILGPR LTKVFDGRKGRRMWFNT+VGMHGKEHSSALMADG EI+E+VVKRCQ+IIEEESIQFKWEKGDVLFLDN
Subjt:  DRYEAEKRANALGMEVEWLPNGAMKTILGPRCLTKVFDGRKGRRMWFNTMVGMHGKEHSSALMADGMEISEHVVKRCQQIIEEESIQFKWEKGDVLFLDN

Query:  YALLHGRRPSLPPRRMM
        YALLHGRRPSLPPRR++
Subjt:  YALLHGRRPSLPPRRMM

A0A5D3BQA2 Clavaminate synthase-like protein3.2e-16589.59Show/hide
Query:  MEFSS-KEFKVGKCEGQKEVDGETMPLVLQPPESDKANFDSLLLALQNNKDWLDQMIIKHSAVLLRGYNVSKPEEFNEIVEAFRWEDIRYVGPAPRTHIY
        MEF+S KEFK+GKCEGQKEVDGET+PLVL PP+++KA+F+SLLLAL+ N DWLDQMIIKHSAVLLRGY+VSK EEFN+IVEAF WEDIRYVGPAPRTHIY
Subjt:  MEFSS-KEFKVGKCEGQKEVDGETMPLVLQPPESDKANFDSLLLALQNNKDWLDQMIIKHSAVLLRGYNVSKPEEFNEIVEAFRWEDIRYVGPAPRTHIY

Query:  KRIWTANEGPLSEFIYYHHEMVLIKEYPKRVILYCEIPPPEGGETPFVPSFKVTERMVKEFPKEVEEMDKKGLKYTFTALSNNDTSSMRGRGWEDAFGSS
        KRIWTANEGPLSEFIYYHHEMVLIKEYPKRVILYCEIPPPEGGETP VPSFKVTERMVKEFPKEVEEM+KKGLKYTFTALS NDTSSMRGRGW+D FGSS
Subjt:  KRIWTANEGPLSEFIYYHHEMVLIKEYPKRVILYCEIPPPEGGETPFVPSFKVTERMVKEFPKEVEEMDKKGLKYTFTALSNNDTSSMRGRGWEDAFGSS

Query:  DRYEAEKRANALGMEVEWLPNGAMKTILGPRCLTKVFDGRKGRRMWFNTMVGMHGKEHSSALMADGMEISEHVVKRCQQIIEEESIQFKWEKGDVLFLDN
        D  EAEKRANALGM+VEWLPNGAMKTILGPR LTKVFDGRKGRRMWFNT+VGMHGKEHSSALMADG EI+E+VVKRCQ+IIEEESIQFKWEKGDVLFLDN
Subjt:  DRYEAEKRANALGMEVEWLPNGAMKTILGPRCLTKVFDGRKGRRMWFNTMVGMHGKEHSSALMADGMEISEHVVKRCQQIIEEESIQFKWEKGDVLFLDN

Query:  YALLHGRRPSLPPRRMM
        YALLHGRRPSLPPRR++
Subjt:  YALLHGRRPSLPPRRMM

A0A6J1ELV8 clavaminate synthase-like protein At3g213605.8e-18399.05Show/hide
Query:  MEFSSKEFKVGKCEGQKEVDGETMPLVLQPPESDKANFDSLLLALQNNKDWLDQMIIKHSAVLLRGYNVSKPEEFNEIVEAFRWEDIRYVGPAPRTHIYK
        MEFSSKEFKVGKCEGQKEVDGETMPLVLQPPESDKANFDSLLLALQNNKDWLDQMIIKHSAVLLRGYNVSKPEEFNEIVEAF WEDIRYVGPAPRTHIYK
Subjt:  MEFSSKEFKVGKCEGQKEVDGETMPLVLQPPESDKANFDSLLLALQNNKDWLDQMIIKHSAVLLRGYNVSKPEEFNEIVEAFRWEDIRYVGPAPRTHIYK

Query:  RIWTANEGPLSEFIYYHHEMVLIKEYPKRVILYCEIPPPEGGETPFVPSFKVTERMVKEFPKEVEEMDKKGLKYTFTALSNNDTSSMRGRGWEDAFGSSD
        RIWTANEGPLSEFIYYHHEMVLIKEYPKRVILYCEIPPPEGGETPFVPSFKVTERMVKEFPKEVEEMDKKGLKYTFTALSNNDTSSMRGRGWEDAFGSSD
Subjt:  RIWTANEGPLSEFIYYHHEMVLIKEYPKRVILYCEIPPPEGGETPFVPSFKVTERMVKEFPKEVEEMDKKGLKYTFTALSNNDTSSMRGRGWEDAFGSSD

Query:  RYEAEKRANALGMEVEWLPNGAMKTILGPRCLTKVFDGRKGRRMWFNTMVGMHGKEHSSALMADGMEISEHVVKRCQQIIEEESIQFKWEKGDVLFLDNY
        RYEAEKRANALGMEVEWLPNGAMKTILGPRCLTKVFDGRKGRRMWFNTMVGMHGKEHSSALMADGMEISEHVVKRCQQIIEEESIQFKWEKGDVLFLDNY
Subjt:  RYEAEKRANALGMEVEWLPNGAMKTILGPRCLTKVFDGRKGRRMWFNTMVGMHGKEHSSALMADGMEISEHVVKRCQQIIEEESIQFKWEKGDVLFLDNY

Query:  ALLHGRRPSLPPRRMM
        ALLHGRRPSLPPRR++
Subjt:  ALLHGRRPSLPPRRMM

A0A6J1I2G0 clavaminate synthase-like protein At3g213601.8e-17997.47Show/hide
Query:  MEFSSKEFKVGKCEGQKEVDGETMPLVLQPPESDKANFDSLLLALQNNKDWLDQMIIKHSAVLLRGYNVSKPEEFNEIVEAFRWEDIRYVGPAPRTHIYK
        MEFSSKEFKVGKCEGQKEVDGETMPLVLQPPES+KANFDSLLLAL+NNKDWLDQMIIKHSAVLLRGY+VSK EEFNEIVEAF WEDIRYVGPAPRTHIYK
Subjt:  MEFSSKEFKVGKCEGQKEVDGETMPLVLQPPESDKANFDSLLLALQNNKDWLDQMIIKHSAVLLRGYNVSKPEEFNEIVEAFRWEDIRYVGPAPRTHIYK

Query:  RIWTANEGPLSEFIYYHHEMVLIKEYPKRVILYCEIPPPEGGETPFVPSFKVTERMVKEFPKEVEEMDKKGLKYTFTALSNNDTSSMRGRGWEDAFGSSD
        RIWTANEGPLSEFIYYHHEMVLIKEYPKRVILYCEIPPPEGGETPFVPSFKVTERMVKEFPKEVEEMDKKGLKYTFTALSNNDTSSMRGRGWEDAFGSSD
Subjt:  RIWTANEGPLSEFIYYHHEMVLIKEYPKRVILYCEIPPPEGGETPFVPSFKVTERMVKEFPKEVEEMDKKGLKYTFTALSNNDTSSMRGRGWEDAFGSSD

Query:  RYEAEKRANALGMEVEWLPNGAMKTILGPRCLTKVFDGRKGRRMWFNTMVGMHGKEHSSALMADGMEISEHVVKRCQQIIEEESIQFKWEKGDVLFLDNY
        RYEAEKRANALGMEVEWLPNGAMKTILGPRCLTKVFDGRK RRMWFNTMVGMHGKEHSSALMADGMEISEHVVKRCQQIIEEESIQFKWEKGDVLFLDNY
Subjt:  RYEAEKRANALGMEVEWLPNGAMKTILGPRCLTKVFDGRKGRRMWFNTMVGMHGKEHSSALMADGMEISEHVVKRCQQIIEEESIQFKWEKGDVLFLDNY

Query:  ALLHGRRPSLPPRRMM
        ALLHGRRPSLPPRR++
Subjt:  ALLHGRRPSLPPRRMM

SwissProt top hitse value%identityAlignment
E2JA28 Dapdiamide synthesis protein DdaC4.4e-1824.28Show/hide
Query:  LDQMIIKHSAVLLRGYNVSKPEEFNEIVEAFRWEDIRY-VGPAPRTHIYKRIWTANEGPLSEFIYYHHEMVLIKEYPKRVILYCEIPPPEGGETPFVPSF
        +D+++ +  A+L RG+++++ ++F+++V     E++ Y      R    + ++T+ E P ++ I  H E    +  P +++ Y      +GGETP   + 
Subjt:  LDQMIIKHSAVLLRGYNVSKPEEFNEIVEAFRWEDIRY-VGPAPRTHIYKRIWTANEGPLSEFIYYHHEMVLIKEYPKRVILYCEIPPPEGGETPFVPSF

Query:  KVTERMVKEFPKEVEEMDKKGLKYTFTALSNNDTSSMRGRGWEDAFGSSDRYEAEKRANALGMEVEWLPNGAMKTILGPRCLTKVFDGRKGRRMWFNTMV
        +V   + +E    V E  +KG++Y        D S      W++AF +  + E E       ++ EWL +  ++T        +    RK   MWFN + 
Subjt:  KVTERMVKEFPKEVEEMDKKGLKYTFTALSNNDTSSMRGRGWEDAFGSSDRYEAEKRANALGMEVEWLPNGAMKTILGPRCLTKVFDGRKGRRMWFNTMV

Query:  GMH----------------GKE--HSSALMADGMEISEHVVKRCQQIIEEESIQFKWEKGDVLFLDNYALLHGRRP
          H                G +     A+   G EI + VV   +  + +  + F W+ GDVL  DN  + HGR+P
Subjt:  GMH----------------GKE--HSSALMADGMEISEHVVKRCQQIIEEESIQFKWEKGDVLFLDNYALLHGRRP

Q9LI74 Protein CHUP1, chloroplastic4.5e-5537.32Show/hide
Query:  MMNRFGVLVSVSIAAYAIRQLTIRSWSSLIYSENGEHTEK--------NLRRQRRMFHGLDEEQEEQKEANSM--------------------NDLEDGD
        M  R G +V+ SIAA  +++L ++       S+NGE  +K        NL   + +    +EE+EE K  NS+                     DL  G+
Subjt:  MMNRFGVLVSVSIAAYAIRQLTIRSWSSLIYSENGEHTEK--------NLRRQRRMFHGLDEEQEEQKEANSM--------------------NDLEDGD

Query:  HDHSLDELQEHLPQNKVGETHKIEM-------ERLLEQVMELEERKVKLEGELLMYDGIKNSEMDIMELKKQLEAKNDDINKSNITISSLQAERKELQEE
         ++ L +   +L + +    +++EM       ERL + V ELEER+VKLEGELL Y G+K  E DI+EL++QL+ K  +I+  NITI+SLQAERK+LQEE
Subjt:  HDHSLDELQEHLPQNKVGETHKIEM-------ERLLEQVMELEERKVKLEGELLMYDGIKNSEMDIMELKKQLEAKNDDINKSNITISSLQAERKELQEE

Query:  IVKGALVKKELMEAKGKIKELQRQIQVDANQTKQHLLLLKQQVSTLQAKEEEALKKEVELYKKLRAEKDFE-----------------------------
        + +  +V+KEL  A+ KIKELQRQIQ+DANQTK  LLLLKQ VS+LQ KEEEA+ K+ E+ +KL+A +D E                             
Subjt:  IVKGALVKKELMEAKGKIKELQRQIQVDANQTKQHLLLLKQQVSTLQAKEEEALKKEVELYKKLRAEKDFE-----------------------------

Query:  ---------------------------------------MNKFSEVEELVYLRWINACLRYELRDDDETPAGE-SARYLNKNSSPKSKEKAKQLMLEYAG
                                               MN+FSEVEELVYLRW+NACLRYELR + +TPAG+ SAR L+KN SPKS+ KAK+LMLEYAG
Subjt:  ---------------------------------------MNKFSEVEELVYLRWINACLRYELRDDDETPAGE-SARYLNKNSSPKSKEKAKQLMLEYAG

Query:  TE-----TD-------------------SIDSSKSRSSSSFSEKPNLIR------NNCESSGVLSLP---MIGLSHGR--------KDRLEAVLAVRA--
        +E     TD                   S+DSS SR  SSFS+KP LI+       + + S V S P     G S GR        +  LE+++   A  
Subjt:  TE-----TD-------------------SIDSSKSRSSSSFSEKPNLIR------NNCESSGVLSLP---MIGLSHGR--------KDRLEAVLAVRA--

Query:  ------------------ETLTLSEVRRLQVCSR--NSVNSVATSFKLMSKTVEESLKQR
                          ET  L  +R  Q  S     +NSVA SF +MSK+V+  L ++
Subjt:  ------------------ETLTLSEVRRLQVCSR--NSVNSVATSFKLMSKTVEESLKQR

Q9LIG0 Clavaminate synthase-like protein At3g213601.3e-5937.7Show/hide
Query:  QKEVDGETMPLVLQPPESD----KANFDSLLLALQNNKDWLDQMIIKHSAVLLRGYNVSKPEEFNEIVEAFRWEDIRYV-GPAPRTHIYKRIWTANEGPL
        QK  + +  P V+ PP +       +       ++  K +LD ++ +  AVL RG+ V+  ++FN++VEAF ++++ YV G APRT +  R++TANE P 
Subjt:  QKEVDGETMPLVLQPPESD----KANFDSLLLALQNNKDWLDQMIIKHSAVLLRGYNVSKPEEFNEIVEAFRWEDIRYV-GPAPRTHIYKRIWTANEGPL

Query:  SEFIYYHHEMVLIKEYPKRVILYCEIPPPEGGETPFVPSFKVTERMVKEFPKEVEEMDKKGLKYTFTALSNNDTSSMRGRGWEDAFGSSDRYEAEKRANA
         + I +HHEM  ++E+P ++  YCEI P  GGETP V S  V ERM  + P+ V+ +++ GL Y      ++D SS  GRGW+  F + D+  AE+RA  
Subjt:  SEFIYYHHEMVLIKEYPKRVILYCEIPPPEGGETPFVPSFKVTERMVKEFPKEVEEMDKKGLKYTFTALSNNDTSSMRGRGWEDAFGSSDRYEAEKRANA

Query:  LGMEVEWLPNGAMKTILGPRCLTKVFDGRKGRRMWFNTMVGMH-------GKEHSSALMADGMEISEHVVKRCQQIIEEESIQFKWEKGDVLFLDNYALL
        LGM++EW  +G  KT++GP    K +D  + R++WFN+MV  +            +    DG  +   +V  C +I+EEE +   W++GDVL +DN+A+L
Subjt:  LGMEVEWLPNGAMKTILGPRCLTKVFDGRKGRRMWFNTMVGMH-------GKEHSSALMADGMEISEHVVKRCQQIIEEESIQFKWEKGDVLFLDNYALL

Query:  HGRRPSLPPRRMM
        H RRP  PPRR++
Subjt:  HGRRPSLPPRRMM

Arabidopsis top hitse value%identityAlignment
AT1G52080.1 actin binding protein family3.8e-1731.83Show/hide
Query:  HKIEMERLLEQVMELEERKVKLEGELLMYDGIKNSEMDIMELKKQLEAKNDDINKSNITISSLQAERKELQEEIVKGALVKKELMEAKGKIKELQRQIQV
        H+ E+ RL   V  L ER+  LE +LL Y  +K  +   MEL+ +L+    +    N  I  LQAE ++L+ E  + + V  EL  AK +++ L++++ +
Subjt:  HKIEMERLLEQVMELEERKVKLEGELLMYDGIKNSEMDIMELKKQLEAKNDDINKSNITISSLQAERKELQEEIVKGALVKKELMEAKGKIKELQRQIQV

Query:  DANQTKQHLLLLKQQVSTLQAKEEEALKKEVELYK----------------------------------------------------------KLRAE--
        +  Q    +L LKQ+V+ LQ +E +A+  ++E  K                                                          +LR+E  
Subjt:  DANQTKQHLLLLKQQVSTLQAKEEEALKKEVELYK----------------------------------------------------------KLRAE--

Query:  ---KDFEM---NKFSEVEELVYLRWINACLRYELRDDDETPAGES-ARYLNKNSSPKSKEKAKQLMLEYAGTETDSIDSSKSRSSSSFS
           KD E    ++ +++E+LVYLRWINACLRYELR   + PAG++ AR L+   SP S+EKAKQL+LEYA +E D+ D  +  SS   S
Subjt:  ---KDFEM---NKFSEVEELVYLRWINACLRYELRDDDETPAGES-ARYLNKNSSPKSKEKAKQLMLEYAGTETDSIDSSKSRSSSSFS

AT3G21360.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein9.5e-6137.7Show/hide
Query:  QKEVDGETMPLVLQPPESD----KANFDSLLLALQNNKDWLDQMIIKHSAVLLRGYNVSKPEEFNEIVEAFRWEDIRYV-GPAPRTHIYKRIWTANEGPL
        QK  + +  P V+ PP +       +       ++  K +LD ++ +  AVL RG+ V+  ++FN++VEAF ++++ YV G APRT +  R++TANE P 
Subjt:  QKEVDGETMPLVLQPPESD----KANFDSLLLALQNNKDWLDQMIIKHSAVLLRGYNVSKPEEFNEIVEAFRWEDIRYV-GPAPRTHIYKRIWTANEGPL

Query:  SEFIYYHHEMVLIKEYPKRVILYCEIPPPEGGETPFVPSFKVTERMVKEFPKEVEEMDKKGLKYTFTALSNNDTSSMRGRGWEDAFGSSDRYEAEKRANA
         + I +HHEM  ++E+P ++  YCEI P  GGETP V S  V ERM  + P+ V+ +++ GL Y      ++D SS  GRGW+  F + D+  AE+RA  
Subjt:  SEFIYYHHEMVLIKEYPKRVILYCEIPPPEGGETPFVPSFKVTERMVKEFPKEVEEMDKKGLKYTFTALSNNDTSSMRGRGWEDAFGSSDRYEAEKRANA

Query:  LGMEVEWLPNGAMKTILGPRCLTKVFDGRKGRRMWFNTMVGMH-------GKEHSSALMADGMEISEHVVKRCQQIIEEESIQFKWEKGDVLFLDNYALL
        LGM++EW  +G  KT++GP    K +D  + R++WFN+MV  +            +    DG  +   +V  C +I+EEE +   W++GDVL +DN+A+L
Subjt:  LGMEVEWLPNGAMKTILGPRCLTKVFDGRKGRRMWFNTMVGMH-------GKEHSSALMADGMEISEHVVKRCQQIIEEESIQFKWEKGDVLFLDNYALL

Query:  HGRRPSLPPRRMM
        H RRP  PPRR++
Subjt:  HGRRPSLPPRRMM

AT3G25690.1 Hydroxyproline-rich glycoprotein family protein3.2e-5637.32Show/hide
Query:  MMNRFGVLVSVSIAAYAIRQLTIRSWSSLIYSENGEHTEK--------NLRRQRRMFHGLDEEQEEQKEANSM--------------------NDLEDGD
        M  R G +V+ SIAA  +++L ++       S+NGE  +K        NL   + +    +EE+EE K  NS+                     DL  G+
Subjt:  MMNRFGVLVSVSIAAYAIRQLTIRSWSSLIYSENGEHTEK--------NLRRQRRMFHGLDEEQEEQKEANSM--------------------NDLEDGD

Query:  HDHSLDELQEHLPQNKVGETHKIEM-------ERLLEQVMELEERKVKLEGELLMYDGIKNSEMDIMELKKQLEAKNDDINKSNITISSLQAERKELQEE
         ++ L +   +L + +    +++EM       ERL + V ELEER+VKLEGELL Y G+K  E DI+EL++QL+ K  +I+  NITI+SLQAERK+LQEE
Subjt:  HDHSLDELQEHLPQNKVGETHKIEM-------ERLLEQVMELEERKVKLEGELLMYDGIKNSEMDIMELKKQLEAKNDDINKSNITISSLQAERKELQEE

Query:  IVKGALVKKELMEAKGKIKELQRQIQVDANQTKQHLLLLKQQVSTLQAKEEEALKKEVELYKKLRAEKDFE-----------------------------
        + +  +V+KEL  A+ KIKELQRQIQ+DANQTK  LLLLKQ VS+LQ KEEEA+ K+ E+ +KL+A +D E                             
Subjt:  IVKGALVKKELMEAKGKIKELQRQIQVDANQTKQHLLLLKQQVSTLQAKEEEALKKEVELYKKLRAEKDFE-----------------------------

Query:  ---------------------------------------MNKFSEVEELVYLRWINACLRYELRDDDETPAGE-SARYLNKNSSPKSKEKAKQLMLEYAG
                                               MN+FSEVEELVYLRW+NACLRYELR + +TPAG+ SAR L+KN SPKS+ KAK+LMLEYAG
Subjt:  ---------------------------------------MNKFSEVEELVYLRWINACLRYELRDDDETPAGE-SARYLNKNSSPKSKEKAKQLMLEYAG

Query:  TE-----TD-------------------SIDSSKSRSSSSFSEKPNLIR------NNCESSGVLSLP---MIGLSHGR--------KDRLEAVLAVRA--
        +E     TD                   S+DSS SR  SSFS+KP LI+       + + S V S P     G S GR        +  LE+++   A  
Subjt:  TE-----TD-------------------SIDSSKSRSSSSFSEKPNLIR------NNCESSGVLSLP---MIGLSHGR--------KDRLEAVLAVRA--

Query:  ------------------ETLTLSEVRRLQVCSR--NSVNSVATSFKLMSKTVEESLKQR
                          ET  L  +R  Q  S     +NSVA SF +MSK+V+  L ++
Subjt:  ------------------ETLTLSEVRRLQVCSR--NSVNSVATSFKLMSKTVEESLKQR

AT3G25690.2 Hydroxyproline-rich glycoprotein family protein3.2e-5637.32Show/hide
Query:  MMNRFGVLVSVSIAAYAIRQLTIRSWSSLIYSENGEHTEK--------NLRRQRRMFHGLDEEQEEQKEANSM--------------------NDLEDGD
        M  R G +V+ SIAA  +++L ++       S+NGE  +K        NL   + +    +EE+EE K  NS+                     DL  G+
Subjt:  MMNRFGVLVSVSIAAYAIRQLTIRSWSSLIYSENGEHTEK--------NLRRQRRMFHGLDEEQEEQKEANSM--------------------NDLEDGD

Query:  HDHSLDELQEHLPQNKVGETHKIEM-------ERLLEQVMELEERKVKLEGELLMYDGIKNSEMDIMELKKQLEAKNDDINKSNITISSLQAERKELQEE
         ++ L +   +L + +    +++EM       ERL + V ELEER+VKLEGELL Y G+K  E DI+EL++QL+ K  +I+  NITI+SLQAERK+LQEE
Subjt:  HDHSLDELQEHLPQNKVGETHKIEM-------ERLLEQVMELEERKVKLEGELLMYDGIKNSEMDIMELKKQLEAKNDDINKSNITISSLQAERKELQEE

Query:  IVKGALVKKELMEAKGKIKELQRQIQVDANQTKQHLLLLKQQVSTLQAKEEEALKKEVELYKKLRAEKDFE-----------------------------
        + +  +V+KEL  A+ KIKELQRQIQ+DANQTK  LLLLKQ VS+LQ KEEEA+ K+ E+ +KL+A +D E                             
Subjt:  IVKGALVKKELMEAKGKIKELQRQIQVDANQTKQHLLLLKQQVSTLQAKEEEALKKEVELYKKLRAEKDFE-----------------------------

Query:  ---------------------------------------MNKFSEVEELVYLRWINACLRYELRDDDETPAGE-SARYLNKNSSPKSKEKAKQLMLEYAG
                                               MN+FSEVEELVYLRW+NACLRYELR + +TPAG+ SAR L+KN SPKS+ KAK+LMLEYAG
Subjt:  ---------------------------------------MNKFSEVEELVYLRWINACLRYELRDDDETPAGE-SARYLNKNSSPKSKEKAKQLMLEYAG

Query:  TE-----TD-------------------SIDSSKSRSSSSFSEKPNLIR------NNCESSGVLSLP---MIGLSHGR--------KDRLEAVLAVRA--
        +E     TD                   S+DSS SR  SSFS+KP LI+       + + S V S P     G S GR        +  LE+++   A  
Subjt:  TE-----TD-------------------SIDSSKSRSSSSFSEKPNLIR------NNCESSGVLSLP---MIGLSHGR--------KDRLEAVLAVRA--

Query:  ------------------ETLTLSEVRRLQVCSR--NSVNSVATSFKLMSKTVEESLKQR
                          ET  L  +R  Q  S     +NSVA SF +MSK+V+  L ++
Subjt:  ------------------ETLTLSEVRRLQVCSR--NSVNSVATSFKLMSKTVEESLKQR

AT3G25690.3 Hydroxyproline-rich glycoprotein family protein1.5e-3438.25Show/hide
Query:  KELQEEIVKGALVKKELMEAKGKIKELQRQIQVDANQTKQHLLLLKQQVSTLQAKEEEALKKEVELYKKLRAEKDFE-----------------------
        K LQEE+ +  +V+KEL  A+ KIKELQRQIQ+DANQTK  LLLLKQ VS+LQ KEEEA+ K+ E+ +KL+A +D E                       
Subjt:  KELQEEIVKGALVKKELMEAKGKIKELQRQIQVDANQTKQHLLLLKQQVSTLQAKEEEALKKEVELYKKLRAEKDFE-----------------------

Query:  ---------------------------------------------MNKFSEVEELVYLRWINACLRYELRDDDETPAGE-SARYLNKNSSPKSKEKAKQL
                                                     MN+FSEVEELVYLRW+NACLRYELR + +TPAG+ SAR L+KN SPKS+ KAK+L
Subjt:  ---------------------------------------------MNKFSEVEELVYLRWINACLRYELRDDDETPAGE-SARYLNKNSSPKSKEKAKQL

Query:  MLEYAGTE-----TD-------------------SIDSSKSRSSSSFSEKPNLIR------NNCESSGVLSLP---MIGLSHGR--------KDRLEAVL
        MLEYAG+E     TD                   S+DSS SR  SSFS+KP LI+       + + S V S P     G S GR        +  LE+++
Subjt:  MLEYAGTE-----TD-------------------SIDSSKSRSSSSFSEKPNLIR------NNCESSGVLSLP---MIGLSHGR--------KDRLEAVL

Query:  AVRA--------------------ETLTLSEVRRLQVCSR--NSVNSVATSFKLMSKTVEESLKQR
           A                    ET  L  +R  Q  S     +NSVA SF +MSK+V+  L ++
Subjt:  AVRA--------------------ETLTLSEVRRLQVCSR--NSVNSVATSFKLMSKTVEESLKQR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTTTAGCAGCAAAGAGTTCAAGGTTGGAAAATGTGAGGGCCAAAAGGAAGTGGATGGAGAAACCATGCCATTGGTTCTACAGCCTCCTGAATCAGACAAGGCCAA
CTTTGATTCCCTTTTATTGGCTCTGCAAAACAACAAAGATTGGTTGGACCAAATGATTATCAAACATAGCGCTGTTTTACTCCGAGGATACAACGTGTCGAAACCCGAGG
AGTTCAACGAGATTGTCGAAGCCTTCAGGTGGGAAGACATTCGATACGTCGGTCCAGCTCCTCGAACTCATATTTATAAGAGGATTTGGACTGCTAATGAAGGACCCCTC
TCTGAGTTCATTTACTACCACCATGAGATGGTTTTGATAAAGGAATATCCAAAGAGGGTAATCTTGTACTGTGAGATACCACCTCCAGAAGGTGGAGAAACGCCATTTGT
TCCAAGTTTCAAAGTAACAGAGCGAATGGTGAAGGAATTCCCAAAGGAAGTTGAAGAAATGGACAAGAAAGGCTTGAAATATACCTTCACTGCTCTTAGCAACAATGACA
CTTCCTCTATGAGGGGCAGAGGCTGGGAGGATGCTTTTGGTTCATCAGATCGTTATGAAGCAGAAAAAAGGGCTAATGCGTTGGGGATGGAGGTAGAGTGGCTGCCAAAC
GGCGCGATGAAGACGATACTGGGACCGAGGTGCCTAACAAAGGTGTTTGATGGAAGGAAAGGAAGAAGAATGTGGTTTAACACTATGGTGGGGATGCATGGGAAGGAGCA
TAGCTCTGCTTTAATGGCGGATGGGATGGAGATTTCAGAACATGTTGTGAAGAGATGCCAGCAGATAATTGAAGAAGAAAGCATCCAATTCAAATGGGAAAAGGGGGATG
TTTTGTTTTTGGATAACTATGCTTTGCTCCATGGAAGAAGGCCTTCTCTTCCTCCTAGAAGAATGATGAACAGATTTGGTGTTCTTGTTTCTGTTTCCATTGCAGCTTAT
GCAATTAGGCAGCTCACAATCAGATCATGGAGCTCATTAATCTATTCAGAAAATGGAGAACACACAGAGAAGAACCTAAGACGACAGCGAAGAATGTTCCATGGCTTGGA
TGAAGAACAAGAAGAACAAAAGGAAGCTAATTCAATGAATGATCTTGAAGATGGTGATCATGATCATAGTTTAGATGAACTTCAAGAACATCTACCCCAAAACAAAGTGG
GTGAAACCCATAAGATTGAAATGGAAAGGCTGCTGGAACAAGTGATGGAATTGGAGGAGAGGAAAGTGAAGCTTGAAGGTGAACTGCTCATGTATGATGGTATCAAGAAC
AGTGAAATGGATATCATGGAGTTGAAGAAGCAGCTGGAGGCCAAGAATGATGATATCAATAAGAGTAATATCACAATCAGCTCTTTGCAGGCTGAGAGGAAGGAGCTACA
AGAAGAGATAGTGAAGGGAGCATTGGTGAAGAAGGAACTAATGGAGGCTAAGGGAAAGATTAAGGAGCTGCAAAGGCAGATTCAGGTGGATGCAAACCAAACAAAACAAC
ATTTGTTATTACTCAAACAACAAGTTTCCACTTTGCAGGCAAAAGAGGAAGAAGCCCTCAAGAAAGAGGTAGAACTTTATAAGAAGCTGAGAGCGGAGAAGGATTTCGAG
ATGAACAAGTTTAGTGAAGTTGAAGAGTTAGTGTACCTTCGTTGGATCAATGCTTGCTTGAGGTATGAGCTTCGGGACGACGACGAAACACCGGCAGGCGAATCAGCTCG
TTATCTCAATAAGAACTCAAGTCCAAAGTCAAAAGAGAAGGCAAAACAACTCATGTTAGAGTATGCAGGAACAGAAACAGATTCAATTGATAGTTCAAAGAGCAGAAGTA
GTAGTAGTTTCAGTGAGAAGCCTAATTTGATCAGAAACAACTGCGAGTCGAGTGGTGTTTTGTCGTTGCCGATGATCGGTTTGAGCCACGGACGGAAGGATCGTTTAGAA
GCAGTGTTGGCTGTGAGGGCTGAAACTTTAACTCTCTCAGAAGTCCGCAGATTGCAGGTTTGTTCAAGAAACTCTGTTAACTCTGTTGCAACATCATTCAAACTTATGTC
TAAAACAGTTGAAGAAAGTCTAAAACAAAGAACATTATAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGTTTAGCAGCAAAGAGTTCAAGGTTGGAAAATGTGAGGGCCAAAAGGAAGTGGATGGAGAAACCATGCCATTGGTTCTACAGCCTCCTGAATCAGACAAGGCCAA
CTTTGATTCCCTTTTATTGGCTCTGCAAAACAACAAAGATTGGTTGGACCAAATGATTATCAAACATAGCGCTGTTTTACTCCGAGGATACAACGTGTCGAAACCCGAGG
AGTTCAACGAGATTGTCGAAGCCTTCAGGTGGGAAGACATTCGATACGTCGGTCCAGCTCCTCGAACTCATATTTATAAGAGGATTTGGACTGCTAATGAAGGACCCCTC
TCTGAGTTCATTTACTACCACCATGAGATGGTTTTGATAAAGGAATATCCAAAGAGGGTAATCTTGTACTGTGAGATACCACCTCCAGAAGGTGGAGAAACGCCATTTGT
TCCAAGTTTCAAAGTAACAGAGCGAATGGTGAAGGAATTCCCAAAGGAAGTTGAAGAAATGGACAAGAAAGGCTTGAAATATACCTTCACTGCTCTTAGCAACAATGACA
CTTCCTCTATGAGGGGCAGAGGCTGGGAGGATGCTTTTGGTTCATCAGATCGTTATGAAGCAGAAAAAAGGGCTAATGCGTTGGGGATGGAGGTAGAGTGGCTGCCAAAC
GGCGCGATGAAGACGATACTGGGACCGAGGTGCCTAACAAAGGTGTTTGATGGAAGGAAAGGAAGAAGAATGTGGTTTAACACTATGGTGGGGATGCATGGGAAGGAGCA
TAGCTCTGCTTTAATGGCGGATGGGATGGAGATTTCAGAACATGTTGTGAAGAGATGCCAGCAGATAATTGAAGAAGAAAGCATCCAATTCAAATGGGAAAAGGGGGATG
TTTTGTTTTTGGATAACTATGCTTTGCTCCATGGAAGAAGGCCTTCTCTTCCTCCTAGAAGAATGATGAACAGATTTGGTGTTCTTGTTTCTGTTTCCATTGCAGCTTAT
GCAATTAGGCAGCTCACAATCAGATCATGGAGCTCATTAATCTATTCAGAAAATGGAGAACACACAGAGAAGAACCTAAGACGACAGCGAAGAATGTTCCATGGCTTGGA
TGAAGAACAAGAAGAACAAAAGGAAGCTAATTCAATGAATGATCTTGAAGATGGTGATCATGATCATAGTTTAGATGAACTTCAAGAACATCTACCCCAAAACAAAGTGG
GTGAAACCCATAAGATTGAAATGGAAAGGCTGCTGGAACAAGTGATGGAATTGGAGGAGAGGAAAGTGAAGCTTGAAGGTGAACTGCTCATGTATGATGGTATCAAGAAC
AGTGAAATGGATATCATGGAGTTGAAGAAGCAGCTGGAGGCCAAGAATGATGATATCAATAAGAGTAATATCACAATCAGCTCTTTGCAGGCTGAGAGGAAGGAGCTACA
AGAAGAGATAGTGAAGGGAGCATTGGTGAAGAAGGAACTAATGGAGGCTAAGGGAAAGATTAAGGAGCTGCAAAGGCAGATTCAGGTGGATGCAAACCAAACAAAACAAC
ATTTGTTATTACTCAAACAACAAGTTTCCACTTTGCAGGCAAAAGAGGAAGAAGCCCTCAAGAAAGAGGTAGAACTTTATAAGAAGCTGAGAGCGGAGAAGGATTTCGAG
ATGAACAAGTTTAGTGAAGTTGAAGAGTTAGTGTACCTTCGTTGGATCAATGCTTGCTTGAGGTATGAGCTTCGGGACGACGACGAAACACCGGCAGGCGAATCAGCTCG
TTATCTCAATAAGAACTCAAGTCCAAAGTCAAAAGAGAAGGCAAAACAACTCATGTTAGAGTATGCAGGAACAGAAACAGATTCAATTGATAGTTCAAAGAGCAGAAGTA
GTAGTAGTTTCAGTGAGAAGCCTAATTTGATCAGAAACAACTGCGAGTCGAGTGGTGTTTTGTCGTTGCCGATGATCGGTTTGAGCCACGGACGGAAGGATCGTTTAGAA
GCAGTGTTGGCTGTGAGGGCTGAAACTTTAACTCTCTCAGAAGTCCGCAGATTGCAGGTTTGTTCAAGAAACTCTGTTAACTCTGTTGCAACATCATTCAAACTTATGTC
TAAAACAGTTGAAGAAAGTCTAAAACAAAGAACATTATAA
Protein sequenceShow/hide protein sequence
MEFSSKEFKVGKCEGQKEVDGETMPLVLQPPESDKANFDSLLLALQNNKDWLDQMIIKHSAVLLRGYNVSKPEEFNEIVEAFRWEDIRYVGPAPRTHIYKRIWTANEGPL
SEFIYYHHEMVLIKEYPKRVILYCEIPPPEGGETPFVPSFKVTERMVKEFPKEVEEMDKKGLKYTFTALSNNDTSSMRGRGWEDAFGSSDRYEAEKRANALGMEVEWLPN
GAMKTILGPRCLTKVFDGRKGRRMWFNTMVGMHGKEHSSALMADGMEISEHVVKRCQQIIEEESIQFKWEKGDVLFLDNYALLHGRRPSLPPRRMMNRFGVLVSVSIAAY
AIRQLTIRSWSSLIYSENGEHTEKNLRRQRRMFHGLDEEQEEQKEANSMNDLEDGDHDHSLDELQEHLPQNKVGETHKIEMERLLEQVMELEERKVKLEGELLMYDGIKN
SEMDIMELKKQLEAKNDDINKSNITISSLQAERKELQEEIVKGALVKKELMEAKGKIKELQRQIQVDANQTKQHLLLLKQQVSTLQAKEEEALKKEVELYKKLRAEKDFE
MNKFSEVEELVYLRWINACLRYELRDDDETPAGESARYLNKNSSPKSKEKAKQLMLEYAGTETDSIDSSKSRSSSSFSEKPNLIRNNCESSGVLSLPMIGLSHGRKDRLE
AVLAVRAETLTLSEVRRLQVCSRNSVNSVATSFKLMSKTVEESLKQRTL