; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0041237 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0041237
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionEncodes a protein whose expression is responsive to nematode infection, putative
Genome locationchr13:14238528..14242693
RNA-Seq ExpressionLag0041237
SyntenyLag0041237
Gene Ontology termsNA
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008454771.1 PREDICTED: uncharacterized protein At1g66480 isoform X1 [Cucumis melo]4.1e-8685.12Show/hide
Query:  MGNTFGLKKTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLESHQKLSTKRLYFLVELPKVPNQQAPRRVRSAINMSAKDRLES
        MGNTFG+KKTVKVMKISGET+KLN PVQ GDVVKDYPGFVLLESEAVKHYGVRAKPLE HQKLSTKRLYFLV+LPK+P +QAPRRVRSAINMSAKDRLES
Subjt:  MGNTFGLKKTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLESHQKLSTKRLYFLVELPKVPNQQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLEEEDGG--EGSRAGATRLKVRLPKAEVERLLKESKDEAEAAEKIVGLYMNKTTQSVSENAQKE-DKKDIIKPRE-KRRVS
        LMLARRSASDLTIMKPKSVL EE GG  EGS +GATR+KVRLPKAEVERLLKE KDEAEAAE+I+GLY  KT +SV EN  KE +KKDIIKPRE KRRVS
Subjt:  LMLARRSASDLTIMKPKSVLEEEDGG--EGSRAGATRLKVRLPKAEVERLLKESKDEAEAAEKIVGLYMNKTTQSVSENAQKE-DKKDIIKPRE-KRRVS

Query:  FMATMDAGTQIAVTS
        FM TM+AGTQIAV S
Subjt:  FMATMDAGTQIAVTS

XP_008454773.1 PREDICTED: uncharacterized protein At1g66480 isoform X2 [Cucumis melo]1.7e-8785.51Show/hide
Query:  MGNTFGLKKTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLESHQKLSTKRLYFLVELPKVPNQQAPRRVRSAINMSAKDRLES
        MGNTFG+KKTVKVMKISGET+KLN PVQ GDVVKDYPGFVLLESEAVKHYGVRAKPLE HQKLSTKRLYFLV+LPK+P +QAPRRVRSAINMSAKDRLES
Subjt:  MGNTFGLKKTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLESHQKLSTKRLYFLVELPKVPNQQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLEEEDGG--EGSRAGATRLKVRLPKAEVERLLKESKDEAEAAEKIVGLYMNKTTQSVSENAQKE-DKKDIIKPREKRRVSF
        LMLARRSASDLTIMKPKSVL EE GG  EGS +GATR+KVRLPKAEVERLLKE KDEAEAAE+I+GLY  KT +SV EN  KE +KKDIIKPREKRRVSF
Subjt:  LMLARRSASDLTIMKPKSVLEEEDGG--EGSRAGATRLKVRLPKAEVERLLKESKDEAEAAEKIVGLYMNKTTQSVSENAQKE-DKKDIIKPREKRRVSF

Query:  MATMDAGTQIAVTS
        M TM+AGTQIAV S
Subjt:  MATMDAGTQIAVTS

XP_022941735.1 uncharacterized protein At1g66480-like [Cucurbita moschata]4.5e-8581.4Show/hide
Query:  MGNTFGLKKTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLESHQKLSTKRLYFLVELPKVPNQQAPRRVRSAINMSAKDRLES
        MGN FG KKTVKVMK++GETMKLN PVQAGDVVKDYPGFVLL+SEAVKHYGVRAKPLE HQ LS KRLYFLV+LPK+PNQ  PRR+RSAI+MSAKDRLES
Subjt:  MGNTFGLKKTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLESHQKLSTKRLYFLVELPKVPNQQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLEEEDGGEGSR-AGATRLKVRLPKAEVERLLKESKDEAEAAEKIV---GLYMNKTTQSVSENAQKEDKKDIIKPREKRRVS
        LMLARRSASDLTIMKPKSVL EED GEGS  AG TRLKVRLP+AEVE+LLKESKD+ EAAEKIV   GLYM+KTTQS S++A K++K+D IKPREKRRVS
Subjt:  LMLARRSASDLTIMKPKSVLEEEDGGEGSR-AGATRLKVRLPKAEVERLLKESKDEAEAAEKIV---GLYMNKTTQSVSENAQKEDKKDIIKPREKRRVS

Query:  FMATMDAGTQIAVTS
        FM TM+AGT+IAV S
Subjt:  FMATMDAGTQIAVTS

XP_023511768.1 uncharacterized protein At1g66480-like [Cucurbita pepo subsp. pepo]1.2e-8581.86Show/hide
Query:  MGNTFGLKKTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLESHQKLSTKRLYFLVELPKVPNQQAPRRVRSAINMSAKDRLES
        MGN FG KKTVKVMK++GETMKLN PVQAGDVVKDYPGFV+L+SEAVKHYGVRAKPLE HQ LS KRLYFLV+LPK+PNQ  PRR+RSAI+MSAKDRLES
Subjt:  MGNTFGLKKTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLESHQKLSTKRLYFLVELPKVPNQQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLEEEDGGEGSR-AGATRLKVRLPKAEVERLLKESKDEAEAAEKIV---GLYMNKTTQSVSENAQKEDKKDIIKPREKRRVS
        LMLARRSASDLTIMKPKSVL EED GEGS  AG TRLKVRLP+AEVE+LLKESKD+ EAAEKIV   GLYM+KTTQS S+NA KE+K+D IKPREKRRVS
Subjt:  LMLARRSASDLTIMKPKSVLEEEDGGEGSR-AGATRLKVRLPKAEVERLLKESKDEAEAAEKIV---GLYMNKTTQSVSENAQKEDKKDIIKPREKRRVS

Query:  FMATMDAGTQIAVTS
        FM TM+AGT+IAV S
Subjt:  FMATMDAGTQIAVTS

XP_038891934.1 uncharacterized protein At1g66480 [Benincasa hispida]2.9e-8483.49Show/hide
Query:  MGNTFGLKKTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLESHQKLSTKRLYFLVELPKVPNQQAPRRVRSAINMSAKDRLES
        MGNTFG+KKTVKVM ISGETMKLN PVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLE HQKLSTKRLYFLVELPKV  +QAPRRVRSAINMSAKDRLES
Subjt:  MGNTFGLKKTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLESHQKLSTKRLYFLVELPKVPNQQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLEEEDGGEGSRAG-----ATRLKVRLPKAEVERLLKESKDEAEAAEKIVGLYMNKTTQSVSENAQK-EDKKDIIKPREKRR
        LMLARRSASDLTIMKPKSVL EE GGE   +G     ATR+KVRLPKAEVERLLKE KDEAEAAE+I+GLY  KT ++V EN  K E+KKDIIKPREKRR
Subjt:  LMLARRSASDLTIMKPKSVLEEEDGGEGSRAG-----ATRLKVRLPKAEVERLLKESKDEAEAAEKIVGLYMNKTTQSVSENAQK-EDKKDIIKPREKRR

Query:  VSFMATMDA-GTQIAVTS
        VSFM TM+A GTQIAV S
Subjt:  VSFMATMDA-GTQIAVTS

TrEMBL top hitse value%identityAlignment
A0A1S3BZC7 uncharacterized protein At1g66480 isoform X12.0e-8685.12Show/hide
Query:  MGNTFGLKKTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLESHQKLSTKRLYFLVELPKVPNQQAPRRVRSAINMSAKDRLES
        MGNTFG+KKTVKVMKISGET+KLN PVQ GDVVKDYPGFVLLESEAVKHYGVRAKPLE HQKLSTKRLYFLV+LPK+P +QAPRRVRSAINMSAKDRLES
Subjt:  MGNTFGLKKTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLESHQKLSTKRLYFLVELPKVPNQQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLEEEDGG--EGSRAGATRLKVRLPKAEVERLLKESKDEAEAAEKIVGLYMNKTTQSVSENAQKE-DKKDIIKPRE-KRRVS
        LMLARRSASDLTIMKPKSVL EE GG  EGS +GATR+KVRLPKAEVERLLKE KDEAEAAE+I+GLY  KT +SV EN  KE +KKDIIKPRE KRRVS
Subjt:  LMLARRSASDLTIMKPKSVLEEEDGG--EGSRAGATRLKVRLPKAEVERLLKESKDEAEAAEKIVGLYMNKTTQSVSENAQKE-DKKDIIKPRE-KRRVS

Query:  FMATMDAGTQIAVTS
        FM TM+AGTQIAV S
Subjt:  FMATMDAGTQIAVTS

A0A1S3C0K8 uncharacterized protein At1g66480 isoform X28.0e-8885.51Show/hide
Query:  MGNTFGLKKTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLESHQKLSTKRLYFLVELPKVPNQQAPRRVRSAINMSAKDRLES
        MGNTFG+KKTVKVMKISGET+KLN PVQ GDVVKDYPGFVLLESEAVKHYGVRAKPLE HQKLSTKRLYFLV+LPK+P +QAPRRVRSAINMSAKDRLES
Subjt:  MGNTFGLKKTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLESHQKLSTKRLYFLVELPKVPNQQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLEEEDGG--EGSRAGATRLKVRLPKAEVERLLKESKDEAEAAEKIVGLYMNKTTQSVSENAQKE-DKKDIIKPREKRRVSF
        LMLARRSASDLTIMKPKSVL EE GG  EGS +GATR+KVRLPKAEVERLLKE KDEAEAAE+I+GLY  KT +SV EN  KE +KKDIIKPREKRRVSF
Subjt:  LMLARRSASDLTIMKPKSVLEEEDGG--EGSRAGATRLKVRLPKAEVERLLKESKDEAEAAEKIVGLYMNKTTQSVSENAQKE-DKKDIIKPREKRRVSF

Query:  MATMDAGTQIAVTS
        M TM+AGTQIAV S
Subjt:  MATMDAGTQIAVTS

A0A6J1FSC9 uncharacterized protein At1g66480 isoform X27.2e-8179.34Show/hide
Query:  MGNTFGLKKTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLESHQKLSTKRLYFLVELPKVPNQQAPRRVRSAINMSAKDRLES
        MGN FGLKKTVKVM +SG+T+KL PPVQA DVVKDYPGFVLLESEAVKHYGVRAKPLE HQKLSTKRLYFLVELPK+P +QAPRRVRSAINMSAKDRLES
Subjt:  MGNTFGLKKTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLESHQKLSTKRLYFLVELPKVPNQQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLEEEDGGEGSRAGATRLKVRLPKAEVERLLKESKDEAEAAEKIVGLYMNKTTQSVSENAQK-EDKKD-IIKPREKRRVSFM
        LML+RRSASDLTIMKPKSVL EE G +   + ATR+KVRLPKAEVER+LKESKDEAEAAE+I+GLYM K  +SV +N +K E +KD IIKPREKRRVSFM
Subjt:  LMLARRSASDLTIMKPKSVLEEEDGGEGSRAGATRLKVRLPKAEVERLLKESKDEAEAAEKIVGLYMNKTTQSVSENAQK-EDKKD-IIKPREKRRVSFM

Query:  ATMDAGTQIAVTS
         T++A  QIAV +
Subjt:  ATMDAGTQIAVTS

A0A6J1FUK8 uncharacterized protein At1g66480-like2.2e-8581.4Show/hide
Query:  MGNTFGLKKTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLESHQKLSTKRLYFLVELPKVPNQQAPRRVRSAINMSAKDRLES
        MGN FG KKTVKVMK++GETMKLN PVQAGDVVKDYPGFVLL+SEAVKHYGVRAKPLE HQ LS KRLYFLV+LPK+PNQ  PRR+RSAI+MSAKDRLES
Subjt:  MGNTFGLKKTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLESHQKLSTKRLYFLVELPKVPNQQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLEEEDGGEGSR-AGATRLKVRLPKAEVERLLKESKDEAEAAEKIV---GLYMNKTTQSVSENAQKEDKKDIIKPREKRRVS
        LMLARRSASDLTIMKPKSVL EED GEGS  AG TRLKVRLP+AEVE+LLKESKD+ EAAEKIV   GLYM+KTTQS S++A K++K+D IKPREKRRVS
Subjt:  LMLARRSASDLTIMKPKSVLEEEDGGEGSR-AGATRLKVRLPKAEVERLLKESKDEAEAAEKIV---GLYMNKTTQSVSENAQKEDKKDIIKPREKRRVS

Query:  FMATMDAGTQIAVTS
        FM TM+AGT+IAV S
Subjt:  FMATMDAGTQIAVTS

A0A6J1JGY1 uncharacterized protein At1g66480-like1.1e-8180Show/hide
Query:  MGNTFGLKKTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLESHQKLSTKRLYFLVELPKVPNQQAPRRVRSAINMSAKDRLES
        MGN FG KKTVKVMK+SGETMKL+ PVQAGDVVKDYPGFVLL+SEAVKHYGVRAKPLE HQ LS KRLYFLV+LPK+PNQ  PRR+RSAI+MSAKDRLES
Subjt:  MGNTFGLKKTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLESHQKLSTKRLYFLVELPKVPNQQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLEEEDGGEGSR-AGATRLKVRLPKAEVERLLKESKDEAEAAEKIV---GLYMNKTTQSVSENAQKEDKKDIIKPREKRRVS
        LMLARRSASDLTIMKPKSVL EED GEGS  AG TRLKVRLP+AEVE+LLKESKD+ EAAEKIV   GLYM+KTTQS    A  ++++D IKPREKRRVS
Subjt:  LMLARRSASDLTIMKPKSVLEEEDGGEGSR-AGATRLKVRLPKAEVERLLKESKDEAEAAEKIV---GLYMNKTTQSVSENAQKEDKKDIIKPREKRRVS

Query:  FMATMDAGTQIAVTS
        FM TM+AGT+IAV S
Subjt:  FMATMDAGTQIAVTS

SwissProt top hitse value%identityAlignment
Q6NLC8 Uncharacterized protein At1g664808.9e-3643.98Show/hide
Query:  MGNTFGLK-KTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLESHQKLSTKRLYFLVELPKVP---------NQQAPRRVRSAI
        MGN+  +K K  KVMKI GET ++  PV A +V  DYPG+VLL+S+AVKH+GVR+KPLE +Q L  K+ YFLVELPK+P         N+   RRV S I
Subjt:  MGNTFGLK-KTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLESHQKLSTKRLYFLVELPKVP---------NQQAPRRVRSAI

Query:  NMSAKDRLESLMLARRSASDLTIMKPKSVLEEEDGGEG----SRAGATRLKVRLPKAEVERLLKESKDEAEA-AEKIVGLYMNKTTQSVSENAQKEDKKD
        ++ AK+RL+ LML+RR+ SD+TI +        DGG+G       G T +++RLP++++ +L++E+ ++A A AEKI+G+YM ++ +        + +++
Subjt:  NMSAKDRLESLMLARRSASDLTIMKPKSVLEEEDGGEG----SRAGATRLKVRLPKAEVERLLKESKDEAEA-AEKIVGLYMNKTTQSVSENAQKEDKKD

Query:  I----IKPREKRRVSF
        +    IK REK +VSF
Subjt:  I----IKPREKRRVSF

Arabidopsis top hitse value%identityAlignment
AT1G66480.1 plastid movement impaired 26.3e-3743.98Show/hide
Query:  MGNTFGLK-KTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLESHQKLSTKRLYFLVELPKVP---------NQQAPRRVRSAI
        MGN+  +K K  KVMKI GET ++  PV A +V  DYPG+VLL+S+AVKH+GVR+KPLE +Q L  K+ YFLVELPK+P         N+   RRV S I
Subjt:  MGNTFGLK-KTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLESHQKLSTKRLYFLVELPKVP---------NQQAPRRVRSAI

Query:  NMSAKDRLESLMLARRSASDLTIMKPKSVLEEEDGGEG----SRAGATRLKVRLPKAEVERLLKESKDEAEA-AEKIVGLYMNKTTQSVSENAQKEDKKD
        ++ AK+RL+ LML+RR+ SD+TI +        DGG+G       G T +++RLP++++ +L++E+ ++A A AEKI+G+YM ++ +        + +++
Subjt:  NMSAKDRLESLMLARRSASDLTIMKPKSVLEEEDGGEG----SRAGATRLKVRLPKAEVERLLKESKDEAEA-AEKIVGLYMNKTTQSVSENAQKEDKKD

Query:  I----IKPREKRRVSF
        +    IK REK +VSF
Subjt:  I----IKPREKRRVSF

AT1G71015.1 unknown protein2.3e-4253.94Show/hide
Query:  MGNTFGLKKTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLESHQKLSTKRLYFLVELPKVPNQQAPRRVRSAINMSAKDRLES
        MGN+ G KKT  +M I+GE+ KL  PV+AG VVKD+PG VLLESEAVK  G+RAKPLE HQ L +KR+YF+VELP+   ++ PRRVRS I MSAK+RLE+
Subjt:  MGNTFGLKKTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLESHQKLSTKRLYFLVELPKVPNQQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLEEEDGGEGSRAGATRLKVRLPKAEVERLLKESKDEAEAAEKIVGL
        L L+RRS+SDL++MK K+ + +E+    S      +K++LPK ++E+L KES+  ++ + KI  L
Subjt:  LMLARRSASDLTIMKPKSVLEEEDGGEGSRAGATRLKVRLPKAEVERLLKESKDEAEAAEKIVGL

AT2G01340.1 Encodes a protein whose expression is responsive to nematode infection.3.6e-4852.23Show/hide
Query:  MGNTFGLKKTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLESHQKLSTKRLYFLVELPKVPNQQAPRRVRSAINMSAKDRLES
        MGN+ G KKT KVMKI GET KL  PV A +V+KD+PG VLL+SE+VKHYG RAKPLE+ Q+L  KRLYF+VE  K   +  PRRVRS I++SAK+RLES
Subjt:  MGNTFGLKKTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLESHQKLSTKRLYFLVELPKVPNQQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKS--VLEEEDGGEGSRAGATRLKVRLPKAEVERLLKESKDEAEAAEKIVGLYMNKTTQSVSENAQKEDKKDI------------
        LMLARRS+SDL+I+KP      EEE+G         R+KVR+PKAE+E+L+KE   EAEA +KI  L+M K  Q  +    ++D+               
Subjt:  LMLARRSASDLTIMKPKS--VLEEEDGGEGSRAGATRLKVRLPKAEVERLLKESKDEAEAAEKIVGLYMNKTTQSVSENAQKEDKKDI------------

Query:  -IKPREKRRVSFMATMDAGTQIAV
         +K R K RVSFMA    G++I V
Subjt:  -IKPREKRRVSFMATMDAGTQIAV

AT5G37840.1 BEST Arabidopsis thaliana protein match is: plastid movement impaired 2 (TAIR:AT1G66480.1)4.3e-3347.13Show/hide
Query:  MGNTFGLKKT-VKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLESHQKLSTKRLYFLVELPKVP--NQQAPRRVRSA-INMSAKD
        MGNT  +++  VKVMKI G+  +L  PV A D  K+YPGFVLL+SE VK  GVRAKPLE +Q L     YFLV+LP V   N+   RRV S  I++ AK+
Subjt:  MGNTFGLKKT-VKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLESHQKLSTKRLYFLVELPKVP--NQQAPRRVRSA-INMSAKD

Query:  RLESLMLARRSASDLTIMKPKSVLEEEDGGEGSRAGATRLKVRLPKAEVERLLKESKDEAEAAEKIVGLYMNKT
        RLE LML+RR+ SD+   +   V      G+G   G TR+++RLP++++ +L++ES D +E A KI+  YM  +
Subjt:  RLESLMLARRSASDLTIMKPKSVLEEEDGGEGSRAGATRLKVRLPKAEVERLLKESKDEAEAAEKIVGLYMNKT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAACACCTTTGGACTTAAGAAGACGGTGAAGGTGATGAAGATCTCCGGCGAGACGATGAAGCTCAACCCCCCGGTTCAAGCCGGGGACGTCGTCAAGGATTACCC
TGGCTTTGTTCTGCTCGAATCCGAGGCCGTGAAGCACTATGGAGTTCGAGCAAAGCCATTGGAGTCCCACCAGAAGCTCAGCACGAAGAGGCTCTATTTCCTCGTCGAGC
TGCCTAAGGTTCCAAATCAACAGGCTCCACGACGGGTACGGTCGGCGATCAATATGAGTGCCAAGGACAGGCTAGAGAGCTTGATGTTGGCGCGAAGGTCGGCATCGGAC
CTAACTATCATGAAACCGAAGAGCGTGTTGGAGGAGGAGGACGGAGGAGAGGGATCGAGAGCGGGAGCGACACGGTTGAAGGTGCGGCTGCCGAAGGCGGAGGTGGAGAG
GCTGTTGAAGGAGAGCAAAGATGAGGCAGAGGCAGCGGAGAAGATTGTGGGACTGTACATGAATAAAACTACACAAAGTGTTTCTGAAAATGCTCAAAAGGAGGACAAGA
AGGATATCATCAAGCCACGTGAGAAGCGACGTGTAAGCTTCATGGCGACAATGGATGCAGGGACGCAAATTGCAGTGACATCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGAAACACCTTTGGACTTAAGAAGACGGTGAAGGTGATGAAGATCTCCGGCGAGACGATGAAGCTCAACCCCCCGGTTCAAGCCGGGGACGTCGTCAAGGATTACCC
TGGCTTTGTTCTGCTCGAATCCGAGGCCGTGAAGCACTATGGAGTTCGAGCAAAGCCATTGGAGTCCCACCAGAAGCTCAGCACGAAGAGGCTCTATTTCCTCGTCGAGC
TGCCTAAGGTTCCAAATCAACAGGCTCCACGACGGGTACGGTCGGCGATCAATATGAGTGCCAAGGACAGGCTAGAGAGCTTGATGTTGGCGCGAAGGTCGGCATCGGAC
CTAACTATCATGAAACCGAAGAGCGTGTTGGAGGAGGAGGACGGAGGAGAGGGATCGAGAGCGGGAGCGACACGGTTGAAGGTGCGGCTGCCGAAGGCGGAGGTGGAGAG
GCTGTTGAAGGAGAGCAAAGATGAGGCAGAGGCAGCGGAGAAGATTGTGGGACTGTACATGAATAAAACTACACAAAGTGTTTCTGAAAATGCTCAAAAGGAGGACAAGA
AGGATATCATCAAGCCACGTGAGAAGCGACGTGTAAGCTTCATGGCGACAATGGATGCAGGGACGCAAATTGCAGTGACATCTTAA
Protein sequenceShow/hide protein sequence
MGNTFGLKKTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLESHQKLSTKRLYFLVELPKVPNQQAPRRVRSAINMSAKDRLESLMLARRSASD
LTIMKPKSVLEEEDGGEGSRAGATRLKVRLPKAEVERLLKESKDEAEAAEKIVGLYMNKTTQSVSENAQKEDKKDIIKPREKRRVSFMATMDAGTQIAVTS