; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0010743 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0010743
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionEndoribonuclease E-like protein
Genome locationchr1:5115862..5119144
RNA-Seq ExpressionLag0010743
SyntenyLag0010743
Gene Ontology termsNA
InterPro domainsIPR040320 - Uncharacterized protein At4g37920-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6603165.1 hypothetical protein SDJN03_03774, partial [Cucurbita argyrosperma subsp. sororia]1.2e-9486.51Show/hide
Query:  MAITNHLAFQLSICSTKTFIFGSFSATQKALPPFSSASPFKPSPKNSKSNNRRTVTITTPMQFNASARAIDVVTTEMVEQAEMEVAEGYTISQFCDKIID
        MAITN LAFQLSI STKTFIF  FSA QK LP  SSA+PFK SPKNSKS+NR T T+ TPMQFNASARA DV TTEM EQAEMEVAEGYTISQFCDKIID
Subjt:  MAITNHLAFQLSICSTKTFIFGSFSATQKALPPFSSASPFKPSPKNSKSNNRRTVTITTPMQFNASARAIDVVTTEMVEQAEMEVAEGYTISQFCDKIID

Query:  IFLNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPNMKEKLLSLRRKVKRIDDEMEIHSELLKELQDSPTDINAIVAKRRKEFTEEFFKFLT
        IF+NEKPKTKEWRK LVFREEWKKYRESFYSHCQRRADWESDP MKEKLLSL R+VKRIDDEMEIHSELLKELQDSPTDINAIVAKRR+EFTE+FFKFLT
Subjt:  IFLNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPNMKEKLLSLRRKVKRIDDEMEIHSELLKELQDSPTDINAIVAKRRKEFTEEFFKFLT

Query:  LISETHDSLEDRDVL
        L+SETHDSLED D +
Subjt:  LISETHDSLEDRDVL

XP_022933100.1 uncharacterized protein At4g37920 [Cucurbita moschata]1.2e-9486.51Show/hide
Query:  MAITNHLAFQLSICSTKTFIFGSFSATQKALPPFSSASPFKPSPKNSKSNNRRTVTITTPMQFNASARAIDVVTTEMVEQAEMEVAEGYTISQFCDKIID
        MAITN LAFQLSI STKTFIF  FSA QK LP  SSA+PFK SPKNSKS+NR T T+ TPMQFNASAR  DV TTEM EQAEMEVAEGYTISQFCDKIID
Subjt:  MAITNHLAFQLSICSTKTFIFGSFSATQKALPPFSSASPFKPSPKNSKSNNRRTVTITTPMQFNASARAIDVVTTEMVEQAEMEVAEGYTISQFCDKIID

Query:  IFLNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPNMKEKLLSLRRKVKRIDDEMEIHSELLKELQDSPTDINAIVAKRRKEFTEEFFKFLT
        IF+NEKPKTKEWRK LVFREEWKKYRESFYSHCQRRADWESDP MKEKLLSL R+VKRIDDEMEIHSELLKELQDSPTDINAIVAKRRKEFTE+FFKFLT
Subjt:  IFLNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPNMKEKLLSLRRKVKRIDDEMEIHSELLKELQDSPTDINAIVAKRRKEFTEEFFKFLT

Query:  LISETHDSLEDRDVL
        L+SETHDSLED D +
Subjt:  LISETHDSLEDRDVL

XP_022967802.1 uncharacterized protein At4g37920 [Cucurbita maxima]6.0e-9485.58Show/hide
Query:  MAITNHLAFQLSICSTKTFIFGSFSATQKALPPFSSASPFKPSPKNSKSNNRRTVTITTPMQFNASARAIDVVTTEMVEQAEMEVAEGYTISQFCDKIID
        MAITN LAFQLSI ST+TFIF  FSA Q  LP  SSA PFKP+PKNSKS+NR T T+ TPMQFNASARA DV TTEM EQ EMEVAEGYTISQFCDKIID
Subjt:  MAITNHLAFQLSICSTKTFIFGSFSATQKALPPFSSASPFKPSPKNSKSNNRRTVTITTPMQFNASARAIDVVTTEMVEQAEMEVAEGYTISQFCDKIID

Query:  IFLNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPNMKEKLLSLRRKVKRIDDEMEIHSELLKELQDSPTDINAIVAKRRKEFTEEFFKFLT
        IF+NEKPKTKEWRK LVFREEWKKYRESFYSHCQRRADWESDP MKEKLLSL R+VKRIDDEMEIHSELLKELQDSPTDINAIVAKRRKEFTE+FFKFLT
Subjt:  IFLNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPNMKEKLLSLRRKVKRIDDEMEIHSELLKELQDSPTDINAIVAKRRKEFTEEFFKFLT

Query:  LISETHDSLEDRDVL
        L+SETHDSLED D +
Subjt:  LISETHDSLEDRDVL

XP_023544083.1 uncharacterized protein At4g37920 [Cucurbita pepo subsp. pepo]4.6e-9486.57Show/hide
Query:  MAITNHLAFQLSICSTKTFIFGSFSATQKALPPF-SSASPFKPSPKNSKSNNRRTVTITTPMQFNASARAIDVVTTEMVEQAEMEVAEGYTISQFCDKII
        MAITN L FQLSI STKTFIF  FSA QK LPP  SSA PFKPSPKNSKS+NR T T+ TPMQFNASARA DV T EM EQAEMEVAEGYTISQFCDKII
Subjt:  MAITNHLAFQLSICSTKTFIFGSFSATQKALPPF-SSASPFKPSPKNSKSNNRRTVTITTPMQFNASARAIDVVTTEMVEQAEMEVAEGYTISQFCDKII

Query:  DIFLNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPNMKEKLLSLRRKVKRIDDEMEIHSELLKELQDSPTDINAIVAKRRKEFTEEFFKFL
        DIF+NEKPKTKEWRK LVFREEWKKYRESFYSHCQRRADWESDP MKEKLLSL R+VKRIDDEMEIHSELLKELQDSPTDINAIVAKRRKEFTE+FFKFL
Subjt:  DIFLNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPNMKEKLLSLRRKVKRIDDEMEIHSELLKELQDSPTDINAIVAKRRKEFTEEFFKFL

Query:  TLISETHDSLEDRDVL
        TL+SETHDSLED D +
Subjt:  TLISETHDSLEDRDVL

XP_038883875.1 uncharacterized protein At4g37920 isoform X2 [Benincasa hispida]4.8e-9185.12Show/hide
Query:  MAITNHLAFQLSICSTKTFIFGSFSATQKALPPFSSASPFKPSPKNSKSNNRRTVTITTPMQFNASARAIDVVTTEMVEQAEMEVAEGYTISQFCDKIID
        MA TNHL FQLSI STK+FIF SFSAT K LP   SAS FKPSP+  KS+N   VTITTPMQF ASA   DV TTE  E+AEMEVAEGYTISQFCDKIID
Subjt:  MAITNHLAFQLSICSTKTFIFGSFSATQKALPPFSSASPFKPSPKNSKSNNRRTVTITTPMQFNASARAIDVVTTEMVEQAEMEVAEGYTISQFCDKIID

Query:  IFLNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPNMKEKLLSLRRKVKRIDDEMEIHSELLKELQDSPTDINAIVAKRRKEFTEEFFKFLT
        IF+NEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDP MKEKL+SLRRKVKRIDDEMEIH ELLKELQDSPTDINAIVAKRRKEFTEEFFKFLT
Subjt:  IFLNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPNMKEKLLSLRRKVKRIDDEMEIHSELLKELQDSPTDINAIVAKRRKEFTEEFFKFLT

Query:  LISETHDSLEDRDVL
        LISETHDSLEDRD +
Subjt:  LISETHDSLEDRDVL

TrEMBL top hitse value%identityAlignment
A0A0A0L3X1 Uncharacterized protein1.4e-8881.48Show/hide
Query:  MAITNHLAFQLSICSTKTFIFGSFSATQKALPPFSSASPFKPSPKNSKSNNRRTVTITTPMQ-FNASARAIDVVTTEMVEQAEMEVAEGYTISQFCDKII
        MA TNHL FQ  + STK FIF SFS T   LP   SASPFKPSPK SKS+NR +VTIT P+Q FNASAR  DV T+E  EQ EMEVA+GY++SQFCDKII
Subjt:  MAITNHLAFQLSICSTKTFIFGSFSATQKALPPFSSASPFKPSPKNSKSNNRRTVTITTPMQ-FNASARAIDVVTTEMVEQAEMEVAEGYTISQFCDKII

Query:  DIFLNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPNMKEKLLSLRRKVKRIDDEMEIHSELLKELQDSPTDINAIVAKRRKEFTEEFFKFL
        DIFLNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWE DP MKEKL+SLRRKVK+IDDEMEIHSELLKELQDSPTDINAIVAKR KEFT+EFFKFL
Subjt:  DIFLNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPNMKEKLLSLRRKVKRIDDEMEIHSELLKELQDSPTDINAIVAKRRKEFTEEFFKFL

Query:  TLISETHDSLEDRDVL
        TLISETHDSLEDRD +
Subjt:  TLISETHDSLEDRDVL

A0A1S3B4W5 uncharacterized protein At4g37920, chloroplastic isoform X12.5e-9082.87Show/hide
Query:  MAITNHLAFQLSICSTKTFIFGSFSATQKALPPFSSASPFKPSPKNSKSNNRRTVTITTPMQ-FNASARAIDVVTTEMVEQAEMEVAEGYTISQFCDKII
        MA TNHL FQ  I STK+FIF +FS T K LP   SASPFKPSPK SKS+NR TVTIT P+Q FNASAR  DV T+E  EQAEMEVA+GY++SQFCDKII
Subjt:  MAITNHLAFQLSICSTKTFIFGSFSATQKALPPFSSASPFKPSPKNSKSNNRRTVTITTPMQ-FNASARAIDVVTTEMVEQAEMEVAEGYTISQFCDKII

Query:  DIFLNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPNMKEKLLSLRRKVKRIDDEMEIHSELLKELQDSPTDINAIVAKRRKEFTEEFFKFL
        DIF+NEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDP MKEKL+SLRRKVK+IDDEMEIHSELLKELQDSPTDINAIVA RRKEFT+EFFKFL
Subjt:  DIFLNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPNMKEKLLSLRRKVKRIDDEMEIHSELLKELQDSPTDINAIVAKRRKEFTEEFFKFL

Query:  TLISETHDSLEDRDVL
        TLISETHDSLEDRD +
Subjt:  TLISETHDSLEDRDVL

A0A1S4DV48 uncharacterized protein At4g37920, chloroplastic isoform X22.5e-9082.87Show/hide
Query:  MAITNHLAFQLSICSTKTFIFGSFSATQKALPPFSSASPFKPSPKNSKSNNRRTVTITTPMQ-FNASARAIDVVTTEMVEQAEMEVAEGYTISQFCDKII
        MA TNHL FQ  I STK+FIF +FS T K LP   SASPFKPSPK SKS+NR TVTIT P+Q FNASAR  DV T+E  EQAEMEVA+GY++SQFCDKII
Subjt:  MAITNHLAFQLSICSTKTFIFGSFSATQKALPPFSSASPFKPSPKNSKSNNRRTVTITTPMQ-FNASARAIDVVTTEMVEQAEMEVAEGYTISQFCDKII

Query:  DIFLNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPNMKEKLLSLRRKVKRIDDEMEIHSELLKELQDSPTDINAIVAKRRKEFTEEFFKFL
        DIF+NEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDP MKEKL+SLRRKVK+IDDEMEIHSELLKELQDSPTDINAIVA RRKEFT+EFFKFL
Subjt:  DIFLNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPNMKEKLLSLRRKVKRIDDEMEIHSELLKELQDSPTDINAIVAKRRKEFTEEFFKFL

Query:  TLISETHDSLEDRDVL
        TLISETHDSLEDRD +
Subjt:  TLISETHDSLEDRDVL

A0A6J1F3Z5 uncharacterized protein At4g379205.9e-9586.51Show/hide
Query:  MAITNHLAFQLSICSTKTFIFGSFSATQKALPPFSSASPFKPSPKNSKSNNRRTVTITTPMQFNASARAIDVVTTEMVEQAEMEVAEGYTISQFCDKIID
        MAITN LAFQLSI STKTFIF  FSA QK LP  SSA+PFK SPKNSKS+NR T T+ TPMQFNASAR  DV TTEM EQAEMEVAEGYTISQFCDKIID
Subjt:  MAITNHLAFQLSICSTKTFIFGSFSATQKALPPFSSASPFKPSPKNSKSNNRRTVTITTPMQFNASARAIDVVTTEMVEQAEMEVAEGYTISQFCDKIID

Query:  IFLNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPNMKEKLLSLRRKVKRIDDEMEIHSELLKELQDSPTDINAIVAKRRKEFTEEFFKFLT
        IF+NEKPKTKEWRK LVFREEWKKYRESFYSHCQRRADWESDP MKEKLLSL R+VKRIDDEMEIHSELLKELQDSPTDINAIVAKRRKEFTE+FFKFLT
Subjt:  IFLNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPNMKEKLLSLRRKVKRIDDEMEIHSELLKELQDSPTDINAIVAKRRKEFTEEFFKFLT

Query:  LISETHDSLEDRDVL
        L+SETHDSLED D +
Subjt:  LISETHDSLEDRDVL

A0A6J1HRT8 uncharacterized protein At4g379202.9e-9485.58Show/hide
Query:  MAITNHLAFQLSICSTKTFIFGSFSATQKALPPFSSASPFKPSPKNSKSNNRRTVTITTPMQFNASARAIDVVTTEMVEQAEMEVAEGYTISQFCDKIID
        MAITN LAFQLSI ST+TFIF  FSA Q  LP  SSA PFKP+PKNSKS+NR T T+ TPMQFNASARA DV TTEM EQ EMEVAEGYTISQFCDKIID
Subjt:  MAITNHLAFQLSICSTKTFIFGSFSATQKALPPFSSASPFKPSPKNSKSNNRRTVTITTPMQFNASARAIDVVTTEMVEQAEMEVAEGYTISQFCDKIID

Query:  IFLNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPNMKEKLLSLRRKVKRIDDEMEIHSELLKELQDSPTDINAIVAKRRKEFTEEFFKFLT
        IF+NEKPKTKEWRK LVFREEWKKYRESFYSHCQRRADWESDP MKEKLLSL R+VKRIDDEMEIHSELLKELQDSPTDINAIVAKRRKEFTE+FFKFLT
Subjt:  IFLNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPNMKEKLLSLRRKVKRIDDEMEIHSELLKELQDSPTDINAIVAKRRKEFTEEFFKFLT

Query:  LISETHDSLEDRDVL
        L+SETHDSLED D +
Subjt:  LISETHDSLEDRDVL

SwissProt top hitse value%identityAlignment
Q84WN0 Uncharacterized protein At4g379202.4e-4550.49Show/hide
Query:  KTFIFGSFSATQKALPPFSSASPFKPSPK--NSKSNNRRTVTI---TTPMQFNASARAIDVVTTEMVEQAEMEVAEGYTISQFCDKIIDIFLNEKPKTKE
        +T IF S +    + PP +S +   P     N     R++ TI   T  + +N +  A   V + + +  E+EVAEGYT++QFCDKIID+FLNEKPK K+
Subjt:  KTFIFGSFSATQKALPPFSSASPFKPSPK--NSKSNNRRTVTI---TTPMQFNASARAIDVVTTEMVEQAEMEVAEGYTISQFCDKIIDIFLNEKPKTKE

Query:  WRKFLVFREEWKKYRESFYSHCQRRADWESDPNMKEKLLSLRRKVKRIDDEMEIHSELLKELQDSPTDINAIVAKRRKEFTEEFFKFLTLISETHDSLED
        W+ +LV R+EW KY  +FY  C+ RAD E+DP +K+KL+SL  KVK+ID EME H++LLKE+Q++PTDINAI AKRR++FT EFF+++TL+SET D LED
Subjt:  WRKFLVFREEWKKYRESFYSHCQRRADWESDPNMKEKLLSLRRKVKRIDDEMEIHSELLKELQDSPTDINAIVAKRRKEFTEEFFKFLTLISETHDSLED

Query:  RDVL
        RD +
Subjt:  RDVL

Arabidopsis top hitse value%identityAlignment
AT1G36320.1 unknown protein8.3e-2537.42Show/hide
Query:  PMQFNASARAID---VVTTEMVEQAEMEVAEGYTISQFCDKIIDIFLNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPNMKEKLLSLRRKV
        P +F  SA   D   V   E  + +E  V +   + + CDK+I++F+ +KP   +WR+ L F +EW   R  FY  CQ RAD E +P MK K+  L RK+
Subjt:  PMQFNASARAID---VVTTEMVEQAEMEVAEGYTISQFCDKIIDIFLNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPNMKEKLLSLRRKV

Query:  KRIDDEMEIHSELLKELQDS-PTDINAIVAKRRKEFTEEFFKFLTLISET-HDSLEDRDVLPS
        K +D++++ H+ELL  ++ + P +I  +VA+RRK+FT EFF+ L  ++E+ +D+ ++++ L S
Subjt:  KRIDDEMEIHSELLKELQDS-PTDINAIVAKRRKEFTEEFFKFLTLISET-HDSLEDRDVLPS

AT4G37920.1 unknown protein1.7e-4650.49Show/hide
Query:  KTFIFGSFSATQKALPPFSSASPFKPSPK--NSKSNNRRTVTI---TTPMQFNASARAIDVVTTEMVEQAEMEVAEGYTISQFCDKIIDIFLNEKPKTKE
        +T IF S +    + PP +S +   P     N     R++ TI   T  + +N +  A   V + + +  E+EVAEGYT++QFCDKIID+FLNEKPK K+
Subjt:  KTFIFGSFSATQKALPPFSSASPFKPSPK--NSKSNNRRTVTI---TTPMQFNASARAIDVVTTEMVEQAEMEVAEGYTISQFCDKIIDIFLNEKPKTKE

Query:  WRKFLVFREEWKKYRESFYSHCQRRADWESDPNMKEKLLSLRRKVKRIDDEMEIHSELLKELQDSPTDINAIVAKRRKEFTEEFFKFLTLISETHDSLED
        W+ +LV R+EW KY  +FY  C+ RAD E+DP +K+KL+SL  KVK+ID EME H++LLKE+Q++PTDINAI AKRR++FT EFF+++TL+SET D LED
Subjt:  WRKFLVFREEWKKYRESFYSHCQRRADWESDPNMKEKLLSLRRKVKRIDDEMEIHSELLKELQDSPTDINAIVAKRRKEFTEEFFKFLTLISETHDSLED

Query:  RDVL
        RD +
Subjt:  RDVL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTGTTGGTTTTTTCTGGTTCGACTGGTTACGAAAACCCCTCTCCCTCTATGAAAAACTAAATAGAAAAACCCACGAAAATGAAGAGGTCGTGGAGTCGGAGCTGCT
GGACACGCCGGAGTCGTCGGATTCGAACGAGAGAGAAAGAACCGCCGGAGCCGTTGAAGTCGTCGGAGCTGCTGGACACGAACGGAATGACGAGGGAGGAGAAGACGAAG
TGAAGACGAAGGGAGGAAGTGCTTCCAGTCTCCACCTCCACCTACTTGGGCGGTTGGTCCTCTCTTCTCTTCGCTTTCTTTCAAAATTTCCGGCAATGGCCATCACAAAT
CACTTGGCCTTTCAGCTCTCCATCTGCTCAACCAAAACCTTCATCTTCGGCAGCTTTTCCGCCACTCAGAAAGCACTTCCACCCTTCTCCTCTGCTTCACCCTTCAAACC
ATCGCCGAAGAATTCCAAATCCAACAACCGAAGAACAGTTACAATCACAACCCCAATGCAATTCAACGCAAGTGCACGAGCGATTGATGTAGTTACAACTGAAATGGTAG
AACAAGCAGAGATGGAAGTTGCTGAGGGATATACCATCTCTCAATTTTGTGATAAAATAATTGATATTTTCTTGAATGAGAAGCCCAAGACTAAAGAATGGAGGAAGTTT
TTGGTATTTAGGGAAGAGTGGAAAAAGTATAGAGAGAGCTTCTACAGTCATTGCCAGAGGCGGGCAGATTGGGAGAGTGATCCAAATATGAAAGAGAAGTTACTTTCACT
TAGGAGAAAGGTCAAAAGGATTGATGATGAAATGGAGATCCACAGTGAACTTCTCAAGGAATTACAGGACAGCCCAACTGACATCAATGCGATAGTTGCAAAGCGGCGCA
AAGAGTTCACGGAGGAATTCTTTAAGTTCCTTACTTTGATTTCGGAAACCCATGATAGCTTGGAAGATCGTGATGTTCTTCCTAGCAAATTTCTCTCTCTTCTCAACCAT
TCTTTCTCATGCATTCTCTTTTCTTTATTGATCGACGTCGGTACCAAGCTCTCTTTCACTCTCTATTTTCTCTTTTCTCTTCCCTTCTTTTCTGCTCCTCACCGTGCTTC
AAACCCTCTTCCCAAAACTTTAGAAAACAAAATCGCATCTCACCTATCCGAGTCGCAAAATCGACAAAATCGACCAACCGATTTCAGAGAGCATCCTAAAAAGACCATCC
ACCGACCGGTCCAAACCAATAGTTTAGTTTTTGGCGTCGGTTTTCGTCCAAAATCGACTCCGACCGATCGACGAATAGCCCTAAGCATGCTTAACTTCAAAGTTCCTATG
ATTGAGCCATTGAAAAGGAAAGTGCACCTTGTTGGTATAGGTAGTTACTATCAATTCTCTTCAGCTTATCTTGACCATACTTTCATCTCCTCAGGATCCCTCTCATTCGG
ATGTGGTCTTAGTTCATTCATGTACCCCTCCTATCCTTGGGCGTTACGTTACATGCCCACCAACTTCTGCTTTGGTTCGTCCTCGAACCACATCCTATGGAGAGGTTTCA
CTCTGATACCATATGTAACGCCCCCAGATTTTAAGTCTAACCTAGTGACAAGAATAAGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTTGTTGGTTTTTTCTGGTTCGACTGGTTACGAAAACCCCTCTCCCTCTATGAAAAACTAAATAGAAAAACCCACGAAAATGAAGAGGTCGTGGAGTCGGAGCTGCT
GGACACGCCGGAGTCGTCGGATTCGAACGAGAGAGAAAGAACCGCCGGAGCCGTTGAAGTCGTCGGAGCTGCTGGACACGAACGGAATGACGAGGGAGGAGAAGACGAAG
TGAAGACGAAGGGAGGAAGTGCTTCCAGTCTCCACCTCCACCTACTTGGGCGGTTGGTCCTCTCTTCTCTTCGCTTTCTTTCAAAATTTCCGGCAATGGCCATCACAAAT
CACTTGGCCTTTCAGCTCTCCATCTGCTCAACCAAAACCTTCATCTTCGGCAGCTTTTCCGCCACTCAGAAAGCACTTCCACCCTTCTCCTCTGCTTCACCCTTCAAACC
ATCGCCGAAGAATTCCAAATCCAACAACCGAAGAACAGTTACAATCACAACCCCAATGCAATTCAACGCAAGTGCACGAGCGATTGATGTAGTTACAACTGAAATGGTAG
AACAAGCAGAGATGGAAGTTGCTGAGGGATATACCATCTCTCAATTTTGTGATAAAATAATTGATATTTTCTTGAATGAGAAGCCCAAGACTAAAGAATGGAGGAAGTTT
TTGGTATTTAGGGAAGAGTGGAAAAAGTATAGAGAGAGCTTCTACAGTCATTGCCAGAGGCGGGCAGATTGGGAGAGTGATCCAAATATGAAAGAGAAGTTACTTTCACT
TAGGAGAAAGGTCAAAAGGATTGATGATGAAATGGAGATCCACAGTGAACTTCTCAAGGAATTACAGGACAGCCCAACTGACATCAATGCGATAGTTGCAAAGCGGCGCA
AAGAGTTCACGGAGGAATTCTTTAAGTTCCTTACTTTGATTTCGGAAACCCATGATAGCTTGGAAGATCGTGATGTTCTTCCTAGCAAATTTCTCTCTCTTCTCAACCAT
TCTTTCTCATGCATTCTCTTTTCTTTATTGATCGACGTCGGTACCAAGCTCTCTTTCACTCTCTATTTTCTCTTTTCTCTTCCCTTCTTTTCTGCTCCTCACCGTGCTTC
AAACCCTCTTCCCAAAACTTTAGAAAACAAAATCGCATCTCACCTATCCGAGTCGCAAAATCGACAAAATCGACCAACCGATTTCAGAGAGCATCCTAAAAAGACCATCC
ACCGACCGGTCCAAACCAATAGTTTAGTTTTTGGCGTCGGTTTTCGTCCAAAATCGACTCCGACCGATCGACGAATAGCCCTAAGCATGCTTAACTTCAAAGTTCCTATG
ATTGAGCCATTGAAAAGGAAAGTGCACCTTGTTGGTATAGGTAGTTACTATCAATTCTCTTCAGCTTATCTTGACCATACTTTCATCTCCTCAGGATCCCTCTCATTCGG
ATGTGGTCTTAGTTCATTCATGTACCCCTCCTATCCTTGGGCGTTACGTTACATGCCCACCAACTTCTGCTTTGGTTCGTCCTCGAACCACATCCTATGGAGAGGTTTCA
CTCTGATACCATATGTAACGCCCCCAGATTTTAAGTCTAACCTAGTGACAAGAATAAGGTAA
Protein sequenceShow/hide protein sequence
MFVGFFWFDWLRKPLSLYEKLNRKTHENEEVVESELLDTPESSDSNERERTAGAVEVVGAAGHERNDEGGEDEVKTKGGSASSLHLHLLGRLVLSSLRFLSKFPAMAITN
HLAFQLSICSTKTFIFGSFSATQKALPPFSSASPFKPSPKNSKSNNRRTVTITTPMQFNASARAIDVVTTEMVEQAEMEVAEGYTISQFCDKIIDIFLNEKPKTKEWRKF
LVFREEWKKYRESFYSHCQRRADWESDPNMKEKLLSLRRKVKRIDDEMEIHSELLKELQDSPTDINAIVAKRRKEFTEEFFKFLTLISETHDSLEDRDVLPSKFLSLLNH
SFSCILFSLLIDVGTKLSFTLYFLFSLPFFSAPHRASNPLPKTLENKIASHLSESQNRQNRPTDFREHPKKTIHRPVQTNSLVFGVGFRPKSTPTDRRIALSMLNFKVPM
IEPLKRKVHLVGIGSYYQFSSAYLDHTFISSGSLSFGCGLSSFMYPSYPWALRYMPTNFCFGSSSNHILWRGFTLIPYVTPPDFKSNLVTRIR