; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0015275 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0015275
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionProtein Ycf2-like
Genome locationchr12:9471813..9473555
RNA-Seq ExpressionLag0015275
SyntenyLag0015275
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR015410 - Domain of unknown function DUF1985


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047596.1 protein Ycf2-like [Cucumis melo var. makuwa]2.7e-6539.02Show/hide
Query:  KASARLQAVGVTPKGRAIEDKTPIDLHSEEDTVDKTDEAVKSKKPESSKRKSKSTRDKKRKRDKKSPDKSEKRPGKILGKRERRFGRKLRKKAPAALRCD
        + S RL+A G+T   +++         S E+ ++         K ES +   K  R  + K+++ +  K ++    +  KR  R   K +K+    +   
Subjt:  KASARLQAVGVTPKGRAIEDKTPIDLHSEEDTVDKTDEAVKSKKPESSKRKSKSTRDKKRKRDKKSPDKSEKRPGKILGKRERRFGRKLRKKAPAALRCD

Query:  FFMTSVSVYTYSESSESATESDDTGE-QEDSASHSATGSRTRTVSTMRHEPRNTKNPQKKKD-MYLMPKARRTQPLKILLHAKPDVIEKIRTNLPER---
            +V   + S +SE   ++    + +E      A     +     R + +   NP+K +D +YLM   RR +PLKI LH K  VIEKI+ NL +R   
Subjt:  FFMTSVSVYTYSESSESATESDDTGE-QEDSASHSATGSRTRTVSTMRHEPRNTKNPQKKKD-MYLMPKARRTQPLKILLHAKPDVIEKIRTNLPER---

Query:  -------------------------VLERQCSPKSATEINFLIGGKVLKFGLREFALITGLNCGPYPTIDKEQMKGDCHIKKHYFEGEKTVSRRYLNMAF
                                 +++R C PKS +++ FLIGG+VL+FGLREFALITGL C   P I+ E + G   +K  YFE  KTV+R+YLN+ F
Subjt:  -------------------------VLERQCSPKSATEINFLIGGKVLKFGLREFALITGLNCGPYPTIDKEQMKGDCHIKKHYFEGEKTVSRRYLNMAF

Query:  NVNKNATDDDMLKVALLYFLESFLLARQETVNVNMDHIEMVDDEELFNAYPWGRCAFELLLGYMHKALIGKGTTGITMGGFIFRLLAWAYKVIPTLSSTE
        N++   TDDD +K+A LYFLESFL+ +QE  +V+ DHI MVDD+E+F+ YPWGR AFELL+ +M++ +  KG TGI+MGGFIF +LAWAY+VIPTLS+  
Subjt:  NVNKNATDDDMLKVALLYFLESFLLARQETVNVNMDHIEMVDDEELFNAYPWGRCAFELLLGYMHKALIGKGTTGITMGGFIFRLLAWAYKVIPTLSSTE

Query:  NCFARKMNARTPRIISWAVDVQPKWKDL
        N F  +++   PRII+ A D QPKWKDL
Subjt:  NCFARKMNARTPRIISWAVDVQPKWKDL

KAA0051382.1 protein Ycf2-like [Cucumis melo var. makuwa]1.1e-6157.58Show/hide
Query:  CSPKSATEINFLIGGKVLKFGLREFALITGLNCGPYPTIDKEQMKGDCHIKKHYFEGEKTVSRRYLNMAFNVNKNATDDDMLKVALLYFLESFLLARQET
        C PKS +++ FLIGG+VL+FGLREFALITGL C   P I+ + +KG   +K  YFE  KTV R+YLN+ FN++   TDDD +K+A LYFLESFL+ +QE 
Subjt:  CSPKSATEINFLIGGKVLKFGLREFALITGLNCGPYPTIDKEQMKGDCHIKKHYFEGEKTVSRRYLNMAFNVNKNATDDDMLKVALLYFLESFLLARQET

Query:  VNVNMDHIEMVDDEELFNAYPWGRCAFELLLGYMHKALIGKGTTGITMGGFIFRLLAWAYKVIPTLSSTENCFARKMNARTPRIISWAVDVQPKWKDL
        ++V+ DHI MVDD+E+F+ YPWGR AFELL+ +M++ +  KG TGI++ GFIF +LAWAY+V PTLS+  N FA ++    PRII+WA D QPKWKDL
Subjt:  VNVNMDHIEMVDDEELFNAYPWGRCAFELLLGYMHKALIGKGTTGITMGGFIFRLLAWAYKVIPTLSSTENCFARKMNARTPRIISWAVDVQPKWKDL

KAA0051565.1 protein Ycf2-like [Cucumis melo var. makuwa]4.3e-5546.64Show/hide
Query:  RHEPRNTKNPQKKKD-MYLMPKARRTQPLKILLHAKPDVIEKIRTNLPERVLERQCSPKSATEINFLIGGKVLKFGLREFALITGLNCGPYPTIDKEQMK
        R + +   NP+K +D +YLM   RR +PLKI LH K  VIEKI+ NL +R++ R                                 C   P I+ E +K
Subjt:  RHEPRNTKNPQKKKD-MYLMPKARRTQPLKILLHAKPDVIEKIRTNLPERVLERQCSPKSATEINFLIGGKVLKFGLREFALITGLNCGPYPTIDKEQMK

Query:  GDCHIKKHYFEGEKTVSRRYLNMAFNVNKNATDDDMLKVALLYFLESFLLARQETVNVNMDHIEMVDDEELFNAYPWGRCAFELLLGYMHKALIGKGTTG
        G   +K  YFE  KTV+R+YLN+ FN++   TDDD +K+A LYFLE FL+ +QE ++V+ D+I MVDD+E+F+ YPWGR AFELL+ +M++A+  KG TG
Subjt:  GDCHIKKHYFEGEKTVSRRYLNMAFNVNKNATDDDMLKVALLYFLESFLLARQETVNVNMDHIEMVDDEELFNAYPWGRCAFELLLGYMHKALIGKGTTG

Query:  ITMGGFIFRLLAWAYKVIPTLSSTENCFARKMNARTPRIISWAVDVQPKWKDL
        I++ GFIF +LAWAY+VIPTLS+  N FA +++   PRII+WA D+QPKWKDL
Subjt:  ITMGGFIFRLLAWAYKVIPTLSSTENCFARKMNARTPRIISWAVDVQPKWKDL

TYK09852.1 protein Ycf2-like [Cucumis melo var. makuwa]1.1e-6359.3Show/hide
Query:  CSPKSATEINFLIGGKVLKFGLREFALITGLNCGPYPTIDKEQMKGDCHIKKHYFEGEKTVSRRYLNMAFNVNKNATDDDMLKVALLYFLESFLLARQET
        C PKS +++ FLIG +VL+FGLREFALITGL C   P I+ E +KG   +K  YFE  KTV+R+YLN+ FN++   TDDD +K+A LYFLESFL+ +QE 
Subjt:  CSPKSATEINFLIGGKVLKFGLREFALITGLNCGPYPTIDKEQMKGDCHIKKHYFEGEKTVSRRYLNMAFNVNKNATDDDMLKVALLYFLESFLLARQET

Query:  VNVNMDHIEMVDDEELFNAYPWGRCAFELLLGYMHKALIGKGTTGITM-GGFIFRLLAWAYKVIPTLSSTENCFARKMNARTPRIISWAVDVQPKWKDL
        ++V+ DHI MVDD+E+F+ YPWGR AFELL+ +M++A+  KG TGI+M GGFIF +LAWAY+VIPTLS+  N FA +++   PRII+WA D QPKWKDL
Subjt:  VNVNMDHIEMVDDEELFNAYPWGRCAFELLLGYMHKALIGKGTTGITM-GGFIFRLLAWAYKVIPTLSSTENCFARKMNARTPRIISWAVDVQPKWKDL

TYK23840.1 protein Ycf2-like [Cucumis melo var. makuwa]1.1e-6157.58Show/hide
Query:  CSPKSATEINFLIGGKVLKFGLREFALITGLNCGPYPTIDKEQMKGDCHIKKHYFEGEKTVSRRYLNMAFNVNKNATDDDMLKVALLYFLESFLLARQET
        C PKS +++ FLIGG+VL+FGLREFALITGL C   P I+ + +KG   +K  YFE  KTV R+YLN+ FN++   TDDD +K+A LYFLESFL+ +QE 
Subjt:  CSPKSATEINFLIGGKVLKFGLREFALITGLNCGPYPTIDKEQMKGDCHIKKHYFEGEKTVSRRYLNMAFNVNKNATDDDMLKVALLYFLESFLLARQET

Query:  VNVNMDHIEMVDDEELFNAYPWGRCAFELLLGYMHKALIGKGTTGITMGGFIFRLLAWAYKVIPTLSSTENCFARKMNARTPRIISWAVDVQPKWKDL
        ++V+ DHI MVDD+E+F+ YPWGR AFELL+ +M++ +  KG TGI++ GFIF +LAWAY+V PTLS+  N FA ++    PRII+WA D QPKWKDL
Subjt:  VNVNMDHIEMVDDEELFNAYPWGRCAFELLLGYMHKALIGKGTTGITMGGFIFRLLAWAYKVIPTLSSTENCFARKMNARTPRIISWAVDVQPKWKDL

TrEMBL top hitse value%identityAlignment
A0A5A7U047 Protein Ycf2-like1.3e-6539.02Show/hide
Query:  KASARLQAVGVTPKGRAIEDKTPIDLHSEEDTVDKTDEAVKSKKPESSKRKSKSTRDKKRKRDKKSPDKSEKRPGKILGKRERRFGRKLRKKAPAALRCD
        + S RL+A G+T   +++         S E+ ++         K ES +   K  R  + K+++ +  K ++    +  KR  R   K +K+    +   
Subjt:  KASARLQAVGVTPKGRAIEDKTPIDLHSEEDTVDKTDEAVKSKKPESSKRKSKSTRDKKRKRDKKSPDKSEKRPGKILGKRERRFGRKLRKKAPAALRCD

Query:  FFMTSVSVYTYSESSESATESDDTGE-QEDSASHSATGSRTRTVSTMRHEPRNTKNPQKKKD-MYLMPKARRTQPLKILLHAKPDVIEKIRTNLPER---
            +V   + S +SE   ++    + +E      A     +     R + +   NP+K +D +YLM   RR +PLKI LH K  VIEKI+ NL +R   
Subjt:  FFMTSVSVYTYSESSESATESDDTGE-QEDSASHSATGSRTRTVSTMRHEPRNTKNPQKKKD-MYLMPKARRTQPLKILLHAKPDVIEKIRTNLPER---

Query:  -------------------------VLERQCSPKSATEINFLIGGKVLKFGLREFALITGLNCGPYPTIDKEQMKGDCHIKKHYFEGEKTVSRRYLNMAF
                                 +++R C PKS +++ FLIGG+VL+FGLREFALITGL C   P I+ E + G   +K  YFE  KTV+R+YLN+ F
Subjt:  -------------------------VLERQCSPKSATEINFLIGGKVLKFGLREFALITGLNCGPYPTIDKEQMKGDCHIKKHYFEGEKTVSRRYLNMAF

Query:  NVNKNATDDDMLKVALLYFLESFLLARQETVNVNMDHIEMVDDEELFNAYPWGRCAFELLLGYMHKALIGKGTTGITMGGFIFRLLAWAYKVIPTLSSTE
        N++   TDDD +K+A LYFLESFL+ +QE  +V+ DHI MVDD+E+F+ YPWGR AFELL+ +M++ +  KG TGI+MGGFIF +LAWAY+VIPTLS+  
Subjt:  NVNKNATDDDMLKVALLYFLESFLLARQETVNVNMDHIEMVDDEELFNAYPWGRCAFELLLGYMHKALIGKGTTGITMGGFIFRLLAWAYKVIPTLSSTE

Query:  NCFARKMNARTPRIISWAVDVQPKWKDL
        N F  +++   PRII+ A D QPKWKDL
Subjt:  NCFARKMNARTPRIISWAVDVQPKWKDL

A0A5A7U6E1 Protein Ycf2-like5.1e-6257.58Show/hide
Query:  CSPKSATEINFLIGGKVLKFGLREFALITGLNCGPYPTIDKEQMKGDCHIKKHYFEGEKTVSRRYLNMAFNVNKNATDDDMLKVALLYFLESFLLARQET
        C PKS +++ FLIGG+VL+FGLREFALITGL C   P I+ + +KG   +K  YFE  KTV R+YLN+ FN++   TDDD +K+A LYFLESFL+ +QE 
Subjt:  CSPKSATEINFLIGGKVLKFGLREFALITGLNCGPYPTIDKEQMKGDCHIKKHYFEGEKTVSRRYLNMAFNVNKNATDDDMLKVALLYFLESFLLARQET

Query:  VNVNMDHIEMVDDEELFNAYPWGRCAFELLLGYMHKALIGKGTTGITMGGFIFRLLAWAYKVIPTLSSTENCFARKMNARTPRIISWAVDVQPKWKDL
        ++V+ DHI MVDD+E+F+ YPWGR AFELL+ +M++ +  KG TGI++ GFIF +LAWAY+V PTLS+  N FA ++    PRII+WA D QPKWKDL
Subjt:  VNVNMDHIEMVDDEELFNAYPWGRCAFELLLGYMHKALIGKGTTGITMGGFIFRLLAWAYKVIPTLSSTENCFARKMNARTPRIISWAVDVQPKWKDL

A0A5A7UD30 Protein Ycf2-like2.1e-5546.64Show/hide
Query:  RHEPRNTKNPQKKKD-MYLMPKARRTQPLKILLHAKPDVIEKIRTNLPERVLERQCSPKSATEINFLIGGKVLKFGLREFALITGLNCGPYPTIDKEQMK
        R + +   NP+K +D +YLM   RR +PLKI LH K  VIEKI+ NL +R++ R                                 C   P I+ E +K
Subjt:  RHEPRNTKNPQKKKD-MYLMPKARRTQPLKILLHAKPDVIEKIRTNLPERVLERQCSPKSATEINFLIGGKVLKFGLREFALITGLNCGPYPTIDKEQMK

Query:  GDCHIKKHYFEGEKTVSRRYLNMAFNVNKNATDDDMLKVALLYFLESFLLARQETVNVNMDHIEMVDDEELFNAYPWGRCAFELLLGYMHKALIGKGTTG
        G   +K  YFE  KTV+R+YLN+ FN++   TDDD +K+A LYFLE FL+ +QE ++V+ D+I MVDD+E+F+ YPWGR AFELL+ +M++A+  KG TG
Subjt:  GDCHIKKHYFEGEKTVSRRYLNMAFNVNKNATDDDMLKVALLYFLESFLLARQETVNVNMDHIEMVDDEELFNAYPWGRCAFELLLGYMHKALIGKGTTG

Query:  ITMGGFIFRLLAWAYKVIPTLSSTENCFARKMNARTPRIISWAVDVQPKWKDL
        I++ GFIF +LAWAY+VIPTLS+  N FA +++   PRII+WA D+QPKWKDL
Subjt:  ITMGGFIFRLLAWAYKVIPTLSSTENCFARKMNARTPRIISWAVDVQPKWKDL

A0A5D3CEX9 Protein Ycf2-like5.5e-6459.3Show/hide
Query:  CSPKSATEINFLIGGKVLKFGLREFALITGLNCGPYPTIDKEQMKGDCHIKKHYFEGEKTVSRRYLNMAFNVNKNATDDDMLKVALLYFLESFLLARQET
        C PKS +++ FLIG +VL+FGLREFALITGL C   P I+ E +KG   +K  YFE  KTV+R+YLN+ FN++   TDDD +K+A LYFLESFL+ +QE 
Subjt:  CSPKSATEINFLIGGKVLKFGLREFALITGLNCGPYPTIDKEQMKGDCHIKKHYFEGEKTVSRRYLNMAFNVNKNATDDDMLKVALLYFLESFLLARQET

Query:  VNVNMDHIEMVDDEELFNAYPWGRCAFELLLGYMHKALIGKGTTGITM-GGFIFRLLAWAYKVIPTLSSTENCFARKMNARTPRIISWAVDVQPKWKDL
        ++V+ DHI MVDD+E+F+ YPWGR AFELL+ +M++A+  KG TGI+M GGFIF +LAWAY+VIPTLS+  N FA +++   PRII+WA D QPKWKDL
Subjt:  VNVNMDHIEMVDDEELFNAYPWGRCAFELLLGYMHKALIGKGTTGITM-GGFIFRLLAWAYKVIPTLSSTENCFARKMNARTPRIISWAVDVQPKWKDL

A0A5D3DKA6 Protein Ycf2-like5.1e-6257.58Show/hide
Query:  CSPKSATEINFLIGGKVLKFGLREFALITGLNCGPYPTIDKEQMKGDCHIKKHYFEGEKTVSRRYLNMAFNVNKNATDDDMLKVALLYFLESFLLARQET
        C PKS +++ FLIGG+VL+FGLREFALITGL C   P I+ + +KG   +K  YFE  KTV R+YLN+ FN++   TDDD +K+A LYFLESFL+ +QE 
Subjt:  CSPKSATEINFLIGGKVLKFGLREFALITGLNCGPYPTIDKEQMKGDCHIKKHYFEGEKTVSRRYLNMAFNVNKNATDDDMLKVALLYFLESFLLARQET

Query:  VNVNMDHIEMVDDEELFNAYPWGRCAFELLLGYMHKALIGKGTTGITMGGFIFRLLAWAYKVIPTLSSTENCFARKMNARTPRIISWAVDVQPKWKDL
        ++V+ DHI MVDD+E+F+ YPWGR AFELL+ +M++ +  KG TGI++ GFIF +LAWAY+V PTLS+  N FA ++    PRII+WA D QPKWKDL
Subjt:  VNVNMDHIEMVDDEELFNAYPWGRCAFELLLGYMHKALIGKGTTGITMGGFIFRLLAWAYKVIPTLSSTENCFARKMNARTPRIISWAVDVQPKWKDL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G31150.1 Domain of unknown function (DUF1985)1.7e-1230.82Show/hide
Query:  VLERQCSPKSATEINFLIGGKVLKFGLREFALITGLNCGPYPTIDKEQMKGDCHIKKHYFEGEKTVSRRYLNMAFNVNKNATDDDMLK------------
        +L RQ   K   E+ F+ GG  ++F +REF ++TGL CG  PT D+        +KKH  +  K +S    N  F   +  T  D+L+            
Subjt:  VLERQCSPKSATEINFLIGGKVLKFGLREFALITGLNCGPYPTIDKEQMKGDCHIKKHYFEGEKTVSRRYLNMAFNVNKNATDDDMLK------------

Query:  --VALLYFLESFLLARQETVNVNMDHIEMVDDEELFNAYPWGRCAF
          +AL+  ++  ++   ++  V +D +EM++D + F  YPWGR AF
Subjt:  --VALLYFLESFLLARQETVNVNMDHIEMVDDEELFNAYPWGRCAF

AT2G06420.1 Domain of unknown function (DUF1985)6.1e-0726.03Show/hide
Query:  VLERQCSPKSATEINFLIGGKVLKFGLREFALITGLNCGPYPTIDKE-QMKGDCHIKKHYFEG----EKTVSRRYLNMAFNVNKNATDDDMLKVALLYFL
        +L R    K   E  F++ G  +++G+ E ALI+G NC  Y  I    +++ +   KK +F+      + V  + + M   V       + L++ +LYFL
Subjt:  VLERQCSPKSATEINFLIGGKVLKFGLREFALITGLNCGPYPTIDKE-QMKGDCHIKKHYFEG----EKTVSRRYLNMAFNVNKNATDDDMLKVALLYFL

Query:  ESFLLARQET----VNVNMDHIEMVDDEELFNAYPWGRCAFELLLG
         + ++A  +T      V+   ++ V D      + WGR +F+ +LG
Subjt:  ESFLLARQET----VNVNMDHIEMVDDEELFNAYPWGRCAFELLLG

AT4G08430.1 Ulp1 protease family protein4.7e-0725.56Show/hide
Query:  LIGGKVLKFGLREFALITGLNCGPYPTIDKEQMKGDCHIKKHYFEGEKTVS----RRYLNMAFNVNKNATDDDMLKVALLYFLESFLLARQETVNVNMDH
        LI  + ++F L EF  ITGLNC  +   D     G+   K  + E    +S       L   F ++K  + +  + V  L      +        V +  
Subjt:  LIGGKVLKFGLREFALITGLNCGPYPTIDKEQMKGDCHIKKHYFEGEKTVS----RRYLNMAFNVNKNATDDDMLKVALLYFLESFLLARQETVNVNMDH

Query:  IEMVDDEELFNAYPWGRCAFELLLGYMHKALIGKGTTGITMGGFIFRLLAWAYKVIPTLSSTENCFARKMNARTPRIISW
         + V D   F  YPWGR AF+ LL  +    +        + G +  LL W Y+ +P +   E C  RK       ++ W
Subjt:  IEMVDDEELFNAYPWGRCAFELLLGYMHKALIGKGTTGITMGGFIFRLLAWAYKVIPTLSSTENCFARKMNARTPRIISW

AT5G28810.1 Domain of unknown function (DUF1985)6.1e-0724.42Show/hide
Query:  KVLKFGLREFALITGLNCGPYPTIDKEQMKGDCHIKKHYFEGEKTVSRRYLNMAFNVNKNATDDDMLKVALLYFLESFLLARQETVNVNMDHIEMVDDEE
        K ++F L EF  ITGLNC  +   D E                       L   F ++K  + +  + V  L  L   +        V +   + V D  
Subjt:  KVLKFGLREFALITGLNCGPYPTIDKEQMKGDCHIKKHYFEGEKTVSRRYLNMAFNVNKNATDDDMLKVALLYFLESFLLARQETVNVNMDHIEMVDDEE

Query:  LFNAYPWGRCAFELLLGYMHKALIGKGTTGITMGGFIFRLLAWAYKVIPTLSSTENCFARKMNARTPRIISW
         F  YPWGR AF+ LL  +   ++        + G +  LL W Y+ +P +   E C   K       ++ W
Subjt:  LFNAYPWGRCAFELLLGYMHKALIGKGTTGITMGGFIFRLLAWAYKVIPTLSSTENCFARKMNARTPRIISW

AT5G45570.1 Ulp1 protease family protein8.5e-0925.64Show/hide
Query:  LERQCSPKSATEINFLIGGKVLKFGLREFALITGLNCGPYPTIDKEQMKGDCHIKKHYFEGEKTVS----RRYLNMAFNVNKNATDDDMLKVALLYFLES
        L  Q    +  E+  LI  + ++F L EF  ITGLNC  +   D     G+   K  + E   ++S       L   F ++K  + +  + V  L  L  
Subjt:  LERQCSPKSATEINFLIGGKVLKFGLREFALITGLNCGPYPTIDKEQMKGDCHIKKHYFEGEKTVS----RRYLNMAFNVNKNATDDDMLKVALLYFLES

Query:  FLLARQETVNVNMDHIEMVDDEELFNAYPWGRCAFELLLGYMHKALIGKGTTGITMGGFIFRLLAWAYKVIPTLSSTENCFARKMNARTPRIISW
         +        V +   + V D   F  YPWGR AF+ L   +   ++        + G +  LL W Y+ +P +   E C  RK N     ++ W
Subjt:  FLLARQETVNVNMDHIEMVDDEELFNAYPWGRCAFELLLGYMHKALIGKGTTGITMGGFIFRLLAWAYKVIPTLSSTENCFARKMNARTPRIISW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTAAAGCAAGCGCAAGGCTGCAAGCCGTCGGAGTGACGCCGAAAGGAAGAGCTATTGAAGACAAGACGCCAATAGACTTACATAGTGAAGAAGACACCGTTGATAA
AACGGATGAAGCTGTTAAGTCTAAGAAACCAGAGAGTTCGAAACGAAAGTCGAAGTCAACAAGGGACAAGAAGAGGAAACGAGACAAAAAAAGTCCTGACAAAAGCGAAA
AGAGACCAGGAAAGATTCTAGGAAAAAGGGAAAGGAGGTTCGGAAGGAAGCTTCGAAAGAAAGCTCCAGCGGCACTGAGGTGTGATTTTTTTATGACGTCTGTTTCTGTA
TACACATATTCTGAGAGTAGCGAGAGCGCAACCGAAAGCGATGACACAGGCGAACAAGAGGATTCAGCAAGCCATTCGGCCACGGGGTCTAGGACTAGGACTGTGTCTAC
TATGAGACATGAGCCGCGCAACACAAAGAACCCCCAGAAGAAAAAAGATATGTACTTGATGCCAAAGGCAAGACGAACCCAACCCCTTAAGATCTTGTTGCATGCCAAGC
CTGATGTGATCGAGAAAATTCGGACAAATTTGCCAGAACGAGTTTTGGAGAGGCAGTGTAGCCCCAAAAGTGCAACTGAAATAAATTTCTTGATCGGAGGGAAGGTACTG
AAGTTCGGACTTAGAGAGTTCGCACTGATCACGGGGCTGAACTGCGGTCCATACCCCACCATAGACAAAGAGCAGATGAAAGGCGACTGTCACATCAAAAAACATTACTT
TGAGGGGGAGAAGACTGTCAGTCGAAGGTATTTGAATATGGCGTTTAACGTTAACAAAAACGCAACGGACGATGATATGCTGAAAGTGGCGCTCTTGTATTTCTTGGAAA
GCTTTCTTCTGGCCAGACAAGAGACTGTGAATGTCAATATGGATCACATAGAGATGGTTGACGACGAGGAGCTCTTTAACGCTTATCCTTGGGGAAGGTGCGCATTCGAG
TTATTACTAGGATACATGCACAAGGCCTTGATTGGTAAAGGCACAACTGGAATCACGATGGGTGGGTTTATATTCCGCCTACTTGCATGGGCATACAAGGTCATCCCGAC
CTTAAGTTCAACAGAGAATTGTTTTGCCCGCAAGATGAATGCACGCACGCCGAGGATAATCAGTTGGGCTGTAGACGTGCAACCAAAATGGAAAGACCTAGACGTGCACG
CCGAGGATAATCAGTTGTTTTGCCCCCTGTTACGATACGTAGACACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTAAAGCAAGCGCAAGGCTGCAAGCCGTCGGAGTGACGCCGAAAGGAAGAGCTATTGAAGACAAGACGCCAATAGACTTACATAGTGAAGAAGACACCGTTGATAA
AACGGATGAAGCTGTTAAGTCTAAGAAACCAGAGAGTTCGAAACGAAAGTCGAAGTCAACAAGGGACAAGAAGAGGAAACGAGACAAAAAAAGTCCTGACAAAAGCGAAA
AGAGACCAGGAAAGATTCTAGGAAAAAGGGAAAGGAGGTTCGGAAGGAAGCTTCGAAAGAAAGCTCCAGCGGCACTGAGGTGTGATTTTTTTATGACGTCTGTTTCTGTA
TACACATATTCTGAGAGTAGCGAGAGCGCAACCGAAAGCGATGACACAGGCGAACAAGAGGATTCAGCAAGCCATTCGGCCACGGGGTCTAGGACTAGGACTGTGTCTAC
TATGAGACATGAGCCGCGCAACACAAAGAACCCCCAGAAGAAAAAAGATATGTACTTGATGCCAAAGGCAAGACGAACCCAACCCCTTAAGATCTTGTTGCATGCCAAGC
CTGATGTGATCGAGAAAATTCGGACAAATTTGCCAGAACGAGTTTTGGAGAGGCAGTGTAGCCCCAAAAGTGCAACTGAAATAAATTTCTTGATCGGAGGGAAGGTACTG
AAGTTCGGACTTAGAGAGTTCGCACTGATCACGGGGCTGAACTGCGGTCCATACCCCACCATAGACAAAGAGCAGATGAAAGGCGACTGTCACATCAAAAAACATTACTT
TGAGGGGGAGAAGACTGTCAGTCGAAGGTATTTGAATATGGCGTTTAACGTTAACAAAAACGCAACGGACGATGATATGCTGAAAGTGGCGCTCTTGTATTTCTTGGAAA
GCTTTCTTCTGGCCAGACAAGAGACTGTGAATGTCAATATGGATCACATAGAGATGGTTGACGACGAGGAGCTCTTTAACGCTTATCCTTGGGGAAGGTGCGCATTCGAG
TTATTACTAGGATACATGCACAAGGCCTTGATTGGTAAAGGCACAACTGGAATCACGATGGGTGGGTTTATATTCCGCCTACTTGCATGGGCATACAAGGTCATCCCGAC
CTTAAGTTCAACAGAGAATTGTTTTGCCCGCAAGATGAATGCACGCACGCCGAGGATAATCAGTTGGGCTGTAGACGTGCAACCAAAATGGAAAGACCTAGACGTGCACG
CCGAGGATAATCAGTTGTTTTGCCCCCTGTTACGATACGTAGACACTTGA
Protein sequenceShow/hide protein sequence
MVKASARLQAVGVTPKGRAIEDKTPIDLHSEEDTVDKTDEAVKSKKPESSKRKSKSTRDKKRKRDKKSPDKSEKRPGKILGKRERRFGRKLRKKAPAALRCDFFMTSVSV
YTYSESSESATESDDTGEQEDSASHSATGSRTRTVSTMRHEPRNTKNPQKKKDMYLMPKARRTQPLKILLHAKPDVIEKIRTNLPERVLERQCSPKSATEINFLIGGKVL
KFGLREFALITGLNCGPYPTIDKEQMKGDCHIKKHYFEGEKTVSRRYLNMAFNVNKNATDDDMLKVALLYFLESFLLARQETVNVNMDHIEMVDDEELFNAYPWGRCAFE
LLLGYMHKALIGKGTTGITMGGFIFRLLAWAYKVIPTLSSTENCFARKMNARTPRIISWAVDVQPKWKDLDVHAEDNQLFCPLLRYVDT