; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg035814 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg035814
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionDUF4283 domain-containing protein
Genome locationscaffold5:40174164..40183147
RNA-Seq ExpressionSpg035814
SyntenySpg035814
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037444.1 hypothetical protein E6C27_scaffold277G00300 [Cucumis melo var. makuwa]3.2e-1334.36Show/hide
Query:  SLADVVKLGVSSKKSISLDG-STKNAKFFNENSYWVQKNWDVLDIDLENSLVVSRLAVHYSWKDVKMVLEEYFHSSILINAFMDDKALIQVAVGISDFSL
        S A VV  G  S  S S D   + ++   + NS++   + D+    LEN++V+ R   H  W  +   L +    S   NAF  +KAL+  +  I    L
Subjt:  SLADVVKLGVSSKKSISLDG-STKNAKFFNENSYWVQKNWDVLDIDLENSLVVSRLAVHYSWKDVKMVLEEYFHSSILINAFMDDKALIQVAVGISDFSL

Query:  --DGEWNKFGDLHLKLELWSSENHSQPKFMKSYGRWIAIRNLPLNLWNRASFEAIGKNFGGLI
          +  W   G   ++ E WSS  H+ PK + SYG W   R +PL+LWN  +F+ IGK  GGLI
Subjt:  --DGEWNKFGDLHLKLELWSSENHSQPKFMKSYGRWIAIRNLPLNLWNRASFEAIGKNFGGLI

KAA0047189.1 hypothetical protein E6C27_scaffold83G00690 [Cucumis melo var. makuwa]1.2e-1530.65Show/hide
Query:  NENSYWVQKNWDVLDIDLENSLVVSRLAVHYSWKDVKMVLEEYFHSSILINAFMDDKALIQVAVG-ISDFSL-DGEWNKFGDLHLKLELWSSENHSQPKF
        ++   WV +N +V+  + EN  ++++L      + ++ +LE YF + I+IN   D+ ALI +  G I D    +G+W   G  +LK E W    +S+P  
Subjt:  NENSYWVQKNWDVLDIDLENSLVVSRLAVHYSWKDVKMVLEEYFHSSILINAFMDDKALIQVAVG-ISDFSL-DGEWNKFGDLHLKLELWSSENHSQPKF

Query:  MKSYGRWIAIRNLPLNLW----NRASFEA---IGKNFGGLIGNKLE--------FSLRYGDINALEDRNSKFDLSKELSANDFSNSLDILRVKQVVLDE
        MK YG W+ I+NL   LW       S EA   +  N  G + + +E          L +GD   L   N  F  S  +  +DF  S+ +LR+ +V+ DE
Subjt:  MKSYGRWIAIRNLPLNLW----NRASFEA---IGKNFGGLIGNKLE--------FSLRYGDINALEDRNSKFDLSKELSANDFSNSLDILRVKQVVLDE

KAA0056565.1 hypothetical protein E6C27_scaffold288G00700 [Cucumis melo var. makuwa]2.1e-1234.15Show/hide
Query:  SYSLADVVKLGVSSKKSISLDGSTKNAKFFNENSYWVQKNWDVLDIDLENSLVVSRLAVHYSWKDVKMVLEEYFHSSILINAFMDDKALIQVAVGISDFS
        SY+ A V +   +S  S S D  T ++   + NS+    + D+    LEN++V+ R   H  W  +   L +    S   NAF  +KAL+  +  I    
Subjt:  SYSLADVVKLGVSSKKSISLDGSTKNAKFFNENSYWVQKNWDVLDIDLENSLVVSRLAVHYSWKDVKMVLEEYFHSSILINAFMDDKALIQVAVGISDFS

Query:  L--DGEWNKFGDLHLKLELWSSENHSQPKFMKSYGRWIAIRNLPLNLWNRASFEAIGKNFGGLI
        L  +  W   G   +K E WSS  H+ PK + SYG W   R +PL+LWN  +F+ +GK  GGLI
Subjt:  L--DGEWNKFGDLHLKLELWSSENHSQPKFMKSYGRWIAIRNLPLNLWNRASFEAIGKNFGGLI

XP_022149859.1 uncharacterized protein LOC111018186 [Momordica charantia]5.5e-1336Show/hide
Query:  NWDVLDIDLENSLVVSRLAVHYSWKDVKMVLEEYFHSSILINAFMDDKALIQV-AVGISDFSLDGE-WNKFGDLHLKLELWSSENHSQPKFMKSYGRWIA
        N +V  ++ E ++V++R   H  W  +   ++E   SS +IN F  DKAL++  +  ++   L  + W  FG + +KLE W+   H +     SYG W+ 
Subjt:  NWDVLDIDLENSLVVSRLAVHYSWKDVKMVLEEYFHSSILINAFMDDKALIQV-AVGISDFSLDGE-WNKFGDLHLKLELWSSENHSQPKFMKSYGRWIA

Query:  IRNLPLNLWNRASFEAIGKNFGGLI
        IRN+PL+LW+ A+F+AIG   GG I
Subjt:  IRNLPLNLWNRASFEAIGKNFGGLI

XP_038904899.1 uncharacterized protein LOC120091119 isoform X2 [Benincasa hispida]1.6e-1231.25Show/hide
Query:  SLDGEWNKFGDLHLKLELWSSENHSQPKFMKSYGRWIAIRNLPLNLWNRASFEAIGKNFGGLIGNKLE--------------------------------
        ++ G+W KFG  HLK E W++  H +P +++ YG WI+I+NLPL+ W + +FEAIGK FGGL    +E                                
Subjt:  SLDGEWNKFGDLHLKLELWSSENHSQPKFMKSYGRWIAIRNLPLNLWNRASFEAIGKNFGGLIGNKLE--------------------------------

Query:  ---FSLRYGDINALEDRNSKFDLSKELSANDFSNSLDILRVKQV
             L +GDI+     N    +  +L  +DF+N +D++R+ +V
Subjt:  ---FSLRYGDINALEDRNSKFDLSKELSANDFSNSLDILRVKQV

TrEMBL top hitse value%identityAlignment
A0A5A7U128 Uncharacterized protein5.8e-1630.65Show/hide
Query:  NENSYWVQKNWDVLDIDLENSLVVSRLAVHYSWKDVKMVLEEYFHSSILINAFMDDKALIQVAVG-ISDFSL-DGEWNKFGDLHLKLELWSSENHSQPKF
        ++   WV +N +V+  + EN  ++++L      + ++ +LE YF + I+IN   D+ ALI +  G I D    +G+W   G  +LK E W    +S+P  
Subjt:  NENSYWVQKNWDVLDIDLENSLVVSRLAVHYSWKDVKMVLEEYFHSSILINAFMDDKALIQVAVG-ISDFSL-DGEWNKFGDLHLKLELWSSENHSQPKF

Query:  MKSYGRWIAIRNLPLNLW----NRASFEA---IGKNFGGLIGNKLE--------FSLRYGDINALEDRNSKFDLSKELSANDFSNSLDILRVKQVVLDE
        MK YG W+ I+NL   LW       S EA   +  N  G + + +E          L +GD   L   N  F  S  +  +DF  S+ +LR+ +V+ DE
Subjt:  MKSYGRWIAIRNLPLNLW----NRASFEA---IGKNFGGLIGNKLE--------FSLRYGDINALEDRNSKFDLSKELSANDFSNSLDILRVKQVVLDE

A0A5A7UST0 DUF4283 domain-containing protein1.0e-1234.15Show/hide
Query:  SYSLADVVKLGVSSKKSISLDGSTKNAKFFNENSYWVQKNWDVLDIDLENSLVVSRLAVHYSWKDVKMVLEEYFHSSILINAFMDDKALIQVAVGISDFS
        SY+ A V +   +S  S S D  T ++   + NS+    + D+    LEN++V+ R   H  W  +   L +    S   NAF  +KAL+  +  I    
Subjt:  SYSLADVVKLGVSSKKSISLDGSTKNAKFFNENSYWVQKNWDVLDIDLENSLVVSRLAVHYSWKDVKMVLEEYFHSSILINAFMDDKALIQVAVGISDFS

Query:  L--DGEWNKFGDLHLKLELWSSENHSQPKFMKSYGRWIAIRNLPLNLWNRASFEAIGKNFGGLI
        L  +  W   G   +K E WSS  H+ PK + SYG W   R +PL+LWN  +F+ +GK  GGLI
Subjt:  L--DGEWNKFGDLHLKLELWSSENHSQPKFMKSYGRWIAIRNLPLNLWNRASFEAIGKNFGGLI

A0A5D3BSE0 DUF4283 domain-containing protein1.6e-1334.36Show/hide
Query:  SLADVVKLGVSSKKSISLDG-STKNAKFFNENSYWVQKNWDVLDIDLENSLVVSRLAVHYSWKDVKMVLEEYFHSSILINAFMDDKALIQVAVGISDFSL
        S A VV  G  S  S S D   + ++   + NS++   + D+    LEN++V+ R   H  W  +   L +    S   NAF  +KAL+  +  I    L
Subjt:  SLADVVKLGVSSKKSISLDG-STKNAKFFNENSYWVQKNWDVLDIDLENSLVVSRLAVHYSWKDVKMVLEEYFHSSILINAFMDDKALIQVAVGISDFSL

Query:  --DGEWNKFGDLHLKLELWSSENHSQPKFMKSYGRWIAIRNLPLNLWNRASFEAIGKNFGGLI
          +  W   G   ++ E WSS  H+ PK + SYG W   R +PL+LWN  +F+ IGK  GGLI
Subjt:  --DGEWNKFGDLHLKLELWSSENHSQPKFMKSYGRWIAIRNLPLNLWNRASFEAIGKNFGGLI

A0A5D3DKV0 DUF4283 domain-containing protein1.0e-1234.15Show/hide
Query:  SYSLADVVKLGVSSKKSISLDGSTKNAKFFNENSYWVQKNWDVLDIDLENSLVVSRLAVHYSWKDVKMVLEEYFHSSILINAFMDDKALIQVAVGISDFS
        SY+ A V +   +S  S S D  T ++   + NS+    + D+    LEN++V+ R   H  W  +   L +    S   NAF  +KAL+  +  I    
Subjt:  SYSLADVVKLGVSSKKSISLDGSTKNAKFFNENSYWVQKNWDVLDIDLENSLVVSRLAVHYSWKDVKMVLEEYFHSSILINAFMDDKALIQVAVGISDFS

Query:  L--DGEWNKFGDLHLKLELWSSENHSQPKFMKSYGRWIAIRNLPLNLWNRASFEAIGKNFGGLI
        L  +  W   G   +K E WSS  H+ PK + SYG W   R +PL+LWN  +F+ +GK  GGLI
Subjt:  L--DGEWNKFGDLHLKLELWSSENHSQPKFMKSYGRWIAIRNLPLNLWNRASFEAIGKNFGGLI

A0A6J1D6X4 uncharacterized protein LOC1110181862.7e-1336Show/hide
Query:  NWDVLDIDLENSLVVSRLAVHYSWKDVKMVLEEYFHSSILINAFMDDKALIQV-AVGISDFSLDGE-WNKFGDLHLKLELWSSENHSQPKFMKSYGRWIA
        N +V  ++ E ++V++R   H  W  +   ++E   SS +IN F  DKAL++  +  ++   L  + W  FG + +KLE W+   H +     SYG W+ 
Subjt:  NWDVLDIDLENSLVVSRLAVHYSWKDVKMVLEEYFHSSILINAFMDDKALIQV-AVGISDFSLDGE-WNKFGDLHLKLELWSSENHSQPKFMKSYGRWIA

Query:  IRNLPLNLWNRASFEAIGKNFGGLI
        IRN+PL+LW+ A+F+AIG   GG I
Subjt:  IRNLPLNLWNRASFEAIGKNFGGLI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGGAGTAGGAAATGGATACCATAAAGGAATTCTCGAGAATGAAGGTTATAGGTCTGATCCATCTATTGTTAGTCACTCGGAGGACGAATCGTTCTTATCTTCCCC
AGCTTTGAAAGCCCAAGATGATCCTAGCTCTAGGTTTTTGTTTGAACAAACCCAAGAGAATCCAACCGTGGGTCGAGAGGAGGAGAGCTCACGTTCGAAAGGGGGGATAG
GATCCATGCCAGCTCATGCGTCTTGTGCCGAGGGTAGAGGTGGATTAGTGGGAGGAGTCCAACCGTTTGCTTGTGAGGTTGTAGTGAATGATCCCAACTCCTTTTGTCCT
CAAGGGGCGGTGATTCCGCAGAAGGAAAAGGAAATTTCCACAGAAGGCAATCATAATTCCCCAAAGAAAGGTAAGGTCGGGAGTAAAGCGAGAGGGGGTGGGAAAATCAA
GGGAGCTGGTGAAGAGTTTGTGGAATCTGTTGCTGGGCGACTTGAGGGAGCGAATTCTGTGCTGCAGCAAAACTGGGAGCAGAACTGCCACGTCACAGCTCGAACGCAAC
ACAAGACAGAAGGGGAGAACGGAGTTTTGAGAGGAGTTAGGGGAGCTCTATGGTTTATGTGTTTGGAGTTGTTCAGAGGTCTGGTTTTCAGGACGGCCTTCGAGGAGCTT
CTTTTTCATCCACCCGTTCGTGAAAAAAGGGCACTTTTTGTGGCAGGCTGGGGTTTGTACAATTATTTGGGGTCTCCTTCATCCGTCGGAACAGTGATCGTTTGTCAGTG
CTCTTTCCTCGCCATTCCCGTTCATAGTCTTTCTCCATTGAATCGTGTTCCTGCTATAGCCGACTTTTTGTTGTTGCTCTCGTCGTCTTTTTCTGGCTTTTTCATCCTCC
AGCAAATTCCGCCATTATCAAAAATATTGAAGGGTTATTCGCTTTTAAGTGATGAATCAGTAGAAAATGCCTCTTCATATTCCTTAGCTGATGTGGTCAAGCTAGGTGTC
TCTAGTAAGAAATCCATTTCTTTGGATGGTTCAACCAAAAATGCTAAGTTCTTTAATGAAAATTCTTATTGGGTTCAAAAAAATTGGGATGTGCTAGATATAGATTTGGA
AAACTCTCTTGTTGTCTCTAGATTGGCGGTCCATTACTCTTGGAAAGATGTTAAGATGGTCCTTGAGGAGTATTTTCATTCTTCAATCTTGATCAACGCTTTTATGGACG
ATAAAGCCTTAATTCAGGTGGCTGTTGGCATCTCTGACTTCTCTTTGGATGGTGAGTGGAACAAATTTGGGGACCTTCATTTGAAACTAGAACTTTGGTCTTCTGAGAAT
CACTCCCAACCGAAATTTATGAAAAGTTATGGACGATGGATTGCAATCAGGAATTTACCCTTAAATTTGTGGAATCGGGCATCCTTTGAAGCAATTGGAAAGAACTTTGG
AGGGTTGATTGGTAATAAGCTTGAATTCTCTCTTCGTTATGGTGATATTAACGCATTAGAGGATAGGAATTCTAAGTTTGATTTAAGTAAAGAGTTATCAGCTAATGACT
TTTCGAATTCCCTGGATATATTAAGGGTCAAGCAAGTTGTTTTGGATGAAGAGTTGGCTATTTTTAATGAAGGAGAGAGGGAGGCCGAATTGCCTTTTATTTCTTGTTAT
CAGGAGGAATTTAATGAGGCGTTGGGTTCTCCAAAAGTTGCATCGTCGCATGATGAGCATATTAATAACATGGGCTGTAATGGACCTCCTTCCATGAAGATTAATGACGG
TATATGCAATATAAAAGATGATTTCCAGCAGGCTTTGGGATTGCCAAATTTCAGTGCTAGAAATGGCTTTGTTCAGTCCAAGGATTTTATGGAATCCTCCATTCAGAGTC
CTAGGGAAAGAGACCTCTTTAAAGAGGCGTTGGGTTCCCCAATGGCGCTTCTTTGCATGAAGAGTGTATTAATATCGCTGGCTCCATCAGGAAGTCCAGTTTATCATCCT
TCCAAGAAGTTTAATGCTGTTAATGTTGTTAATTGCAATTTAATAGATGATGTCCAACAGGTAACATTAAAGACTTATTCTCGAAAAAAGGCTTCTTTTTCGGAGAATGT
ATTGCCTTTATGTTCTCGGTATCAAGGGGAAATTAATGAGGTTTTGGGTTCTCCAAAGGGTGCTTTGATGCATGAAGAGGGCATTAATAACGTTGGTTGTATGAGCTTTA
ATGATAGCATTCAAGAGATGGATCCCATTCTCCCTCCTTCTAAAGTTATTAATGATCATAAATGTTCCAGCCCTAAAGAAGTCCAGTTGCCATTGTTTGTTGATTCTCCT
CCCAAAGATATTAATGATGATGTTAGCATTGTAAATGCGTGCATTCAGCAGGCATCTTTAAAGACTTATTCTCGAAAAAAGGGGTCTCAGTTTTTGGTTGAAAAGGTCAA
TTTTAATGCTGATCATTTGGAATCTGAATGTACTAAAATGATTGTTTCAAAAAATGTGTTGGGATCTTCAAAAGTCAGTGGTGAAGAAAATAGGTTACCAAGGTCCAAGG
AATTTATTGAATCGTCTGTGCGAATCCCAGGGGTGAAAAATTATTTTGTCAGAGGGATTGCTTGTTCTTCCAACCCTAAAGTTCACTCTTCCTTGGATTCAGATGATGAG
TCTTCGGTTAGTGTGAGTAGTGAGGAATCTGAGTCTTCGCTTGATGAAGAAGATTGTGTGGAGCTCCTCCCAGAAGACCAAATTAGTGAGTCTTTGGCTTCTTTTTTTTT
TCCAGAGGATGATGATGGTGTAGGTAATCAAGATGTAATAACTCAAGATCCTTTGCTATCTCCTTCTCAAATTCCTAATCAGTTCTCTTTATTAGTGGAATCTTGTGGAC
TTCAATTGTGCAAAATCTCTCCTCATTCATCTAAAGTCTTAGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATGGAGTAGGAAATGGATACCATAAAGGAATTCTCGAGAATGAAGGTTATAGGTCTGATCCATCTATTGTTAGTCACTCGGAGGACGAATCGTTCTTATCTTCCCC
AGCTTTGAAAGCCCAAGATGATCCTAGCTCTAGGTTTTTGTTTGAACAAACCCAAGAGAATCCAACCGTGGGTCGAGAGGAGGAGAGCTCACGTTCGAAAGGGGGGATAG
GATCCATGCCAGCTCATGCGTCTTGTGCCGAGGGTAGAGGTGGATTAGTGGGAGGAGTCCAACCGTTTGCTTGTGAGGTTGTAGTGAATGATCCCAACTCCTTTTGTCCT
CAAGGGGCGGTGATTCCGCAGAAGGAAAAGGAAATTTCCACAGAAGGCAATCATAATTCCCCAAAGAAAGGTAAGGTCGGGAGTAAAGCGAGAGGGGGTGGGAAAATCAA
GGGAGCTGGTGAAGAGTTTGTGGAATCTGTTGCTGGGCGACTTGAGGGAGCGAATTCTGTGCTGCAGCAAAACTGGGAGCAGAACTGCCACGTCACAGCTCGAACGCAAC
ACAAGACAGAAGGGGAGAACGGAGTTTTGAGAGGAGTTAGGGGAGCTCTATGGTTTATGTGTTTGGAGTTGTTCAGAGGTCTGGTTTTCAGGACGGCCTTCGAGGAGCTT
CTTTTTCATCCACCCGTTCGTGAAAAAAGGGCACTTTTTGTGGCAGGCTGGGGTTTGTACAATTATTTGGGGTCTCCTTCATCCGTCGGAACAGTGATCGTTTGTCAGTG
CTCTTTCCTCGCCATTCCCGTTCATAGTCTTTCTCCATTGAATCGTGTTCCTGCTATAGCCGACTTTTTGTTGTTGCTCTCGTCGTCTTTTTCTGGCTTTTTCATCCTCC
AGCAAATTCCGCCATTATCAAAAATATTGAAGGGTTATTCGCTTTTAAGTGATGAATCAGTAGAAAATGCCTCTTCATATTCCTTAGCTGATGTGGTCAAGCTAGGTGTC
TCTAGTAAGAAATCCATTTCTTTGGATGGTTCAACCAAAAATGCTAAGTTCTTTAATGAAAATTCTTATTGGGTTCAAAAAAATTGGGATGTGCTAGATATAGATTTGGA
AAACTCTCTTGTTGTCTCTAGATTGGCGGTCCATTACTCTTGGAAAGATGTTAAGATGGTCCTTGAGGAGTATTTTCATTCTTCAATCTTGATCAACGCTTTTATGGACG
ATAAAGCCTTAATTCAGGTGGCTGTTGGCATCTCTGACTTCTCTTTGGATGGTGAGTGGAACAAATTTGGGGACCTTCATTTGAAACTAGAACTTTGGTCTTCTGAGAAT
CACTCCCAACCGAAATTTATGAAAAGTTATGGACGATGGATTGCAATCAGGAATTTACCCTTAAATTTGTGGAATCGGGCATCCTTTGAAGCAATTGGAAAGAACTTTGG
AGGGTTGATTGGTAATAAGCTTGAATTCTCTCTTCGTTATGGTGATATTAACGCATTAGAGGATAGGAATTCTAAGTTTGATTTAAGTAAAGAGTTATCAGCTAATGACT
TTTCGAATTCCCTGGATATATTAAGGGTCAAGCAAGTTGTTTTGGATGAAGAGTTGGCTATTTTTAATGAAGGAGAGAGGGAGGCCGAATTGCCTTTTATTTCTTGTTAT
CAGGAGGAATTTAATGAGGCGTTGGGTTCTCCAAAAGTTGCATCGTCGCATGATGAGCATATTAATAACATGGGCTGTAATGGACCTCCTTCCATGAAGATTAATGACGG
TATATGCAATATAAAAGATGATTTCCAGCAGGCTTTGGGATTGCCAAATTTCAGTGCTAGAAATGGCTTTGTTCAGTCCAAGGATTTTATGGAATCCTCCATTCAGAGTC
CTAGGGAAAGAGACCTCTTTAAAGAGGCGTTGGGTTCCCCAATGGCGCTTCTTTGCATGAAGAGTGTATTAATATCGCTGGCTCCATCAGGAAGTCCAGTTTATCATCCT
TCCAAGAAGTTTAATGCTGTTAATGTTGTTAATTGCAATTTAATAGATGATGTCCAACAGGTAACATTAAAGACTTATTCTCGAAAAAAGGCTTCTTTTTCGGAGAATGT
ATTGCCTTTATGTTCTCGGTATCAAGGGGAAATTAATGAGGTTTTGGGTTCTCCAAAGGGTGCTTTGATGCATGAAGAGGGCATTAATAACGTTGGTTGTATGAGCTTTA
ATGATAGCATTCAAGAGATGGATCCCATTCTCCCTCCTTCTAAAGTTATTAATGATCATAAATGTTCCAGCCCTAAAGAAGTCCAGTTGCCATTGTTTGTTGATTCTCCT
CCCAAAGATATTAATGATGATGTTAGCATTGTAAATGCGTGCATTCAGCAGGCATCTTTAAAGACTTATTCTCGAAAAAAGGGGTCTCAGTTTTTGGTTGAAAAGGTCAA
TTTTAATGCTGATCATTTGGAATCTGAATGTACTAAAATGATTGTTTCAAAAAATGTGTTGGGATCTTCAAAAGTCAGTGGTGAAGAAAATAGGTTACCAAGGTCCAAGG
AATTTATTGAATCGTCTGTGCGAATCCCAGGGGTGAAAAATTATTTTGTCAGAGGGATTGCTTGTTCTTCCAACCCTAAAGTTCACTCTTCCTTGGATTCAGATGATGAG
TCTTCGGTTAGTGTGAGTAGTGAGGAATCTGAGTCTTCGCTTGATGAAGAAGATTGTGTGGAGCTCCTCCCAGAAGACCAAATTAGTGAGTCTTTGGCTTCTTTTTTTTT
TCCAGAGGATGATGATGGTGTAGGTAATCAAGATGTAATAACTCAAGATCCTTTGCTATCTCCTTCTCAAATTCCTAATCAGTTCTCTTTATTAGTGGAATCTTGTGGAC
TTCAATTGTGCAAAATCTCTCCTCATTCATCTAAAGTCTTAGCTTGA
Protein sequenceShow/hide protein sequence
MDGVGNGYHKGILENEGYRSDPSIVSHSEDESFLSSPALKAQDDPSSRFLFEQTQENPTVGREEESSRSKGGIGSMPAHASCAEGRGGLVGGVQPFACEVVVNDPNSFCP
QGAVIPQKEKEISTEGNHNSPKKGKVGSKARGGGKIKGAGEEFVESVAGRLEGANSVLQQNWEQNCHVTARTQHKTEGENGVLRGVRGALWFMCLELFRGLVFRTAFEEL
LFHPPVREKRALFVAGWGLYNYLGSPSSVGTVIVCQCSFLAIPVHSLSPLNRVPAIADFLLLLSSSFSGFFILQQIPPLSKILKGYSLLSDESVENASSYSLADVVKLGV
SSKKSISLDGSTKNAKFFNENSYWVQKNWDVLDIDLENSLVVSRLAVHYSWKDVKMVLEEYFHSSILINAFMDDKALIQVAVGISDFSLDGEWNKFGDLHLKLELWSSEN
HSQPKFMKSYGRWIAIRNLPLNLWNRASFEAIGKNFGGLIGNKLEFSLRYGDINALEDRNSKFDLSKELSANDFSNSLDILRVKQVVLDEELAIFNEGEREAELPFISCY
QEEFNEALGSPKVASSHDEHINNMGCNGPPSMKINDGICNIKDDFQQALGLPNFSARNGFVQSKDFMESSIQSPRERDLFKEALGSPMALLCMKSVLISLAPSGSPVYHP
SKKFNAVNVVNCNLIDDVQQVTLKTYSRKKASFSENVLPLCSRYQGEINEVLGSPKGALMHEEGINNVGCMSFNDSIQEMDPILPPSKVINDHKCSSPKEVQLPLFVDSP
PKDINDDVSIVNACIQQASLKTYSRKKGSQFLVEKVNFNADHLESECTKMIVSKNVLGSSKVSGEENRLPRSKEFIESSVRIPGVKNYFVRGIACSSNPKVHSSLDSDDE
SSVSVSSEESESSLDEEDCVELLPEDQISESLASFFFPEDDDGVGNQDVITQDPLLSPSQIPNQFSLLVESCGLQLCKISPHSSKVLA