; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0022847 (gene) of Snake gourd v1 genome

Gene IDTan0022847
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionN-(5-amino-5-carboxypentanoyl)-L-cysteinyl-D-valine synthase
Genome locationLG03:66578438..66581050
RNA-Seq ExpressionTan0022847
SyntenyTan0022847
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR012677 - Nucleotide-binding alpha-beta plait domain superfamily
IPR035979 - RNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6595284.1 hypothetical protein SDJN03_11837, partial [Cucurbita argyrosperma subsp. sororia]3.6e-11286.31Show/hide
Query:  MSTLNLLRKTLGSHFVSPSARSNHGLPVFFLQSPRFFSTEGEQPPSEPAADPFLDTSKTTGLVYGKLYGITRNTLKTDIVNLLEGCNLSLDDVKVEYNRS
        MS L LLRK L SHFV+PSAR+NHGLPVFF QSPRFFSTEGEQPP E  ADPFLDTSKT GLVYGKL GITRNTLKTDIVNLLEGCNL+LDDVKVEYNRS
Subjt:  MSTLNLLRKTLGSHFVSPSARSNHGLPVFFLQSPRFFSTEGEQPPSEPAADPFLDTSKTTGLVYGKLYGITRNTLKTDIVNLLEGCNLSLDDVKVEYNRS

Query:  FTPTSMMMQFPSRQAYDNAFRVIGRKGRLYRLERADRSQWDLLSPYNGKT--------------VLLQGIPRNASVEDVERFLSGCDYDATSINVFFRAS
        FTPTSMMMQFPSRQAYDNAFRVIGRKGRLYRLERADRSQWDLLSPYNGKT              VLLQGIPRNA VEDVERFL GCDYDATSIN+FFRAS
Subjt:  FTPTSMMMQFPSRQAYDNAFRVIGRKGRLYRLERADRSQWDLLSPYNGKT--------------VLLQGIPRNASVEDVERFLSGCDYDATSINVFFRAS

Query:  FPDPIRMATVLFPSPTQAMHAFLTKNRGFCLNNQILMRVLQ
        FP+P+R+ATVLFPSPTQAMHAFLTKNRGFCLNNQILMRVLQ
Subjt:  FPDPIRMATVLFPSPTQAMHAFLTKNRGFCLNNQILMRVLQ

XP_022963065.1 uncharacterized protein LOC111463377 [Cucurbita moschata]2.9e-11490.75Show/hide
Query:  MSTLNLLRKTLGSHFVSPSARSNHGLPVFFLQSPRFFSTEGEQPPSEPAADPFLDTSKTTGLVYGKLYGITRNTLKTDIVNLLEGCNLSLDDVKVEYNRS
        MS L LLRK L SHFV+PSAR+NHGLPVFF QSPRFFSTEGEQPP E  ADPFLDTSKT GLVYGKL GITRNTLKTDIVNLLEGCNL+LDDVKVEYNRS
Subjt:  MSTLNLLRKTLGSHFVSPSARSNHGLPVFFLQSPRFFSTEGEQPPSEPAADPFLDTSKTTGLVYGKLYGITRNTLKTDIVNLLEGCNLSLDDVKVEYNRS

Query:  FTPTSMMMQFPSRQAYDNAFRVIGRKGRLYRLERADRSQWDLLSPYNGKTVLLQGIPRNASVEDVERFLSGCDYDATSINVFFRASFPDPIRMATVLFPS
        FTPTSMMMQFPSRQAYDNAFRVIGRKGRLYRLERADRSQWD+LSPY+GKTVLLQGIPRNA VEDVERFL GCDYDATSIN+FFRASFP+P+R+ATVLFPS
Subjt:  FTPTSMMMQFPSRQAYDNAFRVIGRKGRLYRLERADRSQWDLLSPYNGKTVLLQGIPRNASVEDVERFLSGCDYDATSINVFFRASFPDPIRMATVLFPS

Query:  PTQAMHAFLTKNRGFCLNNQILMRVLQ
        PTQAMHAFLTKNRGFCLNNQILMRVLQ
Subjt:  PTQAMHAFLTKNRGFCLNNQILMRVLQ

XP_022972751.1 uncharacterized protein LOC111471264 [Cucurbita maxima]3.7e-11792.51Show/hide
Query:  MSTLNLLRKTLGSHFVSPSARSNHGLPVFFLQSPRFFSTEGEQPPSEPAADPFLDTSKTTGLVYGKLYGITRNTLKTDIVNLLEGCNLSLDDVKVEYNRS
        MS L LLRK LGSHFV+PSAR+NHGLPVFF QSPRFFSTEGEQPP EP AD FLDTSKT GLVYGKLYGITRNTLKTDIVNLLEGCNL+LDDVKVEYNRS
Subjt:  MSTLNLLRKTLGSHFVSPSARSNHGLPVFFLQSPRFFSTEGEQPPSEPAADPFLDTSKTTGLVYGKLYGITRNTLKTDIVNLLEGCNLSLDDVKVEYNRS

Query:  FTPTSMMMQFPSRQAYDNAFRVIGRKGRLYRLERADRSQWDLLSPYNGKTVLLQGIPRNASVEDVERFLSGCDYDATSINVFFRASFPDPIRMATVLFPS
        FTPTSMMMQFPSRQAYDNAFRVIGRKGRLYRLERADRSQWD+LSPYNGKTVLLQGIPRNA VEDVERFL GCDYDATSIN+FFRASFP+P+RMATVLFPS
Subjt:  FTPTSMMMQFPSRQAYDNAFRVIGRKGRLYRLERADRSQWDLLSPYNGKTVLLQGIPRNASVEDVERFLSGCDYDATSINVFFRASFPDPIRMATVLFPS

Query:  PTQAMHAFLTKNRGFCLNNQILMRVLQ
        PTQAMHAFLTKNRGFCLNNQILMRVLQ
Subjt:  PTQAMHAFLTKNRGFCLNNQILMRVLQ

XP_023518188.1 uncharacterized protein LOC111781730 [Cucurbita pepo subsp. pepo]1.6e-11792.51Show/hide
Query:  MSTLNLLRKTLGSHFVSPSARSNHGLPVFFLQSPRFFSTEGEQPPSEPAADPFLDTSKTTGLVYGKLYGITRNTLKTDIVNLLEGCNLSLDDVKVEYNRS
        MS L LLRK L SHFV+PSAR+NHGLPVFF QSPRFFSTEGEQPPSEP ADPFLDTSKT GLVYGKLYGITRNTLKTDIVNLLEGCNL+LDDVKVEYNRS
Subjt:  MSTLNLLRKTLGSHFVSPSARSNHGLPVFFLQSPRFFSTEGEQPPSEPAADPFLDTSKTTGLVYGKLYGITRNTLKTDIVNLLEGCNLSLDDVKVEYNRS

Query:  FTPTSMMMQFPSRQAYDNAFRVIGRKGRLYRLERADRSQWDLLSPYNGKTVLLQGIPRNASVEDVERFLSGCDYDATSINVFFRASFPDPIRMATVLFPS
        FTPTSMMMQFPSRQAYDNAFRVIGRKGRLYRLERADRSQWD+LSPYNGKTVLLQGIPRNA VEDVERFL GCDYDATSIN+FFRASFP+P+R+ATVLFPS
Subjt:  FTPTSMMMQFPSRQAYDNAFRVIGRKGRLYRLERADRSQWDLLSPYNGKTVLLQGIPRNASVEDVERFLSGCDYDATSINVFFRASFPDPIRMATVLFPS

Query:  PTQAMHAFLTKNRGFCLNNQILMRVLQ
        PTQAMHAFLTKNRGFCLNNQILMRVLQ
Subjt:  PTQAMHAFLTKNRGFCLNNQILMRVLQ

XP_023544682.1 uncharacterized protein LOC111804194 [Cucurbita pepo subsp. pepo]1.8e-11188.55Show/hide
Query:  MSTLNLLRKTLGSHFVSPSARSNHGLPVFFLQSPRFFSTEGEQPPSEPAADPFLDTSKTTGLVYGKLYGITRNTLKTDIVNLLEGCNLSLDDVKVEYNRS
        M+ LNLLRK LGSHF+S SA  NHGLPVFF QS RFFSTEGEQ P E +AD FLDTS  TGLVYGKLYGITRN LKTDIVNLLEGCNLSLDDVKVEYNRS
Subjt:  MSTLNLLRKTLGSHFVSPSARSNHGLPVFFLQSPRFFSTEGEQPPSEPAADPFLDTSKTTGLVYGKLYGITRNTLKTDIVNLLEGCNLSLDDVKVEYNRS

Query:  FTPTSMMMQFPSRQAYDNAFRVIGRKGRLYRLERADRSQWDLLSPYNGKTVLLQGIPRNASVEDVERFLSGCDYDATSINVFFRASFPDPIRMATVLFPS
        FTPTSMMMQFPSR+AYDNAFRVIGR+GR+YRLERADRSQWDLLSPYNGKTVLLQGIPRNA+++DVERFLSGCDYDATSIN+FFRAS P+PIRMATVLFPS
Subjt:  FTPTSMMMQFPSRQAYDNAFRVIGRKGRLYRLERADRSQWDLLSPYNGKTVLLQGIPRNASVEDVERFLSGCDYDATSINVFFRASFPDPIRMATVLFPS

Query:  PTQAMHAFLTKNRGFCLNNQILMRVLQ
        PTQAMHAFLTKNRGFCLNNQILMRVLQ
Subjt:  PTQAMHAFLTKNRGFCLNNQILMRVLQ

TrEMBL top hitse value%identityAlignment
A0A6J1CQY2 uncharacterized protein LOC1110137649.8e-10886.46Show/hide
Query:  MSTLNLLRKTLGSHFVSPSARSNHGLPVFFLQSP--RFFSTEGEQPPSEPAADPFLDTSKTTGLVYGKLYGITRNTLKTDIVNLLEGCNLSLDDVKVEYN
        MS L+LLRK  GSHF+S + R++HG PVFF +SP  R FSTE EQPPSEP AD FLDTSK TGLVYGKLYGITRNTLKTDIVNLLEGCNL LDDVKV+YN
Subjt:  MSTLNLLRKTLGSHFVSPSARSNHGLPVFFLQSP--RFFSTEGEQPPSEPAADPFLDTSKTTGLVYGKLYGITRNTLKTDIVNLLEGCNLSLDDVKVEYN

Query:  RSFTPTSMMMQFPSRQAYDNAFRVIGRKGRLYRLERADRSQWDLLSPYNGKTVLLQGIPRNASVEDVERFLSGCDYDATSINVFFRASFPDPIRMATVLF
        RSFTPTSMMMQFPSRQAYDNA RVIGRKGRLYRLERADRSQWDLLSPYNGKTVLLQG+PRNA  EDVERFLSGC+YDATSIN+FFRAS P+P+RMATVLF
Subjt:  RSFTPTSMMMQFPSRQAYDNAFRVIGRKGRLYRLERADRSQWDLLSPYNGKTVLLQGIPRNASVEDVERFLSGCDYDATSINVFFRASFPDPIRMATVLF

Query:  PSPTQAMHAFLTKNRGFCLNNQILMRVLQ
        PSPTQAMHAFLTKNRGFCLNNQILMRVLQ
Subjt:  PSPTQAMHAFLTKNRGFCLNNQILMRVLQ

A0A6J1GDZ1 uncharacterized protein LOC1114531261.8e-10986.78Show/hide
Query:  MSTLNLLRKTLGSHFVSPSARSNHGLPVFFLQSPRFFSTEGEQPPSEPAADPFLDTSKTTGLVYGKLYGITRNTLKTDIVNLLEGCNLSLDDVKVEYNRS
        M+ LNLLRK LGSHF S SA  NHGLPVFF QS RFFSTEGEQ P E +A+ FLDTS+T GLVYGKLYGITRN LKTDIVNLLEGCNLSLDDVKVEYNRS
Subjt:  MSTLNLLRKTLGSHFVSPSARSNHGLPVFFLQSPRFFSTEGEQPPSEPAADPFLDTSKTTGLVYGKLYGITRNTLKTDIVNLLEGCNLSLDDVKVEYNRS

Query:  FTPTSMMMQFPSRQAYDNAFRVIGRKGRLYRLERADRSQWDLLSPYNGKTVLLQGIPRNASVEDVERFLSGCDYDATSINVFFRASFPDPIRMATVLFPS
        FTPTSMMMQFPS +AYDNAFRVIGR+GR+YRLERADRSQWDLLSPYNGK +LLQGIPRNA+++DVERFLSGCDYDATSIN+FFRAS P+PIRMATVLFPS
Subjt:  FTPTSMMMQFPSRQAYDNAFRVIGRKGRLYRLERADRSQWDLLSPYNGKTVLLQGIPRNASVEDVERFLSGCDYDATSINVFFRASFPDPIRMATVLFPS

Query:  PTQAMHAFLTKNRGFCLNNQILMRVLQ
        PTQAMHAFLTKNRGFCLNNQILMRVLQ
Subjt:  PTQAMHAFLTKNRGFCLNNQILMRVLQ

A0A6J1HGY5 uncharacterized protein LOC1114633771.4e-11490.75Show/hide
Query:  MSTLNLLRKTLGSHFVSPSARSNHGLPVFFLQSPRFFSTEGEQPPSEPAADPFLDTSKTTGLVYGKLYGITRNTLKTDIVNLLEGCNLSLDDVKVEYNRS
        MS L LLRK L SHFV+PSAR+NHGLPVFF QSPRFFSTEGEQPP E  ADPFLDTSKT GLVYGKL GITRNTLKTDIVNLLEGCNL+LDDVKVEYNRS
Subjt:  MSTLNLLRKTLGSHFVSPSARSNHGLPVFFLQSPRFFSTEGEQPPSEPAADPFLDTSKTTGLVYGKLYGITRNTLKTDIVNLLEGCNLSLDDVKVEYNRS

Query:  FTPTSMMMQFPSRQAYDNAFRVIGRKGRLYRLERADRSQWDLLSPYNGKTVLLQGIPRNASVEDVERFLSGCDYDATSINVFFRASFPDPIRMATVLFPS
        FTPTSMMMQFPSRQAYDNAFRVIGRKGRLYRLERADRSQWD+LSPY+GKTVLLQGIPRNA VEDVERFL GCDYDATSIN+FFRASFP+P+R+ATVLFPS
Subjt:  FTPTSMMMQFPSRQAYDNAFRVIGRKGRLYRLERADRSQWDLLSPYNGKTVLLQGIPRNASVEDVERFLSGCDYDATSINVFFRASFPDPIRMATVLFPS

Query:  PTQAMHAFLTKNRGFCLNNQILMRVLQ
        PTQAMHAFLTKNRGFCLNNQILMRVLQ
Subjt:  PTQAMHAFLTKNRGFCLNNQILMRVLQ

A0A6J1I6U3 uncharacterized protein LOC1114712641.8e-11792.51Show/hide
Query:  MSTLNLLRKTLGSHFVSPSARSNHGLPVFFLQSPRFFSTEGEQPPSEPAADPFLDTSKTTGLVYGKLYGITRNTLKTDIVNLLEGCNLSLDDVKVEYNRS
        MS L LLRK LGSHFV+PSAR+NHGLPVFF QSPRFFSTEGEQPP EP AD FLDTSKT GLVYGKLYGITRNTLKTDIVNLLEGCNL+LDDVKVEYNRS
Subjt:  MSTLNLLRKTLGSHFVSPSARSNHGLPVFFLQSPRFFSTEGEQPPSEPAADPFLDTSKTTGLVYGKLYGITRNTLKTDIVNLLEGCNLSLDDVKVEYNRS

Query:  FTPTSMMMQFPSRQAYDNAFRVIGRKGRLYRLERADRSQWDLLSPYNGKTVLLQGIPRNASVEDVERFLSGCDYDATSINVFFRASFPDPIRMATVLFPS
        FTPTSMMMQFPSRQAYDNAFRVIGRKGRLYRLERADRSQWD+LSPYNGKTVLLQGIPRNA VEDVERFL GCDYDATSIN+FFRASFP+P+RMATVLFPS
Subjt:  FTPTSMMMQFPSRQAYDNAFRVIGRKGRLYRLERADRSQWDLLSPYNGKTVLLQGIPRNASVEDVERFLSGCDYDATSINVFFRASFPDPIRMATVLFPS

Query:  PTQAMHAFLTKNRGFCLNNQILMRVLQ
        PTQAMHAFLTKNRGFCLNNQILMRVLQ
Subjt:  PTQAMHAFLTKNRGFCLNNQILMRVLQ

A0A6J1INA4 uncharacterized protein LOC1114779764.0e-10986.34Show/hide
Query:  MSTLNLLRKTLGSHFVSPSARSNHGLPVFFLQSPRFFSTEGEQPPSEPAADPFLDTSKTTGLVYGKLYGITRNTLKTDIVNLLEGCNLSLDDVKVEYNRS
        M+ LNLLRK LGSHF+S SA  NHGLPVFF QS RFFS EGEQ P E +A+ FLDT +T GLVYGKLYGITRN LKTDIVNLLEGCNLSLDDVKVEYNRS
Subjt:  MSTLNLLRKTLGSHFVSPSARSNHGLPVFFLQSPRFFSTEGEQPPSEPAADPFLDTSKTTGLVYGKLYGITRNTLKTDIVNLLEGCNLSLDDVKVEYNRS

Query:  FTPTSMMMQFPSRQAYDNAFRVIGRKGRLYRLERADRSQWDLLSPYNGKTVLLQGIPRNASVEDVERFLSGCDYDATSINVFFRASFPDPIRMATVLFPS
        FTPTSMMMQFPSR++YDNAFRVIGR+GR+YRLERADRSQWDLLSPYNGKTVLLQGIPRNA+++DVERFLSGCDYDAT IN+FFRAS P+PIRMATVLFPS
Subjt:  FTPTSMMMQFPSRQAYDNAFRVIGRKGRLYRLERADRSQWDLLSPYNGKTVLLQGIPRNASVEDVERFLSGCDYDATSINVFFRASFPDPIRMATVLFPS

Query:  PTQAMHAFLTKNRGFCLNNQILMRVLQ
        PTQAMHAFLTKNRGFCLNNQILMRVLQ
Subjt:  PTQAMHAFLTKNRGFCLNNQILMRVLQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G02740.1 Ribosomal protein S24e family protein2.8e-4645.37Show/hide
Query:  TLNLLRKTLGSHFVSPSARSNHGLPVFFLQSPRFFSTEGEQPPSEPAADPFLDTSKTTGLVYGKLYGITRNTLKTDIVNLLEGCNLSLDDVKVEYNR--S
        T + +R +L    + P   S   LP F  +  +  ST  EQPP      P    S   G  YGK  G +++ LKTDI+N+LEGC+L+ DD+K  Y R  +
Subjt:  TLNLLRKTLGSHFVSPSARSNHGLPVFFLQSPRFFSTEGEQPPSEPAADPFLDTSKTTGLVYGKLYGITRNTLKTDIVNLLEGCNLSLDDVKVEYNR--S

Query:  FTPTSMMMQFPSRQAYDNAFRVIGRKGRLYRLERADRSQWDLLSPYNGKTVLLQGIPRNASVEDVERFLSGCDYDATSINVFFRASFPDPIRMATVLFPS
         TP ++ +QFPS  AYD A R I +KG+LYRLE+A R+QWD + PY GK V L GIP NA  +D++RFLSGC Y   SI            R+A V F S
Subjt:  FTPTSMMMQFPSRQAYDNAFRVIGRKGRLYRLERADRSQWDLLSPYNGKTVLLQGIPRNASVEDVERFLSGCDYDATSINVFFRASFPDPIRMATVLFPS

Query:  PTQAMHAFLTKNRGFCLNNQILMRVLQ
         TQAM+A++TKNR F LN +I ++VLQ
Subjt:  PTQAMHAFLTKNRGFCLNNQILMRVLQ

AT5G02740.2 Ribosomal protein S24e family protein7.5e-3645.81Show/hide
Query:  TLNLLRKTLGSHFVSPSARSNHGLPVFFLQSPRFFSTEGEQPPSEPAADPFLDTSKTTGLVYGKLYGITRNTLKTDIVNLLEGCNLSLDDVKVEYNR--S
        T + +R +L    + P   S   LP F  +  +  ST  EQPP      P    S   G  YGK  G +++ LKTDI+N+LEGC+L+ DD+K  Y R  +
Subjt:  TLNLLRKTLGSHFVSPSARSNHGLPVFFLQSPRFFSTEGEQPPSEPAADPFLDTSKTTGLVYGKLYGITRNTLKTDIVNLLEGCNLSLDDVKVEYNR--S

Query:  FTPTSMMMQFPSRQAYDNAFRVIGRKGRLYRLERADRSQWDLLSPYNGKTVLLQGIPRNASVEDVERFLSGCDYDATSI
         TP ++ +QFPS  AYD A R I +KG+LYRLE+A R+QWD + PY GK V L GIP NA  +D++RFLSGC Y   SI
Subjt:  FTPTSMMMQFPSRQAYDNAFRVIGRKGRLYRLERADRSQWDLLSPYNGKTVLLQGIPRNASVEDVERFLSGCDYDATSI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGACCCTCAATCTGCTCCGTAAGACACTTGGATCGCACTTCGTTTCTCCTTCGGCGAGGTCAAATCATGGCCTCCCTGTGTTTTTCCTCCAATCTCCAAGATTCTT
CTCGACGGAAGGGGAACAACCGCCTTCGGAACCGGCCGCCGATCCATTTCTTGATACATCTAAAACAACAGGTTTGGTATATGGAAAATTGTATGGAATCACAAGGAATA
CACTAAAGACAGACATTGTCAATTTACTTGAAGGATGTAATTTGAGTTTGGATGATGTCAAAGTCGAATACAATCGGAGTTTCACACCTACCTCTATGATGATGCAATTC
CCCTCCCGACAGGCTTATGATAATGCTTTTCGAGTGATTGGAAGAAAAGGTCGCTTGTACAGATTGGAGCGGGCTGATCGTTCGCAGTGGGACCTTCTTTCACCTTACAA
TGGAAAAACTGTCCTTCTGCAAGGAATACCTCGAAATGCATCGGTAGAAGACGTCGAACGCTTCTTATCTGGCTGTGACTATGATGCAACCTCAATCAATGTGTTTTTCA
GGGCATCCTTTCCAGACCCCATCAGAATGGCCACAGTGCTGTTCCCTTCACCAACCCAAGCAATGCATGCATTTCTTACAAAGAACAGAGGCTTTTGTCTGAACAACCAA
ATTTTGATGCGGGTTCTCCAATAA
mRNA sequenceShow/hide mRNA sequence
CAGCCACTGCCCCACAGCTCAGTGAAGGAGCCCTATAAACAAACTCAGCACTACACTCCCCTTCTTCGATCTTTCACAGACATCGCCTTCCTTGCCAAAAATGTCGACCC
TCAATCTGCTCCGTAAGACACTTGGATCGCACTTCGTTTCTCCTTCGGCGAGGTCAAATCATGGCCTCCCTGTGTTTTTCCTCCAATCTCCAAGATTCTTCTCGACGGAA
GGGGAACAACCGCCTTCGGAACCGGCCGCCGATCCATTTCTTGATACATCTAAAACAACAGGTTTGGTATATGGAAAATTGTATGGAATCACAAGGAATACACTAAAGAC
AGACATTGTCAATTTACTTGAAGGATGTAATTTGAGTTTGGATGATGTCAAAGTCGAATACAATCGGAGTTTCACACCTACCTCTATGATGATGCAATTCCCCTCCCGAC
AGGCTTATGATAATGCTTTTCGAGTGATTGGAAGAAAAGGTCGCTTGTACAGATTGGAGCGGGCTGATCGTTCGCAGTGGGACCTTCTTTCACCTTACAATGGAAAAACT
GTCCTTCTGCAAGGAATACCTCGAAATGCATCGGTAGAAGACGTCGAACGCTTCTTATCTGGCTGTGACTATGATGCAACCTCAATCAATGTGTTTTTCAGGGCATCCTT
TCCAGACCCCATCAGAATGGCCACAGTGCTGTTCCCTTCACCAACCCAAGCAATGCATGCATTTCTTACAAAGAACAGAGGCTTTTGTCTGAACAACCAAATTTTGATGC
GGGTTCTCCAATAATTAGTAACTGTTTATCTTCTGTATTCGACGCATATTTTGTGTTGGCTCAATGGCTGGGCGTTTTGTAGTAGTGTTATGAGTTATGACTTGAATCTT
CATTATCTCAAGCCTTCAGGTAGCAGTCACATTCATTTAAATTGACCTGAAATCTTTTTTTTTCCCACAAAGTTAGCAGTTTGTTTGGGCTAAAGAACTGCATTTGTGAG
GAATGATATTTAGATTTTGAAAGTTTTATGAAAAATAAGGTTGTTGAAGTTTGAA
Protein sequenceShow/hide protein sequence
MSTLNLLRKTLGSHFVSPSARSNHGLPVFFLQSPRFFSTEGEQPPSEPAADPFLDTSKTTGLVYGKLYGITRNTLKTDIVNLLEGCNLSLDDVKVEYNRSFTPTSMMMQF
PSRQAYDNAFRVIGRKGRLYRLERADRSQWDLLSPYNGKTVLLQGIPRNASVEDVERFLSGCDYDATSINVFFRASFPDPIRMATVLFPSPTQAMHAFLTKNRGFCLNNQ
ILMRVLQ