; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0000569 (gene) of Snake gourd v1 genome

Gene IDTan0000569
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionLate embryogenesis abundant protein
Genome locationLG05:60269651..60270337
RNA-Seq ExpressionTan0000569
SyntenyTan0000569
Gene Ontology termsNA
InterPro domainsIPR004864 - Late embryogenesis abundant protein, LEA_2 subgroup


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008463664.1 PREDICTED: uncharacterized protein LOC103501757 [Cucumis melo]1.4e-5552.61Show/hide
Query:  MADTSTGSQQPPHPDPQPFTSWLHHPSPPPPPHNRC--TSPFLRAFAAGLTIMLLIAVFIYTVEYLVLGPLLPKFRVDSLHLTNFSVAAPSLSALWVVGF
        MA+T+     PPHP+ Q               H  C  +SPFLR FAAG+ ++L IA  IY++++L   P+LP  R+DSL L NFS AA +  + W+VGF
Subjt:  MADTSTGSQQPPHPDPQPFTSWLHHPSPPPPPHNRC--TSPFLRAFAAGLTIMLLIAVFIYTVEYLVLGPLLPKFRVDSLHLTNFSVAAPSLSALWVVGF

Query:  SVNNPSKKLAISLDNIESSIYYKDKILSQARIYRFAIPRKNSTAIVTPFVADSDSDVSVLNDINGDLARGAINFTVAVLGHVAFETGAWRLRGSLFRVFC
        S+NNP+KKLAIS  N++SSIYYKD I++QARI RF +  +NST +V PF+A S  D SVLNDINGDLARG INFTV VLG+  F+   W+ RG+  +V C
Subjt:  SVNNPSKKLAISLDNIESSIYYKDKILSQARIYRFAIPRKNSTAIVTPFVADSDSDVSVLNDINGDLARGAINFTVAVLGHVAFETGAWRLRGSLFRVFC

Query:  SDLAVGISSPPTPGGESGELAGGSRQCQVR
        SDL+VG S PP+  G SG+L GGS+QCQ++
Subjt:  SDLAVGISSPPTPGGESGELAGGSRQCQVR

XP_022943707.1 uncharacterized protein At1g08160-like [Cucurbita moschata]1.3e-3442.27Show/hide
Query:  QQPPHPDPQPFTSWLHHPSPPPPPHNRCTSPFLRAFAAGLTIMLLIAVFIYTVEYLVLGPLLPKFRVDSLHLTNFSVAAPSLSALWVVGFSVNNPSKKLA
        Q  P+  P P++   H+ +P    H      FLRA  AG+ ++ +I   I  + +LVL P LP+FRVDS  +TNFS A+ SLSALW VGFSV NP+KK+ 
Subjt:  QQPPHPDPQPFTSWLHHPSPPPPPHNRCTSPFLRAFAAGLTIMLLIAVFIYTVEYLVLGPLLPKFRVDSLHLTNFSVAAPSLSALWVVGFSVNNPSKKLA

Query:  ISLDNIESSIYYKDKILSQARIYRFAIPRKNSTAI-VTPFVADSDSDVSVLNDINGDLARGAINFTVAVLGHVAFETGAWRLRGSLFRVFCSDLAVGISS
        IS D +E +++YK++ LSQ R+  F   ++  TA+  T    ++  + S +N+IN D  RG + F V V   V F  G WR R  L RV C DL+VG+SS
Subjt:  ISLDNIESSIYYKDKILSQARIYRFAIPRKNSTAI-VTPFVADSDSDVSVLNDINGDLARGAINFTVAVLGHVAFETGAWRLRGSLFRVFCSDLAVGISS

Query:  PPTPGGESGELAGGSRQCQV
          +    SG+L G  R C+V
Subjt:  PPTPGGESGELAGGSRQCQV

XP_022957733.1 NDR1/HIN1-like protein 2 [Cucurbita moschata]4.6e-3542.27Show/hide
Query:  QQPPHPDPQPFTSWLHHPSPPPPPHNRCTSPFLRAFAAGLTIMLLIAVFIYTVEYLVLGPLLPKFRVDSLHLTNFSVAAPSLSALWVVGFSVNNPSKKLA
        Q  P+  P P++   H+ +P    H+     FLRA  AG+ ++ +I   I  + +LVL P LP+FRVDS  +TNFS A+ SLSA W VGFSV NP+KK++
Subjt:  QQPPHPDPQPFTSWLHHPSPPPPPHNRCTSPFLRAFAAGLTIMLLIAVFIYTVEYLVLGPLLPKFRVDSLHLTNFSVAAPSLSALWVVGFSVNNPSKKLA

Query:  ISLDNIESSIYYKDKILSQARIYRFAIPRKNSTAIVTPFVA-DSDSDVSVLNDINGDLARGAINFTVAVLGHVAFETGAWRLRGSLFRVFCSDLAVGISS
        IS D ++S+++YK++ILS+ R+  F   ++  T +   F + ++  D S +N+IN D  RGA+ F V +   V F  G WR R  L RV C DL+VG+SS
Subjt:  ISLDNIESSIYYKDKILSQARIYRFAIPRKNSTAIVTPFVA-DSDSDVSVLNDINGDLARGAINFTVAVLGHVAFETGAWRLRGSLFRVFCSDLAVGISS

Query:  PPTPGGESGELAGGSRQCQV
          +    SG+L G SR C+V
Subjt:  PPTPGGESGELAGGSRQCQV

XP_022995131.1 NDR1/HIN1-like protein 2 [Cucurbita maxima]1.0e-3442.27Show/hide
Query:  QQPPHPDPQPFTSWLHHPSPPPPPHNRCTSPFLRAFAAGLTIMLLIAVFIYTVEYLVLGPLLPKFRVDSLHLTNFSVAAPSLSALWVVGFSVNNPSKKLA
        Q  P+  P P++   H+ +P    H      FLRA  AG+ ++ +I   I  + +LVL P LP+FRVDS  +TNFS A+ SLSA W VGFSV NP+KK++
Subjt:  QQPPHPDPQPFTSWLHHPSPPPPPHNRCTSPFLRAFAAGLTIMLLIAVFIYTVEYLVLGPLLPKFRVDSLHLTNFSVAAPSLSALWVVGFSVNNPSKKLA

Query:  ISLDNIESSIYYKDKILSQARIYRFAIPRKNSTAIVTPFVA-DSDSDVSVLNDINGDLARGAINFTVAVLGHVAFETGAWRLRGSLFRVFCSDLAVGISS
        IS D ++S+++YK++ILS+ R+  F   ++  T +   F + ++  D S +NDIN D  RGA+ F V +   V F  G WR R  L RV C DL+VG+S 
Subjt:  ISLDNIESSIYYKDKILSQARIYRFAIPRKNSTAIVTPFVA-DSDSDVSVLNDINGDLARGAINFTVAVLGHVAFETGAWRLRGSLFRVFCSDLAVGISS

Query:  PPTPGGESGELAGGSRQCQV
          +    SG+L G SR C+V
Subjt:  PPTPGGESGELAGGSRQCQV

XP_023532895.1 NDR1/HIN1-like protein 2 [Cucurbita pepo subsp. pepo]4.6e-3542.27Show/hide
Query:  QQPPHPDPQPFTSWLHHPSPPPPPHNRCTSPFLRAFAAGLTIMLLIAVFIYTVEYLVLGPLLPKFRVDSLHLTNFSVAAPSLSALWVVGFSVNNPSKKLA
        Q  P+  P P++   H+ +P    H+     FLRA  AG+ ++ +I   I  + +LVL P LP+FRVDS  +TNFS A+ SLSA W VGFSV NP+KK++
Subjt:  QQPPHPDPQPFTSWLHHPSPPPPPHNRCTSPFLRAFAAGLTIMLLIAVFIYTVEYLVLGPLLPKFRVDSLHLTNFSVAAPSLSALWVVGFSVNNPSKKLA

Query:  ISLDNIESSIYYKDKILSQARIYRFAIPRKNSTAIVTPFVA-DSDSDVSVLNDINGDLARGAINFTVAVLGHVAFETGAWRLRGSLFRVFCSDLAVGISS
        IS D ++S+++YK++ILS+ R+  F   ++  T +   F + ++  D S +N+IN D  RGA+ F V +   V F  G WR R  L RV C DL+VG+SS
Subjt:  ISLDNIESSIYYKDKILSQARIYRFAIPRKNSTAIVTPFVA-DSDSDVSVLNDINGDLARGAINFTVAVLGHVAFETGAWRLRGSLFRVFCSDLAVGISS

Query:  PPTPGGESGELAGGSRQCQV
          +    SG+L G SR C+V
Subjt:  PPTPGGESGELAGGSRQCQV

TrEMBL top hitse value%identityAlignment
A0A0A0LT76 LEA_2 domain-containing protein2.4e-5352.42Show/hide
Query:  MADTSTGSQQPPHPDPQPFTSWLHHPSPPPPPHNRCTSPFLRAFAAGLTIMLLIAVFIYTVEYLVLGPLLPKFRVDSLHLTNFSVAAPSLSALWVVGFSV
        MA+++ G   PPHP+ Q              PH    SPF+R FAAG+T++L I   IY ++YL+  P+L   R+DSL   NFS  A + S  WVVGFS+
Subjt:  MADTSTGSQQPPHPDPQPFTSWLHHPSPPPPPHNRCTSPFLRAFAAGLTIMLLIAVFIYTVEYLVLGPLLPKFRVDSLHLTNFSVAAPSLSALWVVGFSV

Query:  NNPSKKLAISLDNIESSIYYKDKILSQARIYRFAIPRKNSTAIVTPFVADSDSDVSVLNDINGDLARGAINFTVAVLGHVAFETGAWRLRGSLFRVFCSD
        NNP+KKLAIS  N+ESSIYYKD I++QAR  RF +P +NST +V+PF+AD   D SVLNDI+GDL RG I+FTV VLG+   E G WR  G+  RV CSD
Subjt:  NNPSKKLAISLDNIESSIYYKDKILSQARIYRFAIPRKNSTAIVTPFVADSDSDVSVLNDINGDLARGAINFTVAVLGHVAFETGAWRLRGSLFRVFCSD

Query:  LAVGISSPPTPGGESGELAGGSRQCQV
        L+V  S PP   G SG+L GGSRQC +
Subjt:  LAVGISSPPTPGGESGELAGGSRQCQV

A0A1S3CJS5 uncharacterized protein LOC1035017576.7e-5652.61Show/hide
Query:  MADTSTGSQQPPHPDPQPFTSWLHHPSPPPPPHNRC--TSPFLRAFAAGLTIMLLIAVFIYTVEYLVLGPLLPKFRVDSLHLTNFSVAAPSLSALWVVGF
        MA+T+     PPHP+ Q               H  C  +SPFLR FAAG+ ++L IA  IY++++L   P+LP  R+DSL L NFS AA +  + W+VGF
Subjt:  MADTSTGSQQPPHPDPQPFTSWLHHPSPPPPPHNRC--TSPFLRAFAAGLTIMLLIAVFIYTVEYLVLGPLLPKFRVDSLHLTNFSVAAPSLSALWVVGF

Query:  SVNNPSKKLAISLDNIESSIYYKDKILSQARIYRFAIPRKNSTAIVTPFVADSDSDVSVLNDINGDLARGAINFTVAVLGHVAFETGAWRLRGSLFRVFC
        S+NNP+KKLAIS  N++SSIYYKD I++QARI RF +  +NST +V PF+A S  D SVLNDINGDLARG INFTV VLG+  F+   W+ RG+  +V C
Subjt:  SVNNPSKKLAISLDNIESSIYYKDKILSQARIYRFAIPRKNSTAIVTPFVADSDSDVSVLNDINGDLARGAINFTVAVLGHVAFETGAWRLRGSLFRVFC

Query:  SDLAVGISSPPTPGGESGELAGGSRQCQVR
        SDL+VG S PP+  G SG+L GGS+QCQ++
Subjt:  SDLAVGISSPPTPGGESGELAGGSRQCQVR

A0A5D3E5N0 Protein YLS9 isoform X26.7e-5652.61Show/hide
Query:  MADTSTGSQQPPHPDPQPFTSWLHHPSPPPPPHNRC--TSPFLRAFAAGLTIMLLIAVFIYTVEYLVLGPLLPKFRVDSLHLTNFSVAAPSLSALWVVGF
        MA+T+     PPHP+ Q               H  C  +SPFLR FAAG+ ++L IA  IY++++L   P+LP  R+DSL L NFS AA +  + W+VGF
Subjt:  MADTSTGSQQPPHPDPQPFTSWLHHPSPPPPPHNRC--TSPFLRAFAAGLTIMLLIAVFIYTVEYLVLGPLLPKFRVDSLHLTNFSVAAPSLSALWVVGF

Query:  SVNNPSKKLAISLDNIESSIYYKDKILSQARIYRFAIPRKNSTAIVTPFVADSDSDVSVLNDINGDLARGAINFTVAVLGHVAFETGAWRLRGSLFRVFC
        S+NNP+KKLAIS  N++SSIYYKD I++QARI RF +  +NST +V PF+A S  D SVLNDINGDLARG INFTV VLG+  F+   W+ RG+  +V C
Subjt:  SVNNPSKKLAISLDNIESSIYYKDKILSQARIYRFAIPRKNSTAIVTPFVADSDSDVSVLNDINGDLARGAINFTVAVLGHVAFETGAWRLRGSLFRVFC

Query:  SDLAVGISSPPTPGGESGELAGGSRQCQVR
        SDL+VG S PP+  G SG+L GGS+QCQ++
Subjt:  SDLAVGISSPPTPGGESGELAGGSRQCQVR

A0A6J1H136 NDR1/HIN1-like protein 22.2e-3542.27Show/hide
Query:  QQPPHPDPQPFTSWLHHPSPPPPPHNRCTSPFLRAFAAGLTIMLLIAVFIYTVEYLVLGPLLPKFRVDSLHLTNFSVAAPSLSALWVVGFSVNNPSKKLA
        Q  P+  P P++   H+ +P    H+     FLRA  AG+ ++ +I   I  + +LVL P LP+FRVDS  +TNFS A+ SLSA W VGFSV NP+KK++
Subjt:  QQPPHPDPQPFTSWLHHPSPPPPPHNRCTSPFLRAFAAGLTIMLLIAVFIYTVEYLVLGPLLPKFRVDSLHLTNFSVAAPSLSALWVVGFSVNNPSKKLA

Query:  ISLDNIESSIYYKDKILSQARIYRFAIPRKNSTAIVTPFVA-DSDSDVSVLNDINGDLARGAINFTVAVLGHVAFETGAWRLRGSLFRVFCSDLAVGISS
        IS D ++S+++YK++ILS+ R+  F   ++  T +   F + ++  D S +N+IN D  RGA+ F V +   V F  G WR R  L RV C DL+VG+SS
Subjt:  ISLDNIESSIYYKDKILSQARIYRFAIPRKNSTAIVTPFVA-DSDSDVSVLNDINGDLARGAINFTVAVLGHVAFETGAWRLRGSLFRVFCSDLAVGISS

Query:  PPTPGGESGELAGGSRQCQV
          +    SG+L G SR C+V
Subjt:  PPTPGGESGELAGGSRQCQV

A0A6J1K154 NDR1/HIN1-like protein 25.0e-3542.27Show/hide
Query:  QQPPHPDPQPFTSWLHHPSPPPPPHNRCTSPFLRAFAAGLTIMLLIAVFIYTVEYLVLGPLLPKFRVDSLHLTNFSVAAPSLSALWVVGFSVNNPSKKLA
        Q  P+  P P++   H+ +P    H      FLRA  AG+ ++ +I   I  + +LVL P LP+FRVDS  +TNFS A+ SLSA W VGFSV NP+KK++
Subjt:  QQPPHPDPQPFTSWLHHPSPPPPPHNRCTSPFLRAFAAGLTIMLLIAVFIYTVEYLVLGPLLPKFRVDSLHLTNFSVAAPSLSALWVVGFSVNNPSKKLA

Query:  ISLDNIESSIYYKDKILSQARIYRFAIPRKNSTAIVTPFVA-DSDSDVSVLNDINGDLARGAINFTVAVLGHVAFETGAWRLRGSLFRVFCSDLAVGISS
        IS D ++S+++YK++ILS+ R+  F   ++  T +   F + ++  D S +NDIN D  RGA+ F V +   V F  G WR R  L RV C DL+VG+S 
Subjt:  ISLDNIESSIYYKDKILSQARIYRFAIPRKNSTAIVTPFVA-DSDSDVSVLNDINGDLARGAINFTVAVLGHVAFETGAWRLRGSLFRVFCSDLAVGISS

Query:  PPTPGGESGELAGGSRQCQV
          +    SG+L G SR C+V
Subjt:  PPTPGGESGELAGGSRQCQV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G54540.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family9.0e-0526.59Show/hide
Query:  LHHPSPPP--PPHNRCTSPFLRAFAAGLTIMLLIAVFI-YTVEYLVLGPLLPKFRVDSLHLTNFSVAAP-SLSALWVVGFSVNNPSKKLAISLD-NIESS
        +  P PPP  P  NR     +  +   L ++ LIA+ I   V Y V  P LP + V+SL +TN  +    SLSA + V  +  NP++K+ I  +      
Subjt:  LHHPSPPP--PPHNRCTSPFLRAFAAGLTIMLLIAVFI-YTVEYLVLGPLLPKFRVDSLHLTNFSVAAP-SLSALWVVGFSVNNPSKKLAISLD-NIESS

Query:  IYYKDKILSQARIYRFAIPRKNSTAIVTPFVADSDSDVSVLNDINGDLARGAINFTVAVLGHVAFETGAWRLR
        ++Y    L +  I RF    +N T +       +    +VL  +      G +   + V   VA + G  +++
Subjt:  IYYKDKILSQARIYRFAIPRKNSTAIVTPFVADSDSDVSVLNDINGDLARGAINFTVAVLGHVAFETGAWRLR

AT2G27260.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family8.7e-1628.32Show/hide
Query:  QQPP---HPDPQPFTSW---LHHPSPPPPPHNRCTSPFLRAFAAGLTIMLLIAVFIYTVEYLVLGPLLPKFRVDSLHLTNFSVAAPSLSALWVVGFSVNN
        QQPP   +P+P   T++    H+P   P P+ R      R F    T +LL+ + ++ + +L++ P LP   ++SL ++NF+V+   +S  W +     N
Subjt:  QQPP---HPDPQPFTSW---LHHPSPPPPPHNRCTSPFLRAFAAGLTIMLLIAVFIYTVEYLVLGPLLPKFRVDSLHLTNFSVAAPSLSALWVVGFSVNN

Query:  PSKKLAISLDNIESSIYYKDKILSQARIYRFAIPRKNSTAI-VTPFVADSDSDVSVLNDINGDLA-RGAINFTVAVLGHVAFETGAWRLRGSLFRVFCSD
        P+ K+++  +    ++YY    LS+ R+  F   +K+ T +  T  V+ +  D  +++ I  + + +G + F + ++ +V F  GA+R R     V+C D
Subjt:  PSKKLAISLDNIESSIYYKDKILSQARIYRFAIPRKNSTAI-VTPFVADSDSDVSVLNDINGDLA-RGAINFTVAVLGHVAFETGAWRLRGSLFRVFCSD

Query:  LAVGISSPPTPGGESGELAGGSRQCQ
        +AVG+   P   GE G++ G S++C+
Subjt:  LAVGISSPPTPGGESGELAGGSRQCQ

AT3G52460.1 hydroxyproline-rich glycoprotein family protein4.8e-1425.89Show/hide
Query:  GSQQPPHPDPQ----PFTSWLHHPSPP-------------PPPHNRCTSPFLRAFAAGLTIMLLIAVFIYTVEYLVLGPLLPKFRVDSLHLTNFSVAAPS
        G  QPP P P     P+  + +  +PP             P      +S F+R    GL +++++     T+ +LVL P +P F V++  ++NF+V  P 
Subjt:  GSQQPPHPDPQ----PFTSWLHHPSPP-------------PPPHNRCTSPFLRAFAAGLTIMLLIAVFIYTVEYLVLGPLLPKFRVDSLHLTNFSVAAPS

Query:  LSALWVVGFSVNNPSKKLAISLDNIESSIYY-----KDKILSQARIYRFAIPRKNSTAIVTPFVADSDSDVS----VLNDINGDLARGAINFTVAVLGHV
         SA W    ++ N + KL    D I+  +Y+     +D+ L+ A      +  K S  I     A           V++++  +   G + F++ +   V
Subjt:  LSALWVVGFSVNNPSKKLAISLDNIESSIYY-----KDKILSQARIYRFAIPRKNSTAIVTPFVADSDSDVS----VLNDINGDLARGAINFTVAVLGHV

Query:  AFETGAWRLRGSLFRVFCSDLAVG
         F+T  W  R S  +VFC  L VG
Subjt:  AFETGAWRLRGSLFRVFCSDLAVG

AT5G36970.1 NDR1/HIN1-like 253.4e-0425.25Show/hide
Query:  GSQQPPHPDPQPFTSWLHHPSPPPPPHNRCTSPFLRAFAAGLTIMLLIAVF---IYTVEYLVLGPLLPKFRVDSLHLTNFSVAAP-SLSALWVVGFSVNN
        GS +  H DP    +    P  PP       S + R     L ++ L+ V    I  + YLV  P  P + +D L LT F +    SLS  + V  +  N
Subjt:  GSQQPPHPDPQPFTSWLHHPSPPPPPHNRCTSPFLRAFAAGLTIMLLIAVF---IYTVEYLVLGPLLPKFRVDSLHLTNFSVAAP-SLSALWVVGFSVNN

Query:  PSKKLAISL-DNIESSIYYKDKILSQARIYRFAIPRKNSTAIVTPFVADSDSDVSVLNDI-NGDLARGAINFTVAVLGHVAFETGAWRLRGSLFRVFC
        P++K+ I   D  + S+ Y    +S   + +F    +N+T I+      + +  S++  +       G+I   + V   V  + G  +L    F V C
Subjt:  PSKKLAISL-DNIESSIYYKDKILSQARIYRFAIPRKNSTAIVTPFVADSDSDVSVLNDI-NGDLARGAINFTVAVLGHVAFETGAWRLRGSLFRVFC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGACACCTCAACCGGCAGCCAGCAGCCGCCGCATCCTGACCCCCAACCCTTTACCTCCTGGCTTCATCATCCTTCCCCACCGCCACCGCCTCATAACCGCTGCAC
CTCACCTTTCCTCCGAGCCTTCGCCGCCGGTTTGACCATCATGCTTTTGATCGCCGTCTTCATCTACACCGTCGAATATCTCGTCCTCGGCCCCCTCCTCCCCAAGTTCC
GAGTCGACTCACTTCATCTCACCAACTTCTCCGTCGCCGCCCCGTCCCTCTCCGCCTTATGGGTCGTCGGATTTTCCGTCAACAACCCCAGCAAGAAGCTCGCGATCTCA
TTGGACAACATCGAGTCCTCCATTTACTACAAAGATAAGATCCTCTCTCAGGCCCGGATCTACCGGTTCGCCATACCCCGAAAGAACTCGACGGCCATCGTCACCCCCTT
CGTCGCCGACTCGGATTCCGATGTGTCGGTTTTGAACGACATTAATGGAGACTTGGCGCGTGGAGCAATTAATTTCACCGTGGCGGTTCTTGGCCATGTCGCGTTCGAGA
CCGGTGCCTGGCGGTTGAGGGGTAGCTTGTTTCGGGTATTTTGCAGCGATTTGGCCGTTGGAATCTCTTCGCCGCCGACTCCCGGCGGCGAGTCCGGCGAGTTGGCTGGT
GGATCAAGGCAATGCCAGGTTCGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGACACCTCAACCGGCAGCCAGCAGCCGCCGCATCCTGACCCCCAACCCTTTACCTCCTGGCTTCATCATCCTTCCCCACCGCCACCGCCTCATAACCGCTGCAC
CTCACCTTTCCTCCGAGCCTTCGCCGCCGGTTTGACCATCATGCTTTTGATCGCCGTCTTCATCTACACCGTCGAATATCTCGTCCTCGGCCCCCTCCTCCCCAAGTTCC
GAGTCGACTCACTTCATCTCACCAACTTCTCCGTCGCCGCCCCGTCCCTCTCCGCCTTATGGGTCGTCGGATTTTCCGTCAACAACCCCAGCAAGAAGCTCGCGATCTCA
TTGGACAACATCGAGTCCTCCATTTACTACAAAGATAAGATCCTCTCTCAGGCCCGGATCTACCGGTTCGCCATACCCCGAAAGAACTCGACGGCCATCGTCACCCCCTT
CGTCGCCGACTCGGATTCCGATGTGTCGGTTTTGAACGACATTAATGGAGACTTGGCGCGTGGAGCAATTAATTTCACCGTGGCGGTTCTTGGCCATGTCGCGTTCGAGA
CCGGTGCCTGGCGGTTGAGGGGTAGCTTGTTTCGGGTATTTTGCAGCGATTTGGCCGTTGGAATCTCTTCGCCGCCGACTCCCGGCGGCGAGTCCGGCGAGTTGGCTGGT
GGATCAAGGCAATGCCAGGTTCGATGA
Protein sequenceShow/hide protein sequence
MADTSTGSQQPPHPDPQPFTSWLHHPSPPPPPHNRCTSPFLRAFAAGLTIMLLIAVFIYTVEYLVLGPLLPKFRVDSLHLTNFSVAAPSLSALWVVGFSVNNPSKKLAIS
LDNIESSIYYKDKILSQARIYRFAIPRKNSTAIVTPFVADSDSDVSVLNDINGDLARGAINFTVAVLGHVAFETGAWRLRGSLFRVFCSDLAVGISSPPTPGGESGELAG
GSRQCQVR