; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0002737 (gene) of Snake gourd v1 genome

Gene IDTan0002737
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionC2H2-type domain-containing protein
Genome locationLG04:81495450..81497567
RNA-Seq ExpressionTan0002737
SyntenyTan0002737
Gene Ontology termsGO:0005783 - endoplasmic reticulum (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7036627.1 hypothetical protein SDJN02_00246 [Cucurbita argyrosperma subsp. argyrosperma]1.1e-12589.41Show/hide
Query:  MQSIHSLGFSHSLQFSQSHFHSNPKNPLLKPHCSHGSKRKGRTSLSLRTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVPFL
        MQSIH+LGFSHSLQFSQSHFH NPKN LL+   SHGS RK RTSLSLRT+WPSISIALFGSGFLLGPLLDGLHSRVNLVVY+ GS+DIGPL TNIWVPFL
Subjt:  MQSIHSLGFSHSLQFSQSHFHSNPKNPLLKPHCSHGSKRKGRTSLSLRTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVPFL

Query:  LGLFYSTVGLIQLFIDEKFSPNKSEGSLGRTVASLIALALFIELSAEMYKAGVAANIEAYALFAGAEFIWALLDSSLLGFSLACVVGLVCPLAEIPIMKF
        LG+FY TVGLIQL+IDE FSPN+ EGSLG+TVASLIALALFIELSAEMYKAGVA NIEAYALFAGAEFIWALLDSSLLGFSLACVVGL+CPLAEIPIMKF
Subjt:  LGLFYSTVGLIQLFIDEKFSPNKSEGSLGRTVASLIALALFIELSAEMYKAGVAANIEAYALFAGAEFIWALLDSSLLGFSLACVVGLVCPLAEIPIMKF

Query:  FHLWYYPEANVKIFGEGIISWTITCYFVYTPFLINLSRWLKSVVDSAAVKKDGSA
        FHLWYYP+ANV+IFGEG+ISWTITCYFVYTPFLINLSRWLKSV+DSAAVKKDGSA
Subjt:  FHLWYYPEANVKIFGEGIISWTITCYFVYTPFLINLSRWLKSVVDSAAVKKDGSA

XP_022949544.1 uncharacterized protein LOC111452860 [Cucurbita moschata]4.1e-12589.02Show/hide
Query:  MQSIHSLGFSHSLQFSQSHFHSNPKNPLLKPHCSHGSKRKGRTSLSLRTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVPFL
        MQSIH+LGFSHSLQFSQSHFH NPKN LL+   SHGS RK RTSLSLRT+WPSISIALFGSGFLLGPLLDGLHSRVNLVVY+ GS+DIGPL TNIWVPFL
Subjt:  MQSIHSLGFSHSLQFSQSHFHSNPKNPLLKPHCSHGSKRKGRTSLSLRTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVPFL

Query:  LGLFYSTVGLIQLFIDEKFSPNKSEGSLGRTVASLIALALFIELSAEMYKAGVAANIEAYALFAGAEFIWALLDSSLLGFSLACVVGLVCPLAEIPIMKF
        LG+FY TVGLIQL+IDE FSPN+ EGSLG+TVASLIALALFIELSAEMYKAGVA NIEAYALFAGAEFIWALLDSSLLGFSLACVVGL+CPLAEIPIMKF
Subjt:  LGLFYSTVGLIQLFIDEKFSPNKSEGSLGRTVASLIALALFIELSAEMYKAGVAANIEAYALFAGAEFIWALLDSSLLGFSLACVVGLVCPLAEIPIMKF

Query:  FHLWYYPEANVKIFGEGIISWTITCYFVYTPFLINLSRWLKSVVDSAAVKKDGSA
        FHLWYYP+ANV+IFGEG+ISWTITCYFVYTPFLINLSRWLKSV+DS AVKKDGSA
Subjt:  FHLWYYPEANVKIFGEGIISWTITCYFVYTPFLINLSRWLKSVVDSAAVKKDGSA

XP_022998487.1 uncharacterized protein LOC111493103 [Cucurbita maxima]8.0e-12187.89Show/hide
Query:  MQSIHSLGFSHSLQFSQSHFHSNPKNPLLKPHCSHGSKRKGRTSLSLRTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVPFL
        MQSIH+LGFSHSLQFSQSHFH NPKN LL+   +HGS R+ RTSLSLRT+WPSISIALFGSGFLLGPLLDGLHSRVNLVVY+ GS+DIGPL TNI VPFL
Subjt:  MQSIHSLGFSHSLQFSQSHFHSNPKNPLLKPHCSHGSKRKGRTSLSLRTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVPFL

Query:  LGLFYSTVGLIQLFIDEKFSPNKSEGSLGRTVASLIALALFIELSAEMYKAGVAANIEAYALFAGAEFIWALLDSSLLGFSLACVVGLVCPLAEIPIMKF
        LGLFY TVGLIQL+IDE F PN+ EGSLG+TVASLIALALFIELSAEMYKAGVA NIEAYALFAGAEFIWALLDSSLLGFSLACVVGL+CPLAEIPIMKF
Subjt:  LGLFYSTVGLIQLFIDEKFSPNKSEGSLGRTVASLIALALFIELSAEMYKAGVAANIEAYALFAGAEFIWALLDSSLLGFSLACVVGLVCPLAEIPIMKF

Query:  FHLWYYPEANVKIFGEGIISWTITCYFVYTPFLINLSRWL-KSVVDSAAVKKDGSA
        FHLWYYP+ANV+IFGEG+ISWTITCYFVYTPFLINLSRWL  SVVDSAAVKKDGSA
Subjt:  FHLWYYPEANVKIFGEGIISWTITCYFVYTPFLINLSRWL-KSVVDSAAVKKDGSA

XP_023524606.1 uncharacterized protein LOC111788502 [Cucurbita pepo subsp. pepo]1.9e-11787.8Show/hide
Query:  SHSLQFSQSHFHSNPKNPLLKPHCSHGSKRKGRTSLSLRTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVPFLLGLFYSTVG
        +HSLQFSQSHFH NPK  LL+   SHGS RK RTSLSLR +WPSISIALFGSGFLLGPLLDGLHSRVNLVVY+ GS+DIGPL TNI VPFLLG+FY TVG
Subjt:  SHSLQFSQSHFHSNPKNPLLKPHCSHGSKRKGRTSLSLRTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVPFLLGLFYSTVG

Query:  LIQLFIDEKFSPNKSEGSLGRTVASLIALALFIELSAEMYKAGVAANIEAYALFAGAEFIWALLDSSLLGFSLACVVGLVCPLAEIPIMKFFHLWYYPEA
        LIQL+IDE FSPN+ EGSLG+TVASLIALALFIELSAEMYKAGVA NIEAYALFAGAEFIWALLDSSLLGFSLACVVGL+CPLAEIPIMKFFHLWYYP+A
Subjt:  LIQLFIDEKFSPNKSEGSLGRTVASLIALALFIELSAEMYKAGVAANIEAYALFAGAEFIWALLDSSLLGFSLACVVGLVCPLAEIPIMKFFHLWYYPEA

Query:  NVKIFGEGIISWTITCYFVYTPFLINLSRWLKSVVDSAAVKKDGSA
        NV+IFGEG+ISWTITCYFVYTPFLINLSRWLKSV+DSAAVKKDGSA
Subjt:  NVKIFGEGIISWTITCYFVYTPFLINLSRWLKSVVDSAAVKKDGSA

XP_038904924.1 uncharacterized protein LOC120091134 [Benincasa hispida]5.6e-12287.6Show/hide
Query:  MQSIHSLGFSHSLQFSQSHFHSNPKNPLLKPHC---SHGSKRKGRTSLSLRTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWV
        MQSI++LGFSHSLQFSQSHFHSN K  L KPHC   SHGS R+ RTSLSLRTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSI IGPLHTNIWV
Subjt:  MQSIHSLGFSHSLQFSQSHFHSNPKNPLLKPHC---SHGSKRKGRTSLSLRTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWV

Query:  PFLLGLFYSTVGLIQLFIDEKFSPNKSEGSLGRTVASLIALALFIELSAEMYKAGVAANIEAYALFAGAEFIWALLDSSLLGFSLACVVGLVCPLAEIPI
        PFLLGLFY +VGLIQL++DE FSP KSEG  GRTVASLIAL LFIELSAEMYKAGVA NIEAYALFAGAEFIWALLDSSLLGFSLACV+GLVCPLAEIPI
Subjt:  PFLLGLFYSTVGLIQLFIDEKFSPNKSEGSLGRTVASLIALALFIELSAEMYKAGVAANIEAYALFAGAEFIWALLDSSLLGFSLACVVGLVCPLAEIPI

Query:  MKFFHLWYYPEANVKIFGEGIISWTITCYFVYTPFLINLSRWLKSVVDSAAVKKDGSA
        MKFFHLWYYP+AN++IFGEGI+SWTITCYFVYTPFLINLSRWLKSVVD+AA  +DGSA
Subjt:  MKFFHLWYYPEANVKIFGEGIISWTITCYFVYTPFLINLSRWLKSVVDSAAVKKDGSA

TrEMBL top hitse value%identityAlignment
A0A0A0LBT1 Uncharacterized protein1.3e-11383.2Show/hide
Query:  MQSIHSLGFSHSLQF--SQSHFHSNPKNPLLKPHCSHGSKRKGRTSLSLRTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVP
        MQSI +LGFS SLQF  S SHFHSN +  + KPHCS    +K R SLSLRTTWPSISI+LF SGFLLGPLLDGLHSRVNLVVYRTGSI IGPLHTNIWVP
Subjt:  MQSIHSLGFSHSLQF--SQSHFHSNPKNPLLKPHCSHGSKRKGRTSLSLRTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVP

Query:  FLLGLFYSTVGLIQLFIDEKFSPNKSEGSLGRTVASLIALALFIELSAEMYKAGVAANIEAYALFAGAEFIWALLDSSLLGFSLACVVGLVCPLAEIPIM
        FLLGLFY TVGLIQL++DEKFS  +S+GSLG+TVASLIAL LFIELSAEMYKAGVA NIEAYALFAGAEFIWALLDSSLLGFSLACV+GL CPLAEIPIM
Subjt:  FLLGLFYSTVGLIQLFIDEKFSPNKSEGSLGRTVASLIALALFIELSAEMYKAGVAANIEAYALFAGAEFIWALLDSSLLGFSLACVVGLVCPLAEIPIM

Query:  KFFHLWYYPEANVKIFGEGIISWTITCYFVYTPFLINLSRWLKSVVDSAAVKKDGS
        KFFHLW YP+AN+ IFGEGIISWT+TCYFVYTPFLINLSRWLKSVVD+AAV +D S
Subjt:  KFFHLWYYPEANVKIFGEGIISWTITCYFVYTPFLINLSRWLKSVVDSAAVKKDGS

A0A1S3BI47 uncharacterized protein LOC1034899065.1e-11381.3Show/hide
Query:  MQSIHSLGFSHSLQFSQSH----FHSNPKNPLLKPHCSHGSKRKGRTSLSLRTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIW
        MQSI +LGFSHSLQF  SH    FHSN +  L KPHC+     K R +LSLRTTWPSISI+LF SGFLLGPLLDGLHSRVNLVVYRTGSI IGPLHTNIW
Subjt:  MQSIHSLGFSHSLQFSQSH----FHSNPKNPLLKPHCSHGSKRKGRTSLSLRTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIW

Query:  VPFLLGLFYSTVGLIQLFIDEKFSPNKSEGSLGRTVASLIALALFIELSAEMYKAGVAANIEAYALFAGAEFIWALLDSSLLGFSLACVVGLVCPLAEIP
        VPFLLGLFY TVGLIQL++DEKFSP +S+GSL +TVASLIAL LFIELSAEMYKAGVA NIEAYALFAGAEFIWALLDSSLLGFSLACV+GL CPLAEIP
Subjt:  VPFLLGLFYSTVGLIQLFIDEKFSPNKSEGSLGRTVASLIALALFIELSAEMYKAGVAANIEAYALFAGAEFIWALLDSSLLGFSLACVVGLVCPLAEIP

Query:  IMKFFHLWYYPEANVKIFGEGIISWTITCYFVYTPFLINLSRWLKSVVD----SAAVKKDGS
        IMKFFHLW YP+AN++IFGEGIISWT+TCYFVYTPFLINLSRWLKSVVD    +AAV +DGS
Subjt:  IMKFFHLWYYPEANVKIFGEGIISWTITCYFVYTPFLINLSRWLKSVVD----SAAVKKDGS

A0A6J1DRY1 uncharacterized protein LOC1110238315.4e-11585.88Show/hide
Query:  MQSIHSLGFSHSLQFSQSHFHSNPKNPLLKPHCSHGSKRKGRTSLSLRTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVPFL
        MQSI+ LG S  LQF Q  F S  K+  LKP CSHGS+ + RTSLSLRTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVY+TGSID+GPLHTNIWVPFL
Subjt:  MQSIHSLGFSHSLQFSQSHFHSNPKNPLLKPHCSHGSKRKGRTSLSLRTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVPFL

Query:  LGLFYSTVGLIQLFIDEKFSPNKSEGSLGRTVASLIALALFIELSAEMYKAGVAANIEAYALFAGAEFIWALLDSSLLGFSLACVVGLVCPLAEIPIMKF
        LGLFYSTVGL+QL+IDE FS N SEGSLGRTVASLIALALFIELSAEMYKAGVA NIEAYALFAGAE IWA LDSSLLGFSLACVVGL CPLAEIPIMKF
Subjt:  LGLFYSTVGLIQLFIDEKFSPNKSEGSLGRTVASLIALALFIELSAEMYKAGVAANIEAYALFAGAEFIWALLDSSLLGFSLACVVGLVCPLAEIPIMKF

Query:  FHLWYYPEANVKIFGEGIISWTITCYFVYTPFLINLSRWLKSVVDSAAVKKDGSA
        FHLW YP+ANV+IFGEGIISW ITCYFVYTPFLINLSRWLKSVVD+AAVKKD SA
Subjt:  FHLWYYPEANVKIFGEGIISWTITCYFVYTPFLINLSRWLKSVVDSAAVKKDGSA

A0A6J1GD48 uncharacterized protein LOC1114528602.0e-12589.02Show/hide
Query:  MQSIHSLGFSHSLQFSQSHFHSNPKNPLLKPHCSHGSKRKGRTSLSLRTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVPFL
        MQSIH+LGFSHSLQFSQSHFH NPKN LL+   SHGS RK RTSLSLRT+WPSISIALFGSGFLLGPLLDGLHSRVNLVVY+ GS+DIGPL TNIWVPFL
Subjt:  MQSIHSLGFSHSLQFSQSHFHSNPKNPLLKPHCSHGSKRKGRTSLSLRTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVPFL

Query:  LGLFYSTVGLIQLFIDEKFSPNKSEGSLGRTVASLIALALFIELSAEMYKAGVAANIEAYALFAGAEFIWALLDSSLLGFSLACVVGLVCPLAEIPIMKF
        LG+FY TVGLIQL+IDE FSPN+ EGSLG+TVASLIALALFIELSAEMYKAGVA NIEAYALFAGAEFIWALLDSSLLGFSLACVVGL+CPLAEIPIMKF
Subjt:  LGLFYSTVGLIQLFIDEKFSPNKSEGSLGRTVASLIALALFIELSAEMYKAGVAANIEAYALFAGAEFIWALLDSSLLGFSLACVVGLVCPLAEIPIMKF

Query:  FHLWYYPEANVKIFGEGIISWTITCYFVYTPFLINLSRWLKSVVDSAAVKKDGSA
        FHLWYYP+ANV+IFGEG+ISWTITCYFVYTPFLINLSRWLKSV+DS AVKKDGSA
Subjt:  FHLWYYPEANVKIFGEGIISWTITCYFVYTPFLINLSRWLKSVVDSAAVKKDGSA

A0A6J1KGW6 uncharacterized protein LOC1114931033.9e-12187.89Show/hide
Query:  MQSIHSLGFSHSLQFSQSHFHSNPKNPLLKPHCSHGSKRKGRTSLSLRTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVPFL
        MQSIH+LGFSHSLQFSQSHFH NPKN LL+   +HGS R+ RTSLSLRT+WPSISIALFGSGFLLGPLLDGLHSRVNLVVY+ GS+DIGPL TNI VPFL
Subjt:  MQSIHSLGFSHSLQFSQSHFHSNPKNPLLKPHCSHGSKRKGRTSLSLRTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVPFL

Query:  LGLFYSTVGLIQLFIDEKFSPNKSEGSLGRTVASLIALALFIELSAEMYKAGVAANIEAYALFAGAEFIWALLDSSLLGFSLACVVGLVCPLAEIPIMKF
        LGLFY TVGLIQL+IDE F PN+ EGSLG+TVASLIALALFIELSAEMYKAGVA NIEAYALFAGAEFIWALLDSSLLGFSLACVVGL+CPLAEIPIMKF
Subjt:  LGLFYSTVGLIQLFIDEKFSPNKSEGSLGRTVASLIALALFIELSAEMYKAGVAANIEAYALFAGAEFIWALLDSSLLGFSLACVVGLVCPLAEIPIMKF

Query:  FHLWYYPEANVKIFGEGIISWTITCYFVYTPFLINLSRWL-KSVVDSAAVKKDGSA
        FHLWYYP+ANV+IFGEG+ISWTITCYFVYTPFLINLSRWL  SVVDSAAVKKDGSA
Subjt:  FHLWYYPEANVKIFGEGIISWTITCYFVYTPFLINLSRWL-KSVVDSAAVKKDGSA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G01935.1 unknown protein4.7e-7958.13Show/hide
Query:  SNPKNPLLKPHCSHGSKR-----------KGRTSLSLRTTW-PSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVPFLLGLFYSTVG
        S+P   L+KP   +G  R           + ++  S   +W   +S++LFGSGF+LGPLLDG+HSRV+LVVY+ G+  IGPLHTNIWVPFLLGLFY TVG
Subjt:  SNPKNPLLKPHCSHGSKR-----------KGRTSLSLRTTW-PSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVPFLLGLFYSTVG

Query:  LIQLFIDEKFSPNKSEGSLGRTVASLIALALFIELSAEMYKAGVAANIEAYALFAGAEFIWALLDSSLLGFSLACVVGLVCPLAEIPIMKFFHLWYYPEA
        L+QL +DE  S +   GSL +TV SL+AL  F+ELSAEMYKAGV+ NIEAY LFA AEFIW  LD + + F++A ++G+ CPLAEIPIM+FFHLWYYPEA
Subjt:  LIQLFIDEKFSPNKSEGSLGRTVASLIALALFIELSAEMYKAGVAANIEAYALFAGAEFIWALLDSSLLGFSLACVVGLVCPLAEIPIMKFFHLWYYPEA

Query:  NVKIFGEGIISWTITCYFVYTPFLINLSRWLKSVVDSAAVKKDGSA
        N++IFG+G+++WT TCYFVYTPFLINL+RWL++V++   ++ D S+
Subjt:  NVKIFGEGIISWTITCYFVYTPFLINLSRWLKSVVDSAAVKKDGSA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAATCAATCCATTCATTGGGCTTCTCCCACTCTCTCCAATTTTCTCAATCCCATTTCCACTCCAACCCCAAAAATCCTCTGCTAAAACCCCATTGCAGCCATGGAAG
CAAGAGGAAGGGAAGAACAAGCCTCAGTCTCAGAACCACTTGGCCCTCCATCTCCATCGCCCTCTTCGGCTCAGGCTTTCTCTTAGGTCCTCTTCTCGATGGTCTCCATT
CTCGGGTGAATCTCGTCGTTTACCGAACAGGATCCATCGACATCGGCCCACTCCACACTAACATTTGGGTTCCTTTCTTGCTGGGATTGTTTTACTCTACTGTTGGGTTG
ATTCAACTCTTCATTGATGAGAAATTCTCACCAAACAAATCAGAGGGGAGTTTGGGCAGGACAGTAGCATCCTTAATAGCATTGGCTTTGTTTATTGAATTGAGTGCTGA
AATGTACAAAGCTGGAGTGGCTGCCAACATTGAGGCCTATGCATTGTTTGCTGGGGCTGAGTTTATTTGGGCATTGCTTGATAGTTCATTGCTTGGTTTCTCTCTGGCTT
GTGTTGTTGGCCTTGTCTGCCCTCTGGCTGAGATCCCCATTATGAAGTTCTTCCATCTCTGGTATTATCCAGAAGCGAATGTGAAGATCTTTGGTGAGGGGATAATCAGC
TGGACAATCACTTGCTATTTTGTGTACACTCCATTCTTGATTAATTTATCTAGATGGCTCAAGTCTGTGGTGGATTCTGCTGCTGTAAAAAAAGATGGGTCTGCTTAG
mRNA sequenceShow/hide mRNA sequence
TTGGGCTCTAACAACCCTTCAATTTACCGTCCTTAAATTTTGGAAAATGAATTCTTGTGGATTACAAGGAAACAATCTTCAAAATTTCATCTGTTAGAATCCTCAAAAAT
GCAATCAATCCATTCATTGGGCTTCTCCCACTCTCTCCAATTTTCTCAATCCCATTTCCACTCCAACCCCAAAAATCCTCTGCTAAAACCCCATTGCAGCCATGGAAGCA
AGAGGAAGGGAAGAACAAGCCTCAGTCTCAGAACCACTTGGCCCTCCATCTCCATCGCCCTCTTCGGCTCAGGCTTTCTCTTAGGTCCTCTTCTCGATGGTCTCCATTCT
CGGGTGAATCTCGTCGTTTACCGAACAGGATCCATCGACATCGGCCCACTCCACACTAACATTTGGGTTCCTTTCTTGCTGGGATTGTTTTACTCTACTGTTGGGTTGAT
TCAACTCTTCATTGATGAGAAATTCTCACCAAACAAATCAGAGGGGAGTTTGGGCAGGACAGTAGCATCCTTAATAGCATTGGCTTTGTTTATTGAATTGAGTGCTGAAA
TGTACAAAGCTGGAGTGGCTGCCAACATTGAGGCCTATGCATTGTTTGCTGGGGCTGAGTTTATTTGGGCATTGCTTGATAGTTCATTGCTTGGTTTCTCTCTGGCTTGT
GTTGTTGGCCTTGTCTGCCCTCTGGCTGAGATCCCCATTATGAAGTTCTTCCATCTCTGGTATTATCCAGAAGCGAATGTGAAGATCTTTGGTGAGGGGATAATCAGCTG
GACAATCACTTGCTATTTTGTGTACACTCCATTCTTGATTAATTTATCTAGATGGCTCAAGTCTGTGGTGGATTCTGCTGCTGTAAAAAAAGATGGGTCTGCTTAGTCAA
ATGTTTTATGTGCCTAGCA
Protein sequenceShow/hide protein sequence
MQSIHSLGFSHSLQFSQSHFHSNPKNPLLKPHCSHGSKRKGRTSLSLRTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVPFLLGLFYSTVGL
IQLFIDEKFSPNKSEGSLGRTVASLIALALFIELSAEMYKAGVAANIEAYALFAGAEFIWALLDSSLLGFSLACVVGLVCPLAEIPIMKFFHLWYYPEANVKIFGEGIIS
WTITCYFVYTPFLINLSRWLKSVVDSAAVKKDGSA