; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10019351 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10019351
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionC2H2-type domain-containing protein
Genome locationChr04:20708855..20710891
RNA-Seq ExpressionHG10019351
SyntenyHG10019351
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7036627.1 hypothetical protein SDJN02_00246 [Cucurbita argyrosperma subsp. argyrosperma]2.3e-11583.59Show/hide
Query:  MQSIYALGFSHSLQFSQSHFHSNSRISLQKPHCSSHGSKRARTSLSLTTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVPFL
        MQSI+ALGFSHSLQFSQSHFH N + SL +   S   +++ARTSLSL T+WPSISIALFGSGFLLGPLLDGLHSRVNLVVY+ GS+DIGPL TNIWVPFL
Subjt:  MQSIYALGFSHSLQFSQSHFHSNSRISLQKPHCSSHGSKRARTSLSLTTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVPFL

Query:  LGLFYCTVGLIQLYIDENFSQKRSEGSFGRTLASLIALALFIELSAEMYKAGVADNIEAYALFAGAEFIWALLDSSLLGFSLACVLGLGCPLAEIPIMKF
        LG+FYCTVGLIQLYIDENFS  R EGS G+T+ASLIALALFIELSAEMYKAGVA NIEAYALFAGAEFIWALLDSSLLGFSLACV+GL CPLAEIPIMKF
Subjt:  LGLFYCTVGLIQLYIDENFSQKRSEGSFGRTLASLIALALFIELSAEMYKAGVADNIEAYALFAGAEFIWALLDSSLLGFSLACVLGLGCPLAEIPIMKF

Query:  FHLWDYPKANIEIFGEGIVSWTITCYFVYTPFLINLSRWLKSVVNAASAVKEDGSA
        FHLW YP+AN+EIFGEG++SWTITCYFVYTPFLINLSRWLKSV+++A AVK+DGSA
Subjt:  FHLWDYPKANIEIFGEGIVSWTITCYFVYTPFLINLSRWLKSVVNAASAVKEDGSA

XP_004150901.1 uncharacterized protein LOC101205226 [Cucumis sativus]3.0e-12388.33Show/hide
Query:  MQSIYALGFSHSLQF--SQSHFHSNSRISLQKPHCSSHGSKRARTSLSLTTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVP
        MQSI+ALGFS SLQF  S SHFHSNSRIS+QKPHCSSHGSK+ R SLSL TTWPSISI+LF SGFLLGPLLDGLHSRVNLVVYRTGSI IGPLHTNIWVP
Subjt:  MQSIYALGFSHSLQF--SQSHFHSNSRISLQKPHCSSHGSKRARTSLSLTTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVP

Query:  FLLGLFYCTVGLIQLYIDENFSQKRSEGSFGRTLASLIALALFIELSAEMYKAGVADNIEAYALFAGAEFIWALLDSSLLGFSLACVLGLGCPLAEIPIM
        FLLGLFYCTVGLIQLY+DE FS K+S+GS G+T+ASLIAL LFIELSAEMYKAGVADNIEAYALFAGAEFIWALLDSSLLGFSLACVLGLGCPLAEIPIM
Subjt:  FLLGLFYCTVGLIQLYIDENFSQKRSEGSFGRTLASLIALALFIELSAEMYKAGVADNIEAYALFAGAEFIWALLDSSLLGFSLACVLGLGCPLAEIPIM

Query:  KFFHLWDYPKANIEIFGEGIVSWTITCYFVYTPFLINLSRWLKSVVNAASAVKEDGS
        KFFHLW+YPKANI+IFGEGI+SWT+TCYFVYTPFLINLSRWLKSVV+AA AV ED S
Subjt:  KFFHLWDYPKANIEIFGEGIVSWTITCYFVYTPFLINLSRWLKSVVNAASAVKEDGS

XP_008447464.1 PREDICTED: uncharacterized protein LOC103489906 [Cucumis melo]5.0e-12386.26Show/hide
Query:  MQSIYALGFSHSLQFSQSH----FHSNSRISLQKPHCSSHGSKRARTSLSLTTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIW
        MQSI+ALGFSHSLQF  SH    FHSNSRISLQKPHC+SHGS ++R +LSL TTWPSISI+LF SGFLLGPLLDGLHSRVNLVVYRTGSI IGPLHTNIW
Subjt:  MQSIYALGFSHSLQFSQSH----FHSNSRISLQKPHCSSHGSKRARTSLSLTTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIW

Query:  VPFLLGLFYCTVGLIQLYIDENFSQKRSEGSFGRTLASLIALALFIELSAEMYKAGVADNIEAYALFAGAEFIWALLDSSLLGFSLACVLGLGCPLAEIP
        VPFLLGLFYCTVGLIQLY+DE FS K+S+GS  +T+ASLIAL LFIELSAEMYKAGVADNIEAYALFAGAEFIWALLDSSLLGFSLACVLGLGCPLAEIP
Subjt:  VPFLLGLFYCTVGLIQLYIDENFSQKRSEGSFGRTLASLIALALFIELSAEMYKAGVADNIEAYALFAGAEFIWALLDSSLLGFSLACVLGLGCPLAEIP

Query:  IMKFFHLWDYPKANIEIFGEGIVSWTITCYFVYTPFLINLSRWLKSVVN---AASAVKEDGS
        IMKFFHLW+YPKANIEIFGEGI+SWT+TCYFVYTPFLINLSRWLKSVV+   AA+AV EDGS
Subjt:  IMKFFHLWDYPKANIEIFGEGIVSWTITCYFVYTPFLINLSRWLKSVVN---AASAVKEDGS

XP_022949544.1 uncharacterized protein LOC111452860 [Cucurbita moschata]2.3e-11584.82Show/hide
Query:  MQSIYALGFSHSLQFSQSHFHSNSRISLQKPHCSSHGSKR-ARTSLSLTTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVPF
        MQSI+ALGFSHSLQFSQSHFH N + SL +   SSHGS R ARTSLSL T+WPSISIALFGSGFLLGPLLDGLHSRVNLVVY+ GS+DIGPL TNIWVPF
Subjt:  MQSIYALGFSHSLQFSQSHFHSNSRISLQKPHCSSHGSKR-ARTSLSLTTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVPF

Query:  LLGLFYCTVGLIQLYIDENFSQKRSEGSFGRTLASLIALALFIELSAEMYKAGVADNIEAYALFAGAEFIWALLDSSLLGFSLACVLGLGCPLAEIPIMK
        LLG+FYCTVGLIQLYIDENFS  R EGS G+T+ASLIALALFIELSAEMYKAGVA NIEAYALFAGAEFIWALLDSSLLGFSLACV+GL CPLAEIPIMK
Subjt:  LLGLFYCTVGLIQLYIDENFSQKRSEGSFGRTLASLIALALFIELSAEMYKAGVADNIEAYALFAGAEFIWALLDSSLLGFSLACVLGLGCPLAEIPIMK

Query:  FFHLWDYPKANIEIFGEGIVSWTITCYFVYTPFLINLSRWLKSVVNAASAVKEDGSA
        FFHLW YP+AN+EIFGEG++SWTITCYFVYTPFLINLSRWLKSV+++  AVK+DGSA
Subjt:  FFHLWDYPKANIEIFGEGIVSWTITCYFVYTPFLINLSRWLKSVVNAASAVKEDGSA

XP_038904924.1 uncharacterized protein LOC120091134 [Benincasa hispida]2.3e-12892.25Show/hide
Query:  MQSIYALGFSHSLQFSQSHFHSNSRISLQKPHC--SSHGSKRARTSLSLTTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVP
        MQSIYALGFSHSLQFSQSHFHSNS+ISLQKPHC  SSHGS+RARTSLSL TTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSI IGPLHTNIWVP
Subjt:  MQSIYALGFSHSLQFSQSHFHSNSRISLQKPHC--SSHGSKRARTSLSLTTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVP

Query:  FLLGLFYCTVGLIQLYIDENFSQKRSEGSFGRTLASLIALALFIELSAEMYKAGVADNIEAYALFAGAEFIWALLDSSLLGFSLACVLGLGCPLAEIPIM
        FLLGLFYC+VGLIQLY+DENFS ++SEG FGRT+ASLIAL LFIELSAEMYKAGVADNIEAYALFAGAEFIWALLDSSLLGFSLACVLGL CPLAEIPIM
Subjt:  FLLGLFYCTVGLIQLYIDENFSQKRSEGSFGRTLASLIALALFIELSAEMYKAGVADNIEAYALFAGAEFIWALLDSSLLGFSLACVLGLGCPLAEIPIM

Query:  KFFHLWDYPKANIEIFGEGIVSWTITCYFVYTPFLINLSRWLKSVVNAASAVKEDGSA
        KFFHLW YPKANIEIFGEGIVSWTITCYFVYTPFLINLSRWLKSVV+AA+A  EDGSA
Subjt:  KFFHLWDYPKANIEIFGEGIVSWTITCYFVYTPFLINLSRWLKSVVNAASAVKEDGSA

TrEMBL top hitse value%identityAlignment
A0A0A0LBT1 Uncharacterized protein1.4e-12388.33Show/hide
Query:  MQSIYALGFSHSLQF--SQSHFHSNSRISLQKPHCSSHGSKRARTSLSLTTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVP
        MQSI+ALGFS SLQF  S SHFHSNSRIS+QKPHCSSHGSK+ R SLSL TTWPSISI+LF SGFLLGPLLDGLHSRVNLVVYRTGSI IGPLHTNIWVP
Subjt:  MQSIYALGFSHSLQF--SQSHFHSNSRISLQKPHCSSHGSKRARTSLSLTTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVP

Query:  FLLGLFYCTVGLIQLYIDENFSQKRSEGSFGRTLASLIALALFIELSAEMYKAGVADNIEAYALFAGAEFIWALLDSSLLGFSLACVLGLGCPLAEIPIM
        FLLGLFYCTVGLIQLY+DE FS K+S+GS G+T+ASLIAL LFIELSAEMYKAGVADNIEAYALFAGAEFIWALLDSSLLGFSLACVLGLGCPLAEIPIM
Subjt:  FLLGLFYCTVGLIQLYIDENFSQKRSEGSFGRTLASLIALALFIELSAEMYKAGVADNIEAYALFAGAEFIWALLDSSLLGFSLACVLGLGCPLAEIPIM

Query:  KFFHLWDYPKANIEIFGEGIVSWTITCYFVYTPFLINLSRWLKSVVNAASAVKEDGS
        KFFHLW+YPKANI+IFGEGI+SWT+TCYFVYTPFLINLSRWLKSVV+AA AV ED S
Subjt:  KFFHLWDYPKANIEIFGEGIVSWTITCYFVYTPFLINLSRWLKSVVNAASAVKEDGS

A0A1S3BI47 uncharacterized protein LOC1034899062.4e-12386.26Show/hide
Query:  MQSIYALGFSHSLQFSQSH----FHSNSRISLQKPHCSSHGSKRARTSLSLTTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIW
        MQSI+ALGFSHSLQF  SH    FHSNSRISLQKPHC+SHGS ++R +LSL TTWPSISI+LF SGFLLGPLLDGLHSRVNLVVYRTGSI IGPLHTNIW
Subjt:  MQSIYALGFSHSLQFSQSH----FHSNSRISLQKPHCSSHGSKRARTSLSLTTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIW

Query:  VPFLLGLFYCTVGLIQLYIDENFSQKRSEGSFGRTLASLIALALFIELSAEMYKAGVADNIEAYALFAGAEFIWALLDSSLLGFSLACVLGLGCPLAEIP
        VPFLLGLFYCTVGLIQLY+DE FS K+S+GS  +T+ASLIAL LFIELSAEMYKAGVADNIEAYALFAGAEFIWALLDSSLLGFSLACVLGLGCPLAEIP
Subjt:  VPFLLGLFYCTVGLIQLYIDENFSQKRSEGSFGRTLASLIALALFIELSAEMYKAGVADNIEAYALFAGAEFIWALLDSSLLGFSLACVLGLGCPLAEIP

Query:  IMKFFHLWDYPKANIEIFGEGIVSWTITCYFVYTPFLINLSRWLKSVVN---AASAVKEDGS
        IMKFFHLW+YPKANIEIFGEGI+SWT+TCYFVYTPFLINLSRWLKSVV+   AA+AV EDGS
Subjt:  IMKFFHLWDYPKANIEIFGEGIVSWTITCYFVYTPFLINLSRWLKSVVN---AASAVKEDGS

A0A6J1DRY1 uncharacterized protein LOC1110238311.0e-11384.38Show/hide
Query:  MQSIYALGFSHSLQFSQSHFHSNSRISLQKPHCSSHGSKRARTSLSLTTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVPFL
        MQSIY LG S  LQF Q  F S S+ S  KP C SHGS+  RTSLSL TTWPSISIALFGSGFLLGPLLDGLHSRVNLVVY+TGSID+GPLHTNIWVPFL
Subjt:  MQSIYALGFSHSLQFSQSHFHSNSRISLQKPHCSSHGSKRARTSLSLTTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVPFL

Query:  LGLFYCTVGLIQLYIDENFSQKRSEGSFGRTLASLIALALFIELSAEMYKAGVADNIEAYALFAGAEFIWALLDSSLLGFSLACVLGLGCPLAEIPIMKF
        LGLFY TVGL+QLYIDENFS+  SEGS GRT+ASLIALALFIELSAEMYKAGVADNIEAYALFAGAE IWA LDSSLLGFSLACV+GLGCPLAEIPIMKF
Subjt:  LGLFYCTVGLIQLYIDENFSQKRSEGSFGRTLASLIALALFIELSAEMYKAGVADNIEAYALFAGAEFIWALLDSSLLGFSLACVLGLGCPLAEIPIMKF

Query:  FHLWDYPKANIEIFGEGIVSWTITCYFVYTPFLINLSRWLKSVVNAASAVKEDGSA
        FHLW YP+AN+EIFGEGI+SW ITCYFVYTPFLINLSRWLKSVV+AA AVK+D SA
Subjt:  FHLWDYPKANIEIFGEGIVSWTITCYFVYTPFLINLSRWLKSVVNAASAVKEDGSA

A0A6J1GD48 uncharacterized protein LOC1114528601.1e-11584.82Show/hide
Query:  MQSIYALGFSHSLQFSQSHFHSNSRISLQKPHCSSHGSKR-ARTSLSLTTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVPF
        MQSI+ALGFSHSLQFSQSHFH N + SL +   SSHGS R ARTSLSL T+WPSISIALFGSGFLLGPLLDGLHSRVNLVVY+ GS+DIGPL TNIWVPF
Subjt:  MQSIYALGFSHSLQFSQSHFHSNSRISLQKPHCSSHGSKR-ARTSLSLTTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVPF

Query:  LLGLFYCTVGLIQLYIDENFSQKRSEGSFGRTLASLIALALFIELSAEMYKAGVADNIEAYALFAGAEFIWALLDSSLLGFSLACVLGLGCPLAEIPIMK
        LLG+FYCTVGLIQLYIDENFS  R EGS G+T+ASLIALALFIELSAEMYKAGVA NIEAYALFAGAEFIWALLDSSLLGFSLACV+GL CPLAEIPIMK
Subjt:  LLGLFYCTVGLIQLYIDENFSQKRSEGSFGRTLASLIALALFIELSAEMYKAGVADNIEAYALFAGAEFIWALLDSSLLGFSLACVLGLGCPLAEIPIMK

Query:  FFHLWDYPKANIEIFGEGIVSWTITCYFVYTPFLINLSRWLKSVVNAASAVKEDGSA
        FFHLW YP+AN+EIFGEG++SWTITCYFVYTPFLINLSRWLKSV+++  AVK+DGSA
Subjt:  FFHLWDYPKANIEIFGEGIVSWTITCYFVYTPFLINLSRWLKSVVNAASAVKEDGSA

A0A6J1KGW6 uncharacterized protein LOC1114931034.3e-11282.88Show/hide
Query:  MQSIYALGFSHSLQFSQSHFHSNSRISLQKPHCSSHGSKR-ARTSLSLTTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVPF
        MQSI+ALGFSHSLQFSQSHFH N + SL +    +HGS R ARTSLSL T+WPSISIALFGSGFLLGPLLDGLHSRVNLVVY+ GS+DIGPL TNI VPF
Subjt:  MQSIYALGFSHSLQFSQSHFHSNSRISLQKPHCSSHGSKR-ARTSLSLTTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVPF

Query:  LLGLFYCTVGLIQLYIDENFSQKRSEGSFGRTLASLIALALFIELSAEMYKAGVADNIEAYALFAGAEFIWALLDSSLLGFSLACVLGLGCPLAEIPIMK
        LLGLFYCTVGLIQLYIDENF   R EGS G+T+ASLIALALFIELSAEMYKAGVA NIEAYALFAGAEFIWALLDSSLLGFSLACV+GL CPLAEIPIMK
Subjt:  LLGLFYCTVGLIQLYIDENFSQKRSEGSFGRTLASLIALALFIELSAEMYKAGVADNIEAYALFAGAEFIWALLDSSLLGFSLACVLGLGCPLAEIPIMK

Query:  FFHLWDYPKANIEIFGEGIVSWTITCYFVYTPFLINLSRWLKSVVNAASAVKEDGSA
        FFHLW YP+AN+EIFGEG++SWTITCYFVYTPFLINLSRWL   V  ++AVK+DGSA
Subjt:  FFHLWDYPKANIEIFGEGIVSWTITCYFVYTPFLINLSRWLKSVVNAASAVKEDGSA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G01935.1 unknown protein6.2e-7966.35Show/hide
Query:  SKRARTSLSLTTTW-PSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVPFLLGLFYCTVGLIQLYIDENFSQKRSEGSFGRTLASLI
        S+R ++  S   +W   +S++LFGSGF+LGPLLDG+HSRV+LVVY+ G+  IGPLHTNIWVPFLLGLFYCTVGL+QL +DE  S     GS  +T+ SL+
Subjt:  SKRARTSLSLTTTW-PSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVPFLLGLFYCTVGLIQLYIDENFSQKRSEGSFGRTLASLI

Query:  ALALFIELSAEMYKAGVADNIEAYALFAGAEFIWALLDSSLLGFSLACVLGLGCPLAEIPIMKFFHLWDYPKANIEIFGEGIVSWTITCYFVYTPFLINL
        AL  F+ELSAEMYKAGV+DNIEAY LFA AEFIW  LD + + F++A +LG+ CPLAEIPIM+FFHLW YP+ANIEIFG+G+V+WT TCYFVYTPFLINL
Subjt:  ALALFIELSAEMYKAGVADNIEAYALFAGAEFIWALLDSSLLGFSLACVLGLGCPLAEIPIMKFFHLWDYPKANIEIFGEGIVSWTITCYFVYTPFLINL

Query:  SRWLKSVV
        +RWL++V+
Subjt:  SRWLKSVV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAATCAATCTATGCATTAGGCTTCTCCCACTCTCTCCAATTTTCCCAATCCCATTTCCACTCCAACTCCAGAATTTCTCTGCAAAAACCTCACTGCAGCAGCCATGG
AAGCAAGAGGGCTAGAACAAGCCTCAGTCTTACAACAACTTGGCCTTCCATCTCCATCGCCCTCTTCGGCTCAGGCTTTCTCTTAGGCCCTCTTCTCGATGGACTCCATT
CTCGCGTCAATCTCGTTGTTTATCGAACAGGATCCATCGACATCGGCCCCCTCCACACTAACATTTGGGTTCCTTTCTTGCTGGGATTGTTTTACTGTACAGTTGGTTTG
ATTCAACTCTACATAGATGAGAATTTTTCACAAAAAAGATCAGAGGGAAGCTTTGGCAGGACACTAGCTTCCTTAATAGCATTGGCTTTGTTTATTGAATTGAGTGCTGA
AATGTATAAAGCTGGAGTAGCTGACAATATTGAGGCCTATGCATTGTTTGCTGGGGCTGAGTTTATATGGGCATTGCTTGATAGTTCATTGCTTGGCTTCTCACTGGCTT
GTGTTCTTGGCCTCGGCTGCCCTCTGGCTGAGATTCCCATTATGAAGTTCTTCCATCTCTGGGATTATCCCAAAGCAAACATAGAGATCTTTGGTGAGGGGATAGTCAGC
TGGACAATCACATGCTATTTTGTGTACACTCCATTCTTGATTAACTTATCAAGGTGGCTCAAGTCTGTGGTGAATGCTGCTTCTGCTGTAAAAGAAGATGGGTCTGCATA
G
mRNA sequenceShow/hide mRNA sequence
ATGCAATCAATCTATGCATTAGGCTTCTCCCACTCTCTCCAATTTTCCCAATCCCATTTCCACTCCAACTCCAGAATTTCTCTGCAAAAACCTCACTGCAGCAGCCATGG
AAGCAAGAGGGCTAGAACAAGCCTCAGTCTTACAACAACTTGGCCTTCCATCTCCATCGCCCTCTTCGGCTCAGGCTTTCTCTTAGGCCCTCTTCTCGATGGACTCCATT
CTCGCGTCAATCTCGTTGTTTATCGAACAGGATCCATCGACATCGGCCCCCTCCACACTAACATTTGGGTTCCTTTCTTGCTGGGATTGTTTTACTGTACAGTTGGTTTG
ATTCAACTCTACATAGATGAGAATTTTTCACAAAAAAGATCAGAGGGAAGCTTTGGCAGGACACTAGCTTCCTTAATAGCATTGGCTTTGTTTATTGAATTGAGTGCTGA
AATGTATAAAGCTGGAGTAGCTGACAATATTGAGGCCTATGCATTGTTTGCTGGGGCTGAGTTTATATGGGCATTGCTTGATAGTTCATTGCTTGGCTTCTCACTGGCTT
GTGTTCTTGGCCTCGGCTGCCCTCTGGCTGAGATTCCCATTATGAAGTTCTTCCATCTCTGGGATTATCCCAAAGCAAACATAGAGATCTTTGGTGAGGGGATAGTCAGC
TGGACAATCACATGCTATTTTGTGTACACTCCATTCTTGATTAACTTATCAAGGTGGCTCAAGTCTGTGGTGAATGCTGCTTCTGCTGTAAAAGAAGATGGGTCTGCATA
G
Protein sequenceShow/hide protein sequence
MQSIYALGFSHSLQFSQSHFHSNSRISLQKPHCSSHGSKRARTSLSLTTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVPFLLGLFYCTVGL
IQLYIDENFSQKRSEGSFGRTLASLIALALFIELSAEMYKAGVADNIEAYALFAGAEFIWALLDSSLLGFSLACVLGLGCPLAEIPIMKFFHLWDYPKANIEIFGEGIVS
WTITCYFVYTPFLINLSRWLKSVVNAASAVKEDGSA