; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0031067 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0031067
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionC2H2-type domain-containing protein
Genome locationchr11:4396945..4399438
RNA-Seq ExpressionLag0031067
SyntenyLag0031067
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606923.1 hypothetical protein SDJN03_00265, partial [Cucurbita argyrosperma subsp. sororia]3.8e-11589.17Show/hide
Query:  MQSIHALGFSHSLQFSQSHLHSNSKNPLLKPHCSHGSNSKSICRTSLSLRTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVP
        MQSIHALGFSHSLQFSQSH H N KN LL+   SHGSN K+  RTSLSLRT+WPSISIALFGSGFLLGPLLDGLHSRVNLVVY+ GS+DIGPL TNIWVP
Subjt:  MQSIHALGFSHSLQFSQSHLHSNSKNPLLKPHCSHGSNSKSICRTSLSLRTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVP

Query:  FLLGLFYCTVGLIQLYVDEKFSPKRTEGSLGRTVASLIALALFIELSAEMYKAGVAANIEAYALFAGAEFIWALLDSSLLGFSLACVVGLVCPLAEIPIM
        FLLG+FYCTVGLIQLY+DE FSP R EGSLG+TVASLIALALFIELSAEMYKAGVA NIEAYALFAGAEFIWALLDSSLLGFSLACVVGL+CPLAEIPIM
Subjt:  FLLGLFYCTVGLIQLYVDEKFSPKRTEGSLGRTVASLIALALFIELSAEMYKAGVAANIEAYALFAGAEFIWALLDSSLLGFSLACVVGLVCPLAEIPIM

Query:  KFFHLWYYPQANVEIFGEGIISWTITCYFVYTPFLINLSR
        KFFHLWYYPQANVEIFGEG+ISWTITCYFVYTPFLINLSR
Subjt:  KFFHLWYYPQANVEIFGEGIISWTITCYFVYTPFLINLSR

KAG7036627.1 hypothetical protein SDJN02_00246 [Cucurbita argyrosperma subsp. argyrosperma]3.8e-12388.72Show/hide
Query:  MQSIHALGFSHSLQFSQSHLHSNSKNPLLKPHCSHGSNSKSICRTSLSLRTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVP
        MQSIHALGFSHSLQFSQSH H N KN LL+   SHGSN K+  RTSLSLRT+WPSISIALFGSGFLLGPLLDGLHSRVNLVVY+ GS+DIGPL TNIWVP
Subjt:  MQSIHALGFSHSLQFSQSHLHSNSKNPLLKPHCSHGSNSKSICRTSLSLRTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVP

Query:  FLLGLFYCTVGLIQLYVDEKFSPKRTEGSLGRTVASLIALALFIELSAEMYKAGVAANIEAYALFAGAEFIWALLDSSLLGFSLACVVGLVCPLAEIPIM
        FLLG+FYCTVGLIQLY+DE FSP R EGSLG+TVASLIALALFIELSAEMYKAGVA NIEAYALFAGAEFIWALLDSSLLGFSLACVVGL+CPLAEIPIM
Subjt:  FLLGLFYCTVGLIQLYVDEKFSPKRTEGSLGRTVASLIALALFIELSAEMYKAGVAANIEAYALFAGAEFIWALLDSSLLGFSLACVVGLVCPLAEIPIM

Query:  KFFHLWYYPQANVEIFGEGIISWTITCYFVYTPFLINLSRWLKSVVDAAAVKKDESA
        KFFHLWYYPQANVEIFGEG+ISWTITCYFVYTPFLINLSRWLKSV+D+AAVKKD SA
Subjt:  KFFHLWYYPQANVEIFGEGIISWTITCYFVYTPFLINLSRWLKSVVDAAAVKKDESA

XP_022949544.1 uncharacterized protein LOC111452860 [Cucurbita moschata]1.4e-12288.33Show/hide
Query:  MQSIHALGFSHSLQFSQSHLHSNSKNPLLKPHCSHGSNSKSICRTSLSLRTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVP
        MQSIHALGFSHSLQFSQSH H N KN LL+   SHGSN K+  RTSLSLRT+WPSISIALFGSGFLLGPLLDGLHSRVNLVVY+ GS+DIGPL TNIWVP
Subjt:  MQSIHALGFSHSLQFSQSHLHSNSKNPLLKPHCSHGSNSKSICRTSLSLRTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVP

Query:  FLLGLFYCTVGLIQLYVDEKFSPKRTEGSLGRTVASLIALALFIELSAEMYKAGVAANIEAYALFAGAEFIWALLDSSLLGFSLACVVGLVCPLAEIPIM
        FLLG+FYCTVGLIQLY+DE FSP R EGSLG+TVASLIALALFIELSAEMYKAGVA NIEAYALFAGAEFIWALLDSSLLGFSLACVVGL+CPLAEIPIM
Subjt:  FLLGLFYCTVGLIQLYVDEKFSPKRTEGSLGRTVASLIALALFIELSAEMYKAGVAANIEAYALFAGAEFIWALLDSSLLGFSLACVVGLVCPLAEIPIM

Query:  KFFHLWYYPQANVEIFGEGIISWTITCYFVYTPFLINLSRWLKSVVDAAAVKKDESA
        KFFHLWYYPQANVEIFGEG+ISWTITCYFVYTPFLINLSRWLKSV+D+ AVKKD SA
Subjt:  KFFHLWYYPQANVEIFGEGIISWTITCYFVYTPFLINLSRWLKSVVDAAAVKKDESA

XP_022998487.1 uncharacterized protein LOC111493103 [Cucurbita maxima]2.8e-11887.21Show/hide
Query:  MQSIHALGFSHSLQFSQSHLHSNSKNPLLKPHCSHGSNSKSICRTSLSLRTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVP
        MQSIHALGFSHSLQFSQSH H N KN LL+   +HGSN ++  RTSLSLRT+WPSISIALFGSGFLLGPLLDGLHSRVNLVVY+ GS+DIGPL TNI VP
Subjt:  MQSIHALGFSHSLQFSQSHLHSNSKNPLLKPHCSHGSNSKSICRTSLSLRTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVP

Query:  FLLGLFYCTVGLIQLYVDEKFSPKRTEGSLGRTVASLIALALFIELSAEMYKAGVAANIEAYALFAGAEFIWALLDSSLLGFSLACVVGLVCPLAEIPIM
        FLLGLFYCTVGLIQLY+DE F P R EGSLG+TVASLIALALFIELSAEMYKAGVA NIEAYALFAGAEFIWALLDSSLLGFSLACVVGL+CPLAEIPIM
Subjt:  FLLGLFYCTVGLIQLYVDEKFSPKRTEGSLGRTVASLIALALFIELSAEMYKAGVAANIEAYALFAGAEFIWALLDSSLLGFSLACVVGLVCPLAEIPIM

Query:  KFFHLWYYPQANVEIFGEGIISWTITCYFVYTPFLINLSRWL-KSVVDAAAVKKDESA
        KFFHLWYYPQANVEIFGEG+ISWTITCYFVYTPFLINLSRWL  SVVD+AAVKKD SA
Subjt:  KFFHLWYYPQANVEIFGEGIISWTITCYFVYTPFLINLSRWL-KSVVDAAAVKKDESA

XP_038904924.1 uncharacterized protein LOC120091134 [Benincasa hispida]6.5e-12387.21Show/hide
Query:  KMQSIHALGFSHSLQFSQSHLHSNSKNPLLKPHCSHGSNSKSICRTSLSLRTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWV
        KMQSI+ALGFSHSLQFSQSH HSNSK  L KPHC+  S+     RTSLSLRTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSI IGPLHTNIWV
Subjt:  KMQSIHALGFSHSLQFSQSHLHSNSKNPLLKPHCSHGSNSKSICRTSLSLRTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWV

Query:  PFLLGLFYCTVGLIQLYVDEKFSPKRTEGSLGRTVASLIALALFIELSAEMYKAGVAANIEAYALFAGAEFIWALLDSSLLGFSLACVVGLVCPLAEIPI
        PFLLGLFYC+VGLIQLY+DE FSP+++EG  GRTVASLIAL LFIELSAEMYKAGVA NIEAYALFAGAEFIWALLDSSLLGFSLACV+GLVCPLAEIPI
Subjt:  PFLLGLFYCTVGLIQLYVDEKFSPKRTEGSLGRTVASLIALALFIELSAEMYKAGVAANIEAYALFAGAEFIWALLDSSLLGFSLACVVGLVCPLAEIPI

Query:  MKFFHLWYYPQANVEIFGEGIISWTITCYFVYTPFLINLSRWLKSVVDAAAVKKDESA
        MKFFHLWYYP+AN+EIFGEGI+SWTITCYFVYTPFLINLSRWLKSVVDAAA  +D SA
Subjt:  MKFFHLWYYPQANVEIFGEGIISWTITCYFVYTPFLINLSRWLKSVVDAAAVKKDESA

TrEMBL top hitse value%identityAlignment
A0A0A0LBT1 Uncharacterized protein2.4e-11584.94Show/hide
Query:  MQSIHALGFSHSLQF--SQSHLHSNSKNPLLKPHC-SHGSNSKSICRTSLSLRTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNI
        MQSI ALGFS SLQF  S SH HSNS+  + KPHC SHGS      R SLSLRTTWPSISI+LF SGFLLGPLLDGLHSRVNLVVYRTGSI IGPLHTNI
Subjt:  MQSIHALGFSHSLQF--SQSHLHSNSKNPLLKPHC-SHGSNSKSICRTSLSLRTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNI

Query:  WVPFLLGLFYCTVGLIQLYVDEKFSPKRTEGSLGRTVASLIALALFIELSAEMYKAGVAANIEAYALFAGAEFIWALLDSSLLGFSLACVVGLVCPLAEI
        WVPFLLGLFYCTVGLIQLY+DEKFS K+++GSLG+TVASLIAL LFIELSAEMYKAGVA NIEAYALFAGAEFIWALLDSSLLGFSLACV+GL CPLAEI
Subjt:  WVPFLLGLFYCTVGLIQLYVDEKFSPKRTEGSLGRTVASLIALALFIELSAEMYKAGVAANIEAYALFAGAEFIWALLDSSLLGFSLACVVGLVCPLAEI

Query:  PIMKFFHLWYYPQANVEIFGEGIISWTITCYFVYTPFLINLSRWLKSVVDAAAVKKDES
        PIMKFFHLW YP+AN++IFGEGIISWT+TCYFVYTPFLINLSRWLKSVVDAAAV +DES
Subjt:  PIMKFFHLWYYPQANVEIFGEGIISWTITCYFVYTPFLINLSRWLKSVVDAAAVKKDES

A0A1S3BI47 uncharacterized protein LOC1034899062.7e-11483.4Show/hide
Query:  MQSIHALGFSHSLQFSQSH----LHSNSKNPLLKPHC-SHGSNSKSICRTSLSLRTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHT
        MQSI ALGFSHSLQF  SH     HSNS+  L KPHC SHGSN     R +LSLRTTWPSISI+LF SGFLLGPLLDGLHSRVNLVVYRTGSI IGPLHT
Subjt:  MQSIHALGFSHSLQFSQSH----LHSNSKNPLLKPHC-SHGSNSKSICRTSLSLRTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHT

Query:  NIWVPFLLGLFYCTVGLIQLYVDEKFSPKRTEGSLGRTVASLIALALFIELSAEMYKAGVAANIEAYALFAGAEFIWALLDSSLLGFSLACVVGLVCPLA
        NIWVPFLLGLFYCTVGLIQLY+DEKFSPK+++GSL +TVASLIAL LFIELSAEMYKAGVA NIEAYALFAGAEFIWALLDSSLLGFSLACV+GL CPLA
Subjt:  NIWVPFLLGLFYCTVGLIQLYVDEKFSPKRTEGSLGRTVASLIALALFIELSAEMYKAGVAANIEAYALFAGAEFIWALLDSSLLGFSLACVVGLVCPLA

Query:  EIPIMKFFHLWYYPQANVEIFGEGIISWTITCYFVYTPFLINLSRWLKSVVD----AAAVKKDES
        EIPIMKFFHLW YP+AN+EIFGEGIISWT+TCYFVYTPFLINLSRWLKSVVD    AAAV +D S
Subjt:  EIPIMKFFHLWYYPQANVEIFGEGIISWTITCYFVYTPFLINLSRWLKSVVD----AAAVKKDES

A0A6J1DRY1 uncharacterized protein LOC1110238313.1e-11585.99Show/hide
Query:  MQSIHALGFSHSLQFSQSHLHSNSKNPLLKPHCSHGSNSKSICRTSLSLRTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVP
        MQSI+ LG S  LQF Q    S SK+  LKP CSHGS S+   RTSLSLRTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVY+TGSID+GPLHTNIWVP
Subjt:  MQSIHALGFSHSLQFSQSHLHSNSKNPLLKPHCSHGSNSKSICRTSLSLRTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVP

Query:  FLLGLFYCTVGLIQLYVDEKFSPKRTEGSLGRTVASLIALALFIELSAEMYKAGVAANIEAYALFAGAEFIWALLDSSLLGFSLACVVGLVCPLAEIPIM
        FLLGLFY TVGL+QLY+DE FS   +EGSLGRTVASLIALALFIELSAEMYKAGVA NIEAYALFAGAE IWA LDSSLLGFSLACVVGL CPLAEIPIM
Subjt:  FLLGLFYCTVGLIQLYVDEKFSPKRTEGSLGRTVASLIALALFIELSAEMYKAGVAANIEAYALFAGAEFIWALLDSSLLGFSLACVVGLVCPLAEIPIM

Query:  KFFHLWYYPQANVEIFGEGIISWTITCYFVYTPFLINLSRWLKSVVDAAAVKKDESA
        KFFHLW YPQANVEIFGEGIISW ITCYFVYTPFLINLSRWLKSVVDAAAVKKDESA
Subjt:  KFFHLWYYPQANVEIFGEGIISWTITCYFVYTPFLINLSRWLKSVVDAAAVKKDESA

A0A6J1GD48 uncharacterized protein LOC1114528607.0e-12388.33Show/hide
Query:  MQSIHALGFSHSLQFSQSHLHSNSKNPLLKPHCSHGSNSKSICRTSLSLRTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVP
        MQSIHALGFSHSLQFSQSH H N KN LL+   SHGSN K+  RTSLSLRT+WPSISIALFGSGFLLGPLLDGLHSRVNLVVY+ GS+DIGPL TNIWVP
Subjt:  MQSIHALGFSHSLQFSQSHLHSNSKNPLLKPHCSHGSNSKSICRTSLSLRTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVP

Query:  FLLGLFYCTVGLIQLYVDEKFSPKRTEGSLGRTVASLIALALFIELSAEMYKAGVAANIEAYALFAGAEFIWALLDSSLLGFSLACVVGLVCPLAEIPIM
        FLLG+FYCTVGLIQLY+DE FSP R EGSLG+TVASLIALALFIELSAEMYKAGVA NIEAYALFAGAEFIWALLDSSLLGFSLACVVGL+CPLAEIPIM
Subjt:  FLLGLFYCTVGLIQLYVDEKFSPKRTEGSLGRTVASLIALALFIELSAEMYKAGVAANIEAYALFAGAEFIWALLDSSLLGFSLACVVGLVCPLAEIPIM

Query:  KFFHLWYYPQANVEIFGEGIISWTITCYFVYTPFLINLSRWLKSVVDAAAVKKDESA
        KFFHLWYYPQANVEIFGEG+ISWTITCYFVYTPFLINLSRWLKSV+D+ AVKKD SA
Subjt:  KFFHLWYYPQANVEIFGEGIISWTITCYFVYTPFLINLSRWLKSVVDAAAVKKDESA

A0A6J1KGW6 uncharacterized protein LOC1114931031.4e-11887.21Show/hide
Query:  MQSIHALGFSHSLQFSQSHLHSNSKNPLLKPHCSHGSNSKSICRTSLSLRTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVP
        MQSIHALGFSHSLQFSQSH H N KN LL+   +HGSN ++  RTSLSLRT+WPSISIALFGSGFLLGPLLDGLHSRVNLVVY+ GS+DIGPL TNI VP
Subjt:  MQSIHALGFSHSLQFSQSHLHSNSKNPLLKPHCSHGSNSKSICRTSLSLRTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVP

Query:  FLLGLFYCTVGLIQLYVDEKFSPKRTEGSLGRTVASLIALALFIELSAEMYKAGVAANIEAYALFAGAEFIWALLDSSLLGFSLACVVGLVCPLAEIPIM
        FLLGLFYCTVGLIQLY+DE F P R EGSLG+TVASLIALALFIELSAEMYKAGVA NIEAYALFAGAEFIWALLDSSLLGFSLACVVGL+CPLAEIPIM
Subjt:  FLLGLFYCTVGLIQLYVDEKFSPKRTEGSLGRTVASLIALALFIELSAEMYKAGVAANIEAYALFAGAEFIWALLDSSLLGFSLACVVGLVCPLAEIPIM

Query:  KFFHLWYYPQANVEIFGEGIISWTITCYFVYTPFLINLSRWL-KSVVDAAAVKKDESA
        KFFHLWYYPQANVEIFGEG+ISWTITCYFVYTPFLINLSRWL  SVVD+AAVKKD SA
Subjt:  KFFHLWYYPQANVEIFGEGIISWTITCYFVYTPFLINLSRWL-KSVVDAAAVKKDESA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G01935.1 unknown protein4.7e-7958.94Show/hide
Query:  SNSKNPLLKPHCSHGSN----SKSIC-----RTSLSLRTTW-PSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVPFLLGLFYCTVG
        S+    L+KP   +G N     K +C     ++  S   +W   +S++LFGSGF+LGPLLDG+HSRV+LVVY+ G+  IGPLHTNIWVPFLLGLFYCTVG
Subjt:  SNSKNPLLKPHCSHGSN----SKSIC-----RTSLSLRTTW-PSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVPFLLGLFYCTVG

Query:  LIQLYVDEKFSPKRTEGSLGRTVASLIALALFIELSAEMYKAGVAANIEAYALFAGAEFIWALLDSSLLGFSLACVVGLVCPLAEIPIMKFFHLWYYPQA
        L+QL +DE  S     GSL +TV SL+AL  F+ELSAEMYKAGV+ NIEAY LFA AEFIW  LD + + F++A ++G+ CPLAEIPIM+FFHLWYYP+A
Subjt:  LIQLYVDEKFSPKRTEGSLGRTVASLIALALFIELSAEMYKAGVAANIEAYALFAGAEFIWALLDSSLLGFSLACVVGLVCPLAEIPIMKFFHLWYYPQA

Query:  NVEIFGEGIISWTITCYFVYTPFLINLSRWLKSVVDAAAVKKDESA
        N+EIFG+G+++WT TCYFVYTPFLINL+RWL++V++   ++ D S+
Subjt:  NVEIFGEGIISWTITCYFVYTPFLINLSRWLKSVVDAAAVKKDESA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGCAGGTTTTCAATGTTGCAGGCCACATCTTTCGAATCTCTCTATCTAATTTTGAAGTCCACACTCACTCATCAAAATCACAATCCATACACATTTGGGCTCAACAA
TCCTTCAATTTACAGCCCTCCAATTCGCCAAAATGAATTCTTGTGGATTGCAACGAAACAATTTTCAAAATTTCATCTATTCGGATCCTCAACGAACTCCAAAATGCAAT
CAATCCATGCATTGGGCTTCTCCCACTCTCTCCAATTTTCCCAATCCCATCTCCATTCGAACTCCAAAAATCCTCTGCTCAAACCCCATTGCAGCCATGGAAGCAACAGC
AAGAGCATATGCAGAACAAGCCTCAGTCTCAGAACCACTTGGCCCTCCATTTCCATCGCCCTCTTCGGCTCCGGCTTTCTCTTAGGCCCTCTTCTCGACGGACTCCATTC
GCGGGTGAATCTCGTCGTTTACCGAACAGGATCGATCGACATCGGCCCACTCCACACTAACATCTGGGTTCCTTTCTTGTTGGGATTGTTTTACTGTACTGTTGGGTTGA
TTCAACTCTACGTAGATGAGAAATTTTCGCCAAAAAGAACAGAGGGGAGTTTGGGCAGGACAGTAGCATCCTTGATAGCATTGGCTTTGTTTATTGAATTGAGTGCTGAA
ATGTACAAAGCTGGAGTGGCAGCCAACATTGAGGCCTATGCATTGTTTGCTGGGGCTGAGTTTATATGGGCATTGCTTGATAGTTCTTTGCTTGGTTTCTCACTGGCTTG
TGTTGTTGGCCTTGTCTGCCCTCTGGCTGAGATTCCCATTATGAAGTTCTTCCATCTCTGGTATTATCCACAAGCAAACGTCGAGATCTTTGGCGAGGGGATAATCAGCT
GGACAATCACTTGCTATTTTGTGTATACTCCATTCTTGATAAATTTATCAAGATGGCTCAAGTCTGTGGTGGATGCTGCTGCTGTAAAGAAAGATGAGTCTGCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTGCAGGTTTTCAATGTTGCAGGCCACATCTTTCGAATCTCTCTATCTAATTTTGAAGTCCACACTCACTCATCAAAATCACAATCCATACACATTTGGGCTCAACAA
TCCTTCAATTTACAGCCCTCCAATTCGCCAAAATGAATTCTTGTGGATTGCAACGAAACAATTTTCAAAATTTCATCTATTCGGATCCTCAACGAACTCCAAAATGCAAT
CAATCCATGCATTGGGCTTCTCCCACTCTCTCCAATTTTCCCAATCCCATCTCCATTCGAACTCCAAAAATCCTCTGCTCAAACCCCATTGCAGCCATGGAAGCAACAGC
AAGAGCATATGCAGAACAAGCCTCAGTCTCAGAACCACTTGGCCCTCCATTTCCATCGCCCTCTTCGGCTCCGGCTTTCTCTTAGGCCCTCTTCTCGACGGACTCCATTC
GCGGGTGAATCTCGTCGTTTACCGAACAGGATCGATCGACATCGGCCCACTCCACACTAACATCTGGGTTCCTTTCTTGTTGGGATTGTTTTACTGTACTGTTGGGTTGA
TTCAACTCTACGTAGATGAGAAATTTTCGCCAAAAAGAACAGAGGGGAGTTTGGGCAGGACAGTAGCATCCTTGATAGCATTGGCTTTGTTTATTGAATTGAGTGCTGAA
ATGTACAAAGCTGGAGTGGCAGCCAACATTGAGGCCTATGCATTGTTTGCTGGGGCTGAGTTTATATGGGCATTGCTTGATAGTTCTTTGCTTGGTTTCTCACTGGCTTG
TGTTGTTGGCCTTGTCTGCCCTCTGGCTGAGATTCCCATTATGAAGTTCTTCCATCTCTGGTATTATCCACAAGCAAACGTCGAGATCTTTGGCGAGGGGATAATCAGCT
GGACAATCACTTGCTATTTTGTGTATACTCCATTCTTGATAAATTTATCAAGATGGCTCAAGTCTGTGGTGGATGCTGCTGCTGTAAAGAAAGATGAGTCTGCTTAG
Protein sequenceShow/hide protein sequence
MCRFSMLQATSFESLYLILKSTLTHQNHNPYTFGLNNPSIYSPPIRQNEFLWIATKQFSKFHLFGSSTNSKMQSIHALGFSHSLQFSQSHLHSNSKNPLLKPHCSHGSNS
KSICRTSLSLRTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIDIGPLHTNIWVPFLLGLFYCTVGLIQLYVDEKFSPKRTEGSLGRTVASLIALALFIELSAE
MYKAGVAANIEAYALFAGAEFIWALLDSSLLGFSLACVVGLVCPLAEIPIMKFFHLWYYPQANVEIFGEGIISWTITCYFVYTPFLINLSRWLKSVVDAAAVKKDESA