; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg029199 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg029199
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptionprotein ZGRF1 isoform X2
Genome locationscaffold12:35515178..35532372
RNA-Seq ExpressionSpg029199
SyntenySpg029199
Gene Ontology termsGO:0006302 - double-strand break repair (biological process)
GO:0035861 - site of double-strand break (cellular component)
InterPro domainsIPR018838 - Domain of unknown function DUF2439


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022928770.1 uncharacterized protein LOC111435594 isoform X2 [Cucurbita moschata]1.1e-12269.86Show/hide
Query:  MLFDENRKLLDSRFIKKDETVRSGESIAFDAHLVEIGECERDHKPPKIALNQGSSSGDGGTGVLHGQKKCFNENEISTGKEWHVLYTGQITQKSKKYHNG
        MLFDENRKLLDSRF+KKDETV+SGESIAFDAHLV+IGECER+HKPPKI L+QGSS GD GT VL+  KKCF+ENEISTGKEWHVLYT QITQKSKKYHNG
Subjt:  MLFDENRKLLDSRFIKKDETVRSGESIAFDAHLVEIGECERDHKPPKIALNQGSSSGDGGTGVLHGQKKCFNENEISTGKEWHVLYTGQITQKSKKYHNG

Query:  IIRISSSGSHHMQVTLLNEDRAILSSKHISLSKNLMTGEMLELPKYLVEVGEACESVK-------------------------------------AHEIL
        II+ISSSGSHHMQVTLLNEDR ILSSKH+SLSK L  GE+LELPKYLVE+GEAC +VK                                     AHEIL
Subjt:  IIRISSSGSHHMQVTLLNEDRAILSSKHISLSKNLMTGEMLELPKYLVEVGEACESVK-------------------------------------AHEIL

Query:  SILQRPKARASLSSGHTS----VSVSSYNVPEPNL-AEASQLPIDDRSLQKPSENFDTRESTKNAEKNQSIALTQSTFTGNARTLTEDVEIGHSSQLLQS
        SILQRPKAR SLSSGH+     VSV S  VPEP+L AEA  LP+DDRS +KPSEN DTR+STKNAE NQSIALT S       TLTE++EIGHS+QLLQ+
Subjt:  SILQRPKARASLSSGHTS----VSVSSYNVPEPNL-AEASQLPIDDRSLQKPSENFDTRESTKNAEKNQSIALTQSTFTGNARTLTEDVEIGHSSQLLQS

Query:  DHVEAKSSSPRNSVSRTQGTSGSAACDLVNDEGKICEDITYEREIDACPSFDLGI
        +HVEA+SSS R+++SRTQGTS  AAC+LVNDEGK+CE+ITYERE   CPSFDLGI
Subjt:  DHVEAKSSSPRNSVSRTQGTSGSAACDLVNDEGKICEDITYEREIDACPSFDLGI

XP_038874787.1 uncharacterized protein LOC120067307 isoform X1 [Benincasa hispida]7.8e-12670.17Show/hide
Query:  MLFDENRKLLDSRFIKKDETVRSGESIAFDAHLVEIGECERDHKPPKIALNQGSSSGDGGTGVLHGQKKCFNENEISTGKEWHVLYTGQITQKSKKYHNG
        MLFDENRKLLDSRFIKKDETV+SGESIAFDAHLVEIGECE+DHKPPKI  NQGSSSG+GGT VLHG+K CF+ENEISTGKEW+VLYT Q+TQKSKKYHNG
Subjt:  MLFDENRKLLDSRFIKKDETVRSGESIAFDAHLVEIGECERDHKPPKIALNQGSSSGDGGTGVLHGQKKCFNENEISTGKEWHVLYTGQITQKSKKYHNG

Query:  IIRISSSGSHHMQVTLLNEDRAILSSKHISLSKNLMTGEMLELPKYLVEVGEACESVK-------------------------------------AHEIL
        II++SSSGSH  QVTLLNEDR+ILSSKH SLSKN+  GE+LELPKYLVE+GEACE+VK                                     AH+IL
Subjt:  IIRISSSGSHHMQVTLLNEDRAILSSKHISLSKNLMTGEMLELPKYLVEVGEACESVK-------------------------------------AHEIL

Query:  SILQRPKARASLSSGH----TSVSVSSYNVPEPNLAEASQLPIDDRSLQKPSENFDTRESTKNAEKNQSIALTQ--------STFTGNARTLTEDVEIGH
        SILQRP+AR  LSSGH     SVSVSSYN PEP+LAEA  L IDD+S Q+PSE  D RESTKNAE NQSI LTQ        +TFTGNA TLTEDVEIGH
Subjt:  SILQRPKARASLSSGH----TSVSVSSYNVPEPNLAEASQLPIDDRSLQKPSENFDTRESTKNAEKNQSIALTQ--------STFTGNARTLTEDVEIGH

Query:  SSQLLQSDHVEAKSSSPRNSVSRTQGTSGSAACDLVNDEGKICEDITYEREIDACPSFDLGI
        SSQLL+SDH EA+  S RNS+SRT+ +S +AAC LVNDEGKICE+ITYERE+DACPSFDLGI
Subjt:  SSQLLQSDHVEAKSSSPRNSVSRTQGTSGSAACDLVNDEGKICEDITYEREIDACPSFDLGI

XP_038874788.1 uncharacterized protein LOC120067307 isoform X2 [Benincasa hispida]6.0e-12670.36Show/hide
Query:  MLFDENRKLLDSRFIKKDETVRSGESIAFDAHLVEIGECERDHKPPKIALNQGSSSGDGGTGVLHGQKKCFNENEISTGKEWHVLYTGQITQKSKKYHNG
        MLFDENRKLLDSRFIKKDETV+SGESIAFDAHLVEIGECE+DHKPPKI  NQGSSSG+GGT VLHG+K CF+ENEISTGKEW+VLYT Q+TQKSKKYHNG
Subjt:  MLFDENRKLLDSRFIKKDETVRSGESIAFDAHLVEIGECERDHKPPKIALNQGSSSGDGGTGVLHGQKKCFNENEISTGKEWHVLYTGQITQKSKKYHNG

Query:  IIRISSSGSHHMQVTLLNEDRAILSSKHISLSKNLMTGEMLELPKYLVEVGEACESVK------------------------------------AHEILS
        II++SSSGSH  QVTLLNEDR+ILSSKH SLSKN+  GE+LELPKYLVE+GEACE+VK                                    AH+ILS
Subjt:  IIRISSSGSHHMQVTLLNEDRAILSSKHISLSKNLMTGEMLELPKYLVEVGEACESVK------------------------------------AHEILS

Query:  ILQRPKARASLSSGH----TSVSVSSYNVPEPNLAEASQLPIDDRSLQKPSENFDTRESTKNAEKNQSIALTQ--------STFTGNARTLTEDVEIGHS
        ILQRP+AR  LSSGH     SVSVSSYN PEP+LAEA  L IDD+S Q+PSE  D RESTKNAE NQSI LTQ        +TFTGNA TLTEDVEIGHS
Subjt:  ILQRPKARASLSSGH----TSVSVSSYNVPEPNLAEASQLPIDDRSLQKPSENFDTRESTKNAEKNQSIALTQ--------STFTGNARTLTEDVEIGHS

Query:  SQLLQSDHVEAKSSSPRNSVSRTQGTSGSAACDLVNDEGKICEDITYEREIDACPSFDLGI
        SQLL+SDH EA+  S RNS+SRT+ +S +AAC LVNDEGKICE+ITYERE+DACPSFDLGI
Subjt:  SQLLQSDHVEAKSSSPRNSVSRTQGTSGSAACDLVNDEGKICEDITYEREIDACPSFDLGI

XP_038874789.1 uncharacterized protein LOC120067307 isoform X3 [Benincasa hispida]8.3e-12871.75Show/hide
Query:  MLFDENRKLLDSRFIKKDETVRSGESIAFDAHLVEIGECERDHKPPKIALNQGSSSGDGGTGVLHGQKKCFNENEISTGKEWHVLYTGQITQKSKKYHNG
        MLFDENRKLLDSRFIKKDETV+SGESIAFDAHLVEIGECE+DHKPPKI  NQGSSSG+GGT VLHG+K CF+ENEISTGKEW+VLYT Q+TQKSKKYHNG
Subjt:  MLFDENRKLLDSRFIKKDETVRSGESIAFDAHLVEIGECERDHKPPKIALNQGSSSGDGGTGVLHGQKKCFNENEISTGKEWHVLYTGQITQKSKKYHNG

Query:  IIRISSSGSHHMQVTLLNEDRAILSSKHISLSKNLMTGEMLELPKYLVEVGEACESVK-------------------------------------AHEIL
        II++SSSGSH  QVTLLNEDR+ILSSKH SLSKN+  GE+LELPKYLVE+GEACE+VK                                     AH+IL
Subjt:  IIRISSSGSHHMQVTLLNEDRAILSSKHISLSKNLMTGEMLELPKYLVEVGEACESVK-------------------------------------AHEIL

Query:  SILQRPKARASLSSGH----TSVSVSSYNVPEPNLAEASQLPIDDRSLQKPSENFDTRESTKNAEKNQSIALTQSTFTGNARTLTEDVEIGHSSQLLQSD
        SILQRP+AR  LSSGH     SVSVSSYN PEP+LAEA  L IDD+S Q+PSE  D RESTKNAE NQSI LTQ TFTGNA TLTEDVEIGHSSQLL+SD
Subjt:  SILQRPKARASLSSGH----TSVSVSSYNVPEPNLAEASQLPIDDRSLQKPSENFDTRESTKNAEKNQSIALTQSTFTGNARTLTEDVEIGHSSQLLQSD

Query:  HVEAKSSSPRNSVSRTQGTSGSAACDLVNDEGKICEDITYEREIDACPSFDLGI
        H EA+  S RNS+SRT+ +S +AAC LVNDEGKICE+ITYERE+DACPSFDLGI
Subjt:  HVEAKSSSPRNSVSRTQGTSGSAACDLVNDEGKICEDITYEREIDACPSFDLGI

XP_038874791.1 uncharacterized protein LOC120067307 isoform X5 [Benincasa hispida]7.8e-12670.17Show/hide
Query:  MLFDENRKLLDSRFIKKDETVRSGESIAFDAHLVEIGECERDHKPPKIALNQGSSSGDGGTGVLHGQKKCFNENEISTGKEWHVLYTGQITQKSKKYHNG
        MLFDENRKLLDSRFIKKDETV+SGESIAFDAHLVEIGECE+DHKPPKI  NQGSSSG+GGT VLHG+K CF+ENEISTGKEW+VLYT Q+TQKSKKYHNG
Subjt:  MLFDENRKLLDSRFIKKDETVRSGESIAFDAHLVEIGECERDHKPPKIALNQGSSSGDGGTGVLHGQKKCFNENEISTGKEWHVLYTGQITQKSKKYHNG

Query:  IIRISSSGSHHMQVTLLNEDRAILSSKHISLSKNLMTGEMLELPKYLVEVGEACESVK-------------------------------------AHEIL
        II++SSSGSH  QVTLLNEDR+ILSSKH SLSKN+  GE+LELPKYLVE+GEACE+VK                                     AH+IL
Subjt:  IIRISSSGSHHMQVTLLNEDRAILSSKHISLSKNLMTGEMLELPKYLVEVGEACESVK-------------------------------------AHEIL

Query:  SILQRPKARASLSSGH----TSVSVSSYNVPEPNLAEASQLPIDDRSLQKPSENFDTRESTKNAEKNQSIALTQ--------STFTGNARTLTEDVEIGH
        SILQRP+AR  LSSGH     SVSVSSYN PEP+LAEA  L IDD+S Q+PSE  D RESTKNAE NQSI LTQ        +TFTGNA TLTEDVEIGH
Subjt:  SILQRPKARASLSSGH----TSVSVSSYNVPEPNLAEASQLPIDDRSLQKPSENFDTRESTKNAEKNQSIALTQ--------STFTGNARTLTEDVEIGH

Query:  SSQLLQSDHVEAKSSSPRNSVSRTQGTSGSAACDLVNDEGKICEDITYEREIDACPSFDLGI
        SSQLL+SDH EA+  S RNS+SRT+ +S +AAC LVNDEGKICE+ITYERE+DACPSFDLGI
Subjt:  SSQLLQSDHVEAKSSSPRNSVSRTQGTSGSAACDLVNDEGKICEDITYEREIDACPSFDLGI

TrEMBL top hitse value%identityAlignment
A0A6J1CVV0 uncharacterized protein LOC111015274 isoform X13.8e-11868.93Show/hide
Query:  MLFDENRKLLDSRFIKKDETVRSGESIAFDAHLVEIGECERDHKPPKIALNQGSSSGDGGTGVLHGQKKCFNENEISTGKEWHVLYTGQITQKSKKYHNG
        MLFDENRKLLDSRFIKKDE V+SGES+AF+AHLVEIGECERD KPPKIALNQGS+SGDGGT + HGQKK  NENEISTGKEWHVLYT QITQKSKKY NG
Subjt:  MLFDENRKLLDSRFIKKDETVRSGESIAFDAHLVEIGECERDHKPPKIALNQGSSSGDGGTGVLHGQKKCFNENEISTGKEWHVLYTGQITQKSKKYHNG

Query:  IIRISSSGSHHMQVTLLNEDRAILSSKHISLSKNLMTGEMLELPKYLVEVGEACESVK-------------------------------------AHEIL
        IIRISSSGSHH+QVTLLNEDR ILSSKHISLSK+LM G +LELPKYLVEVGEACESVK                                     AHEIL
Subjt:  IIRISSSGSHHMQVTLLNEDRAILSSKHISLSKNLMTGEMLELPKYLVEVGEACESVK-------------------------------------AHEIL

Query:  SILQRPKARASLSSGHT----SVSVSSYNVPEPNLAEASQLPIDDRSLQKPSENFDTRESTKNAEKNQSIALTQSTFTGNARTLTEDVEIGHSSQLLQSD
        SILQRPKAR +LSSGHT    SVSVSSY  PE +  +   + +DDRS  +PS N D R+S KNAE NQSI LTQSTFT   RTL EDVEI HSSQLLQSD
Subjt:  SILQRPKARASLSSGHT----SVSVSSYNVPEPNLAEASQLPIDDRSLQKPSENFDTRESTKNAEKNQSIALTQSTFTGNARTLTEDVEIGHSSQLLQSD

Query:  HVEAKSSSPRNSVSRTQGTSGSAACDLVNDEGKICEDITYEREIDACPSFDLGI
        +VE +S S RNS +  QG   SAACDLV+DE KI E+ T +R+IDACPSFDLGI
Subjt:  HVEAKSSSPRNSVSRTQGTSGSAACDLVNDEGKICEDITYEREIDACPSFDLGI

A0A6J1CXU9 uncharacterized protein LOC111015274 isoform X22.9e-11869.12Show/hide
Query:  MLFDENRKLLDSRFIKKDETVRSGESIAFDAHLVEIGECERDHKPPKIALNQGSSSGDGGTGVLHGQKKCFNENEISTGKEWHVLYTGQITQKSKKYHNG
        MLFDENRKLLDSRFIKKDE V+SGES+AF+AHLVEIGECERD KPPKIALNQGS+SGDGGT + HGQKK  NENEISTGKEWHVLYT QITQKSKKY NG
Subjt:  MLFDENRKLLDSRFIKKDETVRSGESIAFDAHLVEIGECERDHKPPKIALNQGSSSGDGGTGVLHGQKKCFNENEISTGKEWHVLYTGQITQKSKKYHNG

Query:  IIRISSSGSHHMQVTLLNEDRAILSSKHISLSKNLMTGEMLELPKYLVEVGEACESVK------------------------------------AHEILS
        IIRISSSGSHH+QVTLLNEDR ILSSKHISLSK+LM G +LELPKYLVEVGEACESVK                                    AHEILS
Subjt:  IIRISSSGSHHMQVTLLNEDRAILSSKHISLSKNLMTGEMLELPKYLVEVGEACESVK------------------------------------AHEILS

Query:  ILQRPKARASLSSGHT----SVSVSSYNVPEPNLAEASQLPIDDRSLQKPSENFDTRESTKNAEKNQSIALTQSTFTGNARTLTEDVEIGHSSQLLQSDH
        ILQRPKAR +LSSGHT    SVSVSSY  PE +  +   + +DDRS  +PS N D R+S KNAE NQSI LTQSTFT   RTL EDVEI HSSQLLQSD+
Subjt:  ILQRPKARASLSSGHT----SVSVSSYNVPEPNLAEASQLPIDDRSLQKPSENFDTRESTKNAEKNQSIALTQSTFTGNARTLTEDVEIGHSSQLLQSDH

Query:  VEAKSSSPRNSVSRTQGTSGSAACDLVNDEGKICEDITYEREIDACPSFDLGI
        VE +S S RNS +  QG   SAACDLV+DE KI E+ T +R+IDACPSFDLGI
Subjt:  VEAKSSSPRNSVSRTQGTSGSAACDLVNDEGKICEDITYEREIDACPSFDLGI

A0A6J1ESH9 uncharacterized protein LOC111435594 isoform X25.1e-12369.86Show/hide
Query:  MLFDENRKLLDSRFIKKDETVRSGESIAFDAHLVEIGECERDHKPPKIALNQGSSSGDGGTGVLHGQKKCFNENEISTGKEWHVLYTGQITQKSKKYHNG
        MLFDENRKLLDSRF+KKDETV+SGESIAFDAHLV+IGECER+HKPPKI L+QGSS GD GT VL+  KKCF+ENEISTGKEWHVLYT QITQKSKKYHNG
Subjt:  MLFDENRKLLDSRFIKKDETVRSGESIAFDAHLVEIGECERDHKPPKIALNQGSSSGDGGTGVLHGQKKCFNENEISTGKEWHVLYTGQITQKSKKYHNG

Query:  IIRISSSGSHHMQVTLLNEDRAILSSKHISLSKNLMTGEMLELPKYLVEVGEACESVK-------------------------------------AHEIL
        II+ISSSGSHHMQVTLLNEDR ILSSKH+SLSK L  GE+LELPKYLVE+GEAC +VK                                     AHEIL
Subjt:  IIRISSSGSHHMQVTLLNEDRAILSSKHISLSKNLMTGEMLELPKYLVEVGEACESVK-------------------------------------AHEIL

Query:  SILQRPKARASLSSGHTS----VSVSSYNVPEPNL-AEASQLPIDDRSLQKPSENFDTRESTKNAEKNQSIALTQSTFTGNARTLTEDVEIGHSSQLLQS
        SILQRPKAR SLSSGH+     VSV S  VPEP+L AEA  LP+DDRS +KPSEN DTR+STKNAE NQSIALT S       TLTE++EIGHS+QLLQ+
Subjt:  SILQRPKARASLSSGHTS----VSVSSYNVPEPNL-AEASQLPIDDRSLQKPSENFDTRESTKNAEKNQSIALTQSTFTGNARTLTEDVEIGHSSQLLQS

Query:  DHVEAKSSSPRNSVSRTQGTSGSAACDLVNDEGKICEDITYEREIDACPSFDLGI
        +HVEA+SSS R+++SRTQGTS  AAC+LVNDEGK+CE+ITYERE   CPSFDLGI
Subjt:  DHVEAKSSSPRNSVSRTQGTSGSAACDLVNDEGKICEDITYEREIDACPSFDLGI

A0A6J1HXZ5 protein ZGRF1 isoform X44.9e-11868.73Show/hide
Query:  MLFDENRKLLDSRFIKKDETVRSGESIAFDAHLVEIGECERDHKPPKIALNQGSSSGDGGTGVLHGQKKCFNENEISTGKEWHVLYTGQITQKSKKYHNG
        MLFDENRKLLDSRF+KKDE V+SGESIAFDAHLV+IGECER+HKPPKI ++QGSS GD GT VLH  KKCF+ENEISTGKEWHVLYT QITQKSKKYHNG
Subjt:  MLFDENRKLLDSRFIKKDETVRSGESIAFDAHLVEIGECERDHKPPKIALNQGSSSGDGGTGVLHGQKKCFNENEISTGKEWHVLYTGQITQKSKKYHNG

Query:  IIRISSSGSHHMQVTLLNEDRAILSSKHISLSKNLMTGEMLELPKYLVEVGEACESVK-------------------------------------AHEIL
        II+ISSSGSHHMQVTLLNEDR ILSSKHISLSK L  GE+LELPKYLVE+GEACE+VK                                     AHEIL
Subjt:  IIRISSSGSHHMQVTLLNEDRAILSSKHISLSKNLMTGEMLELPKYLVEVGEACESVK-------------------------------------AHEIL

Query:  SILQRPKARASLSSGHT----SVSVSSYNVPEPNLA-EASQLPIDDRSLQKPSENFDTRESTKNAEKNQSIALTQSTFTGNARTLTEDVEIGHSSQLLQS
        SILQRPKAR SLSSG +    SVSVSS  VPEP+LA EA  LP+D+RS QKPSEN DTRESTKNAE NQS ALTQST T        ++EIGHS+   Q+
Subjt:  SILQRPKARASLSSGHT----SVSVSSYNVPEPNLA-EASQLPIDDRSLQKPSENFDTRESTKNAEKNQSIALTQSTFTGNARTLTEDVEIGHSSQLLQS

Query:  DHVEAKSSSPRNSVSRTQGTSGSAACDLVNDEGKICEDITYEREIDACPSFDLGI
        ++VEA+SSS R+++S TQGTS  AAC LVNDEGK+CE+ITYERE   CPSFDLGI
Subjt:  DHVEAKSSSPRNSVSRTQGTSGSAACDLVNDEGKICEDITYEREIDACPSFDLGI

A0A6J1I1P0 protein ZGRF1 isoform X22.8e-12169.58Show/hide
Query:  MLFDENRKLLDSRFIKKDETVRSGESIAFDAHLVEIGECERDHKPPKIALNQGSSSGDGGTGVLHGQKKCFNENEISTGKEWHVLYTGQITQKSKKYHNG
        MLFDENRKLLDSRF+KKDE V+SGESIAFDAHLV+IGECER+HKPPKI ++QGSS GD GT VLH  KKCF+ENEISTGKEWHVLYT QITQKSKKYHNG
Subjt:  MLFDENRKLLDSRFIKKDETVRSGESIAFDAHLVEIGECERDHKPPKIALNQGSSSGDGGTGVLHGQKKCFNENEISTGKEWHVLYTGQITQKSKKYHNG

Query:  IIRISSSGSHHMQVTLLNEDRAILSSKHISLSKNLMTGEMLELPKYLVEVGEACESVK-------------------------------------AHEIL
        II+ISSSGSHHMQVTLLNEDR ILSSKHISLSK L  GE+LELPKYLVE+GEACE+VK                                     AHEIL
Subjt:  IIRISSSGSHHMQVTLLNEDRAILSSKHISLSKNLMTGEMLELPKYLVEVGEACESVK-------------------------------------AHEIL

Query:  SILQRPKARASLSSGHT----SVSVSSYNVPEPNLA-EASQLPIDDRSLQKPSENFDTRESTKNAEKNQSIALTQSTFTGNARTLTEDVEIGHSSQLLQS
        SILQRPKAR SLSSG +    SVSVSS  VPEP+LA EA  LP+D+RS QKPSEN DTRESTKNAE NQS ALTQST T        ++EIGHS+QLLQ+
Subjt:  SILQRPKARASLSSGHT----SVSVSSYNVPEPNLA-EASQLPIDDRSLQKPSENFDTRESTKNAEKNQSIALTQSTFTGNARTLTEDVEIGHSSQLLQS

Query:  DHVEAKSSSPRNSVSRTQGTSGSAACDLVNDEGKICEDITYEREIDACPSFDLGI
        ++VEA+SSS R+++S TQGTS  AAC LVNDEGK+CE+ITYERE   CPSFDLGI
Subjt:  DHVEAKSSSPRNSVSRTQGTSGSAACDLVNDEGKICEDITYEREIDACPSFDLGI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCTTTGATGAAAACAGGAAACTATTGGATAGCAGGTTTATTAAGAAAGATGAAACAGTAAGATCTGGAGAATCAATAGCCTTTGATGCTCATTTAGTGGAAATTGG
AGAATGTGAAAGGGACCATAAGCCTCCTAAAATTGCTTTAAATCAAGGTAGCAGTTCTGGAGATGGGGGAACCGGGGTACTGCATGGACAGAAAAAATGTTTCAATGAAA
ATGAAATATCAACTGGAAAAGAATGGCATGTTTTGTACACTGGCCAGATAACTCAGAAGTCCAAGAAATATCACAATGGGATCATCAGAATTTCCTCCTCTGGCTCTCAC
CATATGCAGGTTACTTTACTGAATGAAGATAGAGCTATATTAAGCAGCAAACACATCAGTTTATCTAAAAATTTGATGACGGGGGAGATGCTTGAGCTACCAAAATACTT
GGTGGAGGTTGGTGAGGCATGTGAAAGTGTTAAAGCCCATGAAATATTGTCCATTCTTCAAAGGCCAAAGGCTAGAGCGAGCCTTTCTTCAGGTCATACTAGCGTATCAG
TTTCGTCATACAACGTTCCTGAACCTAACCTTGCGGAGGCATCGCAACTTCCAATAGATGACCGATCTCTTCAAAAGCCAAGTGAAAACTTTGACACGAGGGAATCAACT
AAGAATGCAGAAAAGAACCAATCCATTGCTCTAACTCAATCGACATTCACTGGTAATGCCAGAACGTTGACAGAAGATGTTGAAATTGGACACTCCAGCCAGCTTCTTCA
GTCAGACCACGTGGAAGCCAAAAGTAGTTCTCCTAGAAATTCAGTTTCTAGGACGCAAGGTACGAGTGGCTCTGCTGCTTGTGACCTTGTTAATGATGAAGGGAAAATCT
GCGAGGACATTACATATGAAAGAGAAATAGATGCATGCCCAAGTTTTGATCTTGGAATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTCTTTGATGAAAACAGGAAACTATTGGATAGCAGGTTTATTAAGAAAGATGAAACAGTAAGATCTGGAGAATCAATAGCCTTTGATGCTCATTTAGTGGAAATTGG
AGAATGTGAAAGGGACCATAAGCCTCCTAAAATTGCTTTAAATCAAGGTAGCAGTTCTGGAGATGGGGGAACCGGGGTACTGCATGGACAGAAAAAATGTTTCAATGAAA
ATGAAATATCAACTGGAAAAGAATGGCATGTTTTGTACACTGGCCAGATAACTCAGAAGTCCAAGAAATATCACAATGGGATCATCAGAATTTCCTCCTCTGGCTCTCAC
CATATGCAGGTTACTTTACTGAATGAAGATAGAGCTATATTAAGCAGCAAACACATCAGTTTATCTAAAAATTTGATGACGGGGGAGATGCTTGAGCTACCAAAATACTT
GGTGGAGGTTGGTGAGGCATGTGAAAGTGTTAAAGCCCATGAAATATTGTCCATTCTTCAAAGGCCAAAGGCTAGAGCGAGCCTTTCTTCAGGTCATACTAGCGTATCAG
TTTCGTCATACAACGTTCCTGAACCTAACCTTGCGGAGGCATCGCAACTTCCAATAGATGACCGATCTCTTCAAAAGCCAAGTGAAAACTTTGACACGAGGGAATCAACT
AAGAATGCAGAAAAGAACCAATCCATTGCTCTAACTCAATCGACATTCACTGGTAATGCCAGAACGTTGACAGAAGATGTTGAAATTGGACACTCCAGCCAGCTTCTTCA
GTCAGACCACGTGGAAGCCAAAAGTAGTTCTCCTAGAAATTCAGTTTCTAGGACGCAAGGTACGAGTGGCTCTGCTGCTTGTGACCTTGTTAATGATGAAGGGAAAATCT
GCGAGGACATTACATATGAAAGAGAAATAGATGCATGCCCAAGTTTTGATCTTGGAATTTGA
Protein sequenceShow/hide protein sequence
MLFDENRKLLDSRFIKKDETVRSGESIAFDAHLVEIGECERDHKPPKIALNQGSSSGDGGTGVLHGQKKCFNENEISTGKEWHVLYTGQITQKSKKYHNGIIRISSSGSH
HMQVTLLNEDRAILSSKHISLSKNLMTGEMLELPKYLVEVGEACESVKAHEILSILQRPKARASLSSGHTSVSVSSYNVPEPNLAEASQLPIDDRSLQKPSENFDTREST
KNAEKNQSIALTQSTFTGNARTLTEDVEIGHSSQLLQSDHVEAKSSSPRNSVSRTQGTSGSAACDLVNDEGKICEDITYEREIDACPSFDLGI