; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg039724 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg039724
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionMitochondrial transcription termination factor family protein
Genome locationscaffold10:47010819..47014195
RNA-Seq ExpressionSpg039724
SyntenySpg039724
Gene Ontology termsGO:0006353 - DNA-templated transcription, termination (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0003690 - double-stranded DNA binding (molecular function)
InterPro domainsIPR003690 - Transcription termination factor, mitochondrial/chloroplastic
IPR038538 - MTERF superfamily, mitochondrial/chloroplastic


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578327.1 hypothetical protein SDJN03_22775, partial [Cucurbita argyrosperma subsp. sororia]1.4e-10680.08Show/hide
Query:  QSSIVSMASDLTC---GTDFERDPGMLGKSLVSPLSTVDSATRLCFSTHCIVTAQPDAVCSNVRFSYHLPKYMIKDVRRQIPRPRKWEIHSTAQVESLIL
        Q  + ++  D  C    T   +  GMLGKSLVSP+STVDS TRLCFST CI TA+ +AVCSNV FSY   KYMI DVRRQ P PRKWEIHSTAQVESL+L
Subjt:  QSSIVSMASDLTC---GTDFERDPGMLGKSLVSPLSTVDSATRLCFSTHCIVTAQPDAVCSNVRFSYHLPKYMIKDVRRQIPRPRKWEIHSTAQVESLIL

Query:  SDEDQKTWEACRQALSMFNFSVEEQDKMLGKAFGHIHSPYWGEDRKKEVPNIETVNEILEYLRTLGLSDEDLFKLLKKFPEALGCDLEQELKPNVQLLDK
        SDED+KTWEACRQALSMF+FS EEQDKMLGKAFGHIHSPYWGEDRKKEVPNI+TVNEILEYLRTLGLS++DL KLLKKFPE LGC+LEQELK NVQLL+K
Subjt:  SDEDQKTWEACRQALSMFNFSVEEQDKMLGKAFGHIHSPYWGEDRKKEVPNIETVNEILEYLRTLGLSDEDLFKLLKKFPEALGCDLEQELKPNVQLLDK

Query:  EWGIQGKSLRNLLLRNPKVLGYNVDCKGDCMAKCTRCWVRF
        EWGIQGKSLR+LLLRNPKVLGYNVDCKGDC+AKCTRCWVRF
Subjt:  EWGIQGKSLRNLLLRNPKVLGYNVDCKGDCMAKCTRCWVRF

XP_022152523.1 uncharacterized protein LOC111020227 isoform X2 [Momordica charantia]7.9e-10284.72Show/hide
Query:  MLGKSLVSPLSTVDSATRLCFSTHCIVTAQPDAVCSNVRFSYHLPKYMIKDVRRQIPRPRKWEIHSTAQVESLILSDEDQKTWEACRQALSMFNFSVEEQ
        MLGK+LVSP+STVDSATR CFST   VTA+PD  CSNV FS    K + KD RR  P PRKWEIHSTAQVESLILSDED+KTWEACRQALSMF+FS EEQ
Subjt:  MLGKSLVSPLSTVDSATRLCFSTHCIVTAQPDAVCSNVRFSYHLPKYMIKDVRRQIPRPRKWEIHSTAQVESLILSDEDQKTWEACRQALSMFNFSVEEQ

Query:  DKMLGKAFGHIHSPYWGEDRKKEVPNIETVNEILEYLRTLGLSDEDLFKLLKKFPEALGCDLEQELKPNVQLLDKEWGIQGKSLRNLLLRNPKVLGYNVD
        DKMLGKAFGHIHSPYWGEDRKKEVP IE VN+ILEYLRTLGLSD+DL KLLKKFPE LGC+LEQELK N+QLLDKEWGIQGKSLRNLLLRNPKVLGYNVD
Subjt:  DKMLGKAFGHIHSPYWGEDRKKEVPNIETVNEILEYLRTLGLSDEDLFKLLKKFPEALGCDLEQELKPNVQLLDKEWGIQGKSLRNLLLRNPKVLGYNVD

Query:  CKGDCMAKCTRCWVRF
        CKGDCMAKCTRCWVRF
Subjt:  CKGDCMAKCTRCWVRF

XP_022938853.1 uncharacterized protein LOC111444935 [Cucurbita moschata]1.2e-10587.04Show/hide
Query:  MLGKSLVSPLSTVDSATRLCFSTHCIVTAQPDAVCSNVRFSYHLPKYMIKDVRRQIPRPRKWEIHSTAQVESLILSDEDQKTWEACRQALSMFNFSVEEQ
        MLGKSLVSP+STVDS TRLCFST CI TA+ +AVCSNV FSY   KYMI DVRRQ P PRKWEIHSTAQVESL+LSDED+KTWEACRQALSMF+FS EEQ
Subjt:  MLGKSLVSPLSTVDSATRLCFSTHCIVTAQPDAVCSNVRFSYHLPKYMIKDVRRQIPRPRKWEIHSTAQVESLILSDEDQKTWEACRQALSMFNFSVEEQ

Query:  DKMLGKAFGHIHSPYWGEDRKKEVPNIETVNEILEYLRTLGLSDEDLFKLLKKFPEALGCDLEQELKPNVQLLDKEWGIQGKSLRNLLLRNPKVLGYNVD
        DKMLGKAFGHIHSPYWGEDRKKEVPNI+TVNEILEYLRTLGLS++DL KLLKKFPE LGC+LEQELK NVQLL+KEWGIQGKSLR+LLLRNPKVLGYNVD
Subjt:  DKMLGKAFGHIHSPYWGEDRKKEVPNIETVNEILEYLRTLGLSDEDLFKLLKKFPEALGCDLEQELKPNVQLLDKEWGIQGKSLRNLLLRNPKVLGYNVD

Query:  CKGDCMAKCTRCWVRF
        CKGDC+AKCTRCWVRF
Subjt:  CKGDCMAKCTRCWVRF

XP_022993737.1 uncharacterized protein LOC111489650 [Cucurbita maxima]3.4e-10586.57Show/hide
Query:  MLGKSLVSPLSTVDSATRLCFSTHCIVTAQPDAVCSNVRFSYHLPKYMIKDVRRQIPRPRKWEIHSTAQVESLILSDEDQKTWEACRQALSMFNFSVEEQ
        MLGKSLVSP+STVDS TRLCFST CI TA+  AVCSNV FSY   KYMI DVRRQ P PRKWEIHST+QVESL+LSD+D+KTWEACRQALSMF+FS EEQ
Subjt:  MLGKSLVSPLSTVDSATRLCFSTHCIVTAQPDAVCSNVRFSYHLPKYMIKDVRRQIPRPRKWEIHSTAQVESLILSDEDQKTWEACRQALSMFNFSVEEQ

Query:  DKMLGKAFGHIHSPYWGEDRKKEVPNIETVNEILEYLRTLGLSDEDLFKLLKKFPEALGCDLEQELKPNVQLLDKEWGIQGKSLRNLLLRNPKVLGYNVD
        DKMLGKAFGHIHSPYWGEDRKKEVPNI+TVNEILEYLRTLGLS++DL KLLKKFPE LGC+LEQELK NVQLL+KEWGIQGKSLRNLLLRNPKVLGYNVD
Subjt:  DKMLGKAFGHIHSPYWGEDRKKEVPNIETVNEILEYLRTLGLSDEDLFKLLKKFPEALGCDLEQELKPNVQLLDKEWGIQGKSLRNLLLRNPKVLGYNVD

Query:  CKGDCMAKCTRCWVRF
        CKGDC+AKCTRCWVRF
Subjt:  CKGDCMAKCTRCWVRF

XP_023549996.1 uncharacterized protein LOC111808316 [Cucurbita pepo subsp. pepo]5.9e-10586.57Show/hide
Query:  MLGKSLVSPLSTVDSATRLCFSTHCIVTAQPDAVCSNVRFSYHLPKYMIKDVRRQIPRPRKWEIHSTAQVESLILSDEDQKTWEACRQALSMFNFSVEEQ
        MLGKSLVSP+STVDS TRLCFST C  TA+ +AVCSNV FSY   KYMI DVRRQ P PRKWEIHSTAQVESL+LSDED+KTWEACRQALS F+FS EEQ
Subjt:  MLGKSLVSPLSTVDSATRLCFSTHCIVTAQPDAVCSNVRFSYHLPKYMIKDVRRQIPRPRKWEIHSTAQVESLILSDEDQKTWEACRQALSMFNFSVEEQ

Query:  DKMLGKAFGHIHSPYWGEDRKKEVPNIETVNEILEYLRTLGLSDEDLFKLLKKFPEALGCDLEQELKPNVQLLDKEWGIQGKSLRNLLLRNPKVLGYNVD
        DKMLGKAFGHIHSPYWGEDRKKEVPNI+TVNEILEYLRTLGLS++DL KLLKKFPE LGC+LEQELK NVQLL+KEWGIQGKSLRNLLLRNPKVLGYNVD
Subjt:  DKMLGKAFGHIHSPYWGEDRKKEVPNIETVNEILEYLRTLGLSDEDLFKLLKKFPEALGCDLEQELKPNVQLLDKEWGIQGKSLRNLLLRNPKVLGYNVD

Query:  CKGDCMAKCTRCWVRF
        CKGDC+AKCTRCWVRF
Subjt:  CKGDCMAKCTRCWVRF

TrEMBL top hitse value%identityAlignment
A0A1S3B2X0 uncharacterized protein LOC1034854185.5e-10183.33Show/hide
Query:  MLGKSLVSPLSTVDSATRLCFSTHCIVTAQPDAVCSNVRFSYHLPKYMIKDVRRQIPRPRKWEIHSTAQVESLILSDEDQKTWEACRQALSMFNFSVEEQ
        MLGKSL SP+ST+DSATR C ST C  TA  DAVCSNV FSYH  K+ IK VRRQ P PRKWE+ S+ QVESLILSDED+KTWEACRQALS+F+FSVEEQ
Subjt:  MLGKSLVSPLSTVDSATRLCFSTHCIVTAQPDAVCSNVRFSYHLPKYMIKDVRRQIPRPRKWEIHSTAQVESLILSDEDQKTWEACRQALSMFNFSVEEQ

Query:  DKMLGKAFGHIHSPYWGEDRKKEVPNIETVNEILEYLRTLGLSDEDLFKLLKKFPEALGCDLEQELKPNVQLLDKEWGIQGKSLRNLLLRNPKVLGYNVD
        DKMLGKAFGHIHSPYWGEDRKK+VPNIE VN+ILEYLRTLGLS++DL KLLKKFPE LGC+LEQELK NVQLLDKEWGIQGKSLRNLLLRNPKVLGY VD
Subjt:  DKMLGKAFGHIHSPYWGEDRKKEVPNIETVNEILEYLRTLGLSDEDLFKLLKKFPEALGCDLEQELKPNVQLLDKEWGIQGKSLRNLLLRNPKVLGYNVD

Query:  CKGDCMAKCTRCWVRF
        CKGDC+AKCTRCWVRF
Subjt:  CKGDCMAKCTRCWVRF

A0A6J1DF32 uncharacterized protein LOC111020227 isoform X23.8e-10284.72Show/hide
Query:  MLGKSLVSPLSTVDSATRLCFSTHCIVTAQPDAVCSNVRFSYHLPKYMIKDVRRQIPRPRKWEIHSTAQVESLILSDEDQKTWEACRQALSMFNFSVEEQ
        MLGK+LVSP+STVDSATR CFST   VTA+PD  CSNV FS    K + KD RR  P PRKWEIHSTAQVESLILSDED+KTWEACRQALSMF+FS EEQ
Subjt:  MLGKSLVSPLSTVDSATRLCFSTHCIVTAQPDAVCSNVRFSYHLPKYMIKDVRRQIPRPRKWEIHSTAQVESLILSDEDQKTWEACRQALSMFNFSVEEQ

Query:  DKMLGKAFGHIHSPYWGEDRKKEVPNIETVNEILEYLRTLGLSDEDLFKLLKKFPEALGCDLEQELKPNVQLLDKEWGIQGKSLRNLLLRNPKVLGYNVD
        DKMLGKAFGHIHSPYWGEDRKKEVP IE VN+ILEYLRTLGLSD+DL KLLKKFPE LGC+LEQELK N+QLLDKEWGIQGKSLRNLLLRNPKVLGYNVD
Subjt:  DKMLGKAFGHIHSPYWGEDRKKEVPNIETVNEILEYLRTLGLSDEDLFKLLKKFPEALGCDLEQELKPNVQLLDKEWGIQGKSLRNLLLRNPKVLGYNVD

Query:  CKGDCMAKCTRCWVRF
        CKGDCMAKCTRCWVRF
Subjt:  CKGDCMAKCTRCWVRF

A0A6J1DI00 uncharacterized protein LOC111020227 isoform X12.7e-10081.06Show/hide
Query:  MLGKSLVSPLSTVDSATRLCFSTHCI-----------VTAQPDAVCSNVRFSYHLPKYMIKDVRRQIPRPRKWEIHSTAQVESLILSDEDQKTWEACRQA
        MLGK+LVSP+STVDSATR CFST  I           VTA+PD  CSNV FS    K + KD RR  P PRKWEIHSTAQVESLILSDED+KTWEACRQA
Subjt:  MLGKSLVSPLSTVDSATRLCFSTHCI-----------VTAQPDAVCSNVRFSYHLPKYMIKDVRRQIPRPRKWEIHSTAQVESLILSDEDQKTWEACRQA

Query:  LSMFNFSVEEQDKMLGKAFGHIHSPYWGEDRKKEVPNIETVNEILEYLRTLGLSDEDLFKLLKKFPEALGCDLEQELKPNVQLLDKEWGIQGKSLRNLLL
        LSMF+FS EEQDKMLGKAFGHIHSPYWGEDRKKEVP IE VN+ILEYLRTLGLSD+DL KLLKKFPE LGC+LEQELK N+QLLDKEWGIQGKSLRNLLL
Subjt:  LSMFNFSVEEQDKMLGKAFGHIHSPYWGEDRKKEVPNIETVNEILEYLRTLGLSDEDLFKLLKKFPEALGCDLEQELKPNVQLLDKEWGIQGKSLRNLLL

Query:  RNPKVLGYNVDCKGDCMAKCTRCWVRF
        RNPKVLGYNVDCKGDCMAKCTRCWVRF
Subjt:  RNPKVLGYNVDCKGDCMAKCTRCWVRF

A0A6J1FK31 uncharacterized protein LOC1114449355.7e-10687.04Show/hide
Query:  MLGKSLVSPLSTVDSATRLCFSTHCIVTAQPDAVCSNVRFSYHLPKYMIKDVRRQIPRPRKWEIHSTAQVESLILSDEDQKTWEACRQALSMFNFSVEEQ
        MLGKSLVSP+STVDS TRLCFST CI TA+ +AVCSNV FSY   KYMI DVRRQ P PRKWEIHSTAQVESL+LSDED+KTWEACRQALSMF+FS EEQ
Subjt:  MLGKSLVSPLSTVDSATRLCFSTHCIVTAQPDAVCSNVRFSYHLPKYMIKDVRRQIPRPRKWEIHSTAQVESLILSDEDQKTWEACRQALSMFNFSVEEQ

Query:  DKMLGKAFGHIHSPYWGEDRKKEVPNIETVNEILEYLRTLGLSDEDLFKLLKKFPEALGCDLEQELKPNVQLLDKEWGIQGKSLRNLLLRNPKVLGYNVD
        DKMLGKAFGHIHSPYWGEDRKKEVPNI+TVNEILEYLRTLGLS++DL KLLKKFPE LGC+LEQELK NVQLL+KEWGIQGKSLR+LLLRNPKVLGYNVD
Subjt:  DKMLGKAFGHIHSPYWGEDRKKEVPNIETVNEILEYLRTLGLSDEDLFKLLKKFPEALGCDLEQELKPNVQLLDKEWGIQGKSLRNLLLRNPKVLGYNVD

Query:  CKGDCMAKCTRCWVRF
        CKGDC+AKCTRCWVRF
Subjt:  CKGDCMAKCTRCWVRF

A0A6J1JTP6 uncharacterized protein LOC1114896501.7e-10586.57Show/hide
Query:  MLGKSLVSPLSTVDSATRLCFSTHCIVTAQPDAVCSNVRFSYHLPKYMIKDVRRQIPRPRKWEIHSTAQVESLILSDEDQKTWEACRQALSMFNFSVEEQ
        MLGKSLVSP+STVDS TRLCFST CI TA+  AVCSNV FSY   KYMI DVRRQ P PRKWEIHST+QVESL+LSD+D+KTWEACRQALSMF+FS EEQ
Subjt:  MLGKSLVSPLSTVDSATRLCFSTHCIVTAQPDAVCSNVRFSYHLPKYMIKDVRRQIPRPRKWEIHSTAQVESLILSDEDQKTWEACRQALSMFNFSVEEQ

Query:  DKMLGKAFGHIHSPYWGEDRKKEVPNIETVNEILEYLRTLGLSDEDLFKLLKKFPEALGCDLEQELKPNVQLLDKEWGIQGKSLRNLLLRNPKVLGYNVD
        DKMLGKAFGHIHSPYWGEDRKKEVPNI+TVNEILEYLRTLGLS++DL KLLKKFPE LGC+LEQELK NVQLL+KEWGIQGKSLRNLLLRNPKVLGYNVD
Subjt:  DKMLGKAFGHIHSPYWGEDRKKEVPNIETVNEILEYLRTLGLSDEDLFKLLKKFPEALGCDLEQELKPNVQLLDKEWGIQGKSLRNLLLRNPKVLGYNVD

Query:  CKGDCMAKCTRCWVRF
        CKGDC+AKCTRCWVRF
Subjt:  CKGDCMAKCTRCWVRF

SwissProt top hitse value%identityAlignment
F4IHL3 Transcription termination factor MTERF2, chloroplastic7.9e-0427.19Show/hide
Query:  SVEEQDKMLGKAFGHIHSPYWGEDRKKEVPNI-------ETVNEILEYLRTLGLSDEDLFKLLKKFPEALGCDLEQELKPNVQLLDKEWGIQGKSLRNLL
        S+EE+ K L K F ++  P  G  R   V  I       +T+   + +L+ +G+ +E +  +L KFP  L   L ++++P V  L    G+  K +  ++
Subjt:  SVEEQDKMLGKAFGHIHSPYWGEDRKKEVPNI-------ETVNEILEYLRTLGLSDEDLFKLLKKFPEALGCDLEQELKPNVQLLDKEWGIQGKSLRNLL

Query:  LRNPKVLGYNVDCK
          +P +LG ++  K
Subjt:  LRNPKVLGYNVDCK

Q9ZT96 Transcription termination factor MTERF4, chloroplastic1.9e-0530.38Show/hide
Query:  VNEILEYLRTLGLSDEDLFKLLKKFPEALGCDLEQELKPNVQLLDKEWGIQGKSLRNLLLRNPKVLGYNVDCKGDCMAK
        +  ++EYL  LG+      +L++K P  LG +L+  +KPNVQ+L +++ ++  SL +++ + P+++G ++  K D   K
Subjt:  VNEILEYLRTLGLSDEDLFKLLKKFPEALGCDLEQELKPNVQLLDKEWGIQGKSLRNLLLRNPKVLGYNVDCKGDCMAK

Arabidopsis top hitse value%identityAlignment
AT2G21710.1 Mitochondrial transcription termination factor family protein5.6e-0527.19Show/hide
Query:  SVEEQDKMLGKAFGHIHSPYWGEDRKKEVPNI-------ETVNEILEYLRTLGLSDEDLFKLLKKFPEALGCDLEQELKPNVQLLDKEWGIQGKSLRNLL
        S+EE+ K L K F ++  P  G  R   V  I       +T+   + +L+ +G+ +E +  +L KFP  L   L ++++P V  L    G+  K +  ++
Subjt:  SVEEQDKMLGKAFGHIHSPYWGEDRKKEVPNI-------ETVNEILEYLRTLGLSDEDLFKLLKKFPEALGCDLEQELKPNVQLLDKEWGIQGKSLRNLL

Query:  LRNPKVLGYNVDCK
          +P +LG ++  K
Subjt:  LRNPKVLGYNVDCK

AT2G44020.1 Mitochondrial transcription termination factor family protein8.1e-0430.14Show/hide
Query:  VNEILEYLRTLGLSDEDLFKLLKKFPEALGCDLEQELKPNVQLLDKEWGIQGKSLRNLLLRNPKVLGYNVDCK
        +  +++YL ++GL  + + ++L+K    +G +LE+ +KPNV  L   +G++ + L  L+ + P++LG  V  K
Subjt:  VNEILEYLRTLGLSDEDLFKLLKKFPEALGCDLEQELKPNVQLLDKEWGIQGKSLRNLLLRNPKVLGYNVDCK

AT4G02990.1 Mitochondrial transcription termination factor family protein1.3e-0630.38Show/hide
Query:  VNEILEYLRTLGLSDEDLFKLLKKFPEALGCDLEQELKPNVQLLDKEWGIQGKSLRNLLLRNPKVLGYNVDCKGDCMAK
        +  ++EYL  LG+      +L++K P  LG +L+  +KPNVQ+L +++ ++  SL +++ + P+++G ++  K D   K
Subjt:  VNEILEYLRTLGLSDEDLFKLLKKFPEALGCDLEQELKPNVQLLDKEWGIQGKSLRNLLLRNPKVLGYNVDCKGDCMAK

AT4G09620.1 Mitochondrial transcription termination factor family protein1.8e-6758.26Show/hide
Query:  MLGKSLVSPLSTVDSATRLCFSTHCI-VTAQPDAVCSNVRFSYHLPKYMIKDVRRQIPRPRKWEIH-STAQVESLILSDEDQKTWEACRQALSMFNFSVE
        M+G SL SPL+T+ SA   CF    + +    + +   +  SYH    +   VR       KW +  ST QVE+   S+ED   WE C++ALS F+FSVE
Subjt:  MLGKSLVSPLSTVDSATRLCFSTHCI-VTAQPDAVCSNVRFSYHLPKYMIKDVRRQIPRPRKWEIH-STAQVESLILSDEDQKTWEACRQALSMFNFSVE

Query:  EQDKMLGKAFGHIHSPYWGEDRKKEVPNIETVNEILEYLRTLGLSDEDLFKLLKKFPEALGCDLEQELKPNVQLLDKEWGIQGKSLRNLLLRNPKVLGYN
        E+DK+LGKAFGHIHSPYW E+R KE P +ET+N+ILE+LR+LGLSDEDL K++KKFPE LGC LE+E+KPN+ +L+ +WGI GK LRNLLLRNPKVLGYN
Subjt:  EQDKMLGKAFGHIHSPYWGEDRKKEVPNIETVNEILEYLRTLGLSDEDLFKLLKKFPEALGCDLEQELKPNVQLLDKEWGIQGKSLRNLLLRNPKVLGYN

Query:  VDCKGDCMAKCTRCWVRF
        VDCKGDC+A+CTRCWVRF
Subjt:  VDCKGDCMAKCTRCWVRF

AT5G54180.1 plastid transcriptionally active 154.8e-0429.58Show/hide
Query:  NIETVNEILEYLRTL-GLSDEDLFKLLKKFPEALGCDLEQELKPNVQLLDKEWGIQGKSLRNLLLRNPKVL
        ++E +N  +E+L++  GL+ E +FK++  FP  +    E++L+P ++ L KE G     +   L + P +L
Subjt:  NIETVNEILEYLRTL-GLSDEDLFKLLKKFPEALGCDLEQELKPNVQLLDKEWGIQGKSLRNLLLRNPKVL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTCTTTTCAAAGCTCTATTGTGAGCATGGCCTCCGATCTTACCTGTGGAACTGACTTTGAAAGAGATCCTGGGATGCTAGGAAAATCATTGGTGTCTCCCCTATC
AACAGTTGATTCTGCAACTCGTCTCTGCTTTTCTACTCACTGCATTGTCACAGCCCAACCAGATGCTGTATGTTCAAATGTGAGGTTCTCCTATCATCTTCCGAAGTATA
TGATAAAGGATGTGAGGAGACAAATTCCCCGTCCTAGGAAGTGGGAAATCCATTCAACTGCTCAAGTTGAAAGCTTAATATTAAGTGATGAGGATCAGAAGACGTGGGAA
GCTTGTCGACAAGCTCTGTCCATGTTCAACTTCAGTGTTGAGGAGCAAGATAAGATGCTCGGAAAGGCATTTGGCCACATTCATTCACCCTACTGGGGCGAAGACAGAAA
GAAGGAAGTCCCAAATATTGAAACTGTAAATGAGATACTGGAATATCTGAGGACACTTGGCCTTTCTGATGAAGATCTCTTTAAGCTGCTCAAAAAATTCCCAGAAGCTC
TTGGCTGCGATCTTGAGCAGGAGCTGAAACCCAACGTACAATTGTTGGACAAGGAGTGGGGAATTCAAGGAAAATCACTCAGGAATCTTCTTCTGCGTAATCCCAAGGTA
TTGGGTTACAATGTTGATTGTAAAGGAGATTGCATGGCAAAATGCACGAGATGCTGGGTTCGATTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATTCTTTTCAAAGCTCTATTGTGAGCATGGCCTCCGATCTTACCTGTGGAACTGACTTTGAAAGAGATCCTGGGATGCTAGGAAAATCATTGGTGTCTCCCCTATC
AACAGTTGATTCTGCAACTCGTCTCTGCTTTTCTACTCACTGCATTGTCACAGCCCAACCAGATGCTGTATGTTCAAATGTGAGGTTCTCCTATCATCTTCCGAAGTATA
TGATAAAGGATGTGAGGAGACAAATTCCCCGTCCTAGGAAGTGGGAAATCCATTCAACTGCTCAAGTTGAAAGCTTAATATTAAGTGATGAGGATCAGAAGACGTGGGAA
GCTTGTCGACAAGCTCTGTCCATGTTCAACTTCAGTGTTGAGGAGCAAGATAAGATGCTCGGAAAGGCATTTGGCCACATTCATTCACCCTACTGGGGCGAAGACAGAAA
GAAGGAAGTCCCAAATATTGAAACTGTAAATGAGATACTGGAATATCTGAGGACACTTGGCCTTTCTGATGAAGATCTCTTTAAGCTGCTCAAAAAATTCCCAGAAGCTC
TTGGCTGCGATCTTGAGCAGGAGCTGAAACCCAACGTACAATTGTTGGACAAGGAGTGGGGAATTCAAGGAAAATCACTCAGGAATCTTCTTCTGCGTAATCCCAAGGTA
TTGGGTTACAATGTTGATTGTAAAGGAGATTGCATGGCAAAATGCACGAGATGCTGGGTTCGATTCTAG
Protein sequenceShow/hide protein sequence
MNSFQSSIVSMASDLTCGTDFERDPGMLGKSLVSPLSTVDSATRLCFSTHCIVTAQPDAVCSNVRFSYHLPKYMIKDVRRQIPRPRKWEIHSTAQVESLILSDEDQKTWE
ACRQALSMFNFSVEEQDKMLGKAFGHIHSPYWGEDRKKEVPNIETVNEILEYLRTLGLSDEDLFKLLKKFPEALGCDLEQELKPNVQLLDKEWGIQGKSLRNLLLRNPKV
LGYNVDCKGDCMAKCTRCWVRF