; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr014724 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr014724
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionMitochondrial transcription termination factor family protein
Genome locationtig00001047:406506..409882
RNA-Seq ExpressionSgr014724
SyntenySgr014724
Gene Ontology termsGO:0006353 - DNA-templated transcription, termination (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0003690 - double-stranded DNA binding (molecular function)
InterPro domainsIPR003690 - Transcription termination factor, mitochondrial/chloroplastic
IPR038538 - MTERF superfamily, mitochondrial/chloroplastic


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578327.1 hypothetical protein SDJN03_22775, partial [Cucurbita argyrosperma subsp. sororia]4.7e-10485.39Show/hide
Query:  LAWMLGKSLASPISTVDSAIRLCFSTPGVATAKPDAVCSNVSFSYRLGKNVIKDVRRHSPCPRKSEIHSTAQVESLILSDDDKKTWEACRQALSMFSFSV
        ++ MLGKSL SPISTVDS  RLCFSTP + TAK +AVCSNVSFSY+  K +I DVRR SP PRK EIHSTAQVESL+LSD+DKKTWEACRQALSMFSFS 
Subjt:  LAWMLGKSLASPISTVDSAIRLCFSTPGVATAKPDAVCSNVSFSYRLGKNVIKDVRRHSPCPRKSEIHSTAQVESLILSDDDKKTWEACRQALSMFSFSV

Query:  EEQDKMLGKAFGHIHSPYWGEDRKKEVPQIETVNDILEYLSTLGLSDDDLSKLLKKFPEVLGCNLEQELKTNVQVLDKEWGIQGKSLRNLLLRNPKVLGY
        EEQDKMLGKAFGHIHSPYWGEDRKKEVP I+TVN+ILEYL TLGLS+DDLSKLLKKFPEVLGCNLEQELKTNVQ+L+KEWGIQGKSLR+LLLRNPKVLGY
Subjt:  EEQDKMLGKAFGHIHSPYWGEDRKKEVPQIETVNDILEYLSTLGLSDDDLSKLLKKFPEVLGCNLEQELKTNVQVLDKEWGIQGKSLRNLLLRNPKVLGY

Query:  NVDCKGDCMAKCTRCWVRF
        NVDCKGDC+AKCTRCWVRF
Subjt:  NVDCKGDCMAKCTRCWVRF

XP_022152521.1 uncharacterized protein LOC111020227 isoform X1 [Momordica charantia]1.7e-10685.46Show/hide
Query:  MLGKSLASPISTVDSAIRLCFST-----------PGVATAKPDAVCSNVSFSYRLGKNVIKDVRRHSPCPRKSEIHSTAQVESLILSDDDKKTWEACRQA
        MLGK+L SPISTVDSA R CFST           PG  TAKPD  CSNVSFS R  KNV KD RRHSPCPRK EIHSTAQVESLILSD+DKKTWEACRQA
Subjt:  MLGKSLASPISTVDSAIRLCFST-----------PGVATAKPDAVCSNVSFSYRLGKNVIKDVRRHSPCPRKSEIHSTAQVESLILSDDDKKTWEACRQA

Query:  LSMFSFSVEEQDKMLGKAFGHIHSPYWGEDRKKEVPQIETVNDILEYLSTLGLSDDDLSKLLKKFPEVLGCNLEQELKTNVQVLDKEWGIQGKSLRNLLL
        LSMFSFS EEQDKMLGKAFGHIHSPYWGEDRKKEVP+IE VNDILEYL TLGLSDDDL KLLKKFPEVLGCNLEQELKTN+Q+LDKEWGIQGKSLRNLLL
Subjt:  LSMFSFSVEEQDKMLGKAFGHIHSPYWGEDRKKEVPQIETVNDILEYLSTLGLSDDDLSKLLKKFPEVLGCNLEQELKTNVQVLDKEWGIQGKSLRNLLL

Query:  RNPKVLGYNVDCKGDCMAKCTRCWVRF
        RNPKVLGYNVDCKGDCMAKCTRCWVRF
Subjt:  RNPKVLGYNVDCKGDCMAKCTRCWVRF

XP_022152523.1 uncharacterized protein LOC111020227 isoform X2 [Momordica charantia]4.9e-10989.81Show/hide
Query:  MLGKSLASPISTVDSAIRLCFSTPGVATAKPDAVCSNVSFSYRLGKNVIKDVRRHSPCPRKSEIHSTAQVESLILSDDDKKTWEACRQALSMFSFSVEEQ
        MLGK+L SPISTVDSA R CFSTPG  TAKPD  CSNVSFS R  KNV KD RRHSPCPRK EIHSTAQVESLILSD+DKKTWEACRQALSMFSFS EEQ
Subjt:  MLGKSLASPISTVDSAIRLCFSTPGVATAKPDAVCSNVSFSYRLGKNVIKDVRRHSPCPRKSEIHSTAQVESLILSDDDKKTWEACRQALSMFSFSVEEQ

Query:  DKMLGKAFGHIHSPYWGEDRKKEVPQIETVNDILEYLSTLGLSDDDLSKLLKKFPEVLGCNLEQELKTNVQVLDKEWGIQGKSLRNLLLRNPKVLGYNVD
        DKMLGKAFGHIHSPYWGEDRKKEVP+IE VNDILEYL TLGLSDDDL KLLKKFPEVLGCNLEQELKTN+Q+LDKEWGIQGKSLRNLLLRNPKVLGYNVD
Subjt:  DKMLGKAFGHIHSPYWGEDRKKEVPQIETVNDILEYLSTLGLSDDDLSKLLKKFPEVLGCNLEQELKTNVQVLDKEWGIQGKSLRNLLLRNPKVLGYNVD

Query:  CKGDCMAKCTRCWVRF
        CKGDCMAKCTRCWVRF
Subjt:  CKGDCMAKCTRCWVRF

XP_022938853.1 uncharacterized protein LOC111444935 [Cucurbita moschata]6.1e-10486.57Show/hide
Query:  MLGKSLASPISTVDSAIRLCFSTPGVATAKPDAVCSNVSFSYRLGKNVIKDVRRHSPCPRKSEIHSTAQVESLILSDDDKKTWEACRQALSMFSFSVEEQ
        MLGKSL SPISTVDS  RLCFSTP + TAK +AVCSNVSFSY+  K +I DVRR SP PRK EIHSTAQVESL+LSD+DKKTWEACRQALSMFSFS EEQ
Subjt:  MLGKSLASPISTVDSAIRLCFSTPGVATAKPDAVCSNVSFSYRLGKNVIKDVRRHSPCPRKSEIHSTAQVESLILSDDDKKTWEACRQALSMFSFSVEEQ

Query:  DKMLGKAFGHIHSPYWGEDRKKEVPQIETVNDILEYLSTLGLSDDDLSKLLKKFPEVLGCNLEQELKTNVQVLDKEWGIQGKSLRNLLLRNPKVLGYNVD
        DKMLGKAFGHIHSPYWGEDRKKEVP I+TVN+ILEYL TLGLS+DDLSKLLKKFPEVLGCNLEQELKTNVQ+L+KEWGIQGKSLR+LLLRNPKVLGYNVD
Subjt:  DKMLGKAFGHIHSPYWGEDRKKEVPQIETVNDILEYLSTLGLSDDDLSKLLKKFPEVLGCNLEQELKTNVQVLDKEWGIQGKSLRNLLLRNPKVLGYNVD

Query:  CKGDCMAKCTRCWVRF
        CKGDC+AKCTRCWVRF
Subjt:  CKGDCMAKCTRCWVRF

XP_022993737.1 uncharacterized protein LOC111489650 [Cucurbita maxima]2.8e-10487.04Show/hide
Query:  MLGKSLASPISTVDSAIRLCFSTPGVATAKPDAVCSNVSFSYRLGKNVIKDVRRHSPCPRKSEIHSTAQVESLILSDDDKKTWEACRQALSMFSFSVEEQ
        MLGKSL SPISTVDS  RLCFSTP + TAK  AVCSNVSFSY+  K +I DVRR SP PRK EIHST+QVESL+LSDDDKKTWEACRQALSMFSFS EEQ
Subjt:  MLGKSLASPISTVDSAIRLCFSTPGVATAKPDAVCSNVSFSYRLGKNVIKDVRRHSPCPRKSEIHSTAQVESLILSDDDKKTWEACRQALSMFSFSVEEQ

Query:  DKMLGKAFGHIHSPYWGEDRKKEVPQIETVNDILEYLSTLGLSDDDLSKLLKKFPEVLGCNLEQELKTNVQVLDKEWGIQGKSLRNLLLRNPKVLGYNVD
        DKMLGKAFGHIHSPYWGEDRKKEVP I+TVN+ILEYL TLGLS+DDLSKLLKKFPEVLGCNLEQELKTNVQ+L+KEWGIQGKSLRNLLLRNPKVLGYNVD
Subjt:  DKMLGKAFGHIHSPYWGEDRKKEVPQIETVNDILEYLSTLGLSDDDLSKLLKKFPEVLGCNLEQELKTNVQVLDKEWGIQGKSLRNLLLRNPKVLGYNVD

Query:  CKGDCMAKCTRCWVRF
        CKGDC+AKCTRCWVRF
Subjt:  CKGDCMAKCTRCWVRF

TrEMBL top hitse value%identityAlignment
A0A1S3B2X0 uncharacterized protein LOC1034854187.6e-10084.72Show/hide
Query:  MLGKSLASPISTVDSAIRLCFSTPGVATAKPDAVCSNVSFSYRLGKNVIKDVRRHSPCPRKSEIHSTAQVESLILSDDDKKTWEACRQALSMFSFSVEEQ
        MLGKSLASPIST+DSA R C ST   ATA  DAVCSNVSFSY   K+ IK VRR +P PRK E+ S+ QVESLILSD+DKKTWEACRQALS+FSFSVEEQ
Subjt:  MLGKSLASPISTVDSAIRLCFSTPGVATAKPDAVCSNVSFSYRLGKNVIKDVRRHSPCPRKSEIHSTAQVESLILSDDDKKTWEACRQALSMFSFSVEEQ

Query:  DKMLGKAFGHIHSPYWGEDRKKEVPQIETVNDILEYLSTLGLSDDDLSKLLKKFPEVLGCNLEQELKTNVQVLDKEWGIQGKSLRNLLLRNPKVLGYNVD
        DKMLGKAFGHIHSPYWGEDRKK+VP IE VNDILEYL TLGLS+DDLSKLLKKFPEVLGCNLEQELKTNVQ+LDKEWGIQGKSLRNLLLRNPKVLGY VD
Subjt:  DKMLGKAFGHIHSPYWGEDRKKEVPQIETVNDILEYLSTLGLSDDDLSKLLKKFPEVLGCNLEQELKTNVQVLDKEWGIQGKSLRNLLLRNPKVLGYNVD

Query:  CKGDCMAKCTRCWVRF
        CKGDC+AKCTRCWVRF
Subjt:  CKGDCMAKCTRCWVRF

A0A6J1DF32 uncharacterized protein LOC111020227 isoform X22.4e-10989.81Show/hide
Query:  MLGKSLASPISTVDSAIRLCFSTPGVATAKPDAVCSNVSFSYRLGKNVIKDVRRHSPCPRKSEIHSTAQVESLILSDDDKKTWEACRQALSMFSFSVEEQ
        MLGK+L SPISTVDSA R CFSTPG  TAKPD  CSNVSFS R  KNV KD RRHSPCPRK EIHSTAQVESLILSD+DKKTWEACRQALSMFSFS EEQ
Subjt:  MLGKSLASPISTVDSAIRLCFSTPGVATAKPDAVCSNVSFSYRLGKNVIKDVRRHSPCPRKSEIHSTAQVESLILSDDDKKTWEACRQALSMFSFSVEEQ

Query:  DKMLGKAFGHIHSPYWGEDRKKEVPQIETVNDILEYLSTLGLSDDDLSKLLKKFPEVLGCNLEQELKTNVQVLDKEWGIQGKSLRNLLLRNPKVLGYNVD
        DKMLGKAFGHIHSPYWGEDRKKEVP+IE VNDILEYL TLGLSDDDL KLLKKFPEVLGCNLEQELKTN+Q+LDKEWGIQGKSLRNLLLRNPKVLGYNVD
Subjt:  DKMLGKAFGHIHSPYWGEDRKKEVPQIETVNDILEYLSTLGLSDDDLSKLLKKFPEVLGCNLEQELKTNVQVLDKEWGIQGKSLRNLLLRNPKVLGYNVD

Query:  CKGDCMAKCTRCWVRF
        CKGDCMAKCTRCWVRF
Subjt:  CKGDCMAKCTRCWVRF

A0A6J1DI00 uncharacterized protein LOC111020227 isoform X18.4e-10785.46Show/hide
Query:  MLGKSLASPISTVDSAIRLCFST-----------PGVATAKPDAVCSNVSFSYRLGKNVIKDVRRHSPCPRKSEIHSTAQVESLILSDDDKKTWEACRQA
        MLGK+L SPISTVDSA R CFST           PG  TAKPD  CSNVSFS R  KNV KD RRHSPCPRK EIHSTAQVESLILSD+DKKTWEACRQA
Subjt:  MLGKSLASPISTVDSAIRLCFST-----------PGVATAKPDAVCSNVSFSYRLGKNVIKDVRRHSPCPRKSEIHSTAQVESLILSDDDKKTWEACRQA

Query:  LSMFSFSVEEQDKMLGKAFGHIHSPYWGEDRKKEVPQIETVNDILEYLSTLGLSDDDLSKLLKKFPEVLGCNLEQELKTNVQVLDKEWGIQGKSLRNLLL
        LSMFSFS EEQDKMLGKAFGHIHSPYWGEDRKKEVP+IE VNDILEYL TLGLSDDDL KLLKKFPEVLGCNLEQELKTN+Q+LDKEWGIQGKSLRNLLL
Subjt:  LSMFSFSVEEQDKMLGKAFGHIHSPYWGEDRKKEVPQIETVNDILEYLSTLGLSDDDLSKLLKKFPEVLGCNLEQELKTNVQVLDKEWGIQGKSLRNLLL

Query:  RNPKVLGYNVDCKGDCMAKCTRCWVRF
        RNPKVLGYNVDCKGDCMAKCTRCWVRF
Subjt:  RNPKVLGYNVDCKGDCMAKCTRCWVRF

A0A6J1FK31 uncharacterized protein LOC1114449353.0e-10486.57Show/hide
Query:  MLGKSLASPISTVDSAIRLCFSTPGVATAKPDAVCSNVSFSYRLGKNVIKDVRRHSPCPRKSEIHSTAQVESLILSDDDKKTWEACRQALSMFSFSVEEQ
        MLGKSL SPISTVDS  RLCFSTP + TAK +AVCSNVSFSY+  K +I DVRR SP PRK EIHSTAQVESL+LSD+DKKTWEACRQALSMFSFS EEQ
Subjt:  MLGKSLASPISTVDSAIRLCFSTPGVATAKPDAVCSNVSFSYRLGKNVIKDVRRHSPCPRKSEIHSTAQVESLILSDDDKKTWEACRQALSMFSFSVEEQ

Query:  DKMLGKAFGHIHSPYWGEDRKKEVPQIETVNDILEYLSTLGLSDDDLSKLLKKFPEVLGCNLEQELKTNVQVLDKEWGIQGKSLRNLLLRNPKVLGYNVD
        DKMLGKAFGHIHSPYWGEDRKKEVP I+TVN+ILEYL TLGLS+DDLSKLLKKFPEVLGCNLEQELKTNVQ+L+KEWGIQGKSLR+LLLRNPKVLGYNVD
Subjt:  DKMLGKAFGHIHSPYWGEDRKKEVPQIETVNDILEYLSTLGLSDDDLSKLLKKFPEVLGCNLEQELKTNVQVLDKEWGIQGKSLRNLLLRNPKVLGYNVD

Query:  CKGDCMAKCTRCWVRF
        CKGDC+AKCTRCWVRF
Subjt:  CKGDCMAKCTRCWVRF

A0A6J1JTP6 uncharacterized protein LOC1114896501.3e-10487.04Show/hide
Query:  MLGKSLASPISTVDSAIRLCFSTPGVATAKPDAVCSNVSFSYRLGKNVIKDVRRHSPCPRKSEIHSTAQVESLILSDDDKKTWEACRQALSMFSFSVEEQ
        MLGKSL SPISTVDS  RLCFSTP + TAK  AVCSNVSFSY+  K +I DVRR SP PRK EIHST+QVESL+LSDDDKKTWEACRQALSMFSFS EEQ
Subjt:  MLGKSLASPISTVDSAIRLCFSTPGVATAKPDAVCSNVSFSYRLGKNVIKDVRRHSPCPRKSEIHSTAQVESLILSDDDKKTWEACRQALSMFSFSVEEQ

Query:  DKMLGKAFGHIHSPYWGEDRKKEVPQIETVNDILEYLSTLGLSDDDLSKLLKKFPEVLGCNLEQELKTNVQVLDKEWGIQGKSLRNLLLRNPKVLGYNVD
        DKMLGKAFGHIHSPYWGEDRKKEVP I+TVN+ILEYL TLGLS+DDLSKLLKKFPEVLGCNLEQELKTNVQ+L+KEWGIQGKSLRNLLLRNPKVLGYNVD
Subjt:  DKMLGKAFGHIHSPYWGEDRKKEVPQIETVNDILEYLSTLGLSDDDLSKLLKKFPEVLGCNLEQELKTNVQVLDKEWGIQGKSLRNLLLRNPKVLGYNVD

Query:  CKGDCMAKCTRCWVRF
        CKGDC+AKCTRCWVRF
Subjt:  CKGDCMAKCTRCWVRF

SwissProt top hitse value%identityAlignment
Q9ZT96 Transcription termination factor MTERF4, chloroplastic5.2e-0529.11Show/hide
Query:  VNDILEYLSTLGLSDDDLSKLLKKFPEVLGCNLEQELKTNVQVLDKEWGIQGKSLRNLLLRNPKVLGYNVDCKGDCMAK
        +  ++EYL  LG+     ++L++K P +LG  L+  +K NVQ+L +++ ++  SL +++ + P+++G ++  K D   K
Subjt:  VNDILEYLSTLGLSDDDLSKLLKKFPEVLGCNLEQELKTNVQVLDKEWGIQGKSLRNLLLRNPKVLGYNVDCKGDCMAK

Arabidopsis top hitse value%identityAlignment
AT2G44020.1 Mitochondrial transcription termination factor family protein4.5e-0430.14Show/hide
Query:  VNDILEYLSTLGLSDDDLSKLLKKFPEVLGCNLEQELKTNVQVLDKEWGIQGKSLRNLLLRNPKVLGYNVDCK
        +  +++YL ++GL    ++++L+K   ++G NLE+ +K NV  L   +G++ + L  L+ + P++LG  V  K
Subjt:  VNDILEYLSTLGLSDDDLSKLLKKFPEVLGCNLEQELKTNVQVLDKEWGIQGKSLRNLLLRNPKVLGYNVDCK

AT4G02990.1 Mitochondrial transcription termination factor family protein3.7e-0629.11Show/hide
Query:  VNDILEYLSTLGLSDDDLSKLLKKFPEVLGCNLEQELKTNVQVLDKEWGIQGKSLRNLLLRNPKVLGYNVDCKGDCMAK
        +  ++EYL  LG+     ++L++K P +LG  L+  +K NVQ+L +++ ++  SL +++ + P+++G ++  K D   K
Subjt:  VNDILEYLSTLGLSDDDLSKLLKKFPEVLGCNLEQELKTNVQVLDKEWGIQGKSLRNLLLRNPKVLGYNVDCKGDCMAK

AT4G09620.1 Mitochondrial transcription termination factor family protein3.0e-6457.73Show/hide
Query:  MLGKSLASPISTVDSAIRLCFSTPGVATAKPDAVCSNVSFSYRLGKN---VIKDVRRHSPCPRKSEIH-STAQVESLILSDDDKKTWEACRQALSMFSFS
        M+G SLASP++T+ SA   CF    V     D V        RL  +   V   VR  S    K  +  ST QVE+   S++D   WE C++ALS F FS
Subjt:  MLGKSLASPISTVDSAIRLCFSTPGVATAKPDAVCSNVSFSYRLGKN---VIKDVRRHSPCPRKSEIH-STAQVESLILSDDDKKTWEACRQALSMFSFS

Query:  VEEQDKMLGKAFGHIHSPYWGEDRKKEVPQIETVNDILEYLSTLGLSDDDLSKLLKKFPEVLGCNLEQELKTNVQVLDKEWGIQGKSLRNLLLRNPKVLG
        VEE+DK+LGKAFGHIHSPYW E+R KE P++ET+N ILE+L +LGLSD+DL K++KKFPEVLGC+LE+E+K N+ +L+ +WGI GK LRNLLLRNPKVLG
Subjt:  VEEQDKMLGKAFGHIHSPYWGEDRKKEVPQIETVNDILEYLSTLGLSDDDLSKLLKKFPEVLGCNLEQELKTNVQVLDKEWGIQGKSLRNLLLRNPKVLG

Query:  YNVDCKGDCMAKCTRCWVRF
        YNVDCKGDC+A+CTRCWVRF
Subjt:  YNVDCKGDCMAKCTRCWVRF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGCGATGGCGAACATGGCTATTGGCCGCAAGCAAAGGACAAGAACTAAGAAGAGTGCTCTCTCAGCCTCAGCCAACCTGTGGTCCGCATCCGCCTAATTGTAGCGG
GCTGACCCTTCTTCTTCAACCTCCGCTGCACAATCGCAACTCTACGCTTGACTTGAATCTCTCTGCGAGTAATTGCCCGCCGAGCTCCTCAAATTTCAGTTTTTATCGAA
ATTCCAATGCTCAAAAGCTCCTCCTGGCTTGGATGCTAGGAAAATCATTGGCGTCTCCCATATCAACAGTTGATTCTGCAATTCGCCTCTGCTTTTCTACTCCTGGCGTT
GCCACAGCCAAGCCAGATGCTGTATGTTCAAATGTGAGTTTCTCCTATCGTCTTGGAAAGAATGTGATAAAGGATGTCAGGAGACATAGTCCCTGTCCCAGGAAGTCGGA
AATCCATTCAACTGCTCAAGTTGAAAGCTTGATATTAAGTGATGATGATAAGAAGACATGGGAAGCTTGCCGTCAAGCTCTGTCCATGTTCAGCTTCAGTGTTGAGGAGC
AAGATAAGATGCTAGGAAAGGCGTTCGGCCACATTCATTCACCCTACTGGGGCGAAGACAGAAAGAAGGAAGTCCCACAGATTGAAACTGTAAATGACATACTGGAATAC
CTGAGTACACTTGGCCTTTCTGATGATGATCTCTCTAAGCTGCTAAAAAAATTCCCTGAAGTTCTTGGATGCAATCTTGAGCAGGAGCTGAAAACCAACGTACAGGTGTT
GGATAAGGAGTGGGGAATTCAAGGCAAATCACTCAGGAATCTTCTTCTGCGTAATCCCAAGGTATTGGGTTACAATGTTGATTGTAAAGGAGACTGTATGGCAAAATGCA
CCAGATGCTGGGTTCGATTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGGCGATGGCGAACATGGCTATTGGCCGCAAGCAAAGGACAAGAACTAAGAAGAGTGCTCTCTCAGCCTCAGCCAACCTGTGGTCCGCATCCGCCTAATTGTAGCGG
GCTGACCCTTCTTCTTCAACCTCCGCTGCACAATCGCAACTCTACGCTTGACTTGAATCTCTCTGCGAGTAATTGCCCGCCGAGCTCCTCAAATTTCAGTTTTTATCGAA
ATTCCAATGCTCAAAAGCTCCTCCTGGCTTGGATGCTAGGAAAATCATTGGCGTCTCCCATATCAACAGTTGATTCTGCAATTCGCCTCTGCTTTTCTACTCCTGGCGTT
GCCACAGCCAAGCCAGATGCTGTATGTTCAAATGTGAGTTTCTCCTATCGTCTTGGAAAGAATGTGATAAAGGATGTCAGGAGACATAGTCCCTGTCCCAGGAAGTCGGA
AATCCATTCAACTGCTCAAGTTGAAAGCTTGATATTAAGTGATGATGATAAGAAGACATGGGAAGCTTGCCGTCAAGCTCTGTCCATGTTCAGCTTCAGTGTTGAGGAGC
AAGATAAGATGCTAGGAAAGGCGTTCGGCCACATTCATTCACCCTACTGGGGCGAAGACAGAAAGAAGGAAGTCCCACAGATTGAAACTGTAAATGACATACTGGAATAC
CTGAGTACACTTGGCCTTTCTGATGATGATCTCTCTAAGCTGCTAAAAAAATTCCCTGAAGTTCTTGGATGCAATCTTGAGCAGGAGCTGAAAACCAACGTACAGGTGTT
GGATAAGGAGTGGGGAATTCAAGGCAAATCACTCAGGAATCTTCTTCTGCGTAATCCCAAGGTATTGGGTTACAATGTTGATTGTAAAGGAGACTGTATGGCAAAATGCA
CCAGATGCTGGGTTCGATTCTAG
Protein sequenceShow/hide protein sequence
MGRWRTWLLAASKGQELRRVLSQPQPTCGPHPPNCSGLTLLLQPPLHNRNSTLDLNLSASNCPPSSSNFSFYRNSNAQKLLLAWMLGKSLASPISTVDSAIRLCFSTPGV
ATAKPDAVCSNVSFSYRLGKNVIKDVRRHSPCPRKSEIHSTAQVESLILSDDDKKTWEACRQALSMFSFSVEEQDKMLGKAFGHIHSPYWGEDRKKEVPQIETVNDILEY
LSTLGLSDDDLSKLLKKFPEVLGCNLEQELKTNVQVLDKEWGIQGKSLRNLLLRNPKVLGYNVDCKGDCMAKCTRCWVRF