; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0022878 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0022878
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionMitochondrial transcription termination factor family protein isoform 2
Genome locationchr03:970686..975838
RNA-Seq ExpressionPay0022878
SyntenyPay0022878
Gene Ontology termsGO:0006353 - DNA-templated transcription, termination (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0042646 - plastid nucleoid (cellular component)
GO:0003690 - double-stranded DNA binding (molecular function)
InterPro domainsIPR003690 - Transcription termination factor, mitochondrial/chloroplastic
IPR038538 - MTERF superfamily, mitochondrial/chloroplastic


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578327.1 hypothetical protein SDJN03_22775, partial [Cucurbita argyrosperma subsp. sororia]1.9e-10583.7Show/hide
Query:  YSLSIPQISGMLGKSLASPISTIDSATRFCCSTRCTATAISDAVCSNVSFSYYPAKHTIKGVRRQNPYPRKWEVCSSTQVESLILSDEDKKTWEACRQAL
        ++  +PQISGMLGKSL SPIST+DS TR C ST C  TA S+AVCSNVSFSY PAK+ I  VRRQ+PYPRKWE+ S+ QVESL+LSDEDKKTWEACRQAL
Subjt:  YSLSIPQISGMLGKSLASPISTIDSATRFCCSTRCTATAISDAVCSNVSFSYYPAKHTIKGVRRQNPYPRKWEVCSSTQVESLILSDEDKKTWEACRQAL

Query:  SVFSFSVEEQDKMLGKAFGHIHSPYWGEDRKKKVPNIEIVNDILEYLRTTLGLSNDDLSKLLKKFPEVLGCNLERELKTNVQLLDKEWGIQGKSLRNLLL
        S+FSFS EEQDKMLGKAFGHIHSPYWGEDRKK+VPNI+ VN+ILEYLR TLGLSNDDLSKLLKKFPEVLGCNLE+ELKTNVQLL+KEWGIQGKSLR+LLL
Subjt:  SVFSFSVEEQDKMLGKAFGHIHSPYWGEDRKKKVPNIEIVNDILEYLRTTLGLSNDDLSKLLKKFPEVLGCNLERELKTNVQLLDKEWGIQGKSLRNLLL

Query:  RNPKVLGYYVDCKGDCIAKCTRCWVRF
        RNPKVLGY VDCKGDCIAKCTRCWVRF
Subjt:  RNPKVLGYYVDCKGDCIAKCTRCWVRF

XP_004138649.1 uncharacterized protein LOC101218603 isoform X2 [Cucumis sativus]1.4e-11696.31Show/hide
Query:  MLGKSLASPISTIDSATRFCCSTRCTATAISDAVCSNVSFSYYPAKHTIKGVRRQNPYPRKWEVCSSTQVESLILSDEDKKTWEACRQALSVFSFSVEEQ
        MLGKSLASPISTIDSATRFCCSTRCTATAISDAVCS+VSFS + AKHTIKGVRRQNPYPRKWE+CSSTQVESLILSDEDKKTWEACRQALSVFSFSVEEQ
Subjt:  MLGKSLASPISTIDSATRFCCSTRCTATAISDAVCSNVSFSYYPAKHTIKGVRRQNPYPRKWEVCSSTQVESLILSDEDKKTWEACRQALSVFSFSVEEQ

Query:  DKMLGKAFGHIHSPYWGEDRKKKVPNIEIVNDILEYLRTTLGLSNDDLSKLLKKFPEVLGCNLERELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLGYYV
        DKMLGKAFGHIHSPYWGEDR+KKVPNIEIVNDILEYLR TLGLSNDDLSKLLKKFPEVLGCNLE+ELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLGYYV
Subjt:  DKMLGKAFGHIHSPYWGEDRKKKVPNIEIVNDILEYLRTTLGLSNDDLSKLLKKFPEVLGCNLERELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLGYYV

Query:  DCKGDCIAKCTRCWVRF
        DCKGDCIAKCTRCWVRF
Subjt:  DCKGDCIAKCTRCWVRF

XP_008441218.1 PREDICTED: uncharacterized protein LOC103485418 [Cucumis melo]1.0e-11998.62Show/hide
Query:  MLGKSLASPISTIDSATRFCCSTRCTATAISDAVCSNVSFSYYPAKHTIKGVRRQNPYPRKWEVCSSTQVESLILSDEDKKTWEACRQALSVFSFSVEEQ
        MLGKSLASPISTIDSATRFCCSTRCTATAISDAVCSNVSFSY+PAKHTIKGVRRQNPYPRKWEVCSSTQVESLILSDEDKKTWEACRQALSVFSFSVEEQ
Subjt:  MLGKSLASPISTIDSATRFCCSTRCTATAISDAVCSNVSFSYYPAKHTIKGVRRQNPYPRKWEVCSSTQVESLILSDEDKKTWEACRQALSVFSFSVEEQ

Query:  DKMLGKAFGHIHSPYWGEDRKKKVPNIEIVNDILEYLRTTLGLSNDDLSKLLKKFPEVLGCNLERELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLGYYV
        DKMLGKAFGHIHSPYWGEDRKKKVPNIEIVNDILEYLR TLGLSNDDLSKLLKKFPEVLGCNLE+ELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLGYYV
Subjt:  DKMLGKAFGHIHSPYWGEDRKKKVPNIEIVNDILEYLRTTLGLSNDDLSKLLKKFPEVLGCNLERELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLGYYV

Query:  DCKGDCIAKCTRCWVRF
        DCKGDCIAKCTRCWVRF
Subjt:  DCKGDCIAKCTRCWVRF

XP_011649898.1 uncharacterized protein LOC101218603 isoform X1 [Cucumis sativus]1.7e-11795.45Show/hide
Query:  ISGMLGKSLASPISTIDSATRFCCSTRCTATAISDAVCSNVSFSYYPAKHTIKGVRRQNPYPRKWEVCSSTQVESLILSDEDKKTWEACRQALSVFSFSV
        ++GMLGKSLASPISTIDSATRFCCSTRCTATAISDAVCS+VSFS + AKHTIKGVRRQNPYPRKWE+CSSTQVESLILSDEDKKTWEACRQALSVFSFSV
Subjt:  ISGMLGKSLASPISTIDSATRFCCSTRCTATAISDAVCSNVSFSYYPAKHTIKGVRRQNPYPRKWEVCSSTQVESLILSDEDKKTWEACRQALSVFSFSV

Query:  EEQDKMLGKAFGHIHSPYWGEDRKKKVPNIEIVNDILEYLRTTLGLSNDDLSKLLKKFPEVLGCNLERELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLG
        EEQDKMLGKAFGHIHSPYWGEDR+KKVPNIEIVNDILEYLR TLGLSNDDLSKLLKKFPEVLGCNLE+ELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLG
Subjt:  EEQDKMLGKAFGHIHSPYWGEDRKKKVPNIEIVNDILEYLRTTLGLSNDDLSKLLKKFPEVLGCNLERELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLG

Query:  YYVDCKGDCIAKCTRCWVRF
        YYVDCKGDCIAKCTRCWVRF
Subjt:  YYVDCKGDCIAKCTRCWVRF

XP_038885056.1 uncharacterized protein LOC120075592 [Benincasa hispida]1.8e-10890.78Show/hide
Query:  MLGKSLASPISTIDSATRFCCSTRCTATAISDAVCSNVSFSYYPAKHTIKGVRRQNPYPRKWEVCSSTQVESLILSDEDKKTWEACRQALSVFSFSVEEQ
        MLGKSL SPISTIDSATRFCCSTR  ATAISDA  SNVSFSY+PAKH IK VRRQ PYP KWE+CS+TQVESL LSDED+KTWEACRQALSVFSFSVEEQ
Subjt:  MLGKSLASPISTIDSATRFCCSTRCTATAISDAVCSNVSFSYYPAKHTIKGVRRQNPYPRKWEVCSSTQVESLILSDEDKKTWEACRQALSVFSFSVEEQ

Query:  DKMLGKAFGHIHSPYWGEDRKKKVPNIEIVNDILEYLRTTLGLSNDDLSKLLKKFPEVLGCNLERELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLGYYV
        DKMLGKAF HIHSPYWGEDRKK+VPNIE VNDILEYLR TLGLSNDDLSKLLKKFPEVLGCNLE+ELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLGY V
Subjt:  DKMLGKAFGHIHSPYWGEDRKKKVPNIEIVNDILEYLRTTLGLSNDDLSKLLKKFPEVLGCNLERELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLGYYV

Query:  DCKGDCIAKCTRCWVRF
        DCKGDCIAKCTRCWVRF
Subjt:  DCKGDCIAKCTRCWVRF

TrEMBL top hitse value%identityAlignment
A0A0A0LQN8 Uncharacterized protein6.8e-11796.31Show/hide
Query:  MLGKSLASPISTIDSATRFCCSTRCTATAISDAVCSNVSFSYYPAKHTIKGVRRQNPYPRKWEVCSSTQVESLILSDEDKKTWEACRQALSVFSFSVEEQ
        MLGKSLASPISTIDSATRFCCSTRCTATAISDAVCS+VSFS + AKHTIKGVRRQNPYPRKWE+CSSTQVESLILSDEDKKTWEACRQALSVFSFSVEEQ
Subjt:  MLGKSLASPISTIDSATRFCCSTRCTATAISDAVCSNVSFSYYPAKHTIKGVRRQNPYPRKWEVCSSTQVESLILSDEDKKTWEACRQALSVFSFSVEEQ

Query:  DKMLGKAFGHIHSPYWGEDRKKKVPNIEIVNDILEYLRTTLGLSNDDLSKLLKKFPEVLGCNLERELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLGYYV
        DKMLGKAFGHIHSPYWGEDR+KKVPNIEIVNDILEYLR TLGLSNDDLSKLLKKFPEVLGCNLE+ELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLGYYV
Subjt:  DKMLGKAFGHIHSPYWGEDRKKKVPNIEIVNDILEYLRTTLGLSNDDLSKLLKKFPEVLGCNLERELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLGYYV

Query:  DCKGDCIAKCTRCWVRF
        DCKGDCIAKCTRCWVRF
Subjt:  DCKGDCIAKCTRCWVRF

A0A1S3B2X0 uncharacterized protein LOC1034854185.0e-12098.62Show/hide
Query:  MLGKSLASPISTIDSATRFCCSTRCTATAISDAVCSNVSFSYYPAKHTIKGVRRQNPYPRKWEVCSSTQVESLILSDEDKKTWEACRQALSVFSFSVEEQ
        MLGKSLASPISTIDSATRFCCSTRCTATAISDAVCSNVSFSY+PAKHTIKGVRRQNPYPRKWEVCSSTQVESLILSDEDKKTWEACRQALSVFSFSVEEQ
Subjt:  MLGKSLASPISTIDSATRFCCSTRCTATAISDAVCSNVSFSYYPAKHTIKGVRRQNPYPRKWEVCSSTQVESLILSDEDKKTWEACRQALSVFSFSVEEQ

Query:  DKMLGKAFGHIHSPYWGEDRKKKVPNIEIVNDILEYLRTTLGLSNDDLSKLLKKFPEVLGCNLERELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLGYYV
        DKMLGKAFGHIHSPYWGEDRKKKVPNIEIVNDILEYLR TLGLSNDDLSKLLKKFPEVLGCNLE+ELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLGYYV
Subjt:  DKMLGKAFGHIHSPYWGEDRKKKVPNIEIVNDILEYLRTTLGLSNDDLSKLLKKFPEVLGCNLERELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLGYYV

Query:  DCKGDCIAKCTRCWVRF
        DCKGDCIAKCTRCWVRF
Subjt:  DCKGDCIAKCTRCWVRF

A0A5A7T0X2 Mitochondrial transcription termination factor family protein isoform 21.5e-10398.45Show/hide
Query:  MLGKSLASPISTIDSATRFCCSTRCTATAISDAVCSNVSFSYYPAKHTIKGVRRQNPYPRKWEVCSSTQVESLILSDEDKKTWEACRQALSVFSFSVEEQ
        MLGKSLASPISTIDSATRFCCSTRCTATAISDAVCSNVSFSY+PAKHTIKGVRRQNPYPRKWEVCSSTQVESLILSDEDKKTWEACRQALSVFSFSVEEQ
Subjt:  MLGKSLASPISTIDSATRFCCSTRCTATAISDAVCSNVSFSYYPAKHTIKGVRRQNPYPRKWEVCSSTQVESLILSDEDKKTWEACRQALSVFSFSVEEQ

Query:  DKMLGKAFGHIHSPYWGEDRKKKVPNIEIVNDILEYLRTTLGLSNDDLSKLLKKFPEVLGCNLERELKTNVQLLDKEWGIQGKSLRNLLLRNPK
        DKMLGKAFGHIHSPYWGEDRKKKVPNIEIVNDILEYLR TLGLSNDDLSKLLKKFPEVLGCNLE+ELKTNVQLLDKEWGIQGKSLRNLLLRNPK
Subjt:  DKMLGKAFGHIHSPYWGEDRKKKVPNIEIVNDILEYLRTTLGLSNDDLSKLLKKFPEVLGCNLERELKTNVQLLDKEWGIQGKSLRNLLLRNPK

A0A6J1FK31 uncharacterized protein LOC1114449352.8e-10285.25Show/hide
Query:  MLGKSLASPISTIDSATRFCCSTRCTATAISDAVCSNVSFSYYPAKHTIKGVRRQNPYPRKWEVCSSTQVESLILSDEDKKTWEACRQALSVFSFSVEEQ
        MLGKSL SPIST+DS TR C ST C  TA S+AVCSNVSFSY PAK+ I  VRRQ+PYPRKWE+ S+ QVESL+LSDEDKKTWEACRQALS+FSFS EEQ
Subjt:  MLGKSLASPISTIDSATRFCCSTRCTATAISDAVCSNVSFSYYPAKHTIKGVRRQNPYPRKWEVCSSTQVESLILSDEDKKTWEACRQALSVFSFSVEEQ

Query:  DKMLGKAFGHIHSPYWGEDRKKKVPNIEIVNDILEYLRTTLGLSNDDLSKLLKKFPEVLGCNLERELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLGYYV
        DKMLGKAFGHIHSPYWGEDRKK+VPNI+ VN+ILEYLR TLGLSNDDLSKLLKKFPEVLGCNLE+ELKTNVQLL+KEWGIQGKSLR+LLLRNPKVLGY V
Subjt:  DKMLGKAFGHIHSPYWGEDRKKKVPNIEIVNDILEYLRTTLGLSNDDLSKLLKKFPEVLGCNLERELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLGYYV

Query:  DCKGDCIAKCTRCWVRF
        DCKGDCIAKCTRCWVRF
Subjt:  DCKGDCIAKCTRCWVRF

A0A6J1JTP6 uncharacterized protein LOC1114896502.8e-10285.25Show/hide
Query:  MLGKSLASPISTIDSATRFCCSTRCTATAISDAVCSNVSFSYYPAKHTIKGVRRQNPYPRKWEVCSSTQVESLILSDEDKKTWEACRQALSVFSFSVEEQ
        MLGKSL SPIST+DS TR C ST C  TA S AVCSNVSFSY PAK+ I  VRRQ+PYPRKWE+ S++QVESL+LSD+DKKTWEACRQALS+FSFS EEQ
Subjt:  MLGKSLASPISTIDSATRFCCSTRCTATAISDAVCSNVSFSYYPAKHTIKGVRRQNPYPRKWEVCSSTQVESLILSDEDKKTWEACRQALSVFSFSVEEQ

Query:  DKMLGKAFGHIHSPYWGEDRKKKVPNIEIVNDILEYLRTTLGLSNDDLSKLLKKFPEVLGCNLERELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLGYYV
        DKMLGKAFGHIHSPYWGEDRKK+VPNI+ VN+ILEYLR TLGLSNDDLSKLLKKFPEVLGCNLE+ELKTNVQLL+KEWGIQGKSLRNLLLRNPKVLGY V
Subjt:  DKMLGKAFGHIHSPYWGEDRKKKVPNIEIVNDILEYLRTTLGLSNDDLSKLLKKFPEVLGCNLERELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLGYYV

Query:  DCKGDCIAKCTRCWVRF
        DCKGDCIAKCTRCWVRF
Subjt:  DCKGDCIAKCTRCWVRF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G09620.1 Mitochondrial transcription termination factor family protein1.0e-6457.99Show/hide
Query:  MLGKSLASPISTIDSATRFCCSTRCTATAISDAVCSNVSFSYYPAKHTIKGVRRQNPYPR-KWEV-CSSTQVESLILSDEDKKTWEACRQALSVFSFSVE
        M+G SLASP++T+ SA  F  S     T  ++ +   ++ SY    H + G  R + Y R KW V CS+TQVE+   S+ED   WE C++ALS F FSVE
Subjt:  MLGKSLASPISTIDSATRFCCSTRCTATAISDAVCSNVSFSYYPAKHTIKGVRRQNPYPR-KWEV-CSSTQVESLILSDEDKKTWEACRQALSVFSFSVE

Query:  EQDKMLGKAFGHIHSPYWGEDRKKKVPNIEIVNDILEYLRTTLGLSNDDLSKLLKKFPEVLGCNLERELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLGY
        E+DK+LGKAFGHIHSPYW E+R K+ P +E +N ILE+LR +LGLS++DL K++KKFPEVLGC+LE E+K N+ +L+ +WGI GK LRNLLLRNPKVLGY
Subjt:  EQDKMLGKAFGHIHSPYWGEDRKKKVPNIEIVNDILEYLRTTLGLSNDDLSKLLKKFPEVLGCNLERELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLGY

Query:  YVDCKGDCIAKCTRCWVRF
         VDCKGDC+A+CTRCWVRF
Subjt:  YVDCKGDCIAKCTRCWVRF

AT5G54180.1 plastid transcriptionally active 153.4e-0428.17Show/hide
Query:  NIEIVNDILEYLRTTLGLSNDDLSKLLKKFPEVLGCNLERELKTNVQLLDKEWGIQGKSLRNLLLRNPKVL
        ++E +N  +E+L++  GL+++ + K++  FP V+  + ER+L+  ++ L KE G     +   L + P +L
Subjt:  NIEIVNDILEYLRTTLGLSNDDLSKLLKKFPEVLGCNLERELKTNVQLLDKEWGIQGKSLRNLLLRNPKVL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCTATTCACTTTCAATCCCTCAAATTTCAGGGATGCTAGGAAAGTCGTTGGCATCCCCTATATCAACCATTGATTCTGCTACTCGTTTCTGCTGTTCTACTCGCTG
CACTGCCACAGCCATATCAGATGCTGTATGTTCAAATGTGAGTTTCTCCTATTATCCTGCAAAGCATACGATAAAGGGTGTGAGAAGACAAAATCCCTATCCTAGGAAGT
GGGAAGTCTGTTCATCTACTCAAGTTGAAAGCTTAATATTAAGTGATGAAGATAAGAAGACATGGGAAGCTTGTCGGCAAGCTCTGTCTGTGTTCAGCTTCAGTGTTGAG
GAGCAAGATAAGATGCTTGGAAAGGCGTTCGGCCACATTCATTCACCCTACTGGGGTGAAGACAGAAAGAAGAAAGTTCCTAATATTGAAATTGTAAATGATATACTGGA
ATATCTGAGGACGACACTTGGCCTTTCTAATGACGATCTCTCTAAGCTGCTAAAGAAATTCCCGGAAGTTCTTGGCTGCAATCTTGAGCGGGAGCTGAAAACCAACGTAC
AATTGTTGGACAAAGAGTGGGGAATTCAAGGCAAATCACTAAGGAATCTTCTTCTGCGTAATCCCAAGGTACTGGGCTATTATGTTGATTGTAAAGGAGACTGCATAGCA
AAATGCACCAGATGCTGGGTTCGATTCTAG
mRNA sequenceShow/hide mRNA sequence
TCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTATTCTTCTTCTTCTTTTCTTCAACCTCCGCCTTACAATCCAAACTCTAAATGAGCTATTCACTTTCAATCCCTCAAATT
TCAGGGATGCTAGGAAAGTCGTTGGCATCCCCTATATCAACCATTGATTCTGCTACTCGTTTCTGCTGTTCTACTCGCTGCACTGCCACAGCCATATCAGATGCTGTATG
TTCAAATGTGAGTTTCTCCTATTATCCTGCAAAGCATACGATAAAGGGTGTGAGAAGACAAAATCCCTATCCTAGGAAGTGGGAAGTCTGTTCATCTACTCAAGTTGAAA
GCTTAATATTAAGTGATGAAGATAAGAAGACATGGGAAGCTTGTCGGCAAGCTCTGTCTGTGTTCAGCTTCAGTGTTGAGGAGCAAGATAAGATGCTTGGAAAGGCGTTC
GGCCACATTCATTCACCCTACTGGGGTGAAGACAGAAAGAAGAAAGTTCCTAATATTGAAATTGTAAATGATATACTGGAATATCTGAGGACGACACTTGGCCTTTCTAA
TGACGATCTCTCTAAGCTGCTAAAGAAATTCCCGGAAGTTCTTGGCTGCAATCTTGAGCGGGAGCTGAAAACCAACGTACAATTGTTGGACAAAGAGTGGGGAATTCAAG
GCAAATCACTAAGGAATCTTCTTCTGCGTAATCCCAAGGTACTGGGCTATTATGTTGATTGTAAAGGAGACTGCATAGCAAAATGCACCAGATGCTGGGTTCGATTCTAG
AGCTATAATGGTTCTCAATCTATGGTTGAATATTCCCAAAAGATAGGTATGCTTCTAACTAAATTTATATCAATTGAATATTTTGTTAAAGAATATTCAGAAGTATGTGC
GAATATTAAAAGGACCCCGAAGGAAAAATGGGAGTATAGATGCGGCAAGGCTTCAAGAAGCTGAATATTTGCATAAGAGATGGGGAGATGGCTGCCAAAGATCACTTAAA
TATATTATCAATTAGAAAATTGGTCCTTATGGAAAAAAATACAGCACTGATTTTTATTCCAATTGTAAGCAACACAAGCCACTTAGATATAGTACAACAGAGGTGGGAAA
ATCAACATAATCAAAAAGAGAGAATGCAAGAGAAACCCTTGGCTTCAATATATTGATAGGTAAATAACACCAATCCTATATAGTTGGCCTGGGAGGAAAACTCTTGCAAA
ATACACTCACCCACAATAAATTATCATCATTAACAGAAGGAATCCTTCTAGAAGAATTCCATAACATAATGAATTTTGAGAAGGAAATCAAGATACTCAAACCCAAAATC
CAATTAAGGCTAATGAGTTGTCTTAACTGGAAGGTTGAAACATCTTGATCAATAACACATTTATATGCCAGATGAGCAATGCTCACCTTGGCATTTTTATACACATTAAT
AGTATATAAACATAAATGGAAAATACAAAGAATTAAATGACCTCCAATATATAAAAGGAAATAGTGTAGAAGTAGGGTTGGAAGAGTAGAGTGGAAATTAGGGGCACTTG
GGCTTATTGCCATGGGTCCTGAGGCGAGCATAGCAAGGGCAAGCTTGGTGGTTGCCGTAAGTGCCGGGCGGGACGCAGTGACACCGACTGCAGCAAGTCCCACAAGCACG
CATACATATCTTCTTTCTGAACACTTTGCTGCATCTCCTTGAACATTCACTTTTGCAATCTACCCCAAACCAATTCAACCCAACCTGATTGTTAATTTCACTCCAAATTT
TGTGTTATAAGTTTCTTTATATAAAAAAAGAAAAAAAAATGTTCTGTTACGTGTTTAGTTTTTAAAATCTCCGTTGGGTGACTATTTTGTTCTTGACGTTTATGCTTGTT
TTCTCCTAAATTGCTTACGGTTATTTTCAGATTTCTTCAAGAACCACTTGAAACTTTTGTCAAATTGTATGAATGAAAAGAAGTTTTTGAAAACTATAGTTCCGTTTTGT
AATTTTTTGTTTTTTGAAAATTAAGCATATAAACAATACCCATTTTGCACCAATTTTTATTGTTTTGTTATCTATTTTATATTAAGGTTTTTGAAAATCGCGTTCGATAA
CCATTTGGTTTTTGTTTTTTTTTTTTTTAAATATGACTACAAATTAAAAGTTTAATGTTCTATTAGATACAAAATTCACTCTCCGTAGAATTGCTAATTTAAAAATTACA
ATTTCAAGGATGCAGAAAACAATAATTTATACCTATATTAATCGAAAGTTTAAAGACTTATTTGACCGTCCTAAAATTTAAATAGGTTATACCTATATTAATTGAAAGTT
TAAAGACCTATTTGACTAACTTACGATTTGAAAACCTATTCTAACACTCATTTAAGTTGAAAGAATAAATTTGTAATTCTCTATATGTATTATTACTATTATTATATTTT
ATGGCAACTTACTGATTTTTGGCTGATAGTGATAC
Protein sequenceShow/hide protein sequence
MSYSLSIPQISGMLGKSLASPISTIDSATRFCCSTRCTATAISDAVCSNVSFSYYPAKHTIKGVRRQNPYPRKWEVCSSTQVESLILSDEDKKTWEACRQALSVFSFSVE
EQDKMLGKAFGHIHSPYWGEDRKKKVPNIEIVNDILEYLRTTLGLSNDDLSKLLKKFPEVLGCNLERELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLGYYVDCKGDCIA
KCTRCWVRF