; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC01G018830 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC01G018830
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionPlant-specific TFIIB-related protein 1
Genome locationCiama_Chr01:32094627..32097251
RNA-Seq ExpressionCaUC01G018830
SyntenyCaUC01G018830
Gene Ontology termsGO:0070897 - transcription preinitiation complex assembly (biological process)
GO:0005634 - nucleus (cellular component)
GO:0009527 - plastid outer membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0097550 - transcriptional preinitiation complex (cellular component)
GO:0000182 - rDNA binding (molecular function)
GO:0017025 - TBP-class protein binding (molecular function)
InterPro domainsIPR000812 - Transcription factor TFIIB
IPR013150 - Transcription factor TFIIB, cyclin-like domain
IPR013763 - Cyclin-like
IPR036915 - Cyclin-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6603983.1 hypothetical protein SDJN03_04592, partial [Cucurbita argyrosperma subsp. sororia]4.6e-7793.12Show/hide
Query:  MLLSSSSLLRSLPVNHLNLACKPANPRTGLKVQAMAKEGSESEGGIAETVAIAGGLVATPVIGWSLYTLKTTGCGLPPGPGGSLGALEGVSYLAVVGIVG
        MLLSSSSL+R LPVN LN  CKPANPRT +K+QAMAKEGSESEGGIAET+AIAGGLVATPVIGWSLYTLKTTGCGLPPGPGGSLGALEGVSYLAVVGIVG
Subjt:  MLLSSSSLLRSLPVNHLNLACKPANPRTGLKVQAMAKEGSESEGGIAETVAIAGGLVATPVIGWSLYTLKTTGCGLPPGPGGSLGALEGVSYLAVVGIVG

Query:  WSLYTKTKTGSGLPNGPFGLLGAVEGLSYLTLLAILVVFGLQYFEQGYIPGPLPADQCFG
        WS YTKTKTGSGLPNGPFGLLGAVEGLSYL+LLAILVVFGLQYFEQGYIPGPLPADQCFG
Subjt:  WSLYTKTKTGSGLPNGPFGLLGAVEGLSYLTLLAILVVFGLQYFEQGYIPGPLPADQCFG

XP_008440700.1 PREDICTED: uncharacterized protein LOC103485040 [Cucumis melo]1.3e-7996.88Show/hide
Query:  MLLSSSSLLRSLPVNHLNLACKPANPRTGLKVQAMAKEGSESEGGIAETVAIAGGLVATPVIGWSLYTLKTTGCGLPPGPGGSLGALEGVSYLAVVGIVG
        MLLSSSSLLRSLPV   NLACKPANPRTG+KVQAMAKEGSESEGGIAETVAIAGGLVATPVIGWSLYTLKTTGCGLPPGPGGSLGALEGVSYLAVVGIVG
Subjt:  MLLSSSSLLRSLPVNHLNLACKPANPRTGLKVQAMAKEGSESEGGIAETVAIAGGLVATPVIGWSLYTLKTTGCGLPPGPGGSLGALEGVSYLAVVGIVG

Query:  WSLYTKTKTGSGLPNGPFGLLGAVEGLSYLTLLAILVVFGLQYFEQGYIPGPLPADQCFG
        WSLYTKTKTGSGLPNGPFGLLGAVEGLSYL+LLAILVVFGLQYFEQGYIPGPLPADQCFG
Subjt:  WSLYTKTKTGSGLPNGPFGLLGAVEGLSYLTLLAILVVFGLQYFEQGYIPGPLPADQCFG

XP_022950150.1 uncharacterized protein LOC111453326 [Cucurbita moschata]3.5e-7793.12Show/hide
Query:  MLLSSSSLLRSLPVNHLNLACKPANPRTGLKVQAMAKEGSESEGGIAETVAIAGGLVATPVIGWSLYTLKTTGCGLPPGPGGSLGALEGVSYLAVVGIVG
        MLLSSSSL+R+LPVN LN  CKPANPRT +K+QAMAKEGSESEGGIAET+AIAGGLVATPVIGWSLYTLKTTGCGLPPGPGGSLGALEGVSYLAVVGIVG
Subjt:  MLLSSSSLLRSLPVNHLNLACKPANPRTGLKVQAMAKEGSESEGGIAETVAIAGGLVATPVIGWSLYTLKTTGCGLPPGPGGSLGALEGVSYLAVVGIVG

Query:  WSLYTKTKTGSGLPNGPFGLLGAVEGLSYLTLLAILVVFGLQYFEQGYIPGPLPADQCFG
        WS YTKTKTGSGLPNGPFGLLGAVEGLSYL+LLAILVVFGLQYFEQGYIPGPLPADQCFG
Subjt:  WSLYTKTKTGSGLPNGPFGLLGAVEGLSYLTLLAILVVFGLQYFEQGYIPGPLPADQCFG

XP_031744055.1 uncharacterized protein LOC101218070 [Cucumis sativus]1.1e-7895.62Show/hide
Query:  MLLSSSSLLRSLPVNHLNLACKPANPRTGLKVQAMAKEGSESEGGIAETVAIAGGLVATPVIGWSLYTLKTTGCGLPPGPGGSLGALEGVSYLAVVGIVG
        MLLSSSSLL+SLPVN  NLACKPANPRTG+KVQAMAKEGSESEGGIAETVAIAGGLVATPVIGWSLYTLKTTGCGLPPGPGGSLGALEGVSYLAVVGIVG
Subjt:  MLLSSSSLLRSLPVNHLNLACKPANPRTGLKVQAMAKEGSESEGGIAETVAIAGGLVATPVIGWSLYTLKTTGCGLPPGPGGSLGALEGVSYLAVVGIVG

Query:  WSLYTKTKTGSGLPNGPFGLLGAVEGLSYLTLLAILVVFGLQYFEQGYIPGPLPADQCFG
        WSLYTKTKTGSGLPNGPFGLLGAVEGLSYL+LLAILVVFGLQY +QGYIPGPLPADQCFG
Subjt:  WSLYTKTKTGSGLPNGPFGLLGAVEGLSYLTLLAILVVFGLQYFEQGYIPGPLPADQCFG

XP_038882163.1 uncharacterized protein LOC120073396 [Benincasa hispida]5.8e-8096.88Show/hide
Query:  MLLSSSSLLRSLPVNHLNLACKPANPRTGLKVQAMAKEGSESEGGIAETVAIAGGLVATPVIGWSLYTLKTTGCGLPPGPGGSLGALEGVSYLAVVGIVG
        MLLSSSSLLRSLPVN LNLACKPANPRT +KVQAMAKEGSESEGGIAETVAIAGGLVATPVIGWSLYTLKTTGCGLPPGPGGSLGALEG+SYLAVVGIVG
Subjt:  MLLSSSSLLRSLPVNHLNLACKPANPRTGLKVQAMAKEGSESEGGIAETVAIAGGLVATPVIGWSLYTLKTTGCGLPPGPGGSLGALEGVSYLAVVGIVG

Query:  WSLYTKTKTGSGLPNGPFGLLGAVEGLSYLTLLAILVVFGLQYFEQGYIPGPLPADQCFG
        WSLYTKTKTGSGLPNGPFGLLGAVEGLSYL+LLAILVVFGLQYFEQGYIPGPLPADQCFG
Subjt:  WSLYTKTKTGSGLPNGPFGLLGAVEGLSYLTLLAILVVFGLQYFEQGYIPGPLPADQCFG

TrEMBL top hitse value%identityAlignment
A0A0A0KKC0 Uncharacterized protein5.3e-7995.62Show/hide
Query:  MLLSSSSLLRSLPVNHLNLACKPANPRTGLKVQAMAKEGSESEGGIAETVAIAGGLVATPVIGWSLYTLKTTGCGLPPGPGGSLGALEGVSYLAVVGIVG
        MLLSSSSLL+SLPVN  NLACKPANPRTG+KVQAMAKEGSESEGGIAETVAIAGGLVATPVIGWSLYTLKTTGCGLPPGPGGSLGALEGVSYLAVVGIVG
Subjt:  MLLSSSSLLRSLPVNHLNLACKPANPRTGLKVQAMAKEGSESEGGIAETVAIAGGLVATPVIGWSLYTLKTTGCGLPPGPGGSLGALEGVSYLAVVGIVG

Query:  WSLYTKTKTGSGLPNGPFGLLGAVEGLSYLTLLAILVVFGLQYFEQGYIPGPLPADQCFG
        WSLYTKTKTGSGLPNGPFGLLGAVEGLSYL+LLAILVVFGLQY +QGYIPGPLPADQCFG
Subjt:  WSLYTKTKTGSGLPNGPFGLLGAVEGLSYLTLLAILVVFGLQYFEQGYIPGPLPADQCFG

A0A1S3B1A0 uncharacterized protein LOC1034850406.3e-8096.88Show/hide
Query:  MLLSSSSLLRSLPVNHLNLACKPANPRTGLKVQAMAKEGSESEGGIAETVAIAGGLVATPVIGWSLYTLKTTGCGLPPGPGGSLGALEGVSYLAVVGIVG
        MLLSSSSLLRSLPV   NLACKPANPRTG+KVQAMAKEGSESEGGIAETVAIAGGLVATPVIGWSLYTLKTTGCGLPPGPGGSLGALEGVSYLAVVGIVG
Subjt:  MLLSSSSLLRSLPVNHLNLACKPANPRTGLKVQAMAKEGSESEGGIAETVAIAGGLVATPVIGWSLYTLKTTGCGLPPGPGGSLGALEGVSYLAVVGIVG

Query:  WSLYTKTKTGSGLPNGPFGLLGAVEGLSYLTLLAILVVFGLQYFEQGYIPGPLPADQCFG
        WSLYTKTKTGSGLPNGPFGLLGAVEGLSYL+LLAILVVFGLQYFEQGYIPGPLPADQCFG
Subjt:  WSLYTKTKTGSGLPNGPFGLLGAVEGLSYLTLLAILVVFGLQYFEQGYIPGPLPADQCFG

A0A5A7T239 Uncharacterized protein6.3e-8096.88Show/hide
Query:  MLLSSSSLLRSLPVNHLNLACKPANPRTGLKVQAMAKEGSESEGGIAETVAIAGGLVATPVIGWSLYTLKTTGCGLPPGPGGSLGALEGVSYLAVVGIVG
        MLLSSSSLLRSLPV   NLACKPANPRTG+KVQAMAKEGSESEGGIAETVAIAGGLVATPVIGWSLYTLKTTGCGLPPGPGGSLGALEGVSYLAVVGIVG
Subjt:  MLLSSSSLLRSLPVNHLNLACKPANPRTGLKVQAMAKEGSESEGGIAETVAIAGGLVATPVIGWSLYTLKTTGCGLPPGPGGSLGALEGVSYLAVVGIVG

Query:  WSLYTKTKTGSGLPNGPFGLLGAVEGLSYLTLLAILVVFGLQYFEQGYIPGPLPADQCFG
        WSLYTKTKTGSGLPNGPFGLLGAVEGLSYL+LLAILVVFGLQYFEQGYIPGPLPADQCFG
Subjt:  WSLYTKTKTGSGLPNGPFGLLGAVEGLSYLTLLAILVVFGLQYFEQGYIPGPLPADQCFG

A0A6J1GEW2 uncharacterized protein LOC1114533261.7e-7793.12Show/hide
Query:  MLLSSSSLLRSLPVNHLNLACKPANPRTGLKVQAMAKEGSESEGGIAETVAIAGGLVATPVIGWSLYTLKTTGCGLPPGPGGSLGALEGVSYLAVVGIVG
        MLLSSSSL+R+LPVN LN  CKPANPRT +K+QAMAKEGSESEGGIAET+AIAGGLVATPVIGWSLYTLKTTGCGLPPGPGGSLGALEGVSYLAVVGIVG
Subjt:  MLLSSSSLLRSLPVNHLNLACKPANPRTGLKVQAMAKEGSESEGGIAETVAIAGGLVATPVIGWSLYTLKTTGCGLPPGPGGSLGALEGVSYLAVVGIVG

Query:  WSLYTKTKTGSGLPNGPFGLLGAVEGLSYLTLLAILVVFGLQYFEQGYIPGPLPADQCFG
        WS YTKTKTGSGLPNGPFGLLGAVEGLSYL+LLAILVVFGLQYFEQGYIPGPLPADQCFG
Subjt:  WSLYTKTKTGSGLPNGPFGLLGAVEGLSYLTLLAILVVFGLQYFEQGYIPGPLPADQCFG

A0A6J1IN27 uncharacterized protein LOC1114785118.5e-7792.5Show/hide
Query:  MLLSSSSLLRSLPVNHLNLACKPANPRTGLKVQAMAKEGSESEGGIAETVAIAGGLVATPVIGWSLYTLKTTGCGLPPGPGGSLGALEGVSYLAVVGIVG
        MLLSSSSL+R+LPVN LN  CKPANPRT +K+QAMAKEGSESEGGIAET+AIAGGLVATPVIGWSLYTLKTTGCGLPPGPGGSLGALEGVSYLAVVGIVG
Subjt:  MLLSSSSLLRSLPVNHLNLACKPANPRTGLKVQAMAKEGSESEGGIAETVAIAGGLVATPVIGWSLYTLKTTGCGLPPGPGGSLGALEGVSYLAVVGIVG

Query:  WSLYTKTKTGSGLPNGPFGLLGAVEGLSYLTLLAILVVFGLQYFEQGYIPGPLPADQCFG
        WS+YTKTKTGSGLPNGPFGLLGAVEGLSYL+LLAILVVFGLQYFEQGYIPGPLPA QCFG
Subjt:  WSLYTKTKTGSGLPNGPFGLLGAVEGLSYLTLLAILVVFGLQYFEQGYIPGPLPADQCFG

SwissProt top hitse value%identityAlignment
O23215 Plant-specific TFIIB-related protein 11.3e-3751.28Show/hide
Query:  FTTVLSFIDPFKDLAFLIQPQLPKHCASEVLAITMNPISISAAAIYLACQLEDKRKTQAEICKVTGLTEVTLRKVYKELLENWDDLLPSNYTPAVPPERA
        F T+L      ++LA  I   +   C         NPISISAAAIYLACQLEDKRKTQAEICK+TGLTEVTLRKVYKELLENWDDLLPSNYTPAVPPE+A
Subjt:  FTTVLSFIDPFKDLAFLIQPQLPKHCASEVLAITMNPISISAAAIYLACQLEDKRKTQAEICKVTGLTEVTLRKVYKELLENWDDLLPSNYTPAVPPERA

Query:  FPTTVIASGRSAAPK-VDAFEGASLEKDKPMETKPNISTEISEMVHPSRVKEDSESKFVSRGMYNPVTNKSSTFCQPQPPKGDSIPGERPSVLKQ
        FPTT I++ RS  P+ VD  E + +EKDKP   KP I T         + KED + KF    ++   +  +      +P K +++  E+  + KQ
Subjt:  FPTTVIASGRSAAPK-VDAFEGASLEKDKPMETKPNISTEISEMVHPSRVKEDSESKFVSRGMYNPVTNKSSTFCQPQPPKGDSIPGERPSVLKQ

P50387 Transcription initiation factor IIB4.4e-0648.15Show/hide
Query:  NPISISAAAIYLACQLEDKRKTQAEICKVTGLTEVTLRKVYKELLENWDDLLPS
        +P  ++AAAIY+A  L D+R+TQ EI +V G+TEVT+R  YKEL +     +P+
Subjt:  NPISISAAAIYLACQLEDKRKTQAEICKVTGLTEVTLRKVYKELLENWDDLLPS

P58111 Transcription initiation factor IIB 14.4e-0648.15Show/hide
Query:  NPISISAAAIYLACQLEDKRKTQAEICKVTGLTEVTLRKVYKELLENWDDLLPS
        +P  ++AAAIY+A  L D+R+TQ EI +V G+TEVT+R  YKEL +     +P+
Subjt:  NPISISAAAIYLACQLEDKRKTQAEICKVTGLTEVTLRKVYKELLENWDDLLPS

Q54FD6 Transcription initiation factor IIB4.0e-0745.45Show/hide
Query:  NPISISAAAIYLACQLEDKRKTQAEICKVTGLTEVTLRKVYKELLENWDDLLPSN
        +PIS++AA+IY+  QL  +++TQ +I  V+G++EVT+R  YK+L    D L+PS+
Subjt:  NPISISAAAIYLACQLEDKRKTQAEICKVTGLTEVTLRKVYKELLENWDDLLPSN

Q9UWN6 Transcription initiation factor IIB1.5e-0632.04Show/hide
Query:  REVLNEPFIKISKSFTTVLSFIDPFKDLAFLIQPQLPKHCASEVLAITMNPISISAAAIYLACQLEDKRKTQAEICKVTGLTEVTLRKVYKELLENWDDL
        RE+  E  +   K + T +  +     +      ++ +   +  L    +P  ++AAAIY+A  L D+R+TQ EI +V G+TEVT+R  YKEL +     
Subjt:  REVLNEPFIKISKSFTTVLSFIDPFKDLAFLIQPQLPKHCASEVLAITMNPISISAAAIYLACQLEDKRKTQAEICKVTGLTEVTLRKVYKELLENWDDL

Query:  LPS
        +PS
Subjt:  LPS

Arabidopsis top hitse value%identityAlignment
AT2G41630.1 transcription factor IIB2.7e-0642.19Show/hide
Query:  SEVLAITMNPISISAAAIYLACQLEDKRKTQAEICKVTGLTEVTLRKVYKELLENWDDLLPSNY
        SE   I  +PISI+A  IY+  QL D +KT  +I   TG+ E T+R  YK+L  +   + PS Y
Subjt:  SEVLAITMNPISISAAAIYLACQLEDKRKTQAEICKVTGLTEVTLRKVYKELLENWDDLLPSNY

AT3G10330.1 Cyclin-like family protein3.5e-0640.62Show/hide
Query:  SEVLAITMNPISISAAAIYLACQLEDKRKTQAEICKVTGLTEVTLRKVYKELLENWDDLLPSNY
        SE   I  +PISI+AA IY+  QL D++K   +I   TG+ E T+R  YK+L  +   ++P+ Y
Subjt:  SEVLAITMNPISISAAAIYLACQLEDKRKTQAEICKVTGLTEVTLRKVYKELLENWDDLLPSNY

AT3G50685.1 unknown protein2.0e-5478.46Show/hide
Query:  KVQAMAKEGSESEGGIAETVAIAGGLVATPVIGWSLYTLKTTGCGLPPGPGGSLGALEGVSYLAVVGIVGWSLYTKTKTGSGLPNGPFGLLGAVEGLSYL
        +V A AK+ +++ GG  ET AIAG LV+TPVIGWSLYTLKTTGCGLPPGP G +GALEGVSYL VVGIVGWSLYTKTKTGSGLPNGPFGLLGAVEGLSYL
Subjt:  KVQAMAKEGSESEGGIAETVAIAGGLVATPVIGWSLYTLKTTGCGLPPGPGGSLGALEGVSYLAVVGIVGWSLYTKTKTGSGLPNGPFGLLGAVEGLSYL

Query:  TLLAILVVFGLQYFEQGYIPGPLPADQCFG
        ++LAILVVFG+Q+ + G +PGPLP+DQCFG
Subjt:  TLLAILVVFGLQYFEQGYIPGPLPADQCFG

AT4G36650.1 plant-specific TFIIB-related protein9.0e-3951.28Show/hide
Query:  FTTVLSFIDPFKDLAFLIQPQLPKHCASEVLAITMNPISISAAAIYLACQLEDKRKTQAEICKVTGLTEVTLRKVYKELLENWDDLLPSNYTPAVPPERA
        F T+L      ++LA  I   +   C         NPISISAAAIYLACQLEDKRKTQAEICK+TGLTEVTLRKVYKELLENWDDLLPSNYTPAVPPE+A
Subjt:  FTTVLSFIDPFKDLAFLIQPQLPKHCASEVLAITMNPISISAAAIYLACQLEDKRKTQAEICKVTGLTEVTLRKVYKELLENWDDLLPSNYTPAVPPERA

Query:  FPTTVIASGRSAAPK-VDAFEGASLEKDKPMETKPNISTEISEMVHPSRVKEDSESKFVSRGMYNPVTNKSSTFCQPQPPKGDSIPGERPSVLKQ
        FPTT I++ RS  P+ VD  E + +EKDKP   KP I T         + KED + KF    ++   +  +      +P K +++  E+  + KQ
Subjt:  FPTTVIASGRSAAPK-VDAFEGASLEKDKPMETKPNISTEISEMVHPSRVKEDSESKFVSRGMYNPVTNKSSTFCQPQPPKGDSIPGERPSVLKQ

AT4G36650.2 plant-specific TFIIB-related protein1.9e-1239.85Show/hide
Query:  FTTVLSFIDPFKDLAFLIQPQLPKHCASEVLAITMNPISISAAAIYLACQLEDKRKTQAEICKVTGLTEVTLRKVY---------KELLENWDDLLPSNY
        F T+L      ++LA  I   +   C         NPISISAAAIYLACQLEDKRKTQAEICK+TGLTE   RK +         ++LLE W  L   ++
Subjt:  FTTVLSFIDPFKDLAFLIQPQLPKHCASEVLAITMNPISISAAAIYLACQLEDKRKTQAEICKVTGLTEVTLRKVY---------KELLENWDDLLPSNY

Query:  TPAVPPERAFPTTVIASGRSAAPKVDAFEGASL
            P       ++I +    A K+ +    SL
Subjt:  TPAVPPERAFPTTVIASGRSAAPKVDAFEGASL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCTGCGTAAAAAGCCTTGTGGATTTTACAGAACTTTTATGTTCTGCCAATGGTAGATGGTTGATGAGGAATTATAGGATCTTCATGGATAAATTTAGAGGG
CTAGAGGATGTTCGGGAGAAGTACATGTGGACAAAAATGATACCCTCCCACCTAACAAAACCAAGGGAAGTTCTGAATGAGCCCTTCATTAAAATTTCCAAATCT
TTCACAACTGTGCTAAGTTTTATTGATCCCTTCAAAGACTTGGCATTTCTTATTCAACCACAGCTTCCGAAACATTGTGCTTCTGAGGTTTTAGCAATTACAATG
AATCCCATCAGCATCTCAGCCGCTGCTATATATTTAGCTTGCCAATTAGAAGACAAGCGAAAGACGCAAGCAGAAATTTGTAAGGTTACAGGTCTCACTGAAGTC
ACCCTCCGGAAAGTCTACAAGGAGCTACTAGAAAATTGGGATGATTTGCTTCCGTCTAATTATACTCCTGCTGTTCCTCCAGAAAGGGCATTTCCCACCACTGTA
ATTGCTTCAGGCCGTTCTGCAGCTCCTAAAGTTGATGCATTTGAAGGGGCTTCTTTAGAGAAAGACAAGCCGATGGAGACCAAACCTAATATATCCACCGAGATC
TCAGAAATGGTTCATCCATCCAGAGTCAAAGAAGATAGTGAGAGTAAATTTGTATCCCGTGGGATGTATAACCCTGTAACTAACAAGTCCTCAACATTTTGTCAA
CCACAGCCTCCTAAAGGGGATTCTATCCCAGGAGAACGACCCTCTGTTTTAAAGCAAATAGATTCGAAGTTCCCAAATGTAAGGCCTCGGCAATTCGGTTATCAA
GGAGAATGCACTGTTTCCAACAGGATTAAGGTGGAAAGTCTTTACAGAACCACCACTTTTCTCTCTCGCCCTCTCTCGCACACACACACAAGAGACAGACAGTGG
AGGAAGTTGGGAAAGATGCTACTGTCATCATCCTCTTTATTGCGATCATTGCCGGTTAATCATTTGAATTTGGCATGCAAACCGGCAAATCCACGGACGGGATTG
AAGGTTCAGGCCATGGCAAAGGAAGGAAGCGAGAGCGAAGGGGGCATTGCAGAGACGGTGGCTATAGCCGGTGGGTTAGTGGCGACCCCAGTGATCGGTTGGTCG
CTGTACACCCTGAAGACAACCGGGTGCGGGCTGCCGCCGGGGCCAGGTGGGTCACTCGGCGCACTGGAAGGCGTCAGCTACTTGGCGGTGGTGGGCATCGTGGGT
TGGTCTCTATACACGAAAACGAAAACTGGATCGGGTCTGCCGAACGGCCCTTTCGGGCTACTGGGCGCCGTCGAAGGTCTTTCATACTTGACTTTGTTGGCCATC
TTGGTCGTTTTTGGGTTGCAGTACTTTGAACAGGGCTACATCCCCGGCCCTCTTCCGGCCGATCAGTGCTTTGGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTCTGCGTAAAAAGCCTTGTGGATTTTACAGAACTTTTATGTTCTGCCAATGGTAGATGGTTGATGAGGAATTATAGGATCTTCATGGATAAATTTAGAGGG
CTAGAGGATGTTCGGGAGAAGTACATGTGGACAAAAATGATACCCTCCCACCTAACAAAACCAAGGGAAGTTCTGAATGAGCCCTTCATTAAAATTTCCAAATCT
TTCACAACTGTGCTAAGTTTTATTGATCCCTTCAAAGACTTGGCATTTCTTATTCAACCACAGCTTCCGAAACATTGTGCTTCTGAGGTTTTAGCAATTACAATG
AATCCCATCAGCATCTCAGCCGCTGCTATATATTTAGCTTGCCAATTAGAAGACAAGCGAAAGACGCAAGCAGAAATTTGTAAGGTTACAGGTCTCACTGAAGTC
ACCCTCCGGAAAGTCTACAAGGAGCTACTAGAAAATTGGGATGATTTGCTTCCGTCTAATTATACTCCTGCTGTTCCTCCAGAAAGGGCATTTCCCACCACTGTA
ATTGCTTCAGGCCGTTCTGCAGCTCCTAAAGTTGATGCATTTGAAGGGGCTTCTTTAGAGAAAGACAAGCCGATGGAGACCAAACCTAATATATCCACCGAGATC
TCAGAAATGGTTCATCCATCCAGAGTCAAAGAAGATAGTGAGAGTAAATTTGTATCCCGTGGGATGTATAACCCTGTAACTAACAAGTCCTCAACATTTTGTCAA
CCACAGCCTCCTAAAGGGGATTCTATCCCAGGAGAACGACCCTCTGTTTTAAAGCAAATAGATTCGAAGTTCCCAAATGTAAGGCCTCGGCAATTCGGTTATCAA
GGAGAATGCACTGTTTCCAACAGGATTAAGGTGGAAAGTCTTTACAGAACCACCACTTTTCTCTCTCGCCCTCTCTCGCACACACACACAAGAGACAGACAGTGG
AGGAAGTTGGGAAAGATGCTACTGTCATCATCCTCTTTATTGCGATCATTGCCGGTTAATCATTTGAATTTGGCATGCAAACCGGCAAATCCACGGACGGGATTG
AAGGTTCAGGCCATGGCAAAGGAAGGAAGCGAGAGCGAAGGGGGCATTGCAGAGACGGTGGCTATAGCCGGTGGGTTAGTGGCGACCCCAGTGATCGGTTGGTCG
CTGTACACCCTGAAGACAACCGGGTGCGGGCTGCCGCCGGGGCCAGGTGGGTCACTCGGCGCACTGGAAGGCGTCAGCTACTTGGCGGTGGTGGGCATCGTGGGT
TGGTCTCTATACACGAAAACGAAAACTGGATCGGGTCTGCCGAACGGCCCTTTCGGGCTACTGGGCGCCGTCGAAGGTCTTTCATACTTGACTTTGTTGGCCATC
TTGGTCGTTTTTGGGTTGCAGTACTTTGAACAGGGCTACATCCCCGGCCCTCTTCCGGCCGATCAGTGCTTTGGTTGACGATTCTATCTTTTGATTTATGTGGCT
GTTGTTTCTCCAAGTGTGTAAAATCGTAGGGCTCTCCCTTCCAGTTTGGTTGTGGATCTGTTTTCTTCTTTTTCTGTTTACTCTTTCATTGCGATTCTGCATAGT
ACTTCCCCTGCTGCTTGTTTGAACTGAAAGCATTTTACTTTCTTTCGGTTTCGAATCTATTACCCATTATGATTAAACGTTTAAA
Protein sequenceShow/hide protein sequence
MFCVKSLVDFTELLCSANGRWLMRNYRIFMDKFRGLEDVREKYMWTKMIPSHLTKPREVLNEPFIKISKSFTTVLSFIDPFKDLAFLIQPQLPKHCASEVLAITM
NPISISAAAIYLACQLEDKRKTQAEICKVTGLTEVTLRKVYKELLENWDDLLPSNYTPAVPPERAFPTTVIASGRSAAPKVDAFEGASLEKDKPMETKPNISTEI
SEMVHPSRVKEDSESKFVSRGMYNPVTNKSSTFCQPQPPKGDSIPGERPSVLKQIDSKFPNVRPRQFGYQGECTVSNRIKVESLYRTTTFLSRPLSHTHTRDRQW
RKLGKMLLSSSSLLRSLPVNHLNLACKPANPRTGLKVQAMAKEGSESEGGIAETVAIAGGLVATPVIGWSLYTLKTTGCGLPPGPGGSLGALEGVSYLAVVGIVG
WSLYTKTKTGSGLPNGPFGLLGAVEGLSYLTLLAILVVFGLQYFEQGYIPGPLPADQCFG