; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi09G019050 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi09G019050
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionProtein PXR1
Genome locationchr09:27904466..27910842
RNA-Seq ExpressionLsi09G019050
SyntenyLsi09G019050
Gene Ontology termsGO:0016874 - ligase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040007.1 protein PXR1 [Cucumis melo var. makuwa]1.1e-10392.37Show/hide
Query:  MGGKGMRRRERNYRAAHGGYDRLPPPPDTSQVDTLPSKLRKLMSFTSPGPQESEKVSEDIQRKRKGEAVNTEKKSHRKDALGRSDVKSKGNGGNSQMPQL
        MGGKGMRRRERNYRAAHGGYDRLPPPPDTSQ DTLPSKLRKLMSFTS   QESE VSEDIQRKRK +AVNTEKKS++KDA GRSDVKSK NGGNSQMPQ 
Subjt:  MGGKGMRRRERNYRAAHGGYDRLPPPPDTSQVDTLPSKLRKLMSFTSPGPQESEKVSEDIQRKRKGEAVNTEKKSHRKDALGRSDVKSKGNGGNSQMPQL

Query:  TGSDGDEDVQSKSSEKKNKKRKRKQVTDLRFEHSLEESSRRLKKRERRKKYQEAKKNKHKKAKTEEDLDFPKHEKIKFGDVVEAPLKLLAVPKAFKSAQV
        TGSD D DVQSKSSEKKNKKRKRKQVTDLRFE SLEESSRRLKKRERRKKYQEAKKNKHKKAKTEE LDFP+HEKIKFGDVVEAPLKLLAVPKAFKSAQV
Subjt:  TGSDGDEDVQSKSSEKKNKKRKRKQVTDLRFEHSLEESSRRLKKRERRKKYQEAKKNKHKKAKTEEDLDFPKHEKIKFGDVVEAPLKLLAVPKAFKSAQV

Query:  ASQERKRLQAINEYRNRKGWTSRPGIQIPSMTISPA
        ASQERKRLQAINEYRNRKGWTSRPGIQIPSMTI+PA
Subjt:  ASQERKRLQAINEYRNRKGWTSRPGIQIPSMTISPA

KAG6574789.1 hypothetical protein SDJN03_25428, partial [Cucurbita argyrosperma subsp. sororia]1.9e-10086.08Show/hide
Query:  MGGKGMRRRERNYRAAHGGYDRLPPPPDTSQVDTLPSKLRKLMSFTSPGPQESEKVSEDIQRKRKGEAVNTEKKSHRKDALGRSDVKSKGNGGNSQMPQL
        MGGKG+RRRERNYRAAHGGYDRLPPPP+ SQVDTLPSKLRKLM+FTSP PQESEKVSED+QRKRK EAV+TEKK H K+A G+SD+KS+GNGGNSQMPQL
Subjt:  MGGKGMRRRERNYRAAHGGYDRLPPPPDTSQVDTLPSKLRKLMSFTSPGPQESEKVSEDIQRKRKGEAVNTEKKSHRKDALGRSDVKSKGNGGNSQMPQL

Query:  TGSDGDEDVQSKSSEKKNKKRKRKQVTDLRFEHSLEESSRRLKKRERRKKYQEAKKNKHKKAKTEEDLDFPKHEKIKFGDVVEAPLKLLAVPKAFKSAQV
        TGS  D+DVQSKSSEKKNKKRK+KQVTDLRFE   E S+RRLKKRER++KY EAKKNKHKK KT+EDLDFP+HEKIKFGDVVEAPLKL AVPKAFKSAQV
Subjt:  TGSDGDEDVQSKSSEKKNKKRKRKQVTDLRFEHSLEESSRRLKKRERRKKYQEAKKNKHKKAKTEEDLDFPKHEKIKFGDVVEAPLKLLAVPKAFKSAQV

Query:  ASQERKRLQAINEYRNRKGWTSRPGIQIPSMTISPAV
        ASQERKRLQAI EYRNRKGWTSRPG+QIPSMTISPAV
Subjt:  ASQERKRLQAINEYRNRKGWTSRPGIQIPSMTISPAV

XP_016902506.1 PREDICTED: LOW QUALITY PROTEIN: protein PXR1 [Cucumis melo]4.5e-10291.1Show/hide
Query:  MGGKGMRRRERNYRAAHGGYDRLPPPPDTSQVDTLPSKLRKLMSFTSPGPQESEKVSEDIQRKRKGEAVNTEKKSHRKDALGRSDVKSKGNGGNSQMPQL
        MGGKGMRRRERNYRAAHGGYDRLPPPPDTSQ DTLPSKLRKLMSFTS   QESE VSED+QRKRK +A NTEKKS++KDA GRSDVKSK NGGNSQMPQ 
Subjt:  MGGKGMRRRERNYRAAHGGYDRLPPPPDTSQVDTLPSKLRKLMSFTSPGPQESEKVSEDIQRKRKGEAVNTEKKSHRKDALGRSDVKSKGNGGNSQMPQL

Query:  TGSDGDEDVQSKSSEKKNKKRKRKQVTDLRFEHSLEESSRRLKKRERRKKYQEAKKNKHKKAKTEEDLDFPKHEKIKFGDVVEAPLKLLAVPKAFKSAQV
        TGSD D DVQSKSSEKKNKKRKRKQVTDLRFE SLEESSRRLKKRERRKKYQEAKKNKHKKAKTEE LDFP+HE IKFGDVVEAPLKLLAVPKAFKSAQV
Subjt:  TGSDGDEDVQSKSSEKKNKKRKRKQVTDLRFEHSLEESSRRLKKRERRKKYQEAKKNKHKKAKTEEDLDFPKHEKIKFGDVVEAPLKLLAVPKAFKSAQV

Query:  ASQERKRLQAINEYRNRKGWTSRPGIQIPSMTISPA
        ASQERKRLQAINEYRNRKGWTSRPGIQIPSMTI+PA
Subjt:  ASQERKRLQAINEYRNRKGWTSRPGIQIPSMTISPA

XP_022959194.1 uncharacterized protein LOC111460256 [Cucurbita moschata]1.5e-10086.5Show/hide
Query:  MGGKGMRRRERNYRAAHGGYDRLPPPPDTSQVDTLPSKLRKLMSFTSPGPQESEKVSEDIQRKRKGEAVNTEKKSHRKDALGRSDVKSKGNGGNSQMPQL
        MGGKG+RRRERNYRAAHGGYDRLPPPP+ SQVDTLPSKLRKLM+FTSP PQESEKVSED+QRKRK EAV+TEKK H K+A G+SD+KSKGNGGNSQMPQL
Subjt:  MGGKGMRRRERNYRAAHGGYDRLPPPPDTSQVDTLPSKLRKLMSFTSPGPQESEKVSEDIQRKRKGEAVNTEKKSHRKDALGRSDVKSKGNGGNSQMPQL

Query:  TGSDGDEDVQSKSSEKKNKKRKRKQVTDLRFEHSLEESSRRLKKRERRKKYQEAKKNKHKKAKTEEDLDFPKHEKIKFGDVVEAPLKLLAVPKAFKSAQV
        TGS  D+DVQSK SEKKNKKRK+KQVTDLRFE   E S+RRLKKRER+KKY EAKKNKHKK KT+EDLDFP+HEKIKFGDVVEAPLKL AVPKAFKSAQV
Subjt:  TGSDGDEDVQSKSSEKKNKKRKRKQVTDLRFEHSLEESSRRLKKRERRKKYQEAKKNKHKKAKTEEDLDFPKHEKIKFGDVVEAPLKLLAVPKAFKSAQV

Query:  ASQERKRLQAINEYRNRKGWTSRPGIQIPSMTISPAV
        ASQERKRLQAI EYRNRKGWTSRPG+QIPSMTISPAV
Subjt:  ASQERKRLQAINEYRNRKGWTSRPGIQIPSMTISPAV

XP_038907272.1 protein PXR1 [Benincasa hispida]4.0e-10692.41Show/hide
Query:  MGGKGMRRRERNYRAAHGGYDRLPPPPDTSQVDTLPSKLRKLMSFTSPGPQESEKVSEDIQRKRKGEAVNTEKKSHRKDALGRSDVKSKGNGGNSQMPQL
        MGGKGMRRRERNYRAAHGGYDRLPPPP+ SQVDTLPSKLRKLMSFTS GPQESEKVS+DIQRKRK +AVNTEKKSHRKDALGRSDVKSKGNG  SQ PQL
Subjt:  MGGKGMRRRERNYRAAHGGYDRLPPPPDTSQVDTLPSKLRKLMSFTSPGPQESEKVSEDIQRKRKGEAVNTEKKSHRKDALGRSDVKSKGNGGNSQMPQL

Query:  TGSDGDEDVQSKSSEKKNKKRKRKQVTDLRFEHSLEESSRRLKKRERRKKYQEAKKNKHKKAKTEEDLDFPKHEKIKFGDVVEAPLKLLAVPKAFKSAQV
        TGS+GD DVQSKSSEKKNKK+KRKQVTDLRFE S EESSRRLKKRERRKKYQEAKKNKHKKA+TEEDLDFP+HEKIKFGDVVEAPLKL+AVPKAFKSAQV
Subjt:  TGSDGDEDVQSKSSEKKNKKRKRKQVTDLRFEHSLEESSRRLKKRERRKKYQEAKKNKHKKAKTEEDLDFPKHEKIKFGDVVEAPLKLLAVPKAFKSAQV

Query:  ASQERKRLQAINEYRNRKGWTSRPGIQIPSMTISPAV
        ASQERKR QAINEYRNRKGWTSRPGIQIPSMTISPAV
Subjt:  ASQERKRLQAINEYRNRKGWTSRPGIQIPSMTISPAV

TrEMBL top hitse value%identityAlignment
A0A0A0KEN1 Uncharacterized protein4.6e-10089.41Show/hide
Query:  MGGKGMRRRERNYRAAHGGYDRLPPPPDTSQVDTLPSKLRKLMSFTSPGPQESEKVSEDIQRKRKGEAVNTEKKSHRKDALGRSDVKSKGNGGNSQMPQL
        MGGKGMRRRERNYRAAHGGYDRLPPPPDTSQ DTLPSKLRKLMSFTS   QE EKVSEDIQRKRK EAVNT+KKS++KDA G     SK NGGNSQMPQ 
Subjt:  MGGKGMRRRERNYRAAHGGYDRLPPPPDTSQVDTLPSKLRKLMSFTSPGPQESEKVSEDIQRKRKGEAVNTEKKSHRKDALGRSDVKSKGNGGNSQMPQL

Query:  TGSDGDEDVQSKSSEKKNKKRKRKQVTDLRFEHSLEESSRRLKKRERRKKYQEAKKNKHKKAKTEEDLDFPKHEKIKFGDVVEAPLKLLAVPKAFKSAQV
         GSD D +V SKSSEKKNKKRKRKQVTDLRFE SLEESSRRLKKRER KKYQEAKKNKHKKAKTEE LDFPKHEKIKFGDVVEAPLKLLAVPKAFKSAQV
Subjt:  TGSDGDEDVQSKSSEKKNKKRKRKQVTDLRFEHSLEESSRRLKKRERRKKYQEAKKNKHKKAKTEEDLDFPKHEKIKFGDVVEAPLKLLAVPKAFKSAQV

Query:  ASQERKRLQAINEYRNRKGWTSRPGIQIPSMTISPA
        ASQERKRLQAINEYRNRKGWTSRPGIQIPSMTISPA
Subjt:  ASQERKRLQAINEYRNRKGWTSRPGIQIPSMTISPA

A0A1S4E2Q2 LOW QUALITY PROTEIN: protein PXR12.2e-10291.1Show/hide
Query:  MGGKGMRRRERNYRAAHGGYDRLPPPPDTSQVDTLPSKLRKLMSFTSPGPQESEKVSEDIQRKRKGEAVNTEKKSHRKDALGRSDVKSKGNGGNSQMPQL
        MGGKGMRRRERNYRAAHGGYDRLPPPPDTSQ DTLPSKLRKLMSFTS   QESE VSED+QRKRK +A NTEKKS++KDA GRSDVKSK NGGNSQMPQ 
Subjt:  MGGKGMRRRERNYRAAHGGYDRLPPPPDTSQVDTLPSKLRKLMSFTSPGPQESEKVSEDIQRKRKGEAVNTEKKSHRKDALGRSDVKSKGNGGNSQMPQL

Query:  TGSDGDEDVQSKSSEKKNKKRKRKQVTDLRFEHSLEESSRRLKKRERRKKYQEAKKNKHKKAKTEEDLDFPKHEKIKFGDVVEAPLKLLAVPKAFKSAQV
        TGSD D DVQSKSSEKKNKKRKRKQVTDLRFE SLEESSRRLKKRERRKKYQEAKKNKHKKAKTEE LDFP+HE IKFGDVVEAPLKLLAVPKAFKSAQV
Subjt:  TGSDGDEDVQSKSSEKKNKKRKRKQVTDLRFEHSLEESSRRLKKRERRKKYQEAKKNKHKKAKTEEDLDFPKHEKIKFGDVVEAPLKLLAVPKAFKSAQV

Query:  ASQERKRLQAINEYRNRKGWTSRPGIQIPSMTISPA
        ASQERKRLQAINEYRNRKGWTSRPGIQIPSMTI+PA
Subjt:  ASQERKRLQAINEYRNRKGWTSRPGIQIPSMTISPA

A0A5A7TEP7 Protein PXR15.2e-10492.37Show/hide
Query:  MGGKGMRRRERNYRAAHGGYDRLPPPPDTSQVDTLPSKLRKLMSFTSPGPQESEKVSEDIQRKRKGEAVNTEKKSHRKDALGRSDVKSKGNGGNSQMPQL
        MGGKGMRRRERNYRAAHGGYDRLPPPPDTSQ DTLPSKLRKLMSFTS   QESE VSEDIQRKRK +AVNTEKKS++KDA GRSDVKSK NGGNSQMPQ 
Subjt:  MGGKGMRRRERNYRAAHGGYDRLPPPPDTSQVDTLPSKLRKLMSFTSPGPQESEKVSEDIQRKRKGEAVNTEKKSHRKDALGRSDVKSKGNGGNSQMPQL

Query:  TGSDGDEDVQSKSSEKKNKKRKRKQVTDLRFEHSLEESSRRLKKRERRKKYQEAKKNKHKKAKTEEDLDFPKHEKIKFGDVVEAPLKLLAVPKAFKSAQV
        TGSD D DVQSKSSEKKNKKRKRKQVTDLRFE SLEESSRRLKKRERRKKYQEAKKNKHKKAKTEE LDFP+HEKIKFGDVVEAPLKLLAVPKAFKSAQV
Subjt:  TGSDGDEDVQSKSSEKKNKKRKRKQVTDLRFEHSLEESSRRLKKRERRKKYQEAKKNKHKKAKTEEDLDFPKHEKIKFGDVVEAPLKLLAVPKAFKSAQV

Query:  ASQERKRLQAINEYRNRKGWTSRPGIQIPSMTISPA
        ASQERKRLQAINEYRNRKGWTSRPGIQIPSMTI+PA
Subjt:  ASQERKRLQAINEYRNRKGWTSRPGIQIPSMTISPA

A0A6J1H3V7 uncharacterized protein LOC1114602567.1e-10186.5Show/hide
Query:  MGGKGMRRRERNYRAAHGGYDRLPPPPDTSQVDTLPSKLRKLMSFTSPGPQESEKVSEDIQRKRKGEAVNTEKKSHRKDALGRSDVKSKGNGGNSQMPQL
        MGGKG+RRRERNYRAAHGGYDRLPPPP+ SQVDTLPSKLRKLM+FTSP PQESEKVSED+QRKRK EAV+TEKK H K+A G+SD+KSKGNGGNSQMPQL
Subjt:  MGGKGMRRRERNYRAAHGGYDRLPPPPDTSQVDTLPSKLRKLMSFTSPGPQESEKVSEDIQRKRKGEAVNTEKKSHRKDALGRSDVKSKGNGGNSQMPQL

Query:  TGSDGDEDVQSKSSEKKNKKRKRKQVTDLRFEHSLEESSRRLKKRERRKKYQEAKKNKHKKAKTEEDLDFPKHEKIKFGDVVEAPLKLLAVPKAFKSAQV
        TGS  D+DVQSK SEKKNKKRK+KQVTDLRFE   E S+RRLKKRER+KKY EAKKNKHKK KT+EDLDFP+HEKIKFGDVVEAPLKL AVPKAFKSAQV
Subjt:  TGSDGDEDVQSKSSEKKNKKRKRKQVTDLRFEHSLEESSRRLKKRERRKKYQEAKKNKHKKAKTEEDLDFPKHEKIKFGDVVEAPLKLLAVPKAFKSAQV

Query:  ASQERKRLQAINEYRNRKGWTSRPGIQIPSMTISPAV
        ASQERKRLQAI EYRNRKGWTSRPG+QIPSMTISPAV
Subjt:  ASQERKRLQAINEYRNRKGWTSRPGIQIPSMTISPAV

A0A6J1KUZ8 probable H/ACA ribonucleoprotein complex subunit 48.7e-9985.71Show/hide
Query:  MGGKGMRRRERNYRAAHGGYDRLPPPPDTSQVDTLPSKLRKLMSFTSPGPQESEKVSEDIQRKRKGEAVN-TEKKSHRKDALGRSDVKSKGNGGNSQMPQ
        MGGKG+RRRERNYRAAHGGYDRLPPPP+ SQVDTLPSKLRKLMSFTSP PQESEKVSED+QRKRK EAV+ TEKK H K+A G+SD+KSKGNGGNSQMPQ
Subjt:  MGGKGMRRRERNYRAAHGGYDRLPPPPDTSQVDTLPSKLRKLMSFTSPGPQESEKVSEDIQRKRKGEAVN-TEKKSHRKDALGRSDVKSKGNGGNSQMPQ

Query:  LTGSDGDEDVQSKSSEKKNKKRKRKQVTDLRFEHSLEESSRRLKKRERRKKYQEAKKNKHKKAKTEEDLDFPKHEKIKFGDVVEAPLKLLAVPKAFKSAQ
        LTGS  D+DVQSKS+EKKNKKRK+KQVTDLRFE   E S+RRLKKRER+KKY EAKKNKHKK  ++EDLDFP+HEKIKFGDVVEAPLKL AVPKAFKSAQ
Subjt:  LTGSDGDEDVQSKSSEKKNKKRKRKQVTDLRFEHSLEESSRRLKKRERRKKYQEAKKNKHKKAKTEEDLDFPKHEKIKFGDVVEAPLKLLAVPKAFKSAQ

Query:  VASQERKRLQAINEYRNRKGWTSRPGIQIPSMTISPAV
        VASQERKRLQAI EYRNRKGWTSRPG+QIPSMTISPAV
Subjt:  VASQERKRLQAINEYRNRKGWTSRPGIQIPSMTISPAV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G45520.1 unknown protein1.6e-4952.79Show/hide
Query:  MGGKGMRRRERNYRAAHGGYDRLPPPPDTSQVDTLPSKLRKLMSFTSPGPQESEKVSEDIQRKRKGEAVNTEKKSHRKDALGRSDVKSKGNGGNSQMPQL
        MGGKG +RRE+NY AAHGG  RLPPPPD S+ D +PS LR LM++TSP P +S                 T++   +K+ L +++V          +   
Subjt:  MGGKGMRRRERNYRAAHGGYDRLPPPPDTSQVDTLPSKLRKLMSFTSPGPQESEKVSEDIQRKRKGEAVNTEKKSHRKDALGRSDVKSKGNGGNSQMPQL

Query:  TGSDGDEDVQSKSSEKKNKKRKRKQVTDLRFEHSLEESSRRLKKRERRKKYQEAKKNKHKKAKTEEDL--DFPKHEKIKFGDVVEAPLKLLAVPKAFKSA
        T SDGD+ V     EKK KKRKR Q+TDLRFE+ L E   R K++ER+KKY EAKK K  K KTE+ L  +FPKHE+I+FGDVV+APLKL  VPKA KS 
Subjt:  TGSDGDEDVQSKSSEKKNKKRKRKQVTDLRFEHSLEESSRRLKKRERRKKYQEAKKNKHKKAKTEEDL--DFPKHEKIKFGDVVEAPLKLLAVPKAFKSA

Query:  QVASQERKRLQAINEYRNRKGWTSRPGIQIPSM
          ASQER RLQAI+ YR+RKGWT+RPG+ IP++
Subjt:  QVASQERKRLQAINEYRNRKGWTSRPGIQIPSM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGGTAAAGGAATGCGGAGAAGAGAGCGGAATTACAGAGCCGCACATGGAGGCTACGATCGTCTCCCACCGCCGCCGGACACTTCCCAAGTCGATACTCTACCTTC
CAAACTCCGCAAGTTAATGTCCTTCACTTCCCCTGGACCTCAAGAGTCTGAGAAGGTCTCGGAGGATATTCAACGAAAGCGCAAGGGAGAAGCCGTTAATACTGAAAAGA
AATCCCATCGAAAGGATGCTTTGGGAAGATCTGATGTAAAGAGCAAAGGTAATGGTGGGAATTCACAAATGCCTCAGCTTACAGGTAGTGATGGTGATGAAGACGTGCAA
AGCAAGTCCAGTGAGAAGAAAAACAAAAAACGTAAGAGAAAGCAGGTTACTGACCTTCGTTTTGAACACTCGTTGGAGGAATCAAGTCGACGTTTAAAGAAACGGGAACG
TCGGAAAAAATATCAGGAGGCGAAGAAAAACAAACATAAAAAAGCCAAGACAGAGGAGGATCTAGACTTCCCAAAACATGAAAAAATAAAATTTGGAGACGTGGTTGAAG
CTCCACTGAAGTTGCTTGCAGTTCCGAAGGCGTTTAAATCTGCACAAGTTGCTTCTCAAGAGAGGAAGCGGTTGCAGGCTATAAATGAATATAGAAACCGCAAGGGATGG
ACCTCAAGGCCAGGGATACAGATACCTTCAATGACTATATCGCCAGCTGTTTAA
mRNA sequenceShow/hide mRNA sequence
TTGTAAACCTATTTCGTTTTCTGCCTCTCACATAAAAAACAATCGGAAGAGTTCCCACTGGTCTTCGTCGGCGGCGGCGATTGAGGCTATACTCCGGTGTCCAACTCCGA
CGTAGGGCTGCAAATTTTGTTATCGAGAGAAAAAGGAGAGATGGGAGGTAAAGGAATGCGGAGAAGAGAGCGGAATTACAGAGCCGCACATGGAGGCTACGATCGTCTCC
CACCGCCGCCGGACACTTCCCAAGTCGATACTCTACCTTCCAAACTCCGCAAGTTAATGTCCTTCACTTCCCCTGGACCTCAAGAGTCTGAGAAGGTCTCGGAGGATATT
CAACGAAAGCGCAAGGGAGAAGCCGTTAATACTGAAAAGAAATCCCATCGAAAGGATGCTTTGGGAAGATCTGATGTAAAGAGCAAAGGTAATGGTGGGAATTCACAAAT
GCCTCAGCTTACAGGTAGTGATGGTGATGAAGACGTGCAAAGCAAGTCCAGTGAGAAGAAAAACAAAAAACGTAAGAGAAAGCAGGTTACTGACCTTCGTTTTGAACACT
CGTTGGAGGAATCAAGTCGACGTTTAAAGAAACGGGAACGTCGGAAAAAATATCAGGAGGCGAAGAAAAACAAACATAAAAAAGCCAAGACAGAGGAGGATCTAGACTTC
CCAAAACATGAAAAAATAAAATTTGGAGACGTGGTTGAAGCTCCACTGAAGTTGCTTGCAGTTCCGAAGGCGTTTAAATCTGCACAAGTTGCTTCTCAAGAGAGGAAGCG
GTTGCAGGCTATAAATGAATATAGAAACCGCAAGGGATGGACCTCAAGGCCAGGGATACAGATACCTTCAATGACTATATCGCCAGCTGTTTAATTCAGCAGATTTTGTT
CCATTAGGGGCCATAAGCTGGCAGTGGGAATCTCTCTCTACATCACTAAGTACAGGAGTGTTATCAGAATGTGTGCTCTATAAGGGAAGATTTGTTCCAAAAATTACCAA
GCATTCAACTGAGAAATCAAAGGCAAGATTCTGAAAATCCTATCAGAAACAGATTCTGTGATGAAAGCCAACCATCTTGTGCTCTATTGCTTCATATGAGAATAATAGCC
TTGTTTGTGAAAGTTTATGGTTCGAATTTTGGATGAGAATGTTTTTCTTTTTGGTTAAATTGTAATTGATGTTTTGTTGTAGTTCTTATTTTATTCGATTTATCTATTCA
AATGATCTTTTTCAATAGCGAGTAAGGTTGGCCTAATGATCGATAATAGTCATGTAACTTGTAAGTAATGAAAAGGCTTAGAAGGAAGTGGGTCAAGCCATGACGAATGA
CTTGCACACTTATGAATGTTAAAGAGAATAATTAATAGAATTCTGTCTTGTTTCTT
Protein sequenceShow/hide protein sequence
MGGKGMRRRERNYRAAHGGYDRLPPPPDTSQVDTLPSKLRKLMSFTSPGPQESEKVSEDIQRKRKGEAVNTEKKSHRKDALGRSDVKSKGNGGNSQMPQLTGSDGDEDVQ
SKSSEKKNKKRKRKQVTDLRFEHSLEESSRRLKKRERRKKYQEAKKNKHKKAKTEEDLDFPKHEKIKFGDVVEAPLKLLAVPKAFKSAQVASQERKRLQAINEYRNRKGW
TSRPGIQIPSMTISPAV