; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0011345 (gene) of Snake gourd v1 genome

Gene IDTan0011345
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein PXR1
Genome locationLG06:71198452..71207422
RNA-Seq ExpressionTan0011345
SyntenyTan0011345
Gene Ontology termsGO:0016874 - ligase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040007.1 protein PXR1 [Cucumis melo var. makuwa]1.2e-9787.71Show/hide
Query:  MGGKGMRRRERNYRAAHGGYDRLPPPPDTSKGDALPSKLRKLMSFAPSRPQESEKVLEDIQRKRKREAVNTEKKPHPKDALGRSDGESKGNGENLQMPQL
        MGGKGMRRRERNYRAAHGGYDRLPPPPDTS+ D LPSKLRKLMSF  SR QESE V EDIQRKRKR+AVNTEKK + KDA GRSD +SK NG N QMPQ 
Subjt:  MGGKGMRRRERNYRAAHGGYDRLPPPPDTSKGDALPSKLRKLMSFAPSRPQESEKVLEDIQRKRKREAVNTEKKPHPKDALGRSDGESKGNGENLQMPQL

Query:  TGSD-DNVQSKSSEKKKNKKRKRKQVTDLRFEDSLEDSNRRLKRRERRKKYQEAKKNKHKKAKTEEDLDFPRHEKIKFGDVVEAPLKLLAVPKAFKSAQV
        TGSD D+VQSKSSE KKNKKRKRKQVTDLRFEDSLE+S+RRLK+RERRKKYQEAKKNKHKKAKTEE LDFPRHEKIKFGDVVEAPLKLLAVPKAFKSAQV
Subjt:  TGSD-DNVQSKSSEKKKNKKRKRKQVTDLRFEDSLEDSNRRLKRRERRKKYQEAKKNKHKKAKTEEDLDFPRHEKIKFGDVVEAPLKLLAVPKAFKSAQV

Query:  ASQERKRLQAINEYRNRKGWTSRPGIQIPSMTLSPS
        ASQERKRLQAINEYRNRKGWTSRPGIQIPSMT++P+
Subjt:  ASQERKRLQAINEYRNRKGWTSRPGIQIPSMTLSPS

KAG6574789.1 hypothetical protein SDJN03_25428, partial [Cucurbita argyrosperma subsp. sororia]5.7e-9783.47Show/hide
Query:  MGGKGMRRRERNYRAAHGGYDRLPPPPDTSKGDALPSKLRKLMSFAPSRPQESEKVLEDIQRKRKREAVNTEKKPHPKDALGRSDGESKGNGENLQMPQL
        MGGKG+RRRERNYRAAHGGYDRLPPPP+ S+ D LPSKLRKLM+F   RPQESEKV ED+QRKRKREAV+TEKK HPK+A G+SD +S+GNG N QMPQL
Subjt:  MGGKGMRRRERNYRAAHGGYDRLPPPPDTSKGDALPSKLRKLMSFAPSRPQESEKVLEDIQRKRKREAVNTEKKPHPKDALGRSDGESKGNGENLQMPQL

Query:  TGSDDNVQSKSSEKKKNKKRKRKQVTDLRFEDSLEDSNRRLKRRERRKKYQEAKKNKHKKAKTEEDLDFPRHEKIKFGDVVEAPLKLLAVPKAFKSAQVA
        TGSDD+VQSKSSE KKNKKRK+KQVTDLRFED  E+SNRRLK+RER++KY EAKKNKHKK KT+EDLDFPRHEKIKFGDVVEAPLKL AVPKAFKSAQVA
Subjt:  TGSDDNVQSKSSEKKKNKKRKRKQVTDLRFEDSLEDSNRRLKRRERRKKYQEAKKNKHKKAKTEEDLDFPRHEKIKFGDVVEAPLKLLAVPKAFKSAQVA

Query:  SQERKRLQAINEYRNRKGWTSRPGIQIPSMTLSPSV
        SQERKRLQAI EYRNRKGWTSRPG+QIPSMT+SP+V
Subjt:  SQERKRLQAINEYRNRKGWTSRPGIQIPSMTLSPSV

XP_022959194.1 uncharacterized protein LOC111460256 [Cucurbita moschata]4.4e-9783.9Show/hide
Query:  MGGKGMRRRERNYRAAHGGYDRLPPPPDTSKGDALPSKLRKLMSFAPSRPQESEKVLEDIQRKRKREAVNTEKKPHPKDALGRSDGESKGNGENLQMPQL
        MGGKG+RRRERNYRAAHGGYDRLPPPP+ S+ D LPSKLRKLM+F   RPQESEKV ED+QRKRKREAV+TEKK HPK+A G+SD +SKGNG N QMPQL
Subjt:  MGGKGMRRRERNYRAAHGGYDRLPPPPDTSKGDALPSKLRKLMSFAPSRPQESEKVLEDIQRKRKREAVNTEKKPHPKDALGRSDGESKGNGENLQMPQL

Query:  TGSDDNVQSKSSEKKKNKKRKRKQVTDLRFEDSLEDSNRRLKRRERRKKYQEAKKNKHKKAKTEEDLDFPRHEKIKFGDVVEAPLKLLAVPKAFKSAQVA
        TGSDD+VQSK SE KKNKKRK+KQVTDLRFED  E+SNRRLK+RER+KKY EAKKNKHKK KT+EDLDFPRHEKIKFGDVVEAPLKL AVPKAFKSAQVA
Subjt:  TGSDDNVQSKSSEKKKNKKRKRKQVTDLRFEDSLEDSNRRLKRRERRKKYQEAKKNKHKKAKTEEDLDFPRHEKIKFGDVVEAPLKLLAVPKAFKSAQVA

Query:  SQERKRLQAINEYRNRKGWTSRPGIQIPSMTLSPSV
        SQERKRLQAI EYRNRKGWTSRPG+QIPSMT+SP+V
Subjt:  SQERKRLQAINEYRNRKGWTSRPGIQIPSMTLSPSV

XP_023547832.1 protein PXR1 [Cucurbita pepo subsp. pepo]1.7e-9683.47Show/hide
Query:  MGGKGMRRRERNYRAAHGGYDRLPPPPDTSKGDALPSKLRKLMSFAPSRPQESEKVLEDIQRKRKREAVNTEKKPHPKDALGRSDGESKGNGENLQMPQL
        MGGKG+RRRERNYRAAHGGYDRLPPPP+ S+ D LPSKLRKLM+F   RPQESEKV ED+QRKRKREAV+TEKK HPK+A G+SD +SKG+G N Q+PQL
Subjt:  MGGKGMRRRERNYRAAHGGYDRLPPPPDTSKGDALPSKLRKLMSFAPSRPQESEKVLEDIQRKRKREAVNTEKKPHPKDALGRSDGESKGNGENLQMPQL

Query:  TGSDDNVQSKSSEKKKNKKRKRKQVTDLRFEDSLEDSNRRLKRRERRKKYQEAKKNKHKKAKTEEDLDFPRHEKIKFGDVVEAPLKLLAVPKAFKSAQVA
        TGSDD+VQSKSSEKKK KKRK+KQVTDLRFED  E+SNRRLK+RER+KKY EAKKNKHKK KT+EDLDFPRHEKIKFGDVVEAPLKL AVPKAFKSAQVA
Subjt:  TGSDDNVQSKSSEKKKNKKRKRKQVTDLRFEDSLEDSNRRLKRRERRKKYQEAKKNKHKKAKTEEDLDFPRHEKIKFGDVVEAPLKLLAVPKAFKSAQVA

Query:  SQERKRLQAINEYRNRKGWTSRPGIQIPSMTLSPSV
        SQERKRLQAI EYRNRKGWTSRPG+QIPSMT+SP+V
Subjt:  SQERKRLQAINEYRNRKGWTSRPGIQIPSMTLSPSV

XP_038907272.1 protein PXR1 [Benincasa hispida]1.5e-9786.5Show/hide
Query:  MGGKGMRRRERNYRAAHGGYDRLPPPPDTSKGDALPSKLRKLMSFAPSRPQESEKVLEDIQRKRKREAVNTEKKPHPKDALGRSDGESKGNGENLQMPQL
        MGGKGMRRRERNYRAAHGGYDRLPPPP+ S+ D LPSKLRKLMSF  S PQESEKV +DIQRKRKR+AVNTEKK H KDALGRSD +SKGNGE  Q PQL
Subjt:  MGGKGMRRRERNYRAAHGGYDRLPPPPDTSKGDALPSKLRKLMSFAPSRPQESEKVLEDIQRKRKREAVNTEKKPHPKDALGRSDGESKGNGENLQMPQL

Query:  TGSD-DNVQSKSSEKKKNKKRKRKQVTDLRFEDSLEDSNRRLKRRERRKKYQEAKKNKHKKAKTEEDLDFPRHEKIKFGDVVEAPLKLLAVPKAFKSAQV
        TGS+ D+VQSKSSE KKNKK+KRKQVTDLRFEDS E+S+RRLK+RERRKKYQEAKKNKHKKA+TEEDLDFPRHEKIKFGDVVEAPLKL+AVPKAFKSAQV
Subjt:  TGSD-DNVQSKSSEKKKNKKRKRKQVTDLRFEDSLEDSNRRLKRRERRKKYQEAKKNKHKKAKTEEDLDFPRHEKIKFGDVVEAPLKLLAVPKAFKSAQV

Query:  ASQERKRLQAINEYRNRKGWTSRPGIQIPSMTLSPSV
        ASQERKR QAINEYRNRKGWTSRPGIQIPSMT+SP+V
Subjt:  ASQERKRLQAINEYRNRKGWTSRPGIQIPSMTLSPSV

TrEMBL top hitse value%identityAlignment
A0A0A0KEN1 Uncharacterized protein3.7e-9485.23Show/hide
Query:  MGGKGMRRRERNYRAAHGGYDRLPPPPDTSKGDALPSKLRKLMSFAPSRPQESEKVLEDIQRKRKREAVNTEKKPHPKDALGRSDGESKGNGENLQMPQL
        MGGKGMRRRERNYRAAHGGYDRLPPPPDTS+ D LPSKLRKLMSF  SR QE EKV EDIQRKRKREAVNT+KK + KDA G     SK NG N QMPQ 
Subjt:  MGGKGMRRRERNYRAAHGGYDRLPPPPDTSKGDALPSKLRKLMSFAPSRPQESEKVLEDIQRKRKREAVNTEKKPHPKDALGRSDGESKGNGENLQMPQL

Query:  TGSDD--NVQSKSSEKKKNKKRKRKQVTDLRFEDSLEDSNRRLKRRERRKKYQEAKKNKHKKAKTEEDLDFPRHEKIKFGDVVEAPLKLLAVPKAFKSAQ
         GSDD  NV SKSSE KKNKKRKRKQVTDLRFEDSLE+S+RRLK+RER KKYQEAKKNKHKKAKTEE LDFP+HEKIKFGDVVEAPLKLLAVPKAFKSAQ
Subjt:  TGSDD--NVQSKSSEKKKNKKRKRKQVTDLRFEDSLEDSNRRLKRRERRKKYQEAKKNKHKKAKTEEDLDFPRHEKIKFGDVVEAPLKLLAVPKAFKSAQ

Query:  VASQERKRLQAINEYRNRKGWTSRPGIQIPSMTLSPS
        VASQERKRLQAINEYRNRKGWTSRPGIQIPSMT+SP+
Subjt:  VASQERKRLQAINEYRNRKGWTSRPGIQIPSMTLSPS

A0A1S4E2Q2 LOW QUALITY PROTEIN: protein PXR11.8e-9686.44Show/hide
Query:  MGGKGMRRRERNYRAAHGGYDRLPPPPDTSKGDALPSKLRKLMSFAPSRPQESEKVLEDIQRKRKREAVNTEKKPHPKDALGRSDGESKGNGENLQMPQL
        MGGKGMRRRERNYRAAHGGYDRLPPPPDTS+ D LPSKLRKLMSF  SR QESE V ED+QRKRKR+A NTEKK + KDA GRSD +SK NG N QMPQ 
Subjt:  MGGKGMRRRERNYRAAHGGYDRLPPPPDTSKGDALPSKLRKLMSFAPSRPQESEKVLEDIQRKRKREAVNTEKKPHPKDALGRSDGESKGNGENLQMPQL

Query:  TGSD-DNVQSKSSEKKKNKKRKRKQVTDLRFEDSLEDSNRRLKRRERRKKYQEAKKNKHKKAKTEEDLDFPRHEKIKFGDVVEAPLKLLAVPKAFKSAQV
        TGSD D+VQSKSSE KKNKKRKRKQVTDLRFEDSLE+S+RRLK+RERRKKYQEAKKNKHKKAKTEE LDFPRHE IKFGDVVEAPLKLLAVPKAFKSAQV
Subjt:  TGSD-DNVQSKSSEKKKNKKRKRKQVTDLRFEDSLEDSNRRLKRRERRKKYQEAKKNKHKKAKTEEDLDFPRHEKIKFGDVVEAPLKLLAVPKAFKSAQV

Query:  ASQERKRLQAINEYRNRKGWTSRPGIQIPSMTLSPS
        ASQERKRLQAINEYRNRKGWTSRPGIQIPSMT++P+
Subjt:  ASQERKRLQAINEYRNRKGWTSRPGIQIPSMTLSPS

A0A5A7TEP7 Protein PXR15.6e-9887.71Show/hide
Query:  MGGKGMRRRERNYRAAHGGYDRLPPPPDTSKGDALPSKLRKLMSFAPSRPQESEKVLEDIQRKRKREAVNTEKKPHPKDALGRSDGESKGNGENLQMPQL
        MGGKGMRRRERNYRAAHGGYDRLPPPPDTS+ D LPSKLRKLMSF  SR QESE V EDIQRKRKR+AVNTEKK + KDA GRSD +SK NG N QMPQ 
Subjt:  MGGKGMRRRERNYRAAHGGYDRLPPPPDTSKGDALPSKLRKLMSFAPSRPQESEKVLEDIQRKRKREAVNTEKKPHPKDALGRSDGESKGNGENLQMPQL

Query:  TGSD-DNVQSKSSEKKKNKKRKRKQVTDLRFEDSLEDSNRRLKRRERRKKYQEAKKNKHKKAKTEEDLDFPRHEKIKFGDVVEAPLKLLAVPKAFKSAQV
        TGSD D+VQSKSSE KKNKKRKRKQVTDLRFEDSLE+S+RRLK+RERRKKYQEAKKNKHKKAKTEE LDFPRHEKIKFGDVVEAPLKLLAVPKAFKSAQV
Subjt:  TGSD-DNVQSKSSEKKKNKKRKRKQVTDLRFEDSLEDSNRRLKRRERRKKYQEAKKNKHKKAKTEEDLDFPRHEKIKFGDVVEAPLKLLAVPKAFKSAQV

Query:  ASQERKRLQAINEYRNRKGWTSRPGIQIPSMTLSPS
        ASQERKRLQAINEYRNRKGWTSRPGIQIPSMT++P+
Subjt:  ASQERKRLQAINEYRNRKGWTSRPGIQIPSMTLSPS

A0A6J1H3V7 uncharacterized protein LOC1114602562.1e-9783.9Show/hide
Query:  MGGKGMRRRERNYRAAHGGYDRLPPPPDTSKGDALPSKLRKLMSFAPSRPQESEKVLEDIQRKRKREAVNTEKKPHPKDALGRSDGESKGNGENLQMPQL
        MGGKG+RRRERNYRAAHGGYDRLPPPP+ S+ D LPSKLRKLM+F   RPQESEKV ED+QRKRKREAV+TEKK HPK+A G+SD +SKGNG N QMPQL
Subjt:  MGGKGMRRRERNYRAAHGGYDRLPPPPDTSKGDALPSKLRKLMSFAPSRPQESEKVLEDIQRKRKREAVNTEKKPHPKDALGRSDGESKGNGENLQMPQL

Query:  TGSDDNVQSKSSEKKKNKKRKRKQVTDLRFEDSLEDSNRRLKRRERRKKYQEAKKNKHKKAKTEEDLDFPRHEKIKFGDVVEAPLKLLAVPKAFKSAQVA
        TGSDD+VQSK SE KKNKKRK+KQVTDLRFED  E+SNRRLK+RER+KKY EAKKNKHKK KT+EDLDFPRHEKIKFGDVVEAPLKL AVPKAFKSAQVA
Subjt:  TGSDDNVQSKSSEKKKNKKRKRKQVTDLRFEDSLEDSNRRLKRRERRKKYQEAKKNKHKKAKTEEDLDFPRHEKIKFGDVVEAPLKLLAVPKAFKSAQVA

Query:  SQERKRLQAINEYRNRKGWTSRPGIQIPSMTLSPSV
        SQERKRLQAI EYRNRKGWTSRPG+QIPSMT+SP+V
Subjt:  SQERKRLQAINEYRNRKGWTSRPGIQIPSMTLSPSV

A0A6J1KUZ8 probable H/ACA ribonucleoprotein complex subunit 42.6e-9583.12Show/hide
Query:  MGGKGMRRRERNYRAAHGGYDRLPPPPDTSKGDALPSKLRKLMSFAPSRPQESEKVLEDIQRKRKREAVN-TEKKPHPKDALGRSDGESKGNGENLQMPQ
        MGGKG+RRRERNYRAAHGGYDRLPPPP+ S+ D LPSKLRKLMSF   RPQESEKV ED+QRKRKREAV+ TEKK HPK+A G+SD +SKGNG N QMPQ
Subjt:  MGGKGMRRRERNYRAAHGGYDRLPPPPDTSKGDALPSKLRKLMSFAPSRPQESEKVLEDIQRKRKREAVN-TEKKPHPKDALGRSDGESKGNGENLQMPQ

Query:  LTGSDDNVQSKSSEKKKNKKRKRKQVTDLRFEDSLEDSNRRLKRRERRKKYQEAKKNKHKKAKTEEDLDFPRHEKIKFGDVVEAPLKLLAVPKAFKSAQV
        LTGSDD+VQSKS+E KKNKKRK+KQVTDLRFED  E+SNRRLK+RER+KKY EAKKNKHKK  ++EDLDFPRHEKIKFGDVVEAPLKL AVPKAFKSAQV
Subjt:  LTGSDDNVQSKSSEKKKNKKRKRKQVTDLRFEDSLEDSNRRLKRRERRKKYQEAKKNKHKKAKTEEDLDFPRHEKIKFGDVVEAPLKLLAVPKAFKSAQV

Query:  ASQERKRLQAINEYRNRKGWTSRPGIQIPSMTLSPSV
        ASQERKRLQAI EYRNRKGWTSRPG+QIPSMT+SP+V
Subjt:  ASQERKRLQAINEYRNRKGWTSRPGIQIPSMTLSPSV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G45520.1 unknown protein6.4e-4651.49Show/hide
Query:  MGGKGMRRRERNYRAAHGGYDRLPPPPDTSKGDALPSKLRKLMSFAPSRPQESEK-VLEDIQRKRKREAVNTEKKPHPKDALGRSDGESKGNGENLQMPQ
        MGGKG +RRE+NY AAHGG  RLPPPPD SK D +PS LR LM++    P +S K V+E  ++ +K E V    +         SDG             
Subjt:  MGGKGMRRRERNYRAAHGGYDRLPPPPDTSKGDALPSKLRKLMSFAPSRPQESEK-VLEDIQRKRKREAVNTEKKPHPKDALGRSDGESKGNGENLQMPQ

Query:  LTGSDDNVQSKSSEKKKNKKRKRKQVTDLRFEDSLEDSNRRLKRRERRKKYQEAKKNKHKKAKTEEDL--DFPRHEKIKFGDVVEAPLKLLAVPKAFKSA
            DD+V      +KK KKRKR Q+TDLRFE+ L + + R KR+ER+KKY EAKK K  K KTE+ L  +FP+HE+I+FGDVV+APLKL  VPKA KS 
Subjt:  LTGSDDNVQSKSSEKKKNKKRKRKQVTDLRFEDSLEDSNRRLKRRERRKKYQEAKKNKHKKAKTEEDL--DFPRHEKIKFGDVVEAPLKLLAVPKAFKSA

Query:  QVASQERKRLQAINEYRNRKGWTSRPGIQIPSMTL
          ASQER RLQAI+ YR+RKGWT+RPG+ IP++ +
Subjt:  QVASQERKRLQAINEYRNRKGWTSRPGIQIPSMTL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGGCAAAGGAATGAGGAGGAGAGAGCGGAATTACAGAGCCGCACATGGAGGTTACGATCGTCTCCCACCGCCGCCGGACACTTCCAAAGGGGATGCTCTACCTTC
CAAACTCCGCAAGTTAATGTCCTTCGCTCCCTCTCGACCTCAAGAGTCTGAGAAGGTCTTGGAGGATATTCAGCGAAAGCGCAAGAGAGAAGCCGTTAATACTGAGAAGA
AACCCCATCCAAAGGATGCTCTGGGAAGATCTGACGGGGAGAGCAAGGGTAATGGTGAGAATTTACAAATGCCTCAGCTTACAGGTAGTGATGATAACGTGCAAAGCAAG
TCTAGTGAGAAGAAGAAAAACAAAAAACGAAAGAGAAAGCAGGTTACTGACCTTCGTTTTGAAGACTCGTTGGAGGATTCAAATCGACGTTTAAAGAGACGGGAACGCCG
GAAAAAATATCAGGAGGCAAAGAAAAATAAACATAAAAAAGCTAAGACAGAGGAGGATCTAGACTTCCCCAGACATGAAAAAATCAAATTTGGAGACGTGGTTGAGGCTC
CACTGAAGTTACTTGCAGTTCCGAAGGCGTTTAAATCTGCACAAGTTGCTTCTCAAGAGAGGAAGCGATTGCAGGCTATAAATGAATATAGAAACCGCAAGGGGTGGACC
TCAAGGCCAGGGATACAAATACCTTCAATGACTTTATCGCCATCTGTTTAA
mRNA sequenceShow/hide mRNA sequence
GTCTGATCGGCACCAAAACTCGCCAAAGCCAAAACCCCCAACCGAGTTCCTCGTCGTCGGCGGCGGCGCTTGAGGCTATACTCCGTCTTCCAACTCCGACGTACGCTGCA
AATTTCATTCCAGAGAGATGGGAGGCAAAGGAATGAGGAGGAGAGAGCGGAATTACAGAGCCGCACATGGAGGTTACGATCGTCTCCCACCGCCGCCGGACACTTCCAAA
GGGGATGCTCTACCTTCCAAACTCCGCAAGTTAATGTCCTTCGCTCCCTCTCGACCTCAAGAGTCTGAGAAGGTCTTGGAGGATATTCAGCGAAAGCGCAAGAGAGAAGC
CGTTAATACTGAGAAGAAACCCCATCCAAAGGATGCTCTGGGAAGATCTGACGGGGAGAGCAAGGGTAATGGTGAGAATTTACAAATGCCTCAGCTTACAGGTAGTGATG
ATAACGTGCAAAGCAAGTCTAGTGAGAAGAAGAAAAACAAAAAACGAAAGAGAAAGCAGGTTACTGACCTTCGTTTTGAAGACTCGTTGGAGGATTCAAATCGACGTTTA
AAGAGACGGGAACGCCGGAAAAAATATCAGGAGGCAAAGAAAAATAAACATAAAAAAGCTAAGACAGAGGAGGATCTAGACTTCCCCAGACATGAAAAAATCAAATTTGG
AGACGTGGTTGAGGCTCCACTGAAGTTACTTGCAGTTCCGAAGGCGTTTAAATCTGCACAAGTTGCTTCTCAAGAGAGGAAGCGATTGCAGGCTATAAATGAATATAGAA
ACCGCAAGGGGTGGACCTCAAGGCCAGGGATACAAATACCTTCAATGACTTTATCGCCATCTGTTTAATTGGGCAGCAGCAGATATCGTTCCATTAGGAGGGCCATAAGC
TGGCAATGGGAATCTCTCTCTCTGCATCACTAACCAATTACAGGAGAGATATCAGAATGTGCTCTAAAAGGTAGGATTTGTTCTAAAAATTACCAAGCGTTCAACTGAGA
AATCAAAGGTAAGATTCTGAAAATCTTATCAGAAACAGATTCTTTCATGGAAGCTAACCATCTTATGCTCCGTTGCTTCATATGAGAGTAATAGGCTTGTTTATGAAGTT
GTTTATCCATTGAAATTATGATTTGAATTTATATGGAAGTTTTGTAGTTATGTATCGAAAGAGTAGTATGAGATAAAAAAACATTTCCGATTCTAGCGTGCTTTCAATTG
TTGTATCAAATTGCGTCTTAAACATTGGTAAGTGTTGCACTTTTTGTCTTAAA
Protein sequenceShow/hide protein sequence
MGGKGMRRRERNYRAAHGGYDRLPPPPDTSKGDALPSKLRKLMSFAPSRPQESEKVLEDIQRKRKREAVNTEKKPHPKDALGRSDGESKGNGENLQMPQLTGSDDNVQSK
SSEKKKNKKRKRKQVTDLRFEDSLEDSNRRLKRRERRKKYQEAKKNKHKKAKTEEDLDFPRHEKIKFGDVVEAPLKLLAVPKAFKSAQVASQERKRLQAINEYRNRKGWT
SRPGIQIPSMTLSPSV