; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi09G016350 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi09G016350
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptiontranscription factor UNE10
Genome locationchr09:24665950..24667138
RNA-Seq ExpressionLsi09G016350
SyntenyLsi09G016350
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0016020 - membrane (cellular component)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR031066 - Basic helix-loop-helix (bHLH) transcription factors ALC-like, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK24759.1 transcription factor UNE10 [Cucumis melo var. makuwa]5.0e-11180.35Show/hide
Query:  QCVPNWDLSEPPPSAAA-DPSPFHSSSAAADVVPMFEYEVAELTWENGQLAMHGLGLPRVTGKIPNGSGSVGGGGGGGSKYTWDNKPARASGTLESLVNQ
        QCVPNWDLSEPPPS+AA    PFHSSSAA DVVP+FEYEVAELTWENGQLAMHGLGLPRVTGK+ N SG  GGGGG GSK+TWDNKPARASGTLESLVNQ
Subjt:  QCVPNWDLSEPPPSAAA-DPSPFHSSSAAADVVPMFEYEVAELTWENGQLAMHGLGLPRVTGKIPNGSGSVGGGGGGGSKYTWDNKPARASGTLESLVNQ

Query:  GTRYGK-NISFDINADDADDGGANDLVPWFADHHRQTPTASAATAMDAMVPCDGDKAAAGGGADNSAVESSDIPGAAREEDEDCRVIHGKRARVVARVVH
        GTR+GK NISFDIN DD   GGANDL PWF+DHHRQT TAS A   DAMVPCDGDK+A  GG   SA +SSDIP  AREEDEDC VIHGKR RVVARVVH
Subjt:  GTRYGK-NISFDINADDADDGGANDLVPWFADHHRQTPTASAATAMDAMVPCDGDKAAAGGGADNSAVESSDIPGAAREEDEDCRVIHGKRARVVARVVH

Query:  ASREWSGCRNQISVSGSRESSQKVTL-KTRDRNFAAV-------ATTATSQGSLDNTSSDKPCVKNTTITTTTDDHDSVCHSTHQ
        AS EWS CRNQISVSG+RES QKVTL  TR RNFAAV       ATTATSQGSLDNT+SDKPCVKNTT+ TTTDDHDSVCHSTHQ
Subjt:  ASREWSGCRNQISVSGSRESSQKVTL-KTRDRNFAAV-------ATTATSQGSLDNTSSDKPCVKNTTITTTTDDHDSVCHSTHQ

XP_008459727.1 PREDICTED: transcription factor UNE10 [Cucumis melo]4.5e-11280.49Show/hide
Query:  MSQCVPNWDLSEPPPSAAA-DPSPFHSSSAAADVVPMFEYEVAELTWENGQLAMHGLGLPRVTGKIPNGSGSVGGGGGGGSKYTWDNKPARASGTLESLV
        MSQCVPNWDLSEPPPS+AA    PFHSSSAA DVVP+FEYEVAELTWENGQLAMHGLGLPRVTGK+ N SG  GGGGG GSK+TWDNKPARASGTLESLV
Subjt:  MSQCVPNWDLSEPPPSAAA-DPSPFHSSSAAADVVPMFEYEVAELTWENGQLAMHGLGLPRVTGKIPNGSGSVGGGGGGGSKYTWDNKPARASGTLESLV

Query:  NQGTRYGK-NISFDINADDADDGGANDLVPWFADHHRQTPTASAATAMDAMVPCDGDKAAAGGGADNSAVESSDIPGAAREEDEDCRVIHGKRARVVARV
        NQGTR+GK NISFDIN DD   GGANDL PWF+DHHRQT TAS A   DAMVPCDGDK+A  GG   SA +SSDIP  AREEDEDC VIHGKR RVVARV
Subjt:  NQGTRYGK-NISFDINADDADDGGANDLVPWFADHHRQTPTASAATAMDAMVPCDGDKAAAGGGADNSAVESSDIPGAAREEDEDCRVIHGKRARVVARV

Query:  VHASREWSGCRNQISVSGSRESSQKVTL-KTRDRNFAAV-------ATTATSQGSLDNTSSDKPCVKNTTITTTTDDHDSVCHSTHQ
        VHAS EWS CRNQISVSG+RES QKVTL  TR RNFAAV       ATTATSQGSLDNT+SDKPCVKNTT+ TTTDDHDSVCHSTHQ
Subjt:  VHASREWSGCRNQISVSGSRESSQKVTL-KTRDRNFAAV-------ATTATSQGSLDNTSSDKPCVKNTTITTTTDDHDSVCHSTHQ

XP_011656866.1 transcription factor UNE10 [Cucumis sativus]8.5e-11179.51Show/hide
Query:  MSQCVPNWDLSEPPP-SAAADPSPFHSSSAAADVVPMFEYEVAELTWENGQLAMHGLGLPRVTGKIPNGSGSVGGGGGGGSKYTWDNKPARASGTLESLV
        MSQCVPNWDLSEPPP SAAA   PF SSS+A DVVP+FEYEVAELTWENGQL+MHGLGLPRVTGKI N     GGGGG GSKYTWDNKPARASGTLESLV
Subjt:  MSQCVPNWDLSEPPP-SAAADPSPFHSSSAAADVVPMFEYEVAELTWENGQLAMHGLGLPRVTGKIPNGSGSVGGGGGGGSKYTWDNKPARASGTLESLV

Query:  NQGTRYGK-NISFDINADDADDGGANDLVPWFADHHRQTPTASAATAMDAMVPCDGDKAA-AGGGADNSAVESSDIPGAAREEDEDCRVIHGKRARVVAR
        NQGTR+GK NISFDIN DD   GGANDLVPWF+DHHRQTPTAS A   DAMVPCDG+K+A  GGG D    +SSDIP AAR+EDEDCRVIHGKR +VVAR
Subjt:  NQGTRYGK-NISFDINADDADDGGANDLVPWFADHHRQTPTASAATAMDAMVPCDGDKAA-AGGGADNSAVESSDIPGAAREEDEDCRVIHGKRARVVAR

Query:  VVHASREWSGCRNQISVSGSRESSQKVTL-KTRDRNFAAV-------ATTATSQGSLDNTSSDKPCVKNTTITTTTDDHDSVCHSTHQ
        VVHA  EWS CRNQISVSG+RES QKVTL  +RDRNF AV       ATTATSQGSLDNTSSDKPCVKNTT+ TTTDDHDSVCHSTHQ
Subjt:  VVHASREWSGCRNQISVSGSRESSQKVTL-KTRDRNFAAV-------ATTATSQGSLDNTSSDKPCVKNTTITTTTDDHDSVCHSTHQ

XP_022958803.1 transcription factor UNE10 isoform X1 [Cucurbita moschata]5.0e-9573.4Show/hide
Query:  MSQCVPNWDLSEPPPS--AAADPSPFHSSSAAADVVPMFEYEVAELTWENGQLAMHGLGLPRVTGKIPNGSGSVGGGGGGGSKYTWDNKPARASGTLESL
        MSQCVPNWD+S+PPPS  AAAD  P+HSSSAA DVVPMFEYEVAELTWENGQLAMHGLGLPRVTGKI +G    GGGGG GSKYTW+NKPARASGTLE L
Subjt:  MSQCVPNWDLSEPPPS--AAADPSPFHSSSAAADVVPMFEYEVAELTWENGQLAMHGLGLPRVTGKIPNGSGSVGGGGGGGSKYTWDNKPARASGTLESL

Query:  VNQGTRYGKNISFDINADDADDGGANDLVPWFADHHR---QTPTASAATAMDAMVPCDGDKAAAGGGADNSAVESSDIPGAAREEDEDCRVIHGKRARVV
        VNQGTR+GK I FD+N DDA  GG NDLVPWF+DHH+   QTP ASAAT MDAMVPCDGDK+AA GGA  + VESSD P A   EDED RV   KR RVV
Subjt:  VNQGTRYGKNISFDINADDADDGGANDLVPWFADHHR---QTPTASAATAMDAMVPCDGDKAAAGGGADNSAVESSDIPGAAREEDEDCRVIHGKRARVV

Query:  ARVVHASREWSGCRNQISVSGSRESSQKVTLKTRDRNFAAVATTATSQGSLDNTSSDKPCVKNTTITTTTDDHDSVCHSTHQ
        AR VHA REWS C+NQISVSGS       TL T DRNFAA A+  TS GSLDNTSS K CV      TTTDDHDSVCHST+Q
Subjt:  ARVVHASREWSGCRNQISVSGSRESSQKVTLKTRDRNFAAVATTATSQGSLDNTSSDKPCVKNTTITTTTDDHDSVCHSTHQ

XP_038875721.1 transcription factor UNE10 [Benincasa hispida]5.3e-12185.31Show/hide
Query:  MSQCVPNWDLSEPPPSAAADPSPFHSSSAAADVVPMFEYEVAELTWENGQLAMHGLGLPRVTGKIPNGSGSVGGGGGGGSKYTWDNKPARASGTLESLVN
        MSQCVPNWDLSEPP SAAAD   FHSSSAA DVVPMFEYEVAELTWENGQLAMHG+GLPRVTGKI NG G  GGG GGG KYTWDNKPARASGTLESLVN
Subjt:  MSQCVPNWDLSEPPPSAAADPSPFHSSSAAADVVPMFEYEVAELTWENGQLAMHGLGLPRVTGKIPNGSGSVGGGGGGGSKYTWDNKPARASGTLESLVN

Query:  QGTRYGKNISFDINADDAD-DGGANDLVPWFADHHRQTPTASAATAMDAMVPCDGDKAAAG-GGADNSAVESSDIPGAAREEDEDCRVIHGKRARVVARV
        QGTR+GKNI FDINADD D DG ANDLVPWF+DHHRQTPTASAATAMDAMVPCDGDKAAA  GGA  S VESSDIP AARE DEDCRVIHGKR RVVARV
Subjt:  QGTRYGKNISFDINADDAD-DGGANDLVPWFADHHRQTPTASAATAMDAMVPCDGDKAAAG-GGADNSAVESSDIPGAAREEDEDCRVIHGKRARVVARV

Query:  VHASREWSGCRNQISVSGSRESSQKVTLKTRDRNFAAV-------ATTATSQGSLDNTSSDKPCVKNTTITTTTDDHDSVCHSTHQ
        VHASREWSGCRNQISVSGSRE+ QK  L TRDRNFAAV       ATTATSQGSLD TSSD  CVKNTTI TTTDDHDSVCHSTHQ
Subjt:  VHASREWSGCRNQISVSGSRESSQKVTLKTRDRNFAAV-------ATTATSQGSLDNTSSDKPCVKNTTITTTTDDHDSVCHSTHQ

TrEMBL top hitse value%identityAlignment
A0A0A0KCL1 BHLH domain-containing protein4.1e-11179.51Show/hide
Query:  MSQCVPNWDLSEPPP-SAAADPSPFHSSSAAADVVPMFEYEVAELTWENGQLAMHGLGLPRVTGKIPNGSGSVGGGGGGGSKYTWDNKPARASGTLESLV
        MSQCVPNWDLSEPPP SAAA   PF SSS+A DVVP+FEYEVAELTWENGQL+MHGLGLPRVTGKI N     GGGGG GSKYTWDNKPARASGTLESLV
Subjt:  MSQCVPNWDLSEPPP-SAAADPSPFHSSSAAADVVPMFEYEVAELTWENGQLAMHGLGLPRVTGKIPNGSGSVGGGGGGGSKYTWDNKPARASGTLESLV

Query:  NQGTRYGK-NISFDINADDADDGGANDLVPWFADHHRQTPTASAATAMDAMVPCDGDKAA-AGGGADNSAVESSDIPGAAREEDEDCRVIHGKRARVVAR
        NQGTR+GK NISFDIN DD   GGANDLVPWF+DHHRQTPTAS A   DAMVPCDG+K+A  GGG D    +SSDIP AAR+EDEDCRVIHGKR +VVAR
Subjt:  NQGTRYGK-NISFDINADDADDGGANDLVPWFADHHRQTPTASAATAMDAMVPCDGDKAA-AGGGADNSAVESSDIPGAAREEDEDCRVIHGKRARVVAR

Query:  VVHASREWSGCRNQISVSGSRESSQKVTL-KTRDRNFAAV-------ATTATSQGSLDNTSSDKPCVKNTTITTTTDDHDSVCHSTHQ
        VVHA  EWS CRNQISVSG+RES QKVTL  +RDRNF AV       ATTATSQGSLDNTSSDKPCVKNTT+ TTTDDHDSVCHSTHQ
Subjt:  VVHASREWSGCRNQISVSGSRESSQKVTL-KTRDRNFAAV-------ATTATSQGSLDNTSSDKPCVKNTTITTTTDDHDSVCHSTHQ

A0A1S3CAX9 transcription factor UNE102.2e-11280.49Show/hide
Query:  MSQCVPNWDLSEPPPSAAA-DPSPFHSSSAAADVVPMFEYEVAELTWENGQLAMHGLGLPRVTGKIPNGSGSVGGGGGGGSKYTWDNKPARASGTLESLV
        MSQCVPNWDLSEPPPS+AA    PFHSSSAA DVVP+FEYEVAELTWENGQLAMHGLGLPRVTGK+ N SG  GGGGG GSK+TWDNKPARASGTLESLV
Subjt:  MSQCVPNWDLSEPPPSAAA-DPSPFHSSSAAADVVPMFEYEVAELTWENGQLAMHGLGLPRVTGKIPNGSGSVGGGGGGGSKYTWDNKPARASGTLESLV

Query:  NQGTRYGK-NISFDINADDADDGGANDLVPWFADHHRQTPTASAATAMDAMVPCDGDKAAAGGGADNSAVESSDIPGAAREEDEDCRVIHGKRARVVARV
        NQGTR+GK NISFDIN DD   GGANDL PWF+DHHRQT TAS A   DAMVPCDGDK+A  GG   SA +SSDIP  AREEDEDC VIHGKR RVVARV
Subjt:  NQGTRYGK-NISFDINADDADDGGANDLVPWFADHHRQTPTASAATAMDAMVPCDGDKAAAGGGADNSAVESSDIPGAAREEDEDCRVIHGKRARVVARV

Query:  VHASREWSGCRNQISVSGSRESSQKVTL-KTRDRNFAAV-------ATTATSQGSLDNTSSDKPCVKNTTITTTTDDHDSVCHSTHQ
        VHAS EWS CRNQISVSG+RES QKVTL  TR RNFAAV       ATTATSQGSLDNT+SDKPCVKNTT+ TTTDDHDSVCHSTHQ
Subjt:  VHASREWSGCRNQISVSGSRESSQKVTL-KTRDRNFAAV-------ATTATSQGSLDNTSSDKPCVKNTTITTTTDDHDSVCHSTHQ

A0A5D3DN77 Transcription factor UNE102.4e-11180.35Show/hide
Query:  QCVPNWDLSEPPPSAAA-DPSPFHSSSAAADVVPMFEYEVAELTWENGQLAMHGLGLPRVTGKIPNGSGSVGGGGGGGSKYTWDNKPARASGTLESLVNQ
        QCVPNWDLSEPPPS+AA    PFHSSSAA DVVP+FEYEVAELTWENGQLAMHGLGLPRVTGK+ N SG  GGGGG GSK+TWDNKPARASGTLESLVNQ
Subjt:  QCVPNWDLSEPPPSAAA-DPSPFHSSSAAADVVPMFEYEVAELTWENGQLAMHGLGLPRVTGKIPNGSGSVGGGGGGGSKYTWDNKPARASGTLESLVNQ

Query:  GTRYGK-NISFDINADDADDGGANDLVPWFADHHRQTPTASAATAMDAMVPCDGDKAAAGGGADNSAVESSDIPGAAREEDEDCRVIHGKRARVVARVVH
        GTR+GK NISFDIN DD   GGANDL PWF+DHHRQT TAS A   DAMVPCDGDK+A  GG   SA +SSDIP  AREEDEDC VIHGKR RVVARVVH
Subjt:  GTRYGK-NISFDINADDADDGGANDLVPWFADHHRQTPTASAATAMDAMVPCDGDKAAAGGGADNSAVESSDIPGAAREEDEDCRVIHGKRARVVARVVH

Query:  ASREWSGCRNQISVSGSRESSQKVTL-KTRDRNFAAV-------ATTATSQGSLDNTSSDKPCVKNTTITTTTDDHDSVCHSTHQ
        AS EWS CRNQISVSG+RES QKVTL  TR RNFAAV       ATTATSQGSLDNT+SDKPCVKNTT+ TTTDDHDSVCHSTHQ
Subjt:  ASREWSGCRNQISVSGSRESSQKVTL-KTRDRNFAAV-------ATTATSQGSLDNTSSDKPCVKNTTITTTTDDHDSVCHSTHQ

A0A6J1H440 transcription factor UNE10 isoform X12.4e-9573.4Show/hide
Query:  MSQCVPNWDLSEPPPS--AAADPSPFHSSSAAADVVPMFEYEVAELTWENGQLAMHGLGLPRVTGKIPNGSGSVGGGGGGGSKYTWDNKPARASGTLESL
        MSQCVPNWD+S+PPPS  AAAD  P+HSSSAA DVVPMFEYEVAELTWENGQLAMHGLGLPRVTGKI +G    GGGGG GSKYTW+NKPARASGTLE L
Subjt:  MSQCVPNWDLSEPPPS--AAADPSPFHSSSAAADVVPMFEYEVAELTWENGQLAMHGLGLPRVTGKIPNGSGSVGGGGGGGSKYTWDNKPARASGTLESL

Query:  VNQGTRYGKNISFDINADDADDGGANDLVPWFADHHR---QTPTASAATAMDAMVPCDGDKAAAGGGADNSAVESSDIPGAAREEDEDCRVIHGKRARVV
        VNQGTR+GK I FD+N DDA  GG NDLVPWF+DHH+   QTP ASAAT MDAMVPCDGDK+AA GGA  + VESSD P A   EDED RV   KR RVV
Subjt:  VNQGTRYGKNISFDINADDADDGGANDLVPWFADHHR---QTPTASAATAMDAMVPCDGDKAAAGGGADNSAVESSDIPGAAREEDEDCRVIHGKRARVV

Query:  ARVVHASREWSGCRNQISVSGSRESSQKVTLKTRDRNFAAVATTATSQGSLDNTSSDKPCVKNTTITTTTDDHDSVCHSTHQ
        AR VHA REWS C+NQISVSGS       TL T DRNFAA A+  TS GSLDNTSS K CV      TTTDDHDSVCHST+Q
Subjt:  ARVVHASREWSGCRNQISVSGSRESSQKVTLKTRDRNFAAVATTATSQGSLDNTSSDKPCVKNTTITTTTDDHDSVCHSTHQ

A0A6J1H666 transcription factor UNE10 isoform X24.7e-9171.43Show/hide
Query:  MSQCVPNWDLSEPPPSAAADPSPFHSSSAAADVVPMFEYEVAELTWENGQLAMHGLGLPRVTGKIPNGSGSVGGGGGGGSKYTWDNKPARASGTLESLVN
        MSQCVPNWD+S+PPPSAAAD           DVVPMFEYEVAELTWENGQLAMHGLGLPRVTGKI +G    GGGGG GSKYTW+NKPARASGTLE LVN
Subjt:  MSQCVPNWDLSEPPPSAAADPSPFHSSSAAADVVPMFEYEVAELTWENGQLAMHGLGLPRVTGKIPNGSGSVGGGGGGGSKYTWDNKPARASGTLESLVN

Query:  QGTRYGKNISFDINADDADDGGANDLVPWFADHHR---QTPTASAATAMDAMVPCDGDKAAAGGGADNSAVESSDIPGAAREEDEDCRVIHGKRARVVAR
        QGTR+GK I FD+N DDA  GG NDLVPWF+DHH+   QTP ASAAT MDAMVPCDGDK+AA GGA  + VESSD P A   EDED RV   KR RVVAR
Subjt:  QGTRYGKNISFDINADDADDGGANDLVPWFADHHR---QTPTASAATAMDAMVPCDGDKAAAGGGADNSAVESSDIPGAAREEDEDCRVIHGKRARVVAR

Query:  VVHASREWSGCRNQISVSGSRESSQKVTLKTRDRNFAAVATTATSQGSLDNTSSDKPCVKNTTITTTTDDHDSVCHSTHQ
         VHA REWS C+NQISVSGS       TL T DRNFAA A+  TS GSLDNTSS K CV      TTTDDHDSVCHST+Q
Subjt:  VVHASREWSGCRNQISVSGSRESSQKVTLKTRDRNFAAVATTATSQGSLDNTSSDKPCVKNTTITTTTDDHDSVCHSTHQ

SwissProt top hitse value%identityAlignment
Q8GZ38 Transcription factor UNE102.5e-2036.92Show/hide
Query:  MSQCVPNWDLSEPPPSAAADPSPFHSSSAAADVVPMFEYEVAELTWENGQLAMHGLGLPRVTGKIPNGSGSVGGGGGGGSKYTWDNKPARASGTLESLVN
        MSQCVPN  + + P +A         S+ AAD +P+ +YEVAELTWENGQL +HGLG PRVT                 +KY+       A GTLES+V+
Subjt:  MSQCVPNWDLSEPPPSAAADPSPFHSSSAAADVVPMFEYEVAELTWENGQLAMHGLGLPRVTGKIPNGSGSVGGGGGGGSKYTWDNKPARASGTLESLVN

Query:  QGTRYGKNISFDINADDADDGGANDLVPWFADHHRQTPTASAATAMDAMVPCDG---DKAAAGGGADNSAVESSDIPGAAREEDEDCRVI-HGKRARVVA
        Q TR              +    ++LVPWF  HHR   ++ AA AMDA+VPC     ++ +  GG  ++ V S            D R +  GKRARV  
Subjt:  QGTRYGKNISFDINADDADDGGANDLVPWFADHHRQTPTASAATAMDAMVPCDG---DKAAAGGGADNSAVESSDIPGAAREEDEDCRVI-HGKRARVVA

Query:  RVVHASREWSGCRNQISVSGSRESSQKVTLKTRDRNFAAVATTATSQGSLDNTSSDKPCV
             + EWSG             SQ++T+ T D  F     T+TS GS DNT  D   V
Subjt:  RVVHASREWSGCRNQISVSGSRESSQKVTLKTRDRNFAAVATTATSQGSLDNTSSDKPCV

Arabidopsis top hitse value%identityAlignment
AT4G00050.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein3.8e-2437.01Show/hide
Query:  MSQCVPNWDLSEPPPSAAADPSPFHSSSAAADVVPMFEYEVAELTWENGQLAMHGLGLPRVTGKIPNGSGSVGGGGGGGSKYTWDNKPARASGTLESLVN
        MSQCVPN  + + P +A         S+ AAD +P+ +YEVAELTWENGQL +HGLG PRVT                 +KY+       A GTLES+V+
Subjt:  MSQCVPNWDLSEPPPSAAADPSPFHSSSAAADVVPMFEYEVAELTWENGQLAMHGLGLPRVTGKIPNGSGSVGGGGGGGSKYTWDNKPARASGTLESLVN

Query:  QGTRYGKNISFDINADDADDGGANDLVPWFADHHRQTPTASAATAMDAMVPCDG---DKAAAGGGADNSAVESSDIPGAAREEDEDCRVI-HGKRARVVA
        Q TR              +    ++LVPWF  HHR   ++ AA AMDA+VPC     ++ +  GG  ++ V S            D R +  GKRARV  
Subjt:  QGTRYGKNISFDINADDADDGGANDLVPWFADHHRQTPTASAATAMDAMVPCDG---DKAAAGGGADNSAVESSDIPGAAREEDEDCRVI-HGKRARVVA

Query:  RVVHASREWSGCRNQISVSGSRESSQKVTLKTRDRNFAAVATTATSQGSLDNTSSDKPCVKNTTITTTTDDHDSVCHSTHQ
             + EWSG             SQ++T+ T D  F     T+TS GS DN               T DDHDSVCHS  Q
Subjt:  RVVHASREWSGCRNQISVSGSRESSQKVTLKTRDRNFAAVATTATSQGSLDNTSSDKPCVKNTTITTTTDDHDSVCHSTHQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTCAGTGTGTTCCTAATTGGGACCTCTCCGAACCGCCGCCCTCCGCCGCAGCCGACCCTTCCCCTTTCCACTCCTCATCCGCCGCTGCCGACGTCGTTCCCATGTT
TGAGTATGAAGTGGCGGAGCTGACATGGGAAAATGGGCAATTGGCTATGCATGGGCTGGGGCTGCCGAGGGTAACCGGCAAAATTCCGAACGGCAGCGGCAGTGTTGGTG
GTGGTGGTGGTGGTGGTTCTAAGTACACGTGGGATAATAAGCCGGCACGTGCGAGTGGCACGCTTGAGTCTTTGGTGAACCAAGGAACTCGCTATGGTAAGAATATTAGT
TTTGATATTAACGCCGACGACGCCGACGATGGCGGTGCCAATGATTTGGTGCCGTGGTTTGCCGACCACCATAGGCAAACGCCGACGGCTTCCGCTGCAACGGCTATGGA
CGCGATGGTTCCATGCGACGGCGACAAGGCGGCGGCGGGTGGTGGCGCCGACAATTCGGCGGTGGAGTCGAGTGATATTCCTGGGGCGGCGCGTGAGGAGGACGAGGACT
GTAGAGTGATCCATGGGAAACGAGCAAGGGTAGTGGCGCGTGTAGTTCACGCGTCAAGGGAGTGGAGTGGCTGCCGGAATCAGATCAGCGTGAGTGGCAGCCGTGAAAGT
AGTCAGAAAGTGACGTTAAAAACTCGCGATAGGAATTTCGCCGCCGTAGCCACCACCGCGACGTCACAAGGGTCGCTTGATAATACAAGCTCGGACAAGCCGTGCGTTAA
AAACACCACCATCACCACTACCACCGACGACCATGATTCTGTCTGCCATAGCACACATCAGGCAAGATTTTTATTAACTAACATCAAAATCTAA
mRNA sequenceShow/hide mRNA sequence
GGACAAATAAGTCCATCAGCCTCGATCTAAAACACAGCAGAGAATAGCACTCAATTTTATATTTTCAAAATATTATTATTATTATCCAAAATTTCATCCAAATTCCAACC
TTAATTAATTAATTATATTCATTCTTTTCTTATAACACAAATACACCGTCTCTCTTTTTTCTCCACCACCAAAACAAAAAAACACACACAACACCAAAAAAAAAAAAAAA
AAATGAGTCAGTGTGTTCCTAATTGGGACCTCTCCGAACCGCCGCCCTCCGCCGCAGCCGACCCTTCCCCTTTCCACTCCTCATCCGCCGCTGCCGACGTCGTTCCCATG
TTTGAGTATGAAGTGGCGGAGCTGACATGGGAAAATGGGCAATTGGCTATGCATGGGCTGGGGCTGCCGAGGGTAACCGGCAAAATTCCGAACGGCAGCGGCAGTGTTGG
TGGTGGTGGTGGTGGTGGTTCTAAGTACACGTGGGATAATAAGCCGGCACGTGCGAGTGGCACGCTTGAGTCTTTGGTGAACCAAGGAACTCGCTATGGTAAGAATATTA
GTTTTGATATTAACGCCGACGACGCCGACGATGGCGGTGCCAATGATTTGGTGCCGTGGTTTGCCGACCACCATAGGCAAACGCCGACGGCTTCCGCTGCAACGGCTATG
GACGCGATGGTTCCATGCGACGGCGACAAGGCGGCGGCGGGTGGTGGCGCCGACAATTCGGCGGTGGAGTCGAGTGATATTCCTGGGGCGGCGCGTGAGGAGGACGAGGA
CTGTAGAGTGATCCATGGGAAACGAGCAAGGGTAGTGGCGCGTGTAGTTCACGCGTCAAGGGAGTGGAGTGGCTGCCGGAATCAGATCAGCGTGAGTGGCAGCCGTGAAA
GTAGTCAGAAAGTGACGTTAAAAACTCGCGATAGGAATTTCGCCGCCGTAGCCACCACCGCGACGTCACAAGGGTCGCTTGATAATACAAGCTCGGACAAGCCGTGCGTT
AAAAACACCACCATCACCACTACCACCGACGACCATGATTCTGTCTGCCATAGCACACATCAGGCAAGATTTTTATTAACTAACATCAAAATCTAA
Protein sequenceShow/hide protein sequence
MSQCVPNWDLSEPPPSAAADPSPFHSSSAAADVVPMFEYEVAELTWENGQLAMHGLGLPRVTGKIPNGSGSVGGGGGGGSKYTWDNKPARASGTLESLVNQGTRYGKNIS
FDINADDADDGGANDLVPWFADHHRQTPTASAATAMDAMVPCDGDKAAAGGGADNSAVESSDIPGAAREEDEDCRVIHGKRARVVARVVHASREWSGCRNQISVSGSRES
SQKVTLKTRDRNFAAVATTATSQGSLDNTSSDKPCVKNTTITTTTDDHDSVCHSTHQARFLLTNIKI