; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0041297 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0041297
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionHomeobox-leucine zipper protein
Genome locationchr13:15180740..15189011
RNA-Seq ExpressionLag0041297
SyntenyLag0041297
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR001356 - Homeobox domain
IPR009057 - Homeobox-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022942450.1 homeobox-leucine zipper protein HOX3-like [Cucurbita moschata]6.5e-5194.87Show/hide
Query:  MAVLPTTSSRLDLTISVPGFDSF-SSPLPPSVGRDLDMNRAPDEEEWMVGSMEEDEENNSGSHPRKKLRLTKEQSHLLEQSFRQNHTLNPKQKETLAEVL
        MAVLPT SSRLDLTISVPGF SF SSPLPPS  RDLDMNRAPDEEEWM+GSMEEDEENNSGSHPRKKLRLTKEQSHLLEQSFRQNHTLNPKQKETLAEVL
Subjt:  MAVLPTTSSRLDLTISVPGFDSF-SSPLPPSVGRDLDMNRAPDEEEWMVGSMEEDEENNSGSHPRKKLRLTKEQSHLLEQSFRQNHTLNPKQKETLAEVL

Query:  KLKPRQVEVWFQNRRAR
        KLKPRQVEVWFQNRRAR
Subjt:  KLKPRQVEVWFQNRRAR

XP_022976582.1 homeobox-leucine zipper protein HOX3-like [Cucurbita maxima]9.4e-5093.16Show/hide
Query:  MAVLPTTSSRLDLTISVPGFDSF-SSPLPPSVGRDLDMNRAPDEEEWMVGSMEEDEENNSGSHPRKKLRLTKEQSHLLEQSFRQNHTLNPKQKETLAEVL
        MAVLPT SSRLDLTISVPG  SF SSPLPPS  RDLDMNRAPDEEEWM+GSMEEDEENNSGSHPRKKLRLTKEQSHLLEQSFRQNHTLNPKQKE+LAEVL
Subjt:  MAVLPTTSSRLDLTISVPGFDSF-SSPLPPSVGRDLDMNRAPDEEEWMVGSMEEDEENNSGSHPRKKLRLTKEQSHLLEQSFRQNHTLNPKQKETLAEVL

Query:  KLKPRQVEVWFQNRRAR
        KLKPRQVEVWFQNRRAR
Subjt:  KLKPRQVEVWFQNRRAR

XP_031741099.1 homeobox-leucine zipper protein HOX3 isoform X1 [Cucumis sativus]6.1e-4986.55Show/hide
Query:  MAVLPTTSSRLDLTISVPGFDSFSSPLPPSVGRDLDMNRAPDEEEWMVGSMEEDEE--NNSGSHPRKKLRLTKEQSHLLEQSFRQNHTLNPKQKETLAEV
        MA+LPT++SRLDL+ISVPGF+SFSS LPPSVGRDLDMN+APDEEEWM+G+MEEDEE  NN  ++PRKKLRLTKEQSHLLEQSFRQNHTLNPKQKETLAEV
Subjt:  MAVLPTTSSRLDLTISVPGFDSFSSPLPPSVGRDLDMNRAPDEEEWMVGSMEEDEE--NNSGSHPRKKLRLTKEQSHLLEQSFRQNHTLNPKQKETLAEV

Query:  LKLKPRQVEVWFQNRRARW
        LKLKPRQ+EVWFQNRRARW
Subjt:  LKLKPRQVEVWFQNRRARW

XP_031741103.1 homeobox-leucine zipper protein ATHB-17 isoform X7 [Cucumis sativus]2.2e-5970.49Show/hide
Query:  MAVLPTTSSRLDLTISVPGFDSFSSPLPPSVGRDLDMNRAPDEEEWMVGSMEEDEE--NNSGSHPRKKLRLTKEQSHLLEQSFRQNHTLNPKQKETLAEV
        MA+LPT++SRLDL+ISVPGF+SFSS LPPSVGRDLDMN+APDEEEWM+G+MEEDEE  NN  ++PRKKLRLTKEQSHLLEQSFRQNHTLNPKQKETLAEV
Subjt:  MAVLPTTSSRLDLTISVPGFDSFSSPLPPSVGRDLDMNRAPDEEEWMVGSMEEDEE--NNSGSHPRKKLRLTKEQSHLLEQSFRQNHTLNPKQKETLAEV

Query:  LKLKPRQVEVWFQNRRARWEQAKANRDGMRVSKEMVRAIDRTEQETAEGGGGVEGDEGGAADSYFPAQLRATAGVQPHNVPSL
        LKLKPRQ+EVWFQNRRA                       RT+QET +GGGG + +EGGA D  F AQLRA AGV+ HNVP+L
Subjt:  LKLKPRQVEVWFQNRRARWEQAKANRDGMRVSKEMVRAIDRTEQETAEGGGGVEGDEGGAADSYFPAQLRATAGVQPHNVPSL

XP_038893065.1 homeobox-leucine zipper protein HOX3-like [Benincasa hispida]3.2e-5092.31Show/hide
Query:  MAVLPTTSSRLDLTISVPGFDSFSSPLPPSVGRDLDMNRAPDEEEWMVGSMEEDEENNSGS-HPRKKLRLTKEQSHLLEQSFRQNHTLNPKQKETLAEVL
        MA+LPTTSSRLDL+ISVPGF+SFSSPLPPSV RDLDMN+APDEEEWM+GSMEEDEENN+GS +PRKKLRLTKEQSHLLEQSFRQNHTLNPKQKETLAEVL
Subjt:  MAVLPTTSSRLDLTISVPGFDSFSSPLPPSVGRDLDMNRAPDEEEWMVGSMEEDEENNSGS-HPRKKLRLTKEQSHLLEQSFRQNHTLNPKQKETLAEVL

Query:  KLKPRQVEVWFQNRRAR
        KLKPRQVEVWFQNRRAR
Subjt:  KLKPRQVEVWFQNRRAR

TrEMBL top hitse value%identityAlignment
A0A0A0KJE2 Homeobox domain-containing protein5.5e-4886.44Show/hide
Query:  MAVLPTTSSRLDLTISVPGFDSFSSPLPPSVGRDLDMNRAPDEEEWMVGSMEEDEE--NNSGSHPRKKLRLTKEQSHLLEQSFRQNHTLNPKQKETLAEV
        MA+LPT++SRLDL+ISVPGF+SFSS LPPSVGRDLDMN+APDEEEWM+G+MEEDEE  NN  ++PRKKLRLTKEQSHLLEQSFRQNHTLNPKQKETLAEV
Subjt:  MAVLPTTSSRLDLTISVPGFDSFSSPLPPSVGRDLDMNRAPDEEEWMVGSMEEDEE--NNSGSHPRKKLRLTKEQSHLLEQSFRQNHTLNPKQKETLAEV

Query:  LKLKPRQVEVWFQNRRAR
        LKLKPRQ+EVWFQNRRAR
Subjt:  LKLKPRQVEVWFQNRRAR

A0A1S3BYV1 homeobox-leucine zipper protein HOX3-like2.8e-4484.75Show/hide
Query:  MAVLPTTSSRLDLTISVPGFDSFSSPLPPSVGRDLDMNRAPDEEEWMVGSMEEDEE--NNSGSHPRKKLRLTKEQSHLLEQSFRQNHTLNPKQKETLAEV
        MA+LPT +SRLDL+ISVPGF+SFSS L PS GRDLDMN+APDEEEWM+G+MEEDEE  NN  ++PRKKLRLTKEQSHLLEQSFRQNHTLNPKQKETLAEV
Subjt:  MAVLPTTSSRLDLTISVPGFDSFSSPLPPSVGRDLDMNRAPDEEEWMVGSMEEDEE--NNSGSHPRKKLRLTKEQSHLLEQSFRQNHTLNPKQKETLAEV

Query:  LKLKPRQVEVWFQNRRAR
        L LKPRQVEVWFQNRRAR
Subjt:  LKLKPRQVEVWFQNRRAR

A0A6J1D7F8 homeobox-leucine zipper protein HOX3-like6.3e-4483.33Show/hide
Query:  LHQQTPMAVLPTTSSRLDLTISVPGFDSFSSPLPPSVGRDLDMNRAPDEEEWMVGSMEEDEE----NNSGSHPRKKLRLTKEQSHLLEQSFRQNHTLNPK
        LHQQ PMA     SSRLDLTISVPGFDSF S LP SV RDLDMNRAP+EEEWM+GSMEEDEE    NN G HPRKKLRLTKEQSHLLEQ+FRQNHTLNPK
Subjt:  LHQQTPMAVLPTTSSRLDLTISVPGFDSFSSPLPPSVGRDLDMNRAPDEEEWMVGSMEEDEE----NNSGSHPRKKLRLTKEQSHLLEQSFRQNHTLNPK

Query:  QKETLAEVLKLKPRQVEVWFQNRRAR
        QKE LAE+LKLKPRQVEVWFQNRRAR
Subjt:  QKETLAEVLKLKPRQVEVWFQNRRAR

A0A6J1FWC2 homeobox-leucine zipper protein HOX3-like3.1e-5194.87Show/hide
Query:  MAVLPTTSSRLDLTISVPGFDSF-SSPLPPSVGRDLDMNRAPDEEEWMVGSMEEDEENNSGSHPRKKLRLTKEQSHLLEQSFRQNHTLNPKQKETLAEVL
        MAVLPT SSRLDLTISVPGF SF SSPLPPS  RDLDMNRAPDEEEWM+GSMEEDEENNSGSHPRKKLRLTKEQSHLLEQSFRQNHTLNPKQKETLAEVL
Subjt:  MAVLPTTSSRLDLTISVPGFDSF-SSPLPPSVGRDLDMNRAPDEEEWMVGSMEEDEENNSGSHPRKKLRLTKEQSHLLEQSFRQNHTLNPKQKETLAEVL

Query:  KLKPRQVEVWFQNRRAR
        KLKPRQVEVWFQNRRAR
Subjt:  KLKPRQVEVWFQNRRAR

A0A6J1IG52 homeobox-leucine zipper protein HOX3-like4.5e-5093.16Show/hide
Query:  MAVLPTTSSRLDLTISVPGFDSF-SSPLPPSVGRDLDMNRAPDEEEWMVGSMEEDEENNSGSHPRKKLRLTKEQSHLLEQSFRQNHTLNPKQKETLAEVL
        MAVLPT SSRLDLTISVPG  SF SSPLPPS  RDLDMNRAPDEEEWM+GSMEEDEENNSGSHPRKKLRLTKEQSHLLEQSFRQNHTLNPKQKE+LAEVL
Subjt:  MAVLPTTSSRLDLTISVPGFDSF-SSPLPPSVGRDLDMNRAPDEEEWMVGSMEEDEENNSGSHPRKKLRLTKEQSHLLEQSFRQNHTLNPKQKETLAEVL

Query:  KLKPRQVEVWFQNRRAR
        KLKPRQVEVWFQNRRAR
Subjt:  KLKPRQVEVWFQNRRAR

SwissProt top hitse value%identityAlignment
P46603 Homeobox-leucine zipper protein HAT92.4e-1653.4Show/hide
Query:  TISVPGFDSFSSPLPPSVGRDLDMNRAPDEEEWMVGSMEEDEENNSGSHPRKKLRLTKEQSHLLEQSFRQNHTLNPKQKETLAEVLKLKPRQVEVWFQNR
        T S  G  SFSS       RD     +P+EEE     + +  E+  G   RKKLRLTK+QS LLE+SF+ + TLNPKQK+ LA  L L+PRQVEVWFQNR
Subjt:  TISVPGFDSFSSPLPPSVGRDLDMNRAPDEEEWMVGSMEEDEENNSGSHPRKKLRLTKEQSHLLEQSFRQNHTLNPKQKETLAEVLKLKPRQVEVWFQNR

Query:  RAR
        RAR
Subjt:  RAR

Q0JKX1 Homeobox-leucine zipper protein HOX31.0e-2256.8Show/hide
Query:  TTSSRLDLTISVPGFDSFSSP----LPPSVG-----RDLDMNR---APDEEEWMVGSMEEDEENN--SGSHPRKKLRLTKEQSHLLEQSFRQNHTLNPKQ
        T+ S L+LT++VPG  S  S        + G     RDLD+N+     +EEE+ +GS+EEDEE     G H  KKLRL+KEQS LLE+SFR NHTL PKQ
Subjt:  TTSSRLDLTISVPGFDSFSSP----LPPSVG-----RDLDMNR---APDEEEWMVGSMEEDEENN--SGSHPRKKLRLTKEQSHLLEQSFRQNHTLNPKQ

Query:  KETLAEVLKLKPRQVEVWFQNRRAR
        KE LA  LKL+PRQVEVWFQNRRAR
Subjt:  KETLAEVLKLKPRQVEVWFQNRRAR

Q8GXM7 Homeobox-leucine zipper protein ATHB-X1.7e-2558.73Show/hide
Query:  MAVLPTTSSRLDLTISVPGFDSFSSPLPPSVG-----RDLDMNRAP---DEEEWMVGSME--EDEENNSGSHPRKKLRLTKEQSHLLEQSFRQNHTLNPK
        MA+ P +SS LDLTIS+P F    SP  PS+G     RD D+N+ P   ++ EWM+G+     ++++NSG   RKKLRLTKEQSHLLE+SF QNHTL PK
Subjt:  MAVLPTTSSRLDLTISVPGFDSFSSPLPPSVG-----RDLDMNRAP---DEEEWMVGSME--EDEENNSGSHPRKKLRLTKEQSHLLEQSFRQNHTLNPK

Query:  QKETLAEVLKLKPRQVEVWFQNRRAR
        QK+ LA  LKL  RQVEVWFQNRRAR
Subjt:  QKETLAEVLKLKPRQVEVWFQNRRAR

Q8S9N6 Homeobox-leucine zipper protein ATHB-172.8e-2855Show/hide
Query:  TTPVCRFFYLTGHVFPSQTNLS---LVARESHLHQQTPMAVLPTTSSRLDLTISVPGFDSFSSPLP---PSVGRD---LDMNRAPDEEEWMVGSMEEDEE
        T  VC  FY+   VF S    S   L  +  +      MA+LP  SS LDLTISVPGF   SSPL       GRD   LDMNR P  E+   G  EE   
Subjt:  TTPVCRFFYLTGHVFPSQTNLS---LVARESHLHQQTPMAVLPTTSSRLDLTISVPGFDSFSSPLP---PSVGRD---LDMNRAPDEEEWMVGSMEEDEE

Query:  NNSGSHPRKKLRLTKEQSHLLEQSFRQNHTLNPKQKETLAEVLKLKPRQVEVWFQNRRAR
        ++  + PRKKLRLT+EQS LLE SFRQNHTLNPKQKE LA+ L L+PRQ+EVWFQNRRAR
Subjt:  NNSGSHPRKKLRLTKEQSHLLEQSFRQNHTLNPKQKETLAEVLKLKPRQVEVWFQNRRAR

Q9XH38 Homeobox-leucine zipper protein HOX31.0e-2256.8Show/hide
Query:  TTSSRLDLTISVPGFDSFSSP----LPPSVG-----RDLDMNR---APDEEEWMVGSMEEDEENN--SGSHPRKKLRLTKEQSHLLEQSFRQNHTLNPKQ
        T+ S L+LT++VPG  S  S        + G     RDLD+N+     +EEE+ +GS+EEDEE     G H  KKLRL+KEQS LLE+SFR NHTL PKQ
Subjt:  TTSSRLDLTISVPGFDSFSSP----LPPSVG-----RDLDMNR---APDEEEWMVGSMEEDEENN--SGSHPRKKLRLTKEQSHLLEQSFRQNHTLNPKQ

Query:  KETLAEVLKLKPRQVEVWFQNRRAR
        KE LA  LKL+PRQVEVWFQNRRAR
Subjt:  KETLAEVLKLKPRQVEVWFQNRRAR

Arabidopsis top hitse value%identityAlignment
AT1G70920.1 homeobox-leucine zipper protein 181.2e-2658.73Show/hide
Query:  MAVLPTTSSRLDLTISVPGFDSFSSPLPPSVG-----RDLDMNRAP---DEEEWMVGSME--EDEENNSGSHPRKKLRLTKEQSHLLEQSFRQNHTLNPK
        MA+ P +SS LDLTIS+P F    SP  PS+G     RD D+N+ P   ++ EWM+G+     ++++NSG   RKKLRLTKEQSHLLE+SF QNHTL PK
Subjt:  MAVLPTTSSRLDLTISVPGFDSFSSPLPPSVG-----RDLDMNRAP---DEEEWMVGSME--EDEENNSGSHPRKKLRLTKEQSHLLEQSFRQNHTLNPK

Query:  QKETLAEVLKLKPRQVEVWFQNRRAR
        QK+ LA  LKL  RQVEVWFQNRRAR
Subjt:  QKETLAEVLKLKPRQVEVWFQNRRAR

AT2G01430.1 homeobox-leucine zipper protein 172.0e-2955Show/hide
Query:  TTPVCRFFYLTGHVFPSQTNLS---LVARESHLHQQTPMAVLPTTSSRLDLTISVPGFDSFSSPLP---PSVGRD---LDMNRAPDEEEWMVGSMEEDEE
        T  VC  FY+   VF S    S   L  +  +      MA+LP  SS LDLTISVPGF   SSPL       GRD   LDMNR P  E+   G  EE   
Subjt:  TTPVCRFFYLTGHVFPSQTNLS---LVARESHLHQQTPMAVLPTTSSRLDLTISVPGFDSFSSPLP---PSVGRD---LDMNRAPDEEEWMVGSMEEDEE

Query:  NNSGSHPRKKLRLTKEQSHLLEQSFRQNHTLNPKQKETLAEVLKLKPRQVEVWFQNRRAR
        ++  + PRKKLRLT+EQS LLE SFRQNHTLNPKQKE LA+ L L+PRQ+EVWFQNRRAR
Subjt:  NNSGSHPRKKLRLTKEQSHLLEQSFRQNHTLNPKQKETLAEVLKLKPRQVEVWFQNRRAR

AT2G22800.1 Homeobox-leucine zipper protein family1.7e-1753.4Show/hide
Query:  TISVPGFDSFSSPLPPSVGRDLDMNRAPDEEEWMVGSMEEDEENNSGSHPRKKLRLTKEQSHLLEQSFRQNHTLNPKQKETLAEVLKLKPRQVEVWFQNR
        T S  G  SFSS       RD     +P+EEE     + +  E+  G   RKKLRLTK+QS LLE+SF+ + TLNPKQK+ LA  L L+PRQVEVWFQNR
Subjt:  TISVPGFDSFSSPLPPSVGRDLDMNRAPDEEEWMVGSMEEDEENNSGSHPRKKLRLTKEQSHLLEQSFRQNHTLNPKQKETLAEVLKLKPRQVEVWFQNR

Query:  RAR
        RAR
Subjt:  RAR

AT4G37790.1 Homeobox-leucine zipper protein family9.4e-1648.18Show/hide
Query:  TISVPGFDSFSSPLPPSVGRDLDMNRAPDEEE-------WMVGSMEEDEENNSGSHPRKKLRLTKEQSHLLEQSFRQNHTLNPKQKETLAEVLKLKPRQV
        T S  G  SFSS     V R+ +++    EEE        +   + +D ++  G   RKKLRLTK+QS LLE +F+ + TLNPKQK+ LA  L L+PRQV
Subjt:  TISVPGFDSFSSPLPPSVGRDLDMNRAPDEEE-------WMVGSMEEDEENNSGSHPRKKLRLTKEQSHLLEQSFRQNHTLNPKQKETLAEVLKLKPRQV

Query:  EVWFQNRRAR
        EVWFQNRRAR
Subjt:  EVWFQNRRAR

AT5G06710.1 homeobox from Arabidopsis thaliana2.1e-1542.22Show/hide
Query:  SLVARESHLHQQTP-MAVLP----TTSSRLDLTISVPGFDSFSSPLPPSVGRDLDMNRAPDEEEWMVGSMEEDEENNSGSHPRKKLRLTKEQSHLLEQSF
        ++V  E    +  P M+V P    T+S +LD  I   G++  S+       RD+D     DE E        ++ ++     RKKLRL+K+QS  LE SF
Subjt:  SLVARESHLHQQTP-MAVLP----TTSSRLDLTISVPGFDSFSSPLPPSVGRDLDMNRAPDEEEWMVGSMEEDEENNSGSHPRKKLRLTKEQSHLLEQSF

Query:  RQNHTLNPKQKETLAEVLKLKPRQVEVWFQNRRAR
        +++ TLNPKQK  LA+ L L+PRQVEVWFQNRRAR
Subjt:  RQNHTLNPKQKETLAEVLKLKPRQVEVWFQNRRAR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTAGTTTTGGACCACTTCGATGTACAAGGAGTTTACGAGGACAATCCTACATGAGCCCGAAAATAGGACCGGAAAATGGACTTCGAAGGCGAAACGGGCCAGAGGG
TCGGGCCAAGGTCAGAGGGATCGGGCCTGACCCGACCCCTACTCGGCCTCAGCCTTGGGTCGGGTCGAGGTCTTCTGTCTCCGTCTGGTCCCTGCTTTCTGACTTAAGCA
TCGGAGACAGTGTGGCAAGCACCACACCAGTGTGCAGGTTTTTCTATCTTACAGGCCATGTCTTCCCCTCTCAAACAAATTTATCGTTGGTGGCACGTGAAAGTCATCTT
CACCAACAGACTCCCATGGCGGTTCTACCAACCACCTCCTCAAGATTGGATCTCACCATCTCTGTTCCTGGCTTCGATTCTTTCTCATCACCACTTCCTCCATCTGTAGG
GAGGGATTTGGACATGAATAGAGCTCCAGATGAAGAGGAATGGATGGTGGGAAGCATGGAGGAAGATGAAGAAAATAACAGTGGAAGCCATCCAAGAAAGAAGCTGCGTT
TGACAAAGGAACAGTCTCATCTTCTTGAACAAAGCTTCAGACAAAACCATACCTTAAATCCAAAACAAAAGGAGACTCTGGCAGAAGTGTTGAAGTTGAAGCCAAGGCAG
GTTGAGGTTTGGTTTCAGAACCGAAGGGCCAGGTGGGAGCAAGCTAAAGCAAACAGAGATGGAATGCGAGTATCTAAAGAGATGGTTCGGGCTATTGACAGAACAGAACA
AGAGACTGCAGAAGGAGGTGGAGGAGTTGAGGGCGATGAAGGTGGCGCCGCCGACAGTTATTTCCCCGCACAGCTGCGAGCCACTGCCGGCGTCCAACCTCACAATGTGC
CCTCGCTGCGAGCGCGTGACCACCACTGCCCACGACAAGACCCGCATTGTCGTATATGGAAGTGTAGCTTTGAATATGTAACGCCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGTAGTTTTGGACCACTTCGATGTACAAGGAGTTTACGAGGACAATCCTACATGAGCCCGAAAATAGGACCGGAAAATGGACTTCGAAGGCGAAACGGGCCAGAGGG
TCGGGCCAAGGTCAGAGGGATCGGGCCTGACCCGACCCCTACTCGGCCTCAGCCTTGGGTCGGGTCGAGGTCTTCTGTCTCCGTCTGGTCCCTGCTTTCTGACTTAAGCA
TCGGAGACAGTGTGGCAAGCACCACACCAGTGTGCAGGTTTTTCTATCTTACAGGCCATGTCTTCCCCTCTCAAACAAATTTATCGTTGGTGGCACGTGAAAGTCATCTT
CACCAACAGACTCCCATGGCGGTTCTACCAACCACCTCCTCAAGATTGGATCTCACCATCTCTGTTCCTGGCTTCGATTCTTTCTCATCACCACTTCCTCCATCTGTAGG
GAGGGATTTGGACATGAATAGAGCTCCAGATGAAGAGGAATGGATGGTGGGAAGCATGGAGGAAGATGAAGAAAATAACAGTGGAAGCCATCCAAGAAAGAAGCTGCGTT
TGACAAAGGAACAGTCTCATCTTCTTGAACAAAGCTTCAGACAAAACCATACCTTAAATCCAAAACAAAAGGAGACTCTGGCAGAAGTGTTGAAGTTGAAGCCAAGGCAG
GTTGAGGTTTGGTTTCAGAACCGAAGGGCCAGGTGGGAGCAAGCTAAAGCAAACAGAGATGGAATGCGAGTATCTAAAGAGATGGTTCGGGCTATTGACAGAACAGAACA
AGAGACTGCAGAAGGAGGTGGAGGAGTTGAGGGCGATGAAGGTGGCGCCGCCGACAGTTATTTCCCCGCACAGCTGCGAGCCACTGCCGGCGTCCAACCTCACAATGTGC
CCTCGCTGCGAGCGCGTGACCACCACTGCCCACGACAAGACCCGCATTGTCGTATATGGAAGTGTAGCTTTGAATATGTAACGCCCTAG
Protein sequenceShow/hide protein sequence
MGSFGPLRCTRSLRGQSYMSPKIGPENGLRRRNGPEGRAKVRGIGPDPTPTRPQPWVGSRSSVSVWSLLSDLSIGDSVASTTPVCRFFYLTGHVFPSQTNLSLVARESHL
HQQTPMAVLPTTSSRLDLTISVPGFDSFSSPLPPSVGRDLDMNRAPDEEEWMVGSMEEDEENNSGSHPRKKLRLTKEQSHLLEQSFRQNHTLNPKQKETLAEVLKLKPRQ
VEVWFQNRRARWEQAKANRDGMRVSKEMVRAIDRTEQETAEGGGGVEGDEGGAADSYFPAQLRATAGVQPHNVPSLRARDHHCPRQDPHCRIWKCSFEYVTP