; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi06G000080 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi06G000080
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionEnhancer of polycomb-like protein
Genome locationchr06:88693..99340
RNA-Seq ExpressionLsi06G000080
SyntenyLsi06G000080
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0016573 - histone acetylation (biological process)
GO:0032777 - Piccolo NuA4 histone acetyltransferase complex (cellular component)
GO:0004402 - histone acetyltransferase activity (molecular function)
InterPro domainsIPR024943 - Enhancer of polycomb protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK10318.1 EPL1 domain-containing protein [Cucumis melo var. makuwa]4.7e-9687.44Show/hide
Query:  MQPPPPVNDTNPYNVFRPREKAHRLHTRR----------------VRRNLEQAKTLLEALIKREEKKRDLMESDVGLQRVHLKYKHETELLEESLALPRF
        +QPPPPVNDTNPYNVFRPREKAHRLHTRR                VRRNLEQAKTLLEALIKREEKKRDLMESDVGLQR+HLKYKHETELLEESLALPRF
Subjt:  MQPPPPVNDTNPYNVFRPREKAHRLHTRR----------------VRRNLEQAKTLLEALIKREEKKRDLMESDVGLQRVHLKYKHETELLEESLALPRF

Query:  LPFSCKFGSSEDEFVELDDIAISRAPRIRTSGSFVEANTIMLPTEGVKQEYRQQQLPHGWLHKMDPLEPVLLFAKPLITEKLAAAAIVPPSDSSTRNSVS
        LPFSCKFGSSEDEF++LD+IAISR PRIRTSGS VEAN IMLPTE VKQEYRQQQLP GWLHKMDPLEPVLLFAKPLITEKLAAAAIVPPSDSSTRNSVS
Subjt:  LPFSCKFGSSEDEFVELDDIAISRAPRIRTSGSFVEANTIMLPTEGVKQEYRQQQLPHGWLHKMDPLEPVLLFAKPLITEKLAAAAIVPPSDSSTRNSVS

Query:  MGSHKFRGRIGRGSR
        MGSHKFRGRIGRG R
Subjt:  MGSHKFRGRIGRGSR

XP_004135576.1 uncharacterized protein LOC101217797 isoform X1 [Cucumis sativus]5.6e-9788.37Show/hide
Query:  MQPPPPVNDTNPYNVFRPREKAHRLHTRR----------------VRRNLEQAKTLLEALIKREEKKRDLMESDVGLQRVHLKYKHETELLEESLALPRF
        +QPPPPVNDTNPYNVFRPREKAHRLHTRR                VRRNLEQAKTLLEALIKREEKKRDLMESDVGLQRVHLKYKHETELLEESLALPRF
Subjt:  MQPPPPVNDTNPYNVFRPREKAHRLHTRR----------------VRRNLEQAKTLLEALIKREEKKRDLMESDVGLQRVHLKYKHETELLEESLALPRF

Query:  LPFSCKFGSSEDEFVELDDIAISRAPRIRTSGSFVEANTIMLPTEGVKQEYRQQQLPHGWLHKMDPLEPVLLFAKPLITEKLAAAAIVPPSDSSTRNSVS
        LPFSCKFGSSEDEFV+LD+IAISR PRIRTSGS VEAN IMLPTE VKQEYRQQQLPHGWLHKMDPLEPVLLFAKPLITEKLAAA IVPPSDSSTRNSVS
Subjt:  LPFSCKFGSSEDEFVELDDIAISRAPRIRTSGSFVEANTIMLPTEGVKQEYRQQQLPHGWLHKMDPLEPVLLFAKPLITEKLAAAAIVPPSDSSTRNSVS

Query:  MGSHKFRGRIGRGSR
        MGSHKFRGRIGRG R
Subjt:  MGSHKFRGRIGRGSR

XP_031739585.1 uncharacterized protein LOC101217797 isoform X2 [Cucumis sativus]5.6e-9788.37Show/hide
Query:  MQPPPPVNDTNPYNVFRPREKAHRLHTRR----------------VRRNLEQAKTLLEALIKREEKKRDLMESDVGLQRVHLKYKHETELLEESLALPRF
        +QPPPPVNDTNPYNVFRPREKAHRLHTRR                VRRNLEQAKTLLEALIKREEKKRDLMESDVGLQRVHLKYKHETELLEESLALPRF
Subjt:  MQPPPPVNDTNPYNVFRPREKAHRLHTRR----------------VRRNLEQAKTLLEALIKREEKKRDLMESDVGLQRVHLKYKHETELLEESLALPRF

Query:  LPFSCKFGSSEDEFVELDDIAISRAPRIRTSGSFVEANTIMLPTEGVKQEYRQQQLPHGWLHKMDPLEPVLLFAKPLITEKLAAAAIVPPSDSSTRNSVS
        LPFSCKFGSSEDEFV+LD+IAISR PRIRTSGS VEAN IMLPTE VKQEYRQQQLPHGWLHKMDPLEPVLLFAKPLITEKLAAA IVPPSDSSTRNSVS
Subjt:  LPFSCKFGSSEDEFVELDDIAISRAPRIRTSGSFVEANTIMLPTEGVKQEYRQQQLPHGWLHKMDPLEPVLLFAKPLITEKLAAAAIVPPSDSSTRNSVS

Query:  MGSHKFRGRIGRGSR
        MGSHKFRGRIGRG R
Subjt:  MGSHKFRGRIGRGSR

XP_038878544.1 uncharacterized protein LOC120070744 isoform X1 [Benincasa hispida]4.3e-9788.37Show/hide
Query:  MQPPPPVNDTNPYNVFRPREKAHRLHTRR----------------VRRNLEQAKTLLEALIKREEKKRDLMESDVGLQRVHLKYKHETELLEESLALPRF
        +QPPPPVNDTNPYNVFRPREKAHRLHTRR                VRRNLEQAKTLLEALIKREEKKRDLMESDVGLQRVHLKYKHETELLEESLALPRF
Subjt:  MQPPPPVNDTNPYNVFRPREKAHRLHTRR----------------VRRNLEQAKTLLEALIKREEKKRDLMESDVGLQRVHLKYKHETELLEESLALPRF

Query:  LPFSCKFGSSEDEFVELDDIAISRAPRIRTSGSFVEANTIMLPTEGVKQEYRQQQLPHGWLHKMDPLEPVLLFAKPLITEKLAAAAIVPPSDSSTRNSVS
        LPFSCKFGSSEDEF+ELDDIAISR PRIRTSGSFVEANTIM PTE VKQEYRQQQLPHGWLHKMDPLEPVLLFAKPLI EKL AAAIVPPSDSSTR+SVS
Subjt:  LPFSCKFGSSEDEFVELDDIAISRAPRIRTSGSFVEANTIMLPTEGVKQEYRQQQLPHGWLHKMDPLEPVLLFAKPLITEKLAAAAIVPPSDSSTRNSVS

Query:  MGSHKFRGRIGRGSR
        MGSHKFRGRIGRG R
Subjt:  MGSHKFRGRIGRGSR

XP_038878545.1 uncharacterized protein LOC120070744 isoform X2 [Benincasa hispida]4.3e-9788.37Show/hide
Query:  MQPPPPVNDTNPYNVFRPREKAHRLHTRR----------------VRRNLEQAKTLLEALIKREEKKRDLMESDVGLQRVHLKYKHETELLEESLALPRF
        +QPPPPVNDTNPYNVFRPREKAHRLHTRR                VRRNLEQAKTLLEALIKREEKKRDLMESDVGLQRVHLKYKHETELLEESLALPRF
Subjt:  MQPPPPVNDTNPYNVFRPREKAHRLHTRR----------------VRRNLEQAKTLLEALIKREEKKRDLMESDVGLQRVHLKYKHETELLEESLALPRF

Query:  LPFSCKFGSSEDEFVELDDIAISRAPRIRTSGSFVEANTIMLPTEGVKQEYRQQQLPHGWLHKMDPLEPVLLFAKPLITEKLAAAAIVPPSDSSTRNSVS
        LPFSCKFGSSEDEF+ELDDIAISR PRIRTSGSFVEANTIM PTE VKQEYRQQQLPHGWLHKMDPLEPVLLFAKPLI EKL AAAIVPPSDSSTR+SVS
Subjt:  LPFSCKFGSSEDEFVELDDIAISRAPRIRTSGSFVEANTIMLPTEGVKQEYRQQQLPHGWLHKMDPLEPVLLFAKPLITEKLAAAAIVPPSDSSTRNSVS

Query:  MGSHKFRGRIGRGSR
        MGSHKFRGRIGRG R
Subjt:  MGSHKFRGRIGRGSR

TrEMBL top hitse value%identityAlignment
A0A0A0LW95 Enhancer of polycomb-like protein2.7e-9788.37Show/hide
Query:  MQPPPPVNDTNPYNVFRPREKAHRLHTRR----------------VRRNLEQAKTLLEALIKREEKKRDLMESDVGLQRVHLKYKHETELLEESLALPRF
        +QPPPPVNDTNPYNVFRPREKAHRLHTRR                VRRNLEQAKTLLEALIKREEKKRDLMESDVGLQRVHLKYKHETELLEESLALPRF
Subjt:  MQPPPPVNDTNPYNVFRPREKAHRLHTRR----------------VRRNLEQAKTLLEALIKREEKKRDLMESDVGLQRVHLKYKHETELLEESLALPRF

Query:  LPFSCKFGSSEDEFVELDDIAISRAPRIRTSGSFVEANTIMLPTEGVKQEYRQQQLPHGWLHKMDPLEPVLLFAKPLITEKLAAAAIVPPSDSSTRNSVS
        LPFSCKFGSSEDEFV+LD+IAISR PRIRTSGS VEAN IMLPTE VKQEYRQQQLPHGWLHKMDPLEPVLLFAKPLITEKLAAA IVPPSDSSTRNSVS
Subjt:  LPFSCKFGSSEDEFVELDDIAISRAPRIRTSGSFVEANTIMLPTEGVKQEYRQQQLPHGWLHKMDPLEPVLLFAKPLITEKLAAAAIVPPSDSSTRNSVS

Query:  MGSHKFRGRIGRGSR
        MGSHKFRGRIGRG R
Subjt:  MGSHKFRGRIGRGSR

A0A1S3BQD4 Enhancer of polycomb-like protein2.3e-9687.44Show/hide
Query:  MQPPPPVNDTNPYNVFRPREKAHRLHTRR----------------VRRNLEQAKTLLEALIKREEKKRDLMESDVGLQRVHLKYKHETELLEESLALPRF
        +QPPPPVNDTNPYNVFRPREKAHRLHTRR                VRRNLEQAKTLLEALIKREEKKRDLMESDVGLQR+HLKYKHETELLEESLALPRF
Subjt:  MQPPPPVNDTNPYNVFRPREKAHRLHTRR----------------VRRNLEQAKTLLEALIKREEKKRDLMESDVGLQRVHLKYKHETELLEESLALPRF

Query:  LPFSCKFGSSEDEFVELDDIAISRAPRIRTSGSFVEANTIMLPTEGVKQEYRQQQLPHGWLHKMDPLEPVLLFAKPLITEKLAAAAIVPPSDSSTRNSVS
        LPFSCKFGSSEDEF++LD+IAISR PRIRTSGS VEAN IMLPTE VKQEYRQQQLP GWLHKMDPLEPVLLFAKPLITEKLAAAAIVPPSDSSTRNSVS
Subjt:  LPFSCKFGSSEDEFVELDDIAISRAPRIRTSGSFVEANTIMLPTEGVKQEYRQQQLPHGWLHKMDPLEPVLLFAKPLITEKLAAAAIVPPSDSSTRNSVS

Query:  MGSHKFRGRIGRGSR
        MGSHKFRGRIGRG R
Subjt:  MGSHKFRGRIGRGSR

A0A5A7U548 Enhancer of polycomb-like protein2.3e-9687.44Show/hide
Query:  MQPPPPVNDTNPYNVFRPREKAHRLHTRR----------------VRRNLEQAKTLLEALIKREEKKRDLMESDVGLQRVHLKYKHETELLEESLALPRF
        +QPPPPVNDTNPYNVFRPREKAHRLHTRR                VRRNLEQAKTLLEALIKREEKKRDLMESDVGLQR+HLKYKHETELLEESLALPRF
Subjt:  MQPPPPVNDTNPYNVFRPREKAHRLHTRR----------------VRRNLEQAKTLLEALIKREEKKRDLMESDVGLQRVHLKYKHETELLEESLALPRF

Query:  LPFSCKFGSSEDEFVELDDIAISRAPRIRTSGSFVEANTIMLPTEGVKQEYRQQQLPHGWLHKMDPLEPVLLFAKPLITEKLAAAAIVPPSDSSTRNSVS
        LPFSCKFGSSEDEF++LD+IAISR PRIRTSGS VEAN IMLPTE VKQEYRQQQLP GWLHKMDPLEPVLLFAKPLITEKLAAAAIVPPSDSSTRNSVS
Subjt:  LPFSCKFGSSEDEFVELDDIAISRAPRIRTSGSFVEANTIMLPTEGVKQEYRQQQLPHGWLHKMDPLEPVLLFAKPLITEKLAAAAIVPPSDSSTRNSVS

Query:  MGSHKFRGRIGRGSR
        MGSHKFRGRIGRG R
Subjt:  MGSHKFRGRIGRGSR

A0A5D3CGG8 Enhancer of polycomb-like protein2.3e-9687.44Show/hide
Query:  MQPPPPVNDTNPYNVFRPREKAHRLHTRR----------------VRRNLEQAKTLLEALIKREEKKRDLMESDVGLQRVHLKYKHETELLEESLALPRF
        +QPPPPVNDTNPYNVFRPREKAHRLHTRR                VRRNLEQAKTLLEALIKREEKKRDLMESDVGLQR+HLKYKHETELLEESLALPRF
Subjt:  MQPPPPVNDTNPYNVFRPREKAHRLHTRR----------------VRRNLEQAKTLLEALIKREEKKRDLMESDVGLQRVHLKYKHETELLEESLALPRF

Query:  LPFSCKFGSSEDEFVELDDIAISRAPRIRTSGSFVEANTIMLPTEGVKQEYRQQQLPHGWLHKMDPLEPVLLFAKPLITEKLAAAAIVPPSDSSTRNSVS
        LPFSCKFGSSEDEF++LD+IAISR PRIRTSGS VEAN IMLPTE VKQEYRQQQLP GWLHKMDPLEPVLLFAKPLITEKLAAAAIVPPSDSSTRNSVS
Subjt:  LPFSCKFGSSEDEFVELDDIAISRAPRIRTSGSFVEANTIMLPTEGVKQEYRQQQLPHGWLHKMDPLEPVLLFAKPLITEKLAAAAIVPPSDSSTRNSVS

Query:  MGSHKFRGRIGRGSR
        MGSHKFRGRIGRG R
Subjt:  MGSHKFRGRIGRGSR

A0A6J1DXJ8 Enhancer of polycomb-like protein1.2e-8982.79Show/hide
Query:  MQPPPPVNDTNPYNVFRPREKAHRLHTRR----------------VRRNLEQAKTLLEALIKREEKKRDLMESDVGLQRVHLKYKHETELLEESLALPRF
        +QPPPPVNDTNPYNVFRPREKAHRLHTRR                VRRNLEQAKTLLEALIKREEKKRDLM+S+V LQRVHLKYKHETELLEESLALPRF
Subjt:  MQPPPPVNDTNPYNVFRPREKAHRLHTRR----------------VRRNLEQAKTLLEALIKREEKKRDLMESDVGLQRVHLKYKHETELLEESLALPRF

Query:  LPFSCKFGSSEDEFVELDDIAISRAPRIRTSGSFVEANTIMLPTEGVKQEYRQQQLPHGWLHKMDPLEPVLLFAKPLITEKLAAAAIVPPSDSSTRNSVS
        LPFSCKFGSSEDEFVELDD  ISR PRIRTSGSF++AN +MLP+E +K EYR QQLPHGWLHKMDP+EPVLLFAKPL+ EKLAAAAIVPPSDSS RNSVS
Subjt:  LPFSCKFGSSEDEFVELDDIAISRAPRIRTSGSFVEANTIMLPTEGVKQEYRQQQLPHGWLHKMDPLEPVLLFAKPLITEKLAAAAIVPPSDSSTRNSVS

Query:  MGSHKFRGRIGRGSR
        MGSHKFRGRIGRG R
Subjt:  MGSHKFRGRIGRGSR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G16690.1 Enhancer of polycomb-like transcription factor protein4.1e-5354.34Show/hide
Query:  MQPPPPVNDTNPYNVFRPREKAHRLHTRR----------------VRRNLEQAKTLLEALIKREEKKRDLMESDVGLQRVHLKYKHETELLEESLALPRF
        +QPPPPVNDTNPYNVFRPREKAHRLH RR                VRRNL+QAKT+LEALIKREEKKRD M S+V LQR+ LKYK+ETELLE+SLAL  F
Subjt:  MQPPPPVNDTNPYNVFRPREKAHRLHTRR----------------VRRNLEQAKTLLEALIKREEKKRDLMESDVGLQRVHLKYKHETELLEESLALPRF

Query:  -LPFSCKFGSSEDEFVELDDIAISRA---PRIRTSGSFVEANTIMLPTEGVKQEYRQQQLPHGWLHKMDPLEPVLLFAKPLITEKLAAAAIVPPSDSSTR
         L  + +FGSSEDEF++ DD   ++    P       F ++N        +KQE R+     GWLHK++P EPV+LF K L+ +KLAAA I+PPSD+ + 
Subjt:  -LPFSCKFGSSEDEFVELDDIAISRA---PRIRTSGSFVEANTIMLPTEGVKQEYRQQQLPHGWLHKMDPLEPVLLFAKPLITEKLAAAAIVPPSDSSTR

Query:  NSVSMGSHKFRGRIGRGSR
             G  +F+GR+GRG R
Subjt:  NSVSMGSHKFRGRIGRGSR

AT1G79020.1 Enhancer of polycomb-like transcription factor protein1.2e-5755.91Show/hide
Query:  MQPPPPVNDTNPYNVFRPREKAHRLHTRR----------------VRRNLEQAKTLLEALIKREEKKRDLMESDVGLQRVHLKYKHETELLEESLALPRF
        +QPPPPVNDTNPYNVFRPREK HRLHTRR                VRRNL QA+++LEALIKREEKKRD+M+S+V LQR+ L+Y+HETELLE+SLA+P F
Subjt:  MQPPPPVNDTNPYNVFRPREKAHRLHTRR----------------VRRNLEQAKTLLEALIKREEKKRDLMESDVGLQRVHLKYKHETELLEESLALPRF

Query:  LP--FSCKFGSSEDEFVELDDIAISRA---PRIRTSGSFVEANTIMLPTEGVKQEYRQQQLPHGWLHKMDPLEPVLLFAKPLITEKLAAAAIVPPSDSST
         P   S KFGSS+DE ++ DD   +R    P +  S  F   N       G+KQE R++Q  HGWLH++DP EPV+LF KPL+ +KLAAA IVPP+  S 
Subjt:  LP--FSCKFGSSEDEFVELDDIAISRA---PRIRTSGSFVEANTIMLPTEGVKQEYRQQQLPHGWLHKMDPLEPVLLFAKPLITEKLAAAAIVPPSDSST

Query:  RNSVSMGSHKFRGRIGRGSR
          S      +F+GRIGRG R
Subjt:  RNSVSMGSHKFRGRIGRGSR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGCCGCCTCCACCTGTCAATGATACAAACCCGTACAATGTCTTTAGGCCAAGGGAGAAAGCCCATAGACTTCACACAAGAAGGGTTAGGCGCAACCTGGAACAAGC
TAAAACCTTATTGGAGGCTTTAATTAAGAGGGAAGAGAAAAAGAGAGATCTGATGGAAAGTGATGTTGGCCTTCAGAGGGTTCATTTGAAGTATAAGCATGAAACTGAAC
TTCTGGAAGAGAGCTTAGCACTTCCCAGGTTCTTACCCTTCTCTTGTAAGTTTGGTTCAAGCGAGGATGAATTTGTGGAGTTGGATGATATAGCAATCAGTCGCGCCCCA
CGAATACGAACTTCTGGATCTTTCGTGGAGGCAAATACAATTATGCTCCCAACTGAAGGCGTGAAGCAAGAGTACAGGCAACAACAGTTGCCACATGGTTGGCTTCACAA
AATGGATCCACTGGAGCCAGTTTTGTTGTTTGCAAAACCTCTGATTACAGAGAAGTTGGCAGCTGCAGCGATAGTACCCCCATCAGATTCTTCGACAAGGAATAGTGTGT
CAATGGGTTCTCATAAATTCCGGGGAAGAATCGGCCGAGGCAGTCGAGGTTGGGCAAGAACCAGGCTTGCTCCGACGTCCCATCAGTGTCAAGCTAGGGCCAGTCTTAAG
GCTAACCCCGACTTGATTGGGGCATTAAGAGGATAG
mRNA sequenceShow/hide mRNA sequence
CCTGGTTCATGAGCGCCTGGAATGTCGCGGGGGCGTTGGTCAACCCGAATGGCATCACCAAAAACTCGTAATGTCCCTCGTGGGTCCAAAATGCTGTTTTCCTCACGTCT
TCATCGCGAACTCTGATCTGGTGGTAGCCTGATTTTAGATCGATCTTTGAGAAAACCTTCGCCCCATTCAGCTCATCCAACAGTTCATCAATCATCGGTATTGGAAATTT
GTCCGGCACGGTCACCGGTTTAACGCTCGGTAGTCTACACAGAAGCGCCAGCTTCCATCCTTCTTCTTCACTAATATCACGGGGCTGGAAAAGGGACTGATGCTGGGTCG
GATAATTCCCGTGGATAGCATCTCGTTGATTAGTCGCTCGATCTCGTTCTTCTGAGCATGGGGGTATCGATACGGCCTTACGTTGACAGGGTCGGCGCCCGTTTTTAATT
GGATTCGGTGGTCAATTCGCCTCATCAGTGGCAGCCCCTCAGGCATGTCAAACACGTCTGCATATTCCTGTTGTAATTGGTCAATCTCCAGTTGCAGCTCTTCAATTTCT
CCTCCCATTAATATGCCCCTTTCCTCTCTCGGAATTCCCAATGTTCGGAAGTCGACGAGGAAACCTTGGTCCTCGGGTTGCCACGACTTGGCCAGCACCTTCAGGGATAT
CTCCATTCGGGACAGGGTGGGGTCCCCTCTGATCATGATCTGCGAATCTCCTACCGCAAAGGTCATAGTCAAATTCTTCCAATCAACGGTCATCGAGCCCTGTTTCTGGA
GCCATTGCGTTCCCAAAACAACATCCAGATTTCCCAACTCCAGTGGTAGAAAATCATCAGCAAAAGAGAGCTCCGGTAGAGTCATGGATAAATTTCTACATATTCCTTTA
CCTTGTACTGCTTCTCCCGACACCATGATGACACCGTAGCTTGTAGTTTCCGCCAATGGTAGGTTCAGACTCTCGACAAGCCTTTGCGAAACGAAATTGTGGGTAGCTCC
ACAGTCCACTAGCACGATGATTTCTTGGTCCCCTAAATTCCCTTTCAACTTGAACGTACACGGAGTTGTTAGTCCCACCACCGAATTCAGCGACAATTCGATTACTGGGC
TTACCTCTAACATTGGACTGTCCATCTCCACATCCTCCATCACCACGTCGTCGAGTTCGTCCGCCACCACGTACAGTCGGAGTTCTTTATTTTTGCAGCGATGTCCTTTG
CTAAAAGGTTCGTCACACCGGTAGCATAGCCCTTTGTCTCTTCTGGCTTGCAATTCTGAATCTGTCCAGCGTCTCCAGGGAGGTTTCGTAGTCGGTGCTGGTGGGGGATT
TAGGATCTTTTCAGCCAGGGTGACGGTACGTGTCGTCATGCTCTCCGTCGTTTTGCCAGAGGCTTTAGGGTTGGGCTTCCAATCCTTTCCATATGGGCCTTGGGCCGTGC
GAGCTGTTTCAATCCGATCCTCTGCTAGTTGGGCCGCATCCATCATATCCTCCAAGCCCACAGCTCTCATGGAGAAAACCTCCGTTCGAATAACCGGGTCCAGCCCATTT
GTAAACGTGTTTATCAAAACGTTTTCAGCTAGGTCGGGCAGCGGCGCCGATAACTCCTCGAAGTTCTGGAGATAATCCATTACCGTCCCTTCTTGCTTAATGGCTAAGAA
CCGCGCGCATGGTGTTCCTCGCTGTCGAGGACGAAATCGGACATAAATCCTCTGTTTTAACTCCTTCCACGATCGAAATCGTTTTCTATTCTCTGCCCATCGGAACCAGT
TAAGAACCTTCCCTTCCAGACTGACAATAGCAATTTTTAACTTCTCTTTCTCCGTTAGCAAGTGCAACTGGAAATAATGTTCCGCTCTAAAGAACCACCCGTCCGGGTCA
TCACCGTTGAAAACCGGCATCTCCAGTTTTTTGAATTTGATTTTCTCTTGCCGGTCGGAGCTTTCCTTCATCTGGGACGATTCCCCCTCTTCCATCTCATCGTCCTCCAC
TTCTTCATCCGCGCGTATCTTCCTTTTCCCTATTGTGATTTCCGTTATTTCAGACGATCCTTGGGGCTTAGGTCGATCCCCATAGACGTCCTAGAACAACGAGTGTAAGG
CCCGAATGTCCTCAGCCATTTTTTCCACATTTTTCTCAACTATCGGTAGCCTTTTCAGATCTTGCTTCATGCTCCCGATTTCCGCCTCTGTTGCACTTAAACGTTCTTCC
ATTTGCTTCTGCGTCATCTTCTTCCACCTCCCCAGCGTTTCAACGGCTCTGATACCACTTGTAAGGATTCTTCCTCACTTTAATATTGATATCAAAACAGTTCACCAAAA
TACAGGGGAATGATCTCCGATTACGATTATAAAACAGAAAATATTAGCAACTACTACAGATAACCAACAGCCAAATAAGAGGGCTGCTCTCTCCTCTCTATGAACACTGT
TCATTCACCAAATTCAAGATAACTAACCCTAACCCAAAAAGACAAATATATATGCTAACATTCCCCTCAACAGCCATCCTTTTCTGGTCCCCACCCCTTCTAGGCAGTGC
TTGTCTGGGATGTAATCCCCTTCTTGCCCTTCCTTGAATAAACTTTCGTGATAGGAGGCCTAGCATAACATCAGCAAATTGTAAGAGTGAAATCTAAACAATTTTCCTAA
CCTCAAAATCTTCGAAATCACCATTTTGAAGCGCCTTGTTTAGCAAGAAACATAGCACTTTTTACCATTTTGAAGCGCCTTGTGATAGTGGATCCCCTTTTATCTTAAAC
TGAAGTTGTATAACTGGTGACGGTAAATCAAAGTGTGTAGGTAAAGACTTGGATGATACTATAGGATCGGTATCACAACCCTAGAGGGGGTGAATAGGGTTTATTTAAAC
TAATGAAAACTTTTTACTAAAGGTGGGCCCAATTAAACAAATTAATACACTTTTTGCTAATAAGTAATTTAAGCAAGAATTGTAGGTAAATAAATTCAATCCAAAATAAT
CAAGAAATTAAAACTAACACTTTTTAAGCAACCTATTTAATTCTAATAATAAGATTGGTAAATGATATGAAATTTTAAGAACTACCAATCAACATAAACAAATACGTGCA
ATTTATTTCAAGAATTAAAAAAAATCATGCAATGAAATATAATTACAAAAATAAACTAAGCAAGGGATAGAGAAATTGACACCGTGATTTTATAGCGGTTCGGCACAACT
CGACCTACATCCAGTTCCCAAGATTCTCTTGGATATGTCACTACGAACTTGACTCTTTCTAGGGGGGCTTAGAGTCGAACTGCTACAATGTTATTTTTTCGGGAGCAAGA
TAAAACCCAATCATTTCCATGATTGAGGATCAAACTGTTACAACACTCTTGTTATGGGTTTAAGAGTAACCCCTTACAATAGATTTAGAAATGAAGGACAACTTGACAAA
ACTCTATAAAAGAGTGAATTTACAAATTTTGAGCTCACAACAAATCAATCCTCACCATACATAAATCTCTCTCAAGAATAAGATAAAAAGAAGAAAATTGAAGCTTGGAG
AGAGCAACAATGGAGGCTTTGTGTTTTATGGAGGATTGAAAATCATGTAAAATTGTTATAGAAATGTGGAGAAGATGATGAGTTTAAAAAGAGAGGGAAGTTGGTGAAAT
CCAAATTCAAATCAGGCCATTGGATGTTGTTAAAAGAATATTGAATGGTTGAGATTAAAAGTAAAAAGTAATACCTTTTTTTTTTAATTTTAAAGTCAACTAGCCGTTAG
ACACAAACATGTATAATATTAAATTCATTTTCTTTTAAAGTCAATTAGCCGTTAGACATAAACATATATAATATTATATTCATTTTCTTTTTAAAACCAATTCAAAATCA
AAAGCCCACCATGTGTCATTCCTCTCATGCTCCAAGTGGCATCTTTCAATTGGGTCCACTTTGATGAATAAGCTGCCACATCATCACTTCGGGTATTTTTTTGTTAAACT
TGTTTTGGCTTTATGTTCATCGTTTGAGCTCCGATTTGAGTGATTCAAATTGTGTTGGAATCGTTGTCTCTGAGCTTTATCTAATAGACACTTCAAAATACTAAAATTGT
TAGTGAATAAAAACAGATTTGTTATCATCAAAATGTTAAATAATTAAATGATTAAAATTAATTAATTTGAGATTAAGGACCAAAAAGCATACCATTTGGATCAGGAGCTA
GTAGGAGTAGAAGATTGTTTATTAGTCTCCTAGTAGGAGATTTTTAAGATTTTGCAGCCTAGGAATATTTTCAATAGTCTTAATCGAAGCCTAATTGAAGCCTGATTGGA
AGTCCAAAAGATGTCTAGTGCAGGCAAATGAGTGCCATGATGCCAAACTCAAAGACAAGTTAGTATCCGAGAGGCTTAGGCATCAGGGAACAATAATGGTAGTACCACGA
CACCAGATGAATTAAAAATGAATCAGATAGAACACTTTACACACGTCTCAATGCAAAGAGCTATATCATGGAACGCAAATTCAGCAGAATGTAACTCTAGGTCCTTGTGT
CAATATCAGTTTGTCTTTTTTCTCGTTCAAAAGAAAAAGGAAAAAAGAATCACAGTGGTTATCTAGCTCATTATTGATCCTGTAGTCCTAGATCTTAAGAACATTTTTTC
AGGAAAGGTTTCAGTGGCCTTTTATGATTTTCATCTCAGGATTCTTTGGAGTAGGGAACATGATCTTGTCATCGCACAGGCTTTTGTGTTTTAGTCTACTTCAATTGTGA
TTTCTGTGTGATGCACTCTAGTCTAAATGACATTCGTGAGTTAATTGCTCCCATTTCTTATGCAGCCGCCTCCACCTGTCAATGATACAAACCCGTACAATGTCTTTAGG
CCAAGGGAGAAAGCCCATAGACTTCACACAAGAAGGGTTAGGCGCAACCTGGAACAAGCTAAAACCTTATTGGAGGCTTTAATTAAGAGGGAAGAGAAAAAGAGAGATCT
GATGGAAAGTGATGTTGGCCTTCAGAGGGTTCATTTGAAGTATAAGCATGAAACTGAACTTCTGGAAGAGAGCTTAGCACTTCCCAGGTTCTTACCCTTCTCTTGTAAGT
TTGGTTCAAGCGAGGATGAATTTGTGGAGTTGGATGATATAGCAATCAGTCGCGCCCCACGAATACGAACTTCTGGATCTTTCGTGGAGGCAAATACAATTATGCTCCCA
ACTGAAGGCGTGAAGCAAGAGTACAGGCAACAACAGTTGCCACATGGTTGGCTTCACAAAATGGATCCACTGGAGCCAGTTTTGTTGTTTGCAAAACCTCTGATTACAGA
GAAGTTGGCAGCTGCAGCGATAGTACCCCCATCAGATTCTTCGACAAGGAATAGTGTGTCAATGGGTTCTCATAAATTCCGGGGAAGAATCGGCCGAGGCAGTCGAGGTT
GGGCAAGAACCAGGCTTGCTCCGACGTCCCATCAGTGTCAAGCTAGGGCCAGTCTTAAGGCTAACCCCGACTTGATTGGGGCATTAAGAGGATAG
Protein sequenceShow/hide protein sequence
MQPPPPVNDTNPYNVFRPREKAHRLHTRRVRRNLEQAKTLLEALIKREEKKRDLMESDVGLQRVHLKYKHETELLEESLALPRFLPFSCKFGSSEDEFVELDDIAISRAP
RIRTSGSFVEANTIMLPTEGVKQEYRQQQLPHGWLHKMDPLEPVLLFAKPLITEKLAAAAIVPPSDSSTRNSVSMGSHKFRGRIGRGSRGWARTRLAPTSHQCQARASLK
ANPDLIGALRG