; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0004681 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0004681
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCAAX amino terminal protease
Genome locationchr6:6053563..6056119
RNA-Seq ExpressionLag0004681
SyntenyLag0004681
Gene Ontology termsGO:0071586 - CAAX-box protein processing (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004222 - metalloendopeptidase activity (molecular function)
InterPro domainsIPR003675 - Type II CAAX prenyl endopeptidase Rce1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004143052.1 uncharacterized protein LOC101207590 isoform X1 [Cucumis sativus]3.1e-11288.58Show/hide
Query:  RSSVKPKVYSRRKSARKLERTREEVSETSSSADGNAEDVKMNSSDGSIKNSLINIPSRSSVLQACTITSGLIAALGVIIRQVSHVASVEGLPVIDCTSEV
        RS+VK KV ++RKSAR+LER REEVS TSSSAD NA++VKMNSSD S KN LINI SRSSVLQAC ITSGLIAALGVIIRQVSHVAS+EGLPVIDCTSEV
Subjt:  RSSVKPKVYSRRKSARKLERTREEVSETSSSADGNAEDVKMNSSDGSIKNSLINIPSRSSVLQACTITSGLIAALGVIIRQVSHVASVEGLPVIDCTSEV

Query:  SFSFEMEQLQLIIGLVVLISSSRYILLKIWPDFAESSEAANRQVLTSLQPLDYGVVAFLPGISEELLFRGALIPLLGFNWASVVVTAAIFGVLHLGGGRK
        SFSFE+ QLQLIIGLVVLISSSR+ LLK WPDFAESSEAANRQVLTSLQPLDY VVAFLPGISEELLFRGALIPLLGFNWASVVVTAAIFG+LHLGGGRK
Subjt:  SFSFEMEQLQLIIGLVVLISSSRYILLKIWPDFAESSEAANRQVLTSLQPLDYGVVAFLPGISEELLFRGALIPLLGFNWASVVVTAAIFGVLHLGGGRK

Query:  YSFAIWATFVGLAYGYATIETSSIVVPMASHALNNLVGGILWRYESRSLKNRDD
        YSFAIWATFVGLAYGYATIE+SSIVVPMASHALNNLVGGILW YESRSL+N +D
Subjt:  YSFAIWATFVGLAYGYATIETSSIVVPMASHALNNLVGGILWRYESRSLKNRDD

XP_008444403.1 PREDICTED: uncharacterized protein LOC103487740 isoform X2 [Cucumis melo]1.5e-11187.8Show/hide
Query:  RSSVKPKVYSRRKSARKLERTREEVSETSSSADGNAEDVKMNSSDGSIKNSLINIPSRSSVLQACTITSGLIAALGVIIRQVSHVASVEGLPVIDCTSEV
        RS+VKPKV+++RKSAR+LER REE S TSSSAD NA++VKMNSSD S KN LINI SRSSVLQAC ITSGLIAALGVIIRQVSHVAS+EGLPVIDCTSEV
Subjt:  RSSVKPKVYSRRKSARKLERTREEVSETSSSADGNAEDVKMNSSDGSIKNSLINIPSRSSVLQACTITSGLIAALGVIIRQVSHVASVEGLPVIDCTSEV

Query:  SFSFEMEQLQLIIGLVVLISSSRYILLKIWPDFAESSEAANRQVLTSLQPLDYGVVAFLPGISEELLFRGALIPLLGFNWASVVVTAAIFGVLHLGGGRK
        SFSFE+ QLQLIIGLV LISSSR +LLKIWPDFAESSEAANRQVLTSL+PLDY VVA LPGISEELLFRGALIPLLGFNWASVVVTAAIFG+LHLGGGRK
Subjt:  SFSFEMEQLQLIIGLVVLISSSRYILLKIWPDFAESSEAANRQVLTSLQPLDYGVVAFLPGISEELLFRGALIPLLGFNWASVVVTAAIFGVLHLGGGRK

Query:  YSFAIWATFVGLAYGYATIETSSIVVPMASHALNNLVGGILWRYESRSLKNRDD
        YSFAIWATFVGLAYGYATIE+SSIVVPMASHALNNLVGGILWRYES SL+N +D
Subjt:  YSFAIWATFVGLAYGYATIETSSIVVPMASHALNNLVGGILWRYESRSLKNRDD

XP_011649506.1 uncharacterized protein LOC101207590 isoform X2 [Cucumis sativus]3.1e-11288.58Show/hide
Query:  RSSVKPKVYSRRKSARKLERTREEVSETSSSADGNAEDVKMNSSDGSIKNSLINIPSRSSVLQACTITSGLIAALGVIIRQVSHVASVEGLPVIDCTSEV
        RS+VK KV ++RKSAR+LER REEVS TSSSAD NA++VKMNSSD S KN LINI SRSSVLQAC ITSGLIAALGVIIRQVSHVAS+EGLPVIDCTSEV
Subjt:  RSSVKPKVYSRRKSARKLERTREEVSETSSSADGNAEDVKMNSSDGSIKNSLINIPSRSSVLQACTITSGLIAALGVIIRQVSHVASVEGLPVIDCTSEV

Query:  SFSFEMEQLQLIIGLVVLISSSRYILLKIWPDFAESSEAANRQVLTSLQPLDYGVVAFLPGISEELLFRGALIPLLGFNWASVVVTAAIFGVLHLGGGRK
        SFSFE+ QLQLIIGLVVLISSSR+ LLK WPDFAESSEAANRQVLTSLQPLDY VVAFLPGISEELLFRGALIPLLGFNWASVVVTAAIFG+LHLGGGRK
Subjt:  SFSFEMEQLQLIIGLVVLISSSRYILLKIWPDFAESSEAANRQVLTSLQPLDYGVVAFLPGISEELLFRGALIPLLGFNWASVVVTAAIFGVLHLGGGRK

Query:  YSFAIWATFVGLAYGYATIETSSIVVPMASHALNNLVGGILWRYESRSLKNRDD
        YSFAIWATFVGLAYGYATIE+SSIVVPMASHALNNLVGGILW YESRSL+N +D
Subjt:  YSFAIWATFVGLAYGYATIETSSIVVPMASHALNNLVGGILWRYESRSLKNRDD

XP_022131409.1 uncharacterized protein LOC111004632 isoform X1 [Momordica charantia]5.2e-11288.67Show/hide
Query:  SSVKPKVYSRRKSARKLERTREEVSETSSSADGNAEDVKMNSSDGSIKNSLINIPSRSSVLQACTITSGLIAALGVIIRQVSHVASVEGLPVIDCTSEVS
        S+VKP VY+RRKSARKLER  EEVSETS SAD NA+DVKMNSSD S KNSL NI SRSSVLQACTITSGLIAALGVIIRQVSHVAS+EGLPVIDCTSEVS
Subjt:  SSVKPKVYSRRKSARKLERTREEVSETSSSADGNAEDVKMNSSDGSIKNSLINIPSRSSVLQACTITSGLIAALGVIIRQVSHVASVEGLPVIDCTSEVS

Query:  FSFEMEQLQLIIGLVVLISSSRYILLKIWPDFAESSEAANRQVLTSLQPLDYGVVAFLPGISE---ELLFRGALIPLLGFNWASVVVTAAIFGVLHLGGG
        FSFEM QLQLI GLVVLISSSRYILLKIWPDFAESSEAANRQVLTSLQP+DY VVAFLPGISE   ELLFRGALIPLLGFNWASV+VTAAIFGVLHLGGG
Subjt:  FSFEMEQLQLIIGLVVLISSSRYILLKIWPDFAESSEAANRQVLTSLQPLDYGVVAFLPGISE---ELLFRGALIPLLGFNWASVVVTAAIFGVLHLGGG

Query:  RKYSFAIWATFVGLAYGYATIETSSIVVPMASHALNNLVGGILWRYESRSLKNRDD
        RKYSFAIWAT VGLAYGYATIE++S+VVPMASHALNNLVGGILW  ESRSL+NR+D
Subjt:  RKYSFAIWATFVGLAYGYATIETSSIVVPMASHALNNLVGGILWRYESRSLKNRDD

XP_022131410.1 uncharacterized protein LOC111004632 isoform X2 [Momordica charantia]1.2e-11389.72Show/hide
Query:  SSVKPKVYSRRKSARKLERTREEVSETSSSADGNAEDVKMNSSDGSIKNSLINIPSRSSVLQACTITSGLIAALGVIIRQVSHVASVEGLPVIDCTSEVS
        S+VKP VY+RRKSARKLER  EEVSETS SAD NA+DVKMNSSD S KNSL NI SRSSVLQACTITSGLIAALGVIIRQVSHVAS+EGLPVIDCTSEVS
Subjt:  SSVKPKVYSRRKSARKLERTREEVSETSSSADGNAEDVKMNSSDGSIKNSLINIPSRSSVLQACTITSGLIAALGVIIRQVSHVASVEGLPVIDCTSEVS

Query:  FSFEMEQLQLIIGLVVLISSSRYILLKIWPDFAESSEAANRQVLTSLQPLDYGVVAFLPGISEELLFRGALIPLLGFNWASVVVTAAIFGVLHLGGGRKY
        FSFEM QLQLI GLVVLISSSRYILLKIWPDFAESSEAANRQVLTSLQP+DY VVAFLPGISEELLFRGALIPLLGFNWASV+VTAAIFGVLHLGGGRKY
Subjt:  FSFEMEQLQLIIGLVVLISSSRYILLKIWPDFAESSEAANRQVLTSLQPLDYGVVAFLPGISEELLFRGALIPLLGFNWASVVVTAAIFGVLHLGGGRKY

Query:  SFAIWATFVGLAYGYATIETSSIVVPMASHALNNLVGGILWRYESRSLKNRDD
        SFAIWAT VGLAYGYATIE++S+VVPMASHALNNLVGGILW  ESRSL+NR+D
Subjt:  SFAIWATFVGLAYGYATIETSSIVVPMASHALNNLVGGILWRYESRSLKNRDD

TrEMBL top hitse value%identityAlignment
A0A0A0LKL7 Uncharacterized protein1.5e-11288.58Show/hide
Query:  RSSVKPKVYSRRKSARKLERTREEVSETSSSADGNAEDVKMNSSDGSIKNSLINIPSRSSVLQACTITSGLIAALGVIIRQVSHVASVEGLPVIDCTSEV
        RS+VK KV ++RKSAR+LER REEVS TSSSAD NA++VKMNSSD S KN LINI SRSSVLQAC ITSGLIAALGVIIRQVSHVAS+EGLPVIDCTSEV
Subjt:  RSSVKPKVYSRRKSARKLERTREEVSETSSSADGNAEDVKMNSSDGSIKNSLINIPSRSSVLQACTITSGLIAALGVIIRQVSHVASVEGLPVIDCTSEV

Query:  SFSFEMEQLQLIIGLVVLISSSRYILLKIWPDFAESSEAANRQVLTSLQPLDYGVVAFLPGISEELLFRGALIPLLGFNWASVVVTAAIFGVLHLGGGRK
        SFSFE+ QLQLIIGLVVLISSSR+ LLK WPDFAESSEAANRQVLTSLQPLDY VVAFLPGISEELLFRGALIPLLGFNWASVVVTAAIFG+LHLGGGRK
Subjt:  SFSFEMEQLQLIIGLVVLISSSRYILLKIWPDFAESSEAANRQVLTSLQPLDYGVVAFLPGISEELLFRGALIPLLGFNWASVVVTAAIFGVLHLGGGRK

Query:  YSFAIWATFVGLAYGYATIETSSIVVPMASHALNNLVGGILWRYESRSLKNRDD
        YSFAIWATFVGLAYGYATIE+SSIVVPMASHALNNLVGGILW YESRSL+N +D
Subjt:  YSFAIWATFVGLAYGYATIETSSIVVPMASHALNNLVGGILWRYESRSLKNRDD

A0A1S3BAB1 uncharacterized protein LOC103487740 isoform X27.4e-11287.8Show/hide
Query:  RSSVKPKVYSRRKSARKLERTREEVSETSSSADGNAEDVKMNSSDGSIKNSLINIPSRSSVLQACTITSGLIAALGVIIRQVSHVASVEGLPVIDCTSEV
        RS+VKPKV+++RKSAR+LER REE S TSSSAD NA++VKMNSSD S KN LINI SRSSVLQAC ITSGLIAALGVIIRQVSHVAS+EGLPVIDCTSEV
Subjt:  RSSVKPKVYSRRKSARKLERTREEVSETSSSADGNAEDVKMNSSDGSIKNSLINIPSRSSVLQACTITSGLIAALGVIIRQVSHVASVEGLPVIDCTSEV

Query:  SFSFEMEQLQLIIGLVVLISSSRYILLKIWPDFAESSEAANRQVLTSLQPLDYGVVAFLPGISEELLFRGALIPLLGFNWASVVVTAAIFGVLHLGGGRK
        SFSFE+ QLQLIIGLV LISSSR +LLKIWPDFAESSEAANRQVLTSL+PLDY VVA LPGISEELLFRGALIPLLGFNWASVVVTAAIFG+LHLGGGRK
Subjt:  SFSFEMEQLQLIIGLVVLISSSRYILLKIWPDFAESSEAANRQVLTSLQPLDYGVVAFLPGISEELLFRGALIPLLGFNWASVVVTAAIFGVLHLGGGRK

Query:  YSFAIWATFVGLAYGYATIETSSIVVPMASHALNNLVGGILWRYESRSLKNRDD
        YSFAIWATFVGLAYGYATIE+SSIVVPMASHALNNLVGGILWRYES SL+N +D
Subjt:  YSFAIWATFVGLAYGYATIETSSIVVPMASHALNNLVGGILWRYESRSLKNRDD

A0A5A7V132 Uncharacterized protein7.4e-11287.8Show/hide
Query:  RSSVKPKVYSRRKSARKLERTREEVSETSSSADGNAEDVKMNSSDGSIKNSLINIPSRSSVLQACTITSGLIAALGVIIRQVSHVASVEGLPVIDCTSEV
        RS+VKPKV+++RKSAR+LER REE S TSSSAD NA++VKMNSSD S KN LINI SRSSVLQAC ITSGLIAALGVIIRQVSHVAS+EGLPVIDCTSEV
Subjt:  RSSVKPKVYSRRKSARKLERTREEVSETSSSADGNAEDVKMNSSDGSIKNSLINIPSRSSVLQACTITSGLIAALGVIIRQVSHVASVEGLPVIDCTSEV

Query:  SFSFEMEQLQLIIGLVVLISSSRYILLKIWPDFAESSEAANRQVLTSLQPLDYGVVAFLPGISEELLFRGALIPLLGFNWASVVVTAAIFGVLHLGGGRK
        SFSFE+ QLQLIIGLV LISSSR +LLKIWPDFAESSEAANRQVLTSL+PLDY VVA LPGISEELLFRGALIPLLGFNWASVVVTAAIFG+LHLGGGRK
Subjt:  SFSFEMEQLQLIIGLVVLISSSRYILLKIWPDFAESSEAANRQVLTSLQPLDYGVVAFLPGISEELLFRGALIPLLGFNWASVVVTAAIFGVLHLGGGRK

Query:  YSFAIWATFVGLAYGYATIETSSIVVPMASHALNNLVGGILWRYESRSLKNRDD
        YSFAIWATFVGLAYGYATIE+SSIVVPMASHALNNLVGGILWRYES SL+N +D
Subjt:  YSFAIWATFVGLAYGYATIETSSIVVPMASHALNNLVGGILWRYESRSLKNRDD

A0A6J1BPM6 uncharacterized protein LOC111004632 isoform X26.0e-11489.72Show/hide
Query:  SSVKPKVYSRRKSARKLERTREEVSETSSSADGNAEDVKMNSSDGSIKNSLINIPSRSSVLQACTITSGLIAALGVIIRQVSHVASVEGLPVIDCTSEVS
        S+VKP VY+RRKSARKLER  EEVSETS SAD NA+DVKMNSSD S KNSL NI SRSSVLQACTITSGLIAALGVIIRQVSHVAS+EGLPVIDCTSEVS
Subjt:  SSVKPKVYSRRKSARKLERTREEVSETSSSADGNAEDVKMNSSDGSIKNSLINIPSRSSVLQACTITSGLIAALGVIIRQVSHVASVEGLPVIDCTSEVS

Query:  FSFEMEQLQLIIGLVVLISSSRYILLKIWPDFAESSEAANRQVLTSLQPLDYGVVAFLPGISEELLFRGALIPLLGFNWASVVVTAAIFGVLHLGGGRKY
        FSFEM QLQLI GLVVLISSSRYILLKIWPDFAESSEAANRQVLTSLQP+DY VVAFLPGISEELLFRGALIPLLGFNWASV+VTAAIFGVLHLGGGRKY
Subjt:  FSFEMEQLQLIIGLVVLISSSRYILLKIWPDFAESSEAANRQVLTSLQPLDYGVVAFLPGISEELLFRGALIPLLGFNWASVVVTAAIFGVLHLGGGRKY

Query:  SFAIWATFVGLAYGYATIETSSIVVPMASHALNNLVGGILWRYESRSLKNRDD
        SFAIWAT VGLAYGYATIE++S+VVPMASHALNNLVGGILW  ESRSL+NR+D
Subjt:  SFAIWATFVGLAYGYATIETSSIVVPMASHALNNLVGGILWRYESRSLKNRDD

A0A6J1BQ63 uncharacterized protein LOC111004632 isoform X12.5e-11288.67Show/hide
Query:  SSVKPKVYSRRKSARKLERTREEVSETSSSADGNAEDVKMNSSDGSIKNSLINIPSRSSVLQACTITSGLIAALGVIIRQVSHVASVEGLPVIDCTSEVS
        S+VKP VY+RRKSARKLER  EEVSETS SAD NA+DVKMNSSD S KNSL NI SRSSVLQACTITSGLIAALGVIIRQVSHVAS+EGLPVIDCTSEVS
Subjt:  SSVKPKVYSRRKSARKLERTREEVSETSSSADGNAEDVKMNSSDGSIKNSLINIPSRSSVLQACTITSGLIAALGVIIRQVSHVASVEGLPVIDCTSEVS

Query:  FSFEMEQLQLIIGLVVLISSSRYILLKIWPDFAESSEAANRQVLTSLQPLDYGVVAFLPGISE---ELLFRGALIPLLGFNWASVVVTAAIFGVLHLGGG
        FSFEM QLQLI GLVVLISSSRYILLKIWPDFAESSEAANRQVLTSLQP+DY VVAFLPGISE   ELLFRGALIPLLGFNWASV+VTAAIFGVLHLGGG
Subjt:  FSFEMEQLQLIIGLVVLISSSRYILLKIWPDFAESSEAANRQVLTSLQPLDYGVVAFLPGISE---ELLFRGALIPLLGFNWASVVVTAAIFGVLHLGGG

Query:  RKYSFAIWATFVGLAYGYATIETSSIVVPMASHALNNLVGGILWRYESRSLKNRDD
        RKYSFAIWAT VGLAYGYATIE++S+VVPMASHALNNLVGGILW  ESRSL+NR+D
Subjt:  RKYSFAIWATFVGLAYGYATIETSSIVVPMASHALNNLVGGILWRYESRSLKNRDD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G26085.1 CAAX amino terminal protease family protein2.1e-7156.86Show/hide
Query:  KRSSVKPKVYSRRKSARKLERTREEVSE-------TSSSADGNAEDVKMNSSDGSIKNSLINIPSRSSVLQACTITSGLIAALGVIIRQVSHVASVEGLP
        +R     +  S RKS +KL+R  ++  +           +    E+ +++SS       +     R  VLQACT+TSGL+AALG+IIR+ SHVAS EGL 
Subjt:  KRSSVKPKVYSRRKSARKLERTREEVSE-------TSSSADGNAEDVKMNSSDGSIKNSLINIPSRSSVLQACTITSGLIAALGVIIRQVSHVASVEGLP

Query:  VIDCTSEVSFSFEMEQLQLIIGLVVLISSSRYILLKIWPDFAESSEAANRQVLTSLQPLDYGVVAFLPGISEELLFRGALIPLLGFNWASVVVTAAIFGV
        V DC+ +V F FE   L LI G+VV ISSSR++LLK WPDFA+SSEAANRQ+LTSL+PLDY VVA LPGISEELLFRGAL+PL G NW  +V    IFG+
Subjt:  VIDCTSEVSFSFEMEQLQLIIGLVVLISSSRYILLKIWPDFAESSEAANRQVLTSLQPLDYGVVAFLPGISEELLFRGALIPLLGFNWASVVVTAAIFGV

Query:  LHLGGGRKYSFAIWATFVGLAYGYATIETSSIVVPMASHALNNLVGGILWRYESR
        LHLG GRKYSFA+WA+ VG+ YGYA + +SS++VPMASHALNNLVGG+LWRY S+
Subjt:  LHLGGGRKYSFAIWATFVGLAYGYATIETSSIVVPMASHALNNLVGGILWRYESR

AT3G26085.2 CAAX amino terminal protease family protein9.6e-7258.78Show/hide
Query:  SRRKSARKLERTREEVSE-------TSSSADGNAEDVKMNSSDGSIKNSLINIPSRSSVLQACTITSGLIAALGVIIRQVSHVASVEGLPVIDCTSEVSF
        S RKS +KL+R  ++  +           +    E+ +++SS       +     R  VLQACT+TSGL+AALG+IIR+ SHVAS EGL V DC+ +V F
Subjt:  SRRKSARKLERTREEVSE-------TSSSADGNAEDVKMNSSDGSIKNSLINIPSRSSVLQACTITSGLIAALGVIIRQVSHVASVEGLPVIDCTSEVSF

Query:  SFEMEQLQLIIGLVVLISSSRYILLKIWPDFAESSEAANRQVLTSLQPLDYGVVAFLPGISEELLFRGALIPLLGFNWASVVVTAAIFGVLHLGGGRKYS
         FE   L LI G+VV ISSSR++LLK WPDFA+SSEAANRQ+LTSL+PLDY VVA LPGISEELLFRGAL+PL G NW  +V    IFG+LHLG GRKYS
Subjt:  SFEMEQLQLIIGLVVLISSSRYILLKIWPDFAESSEAANRQVLTSLQPLDYGVVAFLPGISEELLFRGALIPLLGFNWASVVVTAAIFGVLHLGGGRKYS

Query:  FAIWATFVGLAYGYATIETSSIVVPMASHALNNLVGGILWRYESR
        FA+WA+ VG+ YGYA + +SS++VPMASHALNNLVGG+LWRY S+
Subjt:  FAIWATFVGLAYGYATIETSSIVVPMASHALNNLVGGILWRYESR

AT3G26085.3 CAAX amino terminal protease family protein2.1e-7156.86Show/hide
Query:  KRSSVKPKVYSRRKSARKLERTREEVSE-------TSSSADGNAEDVKMNSSDGSIKNSLINIPSRSSVLQACTITSGLIAALGVIIRQVSHVASVEGLP
        +R     +  S RKS +KL+R  ++  +           +    E+ +++SS       +     R  VLQACT+TSGL+AALG+IIR+ SHVAS EGL 
Subjt:  KRSSVKPKVYSRRKSARKLERTREEVSE-------TSSSADGNAEDVKMNSSDGSIKNSLINIPSRSSVLQACTITSGLIAALGVIIRQVSHVASVEGLP

Query:  VIDCTSEVSFSFEMEQLQLIIGLVVLISSSRYILLKIWPDFAESSEAANRQVLTSLQPLDYGVVAFLPGISEELLFRGALIPLLGFNWASVVVTAAIFGV
        V DC+ +V F FE   L LI G+VV ISSSR++LLK WPDFA+SSEAANRQ+LTSL+PLDY VVA LPGISEELLFRGAL+PL G NW  +V    IFG+
Subjt:  VIDCTSEVSFSFEMEQLQLIIGLVVLISSSRYILLKIWPDFAESSEAANRQVLTSLQPLDYGVVAFLPGISEELLFRGALIPLLGFNWASVVVTAAIFGV

Query:  LHLGGGRKYSFAIWATFVGLAYGYATIETSSIVVPMASHALNNLVGGILWRYESR
        LHLG GRKYSFA+WA+ VG+ YGYA + +SS++VPMASHALNNLVGG+LWRY S+
Subjt:  LHLGGGRKYSFAIWATFVGLAYGYATIETSSIVVPMASHALNNLVGGILWRYESR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAGGTCGAGTGTAAAGCCAAAGGTTTATTCAAGGCGGAAATCCGCGAGGAAATTGGAAAGAACGCGTGAGGAAGTTTCTGAAACGTCCTCTTCTGCTGAT
GGTAATGCTGAAGATGTGAAGATGAACTCTTCTGATGGTTCTATCAAGAACAGCTTGATTAATATCCCCTCAAGGAGTTCTGTGCTTCAGGCTTGCACAATTACT
TCTGGTTTGATTGCTGCTCTGGGTGTAATAATTCGACAGGTATCTCATGTTGCATCGGTAGAGGGACTGCCAGTGATTGACTGCACCTCGGAGGTATCATTTAGT
TTTGAGATGGAGCAACTTCAGTTGATTATAGGACTGGTTGTTCTAATATCTTCATCTCGATATATACTGTTGAAGATATGGCCAGACTTTGCCGAGTCTAGTGAA
GCGGCCAATCGACAGGTGCTCACTTCTCTTCAACCATTAGATTACGGGGTAGTTGCATTTTTGCCCGGGATTAGCGAGGAATTGCTTTTCCGTGGCGCATTGATA
CCGCTCTTGGGATTCAACTGGGCAAGTGTCGTGGTGACAGCCGCCATTTTTGGCGTTCTACACTTGGGTGGTGGCCGGAAGTATTCATTTGCAATATGGGCAACT
TTTGTTGGACTTGCATATGGTTATGCGACTATTGAAACCTCCAGCATCGTTGTGCCGATGGCTTCTCATGCATTGAATAATCTAGTTGGAGGAATTCTGTGGCGC
TACGAATCAAGGTCTTTGAAGAATCGTGACGATTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAGGTCGAGTGTAAAGCCAAAGGTTTATTCAAGGCGGAAATCCGCGAGGAAATTGGAAAGAACGCGTGAGGAAGTTTCTGAAACGTCCTCTTCTGCTGAT
GGTAATGCTGAAGATGTGAAGATGAACTCTTCTGATGGTTCTATCAAGAACAGCTTGATTAATATCCCCTCAAGGAGTTCTGTGCTTCAGGCTTGCACAATTACT
TCTGGTTTGATTGCTGCTCTGGGTGTAATAATTCGACAGGTATCTCATGTTGCATCGGTAGAGGGACTGCCAGTGATTGACTGCACCTCGGAGGTATCATTTAGT
TTTGAGATGGAGCAACTTCAGTTGATTATAGGACTGGTTGTTCTAATATCTTCATCTCGATATATACTGTTGAAGATATGGCCAGACTTTGCCGAGTCTAGTGAA
GCGGCCAATCGACAGGTGCTCACTTCTCTTCAACCATTAGATTACGGGGTAGTTGCATTTTTGCCCGGGATTAGCGAGGAATTGCTTTTCCGTGGCGCATTGATA
CCGCTCTTGGGATTCAACTGGGCAAGTGTCGTGGTGACAGCCGCCATTTTTGGCGTTCTACACTTGGGTGGTGGCCGGAAGTATTCATTTGCAATATGGGCAACT
TTTGTTGGACTTGCATATGGTTATGCGACTATTGAAACCTCCAGCATCGTTGTGCCGATGGCTTCTCATGCATTGAATAATCTAGTTGGAGGAATTCTGTGGCGC
TACGAATCAAGGTCTTTGAAGAATCGTGACGATTAA
Protein sequenceShow/hide protein sequence
MKRSSVKPKVYSRRKSARKLERTREEVSETSSSADGNAEDVKMNSSDGSIKNSLINIPSRSSVLQACTITSGLIAALGVIIRQVSHVASVEGLPVIDCTSEVSFS
FEMEQLQLIIGLVVLISSSRYILLKIWPDFAESSEAANRQVLTSLQPLDYGVVAFLPGISEELLFRGALIPLLGFNWASVVVTAAIFGVLHLGGGRKYSFAIWAT
FVGLAYGYATIETSSIVVPMASHALNNLVGGILWRYESRSLKNRDD