; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmUC01G013300 (gene) of Watermelon (USVL531) v1 genome

Gene IDCmUC01G013300
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionRING-type E3 ubiquitin transferase
Genome locationCmU531Chr01:26440172..26445854
RNA-Seq ExpressionCmUC01G013300
SyntenyCmUC01G013300
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily
IPR036875 - Zinc finger, CCHC-type superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022155000.1 uncharacterized protein LOC111022144 [Momordica charantia]2.4e-6548.81Show/hide
Query:  FIRNFKRYGPPTFGGGSEKATAAEQWIVKLESLFDYLNCEDHLKVRGAIFMLRDEA----------------------FKDLLYDYYFPDIVKDDKETEF
        FI++FKRYGPPTF G SE+ATAAE+WI +LE+ + YL CED  KV+GA+FMLR EA                      FKDLLYDYY+ + VKD KE EF
Subjt:  FIRNFKRYGPPTFGGGSEKATAAEQWIVKLESLFDYLNCEDHLKVRGAIFMLRDEA----------------------FKDLLYDYYFPDIVKDDKETEF

Query:  LHLTQGSMSVIQDERKFTELSRFAPDLVSTSERRIKRFIRGLCEKIRGVVALKEPTTFAAALRATLIMDKNAAKKPQATHSRWEASASSEFKRKSPPTLS
        LHL QG++SV Q ERKFTELSRFA +L+  +  +IKRF++GL + IRG V L+ P ++A A+R  LIMDK+ + K     S  E  +SS  KRK  PT +
Subjt:  LHLTQGSMSVIQDERKFTELSRFAPDLVSTSERRIKRFIRGLCEKIRGVVALKEPTTFAAALRATLIMDKNAAKKPQATHSRWEASASSEFKRKSPPTLS

Query:  DQTSKAHHPTSGQAITLPLCSLCNKHHLWQCWLDQRICFKCGKEGHFARMCPSKGEANTDKPTPKALPT-ATQGGKQKAHIFALTKKEAEDTD
        D + +A    +      P+C  C K H  QCW   + CF+CG+E HFAR CP    ANT +   +  PT +TQG  Q+A +FALT+KEA D +
Subjt:  DQTSKAHHPTSGQAITLPLCSLCNKHHLWQCWLDQRICFKCGKEGHFARMCPSKGEANTDKPTPKALPT-ATQGGKQKAHIFALTKKEAEDTD

XP_022155925.1 uncharacterized protein LOC111022925 [Momordica charantia]4.5e-5645.55Show/hide
Query:  GANRQQRANPPIPPEVPPFIRNFKRYGPPTFGGGSEKATAAEQWIVKLESLFDYLNCEDHLKVRGAIFMLRDEA----------------------FKDL
        GA  QQ +   I  E   FIR+FKR+GPP F G SE+ TAAE+W+ +LE+L+ YL C D  KVRGA+FML+ EA                      FKDL
Subjt:  GANRQQRANPPIPPEVPPFIRNFKRYGPPTFGGGSEKATAAEQWIVKLESLFDYLNCEDHLKVRGAIFMLRDEA----------------------FKDL

Query:  LYDYYFPDIVKDDKETEFLHLTQGSMSVIQDERKFTELSRFAPDLVSTSERRIKRFIRGLCEKIRGVVALKEPTTFAAALRATLIMDKNAAKKPQATHSR
        LY+YYFP  V+++K  EFL LTQ S+ V Q ERKFTELSRF    + T + +I +FI GL  +I+G++ LKEPTT+AAA+R  L+MDK   ++PQ   S+
Subjt:  LYDYYFPDIVKDDKETEFLHLTQGSMSVIQDERKFTELSRFAPDLVSTSERRIKRFIRGLCEKIRGVVALKEPTTFAAALRATLIMDKNAAKKPQATHSR

Query:  WEASASSEFKRKSPPTLSDQTSKAHHPTSGQAITLPLCSLCNKHHLWQCWLDQRICFKCGKEGHFARMCPSKGEANTDKPTPKALPTATQGG
            +SS  KRK     S Q S+ H     +  T P C  C K+H   CW+ +RIC++C KEGHFAR C   G +NT     +   TAT  G
Subjt:  WEASASSEFKRKSPPTLSDQTSKAHHPTSGQAITLPLCSLCNKHHLWQCWLDQRICFKCGKEGHFARMCPSKGEANTDKPTPKALPTATQGG

XP_022156326.1 uncharacterized protein LOC111023247 [Momordica charantia]1.5e-6746.73Show/hide
Query:  MPP-HGWRPRGGLDMPALPGDGANRQQRANPP----IPPEVPPFIRNFKRYGPPTFGGGSEKATAAEQWIVKLESLFDYLNCEDHLKVRGAIFMLRDEA-
        MPP H  R R   D    P  G     +A PP     P     FI++FKRYGPPTF G SE+ATA E+WI +LE+L+ YL CED  KV+GA+FMLR EA 
Subjt:  MPP-HGWRPRGGLDMPALPGDGANRQQRANPP----IPPEVPPFIRNFKRYGPPTFGGGSEKATAAEQWIVKLESLFDYLNCEDHLKVRGAIFMLRDEA-

Query:  ---------------------FKDLLYDYYFPDIVKDDKETEFLHLTQGSMSVIQDERKFTELSRFAPDLVSTSERRIKRFIRGLCEKIRGVVALKEPTT
                             FK+LLYDYY+P+ VKD KE EFLHL QG++SV Q ERKFTELSRFA +L+ T   +IKRF++GL + IRG V L+ PTT
Subjt:  ---------------------FKDLLYDYYFPDIVKDDKETEFLHLTQGSMSVIQDERKFTELSRFAPDLVSTSERRIKRFIRGLCEKIRGVVALKEPTT

Query:  FAAALRATLIMDKNAAKKPQATHSRWEASASSEFKRKSPPTLSDQTSKAHHPTSGQAITLPLCSLCNKHHLWQCWLDQRICFKCGKEGHFARMCPSKGEA
        +A A+R  L+MDK+ + K        E  +SS  KRK P T +D   +A    +      P+C  C K H  QCW   + CF+CG+EGHFAR CP    A
Subjt:  FAAALRATLIMDKNAAKKPQATHSRWEASASSEFKRKSPPTLSDQTSKAHHPTSGQAITLPLCSLCNKHHLWQCWLDQRICFKCGKEGHFARMCPSKGEA

Query:  NTDKPTPK-ALPTATQGGKQKAHIFALTKKEAEDTD
        NT +   +   P +TQG  Q+A +FALT+KEA D +
Subjt:  NTDKPTPK-ALPTATQGGKQKAHIFALTKKEAEDTD

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]1.9e-6246.33Show/hide
Query:  GANRQQRANPPIPPEVPPFIRNFKRYGPPTFGGGSEKATAAEQWIVKLESLFDYLNCEDHLKVRGAIFMLRDEA----------------------FKDL
        GA  QQ     IP +   FIR+FK +GPP F G SE+ TAAE+W+ +LE+L+ YL C D  KVRGA+FMLR EA                      FKDL
Subjt:  GANRQQRANPPIPPEVPPFIRNFKRYGPPTFGGGSEKATAAEQWIVKLESLFDYLNCEDHLKVRGAIFMLRDEA----------------------FKDL

Query:  LYDYYFPDIVKDDKETEFLHLTQGSMSVIQDERKFTELSRFAPDLVSTSERRIKRFIRGLCEKIRGVVALKEPTTFAAALRATLIMDKNAAKKPQATHSR
        LY+YYFP I +++K  EFL LTQGS++V Q ERKFTELSRF    V T + +I +FI GL  +I+G++ LKEPTT+AAA+R  L+MDK   ++PQ   S+
Subjt:  LYDYYFPDIVKDDKETEFLHLTQGSMSVIQDERKFTELSRFAPDLVSTSERRIKRFIRGLCEKIRGVVALKEPTTFAAALRATLIMDKNAAKKPQATHSR

Query:  WEASASSEFKRKSPPTLSDQTSKAHHPTSGQAITLPLCSLCNKHHLWQCWLDQRICFKCGKEGHFARMCPSKGEANT---DKPTPKALPTATQGGKQKAH
            ++S  KRK     + Q+S+ H   + +    P+C  C K+H   CWL ++ICFKC KEGHF R C   G +NT    + TP A  TATQGG Q A 
Subjt:  WEASASSEFKRKSPPTLSDQTSKAHHPTSGQAITLPLCSLCNKHHLWQCWLDQRICFKCGKEGHFARMCPSKGEANT---DKPTPKALPTATQGGKQKAH

Query:  IFALTKKEAEDTD
        +FALT+ + E  +
Subjt:  IFALTKKEAEDTD

XP_022157413.1 uncharacterized protein LOC111024114 [Momordica charantia]2.2e-5543.23Show/hide
Query:  GANRQQRANPPIPPEVPPFIRNFKRYGPPTFGGGSEKATAAEQWIVKLESLFDYLNCEDHLKVRGAIFMLRDEA----------------------FKDL
        GA  QQ     IP +   FIR+FKR+GPP F G SE+ TA E+W+ +LE+L+ YL C D  KVRGA+FMLR EA                      FKDL
Subjt:  GANRQQRANPPIPPEVPPFIRNFKRYGPPTFGGGSEKATAAEQWIVKLESLFDYLNCEDHLKVRGAIFMLRDEA----------------------FKDL

Query:  LYDYYFPDIVKDDKETEFLHLTQGSMSVIQDERKFTELSRFAPDLVSTSERRIKRFIRGLCEKIRGVVALKEPTTFAAALRATLIMDKNAAKKPQATHSR
        LY+YYFP  V+++K  EFL LTQGS++V Q ERKFTELSRF    + T + +I +FI GL  +I+G++ +KEPTT+AAA+R  L+MDK   ++PQ   S+
Subjt:  LYDYYFPDIVKDDKETEFLHLTQGSMSVIQDERKFTELSRFAPDLVSTSERRIKRFIRGLCEKIRGVVALKEPTTFAAALRATLIMDKNAAKKPQATHSR

Query:  WEASASSEFKRKSPPTLSDQTSKAHHPTSGQAITLPLCSLCNKHHLWQCWLDQRICFKCGKEGHFARMCPSKGEANTDKPTPKALPTATQGGKQKAHIFA
            +SS  KRK     S Q+S+ H     +    P+C  C K+H   CWL +RICF+C K                   TP A   A QGG Q+A +FA
Subjt:  WEASASSEFKRKSPPTLSDQTSKAHHPTSGQAITLPLCSLCNKHHLWQCWLDQRICFKCGKEGHFARMCPSKGEANTDKPTPKALPTATQGGKQKAHIFA

Query:  LTKKEAEDTD
        LT+ + E  +
Subjt:  LTKKEAEDTD

TrEMBL top hitse value%identityAlignment
A0A6J1DL73 uncharacterized protein LOC1110221441.1e-6548.81Show/hide
Query:  FIRNFKRYGPPTFGGGSEKATAAEQWIVKLESLFDYLNCEDHLKVRGAIFMLRDEA----------------------FKDLLYDYYFPDIVKDDKETEF
        FI++FKRYGPPTF G SE+ATAAE+WI +LE+ + YL CED  KV+GA+FMLR EA                      FKDLLYDYY+ + VKD KE EF
Subjt:  FIRNFKRYGPPTFGGGSEKATAAEQWIVKLESLFDYLNCEDHLKVRGAIFMLRDEA----------------------FKDLLYDYYFPDIVKDDKETEF

Query:  LHLTQGSMSVIQDERKFTELSRFAPDLVSTSERRIKRFIRGLCEKIRGVVALKEPTTFAAALRATLIMDKNAAKKPQATHSRWEASASSEFKRKSPPTLS
        LHL QG++SV Q ERKFTELSRFA +L+  +  +IKRF++GL + IRG V L+ P ++A A+R  LIMDK+ + K     S  E  +SS  KRK  PT +
Subjt:  LHLTQGSMSVIQDERKFTELSRFAPDLVSTSERRIKRFIRGLCEKIRGVVALKEPTTFAAALRATLIMDKNAAKKPQATHSRWEASASSEFKRKSPPTLS

Query:  DQTSKAHHPTSGQAITLPLCSLCNKHHLWQCWLDQRICFKCGKEGHFARMCPSKGEANTDKPTPKALPT-ATQGGKQKAHIFALTKKEAEDTD
        D + +A    +      P+C  C K H  QCW   + CF+CG+E HFAR CP    ANT +   +  PT +TQG  Q+A +FALT+KEA D +
Subjt:  DQTSKAHHPTSGQAITLPLCSLCNKHHLWQCWLDQRICFKCGKEGHFARMCPSKGEANTDKPTPKALPT-ATQGGKQKAHIFALTKKEAEDTD

A0A6J1DNV8 uncharacterized protein LOC1110229252.2e-5645.55Show/hide
Query:  GANRQQRANPPIPPEVPPFIRNFKRYGPPTFGGGSEKATAAEQWIVKLESLFDYLNCEDHLKVRGAIFMLRDEA----------------------FKDL
        GA  QQ +   I  E   FIR+FKR+GPP F G SE+ TAAE+W+ +LE+L+ YL C D  KVRGA+FML+ EA                      FKDL
Subjt:  GANRQQRANPPIPPEVPPFIRNFKRYGPPTFGGGSEKATAAEQWIVKLESLFDYLNCEDHLKVRGAIFMLRDEA----------------------FKDL

Query:  LYDYYFPDIVKDDKETEFLHLTQGSMSVIQDERKFTELSRFAPDLVSTSERRIKRFIRGLCEKIRGVVALKEPTTFAAALRATLIMDKNAAKKPQATHSR
        LY+YYFP  V+++K  EFL LTQ S+ V Q ERKFTELSRF    + T + +I +FI GL  +I+G++ LKEPTT+AAA+R  L+MDK   ++PQ   S+
Subjt:  LYDYYFPDIVKDDKETEFLHLTQGSMSVIQDERKFTELSRFAPDLVSTSERRIKRFIRGLCEKIRGVVALKEPTTFAAALRATLIMDKNAAKKPQATHSR

Query:  WEASASSEFKRKSPPTLSDQTSKAHHPTSGQAITLPLCSLCNKHHLWQCWLDQRICFKCGKEGHFARMCPSKGEANTDKPTPKALPTATQGG
            +SS  KRK     S Q S+ H     +  T P C  C K+H   CW+ +RIC++C KEGHFAR C   G +NT     +   TAT  G
Subjt:  WEASASSEFKRKSPPTLSDQTSKAHHPTSGQAITLPLCSLCNKHHLWQCWLDQRICFKCGKEGHFARMCPSKGEANTDKPTPKALPTATQGG

A0A6J1DQB9 Reverse transcriptase9.0e-6346.33Show/hide
Query:  GANRQQRANPPIPPEVPPFIRNFKRYGPPTFGGGSEKATAAEQWIVKLESLFDYLNCEDHLKVRGAIFMLRDEA----------------------FKDL
        GA  QQ     IP +   FIR+FK +GPP F G SE+ TAAE+W+ +LE+L+ YL C D  KVRGA+FMLR EA                      FKDL
Subjt:  GANRQQRANPPIPPEVPPFIRNFKRYGPPTFGGGSEKATAAEQWIVKLESLFDYLNCEDHLKVRGAIFMLRDEA----------------------FKDL

Query:  LYDYYFPDIVKDDKETEFLHLTQGSMSVIQDERKFTELSRFAPDLVSTSERRIKRFIRGLCEKIRGVVALKEPTTFAAALRATLIMDKNAAKKPQATHSR
        LY+YYFP I +++K  EFL LTQGS++V Q ERKFTELSRF    V T + +I +FI GL  +I+G++ LKEPTT+AAA+R  L+MDK   ++PQ   S+
Subjt:  LYDYYFPDIVKDDKETEFLHLTQGSMSVIQDERKFTELSRFAPDLVSTSERRIKRFIRGLCEKIRGVVALKEPTTFAAALRATLIMDKNAAKKPQATHSR

Query:  WEASASSEFKRKSPPTLSDQTSKAHHPTSGQAITLPLCSLCNKHHLWQCWLDQRICFKCGKEGHFARMCPSKGEANT---DKPTPKALPTATQGGKQKAH
            ++S  KRK     + Q+S+ H   + +    P+C  C K+H   CWL ++ICFKC KEGHF R C   G +NT    + TP A  TATQGG Q A 
Subjt:  WEASASSEFKRKSPPTLSDQTSKAHHPTSGQAITLPLCSLCNKHHLWQCWLDQRICFKCGKEGHFARMCPSKGEANT---DKPTPKALPTATQGGKQKAH

Query:  IFALTKKEAEDTD
        +FALT+ + E  +
Subjt:  IFALTKKEAEDTD

A0A6J1DTA8 uncharacterized protein LOC1110241141.1e-5543.23Show/hide
Query:  GANRQQRANPPIPPEVPPFIRNFKRYGPPTFGGGSEKATAAEQWIVKLESLFDYLNCEDHLKVRGAIFMLRDEA----------------------FKDL
        GA  QQ     IP +   FIR+FKR+GPP F G SE+ TA E+W+ +LE+L+ YL C D  KVRGA+FMLR EA                      FKDL
Subjt:  GANRQQRANPPIPPEVPPFIRNFKRYGPPTFGGGSEKATAAEQWIVKLESLFDYLNCEDHLKVRGAIFMLRDEA----------------------FKDL

Query:  LYDYYFPDIVKDDKETEFLHLTQGSMSVIQDERKFTELSRFAPDLVSTSERRIKRFIRGLCEKIRGVVALKEPTTFAAALRATLIMDKNAAKKPQATHSR
        LY+YYFP  V+++K  EFL LTQGS++V Q ERKFTELSRF    + T + +I +FI GL  +I+G++ +KEPTT+AAA+R  L+MDK   ++PQ   S+
Subjt:  LYDYYFPDIVKDDKETEFLHLTQGSMSVIQDERKFTELSRFAPDLVSTSERRIKRFIRGLCEKIRGVVALKEPTTFAAALRATLIMDKNAAKKPQATHSR

Query:  WEASASSEFKRKSPPTLSDQTSKAHHPTSGQAITLPLCSLCNKHHLWQCWLDQRICFKCGKEGHFARMCPSKGEANTDKPTPKALPTATQGGKQKAHIFA
            +SS  KRK     S Q+S+ H     +    P+C  C K+H   CWL +RICF+C K                   TP A   A QGG Q+A +FA
Subjt:  WEASASSEFKRKSPPTLSDQTSKAHHPTSGQAITLPLCSLCNKHHLWQCWLDQRICFKCGKEGHFARMCPSKGEANTDKPTPKALPTATQGGKQKAHIFA

Query:  LTKKEAEDTD
        LT+ + E  +
Subjt:  LTKKEAEDTD

A0A6J1DUM2 uncharacterized protein LOC1110232477.2e-6846.73Show/hide
Query:  MPP-HGWRPRGGLDMPALPGDGANRQQRANPP----IPPEVPPFIRNFKRYGPPTFGGGSEKATAAEQWIVKLESLFDYLNCEDHLKVRGAIFMLRDEA-
        MPP H  R R   D    P  G     +A PP     P     FI++FKRYGPPTF G SE+ATA E+WI +LE+L+ YL CED  KV+GA+FMLR EA 
Subjt:  MPP-HGWRPRGGLDMPALPGDGANRQQRANPP----IPPEVPPFIRNFKRYGPPTFGGGSEKATAAEQWIVKLESLFDYLNCEDHLKVRGAIFMLRDEA-

Query:  ---------------------FKDLLYDYYFPDIVKDDKETEFLHLTQGSMSVIQDERKFTELSRFAPDLVSTSERRIKRFIRGLCEKIRGVVALKEPTT
                             FK+LLYDYY+P+ VKD KE EFLHL QG++SV Q ERKFTELSRFA +L+ T   +IKRF++GL + IRG V L+ PTT
Subjt:  ---------------------FKDLLYDYYFPDIVKDDKETEFLHLTQGSMSVIQDERKFTELSRFAPDLVSTSERRIKRFIRGLCEKIRGVVALKEPTT

Query:  FAAALRATLIMDKNAAKKPQATHSRWEASASSEFKRKSPPTLSDQTSKAHHPTSGQAITLPLCSLCNKHHLWQCWLDQRICFKCGKEGHFARMCPSKGEA
        +A A+R  L+MDK+ + K        E  +SS  KRK P T +D   +A    +      P+C  C K H  QCW   + CF+CG+EGHFAR CP    A
Subjt:  FAAALRATLIMDKNAAKKPQATHSRWEASASSEFKRKSPPTLSDQTSKAHHPTSGQAITLPLCSLCNKHHLWQCWLDQRICFKCGKEGHFARMCPSKGEA

Query:  NTDKPTPK-ALPTATQGGKQKAHIFALTKKEAEDTD
        NT +   +   P +TQG  Q+A +FALT+KEA D +
Subjt:  NTDKPTPK-ALPTATQGGKQKAHIFALTKKEAEDTD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCAATTTAGAAAGGCAAGATAGAAAGCTTCCTTCTCAACCTGAGTCCGATGAGGATCATACTCCACTTGAGCCCACTTCCAGAGGAGGACCACCTCCTAAA
TCTACAAACGGAGTAAAATTTGACCCTCCTGTTTCTCTGAACTCTAATGTGTCTAAAGCTTCTTTTTCTTCTAGGTTGACAAAACCCAAGGTGGATGAAGTTGAC
AGAGAACTTCTTGGGTCATGCTTGATTCAGTTGAATGAATGTTCGTTTATTCATCCTTTAGGTGTAGTAGAAGACGTTTTAGTGCAAGTAAATGAGCTAATATTT
CCTAAAGATTTTTACATGCTAAAAATGGAAGAGTCTAGTTCTCCTTCATCTCCATCCATTTTGCTTGGTCGCCCCTTCATGAAGACGACCAAAACAAAAATAGAT
GTTGATGAAGGAACGTTATTGGTCGAGTTTGACAGGGAGATTATTGGCGTTATAGATTCTTTGGTACAGGAAGTAATTTGGGACACCTACGATGATGAGGATGAG
GATGAAAATTATGGAGAATGCACGATGCTCCTACTAGAATTAAAACAACTCCCTGAGCACTTGAAGAATGCTTATCTTAGAGAGGAGATAGCATTAAAAGATGGG
TCCAAGCCATGCCATCAGTCCCAAAGGCGTTTAAACTTAGCCTTGAGGGAGGTCGTGATGAAAAAGATCTTTAAGTTGCAAGAAGCAGGTAGTATCTATCTTATT
TCTGACAGTGAATGGAAACCAGGTATCCCTGTAGTTAGAAATGAAAAGCTAGATGTTCCTGTTAAGTTTCAAAACGAGTGGAGAATGTGTATTGATTTTCGGAAG
TTGAATAGTGGAAGCTCTTTTCCTGTTTCCTTGATGGTTTTTCAAGATTTTACCAAATTCCAATTTATCGGGATGATTAGGAAAAGACGACATTCACTTGTCCAT
TTGGAACTTTTGCATTCAGATGGATTCCTTTCGTTCTATGCAATGCCCCAGGCATATTTCAGAGATGACATGTTATTTCTGAACAGGGAATTGAAGTTGATTAAG
CAAAGATTGATGTTATTGTTAGCCTCTCATATGCCACAAATGTTCATTAAACATTTCAATAAGATTGCATTGCCGATGACCACCTTGCTGCAAAAAGATATAGAG
TTTAGTTTCAATGATGAATTCAAGCAAGCGTTTGGCAAAATTAAGGCGGCCCTAGATAGTGCTCCAATTGTGCAAGCTCCTAGGTGGGATTTCCCATTTAAGATT
ATGTGCAATGTGAGCAACTACGCTGTGGGAGCTGCCTTGGGCCAAAGGGAGTGCTTCTCTGAAACAGGTTATAGGAAGAATTTGGGATTTGTTGTGGCTGCTAGA
TATGTGGTTAAGTCGCCTAGAGTTAATGGTCTTCCTCAATCTCCTCTCCATCACCAGACGATGCCGCCACATGGATGGAGACCTAGAGGAGGCTTAGACATGCCT
GCGCTTCCCGGTGACGGAGCAAACAGGCAGCAAAGGGCGAACCCTCCCATCCCCCCAGAAGTTCCCCCATTTATAAGGAATTTCAAGCGCTATGGGCCTCCGACC
TTCGGCGGTGGGTCAGAGAAAGCTACGGCAGCTGAGCAGTGGATTGTAAAGCTGGAGTCATTGTTTGACTACCTAAATTGCGAGGATCATCTTAAGGTCAGAGGA
GCAATTTTCATGCTTCGAGACGAGGCGTTTAAAGACCTTCTATATGACTACTACTTTCCCGACATAGTGAAGGATGACAAGGAAACAGAGTTCCTGCATTTGACC
CAGGGTAGCATGTCGGTGATCCAAGATGAAAGGAAGTTCACTGAGCTGTCTCGTTTTGCACCCGACCTGGTGAGTACGTCAGAGAGGAGGATTAAGAGGTTCATC
AGGGGCCTGTGCGAGAAAATTAGAGGTGTGGTCGCTTTAAAGGAGCCGACGACTTTTGCTGCAGCGCTTAGGGCCACCCTGATCATGGACAAAAATGCGGCTAAG
AAACCTCAGGCGACACACTCACGTTGGGAGGCTAGCGCCTCATCTGAATTTAAAAGGAAGTCTCCCCCAACTCTGTCAGATCAAACTTCCAAGGCCCATCATCCG
ACCTCGGGTCAAGCTATCACCCTCCCATTGTGTAGCTTGTGCAACAAGCATCACTTGTGGCAATGCTGGCTAGACCAGAGGATTTGCTTCAAGTGTGGAAAGGAA
GGTCACTTTGCAAGAATGTGCCCAAGTAAAGGGGAGGCCAACACAGACAAGCCGACCCCGAAAGCCCTACCAACAGCTACTCAAGGAGGAAAACAAAAGGCACAC
ATCTTTGCACTGACCAAAAAGGAGGCTGAGGATACGGATTTTGGAACAACGGTTCCAATGCAATTTGAATCACTCAAATCGGAGTTCAAACGAAGAAGATATGAC
CAAAACAAGCTTAATGGGAAATTCTCGAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGCAATTTAGAAAGGCAAGATAGAAAGCTTCCTTCTCAACCTGAGTCCGATGAGGATCATACTCCACTTGAGCCCACTTCCAGAGGAGGACCACCTCCTAAA
TCTACAAACGGAGTAAAATTTGACCCTCCTGTTTCTCTGAACTCTAATGTGTCTAAAGCTTCTTTTTCTTCTAGGTTGACAAAACCCAAGGTGGATGAAGTTGAC
AGAGAACTTCTTGGGTCATGCTTGATTCAGTTGAATGAATGTTCGTTTATTCATCCTTTAGGTGTAGTAGAAGACGTTTTAGTGCAAGTAAATGAGCTAATATTT
CCTAAAGATTTTTACATGCTAAAAATGGAAGAGTCTAGTTCTCCTTCATCTCCATCCATTTTGCTTGGTCGCCCCTTCATGAAGACGACCAAAACAAAAATAGAT
GTTGATGAAGGAACGTTATTGGTCGAGTTTGACAGGGAGATTATTGGCGTTATAGATTCTTTGGTACAGGAAGTAATTTGGGACACCTACGATGATGAGGATGAG
GATGAAAATTATGGAGAATGCACGATGCTCCTACTAGAATTAAAACAACTCCCTGAGCACTTGAAGAATGCTTATCTTAGAGAGGAGATAGCATTAAAAGATGGG
TCCAAGCCATGCCATCAGTCCCAAAGGCGTTTAAACTTAGCCTTGAGGGAGGTCGTGATGAAAAAGATCTTTAAGTTGCAAGAAGCAGGTAGTATCTATCTTATT
TCTGACAGTGAATGGAAACCAGGTATCCCTGTAGTTAGAAATGAAAAGCTAGATGTTCCTGTTAAGTTTCAAAACGAGTGGAGAATGTGTATTGATTTTCGGAAG
TTGAATAGTGGAAGCTCTTTTCCTGTTTCCTTGATGGTTTTTCAAGATTTTACCAAATTCCAATTTATCGGGATGATTAGGAAAAGACGACATTCACTTGTCCAT
TTGGAACTTTTGCATTCAGATGGATTCCTTTCGTTCTATGCAATGCCCCAGGCATATTTCAGAGATGACATGTTATTTCTGAACAGGGAATTGAAGTTGATTAAG
CAAAGATTGATGTTATTGTTAGCCTCTCATATGCCACAAATGTTCATTAAACATTTCAATAAGATTGCATTGCCGATGACCACCTTGCTGCAAAAAGATATAGAG
TTTAGTTTCAATGATGAATTCAAGCAAGCGTTTGGCAAAATTAAGGCGGCCCTAGATAGTGCTCCAATTGTGCAAGCTCCTAGGTGGGATTTCCCATTTAAGATT
ATGTGCAATGTGAGCAACTACGCTGTGGGAGCTGCCTTGGGCCAAAGGGAGTGCTTCTCTGAAACAGGTTATAGGAAGAATTTGGGATTTGTTGTGGCTGCTAGA
TATGTGGTTAAGTCGCCTAGAGTTAATGGTCTTCCTCAATCTCCTCTCCATCACCAGACGATGCCGCCACATGGATGGAGACCTAGAGGAGGCTTAGACATGCCT
GCGCTTCCCGGTGACGGAGCAAACAGGCAGCAAAGGGCGAACCCTCCCATCCCCCCAGAAGTTCCCCCATTTATAAGGAATTTCAAGCGCTATGGGCCTCCGACC
TTCGGCGGTGGGTCAGAGAAAGCTACGGCAGCTGAGCAGTGGATTGTAAAGCTGGAGTCATTGTTTGACTACCTAAATTGCGAGGATCATCTTAAGGTCAGAGGA
GCAATTTTCATGCTTCGAGACGAGGCGTTTAAAGACCTTCTATATGACTACTACTTTCCCGACATAGTGAAGGATGACAAGGAAACAGAGTTCCTGCATTTGACC
CAGGGTAGCATGTCGGTGATCCAAGATGAAAGGAAGTTCACTGAGCTGTCTCGTTTTGCACCCGACCTGGTGAGTACGTCAGAGAGGAGGATTAAGAGGTTCATC
AGGGGCCTGTGCGAGAAAATTAGAGGTGTGGTCGCTTTAAAGGAGCCGACGACTTTTGCTGCAGCGCTTAGGGCCACCCTGATCATGGACAAAAATGCGGCTAAG
AAACCTCAGGCGACACACTCACGTTGGGAGGCTAGCGCCTCATCTGAATTTAAAAGGAAGTCTCCCCCAACTCTGTCAGATCAAACTTCCAAGGCCCATCATCCG
ACCTCGGGTCAAGCTATCACCCTCCCATTGTGTAGCTTGTGCAACAAGCATCACTTGTGGCAATGCTGGCTAGACCAGAGGATTTGCTTCAAGTGTGGAAAGGAA
GGTCACTTTGCAAGAATGTGCCCAAGTAAAGGGGAGGCCAACACAGACAAGCCGACCCCGAAAGCCCTACCAACAGCTACTCAAGGAGGAAAACAAAAGGCACAC
ATCTTTGCACTGACCAAAAAGGAGGCTGAGGATACGGATTTTGGAACAACGGTTCCAATGCAATTTGAATCACTCAAATCGGAGTTCAAACGAAGAAGATATGAC
CAAAACAAGCTTAATGGGAAATTCTCGAACTGA
Protein sequenceShow/hide protein sequence
MSNLERQDRKLPSQPESDEDHTPLEPTSRGGPPPKSTNGVKFDPPVSLNSNVSKASFSSRLTKPKVDEVDRELLGSCLIQLNECSFIHPLGVVEDVLVQVNELIF
PKDFYMLKMEESSSPSSPSILLGRPFMKTTKTKIDVDEGTLLVEFDREIIGVIDSLVQEVIWDTYDDEDEDENYGECTMLLLELKQLPEHLKNAYLREEIALKDG
SKPCHQSQRRLNLALREVVMKKIFKLQEAGSIYLISDSEWKPGIPVVRNEKLDVPVKFQNEWRMCIDFRKLNSGSSFPVSLMVFQDFTKFQFIGMIRKRRHSLVH
LELLHSDGFLSFYAMPQAYFRDDMLFLNRELKLIKQRLMLLLASHMPQMFIKHFNKIALPMTTLLQKDIEFSFNDEFKQAFGKIKAALDSAPIVQAPRWDFPFKI
MCNVSNYAVGAALGQRECFSETGYRKNLGFVVAARYVVKSPRVNGLPQSPLHHQTMPPHGWRPRGGLDMPALPGDGANRQQRANPPIPPEVPPFIRNFKRYGPPT
FGGGSEKATAAEQWIVKLESLFDYLNCEDHLKVRGAIFMLRDEAFKDLLYDYYFPDIVKDDKETEFLHLTQGSMSVIQDERKFTELSRFAPDLVSTSERRIKRFI
RGLCEKIRGVVALKEPTTFAAALRATLIMDKNAAKKPQATHSRWEASASSEFKRKSPPTLSDQTSKAHHPTSGQAITLPLCSLCNKHHLWQCWLDQRICFKCGKE
GHFARMCPSKGEANTDKPTPKALPTATQGGKQKAHIFALTKKEAEDTDFGTTVPMQFESLKSEFKRRRYDQNKLNGKFSN