; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g23650 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g23650
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr4:17084380..17089730
RNA-Seq ExpressionMoc04g23650
SyntenyMoc04g23650
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022131661.1 uncharacterized protein LOC111004786 [Momordica charantia]3.7e-3559.42Show/hide
Query:  SFFSNPALKSSWINTYFQSYTHAQANKGTLPQRRSDSFPPRHWLPPSGSDVKVNSDAACCSTLTGLGFIMKDHSGDLMVAKSIFLPLSLTPLLAEIRGIL
        S  S PAL+ +WI++YFQ+Y+ AQ NK   PQ+ S   P   WLPP  S +KVNSDAAC ST TGLG I++DH G L+VAKS+FLP+ L PL AEIRGIL
Subjt:  SFFSNPALKSSWINTYFQSYTHAQANKGTLPQRRSDSFPPRHWLPPSGSDVKVNSDAACCSTLTGLGFIMKDHSGDLMVAKSIFLPLSLTPLLAEIRGIL

Query:  EALKLACSRNYRHLVMESVCQEAIRPIKGNFHTLRDEL
        EALKLA SR+Y  LV+ES CQEAIR ++G+  ++ +EL
Subjt:  EALKLACSRNYRHLVMESVCQEAIRPIKGNFHTLRDEL

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]2.6e-3339.24Show/hide
Query:  NILESYGRASAPFGPHDFKVADLINTQGEWDVH--LVSNSFFSNPALKSSWINTYFQSYTHAQANKGTLPQRRSDSFP-PRHWLPPSGSDVKVNSDAACC
        + LE +   +    P D  +A  I   G W+    L+     S    K  W+  +  S++ AQ +  + P+ +S+  P  ++W P S   +K+N+DAAC 
Subjt:  NILESYGRASAPFGPHDFKVADLINTQGEWDVH--LVSNSFFSNPALKSSWINTYFQSYTHAQANKGTLPQRRSDSFP-PRHWLPPSGSDVKVNSDAACC

Query:  STLTGLGFIMKDHSGDLMVAKSIFLPLSLTPLLAEIRGILEALKLACSRNYRHLVMESVCQEAIRPIKGNFHTLRDELNWILEIRSLAIVFESISFIHAP
           T  G I++D S  L+ A SI +P  L+PLLAEIR ILE LK A + N+ HL +ES    AI+ I+   HT  DE NW++EI++L   F  ISF H+ 
Subjt:  STLTGLGFIMKDHSGDLMVAKSIFLPLSLTPLLAEIRGILEALKLACSRNYRHLVMESVCQEAIRPIKGNFHTLRDELNWILEIRSLAIVFESISFIHAP

Query:  HSCNVSVHFLAKWGI-SASTTCLWQSFFSTWLMDLVK
          CN + H LAKWGI S S T  W   F TWL+DLV+
Subjt:  HSCNVSVHFLAKWGI-SASTTCLWQSFFSTWLMDLVK

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]3.4e-3323.93Show/hide
Query:  MSASNLLEEWKNLKLTSNEEETIVDVDTTTSVEVNSKLYLSLIGKIFLKRHISCPVLKNTLRNAWKMDINSFDVAN------------------------
        M+  +LLEEWKN KLTS EEET +DVD +      S+L   L+GK+F+KR I+CPV+KNT+R AWK++ N+F+V +                        
Subjt:  MSASNLLEEWKNLKLTSNEEETIVDVDTTTSVEVNSKLYLSLIGKIFLKRHISCPVLKNTLRNAWKMDINSFDVAN------------------------

Query:  -------------------------------------IACRTKAMVIRLGNVLGSFEEADCD-----------VMRLISVGDPACGKLVLSQKPSPRAVA
                                             + C T+ M IRLGN LG FEEADCD           V  ++ +  P    + L+    P   A
Subjt:  -------------------------------------IACRTKAMVIRLGNVLGSFEEADCD-----------VMRLISVGDPACGKLVLSQKPSPRAVA

Query:  TLSRSFKT----CFTCSV----------------GIFSASSPADKQHSSDV-------------------SYGINTP--------PLELDLLKELKAKYA
         +   ++     C+ C +                G    + P  KQ   D+                   S G+ +         P+E  + +  K    
Subjt:  TLSRSFKT----CFTCSV----------------GIFSASSPADKQHSSDV-------------------SYGINTP--------PLELDLLKELKAKYA

Query:  VTNQGKDHILIEE-----NI-------PAIQSLSDQAHLLSNPSITKMDLN----------------------------------------GSQPKFLWE
         + QGK  +LI+E     N+       P ++S +       + S+T+MDL+                                        G     LW 
Subjt:  VTNQGKDHILIEE-----NI-------PAIQSLSDQAHLLSNPSITKMDLN----------------------------------------GSQPKFLWE

Query:  KRS---------------------------FKPFRFEEC---------------WTINQECEKIIYDI-GCWS---------------------------
          +                           FK   F  C               +  N     +  D  G WS                           
Subjt:  KRS---------------------------FKPFRFEEC---------------WTINQECEKIIYDI-GCWS---------------------------

Query:  ----------------------ATNFMIIHSLQDDLAGLIELKEIYWKQRSRENWLKWGDGNTKWFHKKASMRQACNSINGIERTDGNWTENLGEIERSL
                                +F IIH+L++DLAGL+EL+EI+WKQRSRE+WLKWG                                         
Subjt:  ----------------------ATNFMIIHSLQDDLAGLIELKEIYWKQRSRENWLKWGDGNTKWFHKKASMRQACNSINGIERTDGNWTENLGEIERSL

Query:  VEHFSNMFTSSTSSSPDIDSIMDLVPTSISLEISARLLMPYTQEEIQIAVKQILPTKSPGRDGFPPLFYQTYWNIVG
                  +  ++ DI++I++L+PT I+ E++ +LL PYT+EEI++A++Q+ PTK+ G DGFP LFYQTYW++VG
Subjt:  VEHFSNMFTSSTSSSPDIDSIMDLVPTSISLEISARLLMPYTQEEIQIAVKQILPTKSPGRDGFPPLFYQTYWNIVG

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]1.4e-3131.94Show/hide
Query:  IHSLQDDLAGLIELKEIYWKQRSRENWLKWGDGNTKWFHKKASMRQACNSINGIERTDGNWTENLGEIERSLVEHFSNMFTSSTSSSPDIDSIMDLVPTS
        I  L+  +  ++  +E +WKQRSR  WLK GD NTK+FH KAS R+  N I GIE   G+WT+   E+E    E+F+ +FT+S  S  +I + ++ +   
Subjt:  IHSLQDDLAGLIELKEIYWKQRSRENWLKWGDGNTKWFHKKASMRQACNSINGIERTDGNWTENLGEIERSLVEHFSNMFTSSTSSSPDIDSIMDLVPTS

Query:  ISLEISARLLMPYTQEEIQIAVKQILPTKSPGRDGFPPLFYQTYWNIV---------------------------------------GVHFGPFSPSISH
        +S  ++ +L  P+T E+I  A+ Q+ PTK+PG DG P  FYQ +WN+V                                       G+ FG    +ISH
Subjt:  ISLEISARLLMPYTQEEIQIAVKQILPTKSPGRDGFPPLFYQTYWNIV---------------------------------------GVHFGPFSPSISH

Query:  LQFANDSLIFLKSTEAECLAIKNILESYGRA---------SAPFGPHDFKVADLINTQGEWDVHLVS--NSFFSNPALKSSWINTYFQ
        L FA+DSLIF++++  +C  +K I +SY  A         S+ F   + KV ++   Q  + +++VS    +   P++     +++FQ
Subjt:  LQFANDSLIFLKSTEAECLAIKNILESYGRA---------SAPFGPHDFKVADLINTQGEWDVHLVS--NSFFSNPALKSSWINTYFQ

XP_022158772.1 uncharacterized protein LOC111025237 [Momordica charantia]3.8e-4055.35Show/hide
Query:  KRSFKPFRFEECWTINQECEKIIYDIGCWSATNFMIIHSLQDDLAGLIELKEIYWKQRSRENWLKWGDGNTKWFHKKASMRQACNSINGIERTDGNWTEN
        KRS KP RFEE W  N ECE++I  +      +F IIH +++DLAGL+EL+EI+WKQRSRE+WLKWGD N KWFHKKA+MR++CN I+GIE   G WTE 
Subjt:  KRSFKPFRFEECWTINQECEKIIYDIGCWSATNFMIIHSLQDDLAGLIELKEIYWKQRSRENWLKWGDGNTKWFHKKASMRQACNSINGIERTDGNWTEN

Query:  LGEIERSLVEHFSNMFTSSTSSSPDIDSIMDLVPTSISLEISARLLMPYTQEEIQIAVK
          EI    + +F ++FTS++    DIDSI   +PT I+ EI+A+LL PY  EEI+IA+K
Subjt:  LGEIERSLVEHFSNMFTSSTSSSPDIDSIMDLVPTSISLEISARLLMPYTQEEIQIAVK

XP_024037590.1 uncharacterized protein LOC112097210 [Citrus clementina]1.4e-3134.67Show/hide
Query:  IHSLQDDLAGLIELKEIYWKQRSRENWLKWGDGNTKWFHKKASMRQACNSINGIERTDGNWTENLGEIERSLVEHFSNMFTSSTSSSPDIDSIMDLVPTS
        I S ++ +  ++  +E+YWKQRSR +WLK GD NTK+FH KAS R+  N I G+      W ++   +E+   E+F+++FT+S+ S   I++ +  +   
Subjt:  IHSLQDDLAGLIELKEIYWKQRSRENWLKWGDGNTKWFHKKASMRQACNSINGIERTDGNWTENLGEIERSLVEHFSNMFTSSTSSSPDIDSIMDLVPTS

Query:  ISLEISARLLMPYTQEEIQIAVKQILPTKSPGRDGFPPLFYQTYWNIV--------------------------------GVHFGPFSPSISHLQFANDS
        ++LE++ +L  P+T+EE+  A+ Q+ PTK+PG DG P  F+Q +W+ V                                G+HFG     +SHL FA+DS
Subjt:  ISLEISARLLMPYTQEEIQIAVKQILPTKSPGRDGFPPLFYQTYWNIV--------------------------------GVHFGPFSPSISHLQFANDS

Query:  LIFLKSTEAECLAIKNILESYGRAS
        LIF ++   +C  +K + E YG+AS
Subjt:  LIFLKSTEAECLAIKNILESYGRAS

TrEMBL top hitse value%identityAlignment
A0A6J1BQ49 uncharacterized protein LOC1110047861.8e-3559.42Show/hide
Query:  SFFSNPALKSSWINTYFQSYTHAQANKGTLPQRRSDSFPPRHWLPPSGSDVKVNSDAACCSTLTGLGFIMKDHSGDLMVAKSIFLPLSLTPLLAEIRGIL
        S  S PAL+ +WI++YFQ+Y+ AQ NK   PQ+ S   P   WLPP  S +KVNSDAAC ST TGLG I++DH G L+VAKS+FLP+ L PL AEIRGIL
Subjt:  SFFSNPALKSSWINTYFQSYTHAQANKGTLPQRRSDSFPPRHWLPPSGSDVKVNSDAACCSTLTGLGFIMKDHSGDLMVAKSIFLPLSLTPLLAEIRGIL

Query:  EALKLACSRNYRHLVMESVCQEAIRPIKGNFHTLRDEL
        EALKLA SR+Y  LV+ES CQEAIR ++G+  ++ +EL
Subjt:  EALKLACSRNYRHLVMESVCQEAIRPIKGNFHTLRDEL

A0A6J1DX30 uncharacterized protein LOC1110248741.3e-3339.24Show/hide
Query:  NILESYGRASAPFGPHDFKVADLINTQGEWDVH--LVSNSFFSNPALKSSWINTYFQSYTHAQANKGTLPQRRSDSFP-PRHWLPPSGSDVKVNSDAACC
        + LE +   +    P D  +A  I   G W+    L+     S    K  W+  +  S++ AQ +  + P+ +S+  P  ++W P S   +K+N+DAAC 
Subjt:  NILESYGRASAPFGPHDFKVADLINTQGEWDVH--LVSNSFFSNPALKSSWINTYFQSYTHAQANKGTLPQRRSDSFP-PRHWLPPSGSDVKVNSDAACC

Query:  STLTGLGFIMKDHSGDLMVAKSIFLPLSLTPLLAEIRGILEALKLACSRNYRHLVMESVCQEAIRPIKGNFHTLRDELNWILEIRSLAIVFESISFIHAP
           T  G I++D S  L+ A SI +P  L+PLLAEIR ILE LK A + N+ HL +ES    AI+ I+   HT  DE NW++EI++L   F  ISF H+ 
Subjt:  STLTGLGFIMKDHSGDLMVAKSIFLPLSLTPLLAEIRGILEALKLACSRNYRHLVMESVCQEAIRPIKGNFHTLRDELNWILEIRSLAIVFESISFIHAP

Query:  HSCNVSVHFLAKWGI-SASTTCLWQSFFSTWLMDLVK
          CN + H LAKWGI S S T  W   F TWL+DLV+
Subjt:  HSCNVSVHFLAKWGI-SASTTCLWQSFFSTWLMDLVK

A0A6J1DX30 uncharacterized protein LOC1110248741.7e-3323.93Show/hide
Query:  MSASNLLEEWKNLKLTSNEEETIVDVDTTTSVEVNSKLYLSLIGKIFLKRHISCPVLKNTLRNAWKMDINSFDVAN------------------------
        M+  +LLEEWKN KLTS EEET +DVD +      S+L   L+GK+F+KR I+CPV+KNT+R AWK++ N+F+V +                        
Subjt:  MSASNLLEEWKNLKLTSNEEETIVDVDTTTSVEVNSKLYLSLIGKIFLKRHISCPVLKNTLRNAWKMDINSFDVAN------------------------

Query:  -------------------------------------IACRTKAMVIRLGNVLGSFEEADCD-----------VMRLISVGDPACGKLVLSQKPSPRAVA
                                             + C T+ M IRLGN LG FEEADCD           V  ++ +  P    + L+    P   A
Subjt:  -------------------------------------IACRTKAMVIRLGNVLGSFEEADCD-----------VMRLISVGDPACGKLVLSQKPSPRAVA

Query:  TLSRSFKT----CFTCSV----------------GIFSASSPADKQHSSDV-------------------SYGINTP--------PLELDLLKELKAKYA
         +   ++     C+ C +                G    + P  KQ   D+                   S G+ +         P+E  + +  K    
Subjt:  TLSRSFKT----CFTCSV----------------GIFSASSPADKQHSSDV-------------------SYGINTP--------PLELDLLKELKAKYA

Query:  VTNQGKDHILIEE-----NI-------PAIQSLSDQAHLLSNPSITKMDLN----------------------------------------GSQPKFLWE
         + QGK  +LI+E     N+       P ++S +       + S+T+MDL+                                        G     LW 
Subjt:  VTNQGKDHILIEE-----NI-------PAIQSLSDQAHLLSNPSITKMDLN----------------------------------------GSQPKFLWE

Query:  KRS---------------------------FKPFRFEEC---------------WTINQECEKIIYDI-GCWS---------------------------
          +                           FK   F  C               +  N     +  D  G WS                           
Subjt:  KRS---------------------------FKPFRFEEC---------------WTINQECEKIIYDI-GCWS---------------------------

Query:  ----------------------ATNFMIIHSLQDDLAGLIELKEIYWKQRSRENWLKWGDGNTKWFHKKASMRQACNSINGIERTDGNWTENLGEIERSL
                                +F IIH+L++DLAGL+EL+EI+WKQRSRE+WLKWG                                         
Subjt:  ----------------------ATNFMIIHSLQDDLAGLIELKEIYWKQRSRENWLKWGDGNTKWFHKKASMRQACNSINGIERTDGNWTENLGEIERSL

Query:  VEHFSNMFTSSTSSSPDIDSIMDLVPTSISLEISARLLMPYTQEEIQIAVKQILPTKSPGRDGFPPLFYQTYWNIVG
                  +  ++ DI++I++L+PT I+ E++ +LL PYT+EEI++A++Q+ PTK+ G DGFP LFYQTYW++VG
Subjt:  VEHFSNMFTSSTSSSPDIDSIMDLVPTSISLEISARLLMPYTQEEIQIAVKQILPTKSPGRDGFPPLFYQTYWNIVG

A0A6J1DX30 uncharacterized protein LOC1110248743.1e-3229.69Show/hide
Query:  QSLSDQAHLLSNP--SITKMDLNGSQPKFL---WEKRS-----FKPFRFEECWTINQECEKIIYD------------------------IGCWS------
        + L++   L  NP   +  +D+  S  K L   WE  +      KPFRFEE WT ++ CE  I +                        +G WS      
Subjt:  QSLSDQAHLLSNP--SITKMDLNGSQPKFL---WEKRS-----FKPFRFEECWTINQECEKIIYD------------------------IGCWS------

Query:  ---------------------ATNFMIIHSLQDDLAGLIELKEIYWKQRSRENWLKWGDGNTKWFHKKASMRQACNSINGIERTDGNWTENLGEIERSLV
                             + N   + SL+ ++  L   +E  W+QRSR NWL+ GD NT++FH +AS R+  N I G+   DGNW ++  E+   L 
Subjt:  ---------------------ATNFMIIHSLQDDLAGLIELKEIYWKQRSRENWLKWGDGNTKWFHKKASMRQACNSINGIERTDGNWTENLGEIERSLV

Query:  EHFSNMFTSSTSSSPDIDSIMDLVPTSISLEISARLLMPYTQEEIQIAVKQILPTKSPGRDGFPPLFYQTYWNIV-------------------------
         ++ ++F   TS    ID+ +  VP  I+  ++A L   ++  E+  A+KQ+ P  +PG DG PPLFYQ YW++V                         
Subjt:  EHFSNMFTSSTSSSPDIDSIMDLVPTSISLEISARLLMPYTQEEIQIAVKQILPTKSPGRDGFPPLFYQTYWNIV-------------------------

Query:  ------------GVHFGPFSPSISHLQFANDSLIFLKSTEAECLAIKNILESYGRAS
                    GV      P I+HL FANDSL+F ++T  +C  I+ IL +Y RAS
Subjt:  ------------GVHFGPFSPSISHLQFANDSLIFLKSTEAECLAIKNILESYGRAS

A0A6J1DY29 uncharacterized protein LOC1110252371.8e-4055.35Show/hide
Query:  KRSFKPFRFEECWTINQECEKIIYDIGCWSATNFMIIHSLQDDLAGLIELKEIYWKQRSRENWLKWGDGNTKWFHKKASMRQACNSINGIERTDGNWTEN
        KRS KP RFEE W  N ECE++I  +      +F IIH +++DLAGL+EL+EI+WKQRSRE+WLKWGD N KWFHKKA+MR++CN I+GIE   G WTE 
Subjt:  KRSFKPFRFEECWTINQECEKIIYDIGCWSATNFMIIHSLQDDLAGLIELKEIYWKQRSRENWLKWGDGNTKWFHKKASMRQACNSINGIERTDGNWTEN

Query:  LGEIERSLVEHFSNMFTSSTSSSPDIDSIMDLVPTSISLEISARLLMPYTQEEIQIAVK
          EI    + +F ++FTS++    DIDSI   +PT I+ EI+A+LL PY  EEI+IA+K
Subjt:  LGEIERSLVEHFSNMFTSSTSSSPDIDSIMDLVPTSISLEISARLLMPYTQEEIQIAVK

M5XQU7 Uncharacterized protein4.5e-3139.58Show/hide
Query:  WEKRSFKPFRFEECWTINQECEKIIYDIGCWSATNFMI-----IHSLQDDLAGLIELKEIYWKQRSRENWLKWGDGNTKWFHKKASMRQACNSINGIERT
        W +RSF  FRFE  WT +++CE II +    S T  ++      H L   L  L+  +E +WKQRS+ +WLK GD NT++FH++AS R+  N + G+   
Subjt:  WEKRSFKPFRFEECWTINQECEKIIYDIGCWSATNFMI-----IHSLQDDLAGLIELKEIYWKQRSRENWLKWGDGNTKWFHKKASMRQACNSINGIERT

Query:  DGNWTENLGEIERSLVEHFSNMFTSSTSSSPDIDSIMDLVPTSISLEISARLLMPYTQEEIQIAVKQILPTKSPGRDGFPPLFYQTYWNIVG
         G W E+   ++  ++++F+++FTSS S S      +D V + ++ +++  LL  Y   EI  AV Q+ PTK+PG DG PP+F+Q YW+IVG
Subjt:  DGNWTENLGEIERSLVEHFSNMFTSSTSSSPDIDSIMDLVPTSISLEISARLLMPYTQEEIQIAVKQILPTKSPGRDGFPPLFYQTYWNIVG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.1e-1031.11Show/hide
Query:  EIYWKQRSRENWLKWGDGNTKWFHKKASMRQACNSINGIERTDGNWTENLGEIERSLVEHFSNMFTS-STSSSPD-IDSIMDLVPTSISLEISARLLMPY
        E +++Q+SR  WL+ GD NT++FHK     QA N I  +   D    EN+ +++  +V +++++  S S   +PD +  I D+ P   +  +++RL    
Subjt:  EIYWKQRSRENWLKWGDGNTKWFHKKASMRQACNSINGIERTDGNWTENLGEIERSLVEHFSNMFTS-STSSSPD-IDSIMDLVPTSISLEISARLLMPY

Query:  TQEEIQIAVKQILPTKSPGRDGFPPLFYQTYWNIV
        + +EI  AV  +   K+PG D F   F+   W +V
Subjt:  TQEEIQIAVKQILPTKSPGRDGFPPLFYQTYWNIV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGGCCTCAAATTTGTTGGAGGAATGGAAGAATCTCAAGCTAACCTCTAATGAGGAGGAGACTATTGTAGATGTGGACACTACAACCTCTGTGGAGGTAAAT
TCAAAACTCTATCTAAGTTTAATTGGCAAAATTTTTCTCAAAAGACATATCTCTTGCCCTGTGTTGAAGAATACTCTGAGGAATGCATGGAAGATGGATATCAAC
TCTTTTGATGTAGCAAACATTGCTTGTAGGACTAAAGCAATGGTCATTCGTCTTGGTAATGTATTGGGCAGTTTTGAAGAAGCTGACTGTGATGTAATGAGGTTG
ATCTCTGTTGGGGATCCAGCTTGCGGGAAACTTGTTCTCTCGCAAAAACCATCCCCACGGGCTGTGGCTACGCTATCAAGGTCCTTTAAAACTTGTTTCACTTGC
TCTGTAGGAATCTTCTCCGCAAGCAGTCCCGCCGACAAACAGCACAGCTCCGACGTTAGTTACGGAATTAATACCCCACCTTTGGAGCTTGATTTGTTGAAGGAA
TTAAAGGCAAAATATGCTGTTACTAATCAAGGGAAAGATCATATTTTGATAGAAGAGAATATTCCAGCTATCCAGTCACTAAGTGACCAGGCACATTTATTATCA
AATCCCTCAATTACCAAAATGGATCTCAATGGATCTCAGCCCAAATTTCTCTGGGAAAAAAGGTCTTTTAAGCCTTTCCGGTTTGAGGAATGTTGGACCATAAAC
CAGGAATGTGAGAAAATTATTTATGATATTGGATGTTGGTCAGCCACAAATTTCATGATAATTCATTCTCTTCAGGACGATCTAGCAGGACTGATTGAATTGAAG
GAAATTTATTGGAAGCAACGGTCCAGGGAGAATTGGCTAAAATGGGGCGATGGAAACACTAAATGGTTTCATAAGAAAGCCTCTATGAGACAAGCCTGTAACTCT
ATAAATGGTATTGAGCGTACCGATGGCAACTGGACTGAGAATTTGGGCGAAATAGAGAGAAGTTTAGTTGAGCACTTTAGCAACATGTTTACCTCATCTACTTCT
TCATCTCCAGATATTGATTCCATTATGGACCTCGTTCCCACTTCAATATCTCTAGAGATTAGTGCTAGGCTTTTAATGCCTTACACACAAGAGGAGATTCAGATT
GCTGTTAAACAGATACTTCCTACAAAATCTCCGGGTCGTGATGGTTTCCCTCCACTATTTTATCAAACCTACTGGAATATTGTAGGTGTTCACTTTGGACCTTTT
AGTCCAAGCATTTCTCACCTTCAATTCGCTAATGATAGCCTTATCTTTCTTAAATCAACAGAAGCTGAATGTCTAGCCATTAAAAATATCTTGGAGAGTTATGGA
AGGGCTTCAGCTCCCTTTGGTCCTCATGATTTTAAAGTGGCAGATCTTATCAATACTCAAGGAGAATGGGACGTCCATCTAGTCTCTAACTCTTTCTTTTCCAAT
CCTGCTCTAAAGAGTTCTTGGATCAATACTTATTTTCAGAGTTACACACATGCTCAGGCAAATAAGGGAACCTTACCTCAGCGGAGATCAGACTCTTTTCCACCC
CGTCATTGGCTTCCTCCTTCCGGCTCAGATGTGAAGGTGAATTCTGATGCAGCTTGCTGTTCTACATTAACTGGATTAGGTTTTATTATGAAAGACCATTCGGGC
GATTTGATGGTGGCGAAATCCATTTTTCTACCGCTATCGTTGACCCCCCTTTTGGCTGAAATCAGAGGAATTTTGGAAGCCCTTAAGCTTGCTTGTTCGAGGAAT
TACCGTCACCTGGTTATGGAATCCGTTTGCCAAGAGGCTATTCGCCCGATCAAAGGAAATTTTCACACTCTAAGAGACGAATTGAATTGGATTTTGGAAATTCGT
TCGCTTGCCATAGTCTTCGAATCCATTTCTTTCATTCATGCTCCCCATTCTTGTAATGTTAGTGTGCACTTCTTAGCCAAATGGGGAATTTCTGCCTCTACCACG
TGTTTATGGCAATCTTTTTTCTCCACTTGGTTAATGGATCTTGTAAAAGGGATTCCCATTCAACCGCTGGCCTGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGGCCTCAAATTTGTTGGAGGAATGGAAGAATCTCAAGCTAACCTCTAATGAGGAGGAGACTATTGTAGATGTGGACACTACAACCTCTGTGGAGGTAAAT
TCAAAACTCTATCTAAGTTTAATTGGCAAAATTTTTCTCAAAAGACATATCTCTTGCCCTGTGTTGAAGAATACTCTGAGGAATGCATGGAAGATGGATATCAAC
TCTTTTGATGTAGCAAACATTGCTTGTAGGACTAAAGCAATGGTCATTCGTCTTGGTAATGTATTGGGCAGTTTTGAAGAAGCTGACTGTGATGTAATGAGGTTG
ATCTCTGTTGGGGATCCAGCTTGCGGGAAACTTGTTCTCTCGCAAAAACCATCCCCACGGGCTGTGGCTACGCTATCAAGGTCCTTTAAAACTTGTTTCACTTGC
TCTGTAGGAATCTTCTCCGCAAGCAGTCCCGCCGACAAACAGCACAGCTCCGACGTTAGTTACGGAATTAATACCCCACCTTTGGAGCTTGATTTGTTGAAGGAA
TTAAAGGCAAAATATGCTGTTACTAATCAAGGGAAAGATCATATTTTGATAGAAGAGAATATTCCAGCTATCCAGTCACTAAGTGACCAGGCACATTTATTATCA
AATCCCTCAATTACCAAAATGGATCTCAATGGATCTCAGCCCAAATTTCTCTGGGAAAAAAGGTCTTTTAAGCCTTTCCGGTTTGAGGAATGTTGGACCATAAAC
CAGGAATGTGAGAAAATTATTTATGATATTGGATGTTGGTCAGCCACAAATTTCATGATAATTCATTCTCTTCAGGACGATCTAGCAGGACTGATTGAATTGAAG
GAAATTTATTGGAAGCAACGGTCCAGGGAGAATTGGCTAAAATGGGGCGATGGAAACACTAAATGGTTTCATAAGAAAGCCTCTATGAGACAAGCCTGTAACTCT
ATAAATGGTATTGAGCGTACCGATGGCAACTGGACTGAGAATTTGGGCGAAATAGAGAGAAGTTTAGTTGAGCACTTTAGCAACATGTTTACCTCATCTACTTCT
TCATCTCCAGATATTGATTCCATTATGGACCTCGTTCCCACTTCAATATCTCTAGAGATTAGTGCTAGGCTTTTAATGCCTTACACACAAGAGGAGATTCAGATT
GCTGTTAAACAGATACTTCCTACAAAATCTCCGGGTCGTGATGGTTTCCCTCCACTATTTTATCAAACCTACTGGAATATTGTAGGTGTTCACTTTGGACCTTTT
AGTCCAAGCATTTCTCACCTTCAATTCGCTAATGATAGCCTTATCTTTCTTAAATCAACAGAAGCTGAATGTCTAGCCATTAAAAATATCTTGGAGAGTTATGGA
AGGGCTTCAGCTCCCTTTGGTCCTCATGATTTTAAAGTGGCAGATCTTATCAATACTCAAGGAGAATGGGACGTCCATCTAGTCTCTAACTCTTTCTTTTCCAAT
CCTGCTCTAAAGAGTTCTTGGATCAATACTTATTTTCAGAGTTACACACATGCTCAGGCAAATAAGGGAACCTTACCTCAGCGGAGATCAGACTCTTTTCCACCC
CGTCATTGGCTTCCTCCTTCCGGCTCAGATGTGAAGGTGAATTCTGATGCAGCTTGCTGTTCTACATTAACTGGATTAGGTTTTATTATGAAAGACCATTCGGGC
GATTTGATGGTGGCGAAATCCATTTTTCTACCGCTATCGTTGACCCCCCTTTTGGCTGAAATCAGAGGAATTTTGGAAGCCCTTAAGCTTGCTTGTTCGAGGAAT
TACCGTCACCTGGTTATGGAATCCGTTTGCCAAGAGGCTATTCGCCCGATCAAAGGAAATTTTCACACTCTAAGAGACGAATTGAATTGGATTTTGGAAATTCGT
TCGCTTGCCATAGTCTTCGAATCCATTTCTTTCATTCATGCTCCCCATTCTTGTAATGTTAGTGTGCACTTCTTAGCCAAATGGGGAATTTCTGCCTCTACCACG
TGTTTATGGCAATCTTTTTTCTCCACTTGGTTAATGGATCTTGTAAAAGGGATTCCCATTCAACCGCTGGCCTGGTGA
Protein sequenceShow/hide protein sequence
MSASNLLEEWKNLKLTSNEEETIVDVDTTTSVEVNSKLYLSLIGKIFLKRHISCPVLKNTLRNAWKMDINSFDVANIACRTKAMVIRLGNVLGSFEEADCDVMRL
ISVGDPACGKLVLSQKPSPRAVATLSRSFKTCFTCSVGIFSASSPADKQHSSDVSYGINTPPLELDLLKELKAKYAVTNQGKDHILIEENIPAIQSLSDQAHLLS
NPSITKMDLNGSQPKFLWEKRSFKPFRFEECWTINQECEKIIYDIGCWSATNFMIIHSLQDDLAGLIELKEIYWKQRSRENWLKWGDGNTKWFHKKASMRQACNS
INGIERTDGNWTENLGEIERSLVEHFSNMFTSSTSSSPDIDSIMDLVPTSISLEISARLLMPYTQEEIQIAVKQILPTKSPGRDGFPPLFYQTYWNIVGVHFGPF
SPSISHLQFANDSLIFLKSTEAECLAIKNILESYGRASAPFGPHDFKVADLINTQGEWDVHLVSNSFFSNPALKSSWINTYFQSYTHAQANKGTLPQRRSDSFPP
RHWLPPSGSDVKVNSDAACCSTLTGLGFIMKDHSGDLMVAKSIFLPLSLTPLLAEIRGILEALKLACSRNYRHLVMESVCQEAIRPIKGNFHTLRDELNWILEIR
SLAIVFESISFIHAPHSCNVSVHFLAKWGISASTTCLWQSFFSTWLMDLVKGIPIQPLAW