; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g17540 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g17540
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr8:13285469..13289364
RNA-Seq ExpressionMoc08g17540
SyntenyMoc08g17540
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8668513.1 hypothetical protein F3Y22_tig00112304pilonHSYRG00002 [Hibiscus syriacus]4.7e-2329.68Show/hide
Query:  LILLDLEAQMSRRISDISLAPRCPFISHLFFADNSLVFCKASITELYAFKLILIDYEKASGQKVNFDKSALWFSPYVHPNFKNYMRSVMDMKVVQDLGDY
        L  L L+AQ    I  I  + R P +SHLF+AD+SL+F K S+TE+   K IL  YEKASGQKVN++KS+++FSP      +      + +    + G Y
Subjt:  LILLDLEAQMSRRISDISLAPRCPFISHLFFADNSLVFCKASITELYAFKLILIDYEKASGQKVNFDKSALWFSPYVHPNFKNYMRSVMDMKVVQDLGDY

Query:  LGLPSSLTRSKSRDFDSIKDRTPTFSVLICPRRRQGFGR-----GRRKRRITSIGEA---------GKRCA---GRKRV------------------RWS
        LGLP  + + K   F+ IKDRT         +R QG+ +     G R+  I S+ +A         G  CA   G + V                  RW 
Subjt:  LGLPSSLTRSKSRDFDSIKDRTPTFSVLICPRRRQGFGR-----GRRKRRITSIGEA---------GKRCA---GRKRV------------------RWS

Query:  RDLLLLGLRKVVGS---GTTIDFYRDP---WIPRESRFRRFSSNPNPETQVWVKDF---------ITPW-----FSWNI---PKLGEVVLDADMELIKKI
         D ++  LR         T I   R+    W    S F    S P     +W+            I  W      SW I    KL     D  + L+  I
Subjt:  RDLLLLGLRKVVGS---GTTIDFYRDP---WIPRESRFRRFSSNPNPETQVWVKDF---------ITPW-----FSWNI---PKLGEVVLDADMELIKKI

Query:  PIGSQALWNDRNSWVSKGKVPDVVSKATWIMGYVEEFSSNNQLR-LRSQAVRHPRANSWELPPEGLNKINVDG-FCGNNR-LGIGVVVRDADLELSATMV
               WN RN  V  G++    +        VEEF    ++  +    VR  R   W  P +   K+NVDG FC  NR   IGVV RD+   + A + 
Subjt:  PIGSQALWNDRNSWVSKGKVPDVVSKATWIMGYVEEFSSNNQLR-LRSQAVRHPRANSWELPPEGLNKINVDG-FCGNNR-LGIGVVVRDADLELSATMV

Query:  DV--------HAKLYAIQEGLSFARSLNKRRVVVEADS
         +         A++ A  EG+  A      RV++E D+
Subjt:  DV--------HAKLYAIQEGLSFARSLNKRRVVVEADS

VVA36912.1 PREDICTED: reverse mRNAase [Prunus dulcis]9.5e-2426.15Show/hide
Query:  RRISDISLAPRCPFISHLFFADNSLVFCKASITELYAFKLILIDYEKASGQKVNFDKSALWFSPYVHPNFKNYMRSVMDMKVVQDLGDYLGLPSSLTRSK
        +RI  +S A   P +S+LFFAD+S++FC A ++++     IL  YE+ASGQ +NFDKSA  F+P   P  K  +  ++ +++V     Y+GLP+   R+K
Subjt:  RRISDISLAPRCPFISHLFFADNSLVFCKASITELYAFKLILIDYEKASGQKVNFDKSALWFSPYVHPNFKNYMRSVMDMKVVQDLGDYLGLPSSLTRSK

Query:  SRDFDSIKD--------------------RTPTFSVLICPRRRQG--------FGRGRRKRRITSIGE------------------------AGKRCAGR
        ++ F+ ++D                    RT   S + C     G         G+G R     S+ E                         G+    R
Subjt:  SRDFDSIKD--------------------RTPTFSVLICPRRRQG--------FGRGRRKRRITSIGE------------------------AGKRCAGR

Query:  --------------------KRVRWSRDLLLLGLRKVVGSGTTIDFYRDPWIPRESRFRRFSSNPNPETQVWVKDFITPWFSWNIPKLGEVVLDADMELI
                            K + W R+++   L   VGSG +I  ++D WIP+   F     N    +  WV D ITP+  WN   L     + D E I
Subjt:  --------------------KRVRWSRDLLLLGLRKVVGSGTTIDFYRDPWIPRESRFRRFSSNPNPETQVWVKDFITPWFSWNIPKLGEVVLDADMELI

Query:  KKIPIGSQALWNDRNSWVSKGKVPDVVSKATWIMGYVEEFSSNNQLRLRSQAVRHPRANSWELPPEGLNKINVDGFCGNNRLGIGVVVRDADLELSATMV
         KI +G+ A   D   W                      F+ N QL+L    +R  R +   +      +  V G   +  L    +  D DL+  A + 
Subjt:  KKIPIGSQALWNDRNSWVSKGKVPDVVSKATWIMGYVEEFSSNNQLRLRSQAVRHPRANSWELPPEGLNKINVDGFCGNNRLGIGVVVRDADLELSATMV

Query:  DVHAKLYAIQEGLSFARSLNKRRVVVEADSRIADSV
         +   L AI+EGL   R+    R V+ +DS+ A S+
Subjt:  DVHAKLYAIQEGLSFARSLNKRRVVVEADSRIADSV

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]4.1e-2731.79Show/hide
Query:  SRRISDISLAPRCPFISHLFFADNSLVFCKASITELYAFKLILIDYEKASGQKVNFDKSALWFSPYVHPNFKNYMRSVMDMKVVQDLGDYLGLPSSLTRS
        S R++ I        I+HL FAD+SL+F ++  +E  A + +L  Y +ASGQ +NF KSAL FSP VHP  + Y++ ++++K+V   G+YLGLPS  TR 
Subjt:  SRRISDISLAPRCPFISHLFFADNSLVFCKASITELYAFKLILIDYEKASGQKVNFDKSALWFSPYVHPNFKNYMRSVMDMKVVQDLGDYLGLPSSLTRS

Query:  KSRDFDSIKDRTPTFSVLICP--------RRRQGFGRGRRKRRI----------------------TSIGEAGKRCAGR---KRVRWSRDLLLLGLRKVV
        +    +S K     +  +  P        R  +GF +    + +                      TS+ +A          K   W RDLL+ GLR  V
Subjt:  KSRDFDSIKDRTPTFSVLICP--------RRRQGFGRGRRKRRI----------------------TSIGEAGKRCAGR---KRVRWSRDLLLLGLRKVV

Query:  GSGTTIDFYRDPWIPRESRFRRFSSNPNPETQVWVKDFITPWFSWNIPKLGEVVLDADMELIKKIPIGSQALWNDRNSWV
        G+G+TI  + DPW+PR + F+    N N      V  FIT   +W++  +     + D +LI  +PI S   +N ++SW+
Subjt:  GSGTTIDFYRDPWIPRESRFRRFSSNPNPETQVWVKDFITPWFSWNIPKLGEVVLDADMELIKKIPIGSQALWNDRNSWV

XP_030483193.1 uncharacterized protein LOC115699787 [Cannabis sativa]2.0e-2626.92Show/hide
Query:  EAQMSRRISDISLAPRCPFISHLFFADNSLVFCKASITELYAFKLILIDYEKASGQKVNFDKSALWFSPYVHPNFKNYMRSVMDMKVVQDLGDYLGLPSS
        + Q  R    I +A R P ISHLF AD+SL+F  AS       K IL DY  ASGQ VNF KSAL+FSP   P  +N + +++ + V  ++  YLGLP +
Subjt:  EAQMSRRISDISLAPRCPFISHLFFADNSLVFCKASITELYAFKLILIDYEKASGQKVNFDKSALWFSPYVHPNFKNYMRSVMDMKVVQDLGDYLGLPSS

Query:  LTRSKSRDFDSIKDRT----------------------------PTFSVLICPRRRQGF-------------GRGRRKRRI------TSIGEAGKRCAGR
          R+K   F+ IKDR                             P+++ + C R    F             G     R+I       + G+  +     
Subjt:  LTRSKSRDFDSIKDRT----------------------------PTFSVLICPRRRQGF-------------GRGRRKRRI------TSIGEAGKRCAGR

Query:  KRVRWSRDLLLLGLRKVVGSGTTIDFYRDPWIPRESR--FRRFSSNPNPETQVWVKDFITPWFSWN---IPKLGEVVLDADMELIKKIPIGSQAL----W
          + W ++LL  GLR+ +G GTTI  + D WIP   +  + R+ S+ N    + V D +TP   WN   I           M  +  +P+ +  L    W
Subjt:  KRVRWSRDLLLLGLRKVVGSGTTIDFYRDPWIPRESR--FRRFSSNPNPETQVWVKDFITPWFSWN---IPKLGEVVLDADMELIKKIPIGSQAL----W

Query:  NDRNSWVSKGKVPDVVSKATWIMGYVEEFSSNNQLRLRSQAVRHP---RANSWELPPEGLNKINVDGFCGN--NRLGIGVVVR----DADLELSATMVDV
        N RNSW+  G           +  Y++++ S N+    ++ ++        S    P   ++++VD    +  N++G G+ ++    +  L L+     +
Subjt:  NDRNSWVSKGKVPDVVSKATWIMGYVEEFSSNNQLRLRSQAVRHP---RANSWELPPEGLNKINVDGFCGN--NRLGIGVVVR----DADLELSATMVDV

Query:  HAKLYAIQEGLSFARS
        +  L    + L FA S
Subjt:  HAKLYAIQEGLSFARS

XP_030505962.1 uncharacterized protein LOC115720894 [Cannabis sativa]7.5e-2130Show/hide
Query:  LLDLEAQMSRRISDISLAPRCPFISHLFFADNSLVFCKASITELYAFKLILIDYEKASGQKVNFDKSALWFSPYVHPNFKNYMRSVMDMKVVQDLGDYLG
        LL  E ++  R+   +++ + P ISHLFFAD+SL+F +A        K  L  Y +ASGQ++N DKS + FSP   P  +N  ++++ M + +    YLG
Subjt:  LLDLEAQMSRRISDISLAPRCPFISHLFFADNSLVFCKASITELYAFKLILIDYEKASGQKVNFDKSALWFSPYVHPNFKNYMRSVMDMKVVQDLGDYLG

Query:  LPSSLTRSKSRDFDSIKDR----------------------------TPTFSVLIC-----------PRRRQGFGRGRRKRRITSIGEAGKRCAGRKR--
        L +   R K + F+ IK+R                             PT+ V+ C            +    F  G  K +  +     K     K   
Subjt:  LPSSLTRSKSRDFDSIKDR----------------------------TPTFSVLIC-----------PRRRQGFGRGRRKRRITSIGEAGKRCAGRKR--

Query:  -VRWSRDLLLLGLRKVVGSGTTIDFYRDPWIPRESRFRRFSSNPNPETQVWVKDFITPWFSWNIPKLGEVVLDADMELIKKIPIGS-----QALWNDRNS
         + W RDLL+ GLR  +G G+++    DPWIPR S F  F  + +P  Q  V  +IT    WN+P L       D++ I  IP+ S     Q +W+  NS
Subjt:  -VRWSRDLLLLGLRKVVGSGTTIDFYRDPWIPRESRFRRFSSNPNPETQVWVKDFITPWFSWNIPKLGEVVLDADMELIKKIPIGS-----QALWNDRNS

TrEMBL top hitse value%identityAlignment
A0A2N9H680 Reverse transcriptase domain-containing protein1.6e-2129.01Show/hide
Query:  EAQMSRRISDISLAPRCPFISHLFFADNSLVFCKASITELYAFKLILIDYEKASGQKVNFDKSALWFSPYVHPNFKNYMRSVMDMKVVQDLGDYLGLPSS
        +A+  ++I  I++  R P +SHLFFAD+S++FC+AS  +      IL  YE+ASGQK+N +K+A +FS     + ++ + S+           YLGLP  
Subjt:  EAQMSRRISDISLAPRCPFISHLFFADNSLVFCKASITELYAFKLILIDYEKASGQKVNFDKSALWFSPYVHPNFKNYMRSVMDMKVVQDLGDYLGLPSS

Query:  LTRSKSRDFDSIKDR----------------------------TPTFSVL-----------ICP-RRRQGFGRGRRKRRI--------TSIGEAGKRCAG
        L RSK R F+ IKDR                             P +++            IC    R  +G+   +R+I        TS  EA  + +G
Subjt:  LTRSKSRDFDSIKDR----------------------------TPTFSVL-----------ICP-RRRQGFGRGRRKRRI--------TSIGEAGKRCAG

Query:  RKRVRW-----SRDLLLLGLRKVVGSGTTIDFYRDPWIPRESRFRRFSSNPNPETQVWVKDFITP-WFSWNIPKLGEVVLDADMELIKKIPIG-----SQ
             W     +R +L  GLR  VG+GT I  ++D W+P  S FR  S   +  ++  V   I      W++  L ++ L  D+E+IK+IP+       +
Subjt:  RKRVRW-----SRDLLLLGLRKVVGSGTTIDFYRDPWIPRESRFRRFSSNPNPETQVWVKDFITP-WFSWNIPKLGEVVLDADMELIKKIPIG-----SQ

Query:  ALWNDRNSWVSKGKVPDVVSKATW
         +W    S +    VP  V    W
Subjt:  ALWNDRNSWVSKGKVPDVVSKATW

A0A5E4GBC8 PREDICTED: reverse mRNAase4.6e-2426.15Show/hide
Query:  RRISDISLAPRCPFISHLFFADNSLVFCKASITELYAFKLILIDYEKASGQKVNFDKSALWFSPYVHPNFKNYMRSVMDMKVVQDLGDYLGLPSSLTRSK
        +RI  +S A   P +S+LFFAD+S++FC A ++++     IL  YE+ASGQ +NFDKSA  F+P   P  K  +  ++ +++V     Y+GLP+   R+K
Subjt:  RRISDISLAPRCPFISHLFFADNSLVFCKASITELYAFKLILIDYEKASGQKVNFDKSALWFSPYVHPNFKNYMRSVMDMKVVQDLGDYLGLPSSLTRSK

Query:  SRDFDSIKD--------------------RTPTFSVLICPRRRQG--------FGRGRRKRRITSIGE------------------------AGKRCAGR
        ++ F+ ++D                    RT   S + C     G         G+G R     S+ E                         G+    R
Subjt:  SRDFDSIKD--------------------RTPTFSVLICPRRRQG--------FGRGRRKRRITSIGE------------------------AGKRCAGR

Query:  --------------------KRVRWSRDLLLLGLRKVVGSGTTIDFYRDPWIPRESRFRRFSSNPNPETQVWVKDFITPWFSWNIPKLGEVVLDADMELI
                            K + W R+++   L   VGSG +I  ++D WIP+   F     N    +  WV D ITP+  WN   L     + D E I
Subjt:  --------------------KRVRWSRDLLLLGLRKVVGSGTTIDFYRDPWIPRESRFRRFSSNPNPETQVWVKDFITPWFSWNIPKLGEVVLDADMELI

Query:  KKIPIGSQALWNDRNSWVSKGKVPDVVSKATWIMGYVEEFSSNNQLRLRSQAVRHPRANSWELPPEGLNKINVDGFCGNNRLGIGVVVRDADLELSATMV
         KI +G+ A   D   W                      F+ N QL+L    +R  R +   +      +  V G   +  L    +  D DL+  A + 
Subjt:  KKIPIGSQALWNDRNSWVSKGKVPDVVSKATWIMGYVEEFSSNNQLRLRSQAVRHPRANSWELPPEGLNKINVDGFCGNNRLGIGVVVRDADLELSATMV

Query:  DVHAKLYAIQEGLSFARSLNKRRVVVEADSRIADSV
         +   L AI+EGL   R+    R V+ +DS+ A S+
Subjt:  DVHAKLYAIQEGLSFARSLNKRRVVVEADSRIADSV

A0A6A2X1A4 Uncharacterized protein2.3e-2329.68Show/hide
Query:  LILLDLEAQMSRRISDISLAPRCPFISHLFFADNSLVFCKASITELYAFKLILIDYEKASGQKVNFDKSALWFSPYVHPNFKNYMRSVMDMKVVQDLGDY
        L  L L+AQ    I  I  + R P +SHLF+AD+SL+F K S+TE+   K IL  YEKASGQKVN++KS+++FSP      +      + +    + G Y
Subjt:  LILLDLEAQMSRRISDISLAPRCPFISHLFFADNSLVFCKASITELYAFKLILIDYEKASGQKVNFDKSALWFSPYVHPNFKNYMRSVMDMKVVQDLGDY

Query:  LGLPSSLTRSKSRDFDSIKDRTPTFSVLICPRRRQGFGR-----GRRKRRITSIGEA---------GKRCA---GRKRV------------------RWS
        LGLP  + + K   F+ IKDRT         +R QG+ +     G R+  I S+ +A         G  CA   G + V                  RW 
Subjt:  LGLPSSLTRSKSRDFDSIKDRTPTFSVLICPRRRQGFGR-----GRRKRRITSIGEA---------GKRCA---GRKRV------------------RWS

Query:  RDLLLLGLRKVVGS---GTTIDFYRDP---WIPRESRFRRFSSNPNPETQVWVKDF---------ITPW-----FSWNI---PKLGEVVLDADMELIKKI
         D ++  LR         T I   R+    W    S F    S P     +W+            I  W      SW I    KL     D  + L+  I
Subjt:  RDLLLLGLRKVVGS---GTTIDFYRDP---WIPRESRFRRFSSNPNPETQVWVKDF---------ITPW-----FSWNI---PKLGEVVLDADMELIKKI

Query:  PIGSQALWNDRNSWVSKGKVPDVVSKATWIMGYVEEFSSNNQLR-LRSQAVRHPRANSWELPPEGLNKINVDG-FCGNNR-LGIGVVVRDADLELSATMV
               WN RN  V  G++    +        VEEF    ++  +    VR  R   W  P +   K+NVDG FC  NR   IGVV RD+   + A + 
Subjt:  PIGSQALWNDRNSWVSKGKVPDVVSKATWIMGYVEEFSSNNQLR-LRSQAVRHPRANSWELPPEGLNKINVDG-FCGNNR-LGIGVVVRDADLELSATMV

Query:  DV--------HAKLYAIQEGLSFARSLNKRRVVVEADS
         +         A++ A  EG+  A      RV++E D+
Subjt:  DV--------HAKLYAIQEGLSFARSLNKRRVVVEADS

A0A6J1DX30 uncharacterized protein LOC1110248742.0e-2731.79Show/hide
Query:  SRRISDISLAPRCPFISHLFFADNSLVFCKASITELYAFKLILIDYEKASGQKVNFDKSALWFSPYVHPNFKNYMRSVMDMKVVQDLGDYLGLPSSLTRS
        S R++ I        I+HL FAD+SL+F ++  +E  A + +L  Y +ASGQ +NF KSAL FSP VHP  + Y++ ++++K+V   G+YLGLPS  TR 
Subjt:  SRRISDISLAPRCPFISHLFFADNSLVFCKASITELYAFKLILIDYEKASGQKVNFDKSALWFSPYVHPNFKNYMRSVMDMKVVQDLGDYLGLPSSLTRS

Query:  KSRDFDSIKDRTPTFSVLICP--------RRRQGFGRGRRKRRI----------------------TSIGEAGKRCAGR---KRVRWSRDLLLLGLRKVV
        +    +S K     +  +  P        R  +GF +    + +                      TS+ +A          K   W RDLL+ GLR  V
Subjt:  KSRDFDSIKDRTPTFSVLICP--------RRRQGFGRGRRKRRI----------------------TSIGEAGKRCAGR---KRVRWSRDLLLLGLRKVV

Query:  GSGTTIDFYRDPWIPRESRFRRFSSNPNPETQVWVKDFITPWFSWNIPKLGEVVLDADMELIKKIPIGSQALWNDRNSWV
        G+G+TI  + DPW+PR + F+    N N      V  FIT   +W++  +     + D +LI  +PI S   +N ++SW+
Subjt:  GSGTTIDFYRDPWIPRESRFRRFSSNPNPETQVWVKDFITPWFSWNIPKLGEVVLDADMELIKKIPIGSQALWNDRNSWV

A0A803P400 Uncharacterized protein1.9e-2230.3Show/hide
Query:  ISLAPRCPFISHLFFADNSLVFCKASITELYAFKLILIDYEKASGQKVNFDKSALWFSPYVHPNFKNYMRSVMDMKVVQDLGDYLGLPSSLTRSKSRDFD
        + +A   P ISHLFFAD+SL+F +A+       K +L DY  ASGQ VNFDKS+L+FSP    + K+ + +++ + +   +  YLGLP S  RSK   F+
Subjt:  ISLAPRCPFISHLFFADNSLVFCKASITELYAFKLILIDYEKASGQKVNFDKSALWFSPYVHPNFKNYMRSVMDMKVVQDLGDYLGLPSSLTRSKSRDFD

Query:  SIKDRTPTFSVLICPRRRQGFGRGRRKRRITSIGEAGKRCAGRKRVRWSRDLLLLGLRKVVGSGTTIDFYRDPWIPRESRFRRFSSNPNPETQVWVKDFI
         + DR       +C                               + W ++LL +G+RK +G GTT   + + WIP   +    +   +  ++  V D I
Subjt:  SIKDRTPTFSVLICPRRRQGFGRGRRKRRITSIGEAGKRCAGRKRVRWSRDLLLLGLRKVVGSGTTIDFYRDPWIPRESRFRRFSSNPNPETQVWVKDFI

Query:  TPWFSWNIPKLGEVV---LDADMELIKKIPI
        TP   W++P +  +    +   ++ I  IPI
Subjt:  TPWFSWNIPKLGEVV---LDADMELIKKIPI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAGTAGACTTGAATGCTGTCTTAAGGAGTGTGCTGATACTCTTGGACCTTGAGGCCCAAATGTCAAGGCGGATTTCAGATATTTCTCTTGCCCCTAGATGT
CCTTTTATATCGCATCTCTTTTTTGCAGACAACAGTTTGGTTTTTTGTAAGGCTTCCATAACGGAGTTGTATGCCTTTAAATTGATTTTGATCGATTATGAGAAA
GCATCTGGCCAGAAGGTTAATTTTGATAAATCGGCATTGTGGTTTTCTCCATATGTCCATCCAAATTTTAAAAATTATATGCGCTCAGTTATGGACATGAAGGTT
GTTCAAGACCTGGGAGATTATCTGGGGCTTCCTTCGAGCCTTACAAGAAGTAAAAGCAGGGATTTTGATTCGATAAAGGACAGGACTCCCACATTCTCTGTGCTG
ATTTGTCCAAGGCGAAGGCAAGGTTTTGGTAGGGGACGACGGAAACGAAGAATCACATCCATCGGCGAAGCTGGGAAAAGATGTGCTGGCCGAAAGAGGGTGAGA
TGGAGCAGAGATCTGTTATTGTTGGGCTTAAGAAAGGTTGTTGGATCTGGCACTACCATTGATTTTTATAGGGATCCTTGGATTCCTCGGGAGTCAAGGTTTAGG
CGTTTTTCCTCAAATCCTAATCCGGAGACGCAAGTTTGGGTAAAGGATTTTATAACTCCTTGGTTTTCTTGGAATATTCCAAAACTAGGGGAGGTTGTTCTTGAT
GCAGATATGGAGCTTATAAAAAAAATCCCTATTGGCAGCCAAGCTCTTTGGAATGATCGAAACTCCTGGGTTTCTAAGGGGAAGGTTCCTGATGTGGTATCCAAA
GCCACCTGGATAATGGGATATGTGGAGGAGTTTTCTTCGAACAATCAGCTGAGATTGAGATCTCAGGCCGTGAGACACCCTCGGGCTAATAGTTGGGAACTCCCT
CCGGAAGGGTTAAATAAAATCAATGTTGATGGCTTTTGTGGGAACAATCGGTTGGGCATTGGGGTTGTTGTCCGTGATGCAGATTTGGAACTTTCGGCGACCATG
GTGGATGTTCATGCCAAACTTTATGCAATCCAAGAAGGGTTGTCGTTTGCAAGAAGCCTAAACAAAAGGAGGGTTGTTGTGGAAGCTGACTCGAGGATTGCGGAT
TCTGTATATCATCGTAAAGGCTTCTCCGACTTCAGATTAGGCCGTTCCGTCCAGGGTCTTACTCACGGCCTCTCGCCTAAACTGCCCAAGGAACACCTTCGATCT
TCGACCAGAGTTGCCTTAAGCTTGACATCTACTTCATCGGAATTCCTTCTACAATTCTTCACAAAAACAGACTGGGATAGTGATGATAAGATCGGAATACAATGA
mRNA sequenceShow/hide mRNA sequence
ATGCCAGTAGACTTGAATGCTGTCTTAAGGAGTGTGCTGATACTCTTGGACCTTGAGGCCCAAATGTCAAGGCGGATTTCAGATATTTCTCTTGCCCCTAGATGT
CCTTTTATATCGCATCTCTTTTTTGCAGACAACAGTTTGGTTTTTTGTAAGGCTTCCATAACGGAGTTGTATGCCTTTAAATTGATTTTGATCGATTATGAGAAA
GCATCTGGCCAGAAGGTTAATTTTGATAAATCGGCATTGTGGTTTTCTCCATATGTCCATCCAAATTTTAAAAATTATATGCGCTCAGTTATGGACATGAAGGTT
GTTCAAGACCTGGGAGATTATCTGGGGCTTCCTTCGAGCCTTACAAGAAGTAAAAGCAGGGATTTTGATTCGATAAAGGACAGGACTCCCACATTCTCTGTGCTG
ATTTGTCCAAGGCGAAGGCAAGGTTTTGGTAGGGGACGACGGAAACGAAGAATCACATCCATCGGCGAAGCTGGGAAAAGATGTGCTGGCCGAAAGAGGGTGAGA
TGGAGCAGAGATCTGTTATTGTTGGGCTTAAGAAAGGTTGTTGGATCTGGCACTACCATTGATTTTTATAGGGATCCTTGGATTCCTCGGGAGTCAAGGTTTAGG
CGTTTTTCCTCAAATCCTAATCCGGAGACGCAAGTTTGGGTAAAGGATTTTATAACTCCTTGGTTTTCTTGGAATATTCCAAAACTAGGGGAGGTTGTTCTTGAT
GCAGATATGGAGCTTATAAAAAAAATCCCTATTGGCAGCCAAGCTCTTTGGAATGATCGAAACTCCTGGGTTTCTAAGGGGAAGGTTCCTGATGTGGTATCCAAA
GCCACCTGGATAATGGGATATGTGGAGGAGTTTTCTTCGAACAATCAGCTGAGATTGAGATCTCAGGCCGTGAGACACCCTCGGGCTAATAGTTGGGAACTCCCT
CCGGAAGGGTTAAATAAAATCAATGTTGATGGCTTTTGTGGGAACAATCGGTTGGGCATTGGGGTTGTTGTCCGTGATGCAGATTTGGAACTTTCGGCGACCATG
GTGGATGTTCATGCCAAACTTTATGCAATCCAAGAAGGGTTGTCGTTTGCAAGAAGCCTAAACAAAAGGAGGGTTGTTGTGGAAGCTGACTCGAGGATTGCGGAT
TCTGTATATCATCGTAAAGGCTTCTCCGACTTCAGATTAGGCCGTTCCGTCCAGGGTCTTACTCACGGCCTCTCGCCTAAACTGCCCAAGGAACACCTTCGATCT
TCGACCAGAGTTGCCTTAAGCTTGACATCTACTTCATCGGAATTCCTTCTACAATTCTTCACAAAAACAGACTGGGATAGTGATGATAAGATCGGAATACAATGA
Protein sequenceShow/hide protein sequence
MPVDLNAVLRSVLILLDLEAQMSRRISDISLAPRCPFISHLFFADNSLVFCKASITELYAFKLILIDYEKASGQKVNFDKSALWFSPYVHPNFKNYMRSVMDMKV
VQDLGDYLGLPSSLTRSKSRDFDSIKDRTPTFSVLICPRRRQGFGRGRRKRRITSIGEAGKRCAGRKRVRWSRDLLLLGLRKVVGSGTTIDFYRDPWIPRESRFR
RFSSNPNPETQVWVKDFITPWFSWNIPKLGEVVLDADMELIKKIPIGSQALWNDRNSWVSKGKVPDVVSKATWIMGYVEEFSSNNQLRLRSQAVRHPRANSWELP
PEGLNKINVDGFCGNNRLGIGVVVRDADLELSATMVDVHAKLYAIQEGLSFARSLNKRRVVVEADSRIADSVYHRKGFSDFRLGRSVQGLTHGLSPKLPKEHLRS
STRVALSLTSTSSEFLLQFFTKTDWDSDDKIGIQ