; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g10290 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g10290
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr1:6386555..6390989
RNA-Seq ExpressionMoc01g10290
SyntenyMoc01g10290
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG8501049.1 hypothetical protein CXB51_003148 [Gossypium anomalum]1.7e-7244.39Show/hide
Query:  ESWECFKRLLQRCLHHGIPRCIQIETYYTGLDDATHLVIDASANGALLAKPYAEAFNILERISSNNHSWSDPRAVQGK----------------------
        E+WE FK LL++C HHGIP CIQ+ET+Y GL+  T +V+DASANGA+L+K Y EA+ I+ERI+SN++ W   RA  G+                      
Subjt:  ESWECFKRLLQRCLHHGIPRCIQIETYYTGLDDATHLVIDASANGALLAKPYAEAFNILERISSNNHSWSDPRAVQGK----------------------

Query:  -------STGASAGKAN-VNQIQDISCSFCEGDHHYNNCPGNPER------------TQCRTSNTPAFQQKVSYPAGFANQGQMVAQKQSEGSFASLEKL
               + G+++  A   NQ ++I+C +C   H +  CP NPE              Q R +  P F Q+V  P             Q+E S  SLE L
Subjt:  -------STGASAGKAN-VNQIQDISCSFCEGDHHYNNCPGNPER------------TQCRTSNTPAFQQKVSYPAGFANQGQMVAQKQSEGSFASLEKL

Query:  MKQYMANNDATVQSQATSLRNLELQVGQLATDLKSRLVGALPSDTKVSKRDGKEQCKALILRSGKALPPAHPIAPASIKEPIQVDQGETQLEQDSEPTEV
        +K YMA NDAT       LRNLE QVGQLAT+L++R  GALPSDTK  +  GKE CKAL LRS + + P   I  A  KE ++        E+ + P  V
Subjt:  MKQYMANNDATVQSQATSLRNLELQVGQLATDLKSRLVGALPSDTKVSKRDGKEQCKALILRSGKALPPAHPIAPASIKEPIQVDQGETQLEQDSEPTEV

Query:  VVPTPLKQ-IAEQPKRPVPPYPKRLQKKEQNFQFKKFLDVLKQLHVNIPLVEALEQIPNYVSFLKEIFTKKR-MEEYETVVMNKAYSTILTSKIPAKMKD
         +P  + Q +     +P PPYP+RLQK++Q  QFKKFLDVLKQLH+NIPLVEALEQ+PNYV F+K+I +KKR + E+ETV + K  +T L  K+P K+KD
Subjt:  VVPTPLKQ-IAEQPKRPVPPYPKRLQKKEQNFQFKKFLDVLKQLHVNIPLVEALEQIPNYVSFLKEIFTKKR-MEEYETVVMNKAYSTILTSKIPAKMKD

Query:  P
        P
Subjt:  P

KAG8501049.1 hypothetical protein CXB51_003148 [Gossypium anomalum]1.3e-0638.79Show/hide
Query:  VSFLKEIFTKKRMEEYETVVMNKAYSTILTSKIPAKMKDPRKELEDVQSECMNANE-GF-VKRMYESLDLTNLELRLLKPSIEEPPVLEFKAFPQHLKYA
        VS L+++  +K +   E ++       ILT      +  P  E ED     + AN+ GF  +  +ESLDL   +    K SIEEPP LE K  P HLKY 
Subjt:  VSFLKEIFTKKRMEEYETVVMNKAYSTILTSKIPAKMKDPRKELEDVQSECMNANE-GF-VKRMYESLDLTNLELRLLKPSIEEPPVLEFKAFPQHLKYA

Query:  YLGSSETLPIINSSRL
        YLG++ TLP+I S+ L
Subjt:  YLGSSETLPIINSSRL

KAG8501049.1 hypothetical protein CXB51_003148 [Gossypium anomalum]7.2e-7141.59Show/hide
Query:  ESWECFKRLLQRCLHHGIPRCIQIETYYTGLDDATHLVIDASANGALLAKPYAEAFNILERISSNNHSWSDPRAVQGK----------------------
        ++WE FK LL++C HHGI  CIQ+ET+Y GL+  T +V+DASANGALL+K Y +A+ ILE I++ N+ WS  RA  GK                      
Subjt:  ESWECFKRLLQRCLHHGIPRCIQIETYYTGLDDATHLVIDASANGALLAKPYAEAFNILERISSNNHSWSDPRAVQGK----------------------

Query:  ------STGASAGK-----ANVNQIQDISCSFCEGDHHYNNCPGNPE------------------------------RTQCRTSNTPAFQQKVSYPAGFA
              S G +  K     + +NQ +++SC FC   H Y++CP NPE                                Q   S T     K +YP GF+
Subjt:  ------STGASAGK-----ANVNQIQDISCSFCEGDHHYNNCPGNPE------------------------------RTQCRTSNTPAFQQKVSYPAGFA

Query:  NQGQMVAQKQSEGSFASLEKLMKQYMANNDAT-------VQSQATSLRNLELQVGQLATDLKSRLVGALPSDTKVSKRDGKEQCKALILRSGKALPPAHP
         Q       QS     SLE ++K+Y+  N+A+       VQSQA SLRNLE QVGQLA +L++R  G LPSDT+  K  G E CKA+ L+SGK L   + 
Subjt:  NQGQMVAQKQSEGSFASLEKLMKQYMANNDAT-------VQSQATSLRNLELQVGQLATDLKSRLVGALPSDTKVSKRDGKEQCKALILRSGKALPPAHP

Query:  IAPASIKEPIQVDQGETQLEQDSEPTEVVVPTPLKQIAEQPKRPVPPYPKRLQKKEQNFQFKKFLDVLKQLHVNIPLVEALEQIPNYVSFLKEIFTKK-R
        +  A   + ++   G  ++  D E     V +P K   +   +P PP+P+R QK++QN QFKKFLDVLKQLH+NIPLVEALEQ+PNYV F+K+I TKK R
Subjt:  IAPASIKEPIQVDQGETQLEQDSEPTEVVVPTPLKQIAEQPKRPVPPYPKRLQKKEQNFQFKKFLDVLKQLHVNIPLVEALEQIPNYVSFLKEIFTKK-R

Query:  MEEYETVVMNKAYSTILTSKIPAKMKDP
        + E+ETV + K  S+ L  K+P KMKDP
Subjt:  MEEYETVVMNKAYSTILTSKIPAKMKDP

XP_022158598.1 uncharacterized protein LOC111025053 [Momordica charantia]2.8e-8364.54Show/hide
Query:  KRLLQRCLHHGIPRCIQIETYYTGLDDATHLVIDASANGALLAKPYAEAFNILERISSNNHSWSDPRAVQGKSTG---ASAGKANVNQIQDISCSFCEGD
        KRL QRC +HGIP  IQIETYY GLD+AT LVIDAS NGALL KPYA+A NILERISS+NHSWSD RA++GKS+     S     +N   +I       +
Subjt:  KRLLQRCLHHGIPRCIQIETYYTGLDDATHLVIDASANGALLAKPYAEAFNILERISSNNHSWSDPRAVQGKSTG---ASAGKANVNQIQDISCSFCEGD

Query:  HHYNNC-------------PGNPERTQCRTSNTPAFQQKVSYPAGFANQGQMVAQKQSEGSFASLEKLMKQYMANNDATVQSQATSLRNLELQVGQLATD
          Y+N               GN        SN P FQQKVSYP GFA QGQMV   QS+GS  SLE +MKQYMANNDATVQSQA SLRNLELQVGQLA D
Subjt:  HHYNNC-------------PGNPERTQCRTSNTPAFQQKVSYPAGFANQGQMVAQKQSEGSFASLEKLMKQYMANNDATVQSQATSLRNLELQVGQLATD

Query:  LKSRLVGALPSDTKVSKRDGKEQCKALILRSGKALPPAHPIAPASIKEPIQVDQGETQLEQDSEPTEVVVPTPLKQIAEQPK
        LKSR VGALPSDT+V KRD KEQC AL LRSGKALPP HP AP   KEP Q+ QGE Q EQDSEP EVVVP P +QIAEQPK
Subjt:  LKSRLVGALPSDTKVSKRDGKEQCKALILRSGKALPPAHPIAPASIKEPIQVDQGETQLEQDSEPTEVVVPTPLKQIAEQPK

XP_022158611.1 uncharacterized protein LOC111025065 [Momordica charantia]8.5e-8050.8Show/hide
Query:  SESWECFKRLLQRCLHHGIPRCIQIETYYTGLDDATHLVIDASANG-ALLAKPYAEAFNILERISSNNHSWSDPRAVQGKSTGASAGKANVNQIQDISCS
        SESWECFKRLLQ C HHGIPRCIQIETYY  L+DAT L    +  G +      +E++  L     N         +Q  S GA  G ANVNQIQ ISCS
Subjt:  SESWECFKRLLQRCLHHGIPRCIQIETYYTGLDDATHLVIDASANG-ALLAKPYAEAFNILERISSNNHSWSDPRAVQGKSTGASAGKANVNQIQDISCS

Query:  FCEGDHHYNNCPGNPERT-------------------------------------QCRTSNTPAFQQKVSYPAGFANQGQMVAQKQSEGSFASLEKLMKQ
        FCEGDHHYNNCPGNPE                                          TS+ PAFQ KVSYP GF NQGQMVA++QSEGS ASLEKLMKQ
Subjt:  FCEGDHHYNNCPGNPERT-------------------------------------QCRTSNTPAFQQKVSYPAGFANQGQMVAQKQSEGSFASLEKLMKQ

Query:  YMANNDATVQSQATSLRNLELQVGQLATDLKSRLVGALPSDTKVSKRDGKEQCKALILRSGKALPPAHPIAPASIKEPIQVDQGETQLEQDSEPTEVVVP
        YMANNDATVQSQATSLRNL+LQVGQLATDLKS+                                            PI+V +   Q E ++ P E    
Subjt:  YMANNDATVQSQATSLRNLELQVGQLATDLKSRLVGALPSDTKVSKRDGKEQCKALILRSGKALPPAHPIAPASIKEPIQVDQGETQLEQDSEPTEVVVP

Query:  TPLKQIAEQPKRPVPPYPKRLQKKEQNFQFKKFLDVLKQLHVNIPLVEALEQIPNYVSFLKEIFTKKR-MEEYETV
                    P PPYPKRLQKKE+N QF KFLDVLKQLHVNIPLVEALEQ+PNYV FLKEI  KKR + EY+T+
Subjt:  TPLKQIAEQPKRPVPPYPKRLQKKEQNFQFKKFLDVLKQLHVNIPLVEALEQIPNYVSFLKEIFTKKR-MEEYETV

XP_030509265.1 uncharacterized protein LOC115723943 [Cannabis sativa]4.4e-7647.81Show/hide
Query:  SESWECFKRLLQRCLHHGIPRCIQIETYYTGLDDATHLVIDASANGALLAKPYAEAFNILERISSNNHSWSDPRAVQGKSTGASAGKANVNQIQDISCSF
        S++WE FK LL++C HHGIP CIQ+ET+Y GL+ A+ +V+DASANGA+L+K Y EAF ILE I+SNN+ WS+ RA   +        A V ++  I+   
Subjt:  SESWECFKRLLQRCLHHGIPRCIQIETYYTGLDDATHLVIDASANGALLAKPYAEAFNILERISSNNHSWSDPRAVQGKSTGASAGKANVNQIQDISCSF

Query:  CEGDHHYNNCPGNPERTQCRTSNTPAFQQKVSYPAGFANQGQMVAQKQSEGSFASLEKLMKQYMANNDATVQSQATSLRNLELQVGQLATDLKSRLVGAL
         +     NN P      Q  +S+T   Q + +YP GF+ Q +  +Q       +SLE LM+ YMA NDA +QSQA SLRNLELQ+G LA +LK+R  G+L
Subjt:  CEGDHHYNNCPGNPERTQCRTSNTPAFQQKVSYPAGFANQGQMVAQKQSEGSFASLEKLMKQYMANNDATVQSQATSLRNLELQVGQLATDLKSRLVGAL

Query:  PSDTKVSKRDGKEQCKALILRSGKALPPAHPI-----APASIKEPIQVDQGETQLEQDSEPTEVV--VPTPLKQIAEQPKRPVPPYPKRLQKKEQNFQFK
        PSDT+  +RDGKEQCK++ LRSGK L  +         P SI+   ++ +   Q   D+ P +      +  +Q A    +P  P+P+R +K++Q+ QFK
Subjt:  PSDTKVSKRDGKEQCKALILRSGKALPPAHPI-----APASIKEPIQVDQGETQLEQDSEPTEVV--VPTPLKQIAEQPKRPVPPYPKRLQKKEQNFQFK

Query:  KFLDVLKQLHVNIPLVEALEQIPNYVSFLKEIFTKK-RMEEYETVVMNKAYSTILTSKIPAKMKDP
        KFLDVLKQLH+NIPLVEALEQ+PNYV FLK+I TKK R+ E+ETV + +  S +L SKIP K+KDP
Subjt:  KFLDVLKQLHVNIPLVEALEQIPNYVSFLKEIFTKK-RMEEYETVVMNKAYSTILTSKIPAKMKDP

TrEMBL top hitse value%identityAlignment
A0A6J1CS22 uncharacterized protein LOC1110138055.4e-6471.84Show/hide
Query:  MANNDATVQSQATSLRNLELQVGQLATDLKSRLVGALPSDTKVSKRDGKEQCKALILRSGKALPPAHPIAPASIKEPIQVDQGETQLEQDSEPTEVVVPT
        MANNDATVQSQ  SLRNLELQVGQLA DLKSR +GA PSDT+V KRDGKEQCKAL LRSGKALPP H  APA  KE  +VD GETQ EQDS+P EV++ T
Subjt:  MANNDATVQSQATSLRNLELQVGQLATDLKSRLVGALPSDTKVSKRDGKEQCKALILRSGKALPPAHPIAPASIKEPIQVDQGETQLEQDSEPTEVVVPT

Query:  PLKQIAEQPK----------RPVPPYPKRLQKKEQNFQFKKFLDVLKQLHVNIPLVEALEQIPNYVSFLKEIFTKKRM-EEYETVVMNKAYSTILTSKIP
        P +Q A QPK           PV        KKEQNF+FKKFLDVLKQLHVN+PLVEALEQ+PNYV FLKEI TKKRM  EYE V M KA STILTSKIP
Subjt:  PLKQIAEQPK----------RPVPPYPKRLQKKEQNFQFKKFLDVLKQLHVNIPLVEALEQIPNYVSFLKEIFTKKRM-EEYETVVMNKAYSTILTSKIP

Query:  AKMKDP
        AKMKDP
Subjt:  AKMKDP

A0A6J1DRG1 uncharacterized protein LOC1110236691.2e-6354.17Show/hide
Query:  QAIYMHFSESWECFKRLLQRCLHHGIPRCIQIETYYTGLDDATHLVIDASANGALLAKPYAEAFNILERISSNNHSWSDPRAVQGK--------------
        Q +    +ESWE FK+LLQ+C HHGIPRCIQIE YY GLDDAT LVIDAS NGALL KPYAEAFNILERISSNNHSWSDPRA+QG+              
Subjt:  QAIYMHFSESWECFKRLLQRCLHHGIPRCIQIETYYTGLDDATHLVIDASANGALLAKPYAEAFNILERISSNNHSWSDPRAVQGK--------------

Query:  -------------------STGASAGKANVNQIQDISCSFCEGDHHYNNCPGNPER-------------------------------------TQCRTSN
                           + GAS GKANV+ IQ ISCSFCEG+HHYNN P NPE                                          TSN
Subjt:  -------------------STGASAGKANVNQIQDISCSFCEGDHHYNNCPGNPER-------------------------------------TQCRTSN

Query:  TPAFQQKVSYPAGFANQGQMVAQKQSEGSFASLEKLMKQYMANNDATVQSQATSLRNLELQVGQ
         PA+QQK SYP  F+NQGQ+  Q  SEGSFASLE LMK+ M  ND TVQSQA SLRNLE+QVGQ
Subjt:  TPAFQQKVSYPAGFANQGQMVAQKQSEGSFASLEKLMKQYMANNDATVQSQATSLRNLELQVGQ

A0A6J1DWK1 uncharacterized protein LOC1110250531.4e-8364.54Show/hide
Query:  KRLLQRCLHHGIPRCIQIETYYTGLDDATHLVIDASANGALLAKPYAEAFNILERISSNNHSWSDPRAVQGKSTG---ASAGKANVNQIQDISCSFCEGD
        KRL QRC +HGIP  IQIETYY GLD+AT LVIDAS NGALL KPYA+A NILERISS+NHSWSD RA++GKS+     S     +N   +I       +
Subjt:  KRLLQRCLHHGIPRCIQIETYYTGLDDATHLVIDASANGALLAKPYAEAFNILERISSNNHSWSDPRAVQGKSTG---ASAGKANVNQIQDISCSFCEGD

Query:  HHYNNC-------------PGNPERTQCRTSNTPAFQQKVSYPAGFANQGQMVAQKQSEGSFASLEKLMKQYMANNDATVQSQATSLRNLELQVGQLATD
          Y+N               GN        SN P FQQKVSYP GFA QGQMV   QS+GS  SLE +MKQYMANNDATVQSQA SLRNLELQVGQLA D
Subjt:  HHYNNC-------------PGNPERTQCRTSNTPAFQQKVSYPAGFANQGQMVAQKQSEGSFASLEKLMKQYMANNDATVQSQATSLRNLELQVGQLATD

Query:  LKSRLVGALPSDTKVSKRDGKEQCKALILRSGKALPPAHPIAPASIKEPIQVDQGETQLEQDSEPTEVVVPTPLKQIAEQPK
        LKSR VGALPSDT+V KRD KEQC AL LRSGKALPP HP AP   KEP Q+ QGE Q EQDSEP EVVVP P +QIAEQPK
Subjt:  LKSRLVGALPSDTKVSKRDGKEQCKALILRSGKALPPAHPIAPASIKEPIQVDQGETQLEQDSEPTEVVVPTPLKQIAEQPK

A0A6J1DXK5 uncharacterized protein LOC1110255002.5e-6957.72Show/hide
Query:  SESWECFKRLLQRCLHHGIPRCIQIETYYTGLDDATHLVIDASANGALLAKPYAEAFNILERISSNNHSWSDPRAVQGK---------------------
        SESWE FKRL+Q+ L+ GIPRCIQI+TYY GLDDAT LVIDASANGALLAKPYAEAFNILERISSNN SWSDPRA+ GK                     
Subjt:  SESWECFKRLLQRCLHHGIPRCIQIETYYTGLDDATHLVIDASANGALLAKPYAEAFNILERISSNNHSWSDPRAVQGK---------------------

Query:  ------------STGASAGKANVNQIQDISCSFCEGDHHYNNCPGNPERTQCRTSNTPAFQQKVSYPAGFANQGQMVAQKQSEGSFASLEKLMKQYMANN
                    + GASAGKANV+ IQ ISCSFC G++ YNNCPGNPE      +         S    +A  G ++   + E      ++ M +YM NN
Subjt:  ------------STGASAGKANVNQIQDISCSFCEGDHHYNNCPGNPERTQCRTSNTPAFQQKVSYPAGFANQGQMVAQKQSEGSFASLEKLMKQYMANN

Query:  DATVQSQATSLRNLELQVGQLATDLKSRLVGALPSDTKVSKRDGKEQCKALILRSGKALPPAHPIAPASIKE
        D TVQSQA SLRNLE+QVGQLATDLKS+  G LPSD KV KRDGKEQC AL LRSGK LP AHP A    KE
Subjt:  DATVQSQATSLRNLELQVGQLATDLKSRLVGALPSDTKVSKRDGKEQCKALILRSGKALPPAHPIAPASIKE

A0A6J1E1F3 uncharacterized protein LOC1110250654.1e-8050.8Show/hide
Query:  SESWECFKRLLQRCLHHGIPRCIQIETYYTGLDDATHLVIDASANG-ALLAKPYAEAFNILERISSNNHSWSDPRAVQGKSTGASAGKANVNQIQDISCS
        SESWECFKRLLQ C HHGIPRCIQIETYY  L+DAT L    +  G +      +E++  L     N         +Q  S GA  G ANVNQIQ ISCS
Subjt:  SESWECFKRLLQRCLHHGIPRCIQIETYYTGLDDATHLVIDASANG-ALLAKPYAEAFNILERISSNNHSWSDPRAVQGKSTGASAGKANVNQIQDISCS

Query:  FCEGDHHYNNCPGNPERT-------------------------------------QCRTSNTPAFQQKVSYPAGFANQGQMVAQKQSEGSFASLEKLMKQ
        FCEGDHHYNNCPGNPE                                          TS+ PAFQ KVSYP GF NQGQMVA++QSEGS ASLEKLMKQ
Subjt:  FCEGDHHYNNCPGNPERT-------------------------------------QCRTSNTPAFQQKVSYPAGFANQGQMVAQKQSEGSFASLEKLMKQ

Query:  YMANNDATVQSQATSLRNLELQVGQLATDLKSRLVGALPSDTKVSKRDGKEQCKALILRSGKALPPAHPIAPASIKEPIQVDQGETQLEQDSEPTEVVVP
        YMANNDATVQSQATSLRNL+LQVGQLATDLKS+                                            PI+V +   Q E ++ P E    
Subjt:  YMANNDATVQSQATSLRNLELQVGQLATDLKSRLVGALPSDTKVSKRDGKEQCKALILRSGKALPPAHPIAPASIKEPIQVDQGETQLEQDSEPTEVVVP

Query:  TPLKQIAEQPKRPVPPYPKRLQKKEQNFQFKKFLDVLKQLHVNIPLVEALEQIPNYVSFLKEIFTKKR-MEEYETV
                    P PPYPKRLQKKE+N QF KFLDVLKQLHVNIPLVEALEQ+PNYV FLKEI  KKR + EY+T+
Subjt:  TPLKQIAEQPKRPVPPYPKRLQKKEQNFQFKKFLDVLKQLHVNIPLVEALEQIPNYVSFLKEIFTKKR-MEEYETV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGGCTCTTGACATTTATTCCCAAGTTACCGTGCCCCCTCCTTTTACAGTACCTGATGTTCGACCTTGCCTTTTATTCCAAGGCCTAAACTGTGCACCGGTC
CTTGCCATTTATTCCCAAGGCCTAAACTGTGCACCGTCCTTGCCGTTTATCCCCAAGTCACTACGCAAGGCTTTGAACCTCCAAGCGATTTATATGCATTTCAGT
GAATCCTGGGAGTGTTTCAAGCGATTGTTGCAGAGATGTCTACACCATGGGATCCCAAGATGCATCCAAATCGAGACATATTATACAGGTCTTGATGATGCCACA
CACTTAGTAATTGATGCGTCTGCAAATGGGGCTTTGCTAGCAAAACCCTATGCTGAAGCATTCAATATTTTAGAAAGAATATCATCAAACAATCACTCATGGTCT
GATCCTAGAGCTGTTCAAGGAAAATCAACTGGAGCATCAGCTGGTAAGGCTAATGTCAATCAAATTCAAGACATTTCTTGTTCTTTCTGCGAGGGAGACCACCAT
TATAACAACTGCCCTGGAAATCCCGAGAGGACACAATGTAGAACATCCAATACTCCAGCATTTCAGCAGAAGGTAAGTTATCCTGCTGGTTTTGCGAATCAAGGA
CAAATGGTAGCACAAAAGCAATCAGAAGGATCATTTGCATCATTGGAAAAGCTGATGAAGCAATACATGGCCAATAATGACGCCACTGTGCAAAGCCAAGCTACA
TCATTAAGGAACCTAGAGTTGCAAGTAGGCCAGTTAGCGACGGATTTGAAGAGCAGACTGGTTGGAGCATTACCCAGCGATACAAAAGTGTCAAAGAGAGACGGT
AAGGAACAATGCAAGGCCCTCATACTTCGAAGTGGGAAAGCATTACCTCCGGCACACCCAATTGCTCCAGCATCGATCAAGGAACCTATTCAAGTTGACCAAGGA
GAAACTCAGTTAGAACAGGACAGTGAGCCAACAGAAGTAGTTGTACCTACTCCACTAAAGCAGATAGCTGAACAACCAAAAAGGCCAGTGCCACCATATCCTAAA
CGGTTGCAGAAGAAAGAGCAGAATTTTCAATTCAAGAAGTTTTTAGATGTCTTGAAGCAACTGCATGTGAACATACCGTTAGTGGAAGCTCTAGAGCAAATTCCA
AATTATGTGAGCTTCCTGAAAGAGATATTCACAAAGAAGAGAATGGAAGAGTATGAAACTGTAGTAATGAATAAAGCTTACAGCACCATACTCACAAGCAAAATT
CCTGCAAAGATGAAAGATCCTAGGAAGGAACTAGAAGATGTCCAGTCAGAATGCATGAATGCCAATGAAGGATTTGTGAAGAGAATGTATGAGTCCTTAGACCTC
ACAAACCTGGAGCTTAGATTACTGAAGCCATCTATTGAAGAGCCGCCAGTGCTGGAGTTTAAAGCATTTCCACAGCATCTGAAATATGCTTACTTGGGTTCATCA
GAGACATTGCCAATCATCAATAGTAGCAGACTTATCCTTGGAAAAGGAGCAGATGCTGTTAAATGTACTCAAGGCACATCAGAGAGCAATTGGATGGACTATTGC
GAATATCAAGGGGATAAGCCTCAGCTTTTGTATGCATAA
mRNA sequenceShow/hide mRNA sequence
ATGACGGCTCTTGACATTTATTCCCAAGTTACCGTGCCCCCTCCTTTTACAGTACCTGATGTTCGACCTTGCCTTTTATTCCAAGGCCTAAACTGTGCACCGGTC
CTTGCCATTTATTCCCAAGGCCTAAACTGTGCACCGTCCTTGCCGTTTATCCCCAAGTCACTACGCAAGGCTTTGAACCTCCAAGCGATTTATATGCATTTCAGT
GAATCCTGGGAGTGTTTCAAGCGATTGTTGCAGAGATGTCTACACCATGGGATCCCAAGATGCATCCAAATCGAGACATATTATACAGGTCTTGATGATGCCACA
CACTTAGTAATTGATGCGTCTGCAAATGGGGCTTTGCTAGCAAAACCCTATGCTGAAGCATTCAATATTTTAGAAAGAATATCATCAAACAATCACTCATGGTCT
GATCCTAGAGCTGTTCAAGGAAAATCAACTGGAGCATCAGCTGGTAAGGCTAATGTCAATCAAATTCAAGACATTTCTTGTTCTTTCTGCGAGGGAGACCACCAT
TATAACAACTGCCCTGGAAATCCCGAGAGGACACAATGTAGAACATCCAATACTCCAGCATTTCAGCAGAAGGTAAGTTATCCTGCTGGTTTTGCGAATCAAGGA
CAAATGGTAGCACAAAAGCAATCAGAAGGATCATTTGCATCATTGGAAAAGCTGATGAAGCAATACATGGCCAATAATGACGCCACTGTGCAAAGCCAAGCTACA
TCATTAAGGAACCTAGAGTTGCAAGTAGGCCAGTTAGCGACGGATTTGAAGAGCAGACTGGTTGGAGCATTACCCAGCGATACAAAAGTGTCAAAGAGAGACGGT
AAGGAACAATGCAAGGCCCTCATACTTCGAAGTGGGAAAGCATTACCTCCGGCACACCCAATTGCTCCAGCATCGATCAAGGAACCTATTCAAGTTGACCAAGGA
GAAACTCAGTTAGAACAGGACAGTGAGCCAACAGAAGTAGTTGTACCTACTCCACTAAAGCAGATAGCTGAACAACCAAAAAGGCCAGTGCCACCATATCCTAAA
CGGTTGCAGAAGAAAGAGCAGAATTTTCAATTCAAGAAGTTTTTAGATGTCTTGAAGCAACTGCATGTGAACATACCGTTAGTGGAAGCTCTAGAGCAAATTCCA
AATTATGTGAGCTTCCTGAAAGAGATATTCACAAAGAAGAGAATGGAAGAGTATGAAACTGTAGTAATGAATAAAGCTTACAGCACCATACTCACAAGCAAAATT
CCTGCAAAGATGAAAGATCCTAGGAAGGAACTAGAAGATGTCCAGTCAGAATGCATGAATGCCAATGAAGGATTTGTGAAGAGAATGTATGAGTCCTTAGACCTC
ACAAACCTGGAGCTTAGATTACTGAAGCCATCTATTGAAGAGCCGCCAGTGCTGGAGTTTAAAGCATTTCCACAGCATCTGAAATATGCTTACTTGGGTTCATCA
GAGACATTGCCAATCATCAATAGTAGCAGACTTATCCTTGGAAAAGGAGCAGATGCTGTTAAATGTACTCAAGGCACATCAGAGAGCAATTGGATGGACTATTGC
GAATATCAAGGGGATAAGCCTCAGCTTTTGTATGCATAA
Protein sequenceShow/hide protein sequence
MTALDIYSQVTVPPPFTVPDVRPCLLFQGLNCAPVLAIYSQGLNCAPSLPFIPKSLRKALNLQAIYMHFSESWECFKRLLQRCLHHGIPRCIQIETYYTGLDDAT
HLVIDASANGALLAKPYAEAFNILERISSNNHSWSDPRAVQGKSTGASAGKANVNQIQDISCSFCEGDHHYNNCPGNPERTQCRTSNTPAFQQKVSYPAGFANQG
QMVAQKQSEGSFASLEKLMKQYMANNDATVQSQATSLRNLELQVGQLATDLKSRLVGALPSDTKVSKRDGKEQCKALILRSGKALPPAHPIAPASIKEPIQVDQG
ETQLEQDSEPTEVVVPTPLKQIAEQPKRPVPPYPKRLQKKEQNFQFKKFLDVLKQLHVNIPLVEALEQIPNYVSFLKEIFTKKRMEEYETVVMNKAYSTILTSKI
PAKMKDPRKELEDVQSECMNANEGFVKRMYESLDLTNLELRLLKPSIEEPPVLEFKAFPQHLKYAYLGSSETLPIINSSRLILGKGADAVKCTQGTSESNWMDYC
EYQGDKPQLLYA