; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g16010 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g16010
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationchr1:10328209..10339628
RNA-Seq ExpressionMoc01g16010
SyntenyMoc01g16010
Gene Ontology termsGO:0016020 - membrane (cellular component)
GO:0043167 - ion binding (molecular function)
GO:0097159 - organic cyclic compound binding (molecular function)
GO:1901363 - heterocyclic compound binding (molecular function)
InterPro domainsIPR019557 - Aminotransferase-like, plant mobile domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAC95126.1 gag-pol polyprotein [Populus deltoides]2.5e-7245.3Show/hide
Query:  NFLRGKSMWSYVIGVRVKPKDTKANDYASSLDTWEADNSKIITWINNSVTHSISAQLAKYDTAKQAWDHLAKLYTQSNFAKQYQLEKDIRALQQNNMSIQ
        NFL+GK MW YV G  V PK+T+  D   S+DTWEA+N+KIITWINN V HSI  QLAKY+TAK+ WDHL +L+TQSNFAKQYQLE DIRAL Q NMSIQ
Subjt:  NFLRGKSMWSYVIGVRVKPKDTKANDYASSLDTWEADNSKIITWINNSVTHSISAQLAKYDTAKQAWDHLAKLYTQSNFAKQYQLEKDIRALQQNNMSIQ

Query:  DFYSSMSELWDELTLTESSEL-STSEVLYCLERLTVVQFLMALRHDFKPLRGSILCRTPLPFVDSVVSELLVEEIRLKSKKDK-----------------
        +FYS+M++LWD+L LTES EL +    +   E+  +VQFL ALR DF+ LRGSIL R+PLP VDSVVSELL EEIRL+S  +K                 
Subjt:  DFYSSMSELWDELTLTESSEL-STSEVLYCLERLTVVQFLMALRHDFKPLRGSILCRTPLPFVDSVVSELLVEEIRLKSKKDK-----------------

Query:  -----NTP----------------------------------------------------------------------------------PQS-------
             N P                                                                                  PQ+       
Subjt:  -----NTP----------------------------------------------------------------------------------PQS-------

Query:  -------------------ASHHMSPSFSSFVSLSSHSSISIMTADGTPMPLVGVGSIVTPSVSLSDVYYIPNLTLNLVSVSQLCKFG-YLVSFSSSTCS
                           ASHHMSP  SSF S+S  SSI +MTADGTPMPL GVGS+VT  +SL +VY IP L LNL S+ Q+C  G YLV FS S C 
Subjt:  -------------------ASHHMSPSFSSFVSLSSHSSISIMTADGTPMPLVGVGSIVTPSVSLSDVYYIPNLTLNLVSVSQLCKFG-YLVSFSSSTCS

Query:  IQDLQSQKEIGTGHR
        +QDLQSQK IGTG R
Subjt:  IQDLQSQKEIGTGHR

PWA56951.1 gag-pol polyprotein [Artemisia annua]2.4e-6744.26Show/hide
Query:  QLKFMNFSRYSSPRSGSVNFLRGKSMWSYVIGVRVKP-KDTKANDYASSLDTWEADNSKIITWINNSVTHSISAQLAKYDTAKQAWDHLAKLYTQSNFAK
        QL   N+S +S       NFLRGKSMW YV+G + KP       +Y + LDTWE DNSK+ITWINNSVT SI AQLAKYD+AK  WDHLA+LYTQSNFAK
Subjt:  QLKFMNFSRYSSPRSGSVNFLRGKSMWSYVIGVRVKP-KDTKANDYASSLDTWEADNSKIITWINNSVTHSISAQLAKYDTAKQAWDHLAKLYTQSNFAK

Query:  QYQLEKDIRALQQNNMSIQDFYSSMSELWDELTLTESSELSTSE-VLYCLERLTVVQFLMALRHDFKPLRGSILCRTPLPFVDSVVSELLVEEIRLKSKK
        QYQLE DIR+LQQN+MSIQDFYSSMS LWD+L LTE  ELS+ +  +   +   +VQFLMALRHDF+                    ELL EE+RLKS  
Subjt:  QYQLEKDIRALQQNNMSIQDFYSSMSELWDELTLTESSELSTSE-VLYCLERLTVVQFLMALRHDFKPLRGSILCRTPLPFVDSVVSELLVEEIRLKSKK

Query:  DK----------------------------------------------------------NTPPQ-----------------------------------
        DK                                                          N PPQ                                   
Subjt:  DK----------------------------------------------------------NTPPQ-----------------------------------

Query:  ----------------------SASHHMSPSFSSFVSLSSHSSISIMTADGTPMPLVGVGSIVTPSVSLSDVYYIPNLTLNLVSVSQLCKFGYLVSFSSS
                               ASHHM    S F+SL   SS SIM A+G  MPL GVGSI TPSVSLSDVYYIPNLT+NL SV Q+C  GY V FS S
Subjt:  ----------------------SASHHMSPSFSSFVSLSSHSSISIMTADGTPMPLVGVGSIVTPSVSLSDVYYIPNLTLNLVSVSQLCKFGYLVSFSSS

Query:  TCSIQDLQSQKEIGTGHR
         C IQD Q+Q+ IGTG R
Subjt:  TCSIQDLQSQKEIGTGHR

TXG46279.1 hypothetical protein EZV62_028218 [Acer yangbiense]1.6e-7145.64Show/hide
Query:  NFLRGKSMWSYVIGVRVKPKDTKANDYASSLDTWEADNSKIITWINNSVTHSISAQLAKYDTAKQAWDHLAKLYTQSNFAKQYQLEKDIRALQQNNMSIQ
        NFL+GK MW Y+ G  VKPK+ KA DY + LD WEA+NSKIITWINNSV HSI  QLAKYD   + WDHLA+LYTQSNFAKQYQLE DIRAL+Q +MSIQ
Subjt:  NFLRGKSMWSYVIGVRVKPKDTKANDYASSLDTWEADNSKIITWINNSVTHSISAQLAKYDTAKQAWDHLAKLYTQSNFAKQYQLEKDIRALQQNNMSIQ

Query:  DFYSSMSELWDELTLTESSEL-STSEVLYCLERLTVVQFLMALRHDFKPLRGSILCRTPLPFVDSVVSELLVEEIRLKSKKDKNT---------------
        +FYS M++LWD+L LTES+EL + +  +   ++  +VQFLMALR DF+ LRGSIL R PLP VDSVVSELL EEIR K +  K T               
Subjt:  DFYSSMSELWDELTLTESSEL-STSEVLYCLERLTVVQFLMALRHDFKPLRGSILCRTPLPFVDSVVSELLVEEIRLKSKKDKNT---------------

Query:  --------------------------------------------------------------------------PPQ-----------------------
                                                                                  PPQ                       
Subjt:  --------------------------------------------------------------------------PPQ-----------------------

Query:  -----------------------SASHHMSPSFSSFVSLSSHSSISIMTADGTPMPLVGVGSIVTPSVSLSDVYYIPNLTLNLVSVSQLC
                                ASHHMSP+ SSFVSL   SS+S+MT DGTPMPL GVGS+VTP VSLS+VY+IPNLTLNLVSVSQLC
Subjt:  -----------------------SASHHMSPSFSSFVSLSSHSSISIMTADGTPMPLVGVGSIVTPSVSLSDVYYIPNLTLNLVSVSQLC

XP_022859302.1 uncharacterized protein LOC111380069 [Olea europaea var. sylvestris]3.0e-6244.73Show/hide
Query:  MWSYVIGVRVKPKDTKANDYASSLDTWEADNSKIITWINNSVTHSISAQLAKYDTAKQAWDHLAKLYTQSNFAKQYQLEKDIRALQQNNMSIQDFYSSMS
        MW ++ G  VKP++T   DYA+ +D WE++NSKIITWINNSV HSI  +LAKY+TAK+ WDHL +L+TQSNFAKQYQLE DIRAL QNNMSIQ+FYS+M+
Subjt:  MWSYVIGVRVKPKDTKANDYASSLDTWEADNSKIITWINNSVTHSISAQLAKYDTAKQAWDHLAKLYTQSNFAKQYQLEKDIRALQQNNMSIQDFYSSMS

Query:  ELWDELTLTESSEL-STSEVLYCLERLTVVQFLMALRHDFKPLRGSILCRTPLPFVDSVVSELLVEEIRLK-----------------------------
        +LWD+L LTES+EL + +  + C E+  +VQ LMAL  DF+ LR SIL  +PLP VDSVVSELLVEEIRLK                             
Subjt:  ELWDELTLTESSEL-STSEVLYCLERLTVVQFLMALRHDFKPLRGSILCRTPLPFVDSVVSELLVEEIRLK-----------------------------

Query:  --------------------------------------------SKKDKNTPPQS------------------------------ASHHMSPSFSSFVSL
                                                    S+ + + PPQS                              A    S  +SSF S+
Subjt:  --------------------------------------------SKKDKNTPPQS------------------------------ASHHMSPSFSSFVSL

Query:  SSHSSISIMTADGTPMPLVGVGSIVTPSVSLSDVYYIPNLTLNLVSVSQLC
        S  SSI++MTADG PMPL GVGS+VTP +SL +VY+IP +TLNL  V QLC
Subjt:  SSHSSISIMTADGTPMPLVGVGSIVTPSVSLSDVYYIPNLTLNLVSVSQLC

XP_042472527.1 uncharacterized protein LOC122055222 [Zingiber officinale]4.2e-6445.88Show/hide
Query:  MWSYVIGVRVKPKDTKANDYASSLDTWEADNSKIITWINNSVTHSISAQLAKYDTAKQAWDHLAKLYTQSNFAKQYQLEKDIRALQQNNMSIQDFYSSMS
        MW  ++GVRV+P D  A+DYA SL+ WE DN+KIITWINNSV+HSI  QL KY+T K+ WDHLA+LYTQSNFAKQYQLE +IRALQQ +M IQDFYS+MS
Subjt:  MWSYVIGVRVKPKDTKANDYASSLDTWEADNSKIITWINNSVTHSISAQLAKYDTAKQAWDHLAKLYTQSNFAKQYQLEKDIRALQQNNMSIQDFYSSMS

Query:  ELWDELTLTESSEL-STSEVLYCLERLTVVQFLMALRHDFKPLRGSILCRTPLPFVDSVVSELLVEEIRLKSKKDKNT----------------------
        +LWD+L LT+SSEL +    +   E   +VQFLMALR DF+ LRG+IL R+PLP VDSVV ELL EEIRLKS+ DK T                      
Subjt:  ELWDELTLTESSEL-STSEVLYCLERLTVVQFLMALRHDFKPLRGSILCRTPLPFVDSVVSELLVEEIRLKSKKDKNT----------------------

Query:  -----------------------------------------------------------------PPQSASHHMSPSFSSFVSLSSHSSISIMTADGTPM
                                                                          PQS++   + S SS +SLSS SS S+MT DGTPM
Subjt:  -----------------------------------------------------------------PPQSASHHMSPSFSSFVSLSSHSSISIMTADGTPM

Query:  PLVGVGSIVTPSVSLSDVYYIPNLTLNLVSVSQLCKFGYLVSFSSSTCSIQDLQSQKEIGTGHR
        PL+GVGS+VT  +   +VY+IP+LTLNLVS                     D QSQK IG G R
Subjt:  PLVGVGSIVTPSVSLSDVYYIPNLTLNLVSVSQLCKFGYLVSFSSSTCSIQDLQSQKEIGTGHR

TrEMBL top hitse value%identityAlignment
A0A2U1M6T2 Gag-pol polyprotein1.2e-6744.26Show/hide
Query:  QLKFMNFSRYSSPRSGSVNFLRGKSMWSYVIGVRVKP-KDTKANDYASSLDTWEADNSKIITWINNSVTHSISAQLAKYDTAKQAWDHLAKLYTQSNFAK
        QL   N+S +S       NFLRGKSMW YV+G + KP       +Y + LDTWE DNSK+ITWINNSVT SI AQLAKYD+AK  WDHLA+LYTQSNFAK
Subjt:  QLKFMNFSRYSSPRSGSVNFLRGKSMWSYVIGVRVKP-KDTKANDYASSLDTWEADNSKIITWINNSVTHSISAQLAKYDTAKQAWDHLAKLYTQSNFAK

Query:  QYQLEKDIRALQQNNMSIQDFYSSMSELWDELTLTESSELSTSE-VLYCLERLTVVQFLMALRHDFKPLRGSILCRTPLPFVDSVVSELLVEEIRLKSKK
        QYQLE DIR+LQQN+MSIQDFYSSMS LWD+L LTE  ELS+ +  +   +   +VQFLMALRHDF+                    ELL EE+RLKS  
Subjt:  QYQLEKDIRALQQNNMSIQDFYSSMSELWDELTLTESSELSTSE-VLYCLERLTVVQFLMALRHDFKPLRGSILCRTPLPFVDSVVSELLVEEIRLKSKK

Query:  DK----------------------------------------------------------NTPPQ-----------------------------------
        DK                                                          N PPQ                                   
Subjt:  DK----------------------------------------------------------NTPPQ-----------------------------------

Query:  ----------------------SASHHMSPSFSSFVSLSSHSSISIMTADGTPMPLVGVGSIVTPSVSLSDVYYIPNLTLNLVSVSQLCKFGYLVSFSSS
                               ASHHM    S F+SL   SS SIM A+G  MPL GVGSI TPSVSLSDVYYIPNLT+NL SV Q+C  GY V FS S
Subjt:  ----------------------SASHHMSPSFSSFVSLSSHSSISIMTADGTPMPLVGVGSIVTPSVSLSDVYYIPNLTLNLVSVSQLCKFGYLVSFSSS

Query:  TCSIQDLQSQKEIGTGHR
         C IQD Q+Q+ IGTG R
Subjt:  TCSIQDLQSQKEIGTGHR

A0A2U1NEB9 CCHC-type domain-containing protein2.3e-5565.76Show/hide
Query:  MWSYVIGVRVKPKDTKANDYASSLDTWEADNSKIITWINNSVTHSISAQLAKYDTAKQAWDHLAKLYTQSNFAKQYQLEKDIRALQQNNMSIQDFYSSMS
        MW YV GV+  P D K ++Y   L+TWE +NSK+ITWINNS+T SI  QLAKY+TAK  WDHLAKLYTQSNFAKQYQLE DIRALQQN+ SIQ+FYSSMS
Subjt:  MWSYVIGVRVKPKDTKANDYASSLDTWEADNSKIITWINNSVTHSISAQLAKYDTAKQAWDHLAKLYTQSNFAKQYQLEKDIRALQQNNMSIQDFYSSMS

Query:  ELWDELTLTESSELSTSE-VLYCLERLTVVQFLMALRHDFKPLRGSILCRTPLPFVDSVVSELLVEEIRLKSKKDKNTPPQSAS
         LWD+L LTE + LS+ +  +   E   +VQF+MALRH+F+ LRGSIL R PLP VDSVVSELLVEEIRLKS  D+    Q  S
Subjt:  ELWDELTLTESSELSTSE-VLYCLERLTVVQFLMALRHDFKPLRGSILCRTPLPFVDSVVSELLVEEIRLKSKKDKNTPPQSAS

A0A2Z7DAC1 Retrotran_gag_3 domain-containing protein (Fragment)3.9e-5568.93Show/hide
Query:  NFLRGKSMWSYVIGVRVKPKDTKANDYASSLDTWEADNSKIITWINNSVTHSISAQLAKYDTAKQAWDHLAKLYTQSNFAKQYQLEKDIRALQQNNMSIQ
        NFLRGKSMWSYV GV VKP D  A DYA  +D WE  NS    WINNSVTHSI  QLAKY+TAK+ WDHLA+LYTQSNFAKQYQLE DIRALQQN+MSIQ
Subjt:  NFLRGKSMWSYVIGVRVKPKDTKANDYASSLDTWEADNSKIITWINNSVTHSISAQLAKYDTAKQAWDHLAKLYTQSNFAKQYQLEKDIRALQQNNMSIQ

Query:  DFYSSMSELWDELTLTESSELSTSE-VLYCLERLTVVQFLMALRHDFKPLRGSILCRTPLPFVDSVVSELLVEEIRL
        +FYS+MS LWD+L LTES  L   E  +   E   +VQFLMALR+DF+ LRG+IL R+PLP VDSVV+ELL EE  L
Subjt:  DFYSSMSELWDELTLTESSELSTSE-VLYCLERLTVVQFLMALRHDFKPLRGSILCRTPLPFVDSVVSELLVEEIRL

A0A5C7GP75 Protein kinase domain-containing protein7.8e-7245.64Show/hide
Query:  NFLRGKSMWSYVIGVRVKPKDTKANDYASSLDTWEADNSKIITWINNSVTHSISAQLAKYDTAKQAWDHLAKLYTQSNFAKQYQLEKDIRALQQNNMSIQ
        NFL+GK MW Y+ G  VKPK+ KA DY + LD WEA+NSKIITWINNSV HSI  QLAKYD   + WDHLA+LYTQSNFAKQYQLE DIRAL+Q +MSIQ
Subjt:  NFLRGKSMWSYVIGVRVKPKDTKANDYASSLDTWEADNSKIITWINNSVTHSISAQLAKYDTAKQAWDHLAKLYTQSNFAKQYQLEKDIRALQQNNMSIQ

Query:  DFYSSMSELWDELTLTESSEL-STSEVLYCLERLTVVQFLMALRHDFKPLRGSILCRTPLPFVDSVVSELLVEEIRLKSKKDKNT---------------
        +FYS M++LWD+L LTES+EL + +  +   ++  +VQFLMALR DF+ LRGSIL R PLP VDSVVSELL EEIR K +  K T               
Subjt:  DFYSSMSELWDELTLTESSEL-STSEVLYCLERLTVVQFLMALRHDFKPLRGSILCRTPLPFVDSVVSELLVEEIRLKSKKDKNT---------------

Query:  --------------------------------------------------------------------------PPQ-----------------------
                                                                                  PPQ                       
Subjt:  --------------------------------------------------------------------------PPQ-----------------------

Query:  -----------------------SASHHMSPSFSSFVSLSSHSSISIMTADGTPMPLVGVGSIVTPSVSLSDVYYIPNLTLNLVSVSQLC
                                ASHHMSP+ SSFVSL   SS+S+MT DGTPMPL GVGS+VTP VSLS+VY+IPNLTLNLVSVSQLC
Subjt:  -----------------------SASHHMSPSFSSFVSLSSHSSISIMTADGTPMPLVGVGSIVTPSVSLSDVYYIPNLTLNLVSVSQLC

Q710T7 Gag-pol polyprotein1.2e-7245.3Show/hide
Query:  NFLRGKSMWSYVIGVRVKPKDTKANDYASSLDTWEADNSKIITWINNSVTHSISAQLAKYDTAKQAWDHLAKLYTQSNFAKQYQLEKDIRALQQNNMSIQ
        NFL+GK MW YV G  V PK+T+  D   S+DTWEA+N+KIITWINN V HSI  QLAKY+TAK+ WDHL +L+TQSNFAKQYQLE DIRAL Q NMSIQ
Subjt:  NFLRGKSMWSYVIGVRVKPKDTKANDYASSLDTWEADNSKIITWINNSVTHSISAQLAKYDTAKQAWDHLAKLYTQSNFAKQYQLEKDIRALQQNNMSIQ

Query:  DFYSSMSELWDELTLTESSEL-STSEVLYCLERLTVVQFLMALRHDFKPLRGSILCRTPLPFVDSVVSELLVEEIRLKSKKDK-----------------
        +FYS+M++LWD+L LTES EL +    +   E+  +VQFL ALR DF+ LRGSIL R+PLP VDSVVSELL EEIRL+S  +K                 
Subjt:  DFYSSMSELWDELTLTESSEL-STSEVLYCLERLTVVQFLMALRHDFKPLRGSILCRTPLPFVDSVVSELLVEEIRLKSKKDK-----------------

Query:  -----NTP----------------------------------------------------------------------------------PQS-------
             N P                                                                                  PQ+       
Subjt:  -----NTP----------------------------------------------------------------------------------PQS-------

Query:  -------------------ASHHMSPSFSSFVSLSSHSSISIMTADGTPMPLVGVGSIVTPSVSLSDVYYIPNLTLNLVSVSQLCKFG-YLVSFSSSTCS
                           ASHHMSP  SSF S+S  SSI +MTADGTPMPL GVGS+VT  +SL +VY IP L LNL S+ Q+C  G YLV FS S C 
Subjt:  -------------------ASHHMSPSFSSFVSLSSHSSISIMTADGTPMPLVGVGSIVTPSVSLSDVYYIPNLTLNLVSVSQLCKFG-YLVSFSSSTCS

Query:  IQDLQSQKEIGTGHR
        +QDLQSQK IGTG R
Subjt:  IQDLQSQKEIGTGHR

SwissProt top hitse value%identityAlignment
Q9LNG5 Serine/threonine-protein phosphatase 7 long form homolog2.2e-0728.87Show/hide
Query:  LGLPVDGEPIIGSLQYDCAQLCEDLLG-----------------------------------------------------------------------KV
        LGL VDG  + GS +Y+ A LCEDLLG                                                                       +V
Subjt:  LGLPVDGEPIIGSLQYDCAQLCEDLLG-----------------------------------------------------------------------KV

Query:  GRYSWDSACLSWLYRELCQASRADTLDIKGSLILLQVWAWDR
         + SW SA L+ LYRELC+AS+     I G L+LLQ+WAW+R
Subjt:  GRYSWDSACLSWLYRELCQASRADTLDIKGSLILLQVWAWDR

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.7e-0724.56Show/hide
Query:  NFLRGKSMWSYVIGVRVKPKDTKANDYASSLDTWEADNSKIITWINNSVTHSISAQLAKYDTAKQAWDHLAKLYTQSNFAKQYQLEKDIRALQQNNMSIQ
        +FLR    + ++ G   KP     + ++     WE  N+ ++ W+ NS+T  +   +   +TA + W+ L +++      K YQL + +  L+Q   S++
Subjt:  NFLRGKSMWSYVIGVRVKPKDTKANDYASSLDTWEADNSKIITWINNSVTHSISAQLAKYDTAKQAWDHLAKLYTQSNFAKQYQLEKDIRALQQNNMSIQ

Query:  DFYSSMSELWDELT
        +++  +S++W EL+
Subjt:  DFYSSMSELWDELT

AT1G48120.1 hydrolases;protein serine/threonine phosphatases1.6e-0828.87Show/hide
Query:  LGLPVDGEPIIGSLQYDCAQLCEDLLG-----------------------------------------------------------------------KV
        LGL VDG  + GS +Y+ A LCEDLLG                                                                       +V
Subjt:  LGLPVDGEPIIGSLQYDCAQLCEDLLG-----------------------------------------------------------------------KV

Query:  GRYSWDSACLSWLYRELCQASRADTLDIKGSLILLQVWAWDR
         + SW SA L+ LYRELC+AS+     I G L+LLQ+WAW+R
Subjt:  GRYSWDSACLSWLYRELCQASRADTLDIKGSLILLQVWAWDR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCCTTTCCCTCTCATTTCACTTTCTTTTTCTTCTCCCCCTACCCGTGGTCCCTCCTCCTTCATTGTCGCCGTCCGTCCGACCACCCCACCCGTGTTCTTCGTCGTC
GCCGCTACCCCTTGCTGCTGTTGCCGCCGCCGTCACGATCGTACTTCGACTGCAGGTTATGGTAAATTTGAATCCACCACTTCCAAATGAAGAAGCACCTTCGCAGATGT
TTACTGAAATAGATTTTGAGATTGTAGACTCTATATGTGAAGAAGGATCAAAGTTGGAAGAATGGTCAATGTCTGTAGAAATGAATGTGGATAACATACTCTGCAATCAT
GTTCACTTGCCATATTTGCCTTATGTCAAGAAGTTTTACGAATTGTCACGGGAACATGGACGTCCAAAAACTTCCCGCATTCACAATGAGATGGATTGGCGAGAGGCGAA
TACGAAATTTCATTGTGGAATATGCAAAAAAGATATGGTTTACTCAGTTAGATTGACACCTTATTACTGCACTGGTAGAGCGAAGGAGTTAGGGTTACCTGTTGATGGAG
AGCCCATTATAGGATCCTTGCAATACGATTGTGCACAATTATGTGAGGATTTACTAGGAAAAGTTGGACGATATTCATGGGATAGTGCATGCCTCTCATGGTTATATCGA
GAACTCTGTCAGGCTAGTCGAGCTGACACCCTGGATATAAAAGGTTCATTGATACTTTTACAAGTGTGGGCGTGGGATAGATGGAATGGAGTTTCGATGGCTTCTGAACA
ATCTACGAACATGTTGGTTCAATATCGAATATGCTTTAATCGACTAAGCCACAAACAAGTTTATCCAAGAGTTGCTTCAAACTTCGAAATGAGACGTCCTCGTCGTAGGC
GCAATCAAGCACAACCAAATGAAGAAGTTCAAGATGAAGCACAAGAAGCTGAACAAGTTCATAAACAAAATGAAGATGCTATAGGAGATGAACAAATCATGAGTACTCCT
ATCCAATCGCATTTTCTAACAATTCATACACCAAAAGTTCATTTAGACACCGCATCTAGTTCAGCTCATCAACCTGACTCATCATCTGTTCAGCAAGAACATCGTCATGA
ACGTCGCGTAAGAAGACCACGACAATGTGGCGCGGATGACCGGCCAGAAAGGATGACAGTTGAGGTAGCAGAGGTGCGTCAAGTTGTACGAGGTCTGACGGTTGAGTGTA
CGCCCCAACAATCAGCAAGTGGTGCCTCCACCTTCACACCTGAAATCGAATGCCTGTCAGTACCATATCGTAGTGCTCGACGTGGTTTTCAGACAACGAAGGTGGCGACG
CACTCGCCTGCTCTTGCAGTGTGTAACCACCTTACTGTCGCCAGGGCGAAGGATTTTCGGATTCGGCTCCAATTTAGCGGGAGAATCGTCGAGCAGCAGCAGCTGAAATT
CATGAATTTCAGCCGTTACAGCAGCCCACGTTCCGGCAGTGTGAATTTCTTGCGTGGAAAATCTATGTGGAGTTATGTTATAGGTGTTCGAGTTAAGCCTAAAGACACCA
AAGCAAATGATTATGCATCCTCATTGGATACTTGGGAGGCTGACAATTCAAAGATTATCACGTGGATAAATAATTCTGTTACACACTCCATTAGTGCTCAATTGGCAAAA
TATGATACTGCTAAACAGGCTTGGGATCATTTGGCAAAATTGTATACTCAATCTAATTTTGCCAAACAATATCAATTGGAGAAGGATATTCGTGCACTACAGCAGAATAA
TATGAGTATTCAAGACTTTTACTCCTCTATGTCAGAATTATGGGATGAGTTGACATTAACAGAATCCTCAGAATTAAGCACATCGGAAGTTTTATATTGCCTGGAGAGAC
TCACAGTTGTCCAATTTCTTATGGCTCTTCGACATGATTTTAAGCCATTACGTGGGTCAATTTTATGTCGTACTCCTCTCCCTTTTGTTGATTCAGTTGTTAGTGAACTA
TTAGTAGAGGAGATTCGTCTTAAGTCTAAAAAAGATAAGAACACACCTCCTCAGAGTGCCTCTCATCATATGTCTCCTAGTTTTTCCTCTTTTGTTTCTTTGTCTTCTCA
TTCTTCTATATCAATCATGACTGCTGATGGAACTCCTATGCCATTAGTAGGCGTTGGCTCAATTGTTACTCCTTCTGTATCTCTCTCTGATGTTTACTATATTCCTAATC
TTACTTTAAACCTTGTCTCTGTTAGTCAATTATGTAAATTTGGATACTTAGTTTCTTTTTCGTCCTCCACTTGTTCTATACAGGACCTGCAGTCTCAGAAGGAGATTGGG
ACAGGCCACAGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTCCTTTCCCTCTCATTTCACTTTCTTTTTCTTCTCCCCCTACCCGTGGTCCCTCCTCCTTCATTGTCGCCGTCCGTCCGACCACCCCACCCGTGTTCTTCGTCGTC
GCCGCTACCCCTTGCTGCTGTTGCCGCCGCCGTCACGATCGTACTTCGACTGCAGGTTATGGTAAATTTGAATCCACCACTTCCAAATGAAGAAGCACCTTCGCAGATGT
TTACTGAAATAGATTTTGAGATTGTAGACTCTATATGTGAAGAAGGATCAAAGTTGGAAGAATGGTCAATGTCTGTAGAAATGAATGTGGATAACATACTCTGCAATCAT
GTTCACTTGCCATATTTGCCTTATGTCAAGAAGTTTTACGAATTGTCACGGGAACATGGACGTCCAAAAACTTCCCGCATTCACAATGAGATGGATTGGCGAGAGGCGAA
TACGAAATTTCATTGTGGAATATGCAAAAAAGATATGGTTTACTCAGTTAGATTGACACCTTATTACTGCACTGGTAGAGCGAAGGAGTTAGGGTTACCTGTTGATGGAG
AGCCCATTATAGGATCCTTGCAATACGATTGTGCACAATTATGTGAGGATTTACTAGGAAAAGTTGGACGATATTCATGGGATAGTGCATGCCTCTCATGGTTATATCGA
GAACTCTGTCAGGCTAGTCGAGCTGACACCCTGGATATAAAAGGTTCATTGATACTTTTACAAGTGTGGGCGTGGGATAGATGGAATGGAGTTTCGATGGCTTCTGAACA
ATCTACGAACATGTTGGTTCAATATCGAATATGCTTTAATCGACTAAGCCACAAACAAGTTTATCCAAGAGTTGCTTCAAACTTCGAAATGAGACGTCCTCGTCGTAGGC
GCAATCAAGCACAACCAAATGAAGAAGTTCAAGATGAAGCACAAGAAGCTGAACAAGTTCATAAACAAAATGAAGATGCTATAGGAGATGAACAAATCATGAGTACTCCT
ATCCAATCGCATTTTCTAACAATTCATACACCAAAAGTTCATTTAGACACCGCATCTAGTTCAGCTCATCAACCTGACTCATCATCTGTTCAGCAAGAACATCGTCATGA
ACGTCGCGTAAGAAGACCACGACAATGTGGCGCGGATGACCGGCCAGAAAGGATGACAGTTGAGGTAGCAGAGGTGCGTCAAGTTGTACGAGGTCTGACGGTTGAGTGTA
CGCCCCAACAATCAGCAAGTGGTGCCTCCACCTTCACACCTGAAATCGAATGCCTGTCAGTACCATATCGTAGTGCTCGACGTGGTTTTCAGACAACGAAGGTGGCGACG
CACTCGCCTGCTCTTGCAGTGTGTAACCACCTTACTGTCGCCAGGGCGAAGGATTTTCGGATTCGGCTCCAATTTAGCGGGAGAATCGTCGAGCAGCAGCAGCTGAAATT
CATGAATTTCAGCCGTTACAGCAGCCCACGTTCCGGCAGTGTGAATTTCTTGCGTGGAAAATCTATGTGGAGTTATGTTATAGGTGTTCGAGTTAAGCCTAAAGACACCA
AAGCAAATGATTATGCATCCTCATTGGATACTTGGGAGGCTGACAATTCAAAGATTATCACGTGGATAAATAATTCTGTTACACACTCCATTAGTGCTCAATTGGCAAAA
TATGATACTGCTAAACAGGCTTGGGATCATTTGGCAAAATTGTATACTCAATCTAATTTTGCCAAACAATATCAATTGGAGAAGGATATTCGTGCACTACAGCAGAATAA
TATGAGTATTCAAGACTTTTACTCCTCTATGTCAGAATTATGGGATGAGTTGACATTAACAGAATCCTCAGAATTAAGCACATCGGAAGTTTTATATTGCCTGGAGAGAC
TCACAGTTGTCCAATTTCTTATGGCTCTTCGACATGATTTTAAGCCATTACGTGGGTCAATTTTATGTCGTACTCCTCTCCCTTTTGTTGATTCAGTTGTTAGTGAACTA
TTAGTAGAGGAGATTCGTCTTAAGTCTAAAAAAGATAAGAACACACCTCCTCAGAGTGCCTCTCATCATATGTCTCCTAGTTTTTCCTCTTTTGTTTCTTTGTCTTCTCA
TTCTTCTATATCAATCATGACTGCTGATGGAACTCCTATGCCATTAGTAGGCGTTGGCTCAATTGTTACTCCTTCTGTATCTCTCTCTGATGTTTACTATATTCCTAATC
TTACTTTAAACCTTGTCTCTGTTAGTCAATTATGTAAATTTGGATACTTAGTTTCTTTTTCGTCCTCCACTTGTTCTATACAGGACCTGCAGTCTCAGAAGGAGATTGGG
ACAGGCCACAGGTAG
Protein sequenceShow/hide protein sequence
MVLSLSFHFLFLLPLPVVPPPSLSPSVRPPHPCSSSSPLPLAAVAAAVTIVLRLQVMVNLNPPLPNEEAPSQMFTEIDFEIVDSICEEGSKLEEWSMSVEMNVDNILCNH
VHLPYLPYVKKFYELSREHGRPKTSRIHNEMDWREANTKFHCGICKKDMVYSVRLTPYYCTGRAKELGLPVDGEPIIGSLQYDCAQLCEDLLGKVGRYSWDSACLSWLYR
ELCQASRADTLDIKGSLILLQVWAWDRWNGVSMASEQSTNMLVQYRICFNRLSHKQVYPRVASNFEMRRPRRRRNQAQPNEEVQDEAQEAEQVHKQNEDAIGDEQIMSTP
IQSHFLTIHTPKVHLDTASSSAHQPDSSSVQQEHRHERRVRRPRQCGADDRPERMTVEVAEVRQVVRGLTVECTPQQSASGASTFTPEIECLSVPYRSARRGFQTTKVAT
HSPALAVCNHLTVARAKDFRIRLQFSGRIVEQQQLKFMNFSRYSSPRSGSVNFLRGKSMWSYVIGVRVKPKDTKANDYASSLDTWEADNSKIITWINNSVTHSISAQLAK
YDTAKQAWDHLAKLYTQSNFAKQYQLEKDIRALQQNNMSIQDFYSSMSELWDELTLTESSELSTSEVLYCLERLTVVQFLMALRHDFKPLRGSILCRTPLPFVDSVVSEL
LVEEIRLKSKKDKNTPPQSASHHMSPSFSSFVSLSSHSSISIMTADGTPMPLVGVGSIVTPSVSLSDVYYIPNLTLNLVSVSQLCKFGYLVSFSSSTCSIQDLQSQKEIG
TGHR