; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g10600 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g10600
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr9:8999829..9006332
RNA-Seq ExpressionMoc09g10600
SyntenyMoc09g10600
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0044260 - cellular macromolecule metabolic process (biological process)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN63684.1 hypothetical protein VITISV_020448 [Vitis vinifera]7.6e-3446.28Show/hide
Query:  LSERLKKVLPYTVFEQQTAFVAGRQILDASLIANEITDERFQKNQKGVVLKLDVEKAFDMLDWDFLDLILKAKGFGDRWRSILLGSHSTES------GKK
        LS R++ VL  T+   Q AFV GRQILDA LIANEI DER +  ++GVV K+D EKA+D + WDFLD +L+ KGF  RWR  + G  S+ S      G+ 
Subjt:  LSERLKKVLPYTVFEQQTAFVAGRQILDASLIANEITDERFQKNQKGVVLKLDVEKAFDMLDWDFLDLILKAKGFGDRWRSILLGSHSTES------GKK

Query:  AIELEAHSCIISVAETLERLFRKFLWAGNREKDNIHLLRWEKVKLPLDEGGLGIVDLKKWNKALLAKWVCRFHCEKDALWKKVIVAKF
           ++A   +  VA  +ERL R FLW+G  E    HL+RW+ V  P + GGLG  ++   N ALL KW+ R+  E  ALW +V  +KF
Subjt:  AIELEAHSCIISVAETLERLFRKFLWAGNREKDNIHLLRWEKVKLPLDEGGLGIVDLKKWNKALLAKWVCRFHCEKDALWKKVIVAKF

CAN72727.1 hypothetical protein VITISV_015094 [Vitis vinifera]1.7e-3343.62Show/hide
Query:  SLSERLKKVLPYTVFEQQTAFVAGRQILDASLIANEITDERFQKNQKGVVLKLDVEKAFDMLDWDFLDLILKAKGFGDRWRSILLGSHSTESGKKAIELE
        S   RL+KVL  T+F  Q AFV GR ILDA LIANE+ DE+ +  ++ VV K+D EKA+D +DW  LD +L+ KGF  +WRS + G  S+ S    +   
Subjt:  SLSERLKKVLPYTVFEQQTAFVAGRQILDASLIANEITDERFQKNQKGVVLKLDVEKAFDMLDWDFLDLILKAKGFGDRWRSILLGSHSTESGKKAIELE

Query:  AHSCI----ISVAETLERLFRKFLWAGNREKDNIHLLRWEKVKLPLDEGGLGIVDLKKWNKALLAKWVCRFHCEKDALWKKVIVAKFG
        A   +    +S+   +E++ R FLW+G  E    HL+RWE V  P + GGLG       N ALL KW+ RF  E+  LW KVI + +G
Subjt:  AHSCI----ISVAETLERLFRKFLWAGNREKDNIHLLRWEKVKLPLDEGGLGIVDLKKWNKALLAKWVCRFHCEKDALWKKVIVAKFG

CAN78865.1 hypothetical protein VITISV_013346 [Vitis vinifera]1.9e-3241.82Show/hide
Query:  LSERLKKVLPYTVFEQQTAFVAGRQILDASLIANEITDERFQKNQKGVVLKLDVEKAFDMLDWDFLDLILKAKGFGDRWRSILLGSHSTES------GKK
        LS RL+ VL  T+   Q AFV GRQILDA LIANEI DER +  ++GVV K+D EK +D + WDFLD +L+ KGF  RWR  + G  S+ S      G  
Subjt:  LSERLKKVLPYTVFEQQTAFVAGRQILDASLIANEITDERFQKNQKGVVLKLDVEKAFDMLDWDFLDLILKAKGFGDRWRSILLGSHSTES------GKK

Query:  AIELEAHSCIISVAETLERLFRKFLWAGNREKDNIHLLRWEKVKLPLDEGGLGIVDLKKWNKALLAKWVCRFHCEKDALWKKVIVAKFGAKHFDLKPDNH
           ++A   +  VA  +ERL R FLW+G  E    HL+RW+ V  P   GGLG+ +    N A L KW+ R+  E  ALW +VI++ +G+       +  
Subjt:  AIELEAHSCIISVAETLERLFRKFLWAGNREKDNIHLLRWEKVKLPLDEGGLGIVDLKKWNKALLAKWVCRFHCEKDALWKKVIVAKFGAKHFDLKPDNH

Query:  SLQSSKGPWKSIFKQSESMS
           S + PWK+I +  +  S
Subjt:  SLQSSKGPWKSIFKQSESMS

CAN78865.1 hypothetical protein VITISV_013346 [Vitis vinifera]4.3e-0536.84Show/hide
Query:  GSWQKRALIKDLLKQHNPSITILVETKWCNVDRMLIKSVWSSRNLGWSSLDACGSSG----VYGPSSSKYKLDFIKELS-----DLDGLCGENWI
        GS  KR ++KD L+  NP + ++ ETK+ N DR  + SVW+ RN  W +L A G+SG    ++   + + +   I   S      LDG CG  WI
Subjt:  GSWQKRALIKDLLKQHNPSITILVETKWCNVDRMLIKSVWSSRNLGWSSLDACGSSG----VYGPSSSKYKLDFIKELS-----DLDGLCGENWI

RVX06748.1 putative ribonuclease H protein [Vitis vinifera]5.8e-4229.69Show/hide
Query:  SQTSCSKIYP-NVRGLGSWQKRALIKDLLKQHNPSITILVETKWCNVDRMLIKSVWSSRNLGWSSLDACGSSGVYGPSSSKYKLDFIKELSDLDGLCGEN
        S  S  KI   N RGLGS +K   ++  L   NP + +L ETK    DR  I  +W S          C    V       +K DF  EL +L  L    
Subjt:  SQTSCSKIYP-NVRGLGSWQKRALIKDLLKQHNPSITILVETKWCNVDRMLIKSVWSSRNLGWSSLDACGSSGVYGPSSSKYKLDFIKELSDLDGLCGEN

Query:  WILAGDFNLIRWSHENSNGKRPSRSMRAFNAFIFDAGLADSFLK---------------------WQIHMSLSE----------------RLKKVLPYTV
        W + GDFN+IR   E     R + +MR F+ FI ++GL D  L+                     W +H    E                RL KVL  T+
Subjt:  WILAGDFNLIRWSHENSNGKRPSRSMRAFNAFIFDAGLADSFLK---------------------WQIHMSLSE----------------RLKKVLPYTV

Query:  FEQQTAFVAGRQILDASLIANEITDERFQKNQKGVVLKLDVEKAFDMLDWDFLDLILKAKGFGDRWRSILLGSHSTES----------------------
        F  Q AFV  RQILDA LIANE+ DE+ +  ++GVV+K+D EKA+D +DW FLD +L+ KGF  +W+S + G  S+ S                      
Subjt:  FEQQTAFVAGRQILDASLIANEITDERFQKNQKGVVLKLDVEKAFDMLDWDFLDLILKAKGFGDRWRSILLGSHSTES----------------------

Query:  -----------------------------------------GKKAIELE-------AHSCI--------------ISVAETLERLFRKFLWAGNREKDNI
                                                 G K   L          SC+              +S+A  +E++ R FLW+G  E +  
Subjt:  -----------------------------------------GKKAIELE-------AHSCI--------------ISVAETLERLFRKFLWAGNREKDNI

Query:  HLLRWEKVKLPLDEGGLGIVDLKKWNKALLAKWVCRFHCEKDALWKKVIVAKFGAKHFDLKPDNHSLQSSKGPWKSIFKQSESMS
        HL+RWE V    + GGLG       N  LL KW+ RF  E+  LW KVI + +G        +     S + PWK+I +  +  S
Subjt:  HLLRWEKVKLPLDEGGLGIVDLKKWNKALLAKWVCRFHCEKDALWKKVIVAKFGAKHFDLKPDNHSLQSSKGPWKSIFKQSESMS

RVX14083.1 Histone deacetylase 15 [Vitis vinifera]1.1e-3239.6Show/hide
Query:  LSERLKKVLPYTVFEQQTAFVAGRQILDASLIANEITDERFQKNQKGVVLKLDVEKAFDMLDWDFLDLILKAKGFGDRWRSILLGSHSTESGK-------
        LS RL+ VL  T+   Q AFV GRQILDA LIANEI DE+ Q  ++GVV K+D EKA+D + WDFLD +L+ KGF  +WRS + G  S+ S         
Subjt:  LSERLKKVLPYTVFEQQTAFVAGRQILDASLIANEITDERFQKNQKGVVLKLDVEKAFDMLDWDFLDLILKAKGFGDRWRSILLGSHSTESGK-------

Query:  -------------------------KAIE---LEAHSCIISVAETLERLFRKFLWAGNREKDNIHLLRWEKVKLPLDEGGLGIVDLKKWNKALLAKWVCR
                                 KA E   LE       VA  +ER+ R FLW+G  E    HL+RWE V  P   GGLGI  +   N+ALL KW+ R
Subjt:  -------------------------KAIE---LEAHSCIISVAETLERLFRKFLWAGNREKDNIHLLRWEKVKLPLDEGGLGIVDLKKWNKALLAKWVCR

Query:  FHCEKDALWKKVIVAKFGAKHFDLKPDNHSLQSSKGPWKSIFKQSESMSK
        F  E  +LW +VI++ +G        +     S + PWK+I +  +  SK
Subjt:  FHCEKDALWKKVIVAKFGAKHFDLKPDNHSLQSSKGPWKSIFKQSESMSK

TrEMBL top hitse value%identityAlignment
A0A438JCR4 Putative ribonuclease H protein2.8e-4229.69Show/hide
Query:  SQTSCSKIYP-NVRGLGSWQKRALIKDLLKQHNPSITILVETKWCNVDRMLIKSVWSSRNLGWSSLDACGSSGVYGPSSSKYKLDFIKELSDLDGLCGEN
        S  S  KI   N RGLGS +K   ++  L   NP + +L ETK    DR  I  +W S          C    V       +K DF  EL +L  L    
Subjt:  SQTSCSKIYP-NVRGLGSWQKRALIKDLLKQHNPSITILVETKWCNVDRMLIKSVWSSRNLGWSSLDACGSSGVYGPSSSKYKLDFIKELSDLDGLCGEN

Query:  WILAGDFNLIRWSHENSNGKRPSRSMRAFNAFIFDAGLADSFLK---------------------WQIHMSLSE----------------RLKKVLPYTV
        W + GDFN+IR   E     R + +MR F+ FI ++GL D  L+                     W +H    E                RL KVL  T+
Subjt:  WILAGDFNLIRWSHENSNGKRPSRSMRAFNAFIFDAGLADSFLK---------------------WQIHMSLSE----------------RLKKVLPYTV

Query:  FEQQTAFVAGRQILDASLIANEITDERFQKNQKGVVLKLDVEKAFDMLDWDFLDLILKAKGFGDRWRSILLGSHSTES----------------------
        F  Q AFV  RQILDA LIANE+ DE+ +  ++GVV+K+D EKA+D +DW FLD +L+ KGF  +W+S + G  S+ S                      
Subjt:  FEQQTAFVAGRQILDASLIANEITDERFQKNQKGVVLKLDVEKAFDMLDWDFLDLILKAKGFGDRWRSILLGSHSTES----------------------

Query:  -----------------------------------------GKKAIELE-------AHSCI--------------ISVAETLERLFRKFLWAGNREKDNI
                                                 G K   L          SC+              +S+A  +E++ R FLW+G  E +  
Subjt:  -----------------------------------------GKKAIELE-------AHSCI--------------ISVAETLERLFRKFLWAGNREKDNI

Query:  HLLRWEKVKLPLDEGGLGIVDLKKWNKALLAKWVCRFHCEKDALWKKVIVAKFGAKHFDLKPDNHSLQSSKGPWKSIFKQSESMS
        HL+RWE V    + GGLG       N  LL KW+ RF  E+  LW KVI + +G        +     S + PWK+I +  +  S
Subjt:  HLLRWEKVKLPLDEGGLGIVDLKKWNKALLAKWVCRFHCEKDALWKKVIVAKFGAKHFDLKPDNHSLQSSKGPWKSIFKQSESMS

A0A438JYQ9 Histone deacetylase5.3e-3339.6Show/hide
Query:  LSERLKKVLPYTVFEQQTAFVAGRQILDASLIANEITDERFQKNQKGVVLKLDVEKAFDMLDWDFLDLILKAKGFGDRWRSILLGSHSTESGK-------
        LS RL+ VL  T+   Q AFV GRQILDA LIANEI DE+ Q  ++GVV K+D EKA+D + WDFLD +L+ KGF  +WRS + G  S+ S         
Subjt:  LSERLKKVLPYTVFEQQTAFVAGRQILDASLIANEITDERFQKNQKGVVLKLDVEKAFDMLDWDFLDLILKAKGFGDRWRSILLGSHSTESGK-------

Query:  -------------------------KAIE---LEAHSCIISVAETLERLFRKFLWAGNREKDNIHLLRWEKVKLPLDEGGLGIVDLKKWNKALLAKWVCR
                                 KA E   LE       VA  +ER+ R FLW+G  E    HL+RWE V  P   GGLGI  +   N+ALL KW+ R
Subjt:  -------------------------KAIE---LEAHSCIISVAETLERLFRKFLWAGNREKDNIHLLRWEKVKLPLDEGGLGIVDLKKWNKALLAKWVCR

Query:  FHCEKDALWKKVIVAKFGAKHFDLKPDNHSLQSSKGPWKSIFKQSESMSK
        F  E  +LW +VI++ +G        +     S + PWK+I +  +  SK
Subjt:  FHCEKDALWKKVIVAKFGAKHFDLKPDNHSLQSSKGPWKSIFKQSESMSK

A5BCE8 Reverse transcriptase domain-containing protein9.0e-3341.82Show/hide
Query:  LSERLKKVLPYTVFEQQTAFVAGRQILDASLIANEITDERFQKNQKGVVLKLDVEKAFDMLDWDFLDLILKAKGFGDRWRSILLGSHSTES------GKK
        LS RL+ VL  T+   Q AFV GRQILDA LIANEI DER +  ++GVV K+D EK +D + WDFLD +L+ KGF  RWR  + G  S+ S      G  
Subjt:  LSERLKKVLPYTVFEQQTAFVAGRQILDASLIANEITDERFQKNQKGVVLKLDVEKAFDMLDWDFLDLILKAKGFGDRWRSILLGSHSTES------GKK

Query:  AIELEAHSCIISVAETLERLFRKFLWAGNREKDNIHLLRWEKVKLPLDEGGLGIVDLKKWNKALLAKWVCRFHCEKDALWKKVIVAKFGAKHFDLKPDNH
           ++A   +  VA  +ERL R FLW+G  E    HL+RW+ V  P   GGLG+ +    N A L KW+ R+  E  ALW +VI++ +G+       +  
Subjt:  AIELEAHSCIISVAETLERLFRKFLWAGNREKDNIHLLRWEKVKLPLDEGGLGIVDLKKWNKALLAKWVCRFHCEKDALWKKVIVAKFGAKHFDLKPDNH

Query:  SLQSSKGPWKSIFKQSESMS
           S + PWK+I +  +  S
Subjt:  SLQSSKGPWKSIFKQSESMS

A5BCE8 Reverse transcriptase domain-containing protein2.1e-0536.84Show/hide
Query:  GSWQKRALIKDLLKQHNPSITILVETKWCNVDRMLIKSVWSSRNLGWSSLDACGSSG----VYGPSSSKYKLDFIKELS-----DLDGLCGENWI
        GS  KR ++KD L+  NP + ++ ETK+ N DR  + SVW+ RN  W +L A G+SG    ++   + + +   I   S      LDG CG  WI
Subjt:  GSWQKRALIKDLLKQHNPSITILVETKWCNVDRMLIKSVWSSRNLGWSSLDACGSSG----VYGPSSSKYKLDFIKELS-----DLDGLCGENWI

A5BNF8 Uncharacterized protein3.7e-3446.28Show/hide
Query:  LSERLKKVLPYTVFEQQTAFVAGRQILDASLIANEITDERFQKNQKGVVLKLDVEKAFDMLDWDFLDLILKAKGFGDRWRSILLGSHSTES------GKK
        LS R++ VL  T+   Q AFV GRQILDA LIANEI DER +  ++GVV K+D EKA+D + WDFLD +L+ KGF  RWR  + G  S+ S      G+ 
Subjt:  LSERLKKVLPYTVFEQQTAFVAGRQILDASLIANEITDERFQKNQKGVVLKLDVEKAFDMLDWDFLDLILKAKGFGDRWRSILLGSHSTES------GKK

Query:  AIELEAHSCIISVAETLERLFRKFLWAGNREKDNIHLLRWEKVKLPLDEGGLGIVDLKKWNKALLAKWVCRFHCEKDALWKKVIVAKF
           ++A   +  VA  +ERL R FLW+G  E    HL+RW+ V  P + GGLG  ++   N ALL KW+ R+  E  ALW +V  +KF
Subjt:  AIELEAHSCIISVAETLERLFRKFLWAGNREKDNIHLLRWEKVKLPLDEGGLGIVDLKKWNKALLAKWVCRFHCEKDALWKKVIVAKF

A5BV05 Uncharacterized protein8.2e-3443.62Show/hide
Query:  SLSERLKKVLPYTVFEQQTAFVAGRQILDASLIANEITDERFQKNQKGVVLKLDVEKAFDMLDWDFLDLILKAKGFGDRWRSILLGSHSTESGKKAIELE
        S   RL+KVL  T+F  Q AFV GR ILDA LIANE+ DE+ +  ++ VV K+D EKA+D +DW  LD +L+ KGF  +WRS + G  S+ S    +   
Subjt:  SLSERLKKVLPYTVFEQQTAFVAGRQILDASLIANEITDERFQKNQKGVVLKLDVEKAFDMLDWDFLDLILKAKGFGDRWRSILLGSHSTESGKKAIELE

Query:  AHSCI----ISVAETLERLFRKFLWAGNREKDNIHLLRWEKVKLPLDEGGLGIVDLKKWNKALLAKWVCRFHCEKDALWKKVIVAKFG
        A   +    +S+   +E++ R FLW+G  E    HL+RWE V  P + GGLG       N ALL KW+ RF  E+  LW KVI + +G
Subjt:  AHSCI----ISVAETLERLFRKFLWAGNREKDNIHLLRWEKVKLPLDEGGLGIVDLKKWNKALLAKWVCRFHCEKDALWKKVIVAKFG

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657507.9e-1037.86Show/hide
Query:  SVAETLERLFRKFLWAGNREKDNIHLLRWEKVKLPLDEGGLGIVDLKKWNKALLAKWVCRFHCEKDALWKKVIVAKFGAKHFDLKPDNHSLQSSKGPWKS
        S+   L++L R FLW    EK   HL++W KV  P  EGGLG+   K  N+AL++K   R   EK++LW  V+  K+   H     D+  L   KG W S
Subjt:  SVAETLERLFRKFLWAGNREKDNIHLLRWEKVKLPLDEGGLGIVDLKKWNKALLAKWVCRFHCEKDALWKKVIVAKFGAKHFDLKPDNHSLQSSKGPWKS

Query:  IFK
         ++
Subjt:  IFK

Arabidopsis top hitse value%identityAlignment
AT4G20520.1 RNA binding;RNA-directed DNA polymerases3.2e-0636.71Show/hide
Query:  ERLKKVLPYTVFEQQTAFVAGRQILDASLIANE-ITDERFQKNQKG-VVLKLDVEKAFDMLDWDFLDLILKAKGFGDRW
        ERLK ++   +   Q +F+ GR   D  +   E +   R +K  KG ++LKLD+EKA+D + WD+L+  L + GF + W
Subjt:  ERLKKVLPYTVFEQQTAFVAGRQILDASLIANE-ITDERFQKNQKG-VVLKLDVEKAFDMLDWDFLDLILKAKGFGDRW

AT4G29090.1 Ribonuclease H-like superfamily protein1.0e-0427.68Show/hide
Query:  SVAETLERLFRKFLWAGNREKDNIHLLRWEKVKLPLDEGGLGIVDLKKWNKALLAKWVCRFHCEKDALWKKVIVAKFGAKHFDLKPDNHSLQSSKG-PWK
        +V + +  +   F W   +E   +H   W+ +     EGG+G  D++ +N ALL K + R     ++L  KV  +++  K     P N  L S     WK
Subjt:  SVAETLERLFRKFLWAGNREKDNIHLLRWEKVKLPLDEGGLGIVDLKKWNKALLAKWVCRFHCEKDALWKKVIVAKFGAKHFDLKPDNHSLQSSKG-PWK

Query:  SIFKQSESMSKG
        SI    E + +G
Subjt:  SIFKQSESMSKG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCTCATGGAAGCTAGTATAAAAGTGAAAATAAATACAAGAGGTTTTATCCCGGTCTCCATCGATCTCCTGTCATCATCAGCATCTTCTTTGTGTGTGACT
GTGGATCCTTTTTTCAAGGAAGAAAACTACATAGGATACATTGTCGGAATCCATGGCGACCGAAGCAAGGAGCCGACGCGTGAGATTGGGCGAGTGGAATTCCAT
CCTCCCAAAAAGGAGCAGCAGTCGTACCATGGAGAAGACAAACCCACAGATCTCCCTGCCACATTGATTAAAGATGCTGCAAAGGGATGTGACCATACGGTTGAG
GATAGTCCATTGGATAAAAGTATGGTTACTTGCAAAGGACGAAGCACTCCTCTTCCCTCATCCTTTATACCTTCTGAACTGGATATTGATGACTATATCTCAAGC
ACCTACTCCTCCTACCCAAACTCACCCTCCTCTCTGCCAAACCCACCCACCAGCCAGACCTCTTGCTCCAAAATCTACCCGAATGTTAGAGGCCTAGGCTCTTGG
CAAAAAAGGGCCCTAATCAAGGATCTCCTCAAGCAACACAACCCCTCCATCACCATTCTAGTTGAAACTAAGTGGTGCAATGTAGACAGAATGCTTATCAAATCA
GTTTGGAGCTCTAGAAACTTGGGCTGGTCTTCTCTCGACGCGTGTGGTAGCTCCGGAGTATATGGTCCATCTTCCTCAAAATATAAATTAGATTTCATCAAGGAA
TTATCTGATCTAGATGGCCTCTGTGGTGAAAATTGGATTCTTGCGGGTGACTTCAACCTTATCAGATGGTCGCATGAAAATTCTAACGGGAAGCGCCCATCTAGA
AGCATGAGAGCCTTCAATGCCTTTATCTTTGACGCTGGACTTGCTGATAGCTTTCTCAAATGGCAGATTCACATGAGTTTGTCAGAAAGACTCAAGAAAGTGCTT
CCTTACACTGTTTTCGAACAGCAAACAGCATTTGTGGCCGGTAGACAAATCCTCGATGCCTCGCTCATTGCAAATGAGATCACTGATGAAAGGTTTCAGAAAAAT
CAAAAAGGTGTGGTCCTTAAATTAGATGTGGAAAAGGCATTTGATATGTTGGATTGGGATTTCCTAGATTTGATTTTGAAAGCTAAAGGGTTTGGAGATAGATGG
AGAAGCATCCTTTTGGGATCCCATAGTACAGAAAGTGGAAAGAAGGCTATTGAGTTGGAAGCACACTCATGCATCATAAGTGTTGCTGAAACACTAGAGAGACTC
TTCAGAAAGTTCTTATGGGCCGGTAATAGAGAGAAGGACAATATCCATCTTCTCAGGTGGGAAAAAGTTAAACTTCCTCTTGATGAAGGTGGTCTTGGCATTGTT
GATCTAAAGAAGTGGAACAAGGCTCTTCTTGCAAAATGGGTTTGTCGTTTTCATTGTGAAAAGGATGCTTTATGGAAGAAGGTGATAGTAGCAAAATTTGGAGCT
AAGCACTTCGACTTAAAACCGGACAACCACTCTTTACAAAGCTCCAAAGGCCCATGGAAATCTATTTTTAAGCAAAGTGAATCGATGTCAAAAGGGTTCGACACT
AGCAAGGACGATGCCGCCACGGGTAAGAACTCGCCGCCGCAGAAGAAACAGGGAGTCGCGCAGCCGCGTTGTTGGAGGCAGGAGAGCCCCCGCACCGCCGCAAGG
AGGAACGCGCTGTTGAGGTTCGTGAAAAGGTGGCGGTCGCGGGCGGTGGATTTATGA
mRNA sequenceShow/hide mRNA sequence
ATGGATCTCATGGAAGCTAGTATAAAAGTGAAAATAAATACAAGAGGTTTTATCCCGGTCTCCATCGATCTCCTGTCATCATCAGCATCTTCTTTGTGTGTGACT
GTGGATCCTTTTTTCAAGGAAGAAAACTACATAGGATACATTGTCGGAATCCATGGCGACCGAAGCAAGGAGCCGACGCGTGAGATTGGGCGAGTGGAATTCCAT
CCTCCCAAAAAGGAGCAGCAGTCGTACCATGGAGAAGACAAACCCACAGATCTCCCTGCCACATTGATTAAAGATGCTGCAAAGGGATGTGACCATACGGTTGAG
GATAGTCCATTGGATAAAAGTATGGTTACTTGCAAAGGACGAAGCACTCCTCTTCCCTCATCCTTTATACCTTCTGAACTGGATATTGATGACTATATCTCAAGC
ACCTACTCCTCCTACCCAAACTCACCCTCCTCTCTGCCAAACCCACCCACCAGCCAGACCTCTTGCTCCAAAATCTACCCGAATGTTAGAGGCCTAGGCTCTTGG
CAAAAAAGGGCCCTAATCAAGGATCTCCTCAAGCAACACAACCCCTCCATCACCATTCTAGTTGAAACTAAGTGGTGCAATGTAGACAGAATGCTTATCAAATCA
GTTTGGAGCTCTAGAAACTTGGGCTGGTCTTCTCTCGACGCGTGTGGTAGCTCCGGAGTATATGGTCCATCTTCCTCAAAATATAAATTAGATTTCATCAAGGAA
TTATCTGATCTAGATGGCCTCTGTGGTGAAAATTGGATTCTTGCGGGTGACTTCAACCTTATCAGATGGTCGCATGAAAATTCTAACGGGAAGCGCCCATCTAGA
AGCATGAGAGCCTTCAATGCCTTTATCTTTGACGCTGGACTTGCTGATAGCTTTCTCAAATGGCAGATTCACATGAGTTTGTCAGAAAGACTCAAGAAAGTGCTT
CCTTACACTGTTTTCGAACAGCAAACAGCATTTGTGGCCGGTAGACAAATCCTCGATGCCTCGCTCATTGCAAATGAGATCACTGATGAAAGGTTTCAGAAAAAT
CAAAAAGGTGTGGTCCTTAAATTAGATGTGGAAAAGGCATTTGATATGTTGGATTGGGATTTCCTAGATTTGATTTTGAAAGCTAAAGGGTTTGGAGATAGATGG
AGAAGCATCCTTTTGGGATCCCATAGTACAGAAAGTGGAAAGAAGGCTATTGAGTTGGAAGCACACTCATGCATCATAAGTGTTGCTGAAACACTAGAGAGACTC
TTCAGAAAGTTCTTATGGGCCGGTAATAGAGAGAAGGACAATATCCATCTTCTCAGGTGGGAAAAAGTTAAACTTCCTCTTGATGAAGGTGGTCTTGGCATTGTT
GATCTAAAGAAGTGGAACAAGGCTCTTCTTGCAAAATGGGTTTGTCGTTTTCATTGTGAAAAGGATGCTTTATGGAAGAAGGTGATAGTAGCAAAATTTGGAGCT
AAGCACTTCGACTTAAAACCGGACAACCACTCTTTACAAAGCTCCAAAGGCCCATGGAAATCTATTTTTAAGCAAAGTGAATCGATGTCAAAAGGGTTCGACACT
AGCAAGGACGATGCCGCCACGGGTAAGAACTCGCCGCCGCAGAAGAAACAGGGAGTCGCGCAGCCGCGTTGTTGGAGGCAGGAGAGCCCCCGCACCGCCGCAAGG
AGGAACGCGCTGTTGAGGTTCGTGAAAAGGTGGCGGTCGCGGGCGGTGGATTTATGA
Protein sequenceShow/hide protein sequence
MDLMEASIKVKINTRGFIPVSIDLLSSSASSLCVTVDPFFKEENYIGYIVGIHGDRSKEPTREIGRVEFHPPKKEQQSYHGEDKPTDLPATLIKDAAKGCDHTVE
DSPLDKSMVTCKGRSTPLPSSFIPSELDIDDYISSTYSSYPNSPSSLPNPPTSQTSCSKIYPNVRGLGSWQKRALIKDLLKQHNPSITILVETKWCNVDRMLIKS
VWSSRNLGWSSLDACGSSGVYGPSSSKYKLDFIKELSDLDGLCGENWILAGDFNLIRWSHENSNGKRPSRSMRAFNAFIFDAGLADSFLKWQIHMSLSERLKKVL
PYTVFEQQTAFVAGRQILDASLIANEITDERFQKNQKGVVLKLDVEKAFDMLDWDFLDLILKAKGFGDRWRSILLGSHSTESGKKAIELEAHSCIISVAETLERL
FRKFLWAGNREKDNIHLLRWEKVKLPLDEGGLGIVDLKKWNKALLAKWVCRFHCEKDALWKKVIVAKFGAKHFDLKPDNHSLQSSKGPWKSIFKQSESMSKGFDT
SKDDAATGKNSPPQKKQGVAQPRCWRQESPRTAARRNALLRFVKRWRSRAVDL