; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g04850 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g04850
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr7:4118776..4120191
RNA-Seq ExpressionMoc07g04850
SyntenyMoc07g04850
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0062333.1 putative reverse transcriptase [Cucumis melo var. makuwa]3.2e-3732.32Show/hide
Query:  MIFSYGDQGSLSFIKAALASFEDLAGFVANIRKSSFFGVGLPVAEVDRLTAFLGFSVASLPVRYLGVPLFSHRLSHHDYKPLLERITSRIRNWSARVLSF
        MIF   D+ SL FI+  L  F +L+G   N+ K+S F  G+       L A +GF + +LPVRYLG+PL S RL   D  PL++RITSRIR+W ARV SF
Subjt:  MIFSYGDQGSLSFIKAALASFEDLAGFVANIRKSSFFGVGLPVAEVDRLTAFLGFSVASLPVRYLGVPLFSHRLSHHDYKPLLERITSRIRNWSARVLSF

Query:  AGQLQLIRSVLQSFQVYWTSVFILPDRVGHDAFPAVIHSNKMLNKPGHPFLWKGRESRWLGAKVAWSEV------GGRFGCLSPS--------LLELYCY
        AG+LQLIR VL+S QV+W SVF+L         PA +H+   ++K    +LW+ +E    G KVAW EV      GG      PS        +L L   
Subjt:  AGQLQLIRSVLQSFQVYWTSVFILPDRVGHDAFPAVIHSNKMLNKPGHPFLWKGRESRWLGAKVAWSEV------GGRFGCLSPS--------LLELYCY

Query:  NEAALAFHYTSGIFLGGLGGGLYYSGDCIWTVRASPRFSW----------------------------WCPFYPRFGD-----------------WIIGD
        N  +L   +     L          G  +W V +    SW                            W   + + G                  W++G 
Subjt:  NEAALAFHYTSGIFLGGLGGGLYYSGDCIWTVRASPRFSW----------------------------WCPFYPRFGD-----------------WIIGD

Query:  AASSSQAKV--------AYLRWPGVSGELWELVSEVSSVPFQWGGLMCPFGFGVVSCEVWSDMLAWAGSSHQISYWSTELAWVCHISAGKSAR
            S A            +RW   +G + +    V SV   W  L     F V   +VWS +L    SSH+I +W  EL+W+CH   GK  R
Subjt:  AASSSQAKV--------AYLRWPGVSGELWELVSEVSSVPFQWGGLMCPFGFGVVSCEVWSDMLAWAGSSHQISYWSTELAWVCHISAGKSAR

TYK28099.1 uncharacterized protein E5676_scaffold1467G00020 [Cucumis melo var. makuwa]8.0e-4135.88Show/hide
Query:  MIFSYGDQGSLSFIKAALASFEDLAGFVANIRKSSFFGVGLPVAEVDRLTAFLGFSVASLPVRYLGVPLFSHRLSHHDYKPLLERITSRIRNWSARVLSF
        MIF   D+ S+ FI+  L  F +L+G  AN RKSS F  G+       L   +GF+  +LPVRYLG+PL + RL  +D  PL++RITS+IR+W+ARVLSF
Subjt:  MIFSYGDQGSLSFIKAALASFEDLAGFVANIRKSSFFGVGLPVAEVDRLTAFLGFSVASLPVRYLGVPLFSHRLSHHDYKPLLERITSRIRNWSARVLSF

Query:  AGQLQLIRSVLQSFQVYWTSVFILPDRVGHDAFPAVIHSNKMLNKPGHPFLWKGRESRWLGAKVAWSEV-----GGRFGCL-SPS--------LLELYCY
        AG+LQL+RSVL+S QVYW SVF+L         PA +H+   ++K    +LW+G+E    G KVAW +V      G FG    PS        +L L   
Subjt:  AGQLQLIRSVLQSFQVYWTSVFILPDRVGHDAFPAVIHSNKMLNKPGHPFLWKGRESRWLGAKVAWSEV-----GGRFGCL-SPS--------LLELYCY

Query:  NEAALAFHYTSGIFLGGLGGGLYYSGDCIWTVRASPRFSWWCPFYPRFGDWIIGDAASSSQAKVAYLRWPGVSGE---LWELVSEVS-------------
        N  +L   +     L          G  +W V +    SW          W I      S+A   +L WP VS E   LWE V EVS             
Subjt:  NEAALAFHYTSGIFLGGLGGGLYYSGDCIWTVRASPRFSWWCPFYPRFGDWIIGDAASSSQAKVAYLRWPGVSGE---LWELVSEVS-------------

Query:  ------SVPFQWGGLMCPFGFGVV-------------SC----EVWSDMLAWAGSSHQISYWSTELAWVCHISAGKSAR
              S+   W  +  P G  V+             SC    +VWS +L    SS++I +W  EL+W+CH   GK  R
Subjt:  ------SVPFQWGGLMCPFGFGVV-------------SC----EVWSDMLAWAGSSHQISYWSTELAWVCHISAGKSAR

XP_022157473.1 uncharacterized protein LOC111024165 [Momordica charantia]5.9e-5245.26Show/hide
Query:  MIFSYGDQGSLSFIKAALASFEDLAGFVANIRKSSFFGVGLPVAEVDRLTAFLGFSVASLPVRYLGVPLFSHRLSHHDYKPLLERITSRIRNWSARVLSF
        MI SY DQGSLSFIK    SFED    V        FG+GLPV E DRL AFLGFSV SL VRYLGV L S R+SHHD KPLLERI  R+RNWSAR+LSF
Subjt:  MIFSYGDQGSLSFIKAALASFEDLAGFVANIRKSSFFGVGLPVAEVDRLTAFLGFSVASLPVRYLGVPLFSHRLSHHDYKPLLERITSRIRNWSARVLSF

Query:  AGQLQLIRSVLQSFQVYWTSVFILPDRVGHDAFPAVIHSNKMLNKPGHPFLWKGRESRWLGAKVAWSEVGGRFGCLSPSLLELYCYNEAALAFHYTSGIF
        A  L LIR V QSFQVYW SVFILP RV HD    ++HS          FLWKG E+ W G KVAWSE+    G  S S                     
Subjt:  AGQLQLIRSVLQSFQVYWTSVFILPDRVGHDAFPAVIHSNKMLNKPGHPFLWKGRESRWLGAKVAWSEVGGRFGCLSPSLLELYCYNEAALAFHYTSGIF

Query:  LGGLGGGLYYSGDCIWTVRASPRFSWWCPFYPRFGDWIIGDAASSSQAKVAYLRWPGVSGEL--------WELVSEVSSV----PFQWGGLM--CPFGFG
                  + D +  VR  P   W+         W  G+       K +++ W  +   L        W+    VS V       W  L   CPF   
Subjt:  LGGLGGGLYYSGDCIWTVRASPRFSWWCPFYPRFGDWIIGDAASSSQAKVAYLRWPGVSGEL--------WELVSEVSSV----PFQWGGLM--CPFGFG

Query:  VVSCEVWSDMLAWAGSSHQISYWSTEL
          S EVWS M+AWAGSS +ISYWSTEL
Subjt:  VVSCEVWSDMLAWAGSSHQISYWSTEL

XP_031737043.1 uncharacterized protein LOC116402131 [Cucumis sativus]1.6e-4132.31Show/hide
Query:  MIFSYGDQGSLSFIKAALASFEDLAGFVANIRKSSFFGVGLPVAEVDRLTAFLGFSVASLPVRYLGVPLFSHRLSHHDYKPLLERITSRIRNWSARVLSF
        MIF   D  S+SFIK  +  F +L+G  AN+ KSS F VG+  ++  RL A +GFS+  LPVRYLG+PL   RL   D  PL++RITSRIR+WSARVLSF
Subjt:  MIFSYGDQGSLSFIKAALASFEDLAGFVANIRKSSFFGVGLPVAEVDRLTAFLGFSVASLPVRYLGVPLFSHRLSHHDYKPLLERITSRIRNWSARVLSF

Query:  AGQLQLIRSVLQSFQVYWTSVFILPDRVGHDAFPAVIHSNKMLNKPGHPFLWKGRESRWLGAKVAWSEV-------------GGRFGCLSP-SLLELYCY
        AG+LQL+RSVL+S QVYW SVF+LP +V  D        +K+L      +LW+G+E    GAKVAW EV             G  +   S   +L L   
Subjt:  AGQLQLIRSVLQSFQVYWTSVFILPDRVGHDAFPAVIHSNKMLNKPGHPFLWKGRESRWLGAKVAWSEV-------------GGRFGCLSP-SLLELYCY

Query:  NEAALAFHYTSGIFLGG--------LG--GGLYYSGDCI-------WTVRASPRFSW--WCPFYPRFGDWIIGDAASSSQAKVAYL-------RWPGVSG
           +L   +     L G        LG  G    SG  I       W +  S    W        +FG+ +I DA S   A++          RWP VS 
Subjt:  NEAALAFHYTSGIFLGG--------LG--GGLYYSGDCI-------WTVRASPRFSW--WCPFYPRFGDWIIGDAASSSQAKVAYL-------RWPGVSG

Query:  EL-----------------------------------WELVSEVSSVPFQWGGLM---------------------------------------------
        +L                                   WE +   SS    W GL+                                             
Subjt:  EL-----------------------------------WELVSEVSSVPFQWGGLM---------------------------------------------

Query:  ---------CPFGFGVVSCEVWSDMLAWAGSSHQISYWSTELAWVCHISAGKSAR
                 CPFG+     E+WS +L +  SSH+I YW  EL+W+C+   GK  R
Subjt:  ---------CPFGFGVVSCEVWSDMLAWAGSSHQISYWSTELAWVCHISAGKSAR

XP_031740402.1 uncharacterized protein LOC116403409 [Cucumis sativus]4.1e-3752.66Show/hide
Query:  MIFSYGDQGSLSFIKAALASFEDLAGFVANIRKSSFFGVGLPVAEVDRLTAFLGFSVASLPVRYLGVPLFSHRLSHHDYKPLLERITSRIRNWSARVLSF
        MIF   D  S+SFIK  +  F +L+G  AN+ KS  F VG+  ++  RL A +GFS+  LPVRYLG+PL S RL   D  PL++RITSRIR+WSARVLSF
Subjt:  MIFSYGDQGSLSFIKAALASFEDLAGFVANIRKSSFFGVGLPVAEVDRLTAFLGFSVASLPVRYLGVPLFSHRLSHHDYKPLLERITSRIRNWSARVLSF

Query:  AGQLQLIRSVLQSFQVYWTSVFILPDRVGHDAFPAVIHSNKMLNKPGHPFLWKGRESRWLGAKVAWSEV
        AG+LQL+RSVL+S QVYW SVF+LP +V  D           ++K    +LW+G E    GAKVAW EV
Subjt:  AGQLQLIRSVLQSFQVYWTSVFILPDRVGHDAFPAVIHSNKMLNKPGHPFLWKGRESRWLGAKVAWSEV

TrEMBL top hitse value%identityAlignment
A0A5A7SPE5 Reverse transcriptase domain-containing protein7.1e-3539.37Show/hide
Query:  MIFSYGDQGSLSFIKAALASFEDLAGFVANIRKSSFFGVGLPVAEVDRLTAFLGFSVASLPVRYLGVPLFSHRLSHHDYKPLLERITSRIRNWSARVLSF
        MIF   D+ S+ FI+  L  F +L+G  AN RKSS F  G+       L   +GF+  +LPVRYLG+PL + RL  +D  PL++RITS+IR+W+ARVLSF
Subjt:  MIFSYGDQGSLSFIKAALASFEDLAGFVANIRKSSFFGVGLPVAEVDRLTAFLGFSVASLPVRYLGVPLFSHRLSHHDYKPLLERITSRIRNWSARVLSF

Query:  AGQLQLIRSVLQSFQVYWTSVFILPDRVGHDAFPAVIHSNKMLNKPGHPFLWKGRESRWLGAKVAWSEV-----GGRFGCL-SPS--------LLELYCY
        AG+LQL+RSVL+S QVYW SVF+L         PA +H+   ++K    +LW+G+E    G KVAW +V      G FG    PS        +L L   
Subjt:  AGQLQLIRSVLQSFQVYWTSVFILPDRVGHDAFPAVIHSNKMLNKPGHPFLWKGRESRWLGAKVAWSEV-----GGRFGCL-SPS--------LLELYCY

Query:  NEAALAFHYTSGIFLGGLGGGLYYSGDCIWTVRASPRFSWWCPFYPRFGDWIIGDAASSSQAKVAYLRWPGVSGE---LWELVSEVS
        N  +L   +     L          G  +W V +    SW          W I      S+A   +L WP VS E   LWE V EVS
Subjt:  NEAALAFHYTSGIFLGGLGGGLYYSGDCIWTVRASPRFSWWCPFYPRFGDWIIGDAASSSQAKVAYLRWPGVSGE---LWELVSEVS

A0A5A7UV01 F17F8.51.6e-3448.52Show/hide
Query:  MIFSYGDQGSLSFIKAALASFEDLAGFVANIRKSSFFGVGLPVAEVDRLTAFLGFSVASLPVRYLGVPLFSHRLSHHDYKPLLERITSRIRNWSARVLSF
        MIF   D+ S+ FI+  L  F +L+G  AN RKSS F  G+       L A +GF   +LP+RYLG+PL + RL  +DY PL++RITSRIR+W+ARVLSF
Subjt:  MIFSYGDQGSLSFIKAALASFEDLAGFVANIRKSSFFGVGLPVAEVDRLTAFLGFSVASLPVRYLGVPLFSHRLSHHDYKPLLERITSRIRNWSARVLSF

Query:  AGQLQLIRSVLQSFQVYWTSVFILPDRVGHDAFPAVIHSNKMLNKPGHPFLWKGRESRWLGAKVAWSEV
        AG+LQL+R VL+S QVYW SVF+LP  V H+    +I            +LW+G+E    G KVAW +V
Subjt:  AGQLQLIRSVLQSFQVYWTSVFILPDRVGHDAFPAVIHSNKMLNKPGHPFLWKGRESRWLGAKVAWSEV

A0A5A7V2L9 Putative reverse transcriptase1.5e-3732.32Show/hide
Query:  MIFSYGDQGSLSFIKAALASFEDLAGFVANIRKSSFFGVGLPVAEVDRLTAFLGFSVASLPVRYLGVPLFSHRLSHHDYKPLLERITSRIRNWSARVLSF
        MIF   D+ SL FI+  L  F +L+G   N+ K+S F  G+       L A +GF + +LPVRYLG+PL S RL   D  PL++RITSRIR+W ARV SF
Subjt:  MIFSYGDQGSLSFIKAALASFEDLAGFVANIRKSSFFGVGLPVAEVDRLTAFLGFSVASLPVRYLGVPLFSHRLSHHDYKPLLERITSRIRNWSARVLSF

Query:  AGQLQLIRSVLQSFQVYWTSVFILPDRVGHDAFPAVIHSNKMLNKPGHPFLWKGRESRWLGAKVAWSEV------GGRFGCLSPS--------LLELYCY
        AG+LQLIR VL+S QV+W SVF+L         PA +H+   ++K    +LW+ +E    G KVAW EV      GG      PS        +L L   
Subjt:  AGQLQLIRSVLQSFQVYWTSVFILPDRVGHDAFPAVIHSNKMLNKPGHPFLWKGRESRWLGAKVAWSEV------GGRFGCLSPS--------LLELYCY

Query:  NEAALAFHYTSGIFLGGLGGGLYYSGDCIWTVRASPRFSW----------------------------WCPFYPRFGD-----------------WIIGD
        N  +L   +     L          G  +W V +    SW                            W   + + G                  W++G 
Subjt:  NEAALAFHYTSGIFLGGLGGGLYYSGDCIWTVRASPRFSW----------------------------WCPFYPRFGD-----------------WIIGD

Query:  AASSSQAKV--------AYLRWPGVSGELWELVSEVSSVPFQWGGLMCPFGFGVVSCEVWSDMLAWAGSSHQISYWSTELAWVCHISAGKSAR
            S A            +RW   +G + +    V SV   W  L     F V   +VWS +L    SSH+I +W  EL+W+CH   GK  R
Subjt:  AASSSQAKV--------AYLRWPGVSGELWELVSEVSSVPFQWGGLMCPFGFGVVSCEVWSDMLAWAGSSHQISYWSTELAWVCHISAGKSAR

A0A5D3DXE4 Reverse transcriptase domain-containing protein3.9e-4135.88Show/hide
Query:  MIFSYGDQGSLSFIKAALASFEDLAGFVANIRKSSFFGVGLPVAEVDRLTAFLGFSVASLPVRYLGVPLFSHRLSHHDYKPLLERITSRIRNWSARVLSF
        MIF   D+ S+ FI+  L  F +L+G  AN RKSS F  G+       L   +GF+  +LPVRYLG+PL + RL  +D  PL++RITS+IR+W+ARVLSF
Subjt:  MIFSYGDQGSLSFIKAALASFEDLAGFVANIRKSSFFGVGLPVAEVDRLTAFLGFSVASLPVRYLGVPLFSHRLSHHDYKPLLERITSRIRNWSARVLSF

Query:  AGQLQLIRSVLQSFQVYWTSVFILPDRVGHDAFPAVIHSNKMLNKPGHPFLWKGRESRWLGAKVAWSEV-----GGRFGCL-SPS--------LLELYCY
        AG+LQL+RSVL+S QVYW SVF+L         PA +H+   ++K    +LW+G+E    G KVAW +V      G FG    PS        +L L   
Subjt:  AGQLQLIRSVLQSFQVYWTSVFILPDRVGHDAFPAVIHSNKMLNKPGHPFLWKGRESRWLGAKVAWSEV-----GGRFGCL-SPS--------LLELYCY

Query:  NEAALAFHYTSGIFLGGLGGGLYYSGDCIWTVRASPRFSWWCPFYPRFGDWIIGDAASSSQAKVAYLRWPGVSGE---LWELVSEVS-------------
        N  +L   +     L          G  +W V +    SW          W I      S+A   +L WP VS E   LWE V EVS             
Subjt:  NEAALAFHYTSGIFLGGLGGGLYYSGDCIWTVRASPRFSWWCPFYPRFGDWIIGDAASSSQAKVAYLRWPGVSGE---LWELVSEVS-------------

Query:  ------SVPFQWGGLMCPFGFGVV-------------SC----EVWSDMLAWAGSSHQISYWSTELAWVCHISAGKSAR
              S+   W  +  P G  V+             SC    +VWS +L    SS++I +W  EL+W+CH   GK  R
Subjt:  ------SVPFQWGGLMCPFGFGVV-------------SC----EVWSDMLAWAGSSHQISYWSTELAWVCHISAGKSAR

A0A6J1DTG0 uncharacterized protein LOC1110241652.9e-5245.26Show/hide
Query:  MIFSYGDQGSLSFIKAALASFEDLAGFVANIRKSSFFGVGLPVAEVDRLTAFLGFSVASLPVRYLGVPLFSHRLSHHDYKPLLERITSRIRNWSARVLSF
        MI SY DQGSLSFIK    SFED    V        FG+GLPV E DRL AFLGFSV SL VRYLGV L S R+SHHD KPLLERI  R+RNWSAR+LSF
Subjt:  MIFSYGDQGSLSFIKAALASFEDLAGFVANIRKSSFFGVGLPVAEVDRLTAFLGFSVASLPVRYLGVPLFSHRLSHHDYKPLLERITSRIRNWSARVLSF

Query:  AGQLQLIRSVLQSFQVYWTSVFILPDRVGHDAFPAVIHSNKMLNKPGHPFLWKGRESRWLGAKVAWSEVGGRFGCLSPSLLELYCYNEAALAFHYTSGIF
        A  L LIR V QSFQVYW SVFILP RV HD    ++HS          FLWKG E+ W G KVAWSE+    G  S S                     
Subjt:  AGQLQLIRSVLQSFQVYWTSVFILPDRVGHDAFPAVIHSNKMLNKPGHPFLWKGRESRWLGAKVAWSEVGGRFGCLSPSLLELYCYNEAALAFHYTSGIF

Query:  LGGLGGGLYYSGDCIWTVRASPRFSWWCPFYPRFGDWIIGDAASSSQAKVAYLRWPGVSGEL--------WELVSEVSSV----PFQWGGLM--CPFGFG
                  + D +  VR  P   W+         W  G+       K +++ W  +   L        W+    VS V       W  L   CPF   
Subjt:  LGGLGGGLYYSGDCIWTVRASPRFSWWCPFYPRFGDWIIGDAASSSQAKVAYLRWPGVSGEL--------WELVSEVSSV----PFQWGGLM--CPFGFG

Query:  VVSCEVWSDMLAWAGSSHQISYWSTEL
          S EVWS M+AWAGSS +ISYWSTEL
Subjt:  VVSCEVWSDMLAWAGSSHQISYWSTEL

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657502.3e-0629.13Show/hide
Query:  VPLFSHRLSHHDYKPLLERITSRIRNWSARVLSFAGQLQLIRSVLQSFQVYWTSVFILPDRVGHDAFPAVIHSNKMLNKPGHPFLWKGRESRWLGAKVAW
        +P+   R++   +  +LER++SR+  W  + LSFAG+L L ++VL S  V+  S  +LP  +              L++    FLW     +     V W
Subjt:  VPLFSHRLSHHDYKPLLERITSRIRNWSARVLSFAGQLQLIRSVLQSFQVYWTSVFILPDRVGHDAFPAVIHSNKMLNKPGHPFLWKGRESRWLGAKVAW

Query:  SEV
        S+V
Subjt:  SEV

Arabidopsis top hitse value%identityAlignment
AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.2e-1946.09Show/hide
Query:  FSVASLPVRYLGVPLFSHRLSHHDYKPLLERITSRIRNWSARVLSFAGQLQLIRSVLQSFQVYWTSVFILPDRVGHDAFPAVIHSNKMLNKPGHPFLWKG
        F+  +LPVRYLG+PL + +++  DY PL+E+I  RI  W+AR LSFAG+LQLI SV+ S   +W S F LP         A I   K ++     FLW G
Subjt:  FSVASLPVRYLGVPLFSHRLSHHDYKPLLERITSRIRNWSARVLSFAGQLQLIRSVLQSFQVYWTSVFILPDRVGHDAFPAVIHSNKMLNKPGHPFLWKG

Query:  RESRWLGAKVAWSEV
         E     AKVAWS+V
Subjt:  RESRWLGAKVAWSEV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTTTCTCGTATGGGGATCAAGGATCTCTTTCTTTTATTAAGGCTGCATTGGCTTCTTTTGAAGATTTGGCGGGTTTCGTCGCGAATATTCGGAAGAGCTCGTTTTT
CGGGGTGGGCCTCCCTGTGGCGGAGGTCGACAGGTTAACGGCTTTCTTAGGGTTTTCGGTGGCGTCGCTCCCTGTGCGTTATCTCGGGGTTCCGCTTTTTTCGCATCGGC
TTTCTCATCATGATTATAAGCCTTTGCTTGAGCGGATTACCTCTCGTATTCGAAATTGGTCGGCTAGGGTGCTTTCTTTTGCTGGCCAGTTGCAGCTTATCCGATCGGTT
CTGCAGAGTTTTCAGGTCTATTGGACCAGTGTATTTATCCTTCCGGATCGTGTTGGTCATGATGCCTTTCCCGCTGTTATCCACTCCAATAAAATGCTGAACAAGCCGGG
CCATCCTTTCTTGTGGAAGGGGCGCGAGAGCAGGTGGTTGGGGGCTAAGGTGGCTTGGTCTGAGGTGGGAGGGCGGTTTGGGTGTTTGTCACCTAGCCTCTTGGAACTCT
ACTGCTATAATGAAGCTGCTCTGGCTTTTCATTACACGAGCGGGATCTTTTTGGGTGGCTTGGGTGGAGGCTTATATTATTCGGGGGATTGTATTTGGACCGTCCGTGCT
TCGCCTCGGTTCTCTTGGTGGTGTCCGTTTTATCCGAGGTTTGGTGACTGGATTATCGGTGATGCTGCTAGCTCGTCTCAGGCGAAGGTTGCTTATTTGCGTTGGCCTGG
GGTGTCGGGTGAGCTTTGGGAGCTGGTCTCTGAGGTTTCATCTGTGCCGTTTCAGTGGGGAGGGTTGATGTGCCCATTTGGATTCGGCGTCGTCAGTTGTGAGGTTTGGT
CTGACATGCTTGCTTGGGCTGGTTCTTCTCACCAGATTTCATATTGGTCTACTGAGCTTGCTTGGGTTTGTCACATTAGTGCCGGGAAGTCTGCTCGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGATTTTCTCGTATGGGGATCAAGGATCTCTTTCTTTTATTAAGGCTGCATTGGCTTCTTTTGAAGATTTGGCGGGTTTCGTCGCGAATATTCGGAAGAGCTCGTTTTT
CGGGGTGGGCCTCCCTGTGGCGGAGGTCGACAGGTTAACGGCTTTCTTAGGGTTTTCGGTGGCGTCGCTCCCTGTGCGTTATCTCGGGGTTCCGCTTTTTTCGCATCGGC
TTTCTCATCATGATTATAAGCCTTTGCTTGAGCGGATTACCTCTCGTATTCGAAATTGGTCGGCTAGGGTGCTTTCTTTTGCTGGCCAGTTGCAGCTTATCCGATCGGTT
CTGCAGAGTTTTCAGGTCTATTGGACCAGTGTATTTATCCTTCCGGATCGTGTTGGTCATGATGCCTTTCCCGCTGTTATCCACTCCAATAAAATGCTGAACAAGCCGGG
CCATCCTTTCTTGTGGAAGGGGCGCGAGAGCAGGTGGTTGGGGGCTAAGGTGGCTTGGTCTGAGGTGGGAGGGCGGTTTGGGTGTTTGTCACCTAGCCTCTTGGAACTCT
ACTGCTATAATGAAGCTGCTCTGGCTTTTCATTACACGAGCGGGATCTTTTTGGGTGGCTTGGGTGGAGGCTTATATTATTCGGGGGATTGTATTTGGACCGTCCGTGCT
TCGCCTCGGTTCTCTTGGTGGTGTCCGTTTTATCCGAGGTTTGGTGACTGGATTATCGGTGATGCTGCTAGCTCGTCTCAGGCGAAGGTTGCTTATTTGCGTTGGCCTGG
GGTGTCGGGTGAGCTTTGGGAGCTGGTCTCTGAGGTTTCATCTGTGCCGTTTCAGTGGGGAGGGTTGATGTGCCCATTTGGATTCGGCGTCGTCAGTTGTGAGGTTTGGT
CTGACATGCTTGCTTGGGCTGGTTCTTCTCACCAGATTTCATATTGGTCTACTGAGCTTGCTTGGGTTTGTCACATTAGTGCCGGGAAGTCTGCTCGTTGA
Protein sequenceShow/hide protein sequence
MIFSYGDQGSLSFIKAALASFEDLAGFVANIRKSSFFGVGLPVAEVDRLTAFLGFSVASLPVRYLGVPLFSHRLSHHDYKPLLERITSRIRNWSARVLSFAGQLQLIRSV
LQSFQVYWTSVFILPDRVGHDAFPAVIHSNKMLNKPGHPFLWKGRESRWLGAKVAWSEVGGRFGCLSPSLLELYCYNEAALAFHYTSGIFLGGLGGGLYYSGDCIWTVRA
SPRFSWWCPFYPRFGDWIIGDAASSSQAKVAYLRWPGVSGELWELVSEVSSVPFQWGGLMCPFGFGVVSCEVWSDMLAWAGSSHQISYWSTELAWVCHISAGKSAR