; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g36650 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g36650
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr4:27505907..27511734
RNA-Seq ExpressionMoc04g36650
SyntenyMoc04g36650
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN76546.1 hypothetical protein VITISV_010420 [Vitis vinifera]4.5e-2632.81Show/hide
Query:  CGGVKSMSEYFQNEYVMCFLVGLNESYSQLRAQLLLMEPAPSINRAFALVAQEAAQRNISSS-------------SSSISPAAF----------------
        CGG++  ++Y   EYV+ FL+GLN+SY+Q+R Q+L+M+P P+IN+ F+LV QE   R +  S             S+S +PA                  
Subjt:  CGGVKSMSEYFQNEYVMCFLVGLNESYSQLRAQLLLMEPAPSINRAFALVAQEAAQRNISSS-------------SSSISPAAF----------------

Query:  ---------------LTHLTKAQTTSDVASSSTTHVAGICYSILNSIHSSTRWVIDSGASAHVCFSRPAFTSLVEVSDMSVTLPNNSRIPVKFIGDVRLN
                       L+  + A T +     S ++ AG    I N       W+IDSGA+ HVC     F S ++V ++ VTLP    +P+  +G V L+
Subjt:  ---------------LTHLTKAQTTSDVASSSTTHVAGICYSILNSIHSSTRWVIDSGASAHVCFSRPAFTSLVEVSDMSVTLPNNSRIPVKFIGDVRLN

Query:  SHITLKGVLYIPDFQFNLLSTSALATHMSVMIQFVGDSCLIQDKSSLQTIGRG
          + L  VL++P F++NLLS SA    +S+ + F  D+C+IQ+ S  + IG+G
Subjt:  SHITLKGVLYIPDFQFNLLSTSALATHMSVMIQFVGDSCLIQDKSSLQTIGRG

RVW92730.1 hypothetical protein CK203_042571 [Vitis vinifera]2.7e-2631.16Show/hide
Query:  SSSSPIPPCGGVKSMSEYFQNEYVMCFLVGLNESYSQLRAQLLLMEPAPSINRAFALVAQEAAQRNISSSSSSISPAAFLTHLTKAQTTSDVASSSTTHV
        S+  P+  CGGVK++S ++Q EY+M FL+ LN+S++Q+R QLLL++P PSIN+ F+L++QE  Q+ I S  ++ S +A           S  +     H 
Subjt:  SSSSPIPPCGGVKSMSEYFQNEYVMCFLVGLNESYSQLRAQLLLMEPAPSINRAFALVAQEAAQRNISSSSSSISPAAFLTHLTKAQTTSDVASSSTTHV

Query:  AG--IC------------YSIL--NSIHSSTRWV-IDSGASAHVCFSRPAFTSLVEVSDMSVTLPNNSRIPVKFIGDVRLNSHITLKGVLYIPDFQFNLL
             C            YS +  N+ H         +GA+ H+C S  AF SL    + +VTLPN+ +IPV F GD+ L+S +TLK VL++P F+FNL+
Subjt:  AG--IC------------YSIL--NSIHSSTRWV-IDSGASAHVCFSRPAFTSLVEVSDMSVTLPNNSRIPVKFIGDVRLNSHITLKGVLYIPDFQFNLL

Query:  STSALATHMSVMIQFVGDSCLIQDKSSLQTIGRGEYWHVAAPSDIHL----SYGMTPSGSDSLGLQHSDIISAGTDLNNTDVADVNTDGAAVNTDVTIVD
        S SAL    S+    +  S                   V+APS +++    S G++PS                             D A  N    + D
Subjt:  STSALATHMSVMIQFVGDSCLIQDKSSLQTIGRGEYWHVAAPSDIHL----SYGMTPSGSDSLGLQHSDIISAGTDLNNTDVADVNTDGAAVNTDVTIVD

Query:  TDISQVHTSGQDASDILDTALVPPPQVDTPVRKSNRLTRPPTYLKDFHCSLLT
         D+                 LV  P     +R+S R+++PP+ L+DFHC+LL+
Subjt:  TDISQVHTSGQDASDILDTALVPPPQVDTPVRKSNRLTRPPTYLKDFHCSLLT

XP_022143573.1 uncharacterized protein LOC111013441 [Momordica charantia]2.8e-2827.81Show/hide
Query:  CGGVKSMSEYFQNEYVMCFLVGLNESYSQLRAQLLLMEPAPSINRAFALVAQEAAQRNISSSSS------------------------------------
        CGG KS+SE+ Q EY++  L+GL+E Y   RA+LLLM+P PS+N+A +LV Q+  QR+I +S++                                    
Subjt:  CGGVKSMSEYFQNEYVMCFLVGLNESYSQLRAQLLLMEPAPSINRAFALVAQEAAQRNISSSSS------------------------------------

Query:  -----------SISPAAFLTHLTKAQTTSDVASSSTTHVAGICYSILNSIHSSTRWV-------------------IDSGASAHVCFSRPAFTSLVEVSD
                   S +P A  +    A   S   S + +H   + + +L S  S+ ++V                   +D GASAH+C  R  F  + ++S 
Subjt:  -----------SISPAAFLTHLTKAQTTSDVASSSTTHVAGICYSILNSIHSSTRWV-------------------IDSGASAHVCFSRPAFTSLVEVSD

Query:  MSVTLPNNSRIPVKFIGDVRLNSHITLKGVLYIPDFQFNLLSTSALATHM-SVMIQFVGDSCLIQDKSSLQTIGRG------------------------
        + V LPN  R  V++ G VRL+ H+++ GVLYIP+F FNL+S + L   M S+ ++F  D+C+IQDKS  +TI +G                        
Subjt:  MSVTLPNNSRIPVKFIGDVRLNSHITLKGVLYIPDFQFNLLSTSALATHM-SVMIQFVGDSCLIQDKSSLQTIGRG------------------------

Query:  --------EYWH--VAAPSDIHLSYGMTPSGSDSLGLQH---------------SDI-ISAGTDLNNTDVADVNTDGAAVNTDVTIVDTDISQV------
                + WH  +  PS   L    +    D+  L+                +D+ ++   DL   +V D   D  A + D+T  D DI  V      
Subjt:  --------EYWH--VAAPSDIHLSYGMTPSGSDSLGLQH---------------SDI-ISAGTDLNNTDVADVNTDGAAVNTDVTIVDTDISQV------

Query:  ---------------HTSGQDASDILDTALV-PPPQVDTPV--------------RKSNRLTRPPTYLKDFHCSLLTASSLPSSASKYH
                          G  ++ ++   +    P V TP+              R+S R ++ P+YL+DFHCSLLT +SLPS AS  H
Subjt:  ---------------HTSGQDASDILDTALV-PPPQVDTPV--------------RKSNRLTRPPTYLKDFHCSLLTASSLPSSASKYH

XP_022856063.1 uncharacterized protein LOC111377235, partial [Olea europaea var. sylvestris]1.3e-2830.28Show/hide
Query:  CGGVKSMSEYFQNEYVMCFLVGLNESYSQLRAQLLLMEPAPSINRAFALVAQEAAQRNIS----------------------------------------
        CG  K +SE++Q EYVM FL+GLN++++Q R QLLLM+P PSIN+ F+LV+QE  QRNIS                                        
Subjt:  CGGVKSMSEYFQNEYVMCFLVGLNESYSQLRAQLLLMEPAPSINRAFALVAQEAAQRNIS----------------------------------------

Query:  --------------------------------------------------SSSSSISPAAFL----------------THLTKAQTTSDVASSSTTHVAG
                                                          S  +  +   F+                 HLT A+T +      T  V+G
Subjt:  --------------------------------------------------SSSSSISPAAFL----------------THLTKAQTTSDVASSSTTHVAG

Query:  ICYSILNS--IHSSTRWVIDSGASAHVCFSRPAFTSLVEVSDMSVTLPNNSRIPVKFIGDVRLNSHITLKGVLYIPDFQFNLLSTSALATHMSVMIQFVG
         C+SI+ +  ++S   WV+DSGA++H+CFS+ AF S+  + +  VTLPN+ RIPV F+G V+ ++++ L+ VLY+P+F+FNL+S S+L    +++++F  
Subjt:  ICYSILNS--IHSSTRWVIDSGASAHVCFSRPAFTSLVEVSDMSVTLPNNSRIPVKFIGDVRLNSHITLKGVLYIPDFQFNLLSTSALATHMSVMIQFVG

Query:  DSCLIQDKSSLQTIGRG
        + C IQD  + + IG+G
Subjt:  DSCLIQDKSSLQTIGRG

XP_022867559.1 uncharacterized protein LOC111387248 [Olea europaea var. sylvestris]5.2e-3031.83Show/hide
Query:  CGGVKSMSEYFQNEYVMCFLVGLNESYSQLRAQLLLMEPAPSINRAFALVAQEAAQR---NISSSSSSISPAAF--------------------------
        C G+K++ +++Q EYVM F +GL++S++Q R+Q+LLM+P P IN+ FALV+QE  QR   N++ +S+  +P AF                          
Subjt:  CGGVKSMSEYFQNEYVMCFLVGLNESYSQLRAQLLLMEPAPSINRAFALVAQEAAQR---NISSSSSSISPAAF--------------------------

Query:  -------------------------------------------------------------------LT----HLTKAQTTSDVASSSTTHVAGICYSIL
                                                                           LT    HLT A+T++++      + AGI +S+ 
Subjt:  -------------------------------------------------------------------LT----HLTKAQTTSDVASSSTTHVAGICYSIL

Query:  N--SIHSSTRWVIDSGASAHVCFSRPAFTSLVEVSDMSVTLPNNSRIPVKFIGDVRLNSHITLKGVLYIPDFQFNLLSTSALATHMSVMIQFVGDSCLIQ
        N  ++HS   W++DS A+ HVCF +  F +LV +S+   TLPN +RI V F G +R+N  + L  VLY+P FQFNLLS ++L  + S+ I F+ D CLIQ
Subjt:  N--SIHSSTRWVIDSGASAHVCFSRPAFTSLVEVSDMSVTLPNNSRIPVKFIGDVRLNSHITLKGVLYIPDFQFNLLSTSALATHMSVMIQFVGDSCLIQ

Query:  DKSSLQTIGRG
           +L+TIGRG
Subjt:  DKSSLQTIGRG

TrEMBL top hitse value%identityAlignment
A0A151U9A5 Retrovirus-related Pol polyprotein from transposon TNT 1-949.2e-2530.23Show/hide
Query:  CGGVKSMSEYFQNEYVMCFLVGLNESYSQLRAQLLLMEPAPSINRAFALVAQEAAQRNIS-SSSSSISPAAFL---------------------------
        CGG+K   ++ Q+EY M FL+GLNE YS +R Q+LLM+P P I + F+LV QE  Q+ +   ++S+ +P AF                            
Subjt:  CGGVKSMSEYFQNEYVMCFLVGLNESYSQLRAQLLLMEPAPSINRAFALVAQEAAQRNIS-SSSSSISPAAFL---------------------------

Query:  --------------THLTKAQTTSDV--------------------------------ASSSTTHVAGICYSILNSIHS--STRWVIDSGASAHVCFSRP
                      THL +  + ++V                                   S   V G+  SI  S +S  ST+W++DSGAS HV  S  
Subjt:  --------------THLTKAQTTSDV--------------------------------ASSSTTHVAGICYSILNSIHS--STRWVIDSGASAHVCFSRP

Query:  AFTSLVEVSDMSVTLPNNSRIPVKFIGDVRLNSHITLKGVLYIPDFQFNLLSTSALATHMSVMIQFVGDSCLIQDKSSLQTIGRGEYWHVAAPSDIHLSY
         F +   + +  VTLPN S IPV  IG V L+  I LK V+Y+P FQ+NLLS + L  H S+ + F+ +  ++QD    + IG GE    A      L+ 
Subjt:  AFTSLVEVSDMSVTLPNNSRIPVKFIGDVRLNSHITLKGVLYIPDFQFNLLSTSALATHMSVMIQFVGDSCLIQDKSSLQTIGRGEYWHVAAPSDIHLSY

Query:  GMTPSGSDSLGLQHSDIISAGTDLNNTDVADVNTDGAAVNTDVT
         + P+   S  + ++ +  +   LN      VN++ A+VN+  T
Subjt:  GMTPSGSDSLGLQHSDIISAGTDLNNTDVADVNTDGAAVNTDVT

A0A438I7P6 Retrotran_gag_3 domain-containing protein1.3e-2631.16Show/hide
Query:  SSSSPIPPCGGVKSMSEYFQNEYVMCFLVGLNESYSQLRAQLLLMEPAPSINRAFALVAQEAAQRNISSSSSSISPAAFLTHLTKAQTTSDVASSSTTHV
        S+  P+  CGGVK++S ++Q EY+M FL+ LN+S++Q+R QLLL++P PSIN+ F+L++QE  Q+ I S  ++ S +A           S  +     H 
Subjt:  SSSSPIPPCGGVKSMSEYFQNEYVMCFLVGLNESYSQLRAQLLLMEPAPSINRAFALVAQEAAQRNISSSSSSISPAAFLTHLTKAQTTSDVASSSTTHV

Query:  AG--IC------------YSIL--NSIHSSTRWV-IDSGASAHVCFSRPAFTSLVEVSDMSVTLPNNSRIPVKFIGDVRLNSHITLKGVLYIPDFQFNLL
             C            YS +  N+ H         +GA+ H+C S  AF SL    + +VTLPN+ +IPV F GD+ L+S +TLK VL++P F+FNL+
Subjt:  AG--IC------------YSIL--NSIHSSTRWV-IDSGASAHVCFSRPAFTSLVEVSDMSVTLPNNSRIPVKFIGDVRLNSHITLKGVLYIPDFQFNLL

Query:  STSALATHMSVMIQFVGDSCLIQDKSSLQTIGRGEYWHVAAPSDIHL----SYGMTPSGSDSLGLQHSDIISAGTDLNNTDVADVNTDGAAVNTDVTIVD
        S SAL    S+    +  S                   V+APS +++    S G++PS                             D A  N    + D
Subjt:  STSALATHMSVMIQFVGDSCLIQDKSSLQTIGRGEYWHVAAPSDIHL----SYGMTPSGSDSLGLQHSDIISAGTDLNNTDVADVNTDGAAVNTDVTIVD

Query:  TDISQVHTSGQDASDILDTALVPPPQVDTPVRKSNRLTRPPTYLKDFHCSLLT
         D+                 LV  P     +R+S R+++PP+ L+DFHC+LL+
Subjt:  TDISQVHTSGQDASDILDTALVPPPQVDTPVRKSNRLTRPPTYLKDFHCSLLT

A0A5A7VE66 Cysteine-rich RLK (Receptor-like protein kinase) 87.0e-2528.48Show/hide
Query:  CGGVKSMSEYFQNEYVMCFLVGLNESYSQLRAQLLLMEPAPSINRAFALVAQEAAQR-------------------------------------------
        CGG+K   ++ ++EY+M FL+GLN+SY+ +RAQ+LLM+P PSIN  F+L+ QE  QR                                           
Subjt:  CGGVKSMSEYFQNEYVMCFLVGLNESYSQLRAQLLLMEPAPSINRAFALVAQEAAQR-------------------------------------------

Query:  ------------------------------------NISSSSSSISPAAFLT---------------HLTKAQTTSDVASSSTTHVAGICYSILNSIHSS
                                            N +S++++ SP  F +               HL  A T     +++ TH +GI     ++  S 
Subjt:  ------------------------------------NISSSSSSISPAAFLT---------------HLTKAQTTSDVASSSTTHVAGICYSILNSIHSS

Query:  TRWVIDSGASAHVCFSRPAFTSLVEVSDMSVTLPNNSRIPVKFIGDVRLNSHITLKGVLYIPDFQFNLLSTSALATHMSVMIQFVGDSCLIQDKSSLQTI
          W+IDSGAS H+C  +  F +    ++M V LPN  RI V  IGD+++N  +TLK VL++  F +NL+S S L    ++ + F    C+IQD S    I
Subjt:  TRWVIDSGASAHVCFSRPAFTSLVEVSDMSVTLPNNSRIPVKFIGDVRLNSHITLKGVLYIPDFQFNLLSTSALATHMSVMIQFVGDSCLIQDKSSLQTI

Query:  GR
        G+
Subjt:  GR

A0A6J1CR17 uncharacterized protein LOC1110134411.4e-2827.81Show/hide
Query:  CGGVKSMSEYFQNEYVMCFLVGLNESYSQLRAQLLLMEPAPSINRAFALVAQEAAQRNISSSSS------------------------------------
        CGG KS+SE+ Q EY++  L+GL+E Y   RA+LLLM+P PS+N+A +LV Q+  QR+I +S++                                    
Subjt:  CGGVKSMSEYFQNEYVMCFLVGLNESYSQLRAQLLLMEPAPSINRAFALVAQEAAQRNISSSSS------------------------------------

Query:  -----------SISPAAFLTHLTKAQTTSDVASSSTTHVAGICYSILNSIHSSTRWV-------------------IDSGASAHVCFSRPAFTSLVEVSD
                   S +P A  +    A   S   S + +H   + + +L S  S+ ++V                   +D GASAH+C  R  F  + ++S 
Subjt:  -----------SISPAAFLTHLTKAQTTSDVASSSTTHVAGICYSILNSIHSSTRWV-------------------IDSGASAHVCFSRPAFTSLVEVSD

Query:  MSVTLPNNSRIPVKFIGDVRLNSHITLKGVLYIPDFQFNLLSTSALATHM-SVMIQFVGDSCLIQDKSSLQTIGRG------------------------
        + V LPN  R  V++ G VRL+ H+++ GVLYIP+F FNL+S + L   M S+ ++F  D+C+IQDKS  +TI +G                        
Subjt:  MSVTLPNNSRIPVKFIGDVRLNSHITLKGVLYIPDFQFNLLSTSALATHM-SVMIQFVGDSCLIQDKSSLQTIGRG------------------------

Query:  --------EYWH--VAAPSDIHLSYGMTPSGSDSLGLQH---------------SDI-ISAGTDLNNTDVADVNTDGAAVNTDVTIVDTDISQV------
                + WH  +  PS   L    +    D+  L+                +D+ ++   DL   +V D   D  A + D+T  D DI  V      
Subjt:  --------EYWH--VAAPSDIHLSYGMTPSGSDSLGLQH---------------SDI-ISAGTDLNNTDVADVNTDGAAVNTDVTIVDTDISQV------

Query:  ---------------HTSGQDASDILDTALV-PPPQVDTPV--------------RKSNRLTRPPTYLKDFHCSLLTASSLPSSASKYH
                          G  ++ ++   +    P V TP+              R+S R ++ P+YL+DFHCSLLT +SLPS AS  H
Subjt:  ---------------HTSGQDASDILDTALV-PPPQVDTPV--------------RKSNRLTRPPTYLKDFHCSLLTASSLPSSASKYH

A5B7R0 Integrase catalytic domain-containing protein2.2e-2632.81Show/hide
Query:  CGGVKSMSEYFQNEYVMCFLVGLNESYSQLRAQLLLMEPAPSINRAFALVAQEAAQRNISSS-------------SSSISPAAF----------------
        CGG++  ++Y   EYV+ FL+GLN+SY+Q+R Q+L+M+P P+IN+ F+LV QE   R +  S             S+S +PA                  
Subjt:  CGGVKSMSEYFQNEYVMCFLVGLNESYSQLRAQLLLMEPAPSINRAFALVAQEAAQRNISSS-------------SSSISPAAF----------------

Query:  ---------------LTHLTKAQTTSDVASSSTTHVAGICYSILNSIHSSTRWVIDSGASAHVCFSRPAFTSLVEVSDMSVTLPNNSRIPVKFIGDVRLN
                       L+  + A T +     S ++ AG    I N       W+IDSGA+ HVC     F S ++V ++ VTLP    +P+  +G V L+
Subjt:  ---------------LTHLTKAQTTSDVASSSTTHVAGICYSILNSIHSSTRWVIDSGASAHVCFSRPAFTSLVEVSDMSVTLPNNSRIPVKFIGDVRLN

Query:  SHITLKGVLYIPDFQFNLLSTSALATHMSVMIQFVGDSCLIQDKSSLQTIGRG
          + L  VL++P F++NLLS SA    +S+ + F  D+C+IQ+ S  + IG+G
Subjt:  SHITLKGVLYIPDFQFNLLSTSALATHMSVMIQFVGDSCLIQDKSSLQTIGRG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTAATTTTCTTGGCCATCTTTGGAAACCCACTTTCTCCTCTGAATTCCCTTGCTTCCACCAGCATCGCCGGATGGGTCACGCAGTTACCGAAATAGTTGCTCGAAA
CCGGAGGATCCAACAGCCCCCGCACTATCAATATGGTATCAGTGACTATTTCACTTTTCCACTATTTTTTCCCGTCAAGATCCTTATGGCGGTTACTGATCCTCCATTGG
AAGAACATCCTCCAGCATCGGAGAATCCTTCGTCATCGACCACTATACCGACTCCGATTTCAAGTTCATCTTCACCGATTCCTCCCTGTGGTGGTGTTAAATCCATGTCT
GAATATTTTCAAAATGAATACGTCATGTGCTTCCTCGTGGGTTTGAATGAGTCCTATAGCCAATTGCGTGCTCAATTGTTGCTTATGGAACCTGCTCCATCTATCAATCG
TGCATTTGCCCTAGTTGCGCAGGAGGCTGCTCAACGAAATATCTCTTCTTCATCTTCTTCTATTTCTCCTGCGGCGTTTTTGACTCATCTTACCAAAGCTCAAACCACTT
CAGACGTTGCTTCTTCTTCTACAACACATGTTGCAGGTATTTGTTACTCCATCTTGAATTCTATCCATTCCAGTACTCGATGGGTTATTGATTCTGGCGCTTCCGCTCAT
GTATGTTTCTCTCGGCCTGCTTTTACGTCCTTGGTTGAAGTATCTGATATGTCTGTCACTCTGCCTAACAATTCTCGCATACCTGTGAAATTTATTGGTGATGTTCGATT
GAATTCTCATATTACACTTAAAGGGGTTCTCTACATTCCGGATTTTCAGTTCAACTTGTTGTCTACCAGTGCTTTGGCCACACATATGTCAGTAATGATTCAGTTTGTTG
GTGATTCGTGTCTTATACAGGACAAGTCTTCTTTACAGACGATTGGCAGGGGTGAATATTGGCATGTTGCTGCACCCAGTGATATTCATTTGTCTTATGGTATGACTCCT
TCTGGTAGTGACAGCTTGGGTTTACAACATTCTGATATCATTTCTGCTGGTACTGACTTAAACAACACTGATGTTGCTGATGTAAATACTGATGGTGCTGCTGTCAATAC
TGATGTTACTATTGTAGATACTGATATTAGTCAAGTTCATACTTCAGGACAAGATGCTTCTGATATTTTAGATACTGCTCTTGTTCCTCCTCCACAGGTTGATACACCTG
TTCGAAAATCTAATCGATTGACTAGACCTCCTACTTATCTTAAGGATTTTCATTGCAGCTTGCTCACGGCTTCTTCCTTGCCTTCATCTGCATCCAAATATCATTTACAG
AACAATTCTTCTTCCTGTTGCTCTGGTTCCTCCAATGCTCTGATCTTGAGTTTGTGTATGTGTGATGGGGTGAGTTCGAACGTGGCTCGGACCCAGTTGGATGGAGGAAA
ATGGGACGAGTCGAAGCTTAGTGATGTTGTTGTCTTGGGGACGAATGGGTTTTGGAGGACATTGGCGGCTCTTTCGAAGGAAGGGGTGAGTTCCGGTGGCAGAGTAGCCT
CCAATTCCAAATGGGTTGCTCCAATTGTGAGGCATGGTGGAATTCGGCACCGGAGTCTGCCTCGACGCGGTGAAAGAAACGGCGTCGTTTGGGGAATAGAGGATGAAGGG
GAGAGGAGAATTCTCCGGCCAAAGGAGGTTGCCGGCGAGAGGAAAGAAATGGTTGAGAGTGAGAGAGAGGGAGTGTTTGAGATTTGGGATCAGGGTGGAGTGGAAGAACA
ATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTAATTTTCTTGGCCATCTTTGGAAACCCACTTTCTCCTCTGAATTCCCTTGCTTCCACCAGCATCGCCGGATGGGTCACGCAGTTACCGAAATAGTTGCTCGAAA
CCGGAGGATCCAACAGCCCCCGCACTATCAATATGGTATCAGTGACTATTTCACTTTTCCACTATTTTTTCCCGTCAAGATCCTTATGGCGGTTACTGATCCTCCATTGG
AAGAACATCCTCCAGCATCGGAGAATCCTTCGTCATCGACCACTATACCGACTCCGATTTCAAGTTCATCTTCACCGATTCCTCCCTGTGGTGGTGTTAAATCCATGTCT
GAATATTTTCAAAATGAATACGTCATGTGCTTCCTCGTGGGTTTGAATGAGTCCTATAGCCAATTGCGTGCTCAATTGTTGCTTATGGAACCTGCTCCATCTATCAATCG
TGCATTTGCCCTAGTTGCGCAGGAGGCTGCTCAACGAAATATCTCTTCTTCATCTTCTTCTATTTCTCCTGCGGCGTTTTTGACTCATCTTACCAAAGCTCAAACCACTT
CAGACGTTGCTTCTTCTTCTACAACACATGTTGCAGGTATTTGTTACTCCATCTTGAATTCTATCCATTCCAGTACTCGATGGGTTATTGATTCTGGCGCTTCCGCTCAT
GTATGTTTCTCTCGGCCTGCTTTTACGTCCTTGGTTGAAGTATCTGATATGTCTGTCACTCTGCCTAACAATTCTCGCATACCTGTGAAATTTATTGGTGATGTTCGATT
GAATTCTCATATTACACTTAAAGGGGTTCTCTACATTCCGGATTTTCAGTTCAACTTGTTGTCTACCAGTGCTTTGGCCACACATATGTCAGTAATGATTCAGTTTGTTG
GTGATTCGTGTCTTATACAGGACAAGTCTTCTTTACAGACGATTGGCAGGGGTGAATATTGGCATGTTGCTGCACCCAGTGATATTCATTTGTCTTATGGTATGACTCCT
TCTGGTAGTGACAGCTTGGGTTTACAACATTCTGATATCATTTCTGCTGGTACTGACTTAAACAACACTGATGTTGCTGATGTAAATACTGATGGTGCTGCTGTCAATAC
TGATGTTACTATTGTAGATACTGATATTAGTCAAGTTCATACTTCAGGACAAGATGCTTCTGATATTTTAGATACTGCTCTTGTTCCTCCTCCACAGGTTGATACACCTG
TTCGAAAATCTAATCGATTGACTAGACCTCCTACTTATCTTAAGGATTTTCATTGCAGCTTGCTCACGGCTTCTTCCTTGCCTTCATCTGCATCCAAATATCATTTACAG
AACAATTCTTCTTCCTGTTGCTCTGGTTCCTCCAATGCTCTGATCTTGAGTTTGTGTATGTGTGATGGGGTGAGTTCGAACGTGGCTCGGACCCAGTTGGATGGAGGAAA
ATGGGACGAGTCGAAGCTTAGTGATGTTGTTGTCTTGGGGACGAATGGGTTTTGGAGGACATTGGCGGCTCTTTCGAAGGAAGGGGTGAGTTCCGGTGGCAGAGTAGCCT
CCAATTCCAAATGGGTTGCTCCAATTGTGAGGCATGGTGGAATTCGGCACCGGAGTCTGCCTCGACGCGGTGAAAGAAACGGCGTCGTTTGGGGAATAGAGGATGAAGGG
GAGAGGAGAATTCTCCGGCCAAAGGAGGTTGCCGGCGAGAGGAAAGAAATGGTTGAGAGTGAGAGAGAGGGAGTGTTTGAGATTTGGGATCAGGGTGGAGTGGAAGAACA
ATGA
Protein sequenceShow/hide protein sequence
MANFLGHLWKPTFSSEFPCFHQHRRMGHAVTEIVARNRRIQQPPHYQYGISDYFTFPLFFPVKILMAVTDPPLEEHPPASENPSSSTTIPTPISSSSSPIPPCGGVKSMS
EYFQNEYVMCFLVGLNESYSQLRAQLLLMEPAPSINRAFALVAQEAAQRNISSSSSSISPAAFLTHLTKAQTTSDVASSSTTHVAGICYSILNSIHSSTRWVIDSGASAH
VCFSRPAFTSLVEVSDMSVTLPNNSRIPVKFIGDVRLNSHITLKGVLYIPDFQFNLLSTSALATHMSVMIQFVGDSCLIQDKSSLQTIGRGEYWHVAAPSDIHLSYGMTP
SGSDSLGLQHSDIISAGTDLNNTDVADVNTDGAAVNTDVTIVDTDISQVHTSGQDASDILDTALVPPPQVDTPVRKSNRLTRPPTYLKDFHCSLLTASSLPSSASKYHLQ
NNSSSCCSGSSNALILSLCMCDGVSSNVARTQLDGGKWDESKLSDVVVLGTNGFWRTLAALSKEGVSSGGRVASNSKWVAPIVRHGGIRHRSLPRRGERNGVVWGIEDEG
ERRILRPKEVAGERKEMVESEREGVFEIWDQGGVEEQ