; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g03440 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g03440
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr3:2585456..2589048
RNA-Seq ExpressionMoc03g03440
SyntenyMoc03g03440
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]2.5e-8272.38Show/hide
Query:  LKGPTSIKGWVRKWFYASGEWLAKDESGRSFIDVPTSFNPTSPRVTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPLESSRPNSEL-
        +KGPTSIKGWVRKWFYASGEWLAKDES         +  P  P +TQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRP+ESSRPNSEL 
Subjt:  LKGPTSIKGWVRKWFYASGEWLAKDESGRSFIDVPTSFNPTSPRVTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPLESSRPNSEL-

Query:  ----------------------------------GPASDDPAPVIELESSRGPSREKRPRGQTEAADVSSLGEEVREEAPLKRRRKMKKTTSPLEVGARG
                                          GPAS+DPAPVIELESSRGPSREKRPR QTEA DVS LGEEVREE PLKRRRK KKTTSPLEVGARG
Subjt:  ----------------------------------GPASDDPAPVIELESSRGPSREKRPRGQTEAADVSSLGEEVREEAPLKRRRKMKKTTSPLEVGARG

Query:  ALPASFADRVDDPEARMGGTSDVTARFRVESSSSRVKDE
         LPASFADRVDDPEARMGGT DVT RFRVE SSS V+D+
Subjt:  ALPASFADRVDDPEARMGGTSDVTARFRVESSSSRVKDE

XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]4.3e-8250.27Show/hide
Query:  PRVTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPLESSRPNSELGPASDDPAPVIELESSRGPSREKRPRGQTEAADVSSLGEEVRE
        P + QA+FDTLK+YK++FPRGRK+GTLVTDKLLLESGLLDYNP VRP+E+SRPNSEL       A V    S    S +++ +G+  A  +    + V  
Subjt:  PRVTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPLESSRPNSELGPASDDPAPVIELESSRGPSREKRPRGQTEAADVSSLGEEVRE

Query:  EAPLKRRRKMKKTTSPLEVGARGALPASFADRVDDPEARMGGTSDVTARFRVESSSSRVKDEVSRISAASLDRCLRRASKFVSDPGFVLQRTIDYAAEAE
                 + +  +  + G   A P                    T    ++S+  R +++ SR  + +LD    R  +                    
Subjt:  EAPLKRRRKMKKTTSPLEVGARGALPASFADRVDDPEARMGGTSDVTARFRVESSSSRVKDEVSRISAASLDRCLRRASKFVSDPGFVLQRTIDYAAEAE

Query:  VEVRAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEANEEELKHATAELEMVKERFSNGALLEESFRQHPDFDGFAKDFSDVGFRFLM
         E +AELLK+E++R KA LRAAHAITKGLEKEKFQLLKEKDDMLQALE  +  +    AEL+  KER +NGALLE +FRQHPDFDGFAKDFSD GF+FLM
Subjt:  VEVRAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEANEEELKHATAELEMVKERFSNGALLEESFRQHPDFDGFAKDFSDVGFRFLM

Query:  KGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLHSDYSDLEED--------QVGTTQEGAP
        KGIA+D+P L++DLG LKKRYAE+WASGP+GT GP +LVDKYVRDL SDYSDL+ED        +VGTTQEG P
Subjt:  KGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLHSDYSDLEED--------QVGTTQEGAP

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]5.2e-8865.96Show/hide
Query:  GTSDVTARFRVESSSSRVKDEVSRISAASLDRCLRRASKFVSDPGFVLQRTIDYAAE-------------------------------------------
        G   + A+ R+E SSS V+D+VSRISAASLDRCLRRASKFVS PG VLQRTIDYAAE                                           
Subjt:  GTSDVTARFRVESSSSRVKDEVSRISAASLDRCLRRASKFVSDPGFVLQRTIDYAAE-------------------------------------------

Query:  -------------AEVEVRAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEANEEELKHATAELEMVKERFSNGALLEESFRQHPDFD
                     AEVE +AELLKKEEDRR+AQLRAAHAIT+GLE+EKFQLLKEKDDMLQALEA ++EL+HATAELE  KER SNG LLEE+FRQHPDFD
Subjt:  -------------AEVEVRAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEANEEELKHATAELEMVKERFSNGALLEESFRQHPDFD

Query:  GFAKDFSDVGFRFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLHSDYSDLEEDQVGTTQEGAPQAGS
        GFAKDFSD GF+FLMKGIASDMPDLQIDL GLK+RYAE+WASGP GTPGPQALVD+YVRDL SDYSD EEDQVG+TQEGA   GS
Subjt:  GFAKDFSDVGFRFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLHSDYSDLEEDQVGTTQEGAPQAGS

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]7.9e-12972.11Show/hide
Query:  MSSSFSSDLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERVDNPPEGWVTLYFKMFEYG
        MSSS SS+L  + DLARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLR+PEEGER DNPPEGWVTLYFKMFEYG
Subjt:  MSSSFSSDLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERVDNPPEGWVTLYFKMFEYG

Query:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLG-------------------------------LKGPTSIKGWVRKWFYA
        LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL                                 +KGPTSIKGWVRKWFYA
Subjt:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLG-------------------------------LKGPTSIKGWVRKWFYA

Query:  SGEWLAKDESGRSFIDVPTSF-NPTS----PRVTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPLESSRPNSEL-------------
        SGEWLAKDESGRSF DVPT F N  S    P +TQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRP+ESSRPNSEL             
Subjt:  SGEWLAKDESGRSFIDVPTSF-NPTS----PRVTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPLESSRPNSEL-------------

Query:  ----------------------GPASDDPAPVIELESSRGPSREKRPRGQTEAAD
                              GPAS+DPA VIELESS GPSREKRPR QTEA D
Subjt:  ----------------------GPASDDPAPVIELESSRGPSREKRPRGQTEAAD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]2.2e-13956.57Show/hide
Query:  LKGPTSIKGWVRKWFYASGEWLAKDESGRSFIDVPTSFN-----PTSPRVTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPLESSRP
        +KGPTSIKGWV KWF+ASGEWLAKDESGR+F DVPT F         P + QA+FDTLK+YK+HFPR RK+ TLVTDKLLLESGLLDYNP VR +E+SRP
Subjt:  LKGPTSIKGWVRKWFYASGEWLAKDESGRSFIDVPTSFN-----PTSPRVTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPLESSRP

Query:  NSEL-------------------------------------------GPASDDPAPVIELESSRGPSREKRPRGQTEAADVSSLGEEVREEAPLKRRRKM
        NSEL                                           GP+S  P PVIEL+ S G S EKR R ++EA DVS L  EVR E+PL+RRRK 
Subjt:  NSEL-------------------------------------------GPASDDPAPVIELESSRGPSREKRPRGQTEAADVSSLGEEVREEAPLKRRRKM

Query:  KKTTSPLEVGARGALPASFADRVDDPEARMGGTSDVTARFRVESSSSRVKDEVSRISAASLDRCLRRASKFVSDPGFVLQRTIDYAAE------------
        KKT+S  E GARG LP S AD VDDPEARM GTS+V  RF +E SSS VKD+VSRISA  LDR LRRASKFVSDPG VLQRTID  AE            
Subjt:  KKTTSPLEVGARGALPASFADRVDDPEARMGGTSDVTARFRVESSSSRVKDEVSRISAASLDRCLRRASKFVSDPGFVLQRTIDYAAE------------

Query:  -------------------------------------------AEVEVRAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEANEEELK
                                                   AEV+ + +LLKKE ++ KA LRAAHAITKGLEKEKFQLLKEKDD+ Q LE  +  + 
Subjt:  -------------------------------------------AEVEVRAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEANEEELK

Query:  HATAELEMVKERFSNGALLEESFRQHPDFDGFAKDFSDVGFRFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLHSDYSDLEE
          T EL+ +KER +NG LLEESFRQHPDFDGFAKDFSD GF+FLMKGIA+DMP LQIDL GLKK+Y+E+WASGP+GTP PQ+LVDKYVR+L SDYSD+EE
Subjt:  HATAELEMVKERFSNGALLEESFRQHPDFDGFAKDFSDVGFRFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLHSDYSDLEE

Query:  D--------QVGTTQEGAP--QAGS
        +        +VGTTQE  P  Q GS
Subjt:  D--------QVGTTQEGAP--QAGS

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092981.2e-8272.38Show/hide
Query:  LKGPTSIKGWVRKWFYASGEWLAKDESGRSFIDVPTSFNPTSPRVTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPLESSRPNSEL-
        +KGPTSIKGWVRKWFYASGEWLAKDES         +  P  P +TQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRP+ESSRPNSEL 
Subjt:  LKGPTSIKGWVRKWFYASGEWLAKDESGRSFIDVPTSFNPTSPRVTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPLESSRPNSEL-

Query:  ----------------------------------GPASDDPAPVIELESSRGPSREKRPRGQTEAADVSSLGEEVREEAPLKRRRKMKKTTSPLEVGARG
                                          GPAS+DPAPVIELESSRGPSREKRPR QTEA DVS LGEEVREE PLKRRRK KKTTSPLEVGARG
Subjt:  ----------------------------------GPASDDPAPVIELESSRGPSREKRPRGQTEAADVSSLGEEVREEAPLKRRRKMKKTTSPLEVGARG

Query:  ALPASFADRVDDPEARMGGTSDVTARFRVESSSSRVKDE
         LPASFADRVDDPEARMGGT DVT RFRVE SSS V+D+
Subjt:  ALPASFADRVDDPEARMGGTSDVTARFRVESSSSRVKDE

A0A6J1CLV1 uncharacterized protein LOC1110124672.1e-8250.27Show/hide
Query:  PRVTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPLESSRPNSELGPASDDPAPVIELESSRGPSREKRPRGQTEAADVSSLGEEVRE
        P + QA+FDTLK+YK++FPRGRK+GTLVTDKLLLESGLLDYNP VRP+E+SRPNSEL       A V    S    S +++ +G+  A  +    + V  
Subjt:  PRVTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPLESSRPNSELGPASDDPAPVIELESSRGPSREKRPRGQTEAADVSSLGEEVRE

Query:  EAPLKRRRKMKKTTSPLEVGARGALPASFADRVDDPEARMGGTSDVTARFRVESSSSRVKDEVSRISAASLDRCLRRASKFVSDPGFVLQRTIDYAAEAE
                 + +  +  + G   A P                    T    ++S+  R +++ SR  + +LD    R  +                    
Subjt:  EAPLKRRRKMKKTTSPLEVGARGALPASFADRVDDPEARMGGTSDVTARFRVESSSSRVKDEVSRISAASLDRCLRRASKFVSDPGFVLQRTIDYAAEAE

Query:  VEVRAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEANEEELKHATAELEMVKERFSNGALLEESFRQHPDFDGFAKDFSDVGFRFLM
         E +AELLK+E++R KA LRAAHAITKGLEKEKFQLLKEKDDMLQALE  +  +    AEL+  KER +NGALLE +FRQHPDFDGFAKDFSD GF+FLM
Subjt:  VEVRAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEANEEELKHATAELEMVKERFSNGALLEESFRQHPDFDGFAKDFSDVGFRFLM

Query:  KGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLHSDYSDLEED--------QVGTTQEGAP
        KGIA+D+P L++DLG LKKRYAE+WASGP+GT GP +LVDKYVRDL SDYSDL+ED        +VGTTQEG P
Subjt:  KGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLHSDYSDLEED--------QVGTTQEGAP

A0A6J1D971 uncharacterized protein LOC1110185382.5e-8865.96Show/hide
Query:  GTSDVTARFRVESSSSRVKDEVSRISAASLDRCLRRASKFVSDPGFVLQRTIDYAAE-------------------------------------------
        G   + A+ R+E SSS V+D+VSRISAASLDRCLRRASKFVS PG VLQRTIDYAAE                                           
Subjt:  GTSDVTARFRVESSSSRVKDEVSRISAASLDRCLRRASKFVSDPGFVLQRTIDYAAE-------------------------------------------

Query:  -------------AEVEVRAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEANEEELKHATAELEMVKERFSNGALLEESFRQHPDFD
                     AEVE +AELLKKEEDRR+AQLRAAHAIT+GLE+EKFQLLKEKDDMLQALEA ++EL+HATAELE  KER SNG LLEE+FRQHPDFD
Subjt:  -------------AEVEVRAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEANEEELKHATAELEMVKERFSNGALLEESFRQHPDFD

Query:  GFAKDFSDVGFRFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLHSDYSDLEEDQVGTTQEGAPQAGS
        GFAKDFSD GF+FLMKGIASDMPDLQIDL GLK+RYAE+WASGP GTPGPQALVD+YVRDL SDYSD EEDQVG+TQEGA   GS
Subjt:  GFAKDFSDVGFRFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLHSDYSDLEEDQVGTTQEGAPQAGS

A0A6J1DXS5 uncharacterized protein LOC1110255023.8e-12972.11Show/hide
Query:  MSSSFSSDLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERVDNPPEGWVTLYFKMFEYG
        MSSS SS+L  + DLARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLR+PEEGER DNPPEGWVTLYFKMFEYG
Subjt:  MSSSFSSDLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERVDNPPEGWVTLYFKMFEYG

Query:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLG-------------------------------LKGPTSIKGWVRKWFYA
        LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL                                 +KGPTSIKGWVRKWFYA
Subjt:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLG-------------------------------LKGPTSIKGWVRKWFYA

Query:  SGEWLAKDESGRSFIDVPTSF-NPTS----PRVTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPLESSRPNSEL-------------
        SGEWLAKDESGRSF DVPT F N  S    P +TQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRP+ESSRPNSEL             
Subjt:  SGEWLAKDESGRSFIDVPTSF-NPTS----PRVTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPLESSRPNSEL-------------

Query:  ----------------------GPASDDPAPVIELESSRGPSREKRPRGQTEAAD
                              GPAS+DPA VIELESS GPSREKRPR QTEA D
Subjt:  ----------------------GPASDDPAPVIELESSRGPSREKRPRGQTEAAD

A0A6J1DZB3 uncharacterized protein LOC1110256651.1e-13956.57Show/hide
Query:  LKGPTSIKGWVRKWFYASGEWLAKDESGRSFIDVPTSFN-----PTSPRVTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPLESSRP
        +KGPTSIKGWV KWF+ASGEWLAKDESGR+F DVPT F         P + QA+FDTLK+YK+HFPR RK+ TLVTDKLLLESGLLDYNP VR +E+SRP
Subjt:  LKGPTSIKGWVRKWFYASGEWLAKDESGRSFIDVPTSFN-----PTSPRVTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPLESSRP

Query:  NSEL-------------------------------------------GPASDDPAPVIELESSRGPSREKRPRGQTEAADVSSLGEEVREEAPLKRRRKM
        NSEL                                           GP+S  P PVIEL+ S G S EKR R ++EA DVS L  EVR E+PL+RRRK 
Subjt:  NSEL-------------------------------------------GPASDDPAPVIELESSRGPSREKRPRGQTEAADVSSLGEEVREEAPLKRRRKM

Query:  KKTTSPLEVGARGALPASFADRVDDPEARMGGTSDVTARFRVESSSSRVKDEVSRISAASLDRCLRRASKFVSDPGFVLQRTIDYAAE------------
        KKT+S  E GARG LP S AD VDDPEARM GTS+V  RF +E SSS VKD+VSRISA  LDR LRRASKFVSDPG VLQRTID  AE            
Subjt:  KKTTSPLEVGARGALPASFADRVDDPEARMGGTSDVTARFRVESSSSRVKDEVSRISAASLDRCLRRASKFVSDPGFVLQRTIDYAAE------------

Query:  -------------------------------------------AEVEVRAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEANEEELK
                                                   AEV+ + +LLKKE ++ KA LRAAHAITKGLEKEKFQLLKEKDD+ Q LE  +  + 
Subjt:  -------------------------------------------AEVEVRAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEANEEELK

Query:  HATAELEMVKERFSNGALLEESFRQHPDFDGFAKDFSDVGFRFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLHSDYSDLEE
          T EL+ +KER +NG LLEESFRQHPDFDGFAKDFSD GF+FLMKGIA+DMP LQIDL GLKK+Y+E+WASGP+GTP PQ+LVDKYVR+L SDYSD+EE
Subjt:  HATAELEMVKERFSNGALLEESFRQHPDFDGFAKDFSDVGFRFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLHSDYSDLEE

Query:  D--------QVGTTQEGAP--QAGS
        +        +VGTTQE  P  Q GS
Subjt:  D--------QVGTTQEGAP--QAGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAGATCTAGTTTGGGACGTGGTGCATCAAGCAGATATGGTACGTGTTGTAGTCAGCGGGGAGGGTGGTGGCGGTTGGGTGCGGCGACGACGGGTCTGGTTTTGGTT
CGTGGACGGCGGTGTGAAGACTGCAGCTCGAACTCGGCCTCCGGACCGATCTGAATACTTGGGCGGACCTGCACAAAAAGGTGAGCACTCCGACGATCAAGTCAGTATAG
GTCGGATTCCCAATTTAGTTCGAGGGTATTCTCTTCCCCAAACATTGGCCCCCTCTCTGTCTGGTCCGATCTCGACCTGGCAGAGAAGTTCATTCGACTTGCTTTGGACG
CGTGGCGACTTCCTATTCGTGGGAAAATATAACCGTTGCGGTGGATTTATCGTCGGAATATTCAAATATTCCGACGCTTCGGATCTTCGGGAGGATCCTAGCCGCTCCTT
GATTACACGTCTCGAACCCTTGGTAGGTCAGTCTCTTCCCTCACTTTCTCTTTCGAACGTAGTTGCCATGTCGTCCTCTTTTAGCAGCGACTTAGGATCCGATGAGGATT
TAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGGAGGATAGTGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCTTCT
AGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTCCTTAGGATTCCGGAGGAGGGGGAGAGAGTTGACAATCCTCCAGAGGG
ATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGGCTTCCCCTTCACCCTTTTGTCCAAGAATTTCTCTTCCGAACTGGGTTGGCTCCGGCTCAAGTGGCCC
CCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTACGAGCTCGAGATAGTGAAGAGGCCGAGCTGTTGGGACTTAAGGGGCCGACCTCCATCAAAGGA
TGGGTGAGGAAGTGGTTCTACGCTTCCGGGGAATGGCTTGCAAAGGACGAGTCGGGTCGTTCCTTCATTGACGTTCCCACTAGTTTCAATCCGACCAGTCCCCGAGTTAC
GCAAGCCTCCTTCGACACGCTGAAATATTACAAGGAGCACTTCCCGAGGGGTAGGAAGGTCGGAACCTTGGTGACCGACAAGCTGCTGCTTGAGTCCGGGCTGCTAGATT
ACAACCCTGCTGTTCGTCCCCTTGAATCCTCAAGGCCGAACTCCGAACTTGGGCCAGCCTCAGATGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTAGGGGTCCTTCG
AGGGAGAAGCGCCCCAGGGGTCAGACCGAGGCGGCAGACGTCTCGTCCTTGGGCGAGGAGGTGAGGGAGGAGGCCCCTCTGAAGCGAAGGAGGAAGATGAAGAAGACCAC
CTCTCCCTTGGAGGTCGGAGCTCGTGGGGCCCTGCCTGCGAGCTTCGCAGATCGGGTGGACGATCCTGAGGCCAGGATGGGCGGGACGTCCGACGTGACAGCACGGTTCA
GAGTTGAGTCGTCAAGTTCTAGGGTGAAAGACGAGGTGTCCCGCATCTCGGCTGCAAGTTTGGACCGCTGCCTCAGAAGAGCGTCCAAATTTGTAAGTGACCCGGGGTTC
GTCCTACAGAGGACCATCGACTACGCCGCTGAGGCTGAGGTGGAGGTCAGGGCCGAGCTGCTGAAGAAAGAAGAGGACAGACGCAAGGCCCAGCTCCGAGCTGCCCATGC
TATCACCAAGGGTTTGGAGAAGGAGAAGTTCCAACTCCTCAAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAACGAGGAGGAGCTGAAGCACGCGACCGCTGAGC
TGGAGATGGTGAAGGAGCGTTTCAGCAATGGAGCCCTATTGGAGGAATCGTTCAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTCTCTGACGTGGGCTTCAGG
TTTCTCATGAAGGGCATTGCTTCCGACATGCCTGACCTTCAGATCGATCTCGGTGGTCTGAAGAAGAGGTATGCTGAGCAGTGGGCGTCTGGGCCTAGCGGCACCCCTGG
CCCCCAAGCGTTGGTGGATAAGTACGTCAGAGATCTGCACTCTGACTACTCCGACCTCGAAGAGGATCAGGTCGGCACCACTCAGGAGGGCGCTCCTCAAGCAGGCTCTT
AG
mRNA sequenceShow/hide mRNA sequence
ATGGTAGATCTAGTTTGGGACGTGGTGCATCAAGCAGATATGGTACGTGTTGTAGTCAGCGGGGAGGGTGGTGGCGGTTGGGTGCGGCGACGACGGGTCTGGTTTTGGTT
CGTGGACGGCGGTGTGAAGACTGCAGCTCGAACTCGGCCTCCGGACCGATCTGAATACTTGGGCGGACCTGCACAAAAAGGTGAGCACTCCGACGATCAAGTCAGTATAG
GTCGGATTCCCAATTTAGTTCGAGGGTATTCTCTTCCCCAAACATTGGCCCCCTCTCTGTCTGGTCCGATCTCGACCTGGCAGAGAAGTTCATTCGACTTGCTTTGGACG
CGTGGCGACTTCCTATTCGTGGGAAAATATAACCGTTGCGGTGGATTTATCGTCGGAATATTCAAATATTCCGACGCTTCGGATCTTCGGGAGGATCCTAGCCGCTCCTT
GATTACACGTCTCGAACCCTTGGTAGGTCAGTCTCTTCCCTCACTTTCTCTTTCGAACGTAGTTGCCATGTCGTCCTCTTTTAGCAGCGACTTAGGATCCGATGAGGATT
TAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGGAGGATAGTGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCTTCT
AGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTCCTTAGGATTCCGGAGGAGGGGGAGAGAGTTGACAATCCTCCAGAGGG
ATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGGCTTCCCCTTCACCCTTTTGTCCAAGAATTTCTCTTCCGAACTGGGTTGGCTCCGGCTCAAGTGGCCC
CCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTACGAGCTCGAGATAGTGAAGAGGCCGAGCTGTTGGGACTTAAGGGGCCGACCTCCATCAAAGGA
TGGGTGAGGAAGTGGTTCTACGCTTCCGGGGAATGGCTTGCAAAGGACGAGTCGGGTCGTTCCTTCATTGACGTTCCCACTAGTTTCAATCCGACCAGTCCCCGAGTTAC
GCAAGCCTCCTTCGACACGCTGAAATATTACAAGGAGCACTTCCCGAGGGGTAGGAAGGTCGGAACCTTGGTGACCGACAAGCTGCTGCTTGAGTCCGGGCTGCTAGATT
ACAACCCTGCTGTTCGTCCCCTTGAATCCTCAAGGCCGAACTCCGAACTTGGGCCAGCCTCAGATGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTAGGGGTCCTTCG
AGGGAGAAGCGCCCCAGGGGTCAGACCGAGGCGGCAGACGTCTCGTCCTTGGGCGAGGAGGTGAGGGAGGAGGCCCCTCTGAAGCGAAGGAGGAAGATGAAGAAGACCAC
CTCTCCCTTGGAGGTCGGAGCTCGTGGGGCCCTGCCTGCGAGCTTCGCAGATCGGGTGGACGATCCTGAGGCCAGGATGGGCGGGACGTCCGACGTGACAGCACGGTTCA
GAGTTGAGTCGTCAAGTTCTAGGGTGAAAGACGAGGTGTCCCGCATCTCGGCTGCAAGTTTGGACCGCTGCCTCAGAAGAGCGTCCAAATTTGTAAGTGACCCGGGGTTC
GTCCTACAGAGGACCATCGACTACGCCGCTGAGGCTGAGGTGGAGGTCAGGGCCGAGCTGCTGAAGAAAGAAGAGGACAGACGCAAGGCCCAGCTCCGAGCTGCCCATGC
TATCACCAAGGGTTTGGAGAAGGAGAAGTTCCAACTCCTCAAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAACGAGGAGGAGCTGAAGCACGCGACCGCTGAGC
TGGAGATGGTGAAGGAGCGTTTCAGCAATGGAGCCCTATTGGAGGAATCGTTCAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTCTCTGACGTGGGCTTCAGG
TTTCTCATGAAGGGCATTGCTTCCGACATGCCTGACCTTCAGATCGATCTCGGTGGTCTGAAGAAGAGGTATGCTGAGCAGTGGGCGTCTGGGCCTAGCGGCACCCCTGG
CCCCCAAGCGTTGGTGGATAAGTACGTCAGAGATCTGCACTCTGACTACTCCGACCTCGAAGAGGATCAGGTCGGCACCACTCAGGAGGGCGCTCCTCAAGCAGGCTCTT
AG
Protein sequenceShow/hide protein sequence
MVDLVWDVVHQADMVRVVVSGEGGGGWVRRRRVWFWFVDGGVKTAARTRPPDRSEYLGGPAQKGEHSDDQVSIGRIPNLVRGYSLPQTLAPSLSGPISTWQRSSFDLLWT
RGDFLFVGKYNRCGGFIVGIFKYSDASDLREDPSRSLITRLEPLVGQSLPSLSLSNVVAMSSSFSSDLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPS
RIPEHYLGSLRRGFAIPENILLRIPEEGERVDNPPEGWVTLYFKMFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLGLKGPTSIKG
WVRKWFYASGEWLAKDESGRSFIDVPTSFNPTSPRVTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPLESSRPNSELGPASDDPAPVIELESSRGPS
REKRPRGQTEAADVSSLGEEVREEAPLKRRRKMKKTTSPLEVGARGALPASFADRVDDPEARMGGTSDVTARFRVESSSSRVKDEVSRISAASLDRCLRRASKFVSDPGF
VLQRTIDYAAEAEVEVRAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEANEEELKHATAELEMVKERFSNGALLEESFRQHPDFDGFAKDFSDVGFR
FLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLHSDYSDLEEDQVGTTQEGAPQAGS