; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g10810 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g10810
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr4:8108507..8110297
RNA-Seq ExpressionMoc04g10810
SyntenyMoc04g10810
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]1.1e-10653.88Show/hide
Query:  VSIRPIPELTQASFDTLKFYKDHFPKGRKIGTLVTDRLLLDFGLLDYYSLVRPIEASRPNFELAMVCGFTSNVKRKSKGRAHALKTVQSTGPTTPIAARL
        +SI+PIPEL QA+FDTLKFYKD+FP+GRKIGTLVTD+LLL+ GLLDY  LVRPIEASRPN ELAMVCGFTS+VKRKSKGRAHALK VQS+ P TP   + 
Subjt:  VSIRPIPELTQASFDTLKFYKDHFPKGRKIGTLVTDRLLLDFGLLDYYSLVRPIEASRPNFELAMVCGFTSNVKRKSKGRAHALKTVQSTGPTTPIAARL

Query:  AAQGNAGPSSEVPTPVIELELTGDHSGEKRPRDESEALDVSPLNEVRGESPLKRRRKKKKTTSSSEAGPRGTLHTSHANLVDDPEARMEGTSDVPMRFRV
        AAQ  AGPSS  PTPVIEL+ TG+ S EKR R ESEALDVSPL EVR                                                     
Subjt:  AAQGNAGPSSEVPTPVIELELTGDHSGEKRPRDESEALDVSPLNEVRGESPLKRRRKKKKTTSSSEAGPRGTLHTSHANLVDDPEARMEGTSDVPMRFRV

Query:  EPSSSKAKDQVSRISASCLDRCLRRASKFVSDPGSVLQRTIDHAAVGEGKFLCCLEAATSMKGELLKARSEVEILKAEVEAKALLLKKEEEKHKAHLRVA
                                                                                       EAKA LLK+E+E+HKAHLR A
Subjt:  EPSSSKAKDQVSRISASCLDRCLRRASKFVSDPGSVLQRTIDHAAVGEGKFLCCLEAATSMKGELLKARSEVEILKAEVEAKALLLKKEEEKHKAHLRVA

Query:  HAITKGLEKEKFQLLKE----------KDTSIGRLATELKEVKERLTNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIAADMPHLQVDLGGLKKRYA
        HAITKGLEKEKFQLLKE          KD +IGRL  ELK  KERLTNG LLE AFRQHPDFDGFAKDFSDAGFKFLMKGIAAD+PHL+VDLG LKKRYA
Subjt:  HAITKGLEKEKFQLLKE----------KDTSIGRLATELKEVKERLTNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIAADMPHLQVDLGGLKKRYA

Query:  EKWASRPNNTPGPQSLVEKYVRELDSDYSDLEEEEAAS
        EKWAS PN T GP SLV+KYVR+LDSDYSDL+E+E  S
Subjt:  EKWASRPNNTPGPQSLVEKYVRELDSDYSDLEEEEAAS

XP_022147182.1 uncharacterized protein LOC111016193 [Momordica charantia]2.2e-8970.15Show/hide
Query:  MRFRVEPSSSKAKDQVSRISASCLDRCLRRASKFVSDPGSVLQRTIDHAAVG---------------EGKFL----------CCLEAATSMKGELLKARS
        MRFR+E SSS  KDQVSRISA+CLDRCLRRAS+FVSDPGSVLQRTID+AA                 +G+              LEAAT++KGELLKA+ 
Subjt:  MRFRVEPSSSKAKDQVSRISASCLDRCLRRASKFVSDPGSVLQRTIDHAAVG---------------EGKFL----------CCLEAATSMKGELLKARS

Query:  EVEILKAEVEAKALLLKKEEEKHKAHLRVAHAITKGLEKEKFQLLKE----------KDTSIGRLATELKEVKERLTNGVLLEEAFRQHPDFDGFAKDFS
        EV+IL+AEV+AK  LLKKE EKHKAHLR AHAITKGLEKEKFQLLKE          KD SIGRL TELK++KERLT+G LLEE+FRQHP+FDGFAKDFS
Subjt:  EVEILKAEVEAKALLLKKEEEKHKAHLRVAHAITKGLEKEKFQLLKE----------KDTSIGRLATELKEVKERLTNGVLLEEAFRQHPDFDGFAKDFS

Query:  DAGFKFLMKGIAADMPHLQVDLGGLKKRYAEKWASRPNNTPGPQSLVEKYVRELDSDYSDLEEEEAAS
        DAGFKFLMKGIAADMPHLQ+DL  LKKRY+E WAS PN TPGPQSLV+KYVRELDSDYSD+EEE+A S
Subjt:  DAGFKFLMKGIAADMPHLQVDLGGLKKRYAEKWASRPNNTPGPQSLVEKYVRELDSDYSDLEEEEAAS

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]1.5e-7762.18Show/hide
Query:  GTSDVPMRFRVEPSSSKAKDQVSRISASCLDRCLRRASKFVSDPGSVLQRTIDHAAVG-------------------------EGKFLCCLEAATS-MKG
        G   +  + R+EPSSS  +DQVSRISA+ LDRCLRRASKFVS PGSVLQRTID+AA                           + +F   LE A+S MK 
Subjt:  GTSDVPMRFRVEPSSSKAKDQVSRISASCLDRCLRRASKFVSDPGSVLQRTIDHAAVG-------------------------EGKFLCCLEAATS-MKG

Query:  ELLKARSEVEILKAEVEAKALLLKKEEEKHKAHLRVAHAITKGLEKEKFQLLKEKDTSIGRL----------ATELKEVKERLTNGVLLEEAFRQHPDFD
        ELLKA SEVE LKAEVE++A LLKKEE++ +A LR AHAIT+GLE+EKFQLLKEKD  +  L            EL+  KERL+NGVLLEEAFRQHPDFD
Subjt:  ELLKARSEVEILKAEVEAKALLLKKEEEKHKAHLRVAHAITKGLEKEKFQLLKEKDTSIGRL----------ATELKEVKERLTNGVLLEEAFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIAADMPHLQVDLGGLKKRYAEKWASRPNNTPGPQSLVEKYVRELDSDYSDLEEEEAAS
        GFAKDFSDAGFKFLMKGIA+DMP LQ+DL GLK+RYAEKWAS P  TPGPQ+LV++YVR+LDSDYSD EE++  S
Subjt:  GFAKDFSDAGFKFLMKGIAADMPHLQVDLGGLKKRYAEKWASRPNNTPGPQSLVEKYVRELDSDYSDLEEEEAAS

XP_022152119.1 uncharacterized protein LOC111019909 [Momordica charantia]8.0e-9270.65Show/hide
Query:  MEGTSDVPMRFRVEPSSSKAKDQVSRISASCLDRCLRRASKFVSDPGSVLQRTIDHAAVG---------------EGK----------FLCCLEAATSMK
        M GT DV  RFR+EPSSS  KDQVSRISA+CLDRCL+RASKFVSDPGSVLQRTID+AA                 +G+              LEAAT++K
Subjt:  MEGTSDVPMRFRVEPSSSKAKDQVSRISASCLDRCLRRASKFVSDPGSVLQRTIDHAAVG---------------EGK----------FLCCLEAATSMK

Query:  GELLKARSEVEILKAEVEAKALLLKKEEEKHKAHLRVAHAITKGLEKEKFQLLKE----------KDTSIGRLATELKEVKERLTNGVLLEEAFRQHPDF
        GELLKA+ EV IL+AEV+AKA LLKKE EKHKAHLR AHAITKGLEKEKFQLLKE          KDTSIGRL  ELK++KERLTNG LLEE+FRQH DF
Subjt:  GELLKARSEVEILKAEVEAKALLLKKEEEKHKAHLRVAHAITKGLEKEKFQLLKE----------KDTSIGRLATELKEVKERLTNGVLLEEAFRQHPDF

Query:  DGFAKDFSDAGFKFLMKGIAADMPHLQVDLGGLKKRYAEKWASRPNNTPGPQSLVEKYVRELDSDYSDLEEEEAAS
        DGFAKDFSDAGFKFLMKGIAADMPHLQ+DL  LKK+Y+EKWAS PN TPGPQSLV KYVRELDSDYSD+EEE+A S
Subjt:  DGFAKDFSDAGFKFLMKGIAADMPHLQVDLGGLKKRYAEKWASRPNNTPGPQSLVEKYVRELDSDYSDLEEEEAAS

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]2.8e-17773.98Show/hide
Query:  NLVSIRPIPELTQASFDTLKFYKDHFPKGRKIGTLVTDRLLLDFGLLDYYSLVRPIEASRPNFELAMVCGFTSNVKRKSKGRAHALKTVQSTGPTTPIAA
        NLVSI+ IPEL QA+FDTLK YKDHFP+ RKI TLVTD+LLL+ GLLDY  LVR IEASRPN ELAMVCGFT +VKRKSKGRAHALKTV  T P TP   
Subjt:  NLVSIRPIPELTQASFDTLKFYKDHFPKGRKIGTLVTDRLLLDFGLLDYYSLVRPIEASRPNFELAMVCGFTSNVKRKSKGRAHALKTVQSTGPTTPIAA

Query:  RLAAQGNAGPSSEVPTPVIELELTGDHSGEKRPRDESEALDVSPLNEVRGESPLKRRRKKKKTTSSSEAGPRGTLHTSHANLVDDPEARMEGTSDVPMRF
        R  AQGN+GPSS VPTPVIEL+L+G  SGEKR R+ESEALDVSPLNEVRGESPL+RRRKKKKT+SSSEAG RGTL TSHA+LVDDPEARM GTS+V MRF
Subjt:  RLAAQGNAGPSSEVPTPVIELELTGDHSGEKRPRDESEALDVSPLNEVRGESPLKRRRKKKKTTSSSEAGPRGTLHTSHANLVDDPEARMEGTSDVPMRF

Query:  RVEPSSSKAKDQVSRISASCLDRCLRRASKFVSDPGSVLQRTIDH-------------------------AAVGEGKFLCCLEAATSMKGELLKARSEVE
         +EPSSS  KDQVSRISA+CLDR LRRASKFVSDPGSVLQRTID+                         AA         LEAAT++KGELLKA+ EV+
Subjt:  RVEPSSSKAKDQVSRISASCLDRCLRRASKFVSDPGSVLQRTIDH-------------------------AAVGEGKFLCCLEAATSMKGELLKARSEVE

Query:  ILKAEVEAKALLLKKEEEKHKAHLRVAHAITKGLEKEKFQLLK----------EKDTSIGRLATELKEVKERLTNGVLLEEAFRQHPDFDGFAKDFSDAG
        IL+AEV+AK  LLKKE EKHKAHLR AHAITKGLEKEKFQLLK          EKD SIGRL TELK++KERLTNG LLEE+FRQHPDFDGFAKDFSDAG
Subjt:  ILKAEVEAKALLLKKEEEKHKAHLRVAHAITKGLEKEKFQLLK----------EKDTSIGRLATELKEVKERLTNGVLLEEAFRQHPDFDGFAKDFSDAG

Query:  FKFLMKGIAADMPHLQVDLGGLKKRYAEKWASRPNNTPGPQSLVEKYVRELDSDYSDLEEEEAAS
        FKFLMKGIAADMPHLQ+DL GLKK+Y+EKWAS PN TP PQSLV+KYVRELDSDYSD+EEE+A S
Subjt:  FKFLMKGIAADMPHLQVDLGGLKKRYAEKWASRPNNTPGPQSLVEKYVRELDSDYSDLEEEEAAS

TrEMBL top hitse value%identityAlignment
A0A6J1CLV1 uncharacterized protein LOC1110124675.6e-10753.88Show/hide
Query:  VSIRPIPELTQASFDTLKFYKDHFPKGRKIGTLVTDRLLLDFGLLDYYSLVRPIEASRPNFELAMVCGFTSNVKRKSKGRAHALKTVQSTGPTTPIAARL
        +SI+PIPEL QA+FDTLKFYKD+FP+GRKIGTLVTD+LLL+ GLLDY  LVRPIEASRPN ELAMVCGFTS+VKRKSKGRAHALK VQS+ P TP   + 
Subjt:  VSIRPIPELTQASFDTLKFYKDHFPKGRKIGTLVTDRLLLDFGLLDYYSLVRPIEASRPNFELAMVCGFTSNVKRKSKGRAHALKTVQSTGPTTPIAARL

Query:  AAQGNAGPSSEVPTPVIELELTGDHSGEKRPRDESEALDVSPLNEVRGESPLKRRRKKKKTTSSSEAGPRGTLHTSHANLVDDPEARMEGTSDVPMRFRV
        AAQ  AGPSS  PTPVIEL+ TG+ S EKR R ESEALDVSPL EVR                                                     
Subjt:  AAQGNAGPSSEVPTPVIELELTGDHSGEKRPRDESEALDVSPLNEVRGESPLKRRRKKKKTTSSSEAGPRGTLHTSHANLVDDPEARMEGTSDVPMRFRV

Query:  EPSSSKAKDQVSRISASCLDRCLRRASKFVSDPGSVLQRTIDHAAVGEGKFLCCLEAATSMKGELLKARSEVEILKAEVEAKALLLKKEEEKHKAHLRVA
                                                                                       EAKA LLK+E+E+HKAHLR A
Subjt:  EPSSSKAKDQVSRISASCLDRCLRRASKFVSDPGSVLQRTIDHAAVGEGKFLCCLEAATSMKGELLKARSEVEILKAEVEAKALLLKKEEEKHKAHLRVA

Query:  HAITKGLEKEKFQLLKE----------KDTSIGRLATELKEVKERLTNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIAADMPHLQVDLGGLKKRYA
        HAITKGLEKEKFQLLKE          KD +IGRL  ELK  KERLTNG LLE AFRQHPDFDGFAKDFSDAGFKFLMKGIAAD+PHL+VDLG LKKRYA
Subjt:  HAITKGLEKEKFQLLKE----------KDTSIGRLATELKEVKERLTNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIAADMPHLQVDLGGLKKRYA

Query:  EKWASRPNNTPGPQSLVEKYVRELDSDYSDLEEEEAAS
        EKWAS PN T GP SLV+KYVR+LDSDYSDL+E+E  S
Subjt:  EKWASRPNNTPGPQSLVEKYVRELDSDYSDLEEEEAAS

A0A6J1D1N9 uncharacterized protein LOC1110161931.1e-8970.15Show/hide
Query:  MRFRVEPSSSKAKDQVSRISASCLDRCLRRASKFVSDPGSVLQRTIDHAAVG---------------EGKFL----------CCLEAATSMKGELLKARS
        MRFR+E SSS  KDQVSRISA+CLDRCLRRAS+FVSDPGSVLQRTID+AA                 +G+              LEAAT++KGELLKA+ 
Subjt:  MRFRVEPSSSKAKDQVSRISASCLDRCLRRASKFVSDPGSVLQRTIDHAAVG---------------EGKFL----------CCLEAATSMKGELLKARS

Query:  EVEILKAEVEAKALLLKKEEEKHKAHLRVAHAITKGLEKEKFQLLKE----------KDTSIGRLATELKEVKERLTNGVLLEEAFRQHPDFDGFAKDFS
        EV+IL+AEV+AK  LLKKE EKHKAHLR AHAITKGLEKEKFQLLKE          KD SIGRL TELK++KERLT+G LLEE+FRQHP+FDGFAKDFS
Subjt:  EVEILKAEVEAKALLLKKEEEKHKAHLRVAHAITKGLEKEKFQLLKE----------KDTSIGRLATELKEVKERLTNGVLLEEAFRQHPDFDGFAKDFS

Query:  DAGFKFLMKGIAADMPHLQVDLGGLKKRYAEKWASRPNNTPGPQSLVEKYVRELDSDYSDLEEEEAAS
        DAGFKFLMKGIAADMPHLQ+DL  LKKRY+E WAS PN TPGPQSLV+KYVRELDSDYSD+EEE+A S
Subjt:  DAGFKFLMKGIAADMPHLQVDLGGLKKRYAEKWASRPNNTPGPQSLVEKYVRELDSDYSDLEEEEAAS

A0A6J1D971 uncharacterized protein LOC1110185387.1e-7862.18Show/hide
Query:  GTSDVPMRFRVEPSSSKAKDQVSRISASCLDRCLRRASKFVSDPGSVLQRTIDHAAVG-------------------------EGKFLCCLEAATS-MKG
        G   +  + R+EPSSS  +DQVSRISA+ LDRCLRRASKFVS PGSVLQRTID+AA                           + +F   LE A+S MK 
Subjt:  GTSDVPMRFRVEPSSSKAKDQVSRISASCLDRCLRRASKFVSDPGSVLQRTIDHAAVG-------------------------EGKFLCCLEAATS-MKG

Query:  ELLKARSEVEILKAEVEAKALLLKKEEEKHKAHLRVAHAITKGLEKEKFQLLKEKDTSIGRL----------ATELKEVKERLTNGVLLEEAFRQHPDFD
        ELLKA SEVE LKAEVE++A LLKKEE++ +A LR AHAIT+GLE+EKFQLLKEKD  +  L            EL+  KERL+NGVLLEEAFRQHPDFD
Subjt:  ELLKARSEVEILKAEVEAKALLLKKEEEKHKAHLRVAHAITKGLEKEKFQLLKEKDTSIGRL----------ATELKEVKERLTNGVLLEEAFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIAADMPHLQVDLGGLKKRYAEKWASRPNNTPGPQSLVEKYVRELDSDYSDLEEEEAAS
        GFAKDFSDAGFKFLMKGIA+DMP LQ+DL GLK+RYAEKWAS P  TPGPQ+LV++YVR+LDSDYSD EE++  S
Subjt:  GFAKDFSDAGFKFLMKGIAADMPHLQVDLGGLKKRYAEKWASRPNNTPGPQSLVEKYVRELDSDYSDLEEEEAAS

A0A6J1DF31 uncharacterized protein LOC1110199093.9e-9270.65Show/hide
Query:  MEGTSDVPMRFRVEPSSSKAKDQVSRISASCLDRCLRRASKFVSDPGSVLQRTIDHAAVG---------------EGK----------FLCCLEAATSMK
        M GT DV  RFR+EPSSS  KDQVSRISA+CLDRCL+RASKFVSDPGSVLQRTID+AA                 +G+              LEAAT++K
Subjt:  MEGTSDVPMRFRVEPSSSKAKDQVSRISASCLDRCLRRASKFVSDPGSVLQRTIDHAAVG---------------EGK----------FLCCLEAATSMK

Query:  GELLKARSEVEILKAEVEAKALLLKKEEEKHKAHLRVAHAITKGLEKEKFQLLKE----------KDTSIGRLATELKEVKERLTNGVLLEEAFRQHPDF
        GELLKA+ EV IL+AEV+AKA LLKKE EKHKAHLR AHAITKGLEKEKFQLLKE          KDTSIGRL  ELK++KERLTNG LLEE+FRQH DF
Subjt:  GELLKARSEVEILKAEVEAKALLLKKEEEKHKAHLRVAHAITKGLEKEKFQLLKE----------KDTSIGRLATELKEVKERLTNGVLLEEAFRQHPDF

Query:  DGFAKDFSDAGFKFLMKGIAADMPHLQVDLGGLKKRYAEKWASRPNNTPGPQSLVEKYVRELDSDYSDLEEEEAAS
        DGFAKDFSDAGFKFLMKGIAADMPHLQ+DL  LKK+Y+EKWAS PN TPGPQSLV KYVRELDSDYSD+EEE+A S
Subjt:  DGFAKDFSDAGFKFLMKGIAADMPHLQVDLGGLKKRYAEKWASRPNNTPGPQSLVEKYVRELDSDYSDLEEEEAAS

A0A6J1DZB3 uncharacterized protein LOC1110256651.3e-17773.98Show/hide
Query:  NLVSIRPIPELTQASFDTLKFYKDHFPKGRKIGTLVTDRLLLDFGLLDYYSLVRPIEASRPNFELAMVCGFTSNVKRKSKGRAHALKTVQSTGPTTPIAA
        NLVSI+ IPEL QA+FDTLK YKDHFP+ RKI TLVTD+LLL+ GLLDY  LVR IEASRPN ELAMVCGFT +VKRKSKGRAHALKTV  T P TP   
Subjt:  NLVSIRPIPELTQASFDTLKFYKDHFPKGRKIGTLVTDRLLLDFGLLDYYSLVRPIEASRPNFELAMVCGFTSNVKRKSKGRAHALKTVQSTGPTTPIAA

Query:  RLAAQGNAGPSSEVPTPVIELELTGDHSGEKRPRDESEALDVSPLNEVRGESPLKRRRKKKKTTSSSEAGPRGTLHTSHANLVDDPEARMEGTSDVPMRF
        R  AQGN+GPSS VPTPVIEL+L+G  SGEKR R+ESEALDVSPLNEVRGESPL+RRRKKKKT+SSSEAG RGTL TSHA+LVDDPEARM GTS+V MRF
Subjt:  RLAAQGNAGPSSEVPTPVIELELTGDHSGEKRPRDESEALDVSPLNEVRGESPLKRRRKKKKTTSSSEAGPRGTLHTSHANLVDDPEARMEGTSDVPMRF

Query:  RVEPSSSKAKDQVSRISASCLDRCLRRASKFVSDPGSVLQRTIDH-------------------------AAVGEGKFLCCLEAATSMKGELLKARSEVE
         +EPSSS  KDQVSRISA+CLDR LRRASKFVSDPGSVLQRTID+                         AA         LEAAT++KGELLKA+ EV+
Subjt:  RVEPSSSKAKDQVSRISASCLDRCLRRASKFVSDPGSVLQRTIDH-------------------------AAVGEGKFLCCLEAATSMKGELLKARSEVE

Query:  ILKAEVEAKALLLKKEEEKHKAHLRVAHAITKGLEKEKFQLLK----------EKDTSIGRLATELKEVKERLTNGVLLEEAFRQHPDFDGFAKDFSDAG
        IL+AEV+AK  LLKKE EKHKAHLR AHAITKGLEKEKFQLLK          EKD SIGRL TELK++KERLTNG LLEE+FRQHPDFDGFAKDFSDAG
Subjt:  ILKAEVEAKALLLKKEEEKHKAHLRVAHAITKGLEKEKFQLLK----------EKDTSIGRLATELKEVKERLTNGVLLEEAFRQHPDFDGFAKDFSDAG

Query:  FKFLMKGIAADMPHLQVDLGGLKKRYAEKWASRPNNTPGPQSLVEKYVRELDSDYSDLEEEEAAS
        FKFLMKGIAADMPHLQ+DL GLKK+Y+EKWAS PN TP PQSLV+KYVRELDSDYSD+EEE+A S
Subjt:  FKFLMKGIAADMPHLQVDLGGLKKRYAEKWASRPNNTPGPQSLVEKYVRELDSDYSDLEEEEAAS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATAAGGAAGAGCGAAATAACCCTAATCGAGGCATTGAGAACTTAGTATCAATCAGGCCAATCCCCGAACTCACTCAAGCATCCTTTGATACACTTAAGTTTTACAA
GGATCACTTTCCAAAGGGCAGGAAGATCGGAACTCTGGTAACCGACAGGCTACTCCTTGACTTTGGGTTGTTGGATTACTACTCTTTAGTTCGTCCAATCGAAGCTTCAA
GGCCGAACTTCGAGCTCGCCATGGTGTGCGGTTTCACAAGTAATGTGAAGCGCAAGTCTAAGGGTCGTGCTCACGCCCTTAAGACCGTTCAGAGCACGGGGCCAACAACT
CCTATTGCGGCTCGACTTGCGGCACAAGGCAATGCTGGTCCATCTTCTGAAGTCCCCACTCCCGTGATCGAACTAGAGTTAACTGGGGATCACTCCGGGGAAAAGCGCCC
AAGGGATGAATCTGAGGCGTTGGACGTATCTCCCCTGAACGAGGTGAGGGGAGAGTCTCCTTTGAAGAGGAGAAGGAAGAAGAAGAAGACCACCTCCTCCTCGGAGGCCG
GACCTCGTGGGACCCTGCACACGAGCCATGCTAATCTAGTGGACGACCCTGAAGCCAGGATGGAGGGGACGTCCGATGTGCCAATGCGGTTCCGGGTCGAACCATCAAGC
TCTAAGGCGAAGGACCAGGTGTCCCGCATCTCGGCATCATGCTTGGATCGCTGCCTTCGCAGGGCATCCAAGTTTGTAAGCGATCCTGGGTCCGTTCTGCAAAGGACCAT
TGATCACGCTGCTGTGGGAGAAGGAAAATTCCTCTGCTGCCTGGAAGCTGCTACCTCAATGAAGGGCGAGTTATTAAAGGCTCGCTCCGAAGTGGAGATTCTGAAGGCCG
AGGTGGAGGCCAAGGCTCTGCTGCTAAAGAAAGAGGAAGAAAAGCACAAGGCCCACCTCCGCGTTGCTCATGCCATCACCAAGGGGCTGGAGAAGGAAAAGTTCCAGCTC
CTTAAGGAAAAGGACACTTCGATAGGGCGCCTTGCTACCGAGCTCAAGGAGGTGAAGGAACGCCTCACCAACGGAGTCCTCTTGGAGGAAGCGTTCAGGCAGCACCCAGA
CTTTGATGGGTTTGCCAAGGACTTCAGCGATGCGGGCTTCAAGTTTCTGATGAAAGGCATTGCTGCTGACATGCCCCACCTCCAGGTCGACTTAGGCGGTCTGAAGAAAA
GGTATGCTGAGAAATGGGCTTCTAGGCCTAATAATACCCCAGGCCCTCAGTCGCTGGTGGAGAAGTACGTCAGGGAGTTGGACTCTGACTACTCCGACCTTGAAGAGGAA
GAAGCTGCTAGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTATAAGGAAGAGCGAAATAACCCTAATCGAGGCATTGAGAACTTAGTATCAATCAGGCCAATCCCCGAACTCACTCAAGCATCCTTTGATACACTTAAGTTTTACAA
GGATCACTTTCCAAAGGGCAGGAAGATCGGAACTCTGGTAACCGACAGGCTACTCCTTGACTTTGGGTTGTTGGATTACTACTCTTTAGTTCGTCCAATCGAAGCTTCAA
GGCCGAACTTCGAGCTCGCCATGGTGTGCGGTTTCACAAGTAATGTGAAGCGCAAGTCTAAGGGTCGTGCTCACGCCCTTAAGACCGTTCAGAGCACGGGGCCAACAACT
CCTATTGCGGCTCGACTTGCGGCACAAGGCAATGCTGGTCCATCTTCTGAAGTCCCCACTCCCGTGATCGAACTAGAGTTAACTGGGGATCACTCCGGGGAAAAGCGCCC
AAGGGATGAATCTGAGGCGTTGGACGTATCTCCCCTGAACGAGGTGAGGGGAGAGTCTCCTTTGAAGAGGAGAAGGAAGAAGAAGAAGACCACCTCCTCCTCGGAGGCCG
GACCTCGTGGGACCCTGCACACGAGCCATGCTAATCTAGTGGACGACCCTGAAGCCAGGATGGAGGGGACGTCCGATGTGCCAATGCGGTTCCGGGTCGAACCATCAAGC
TCTAAGGCGAAGGACCAGGTGTCCCGCATCTCGGCATCATGCTTGGATCGCTGCCTTCGCAGGGCATCCAAGTTTGTAAGCGATCCTGGGTCCGTTCTGCAAAGGACCAT
TGATCACGCTGCTGTGGGAGAAGGAAAATTCCTCTGCTGCCTGGAAGCTGCTACCTCAATGAAGGGCGAGTTATTAAAGGCTCGCTCCGAAGTGGAGATTCTGAAGGCCG
AGGTGGAGGCCAAGGCTCTGCTGCTAAAGAAAGAGGAAGAAAAGCACAAGGCCCACCTCCGCGTTGCTCATGCCATCACCAAGGGGCTGGAGAAGGAAAAGTTCCAGCTC
CTTAAGGAAAAGGACACTTCGATAGGGCGCCTTGCTACCGAGCTCAAGGAGGTGAAGGAACGCCTCACCAACGGAGTCCTCTTGGAGGAAGCGTTCAGGCAGCACCCAGA
CTTTGATGGGTTTGCCAAGGACTTCAGCGATGCGGGCTTCAAGTTTCTGATGAAAGGCATTGCTGCTGACATGCCCCACCTCCAGGTCGACTTAGGCGGTCTGAAGAAAA
GGTATGCTGAGAAATGGGCTTCTAGGCCTAATAATACCCCAGGCCCTCAGTCGCTGGTGGAGAAGTACGTCAGGGAGTTGGACTCTGACTACTCCGACCTTGAAGAGGAA
GAAGCTGCTAGCTAG
Protein sequenceShow/hide protein sequence
MYKEERNNPNRGIENLVSIRPIPELTQASFDTLKFYKDHFPKGRKIGTLVTDRLLLDFGLLDYYSLVRPIEASRPNFELAMVCGFTSNVKRKSKGRAHALKTVQSTGPTT
PIAARLAAQGNAGPSSEVPTPVIELELTGDHSGEKRPRDESEALDVSPLNEVRGESPLKRRRKKKKTTSSSEAGPRGTLHTSHANLVDDPEARMEGTSDVPMRFRVEPSS
SKAKDQVSRISASCLDRCLRRASKFVSDPGSVLQRTIDHAAVGEGKFLCCLEAATSMKGELLKARSEVEILKAEVEAKALLLKKEEEKHKAHLRVAHAITKGLEKEKFQL
LKEKDTSIGRLATELKEVKERLTNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIAADMPHLQVDLGGLKKRYAEKWASRPNNTPGPQSLVEKYVRELDSDYSDLEEE
EAAS