; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g26720 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g26720
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr6:20151140..20153431
RNA-Seq ExpressionMoc06g26720
SyntenyMoc06g26720
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]2.1e-11559.37Show/hide
Query:  NPLTPEGVITREEFDLMKNKFGAHVEALKAKCEMKENPFEDGDMVESPFTSNILEAPIPPKFKMVAMKPYDRSKDPKDYVEVFEGLIDFQMAFDVVECRV
        NP TP GVITREEFD ++ +  A VEALKAKCE KE P  DGD+ ESPFTS++LEAPIPPKFK   +KPYD SKDPKDYVEVFE L+DFQ A D ++CR 
Subjt:  NPLTPEGVITREEFDLMKNKFGAHVEALKAKCEMKENPFEDGDMVESPFTSNILEAPIPPKFKMVAMKPYDRSKDPKDYVEVFEGLIDFQMAFDVVECRV

Query:  FHIGMTWSVQLWYRRLSARSISTNSQMRKEFISTFSSQYYDRKIATHLATIRQKEGETLREHVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGE
        F I +T S +LWYRRL A SIST SQ+R+EF++ FSS++YD+K ATHLATIRQKEGETLRE+VTRFQEEQLKVAHCSDDSAMCYFLTGLADE LTVKLGE
Subjt:  FHIGMTWSVQLWYRRLSARSISTNSQMRKEFISTFSSQYYDRKIATHLATIRQKEGETLREHVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGE

Query:  ESPSTFAEVLQKAKKVND--------------------------------------------------GPLRA------------------DLIIEGLEK
        E+P+TFAEVLQKAKKV D                                                  GP R+                  ++   G+EK
Subjt:  ESPSTFAEVLQKAKKVND--------------------------------------------------GPLRA------------------DLIIEGLEK

Query:  LLKRPEKLRGDPKKCNKDKYCRFHRDHGHDTSSCWELKRQIDDLIQDGYFNKYVVRPGSSSAENKDVRNRSRMPPHRDD
        LLKRPEKLRG P++ +KDKYCRFHR+HGH+TS  WELKRQI++LIQDGYF K+V +P +SSAE K+ R RSR PP R D
Subjt:  LLKRPEKLRGDPKKCNKDKYCRFHRDHGHDTSSCWELKRQIDDLIQDGYFNKYVVRPGSSSAENKDVRNRSRMPPHRDD

XP_022141796.1 uncharacterized protein LOC111012081 [Momordica charantia]1.8e-11158.6Show/hide
Query:  VITREEFDLMKNKFGAHVEALKAKCEMKENPFEDGDMVESPFTSNILEAPIPPKFKMVAMKPYDRSKDPKDYVEVFEGLIDFQMAFDVVECRVFHIGMTW
        +ITREEFD ++ +  A  EALKAKCE KE P  DGD+ ESPFTS++LEAPIPPKFK   +KPYD SKDPKDYVEVFEGL+DFQ   D ++CR F I +T 
Subjt:  VITREEFDLMKNKFGAHVEALKAKCEMKENPFEDGDMVESPFTSNILEAPIPPKFKMVAMKPYDRSKDPKDYVEVFEGLIDFQMAFDVVECRVFHIGMTW

Query:  SVQLWYRRLSARSISTNSQMRKEFISTFSSQYYDRKIATHLATIRQKEGETLREHVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEESPSTFA
        S +LWYRRL ARSIST SQ+R+EF++ FSS++YD+K ATHLATIRQKEGETLRE+VTRFQEEQLKVAHCSDDSAMCYFLTGLADE LTVKLGEE+PSTF 
Subjt:  SVQLWYRRLSARSISTNSQMRKEFISTFSSQYYDRKIATHLATIRQKEGETLREHVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEESPSTFA

Query:  EVLQKAKKVND--------------------------------------------------GPLRA------------------DLIIEGLEKLLKRPEK
        EVLQK KKV D                                                  GP R+                   +   G+EKLLKRPEK
Subjt:  EVLQKAKKVND--------------------------------------------------GPLRA------------------DLIIEGLEKLLKRPEK

Query:  LRGDPKKCNKDKYCRFHRDHGHDTSSCWELKRQIDDLIQDGYFNKYVVRPGSSSAENKDVRNRSRMPPHRDD
        LR   ++ +KDKYCRFHR+HGH+TS CWELKRQI+DLIQDGYF K+V +P +SSAE K+ R RSR PP R D
Subjt:  LRGDPKKCNKDKYCRFHRDHGHDTSSCWELKRQIDDLIQDGYFNKYVVRPGSSSAENKDVRNRSRMPPHRDD

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]8.7e-11459.58Show/hide
Query:  YNPLTPEGVITREEFDLMKNKFGAHVEALKAKCEMKENPFEDGDMVESPFTSNILEAPIPPKFKMVAMKPYDRSKDPKDYVEVFEGLIDFQMAFDVVECR
        YNP+TP GVITREEFD +K+KF A VEALKA+CE KE+ F+DGD+ E  F+S+ILEA IPPKFK   MKPYD SKDPKDYVEVFE L+DFQ A D ++C 
Subjt:  YNPLTPEGVITREEFDLMKNKFGAHVEALKAKCEMKENPFEDGDMVESPFTSNILEAPIPPKFKMVAMKPYDRSKDPKDYVEVFEGLIDFQMAFDVVECR

Query:  VFHIGMTWSVQLWYRRLSARSISTNSQMRKEFISTFSSQYYDRKIATHLATIRQKEGETLREHVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLG
         F I +T S +LWYRRL AR IST SQ+RKEFIS FSS++YDRK  THLATIRQKEGETLRE+VTRF EEQLKVAHCSDDSAMCYFLTGLADETLTVKL 
Subjt:  VFHIGMTWSVQLWYRRLSARSISTNSQMRKEFISTFSSQYYDRKIATHLATIRQKEGETLREHVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLG

Query:  EESPSTFAEVLQKAKKVNDG---------------------------------------------------------------------PLRADLIIEGL
        EE+P+TFAEVLQK KKV DG                                                                      +  ++   G+
Subjt:  EESPSTFAEVLQKAKKVNDG---------------------------------------------------------------------PLRADLIIEGL

Query:  EKLLKRPEKLRGDPKKCNKDKYCRFHRDHGHDTSSCWELKRQIDDLIQDGYFNKYVVRPGSSSAENKDVRNRSRMPPHRDD
        EKLLKRPEKLRGDP+K N DKYCRFHRDHGH+TS+ WELKRQI+DLIQDGYF K+V +P S+S E K+ R R R PP RDD
Subjt:  EKLLKRPEKLRGDPKKCNKDKYCRFHRDHGHDTSSCWELKRQIDDLIQDGYFNKYVVRPGSSSAENKDVRNRSRMPPHRDD

XP_022156542.1 uncharacterized protein LOC111023421 [Momordica charantia]1.6e-11258.71Show/hide
Query:  GVITREEFDLMKNKFGAHVEALKAKCEMKENPFEDGDMVESPFTSNILEAPIPPKFKMVAMKPYDRSKDPKDYVEVFEGLIDFQMAFDVVECRVFHIGMT
        G+ITREEFD ++ +  A VEALKAKCE K++   DGD+ ESPFTS++LEAPIPPKFK   +KPYD +KDPKDYVEVFEGL+DFQ A D ++CR F I +T
Subjt:  GVITREEFDLMKNKFGAHVEALKAKCEMKENPFEDGDMVESPFTSNILEAPIPPKFKMVAMKPYDRSKDPKDYVEVFEGLIDFQMAFDVVECRVFHIGMT

Query:  WSVQLWYRRLSARSISTNSQMRKEFISTFSSQYYDRKIATHLATIRQKEGETLREHVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEESPSTF
         S +LWYRRL  RSIST SQ+R+EF++ FSS++YD+K ATHLATIRQKEGETLRE+VTRFQEEQLKVAHCSDDSAMCYFLTGLADE LTVKLGEE+P+TF
Subjt:  WSVQLWYRRLSARSISTNSQMRKEFISTFSSQYYDRKIATHLATIRQKEGETLREHVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEESPSTF

Query:  AEVLQKAKKVND--------------------------------------------------GPLRA------------------DLIIEGLEKLLKRPE
        AEVLQKAKKV D                                                  GP R+                  ++   G+EKLLKRPE
Subjt:  AEVLQKAKKVND--------------------------------------------------GPLRA------------------DLIIEGLEKLLKRPE

Query:  KLRGDPKKCNKDKYCRFHRDHGHDTSSCWELKRQIDDLIQDGYFNKYVVRPGSSSAENKDVRNRSRMPPHRDD
        KLRG P++ +KDKYCRFHR+HGH+TS  WELKRQI+DLIQDGYF K+V +P +SSAE K+ R RSR PP R D
Subjt:  KLRGDPKKCNKDKYCRFHRDHGHDTSSCWELKRQIDDLIQDGYFNKYVVRPGSSSAENKDVRNRSRMPPHRDD

XP_022159160.1 uncharacterized protein LOC111025585 [Momordica charantia]3.5e-10763.86Show/hide
Query:  LTPEGVITREEFDLMKNKFGAHVEALKAKCEMKENPFEDGDMVESPFTSNILEAPIPPKFKMVAMKPYDRSKDPKDYVEVFEGLIDFQMAFDVVECRVFH
        +TPE VITREEFDLMK++F   VE LK KCE  E+PF DG+M ESPFTS++LE PIPPKFKM AMKPYD SKDPKDYVEVFE L+DFQ A D ++CR F 
Subjt:  LTPEGVITREEFDLMKNKFGAHVEALKAKCEMKENPFEDGDMVESPFTSNILEAPIPPKFKMVAMKPYDRSKDPKDYVEVFEGLIDFQMAFDVVECRVFH

Query:  IGMTWSVQLWYRRLSARSISTNSQMRKEFISTFSSQYYDRKIATHLATIRQKEGETLREHVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEES
        I +T ++                          S     RKIATHLATIRQKEGETLRE+VTRFQEEQLKV HCSDDSAMCYFLT LADETLTVKLGEE+
Subjt:  IGMTWSVQLWYRRLSARSISTNSQMRKEFISTFSSQYYDRKIATHLATIRQKEGETLREHVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEES

Query:  PSTFAEVLQKAKKVNDGPLRAD-----------------------LIIEGLEKLLKRPEKLRGDPKKCNKDKYCRFHRDHGHDTSSCWELKRQIDDLIQD
        PSTFAEVLQKAKKV DG   +D                       +   GLEKLLKRP+KLRGDP+K NKD+YCRFHRDHG+DTS+CWELKRQI+DLIQD
Subjt:  PSTFAEVLQKAKKVNDGPLRAD-----------------------LIIEGLEKLLKRPEKLRGDPKKCNKDKYCRFHRDHGHDTSSCWELKRQIDDLIQD

Query:  GYFNKYVVRPGSSSAENKDVRNRSRMPPHRDD
        GY  KYV +PGSSS   K+ R RSR PP RDD
Subjt:  GYFNKYVVRPGSSSAENKDVRNRSRMPPHRDD

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088131.0e-11559.37Show/hide
Query:  NPLTPEGVITREEFDLMKNKFGAHVEALKAKCEMKENPFEDGDMVESPFTSNILEAPIPPKFKMVAMKPYDRSKDPKDYVEVFEGLIDFQMAFDVVECRV
        NP TP GVITREEFD ++ +  A VEALKAKCE KE P  DGD+ ESPFTS++LEAPIPPKFK   +KPYD SKDPKDYVEVFE L+DFQ A D ++CR 
Subjt:  NPLTPEGVITREEFDLMKNKFGAHVEALKAKCEMKENPFEDGDMVESPFTSNILEAPIPPKFKMVAMKPYDRSKDPKDYVEVFEGLIDFQMAFDVVECRV

Query:  FHIGMTWSVQLWYRRLSARSISTNSQMRKEFISTFSSQYYDRKIATHLATIRQKEGETLREHVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGE
        F I +T S +LWYRRL A SIST SQ+R+EF++ FSS++YD+K ATHLATIRQKEGETLRE+VTRFQEEQLKVAHCSDDSAMCYFLTGLADE LTVKLGE
Subjt:  FHIGMTWSVQLWYRRLSARSISTNSQMRKEFISTFSSQYYDRKIATHLATIRQKEGETLREHVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGE

Query:  ESPSTFAEVLQKAKKVND--------------------------------------------------GPLRA------------------DLIIEGLEK
        E+P+TFAEVLQKAKKV D                                                  GP R+                  ++   G+EK
Subjt:  ESPSTFAEVLQKAKKVND--------------------------------------------------GPLRA------------------DLIIEGLEK

Query:  LLKRPEKLRGDPKKCNKDKYCRFHRDHGHDTSSCWELKRQIDDLIQDGYFNKYVVRPGSSSAENKDVRNRSRMPPHRDD
        LLKRPEKLRG P++ +KDKYCRFHR+HGH+TS  WELKRQI++LIQDGYF K+V +P +SSAE K+ R RSR PP R D
Subjt:  LLKRPEKLRGDPKKCNKDKYCRFHRDHGHDTSSCWELKRQIDDLIQDGYFNKYVVRPGSSSAENKDVRNRSRMPPHRDD

A0A6J1CKB3 uncharacterized protein LOC1110120818.8e-11258.6Show/hide
Query:  VITREEFDLMKNKFGAHVEALKAKCEMKENPFEDGDMVESPFTSNILEAPIPPKFKMVAMKPYDRSKDPKDYVEVFEGLIDFQMAFDVVECRVFHIGMTW
        +ITREEFD ++ +  A  EALKAKCE KE P  DGD+ ESPFTS++LEAPIPPKFK   +KPYD SKDPKDYVEVFEGL+DFQ   D ++CR F I +T 
Subjt:  VITREEFDLMKNKFGAHVEALKAKCEMKENPFEDGDMVESPFTSNILEAPIPPKFKMVAMKPYDRSKDPKDYVEVFEGLIDFQMAFDVVECRVFHIGMTW

Query:  SVQLWYRRLSARSISTNSQMRKEFISTFSSQYYDRKIATHLATIRQKEGETLREHVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEESPSTFA
        S +LWYRRL ARSIST SQ+R+EF++ FSS++YD+K ATHLATIRQKEGETLRE+VTRFQEEQLKVAHCSDDSAMCYFLTGLADE LTVKLGEE+PSTF 
Subjt:  SVQLWYRRLSARSISTNSQMRKEFISTFSSQYYDRKIATHLATIRQKEGETLREHVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEESPSTFA

Query:  EVLQKAKKVND--------------------------------------------------GPLRA------------------DLIIEGLEKLLKRPEK
        EVLQK KKV D                                                  GP R+                   +   G+EKLLKRPEK
Subjt:  EVLQKAKKVND--------------------------------------------------GPLRA------------------DLIIEGLEKLLKRPEK

Query:  LRGDPKKCNKDKYCRFHRDHGHDTSSCWELKRQIDDLIQDGYFNKYVVRPGSSSAENKDVRNRSRMPPHRDD
        LR   ++ +KDKYCRFHR+HGH+TS CWELKRQI+DLIQDGYF K+V +P +SSAE K+ R RSR PP R D
Subjt:  LRGDPKKCNKDKYCRFHRDHGHDTSSCWELKRQIDDLIQDGYFNKYVVRPGSSSAENKDVRNRSRMPPHRDD

A0A6J1DHB3 uncharacterized protein LOC1110204794.2e-11459.58Show/hide
Query:  YNPLTPEGVITREEFDLMKNKFGAHVEALKAKCEMKENPFEDGDMVESPFTSNILEAPIPPKFKMVAMKPYDRSKDPKDYVEVFEGLIDFQMAFDVVECR
        YNP+TP GVITREEFD +K+KF A VEALKA+CE KE+ F+DGD+ E  F+S+ILEA IPPKFK   MKPYD SKDPKDYVEVFE L+DFQ A D ++C 
Subjt:  YNPLTPEGVITREEFDLMKNKFGAHVEALKAKCEMKENPFEDGDMVESPFTSNILEAPIPPKFKMVAMKPYDRSKDPKDYVEVFEGLIDFQMAFDVVECR

Query:  VFHIGMTWSVQLWYRRLSARSISTNSQMRKEFISTFSSQYYDRKIATHLATIRQKEGETLREHVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLG
         F I +T S +LWYRRL AR IST SQ+RKEFIS FSS++YDRK  THLATIRQKEGETLRE+VTRF EEQLKVAHCSDDSAMCYFLTGLADETLTVKL 
Subjt:  VFHIGMTWSVQLWYRRLSARSISTNSQMRKEFISTFSSQYYDRKIATHLATIRQKEGETLREHVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLG

Query:  EESPSTFAEVLQKAKKVNDG---------------------------------------------------------------------PLRADLIIEGL
        EE+P+TFAEVLQK KKV DG                                                                      +  ++   G+
Subjt:  EESPSTFAEVLQKAKKVNDG---------------------------------------------------------------------PLRADLIIEGL

Query:  EKLLKRPEKLRGDPKKCNKDKYCRFHRDHGHDTSSCWELKRQIDDLIQDGYFNKYVVRPGSSSAENKDVRNRSRMPPHRDD
        EKLLKRPEKLRGDP+K N DKYCRFHRDHGH+TS+ WELKRQI+DLIQDGYF K+V +P S+S E K+ R R R PP RDD
Subjt:  EKLLKRPEKLRGDPKKCNKDKYCRFHRDHGHDTSSCWELKRQIDDLIQDGYFNKYVVRPGSSSAENKDVRNRSRMPPHRDD

A0A6J1DS95 uncharacterized protein LOC1110234217.9e-11358.71Show/hide
Query:  GVITREEFDLMKNKFGAHVEALKAKCEMKENPFEDGDMVESPFTSNILEAPIPPKFKMVAMKPYDRSKDPKDYVEVFEGLIDFQMAFDVVECRVFHIGMT
        G+ITREEFD ++ +  A VEALKAKCE K++   DGD+ ESPFTS++LEAPIPPKFK   +KPYD +KDPKDYVEVFEGL+DFQ A D ++CR F I +T
Subjt:  GVITREEFDLMKNKFGAHVEALKAKCEMKENPFEDGDMVESPFTSNILEAPIPPKFKMVAMKPYDRSKDPKDYVEVFEGLIDFQMAFDVVECRVFHIGMT

Query:  WSVQLWYRRLSARSISTNSQMRKEFISTFSSQYYDRKIATHLATIRQKEGETLREHVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEESPSTF
         S +LWYRRL  RSIST SQ+R+EF++ FSS++YD+K ATHLATIRQKEGETLRE+VTRFQEEQLKVAHCSDDSAMCYFLTGLADE LTVKLGEE+P+TF
Subjt:  WSVQLWYRRLSARSISTNSQMRKEFISTFSSQYYDRKIATHLATIRQKEGETLREHVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEESPSTF

Query:  AEVLQKAKKVND--------------------------------------------------GPLRA------------------DLIIEGLEKLLKRPE
        AEVLQKAKKV D                                                  GP R+                  ++   G+EKLLKRPE
Subjt:  AEVLQKAKKVND--------------------------------------------------GPLRA------------------DLIIEGLEKLLKRPE

Query:  KLRGDPKKCNKDKYCRFHRDHGHDTSSCWELKRQIDDLIQDGYFNKYVVRPGSSSAENKDVRNRSRMPPHRDD
        KLRG P++ +KDKYCRFHR+HGH+TS  WELKRQI+DLIQDGYF K+V +P +SSAE K+ R RSR PP R D
Subjt:  KLRGDPKKCNKDKYCRFHRDHGHDTSSCWELKRQIDDLIQDGYFNKYVVRPGSSSAENKDVRNRSRMPPHRDD

A0A6J1DXW4 uncharacterized protein LOC1110255851.7e-10763.86Show/hide
Query:  LTPEGVITREEFDLMKNKFGAHVEALKAKCEMKENPFEDGDMVESPFTSNILEAPIPPKFKMVAMKPYDRSKDPKDYVEVFEGLIDFQMAFDVVECRVFH
        +TPE VITREEFDLMK++F   VE LK KCE  E+PF DG+M ESPFTS++LE PIPPKFKM AMKPYD SKDPKDYVEVFE L+DFQ A D ++CR F 
Subjt:  LTPEGVITREEFDLMKNKFGAHVEALKAKCEMKENPFEDGDMVESPFTSNILEAPIPPKFKMVAMKPYDRSKDPKDYVEVFEGLIDFQMAFDVVECRVFH

Query:  IGMTWSVQLWYRRLSARSISTNSQMRKEFISTFSSQYYDRKIATHLATIRQKEGETLREHVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEES
        I +T ++                          S     RKIATHLATIRQKEGETLRE+VTRFQEEQLKV HCSDDSAMCYFLT LADETLTVKLGEE+
Subjt:  IGMTWSVQLWYRRLSARSISTNSQMRKEFISTFSSQYYDRKIATHLATIRQKEGETLREHVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEES

Query:  PSTFAEVLQKAKKVNDGPLRAD-----------------------LIIEGLEKLLKRPEKLRGDPKKCNKDKYCRFHRDHGHDTSSCWELKRQIDDLIQD
        PSTFAEVLQKAKKV DG   +D                       +   GLEKLLKRP+KLRGDP+K NKD+YCRFHRDHG+DTS+CWELKRQI+DLIQD
Subjt:  PSTFAEVLQKAKKVNDGPLRAD-----------------------LIIEGLEKLLKRPEKLRGDPKKCNKDKYCRFHRDHGHDTSSCWELKRQIDDLIQD

Query:  GYFNKYVVRPGSSSAENKDVRNRSRMPPHRDD
        GY  KYV +PGSSS   K+ R RSR PP RDD
Subjt:  GYFNKYVVRPGSSSAENKDVRNRSRMPPHRDD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCGAAAAAAGATTAAAGGCGATTGAAGGAACTGACATCCTCGTTGTTTGGGGAAGGAAAACAGATTGCCATGTGGAGGACAACGATTCAACAAAGGGAATATCCCC
AGACAAATCTTTATGCAGGTCCGACCTCAACTCTGACCAAGATCGAAAAGGTCTGGATGGCAATAACGACTTTCAGCATGAGGTCGATGCCAAGGTGGTTGAGGACCAGG
ATCGAGGAGGTCAAGAGATAGATCCACCTCGAAGGTCAGCCCGCAATGCGAATCTGACCCTACCTCATGCGTATCTGAGGCCATCTAAGACCCATCGGGGTCGAGGTGGG
GTATCAAAGAAACCCACTACTCGAAGGATCGCACCAGCTGTAGATCCTGAGGTTAACGCTACAGTCCAGCGGGAGCTTGATGATATGCGCATTCGAGTGCGTACCATGGA
AGAGACTTACAACGAATTAATGTGGGCAAATCGAACTAGGTCCCAATTCAGGACCCAGGGCGGAGGTGATGACCTGCATTATGACGGCAACGACCAAGATCCACGTCTCC
ATCCTGATGACGATGTGCGCGTTGTCGATAATGGAGGGGTCGATTACAGTCGTCAAGAGGAAGACATAAGAAAGTATAACCCACTGACGCCCGAAGGGGTGATCACAAGG
GAAGAGTTCGACCTGATGAAGAATAAGTTTGGTGCACATGTCGAGGCACTTAAAGCTAAGTGCGAGATGAAGGAAAATCCATTCGAGGATGGTGATATGGTCGAATCTCC
ATTCACCTCAAACATTTTGGAGGCACCTATTCCTCCTAAGTTCAAGATGGTCGCTATGAAGCCCTATGATAGGTCCAAGGACCCGAAGGACTATGTCGAGGTATTCGAGG
GCCTCATAGATTTTCAGATGGCATTTGACGTCGTTGAATGCCGAGTATTCCATATTGGTATGACATGGAGTGTTCAACTCTGGTATAGAAGGCTGTCAGCAAGGTCGATC
TCGACCAACAGCCAGATGAGGAAAGAGTTTATCAGCACATTCTCTTCTCAGTATTATGATAGAAAGATAGCCACCCACCTCGCCACCATTCGACAGAAGGAGGGCGAGAC
ACTCCGAGAACATGTGACCAGATTTCAAGAAGAGCAGCTTAAGGTCGCACATTGCTCAGATGATTCAGCCATGTGCTATTTCCTCACCGGCCTGGCCGACGAGACACTGA
CGGTCAAATTGGGAGAGGAATCACCATCCACCTTCGCCGAGGTCTTGCAAAAGGCCAAAAAGGTTAACGATGGCCCTCTTAGGGCCGACCTGATTATAGAAGGGCTGGAG
AAACTCCTCAAGCGCCCTGAGAAGCTTAGAGGAGATCCCAAAAAATGCAACAAGGACAAGTATTGTCGTTTCCACCGCGACCATGGCCACGATACCTCGAGTTGCTGGGA
ATTGAAACGCCAAATTGATGATTTAATTCAAGATGGCTACTTCAACAAGTATGTCGTTAGGCCAGGATCGAGCTCAGCAGAAAATAAAGATGTGAGGAACCGTTCAAGGA
TGCCACCTCATAGGGATGACTAG
mRNA sequenceShow/hide mRNA sequence
ATGCTCGAAAAAAGATTAAAGGCGATTGAAGGAACTGACATCCTCGTTGTTTGGGGAAGGAAAACAGATTGCCATGTGGAGGACAACGATTCAACAAAGGGAATATCCCC
AGACAAATCTTTATGCAGGTCCGACCTCAACTCTGACCAAGATCGAAAAGGTCTGGATGGCAATAACGACTTTCAGCATGAGGTCGATGCCAAGGTGGTTGAGGACCAGG
ATCGAGGAGGTCAAGAGATAGATCCACCTCGAAGGTCAGCCCGCAATGCGAATCTGACCCTACCTCATGCGTATCTGAGGCCATCTAAGACCCATCGGGGTCGAGGTGGG
GTATCAAAGAAACCCACTACTCGAAGGATCGCACCAGCTGTAGATCCTGAGGTTAACGCTACAGTCCAGCGGGAGCTTGATGATATGCGCATTCGAGTGCGTACCATGGA
AGAGACTTACAACGAATTAATGTGGGCAAATCGAACTAGGTCCCAATTCAGGACCCAGGGCGGAGGTGATGACCTGCATTATGACGGCAACGACCAAGATCCACGTCTCC
ATCCTGATGACGATGTGCGCGTTGTCGATAATGGAGGGGTCGATTACAGTCGTCAAGAGGAAGACATAAGAAAGTATAACCCACTGACGCCCGAAGGGGTGATCACAAGG
GAAGAGTTCGACCTGATGAAGAATAAGTTTGGTGCACATGTCGAGGCACTTAAAGCTAAGTGCGAGATGAAGGAAAATCCATTCGAGGATGGTGATATGGTCGAATCTCC
ATTCACCTCAAACATTTTGGAGGCACCTATTCCTCCTAAGTTCAAGATGGTCGCTATGAAGCCCTATGATAGGTCCAAGGACCCGAAGGACTATGTCGAGGTATTCGAGG
GCCTCATAGATTTTCAGATGGCATTTGACGTCGTTGAATGCCGAGTATTCCATATTGGTATGACATGGAGTGTTCAACTCTGGTATAGAAGGCTGTCAGCAAGGTCGATC
TCGACCAACAGCCAGATGAGGAAAGAGTTTATCAGCACATTCTCTTCTCAGTATTATGATAGAAAGATAGCCACCCACCTCGCCACCATTCGACAGAAGGAGGGCGAGAC
ACTCCGAGAACATGTGACCAGATTTCAAGAAGAGCAGCTTAAGGTCGCACATTGCTCAGATGATTCAGCCATGTGCTATTTCCTCACCGGCCTGGCCGACGAGACACTGA
CGGTCAAATTGGGAGAGGAATCACCATCCACCTTCGCCGAGGTCTTGCAAAAGGCCAAAAAGGTTAACGATGGCCCTCTTAGGGCCGACCTGATTATAGAAGGGCTGGAG
AAACTCCTCAAGCGCCCTGAGAAGCTTAGAGGAGATCCCAAAAAATGCAACAAGGACAAGTATTGTCGTTTCCACCGCGACCATGGCCACGATACCTCGAGTTGCTGGGA
ATTGAAACGCCAAATTGATGATTTAATTCAAGATGGCTACTTCAACAAGTATGTCGTTAGGCCAGGATCGAGCTCAGCAGAAAATAAAGATGTGAGGAACCGTTCAAGGA
TGCCACCTCATAGGGATGACTAG
Protein sequenceShow/hide protein sequence
MLEKRLKAIEGTDILVVWGRKTDCHVEDNDSTKGISPDKSLCRSDLNSDQDRKGLDGNNDFQHEVDAKVVEDQDRGGQEIDPPRRSARNANLTLPHAYLRPSKTHRGRGG
VSKKPTTRRIAPAVDPEVNATVQRELDDMRIRVRTMEETYNELMWANRTRSQFRTQGGGDDLHYDGNDQDPRLHPDDDVRVVDNGGVDYSRQEEDIRKYNPLTPEGVITR
EEFDLMKNKFGAHVEALKAKCEMKENPFEDGDMVESPFTSNILEAPIPPKFKMVAMKPYDRSKDPKDYVEVFEGLIDFQMAFDVVECRVFHIGMTWSVQLWYRRLSARSI
STNSQMRKEFISTFSSQYYDRKIATHLATIRQKEGETLREHVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEESPSTFAEVLQKAKKVNDGPLRADLIIEGLE
KLLKRPEKLRGDPKKCNKDKYCRFHRDHGHDTSSCWELKRQIDDLIQDGYFNKYVVRPGSSSAENKDVRNRSRMPPHRDD