; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc11g16430 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc11g16430
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr11:12563769..12566784
RNA-Seq ExpressionMoc11g16430
SyntenyMoc11g16430
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022131660.1 uncharacterized protein LOC111004785 [Momordica charantia]1.1e-15595.1Show/hide
Query:  MAFRRNTRARNYEDPNPRGEEAADPNVPLAVPEGVAPPVPQAAPQGVPQVNPQVALLAEVLQVLLNNANGAGGAQAQQPRRAQIQQEEVQFIRDFKRF--
        MAFRRNTRARNYEDPNPRGEEAADPNVPLAVPEGVAPPVPQAAPQGVPQVNPQVALLAEVLQVLLNNANGAGGAQAQQPRRAQIQQEEVQ +     +  
Subjt:  MAFRRNTRARNYEDPNPRGEEAADPNVPLAVPEGVAPPVPQAAPQGVPQVNPQVALLAEVLQVLLNNANGAGGAQAQQPRRAQIQQEEVQFIRDFKRF--

Query:  GPPVFNVRGAVFMLRGKAVNWWESVAAAEDHANAPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQYERKFTELSRFRMQYIPTEQLKIDKF
            F VRGAVFMLRGKAVNWWESVAAAEDHANAPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQYERKFTELSRFRMQYIPTEQLKIDKF
Subjt:  GPPVFNVRGAVFMLRGKAVNWWESVAAAEDHANAPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQYERKFTELSRFRMQYIPTEQLKIDKF

Query:  VDGLRREIKGLLVLKEPTTYAAAVRCTLVMDKCLEEPHSQQVTGSSSGVKRKFASFSSNQSSRGHQQLVQRQTASPVCPTCKRSHAGLEINDKHPVHFRS
        VDGLRREIKGLLVLKEPTTYAAAVRCTLVMDKCLEEPHSQQVTGSSSGVKRKFASFSSNQSSRGHQQLVQRQTASPVCPTCKRSHAGLEINDKHPVHFRS
Subjt:  VDGLRREIKGLLVLKEPTTYAAAVRCTLVMDKCLEEPHSQQVTGSSSGVKRKFASFSSNQSSRGHQQLVQRQTASPVCPTCKRSHAGLEINDKHPVHFRS

Query:  VVLVII
        VVLVII
Subjt:  VVLVII

XP_022155925.1 uncharacterized protein LOC111022925 [Momordica charantia]2.2e-11472.7Show/hide
Query:  MAFRRNTRARNYEDPNPRGEEAADPNVPLAVPEGVAPPVPQAAPQGVPQVNPQVALLAEVLQVLLNNANGAGGAQAQQPRRAQIQQEEVQFIRDFKRFGP
        MAFRRNT+A NYEDPN RGE AAD NVP  VP                    +V LLAE LQVLL+NANGAGGAQ QQP R QI QEEVQFIRDFKRFGP
Subjt:  MAFRRNTRARNYEDPNPRGEEAADPNVPLAVPEGVAPPVPQAAPQGVPQVNPQVALLAEVLQVLLNNANGAGGAQAQQPRRAQIQQEEVQFIRDFKRFGP

Query:  PVFN------------------------------VRGAVFMLRGKAVNWWESVAAAEDHANAPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV
        PVFN                              VRGAVFML+G+AVNWWESVAAAEDHAN PVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQ SL V
Subjt:  PVFN------------------------------VRGAVFMLRGKAVNWWESVAAAEDHANAPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV

Query:  AQYERKFTELSRFRMQYIPTEQLKIDKFVDGLRREIKGLLVLKEPTTYAAAVRCTLVMDKCLEEPHSQQVTGSSSGVKRKFASFSSNQSSRGHQQLVQRQ
        AQYERKFTELSRF MQYIPTEQLKIDKF+DGLRREIKGLLVLKEPTTYAAAVRC LVMDKCLEEP SQQV GSSSGVKRKFASFSS+Q SRGHQ   QRQ
Subjt:  AQYERKFTELSRFRMQYIPTEQLKIDKFVDGLRREIKGLLVLKEPTTYAAAVRCTLVMDKCLEEPHSQQVTGSSSGVKRKFASFSSNQSSRGHQQLVQRQ

Query:  TASPVCPTCKRSHAG
        T  P CP+CK++HAG
Subjt:  TASPVCPTCKRSHAG

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]1.1e-12978.98Show/hide
Query:  MAFRRNTRARNYEDPNPRGEEAADPNVPLAVPEGVAPPVPQAAPQGVPQVNPQVALLAEVLQVLLNNANGAGGAQAQQPRRAQIQQEEVQFIRDFKRFGP
        MAFRRNTRA NYEDPN RGE AADPNV   VP GV PPVPQAAPQGVPQVNPQVALLAE LQVLL+NANGAGGAQ QQPRRAQI Q+EVQFIRDFK FGP
Subjt:  MAFRRNTRARNYEDPNPRGEEAADPNVPLAVPEGVAPPVPQAAPQGVPQVNPQVALLAEVLQVLLNNANGAGGAQAQQPRRAQIQQEEVQFIRDFKRFGP

Query:  PVFN------------------------------VRGAVFMLRGKAVNWWESVAAAEDHANAPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV
        PVFN                              VRGAVFMLRG+AVNWWESVAAAEDHAN PVTWARFKDLLYEYYFPV  RNEKR EFLRLTQGSLTV
Subjt:  PVFN------------------------------VRGAVFMLRGKAVNWWESVAAAEDHANAPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV

Query:  AQYERKFTELSRFRMQYIPTEQLKIDKFVDGLRREIKGLLVLKEPTTYAAAVRCTLVMDKCLEEPHSQQVTGSSSGVKRKFASFSSNQSSRGHQQLVQRQ
        AQYERKFTELSRF  QY+PTEQLKIDKF+DGLRREIKGLLVLKEPTTYAAAVRC LVMDKCLEEP SQQV GS+SGVKRKFASFS++QSSRGHQ   QRQ
Subjt:  AQYERKFTELSRFRMQYIPTEQLKIDKFVDGLRREIKGLLVLKEPTTYAAAVRCTLVMDKCLEEPHSQQVTGSSSGVKRKFASFSSNQSSRGHQQLVQRQ

Query:  TASPVCPTCKRSHA
        TA PVCP+CK++HA
Subjt:  TASPVCPTCKRSHA

XP_022157413.1 uncharacterized protein LOC111024114 [Momordica charantia]1.9e-13480.95Show/hide
Query:  MAFRRNTRARNYEDPNPRGEEAADPNVPLAVPEGVAPPVPQAAPQGVPQVNPQVALLAEVLQVLLNNANGAGGAQAQQPRRAQIQQEEVQFIRDFKRFGP
        MAFRRNTRA NY+DPNPRGE AADPNVPL VP  VAPPVPQAAPQGVPQVNPQVALLAE LQVLL+NANGAGGAQ QQPRRAQI Q+EVQFIRDFKRFGP
Subjt:  MAFRRNTRARNYEDPNPRGEEAADPNVPLAVPEGVAPPVPQAAPQGVPQVNPQVALLAEVLQVLLNNANGAGGAQAQQPRRAQIQQEEVQFIRDFKRFGP

Query:  PVFN------------------------------VRGAVFMLRGKAVNWWESVAAAEDHANAPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV
        PVFN                              VRGAVFMLRG+AVNWWESVAAAEDH N PVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV
Subjt:  PVFN------------------------------VRGAVFMLRGKAVNWWESVAAAEDHANAPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV

Query:  AQYERKFTELSRFRMQYIPTEQLKIDKFVDGLRREIKGLLVLKEPTTYAAAVRCTLVMDKCLEEPHSQQVTGSSSGVKRKFASFSSNQSSRGHQQLVQRQ
        AQYERKFTELSRF MQYIPTEQLKIDKF+DGLR EIKGLLV+KEPTTYAAA+RC LVMDKCLEEP SQQV GSSSGVKRKFA FSS+QSSRGHQ  VQRQ
Subjt:  AQYERKFTELSRFRMQYIPTEQLKIDKFVDGLRREIKGLLVLKEPTTYAAAVRCTLVMDKCLEEPHSQQVTGSSSGVKRKFASFSSNQSSRGHQQLVQRQ

Query:  TASPVCPTCKRSHAG
        TA PVCP+CK++HAG
Subjt:  TASPVCPTCKRSHAG

XP_022158750.1 uncharacterized protein LOC111025215 [Momordica charantia]1.2e-12581.54Show/hide
Query:  MAFRRNTRARNYEDPNPRGEEAADPNVPLAVPEGVAPPVPQAAPQGVPQVNPQVALLAEVLQVLLNNANGAGGAQAQQPRRAQIQQEEV--------QFI
        MAFRRNTRA NYEDPNPRGE AADPNVP AVP GVAPP PQAA QGVPQVNPQVALLAE LQVLL+NANGAGGAQ QQPR AQI QEEV        +++
Subjt:  MAFRRNTRARNYEDPNPRGEEAADPNVPLAVPEGVAPPVPQAAPQGVPQVNPQVALLAEVLQVLLNNANGAGGAQAQQPRRAQIQQEEV--------QFI

Query:  RDFKRFGPPV-----FNVRGAVFMLRGKAVNWWESVAAAEDHANAPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQYERKFTELSRFRMQY
        R+ +     +     F VRGAVFMLRG+AVNWWESVAAAEDHAN PVTWARFKDLLYEYYFPVTVRNEKR EFLRLTQGSLTVA+YERKFTELSRF MQY
Subjt:  RDFKRFGPPV-----FNVRGAVFMLRGKAVNWWESVAAAEDHANAPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQYERKFTELSRFRMQY

Query:  IPTEQLKIDKFVDGLRREIKGLLVLKEPTTYAAAVRCTLVMDKCLEEPHSQQVTGSSSGVKRKFASFSSNQSSRGHQQLVQRQTASPVCPTCKRSHAG
        IPT+QLKIDKF+DGLRREIKGLLVLKEPTTYAAAVRC LVMDKCLEEP SQQV GSSSGVKRKFASFSS+Q SR HQ  VQRQTA PVCP+CK+SHAG
Subjt:  IPTEQLKIDKFVDGLRREIKGLLVLKEPTTYAAAVRCTLVMDKCLEEPHSQQVTGSSSGVKRKFASFSSNQSSRGHQQLVQRQTASPVCPTCKRSHAG

TrEMBL top hitse value%identityAlignment
A0A6J1BQB2 uncharacterized protein LOC1110047855.6e-15695.1Show/hide
Query:  MAFRRNTRARNYEDPNPRGEEAADPNVPLAVPEGVAPPVPQAAPQGVPQVNPQVALLAEVLQVLLNNANGAGGAQAQQPRRAQIQQEEVQFIRDFKRF--
        MAFRRNTRARNYEDPNPRGEEAADPNVPLAVPEGVAPPVPQAAPQGVPQVNPQVALLAEVLQVLLNNANGAGGAQAQQPRRAQIQQEEVQ +     +  
Subjt:  MAFRRNTRARNYEDPNPRGEEAADPNVPLAVPEGVAPPVPQAAPQGVPQVNPQVALLAEVLQVLLNNANGAGGAQAQQPRRAQIQQEEVQFIRDFKRF--

Query:  GPPVFNVRGAVFMLRGKAVNWWESVAAAEDHANAPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQYERKFTELSRFRMQYIPTEQLKIDKF
            F VRGAVFMLRGKAVNWWESVAAAEDHANAPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQYERKFTELSRFRMQYIPTEQLKIDKF
Subjt:  GPPVFNVRGAVFMLRGKAVNWWESVAAAEDHANAPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQYERKFTELSRFRMQYIPTEQLKIDKF

Query:  VDGLRREIKGLLVLKEPTTYAAAVRCTLVMDKCLEEPHSQQVTGSSSGVKRKFASFSSNQSSRGHQQLVQRQTASPVCPTCKRSHAGLEINDKHPVHFRS
        VDGLRREIKGLLVLKEPTTYAAAVRCTLVMDKCLEEPHSQQVTGSSSGVKRKFASFSSNQSSRGHQQLVQRQTASPVCPTCKRSHAGLEINDKHPVHFRS
Subjt:  VDGLRREIKGLLVLKEPTTYAAAVRCTLVMDKCLEEPHSQQVTGSSSGVKRKFASFSSNQSSRGHQQLVQRQTASPVCPTCKRSHAGLEINDKHPVHFRS

Query:  VVLVII
        VVLVII
Subjt:  VVLVII

A0A6J1DNV8 uncharacterized protein LOC1110229251.1e-11472.7Show/hide
Query:  MAFRRNTRARNYEDPNPRGEEAADPNVPLAVPEGVAPPVPQAAPQGVPQVNPQVALLAEVLQVLLNNANGAGGAQAQQPRRAQIQQEEVQFIRDFKRFGP
        MAFRRNT+A NYEDPN RGE AAD NVP  VP                    +V LLAE LQVLL+NANGAGGAQ QQP R QI QEEVQFIRDFKRFGP
Subjt:  MAFRRNTRARNYEDPNPRGEEAADPNVPLAVPEGVAPPVPQAAPQGVPQVNPQVALLAEVLQVLLNNANGAGGAQAQQPRRAQIQQEEVQFIRDFKRFGP

Query:  PVFN------------------------------VRGAVFMLRGKAVNWWESVAAAEDHANAPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV
        PVFN                              VRGAVFML+G+AVNWWESVAAAEDHAN PVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQ SL V
Subjt:  PVFN------------------------------VRGAVFMLRGKAVNWWESVAAAEDHANAPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV

Query:  AQYERKFTELSRFRMQYIPTEQLKIDKFVDGLRREIKGLLVLKEPTTYAAAVRCTLVMDKCLEEPHSQQVTGSSSGVKRKFASFSSNQSSRGHQQLVQRQ
        AQYERKFTELSRF MQYIPTEQLKIDKF+DGLRREIKGLLVLKEPTTYAAAVRC LVMDKCLEEP SQQV GSSSGVKRKFASFSS+Q SRGHQ   QRQ
Subjt:  AQYERKFTELSRFRMQYIPTEQLKIDKFVDGLRREIKGLLVLKEPTTYAAAVRCTLVMDKCLEEPHSQQVTGSSSGVKRKFASFSSNQSSRGHQQLVQRQ

Query:  TASPVCPTCKRSHAG
        T  P CP+CK++HAG
Subjt:  TASPVCPTCKRSHAG

A0A6J1DQB9 Reverse transcriptase5.2e-13078.98Show/hide
Query:  MAFRRNTRARNYEDPNPRGEEAADPNVPLAVPEGVAPPVPQAAPQGVPQVNPQVALLAEVLQVLLNNANGAGGAQAQQPRRAQIQQEEVQFIRDFKRFGP
        MAFRRNTRA NYEDPN RGE AADPNV   VP GV PPVPQAAPQGVPQVNPQVALLAE LQVLL+NANGAGGAQ QQPRRAQI Q+EVQFIRDFK FGP
Subjt:  MAFRRNTRARNYEDPNPRGEEAADPNVPLAVPEGVAPPVPQAAPQGVPQVNPQVALLAEVLQVLLNNANGAGGAQAQQPRRAQIQQEEVQFIRDFKRFGP

Query:  PVFN------------------------------VRGAVFMLRGKAVNWWESVAAAEDHANAPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV
        PVFN                              VRGAVFMLRG+AVNWWESVAAAEDHAN PVTWARFKDLLYEYYFPV  RNEKR EFLRLTQGSLTV
Subjt:  PVFN------------------------------VRGAVFMLRGKAVNWWESVAAAEDHANAPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV

Query:  AQYERKFTELSRFRMQYIPTEQLKIDKFVDGLRREIKGLLVLKEPTTYAAAVRCTLVMDKCLEEPHSQQVTGSSSGVKRKFASFSSNQSSRGHQQLVQRQ
        AQYERKFTELSRF  QY+PTEQLKIDKF+DGLRREIKGLLVLKEPTTYAAAVRC LVMDKCLEEP SQQV GS+SGVKRKFASFS++QSSRGHQ   QRQ
Subjt:  AQYERKFTELSRFRMQYIPTEQLKIDKFVDGLRREIKGLLVLKEPTTYAAAVRCTLVMDKCLEEPHSQQVTGSSSGVKRKFASFSSNQSSRGHQQLVQRQ

Query:  TASPVCPTCKRSHA
        TA PVCP+CK++HA
Subjt:  TASPVCPTCKRSHA

A0A6J1DTA8 uncharacterized protein LOC1110241149.2e-13580.95Show/hide
Query:  MAFRRNTRARNYEDPNPRGEEAADPNVPLAVPEGVAPPVPQAAPQGVPQVNPQVALLAEVLQVLLNNANGAGGAQAQQPRRAQIQQEEVQFIRDFKRFGP
        MAFRRNTRA NY+DPNPRGE AADPNVPL VP  VAPPVPQAAPQGVPQVNPQVALLAE LQVLL+NANGAGGAQ QQPRRAQI Q+EVQFIRDFKRFGP
Subjt:  MAFRRNTRARNYEDPNPRGEEAADPNVPLAVPEGVAPPVPQAAPQGVPQVNPQVALLAEVLQVLLNNANGAGGAQAQQPRRAQIQQEEVQFIRDFKRFGP

Query:  PVFN------------------------------VRGAVFMLRGKAVNWWESVAAAEDHANAPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV
        PVFN                              VRGAVFMLRG+AVNWWESVAAAEDH N PVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV
Subjt:  PVFN------------------------------VRGAVFMLRGKAVNWWESVAAAEDHANAPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV

Query:  AQYERKFTELSRFRMQYIPTEQLKIDKFVDGLRREIKGLLVLKEPTTYAAAVRCTLVMDKCLEEPHSQQVTGSSSGVKRKFASFSSNQSSRGHQQLVQRQ
        AQYERKFTELSRF MQYIPTEQLKIDKF+DGLR EIKGLLV+KEPTTYAAA+RC LVMDKCLEEP SQQV GSSSGVKRKFA FSS+QSSRGHQ  VQRQ
Subjt:  AQYERKFTELSRFRMQYIPTEQLKIDKFVDGLRREIKGLLVLKEPTTYAAAVRCTLVMDKCLEEPHSQQVTGSSSGVKRKFASFSSNQSSRGHQQLVQRQ

Query:  TASPVCPTCKRSHAG
        TA PVCP+CK++HAG
Subjt:  TASPVCPTCKRSHAG

A0A6J1DWP4 uncharacterized protein LOC1110252156.0e-12681.54Show/hide
Query:  MAFRRNTRARNYEDPNPRGEEAADPNVPLAVPEGVAPPVPQAAPQGVPQVNPQVALLAEVLQVLLNNANGAGGAQAQQPRRAQIQQEEV--------QFI
        MAFRRNTRA NYEDPNPRGE AADPNVP AVP GVAPP PQAA QGVPQVNPQVALLAE LQVLL+NANGAGGAQ QQPR AQI QEEV        +++
Subjt:  MAFRRNTRARNYEDPNPRGEEAADPNVPLAVPEGVAPPVPQAAPQGVPQVNPQVALLAEVLQVLLNNANGAGGAQAQQPRRAQIQQEEV--------QFI

Query:  RDFKRFGPPV-----FNVRGAVFMLRGKAVNWWESVAAAEDHANAPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQYERKFTELSRFRMQY
        R+ +     +     F VRGAVFMLRG+AVNWWESVAAAEDHAN PVTWARFKDLLYEYYFPVTVRNEKR EFLRLTQGSLTVA+YERKFTELSRF MQY
Subjt:  RDFKRFGPPV-----FNVRGAVFMLRGKAVNWWESVAAAEDHANAPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQYERKFTELSRFRMQY

Query:  IPTEQLKIDKFVDGLRREIKGLLVLKEPTTYAAAVRCTLVMDKCLEEPHSQQVTGSSSGVKRKFASFSSNQSSRGHQQLVQRQTASPVCPTCKRSHAG
        IPT+QLKIDKF+DGLRREIKGLLVLKEPTTYAAAVRC LVMDKCLEEP SQQV GSSSGVKRKFASFSS+Q SR HQ  VQRQTA PVCP+CK+SHAG
Subjt:  IPTEQLKIDKFVDGLRREIKGLLVLKEPTTYAAAVRCTLVMDKCLEEPHSQQVTGSSSGVKRKFASFSSNQSSRGHQQLVQRQTASPVCPTCKRSHAG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGTGTTTGACAAGCTTAGGAGCTTTAGGACTTTTGGTAGTAAGTAGCACGGGGGCGGGTGCGTGCGTTGATGCTTGTTATTTGGTTGCCGATGTGCTTTTT
ATGATGCTTAATATTGGTTCTGATACGTGGTTTTGTAGTGTAACTCTTGTTTATGTTGTTTTGTATGCCTGGAAAGTTATGAAGCGAATGGGTGAGTGTCAGGGC
CTCGGGTATAAATGGTCGGGGGCTGATACGTCACTAATTGGGTATCGAGGCCTCGGGTATAAATGGTCGGGGGTCGATGCGATGAGTCTTTTGAAAGGAGAACTA
TTGGGGCCTTGGGTCAGGGGCCGACACTACCAAGAGAATGGTTTTCAAAGAGCTCGCATATCTGAGGTGAGCATAGCTGTATTTGAGGTGTTAGGGAACCAGGTG
AACGAGCATGTCTATGGTTGTTCGCATACTGTAATGATAGAGCATGACGATGTATGTTTCCTTTATGGTTATGAGCATGATGTTGGTGTGTGCTTGATTGTCCTA
GAATCTAGGTCGTTAGTGTACGGTCCTCGTCAGTCTCCTCGTCATCACCAGACAATGGCTTTTCGACGGAACACGAGGGCTCGCAACTACGAGGATCCGAACCCT
AGGGGTGAGGAGGCAGCGGATCCAAATGTTCCCCTGGCAGTTCCTGAAGGGGTAGCACCCCCGGTTCCGCAGGCAGCACCTCAGGGAGTTCCCCAGGTGAATCCC
CAGGTGGCGTTACTAGCTGAGGTATTGCAAGTATTGCTGAATAATGCGAATGGAGCTGGTGGGGCTCAGGCGCAGCAGCCACGCCGGGCACAGATTCAACAAGAG
GAGGTCCAGTTCATCAGAGATTTCAAACGCTTCGGACCACCCGTTTTCAACGTCCGGGGAGCAGTGTTTATGCTTCGGGGAAAAGCAGTAAACTGGTGGGAGTCG
GTGGCGGCAGCGGAGGATCACGCCAACGCACCCGTCACATGGGCGAGGTTTAAGGACCTACTCTATGAGTATTATTTCCCCGTGACTGTCAGGAACGAAAAACGG
GCAGAGTTTCTCCGTCTCACTCAAGGGAGCCTAACTGTGGCCCAATACGAGAGGAAGTTCACTGAGCTGTCCCGTTTTAGAATGCAATATATTCCTACTGAACAA
TTAAAGATTGACAAGTTCGTTGACGGTTTGCGTAGGGAGATCAAGGGGCTACTTGTTCTCAAGGAACCAACTACTTATGCAGCAGCAGTCAGGTGCACGTTGGTT
ATGGACAAATGTCTCGAGGAACCTCACTCTCAACAGGTGACGGGCTCCAGCTCGGGGGTCAAGAGAAAATTTGCATCGTTCTCCTCCAATCAATCCTCGAGGGGA
CACCAGCAGCTTGTGCAAAGGCAGACTGCTTCTCCGGTGTGCCCCACTTGTAAGAGGAGCCATGCTGGGCTTGAGATTAATGATAAGCATCCTGTACATTTTAGA
TCGGTAGTCCTAGTCATCATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGTGTTTGACAAGCTTAGGAGCTTTAGGACTTTTGGTAGTAAGTAGCACGGGGGCGGGTGCGTGCGTTGATGCTTGTTATTTGGTTGCCGATGTGCTTTTT
ATGATGCTTAATATTGGTTCTGATACGTGGTTTTGTAGTGTAACTCTTGTTTATGTTGTTTTGTATGCCTGGAAAGTTATGAAGCGAATGGGTGAGTGTCAGGGC
CTCGGGTATAAATGGTCGGGGGCTGATACGTCACTAATTGGGTATCGAGGCCTCGGGTATAAATGGTCGGGGGTCGATGCGATGAGTCTTTTGAAAGGAGAACTA
TTGGGGCCTTGGGTCAGGGGCCGACACTACCAAGAGAATGGTTTTCAAAGAGCTCGCATATCTGAGGTGAGCATAGCTGTATTTGAGGTGTTAGGGAACCAGGTG
AACGAGCATGTCTATGGTTGTTCGCATACTGTAATGATAGAGCATGACGATGTATGTTTCCTTTATGGTTATGAGCATGATGTTGGTGTGTGCTTGATTGTCCTA
GAATCTAGGTCGTTAGTGTACGGTCCTCGTCAGTCTCCTCGTCATCACCAGACAATGGCTTTTCGACGGAACACGAGGGCTCGCAACTACGAGGATCCGAACCCT
AGGGGTGAGGAGGCAGCGGATCCAAATGTTCCCCTGGCAGTTCCTGAAGGGGTAGCACCCCCGGTTCCGCAGGCAGCACCTCAGGGAGTTCCCCAGGTGAATCCC
CAGGTGGCGTTACTAGCTGAGGTATTGCAAGTATTGCTGAATAATGCGAATGGAGCTGGTGGGGCTCAGGCGCAGCAGCCACGCCGGGCACAGATTCAACAAGAG
GAGGTCCAGTTCATCAGAGATTTCAAACGCTTCGGACCACCCGTTTTCAACGTCCGGGGAGCAGTGTTTATGCTTCGGGGAAAAGCAGTAAACTGGTGGGAGTCG
GTGGCGGCAGCGGAGGATCACGCCAACGCACCCGTCACATGGGCGAGGTTTAAGGACCTACTCTATGAGTATTATTTCCCCGTGACTGTCAGGAACGAAAAACGG
GCAGAGTTTCTCCGTCTCACTCAAGGGAGCCTAACTGTGGCCCAATACGAGAGGAAGTTCACTGAGCTGTCCCGTTTTAGAATGCAATATATTCCTACTGAACAA
TTAAAGATTGACAAGTTCGTTGACGGTTTGCGTAGGGAGATCAAGGGGCTACTTGTTCTCAAGGAACCAACTACTTATGCAGCAGCAGTCAGGTGCACGTTGGTT
ATGGACAAATGTCTCGAGGAACCTCACTCTCAACAGGTGACGGGCTCCAGCTCGGGGGTCAAGAGAAAATTTGCATCGTTCTCCTCCAATCAATCCTCGAGGGGA
CACCAGCAGCTTGTGCAAAGGCAGACTGCTTCTCCGGTGTGCCCCACTTGTAAGAGGAGCCATGCTGGGCTTGAGATTAATGATAAGCATCCTGTACATTTTAGA
TCGGTAGTCCTAGTCATCATTTGA
Protein sequenceShow/hide protein sequence
MKCLTSLGALGLLVVSSTGAGACVDACYLVADVLFMMLNIGSDTWFCSVTLVYVVLYAWKVMKRMGECQGLGYKWSGADTSLIGYRGLGYKWSGVDAMSLLKGEL
LGPWVRGRHYQENGFQRARISEVSIAVFEVLGNQVNEHVYGCSHTVMIEHDDVCFLYGYEHDVGVCLIVLESRSLVYGPRQSPRHHQTMAFRRNTRARNYEDPNP
RGEEAADPNVPLAVPEGVAPPVPQAAPQGVPQVNPQVALLAEVLQVLLNNANGAGGAQAQQPRRAQIQQEEVQFIRDFKRFGPPVFNVRGAVFMLRGKAVNWWES
VAAAEDHANAPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQYERKFTELSRFRMQYIPTEQLKIDKFVDGLRREIKGLLVLKEPTTYAAAVRCTLV
MDKCLEEPHSQQVTGSSSGVKRKFASFSSNQSSRGHQQLVQRQTASPVCPTCKRSHAGLEINDKHPVHFRSVVLVII