; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g18340 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g18340
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr9:14396453..14397963
RNA-Seq ExpressionMoc09g18340
SyntenyMoc09g18340
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]1.3e-12182.73Show/hide
Query:  MKPYDGSKNPKDYVEVFEGLMDFQAAIGAIKCRAFQIALTGSARLWYRRLPAKSILTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRF
        +KPYDGSK+PKDYVEVFE LMDFQAA  AIKCRAF+IALTGSARLWYRRLPA SI TYSQLR+EF++ FSSRHYD+KTATHLATIRQKEGETLREYVTRF
Subjt:  MKPYDGSKNPKDYVEVFEGLMDFQAAIGAIKCRAFQIALTGSARLWYRRLPAKSILTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRF

Query:  QEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAPATFIEVLQKAKKVIDGQELFRTKTGRPEKHIDQKKPNQEKRKADSKSKDKGSSSSNNRAEYRR
        QEEQLKVAHCSDDSAMCYFLTGLADE LTVKLGEEAPATF EVLQKAKKVIDGQEL RTKTGRPE+ I + +  ++   AD KSKDKGS SS  RAEYRR
Subjt:  QEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAPATFIEVLQKAKKVIDGQELFRTKTGRPEKHIDQKKPNQEKRKADSKSKDKGSSSSNNRAEYRR

Query:  SDSGPNRSRPYERYTPTTIPISEILTNIEETGMEKFLKRPEKLRGDPEKRNKDKYCRFHRDHGHNTSSCWELKHQICN
        +++GP RSRPYER+TPTTIPISEILTNIEE+GMEK LKRPEKLRG PE+R+KDKYCRFHR+HGHNTS  WELK QI N
Subjt:  SDSGPNRSRPYERYTPTTIPISEILTNIEETGMEKFLKRPEKLRGDPEKRNKDKYCRFHRDHGHNTSSCWELKHQICN

XP_022149377.1 uncharacterized protein LOC111017807 [Momordica charantia]1.3e-12488.1Show/hide
Query:  KNPKDYVEVFEGLMDFQAAIGAIKCRAFQIALTGSARLWYRRLPAKSILTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKV
        ++PKDYVEVFEGLMDFQAA  AIKCRAFQIALTG ARLWYRRLPA+SI TYSQLRKEFISQF SRHYDRKTATHLATIRQKE ETLREYVTRFQEEQLKV
Subjt:  KNPKDYVEVFEGLMDFQAAIGAIKCRAFQIALTGSARLWYRRLPAKSILTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKV

Query:  AHCSDDSAMCYFLTGLADETLTVKLGEEAPATFIEVLQKAKKVIDGQELFRTKTGRPEKHIDQKKPNQEKRKADSKSKDKGSSSSNNRAEYRRSDSGPNR
         HCSDDSAMCYFLTGLADETLTVKLGEEAPATF EVLQKAKKVIDGQEL RTKTGRPEK IDQKK  QEKRK DSKS+DKGSSSS +RAE+RR +SGP+R
Subjt:  AHCSDDSAMCYFLTGLADETLTVKLGEEAPATFIEVLQKAKKVIDGQELFRTKTGRPEKHIDQKKPNQEKRKADSKSKDKGSSSSNNRAEYRRSDSGPNR

Query:  SRPYERYTPTTIPISEILTNIEETGMEKFLKRPEKLRGDPEKRNKDKYCRFHRDHGHNTSSCWELKHQI
        SRPYERYTPTTI ISEILTNIEE+GMEK LK PEKLRGDPEKR+KDK CRFHRDH HNT+SCWELK QI
Subjt:  SRPYERYTPTTIPISEILTNIEETGMEKFLKRPEKLRGDPEKRNKDKYCRFHRDHGHNTSSCWELKHQI

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]1.5e-12585.87Show/hide
Query:  MKPYDGSKNPKDYVEVFEGLMDFQAAIGAIKCRAFQIALTGSARLWYRRLPAKSILTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRF
        MKPYDGSK+PKDYVEVFE LMDFQAA  AIKC AFQIALTGSARLWYRRLPA+ I TYSQLRKEFISQFSSRHYDRKT THLATIRQKEGETLREYVTRF
Subjt:  MKPYDGSKNPKDYVEVFEGLMDFQAAIGAIKCRAFQIALTGSARLWYRRLPAKSILTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRF

Query:  QEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAPATFIEVLQKAKKVIDGQELFRTKTGRPEKHIDQKKPNQEKRKADSKSKDKGSSSSNNRAEYRR
         EEQLKVAHCSDDSAMCYFLTGLADETLTVKL EEAPATF EVLQK KKVIDGQEL RTKTGRPEK+IDQ +  ++K KADSKS+DKG SSS++R +YRR
Subjt:  QEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAPATFIEVLQKAKKVIDGQELFRTKTGRPEKHIDQKKPNQEKRKADSKSKDKGSSSSNNRAEYRR

Query:  SDSGPNRSRPYERYTPTTIPISEILTNIEETGMEKFLKRPEKLRGDPEKRNKDKYCRFHRDHGHNTSSCWELKHQI
        S+S  N+SRPYE YTPTTIPI EILTNIEETGMEK LKRPEKLRGDPEKRN DKYCRFHRDHGHNTS+ WELK QI
Subjt:  SDSGPNRSRPYERYTPTTIPISEILTNIEETGMEKFLKRPEKLRGDPEKRNKDKYCRFHRDHGHNTSSCWELKHQI

XP_022153957.1 uncharacterized protein LOC111021344 [Momordica charantia]2.6e-12291.02Show/hide
Query:  MDFQAAIGAIKCRAFQIALTGSARLWYRRLPAKSILTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL
        MDFQAA  AIKCRAFQIALT SARLWYRRLPA+SI TYSQLRKE ISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL
Subjt:  MDFQAAIGAIKCRAFQIALTGSARLWYRRLPAKSILTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL

Query:  TGLADETLTVKLGEEAPATFIEVLQKAKKVIDGQELFRTKTGRPEKHIDQKKPNQEKRKADSKSKDKGSSSSNNRAEYRRSDSGPNRSRPYERYTPTTIP
        TGLADETLTVKLGEEAPATF EVL+KAKKVIDGQEL RTKTGRPE+ IDQKK NQEKRKADSKSKDKGSSSS +R EYRRS+SGP+RSRPYERYT TTIP
Subjt:  TGLADETLTVKLGEEAPATFIEVLQKAKKVIDGQELFRTKTGRPEKHIDQKKPNQEKRKADSKSKDKGSSSSNNRAEYRRSDSGPNRSRPYERYTPTTIP

Query:  ISEILTNIEETGMEKFLKRPEKLRGDPEKRNKDKYCRFHRDHGHNTSSCWELKHQI
        ISEILTNIEE+GMEK LKRPEKLRGD EKRNKDKYCRFHRDHGHNT+SCWELK QI
Subjt:  ISEILTNIEETGMEKFLKRPEKLRGDPEKRNKDKYCRFHRDHGHNTSSCWELKHQI

XP_022156542.1 uncharacterized protein LOC111023421 [Momordica charantia]2.6e-12282.97Show/hide
Query:  MKPYDGSKNPKDYVEVFEGLMDFQAAIGAIKCRAFQIALTGSARLWYRRLPAKSILTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRF
        +KPYDG+K+PKDYVEVFEGLMDFQAA  AIKCRAFQIALTGSARLWYRRLP +SI TYSQLR+EF++QFSSRHYD+KTATHLATIRQKEGETLREYVTRF
Subjt:  MKPYDGSKNPKDYVEVFEGLMDFQAAIGAIKCRAFQIALTGSARLWYRRLPAKSILTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRF

Query:  QEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAPATFIEVLQKAKKVIDGQELFRTKTGRPEKHIDQKKPNQEKRKADSKSKDKGSSSSNNRAEYRR
        QEEQLKVAHCSDDSAMCYFLTGLADE LTVKLGEEAPATF EVLQKAKKVIDGQEL RTKTGRPE+ I + +  ++  +AD KSKDKGS SS  RAEYRR
Subjt:  QEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAPATFIEVLQKAKKVIDGQELFRTKTGRPEKHIDQKKPNQEKRKADSKSKDKGSSSSNNRAEYRR

Query:  SDSGPNRSRPYERYTPTTIPISEILTNIEETGMEKFLKRPEKLRGDPEKRNKDKYCRFHRDHGHNTSSCWELKHQI
        +++GP RSRPYER+TPTTIPI EILTNIEE+GMEK LKRPEKLRG PE+R+KDKYCRFHR+HGHNTS  WELK QI
Subjt:  SDSGPNRSRPYERYTPTTIPISEILTNIEETGMEKFLKRPEKLRGDPEKRNKDKYCRFHRDHGHNTSSCWELKHQI

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088136.4e-12282.73Show/hide
Query:  MKPYDGSKNPKDYVEVFEGLMDFQAAIGAIKCRAFQIALTGSARLWYRRLPAKSILTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRF
        +KPYDGSK+PKDYVEVFE LMDFQAA  AIKCRAF+IALTGSARLWYRRLPA SI TYSQLR+EF++ FSSRHYD+KTATHLATIRQKEGETLREYVTRF
Subjt:  MKPYDGSKNPKDYVEVFEGLMDFQAAIGAIKCRAFQIALTGSARLWYRRLPAKSILTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRF

Query:  QEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAPATFIEVLQKAKKVIDGQELFRTKTGRPEKHIDQKKPNQEKRKADSKSKDKGSSSSNNRAEYRR
        QEEQLKVAHCSDDSAMCYFLTGLADE LTVKLGEEAPATF EVLQKAKKVIDGQEL RTKTGRPE+ I + +  ++   AD KSKDKGS SS  RAEYRR
Subjt:  QEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAPATFIEVLQKAKKVIDGQELFRTKTGRPEKHIDQKKPNQEKRKADSKSKDKGSSSSNNRAEYRR

Query:  SDSGPNRSRPYERYTPTTIPISEILTNIEETGMEKFLKRPEKLRGDPEKRNKDKYCRFHRDHGHNTSSCWELKHQICN
        +++GP RSRPYER+TPTTIPISEILTNIEE+GMEK LKRPEKLRG PE+R+KDKYCRFHR+HGHNTS  WELK QI N
Subjt:  SDSGPNRSRPYERYTPTTIPISEILTNIEETGMEKFLKRPEKLRGDPEKRNKDKYCRFHRDHGHNTSSCWELKHQICN

A0A6J1D7S8 uncharacterized protein LOC1110178076.2e-12588.1Show/hide
Query:  KNPKDYVEVFEGLMDFQAAIGAIKCRAFQIALTGSARLWYRRLPAKSILTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKV
        ++PKDYVEVFEGLMDFQAA  AIKCRAFQIALTG ARLWYRRLPA+SI TYSQLRKEFISQF SRHYDRKTATHLATIRQKE ETLREYVTRFQEEQLKV
Subjt:  KNPKDYVEVFEGLMDFQAAIGAIKCRAFQIALTGSARLWYRRLPAKSILTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKV

Query:  AHCSDDSAMCYFLTGLADETLTVKLGEEAPATFIEVLQKAKKVIDGQELFRTKTGRPEKHIDQKKPNQEKRKADSKSKDKGSSSSNNRAEYRRSDSGPNR
         HCSDDSAMCYFLTGLADETLTVKLGEEAPATF EVLQKAKKVIDGQEL RTKTGRPEK IDQKK  QEKRK DSKS+DKGSSSS +RAE+RR +SGP+R
Subjt:  AHCSDDSAMCYFLTGLADETLTVKLGEEAPATFIEVLQKAKKVIDGQELFRTKTGRPEKHIDQKKPNQEKRKADSKSKDKGSSSSNNRAEYRRSDSGPNR

Query:  SRPYERYTPTTIPISEILTNIEETGMEKFLKRPEKLRGDPEKRNKDKYCRFHRDHGHNTSSCWELKHQI
        SRPYERYTPTTI ISEILTNIEE+GMEK LK PEKLRGDPEKR+KDK CRFHRDH HNT+SCWELK QI
Subjt:  SRPYERYTPTTIPISEILTNIEETGMEKFLKRPEKLRGDPEKRNKDKYCRFHRDHGHNTSSCWELKHQI

A0A6J1DHB3 uncharacterized protein LOC1110204797.3e-12685.87Show/hide
Query:  MKPYDGSKNPKDYVEVFEGLMDFQAAIGAIKCRAFQIALTGSARLWYRRLPAKSILTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRF
        MKPYDGSK+PKDYVEVFE LMDFQAA  AIKC AFQIALTGSARLWYRRLPA+ I TYSQLRKEFISQFSSRHYDRKT THLATIRQKEGETLREYVTRF
Subjt:  MKPYDGSKNPKDYVEVFEGLMDFQAAIGAIKCRAFQIALTGSARLWYRRLPAKSILTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRF

Query:  QEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAPATFIEVLQKAKKVIDGQELFRTKTGRPEKHIDQKKPNQEKRKADSKSKDKGSSSSNNRAEYRR
         EEQLKVAHCSDDSAMCYFLTGLADETLTVKL EEAPATF EVLQK KKVIDGQEL RTKTGRPEK+IDQ +  ++K KADSKS+DKG SSS++R +YRR
Subjt:  QEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAPATFIEVLQKAKKVIDGQELFRTKTGRPEKHIDQKKPNQEKRKADSKSKDKGSSSSNNRAEYRR

Query:  SDSGPNRSRPYERYTPTTIPISEILTNIEETGMEKFLKRPEKLRGDPEKRNKDKYCRFHRDHGHNTSSCWELKHQI
        S+S  N+SRPYE YTPTTIPI EILTNIEETGMEK LKRPEKLRGDPEKRN DKYCRFHRDHGHNTS+ WELK QI
Subjt:  SDSGPNRSRPYERYTPTTIPISEILTNIEETGMEKFLKRPEKLRGDPEKRNKDKYCRFHRDHGHNTSSCWELKHQI

A0A6J1DKD3 uncharacterized protein LOC1110213441.3e-12291.02Show/hide
Query:  MDFQAAIGAIKCRAFQIALTGSARLWYRRLPAKSILTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL
        MDFQAA  AIKCRAFQIALT SARLWYRRLPA+SI TYSQLRKE ISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL
Subjt:  MDFQAAIGAIKCRAFQIALTGSARLWYRRLPAKSILTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL

Query:  TGLADETLTVKLGEEAPATFIEVLQKAKKVIDGQELFRTKTGRPEKHIDQKKPNQEKRKADSKSKDKGSSSSNNRAEYRRSDSGPNRSRPYERYTPTTIP
        TGLADETLTVKLGEEAPATF EVL+KAKKVIDGQEL RTKTGRPE+ IDQKK NQEKRKADSKSKDKGSSSS +R EYRRS+SGP+RSRPYERYT TTIP
Subjt:  TGLADETLTVKLGEEAPATFIEVLQKAKKVIDGQELFRTKTGRPEKHIDQKKPNQEKRKADSKSKDKGSSSSNNRAEYRRSDSGPNRSRPYERYTPTTIP

Query:  ISEILTNIEETGMEKFLKRPEKLRGDPEKRNKDKYCRFHRDHGHNTSSCWELKHQI
        ISEILTNIEE+GMEK LKRPEKLRGD EKRNKDKYCRFHRDHGHNT+SCWELK QI
Subjt:  ISEILTNIEETGMEKFLKRPEKLRGDPEKRNKDKYCRFHRDHGHNTSSCWELKHQI

A0A6J1DS95 uncharacterized protein LOC1110234211.3e-12282.97Show/hide
Query:  MKPYDGSKNPKDYVEVFEGLMDFQAAIGAIKCRAFQIALTGSARLWYRRLPAKSILTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRF
        +KPYDG+K+PKDYVEVFEGLMDFQAA  AIKCRAFQIALTGSARLWYRRLP +SI TYSQLR+EF++QFSSRHYD+KTATHLATIRQKEGETLREYVTRF
Subjt:  MKPYDGSKNPKDYVEVFEGLMDFQAAIGAIKCRAFQIALTGSARLWYRRLPAKSILTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRF

Query:  QEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAPATFIEVLQKAKKVIDGQELFRTKTGRPEKHIDQKKPNQEKRKADSKSKDKGSSSSNNRAEYRR
        QEEQLKVAHCSDDSAMCYFLTGLADE LTVKLGEEAPATF EVLQKAKKVIDGQEL RTKTGRPE+ I + +  ++  +AD KSKDKGS SS  RAEYRR
Subjt:  QEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAPATFIEVLQKAKKVIDGQELFRTKTGRPEKHIDQKKPNQEKRKADSKSKDKGSSSSNNRAEYRR

Query:  SDSGPNRSRPYERYTPTTIPISEILTNIEETGMEKFLKRPEKLRGDPEKRNKDKYCRFHRDHGHNTSSCWELKHQI
        +++GP RSRPYER+TPTTIPI EILTNIEE+GMEK LKRPEKLRG PE+R+KDKYCRFHR+HGHNTS  WELK QI
Subjt:  SDSGPNRSRPYERYTPTTIPISEILTNIEETGMEKFLKRPEKLRGDPEKRNKDKYCRFHRDHGHNTSSCWELKHQI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAACCATATGATGGGTCTAAGAACCCAAAAGACTATGTTGAAGTCTTTGAAGGTCTTATGGACTTTCAAGCGGCGATAGGTGCCATCAAATGCCGCGCCTTTCAAAT
CGCACTTACTGGTAGCGCGCGCTTGTGGTATAGAAGATTGCCAGCCAAGTCGATTTTGACCTACTCTCAGTTGAGGAAAGAGTTCATCAGTCAATTCTCTTCTCGGCACT
ATGACAGAAAGACAGCAACTCACCTCGCCACCATCAGACAGAAGGAGGGCGAGACGCTTAGAGAGTACGTCACAAGGTTCCAAGAGGAGCAGCTGAAGGTCGCGCACTGC
TCCGATGACTCAGCCATGTGTTACTTCCTCACCGGCCTAGCCGATGAGACTCTCACTGTGAAGCTTGGAGAGGAAGCTCCAGCCACCTTCATTGAAGTTTTGCAAAAGGC
GAAGAAAGTCATCGATGGACAAGAGCTCTTCCGGACCAAGACTGGCCGACCTGAAAAGCATATCGACCAGAAGAAGCCCAACCAAGAGAAGAGGAAGGCTGATTCCAAGT
CTAAGGACAAGGGGTCGTCCTCTTCCAACAACCGAGCTGAGTACCGCAGGTCGGACAGCGGCCCCAACCGAAGCCGACCTTACGAGCGTTATACTCCAACCACCATCCCC
ATCTCTGAAATACTTACCAATATTGAGGAGACTGGGATGGAAAAGTTCCTCAAGCGACCTGAGAAGCTCCGAGGAGACCCAGAAAAGCGCAATAAGGATAAATATTGCCG
TTTTCACCGTGATCACGGCCATAATACATCAAGTTGCTGGGAACTCAAGCACCAGATTTGTAACGCCCCGAGTCTCTCAGTGAGTCTTGTTTTTAAGATGGAATGGATTT
CTTTTCTCTTGGGCTTGGACATGTTGGGCCTTTATTGGGTTAAGTTGGTTTTAAAACAAAGGAAAAGAGGAAAAAGAAAACAAAAAAAAACTTCATCTTCTTCTTCTCTC
CTCCCTTGTTGCCCTAGCCGTCACCCACCCCTCTTCTTCTTCTTGTTCACCGGCGACCACACCTCACGACGACGATGGAAGCGGCGGCGCCTCCTCGAAACAGCAACAGC
GGCGGCCGGCGACCACGCTCGTAGCAGCAGCGGCGGCGGCGCGACTTCGACCCGAACAGCAGCGGCGGATCGATTTCCTCCGGTGACGTGTTCTCCGATGAACGGCCCAG
TGTCCTCCACGAACCCTGGTGATCTGCAACGAACCAGCACCGCTCCCTGCAGCGTTCCTCACGGCGGCGCACGGCGAACGGGTCAGATTCGCGGCGAACTAGCGGCGCAT
GGCGAACGGGTCAGATCCGCGGCGAACCTTGACAACCTGCTGCTACGATCCGACGAACAGCAACTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAACCATATGATGGGTCTAAGAACCCAAAAGACTATGTTGAAGTCTTTGAAGGTCTTATGGACTTTCAAGCGGCGATAGGTGCCATCAAATGCCGCGCCTTTCAAAT
CGCACTTACTGGTAGCGCGCGCTTGTGGTATAGAAGATTGCCAGCCAAGTCGATTTTGACCTACTCTCAGTTGAGGAAAGAGTTCATCAGTCAATTCTCTTCTCGGCACT
ATGACAGAAAGACAGCAACTCACCTCGCCACCATCAGACAGAAGGAGGGCGAGACGCTTAGAGAGTACGTCACAAGGTTCCAAGAGGAGCAGCTGAAGGTCGCGCACTGC
TCCGATGACTCAGCCATGTGTTACTTCCTCACCGGCCTAGCCGATGAGACTCTCACTGTGAAGCTTGGAGAGGAAGCTCCAGCCACCTTCATTGAAGTTTTGCAAAAGGC
GAAGAAAGTCATCGATGGACAAGAGCTCTTCCGGACCAAGACTGGCCGACCTGAAAAGCATATCGACCAGAAGAAGCCCAACCAAGAGAAGAGGAAGGCTGATTCCAAGT
CTAAGGACAAGGGGTCGTCCTCTTCCAACAACCGAGCTGAGTACCGCAGGTCGGACAGCGGCCCCAACCGAAGCCGACCTTACGAGCGTTATACTCCAACCACCATCCCC
ATCTCTGAAATACTTACCAATATTGAGGAGACTGGGATGGAAAAGTTCCTCAAGCGACCTGAGAAGCTCCGAGGAGACCCAGAAAAGCGCAATAAGGATAAATATTGCCG
TTTTCACCGTGATCACGGCCATAATACATCAAGTTGCTGGGAACTCAAGCACCAGATTTGTAACGCCCCGAGTCTCTCAGTGAGTCTTGTTTTTAAGATGGAATGGATTT
CTTTTCTCTTGGGCTTGGACATGTTGGGCCTTTATTGGGTTAAGTTGGTTTTAAAACAAAGGAAAAGAGGAAAAAGAAAACAAAAAAAAACTTCATCTTCTTCTTCTCTC
CTCCCTTGTTGCCCTAGCCGTCACCCACCCCTCTTCTTCTTCTTGTTCACCGGCGACCACACCTCACGACGACGATGGAAGCGGCGGCGCCTCCTCGAAACAGCAACAGC
GGCGGCCGGCGACCACGCTCGTAGCAGCAGCGGCGGCGGCGCGACTTCGACCCGAACAGCAGCGGCGGATCGATTTCCTCCGGTGACGTGTTCTCCGATGAACGGCCCAG
TGTCCTCCACGAACCCTGGTGATCTGCAACGAACCAGCACCGCTCCCTGCAGCGTTCCTCACGGCGGCGCACGGCGAACGGGTCAGATTCGCGGCGAACTAGCGGCGCAT
GGCGAACGGGTCAGATCCGCGGCGAACCTTGACAACCTGCTGCTACGATCCGACGAACAGCAACTCTAG
Protein sequenceShow/hide protein sequence
MKPYDGSKNPKDYVEVFEGLMDFQAAIGAIKCRAFQIALTGSARLWYRRLPAKSILTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVAHC
SDDSAMCYFLTGLADETLTVKLGEEAPATFIEVLQKAKKVIDGQELFRTKTGRPEKHIDQKKPNQEKRKADSKSKDKGSSSSNNRAEYRRSDSGPNRSRPYERYTPTTIP
ISEILTNIEETGMEKFLKRPEKLRGDPEKRNKDKYCRFHRDHGHNTSSCWELKHQICNAPSLSVSLVFKMEWISFLLGLDMLGLYWVKLVLKQRKRGKRKQKKTSSSSSL
LPCCPSRHPPLFFFLFTGDHTSRRRWKRRRLLETATAAAGDHARSSSGGGATSTRTAAADRFPPVTCSPMNGPVSSTNPGDLQRTSTAPCSVPHGGARRTGQIRGELAAH
GERVRSAANLDNLLLRSDEQQL