; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g10190 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g10190
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr2:7204774..7206510
RNA-Seq ExpressionMoc02g10190
SyntenyMoc02g10190
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]3.2e-5642.55Show/hide
Query:  RSEVDLLRDQFQREIEDLKRQCRLVD-PHRVAEQEEPPFSQAILDAPIPPRFKAPVMNSYDGSGDLISYVEVFEGKMDFLAASDPMKCRAFQIALEGSAR
        R E D LR Q   ++E LK +C   + P    +  E PF+  +L+APIPP+FKAP +  YDGS D   YVEVFE  MDF AASD +KCRAF+IAL GSAR
Subjt:  RSEVDLLRDQFQREIEDLKRQCRLVD-PHRVAEQEEPPFSQAILDAPIPPRFKAPVMNSYDGSGDLISYVEVFEGKMDFLAASDPMKCRAFQIALEGSAR

Query:  LWYRQLKPRSIDSYQQLRRLFINQFSARQLLKLPPSHLGTVKQRDRESLTEYIARFMDEHVKVVSCTDDIVMMYFTTGLNDKNLTIEFESRPPASLNEML
        LWYR+L   SI +Y QLRR F+  FS+R   K   +HL T++Q++ E+L EY+ RF +E +KV  C+DD  M YF TGL D+ LT++     PA+  E+L
Subjt:  LWYRQLKPRSIDSYQQLRRLFINQFSARQLLKLPPSHLGTVKQRDRESLTEYIARFMDEHVKVVSCTDDIVMMYFTTGLNDKNLTIEFESRPPASLNEML

Query:  VRARQYIDDLELWKANGARWSDR------GKDRNQKSPPPKKQRSHGWSSSRRADDSKSRGHWDEKAPSDHRGPKFDKFTPLNASVAEIYAAAEDTDLEA
         +A++ ID  EL +    R   +      GKD     P   K +  G  SS RA+  ++     E  P+  R   +++FTP    ++EI    E++ +E 
Subjt:  VRARQYIDDLELWKANGARWSDR------GKDRNQKSPPPKKQRSHGWSSSRRADDSKSRGHWDEKAPSDHRGPKFDKFTPLNASVAEIYAAAEDTDLEA

Query:  LFAAPEKLRRPPGKRDKRLYKR
        L   PEKLR  P +R K  Y R
Subjt:  LFAAPEKLRRPPGKRDKRLYKR

XP_022149029.1 uncharacterized protein LOC111017548 [Momordica charantia]4.2e-10181.25Show/hide
Query:  MDFLAASDPMKCRAFQIALEGSARLWYRQLKPRSIDSYQQLRRLFINQFSARQLLKLPPSHLGTVKQRDRESLTEYIARFMDEHVKVVSCTDDIVMMYFT
        MDFLAASD +KCRAFQIALEGS RLWY+QLKPRSIDSYQQLRRLFINQFSARQLLKLPPSHL TVKQRD ESLTEYIAR MDEHVKVVSCTDDI MMYFT
Subjt:  MDFLAASDPMKCRAFQIALEGSARLWYRQLKPRSIDSYQQLRRLFINQFSARQLLKLPPSHLGTVKQRDRESLTEYIARFMDEHVKVVSCTDDIVMMYFT

Query:  TGLNDKNLTIEFESRPPASLNEMLVRARQYIDDLELWKANGARWSDRGKDRNQKSPPPKKQRSHGWSSSRRADDSKSRGHWDEKAPSDHRGPKFDKFTPL
        TGLND+NLTIEF SRPPASLN+ML RARQYID LELWKA GAR S RGKDR+Q+S PPKK+ S   SSSR+A D +SRG  DE+  SD  GPKFDKFTPL
Subjt:  TGLNDKNLTIEFESRPPASLNEMLVRARQYIDDLELWKANGARWSDRGKDRNQKSPPPKKQRSHGWSSSRRADDSKSRGHWDEKAPSDHRGPKFDKFTPL

Query:  NASVAEIYAAAEDTDLEALFAAPEKLRRPPGKRDKRLYKR
        NASVAEIYA  E+TD++ALF AP+KL RP GKRDKRLY R
Subjt:  NASVAEIYAAAEDTDLEALFAAPEKLRRPPGKRDKRLYKR

XP_022152851.1 uncharacterized protein LOC111020475 [Momordica charantia]5.6e-6959.26Show/hide
Query:  MNSYDGSGDLISYVEVFEGKMDFLAASDPMKCRAFQIALEGSARLWYRQLKPRSIDSYQQLRRLFINQFSARQLLKLPPSHLGTVKQRDRESLTEYIARF
        M+SYDGSGD ISYVEVFEGKMDFLA SD MKC AFQI LEGS RLWYRQLK RSIDSYQQLRRLFINQFS RQ LKLP SHLGTVKQRD ES T YIARF
Subjt:  MNSYDGSGDLISYVEVFEGKMDFLAASDPMKCRAFQIALEGSARLWYRQLKPRSIDSYQQLRRLFINQFSARQLLKLPPSHLGTVKQRDRESLTEYIARF

Query:  MDEHVKVVSCTDDIVMMYFTTGLNDKNLTIEFESRPPASLNEMLVRARQYIDDLELWKANGARWSDRGKDRNQKSPPPKKQRSHGWSSSRRADDSKSRGH
        MDEHVKVVSCTDDI MMYFTTGLND+NLTIEF S  PA LNEM  RARQYID LELW A+GA      +       PP+                     
Subjt:  MDEHVKVVSCTDDIVMMYFTTGLNDKNLTIEFESRPPASLNEMLVRARQYIDDLELWKANGARWSDRGKDRNQKSPPPKKQRSHGWSSSRRADDSKSRGH

Query:  WDEKAPSDHRGPKFDKFTPLNASVAEIYAAAEDTDLEALFAAPEKLRRPPGKRDKRLYKRIMATTLHAVS
                H G        +          A  T         EKLRRP GKRDKRLY R      H  S
Subjt:  WDEKAPSDHRGPKFDKFTPLNASVAEIYAAAEDTDLEALFAAPEKLRRPPGKRDKRLYKRIMATTLHAVS

XP_022156542.1 uncharacterized protein LOC111023421 [Momordica charantia]1.3e-5743.17Show/hide
Query:  RSEVDLLRDQFQREIEDLKRQCRLVDPH-RVAEQEEPPFSQAILDAPIPPRFKAPVMNSYDGSGDLISYVEVFEGKMDFLAASDPMKCRAFQIALEGSAR
        R E D LR +   ++E LK +C   D      +  E PF+  +L+APIPP+FKAP +  YDG+ D   YVEVFEG MDF AASD +KCRAFQIAL GSAR
Subjt:  RSEVDLLRDQFQREIEDLKRQCRLVDPH-RVAEQEEPPFSQAILDAPIPPRFKAPVMNSYDGSGDLISYVEVFEGKMDFLAASDPMKCRAFQIALEGSAR

Query:  LWYRQLKPRSIDSYQQLRRLFINQFSARQLLKLPPSHLGTVKQRDRESLTEYIARFMDEHVKVVSCTDDIVMMYFTTGLNDKNLTIEFESRPPASLNEML
        LWYR+L  RSI +Y QLRR F+ QFS+R   K   +HL T++Q++ E+L EY+ RF +E +KV  C+DD  M YF TGL D+ LT++     PA+  E+L
Subjt:  LWYRQLKPRSIDSYQQLRRLFINQFSARQLLKLPPSHLGTVKQRDRESLTEYIARFMDEHVKVVSCTDDIVMMYFTTGLNDKNLTIEFESRPPASLNEML

Query:  VRARQYIDDLELWKANGARWSDR------GKDRNQKSPPPKKQRSHGWSSSRRADDSKSRGHWDEKAPSDHRGPKFDKFTPLNASVAEIYAAAEDTDLEA
         +A++ ID  EL +    R   +      GKD  +  P   K +  G  SS RA+  ++     E  P+  R   +++FTP    + EI    E++ +E 
Subjt:  VRARQYIDDLELWKANGARWSDR------GKDRNQKSPPPKKQRSHGWSSSRRADDSKSRGHWDEKAPSDHRGPKFDKFTPLNASVAEIYAAAEDTDLEA

Query:  LFAAPEKLRRPPGKRDKRLYKR
        L   PEKLR  P +R K  Y R
Subjt:  LFAAPEKLRRPPGKRDKRLYKR

XP_022158844.1 uncharacterized protein LOC111025310 [Momordica charantia]4.7e-6077.5Show/hide
Query:  MDEHVKVVSCTDDIVMMYFTTGLNDKNLTIEFESRPPASLNEMLVRARQYIDDLELWKANGARWSDRGKDRNQKSPPPKKQRSHGWSSSRRADDSKSRGH
        MDEHVKVVSCTDDI MMYFTTGLND+NLTIEF SRPPASLNEM  RARQYID LELWKANGAR S RG+DR+ KSPP KK+     SSSRRADD KSR  
Subjt:  MDEHVKVVSCTDDIVMMYFTTGLNDKNLTIEFESRPPASLNEMLVRARQYIDDLELWKANGARWSDRGKDRNQKSPPPKKQRSHGWSSSRRADDSKSRGH

Query:  WDEKAPSDHRGPKFDKFTPLNASVAEIYAAAEDTDLEALFAAPEKLRRPPGKRDKRLYKR
         DE+  S+ RGPKFDKFTPLNAS+AEIYA  EDTD+E LFA+PEKLRRP GKR+KRLY R
Subjt:  WDEKAPSDHRGPKFDKFTPLNASVAEIYAAAEDTDLEALFAAPEKLRRPPGKRDKRLYKR

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088131.5e-5642.55Show/hide
Query:  RSEVDLLRDQFQREIEDLKRQCRLVD-PHRVAEQEEPPFSQAILDAPIPPRFKAPVMNSYDGSGDLISYVEVFEGKMDFLAASDPMKCRAFQIALEGSAR
        R E D LR Q   ++E LK +C   + P    +  E PF+  +L+APIPP+FKAP +  YDGS D   YVEVFE  MDF AASD +KCRAF+IAL GSAR
Subjt:  RSEVDLLRDQFQREIEDLKRQCRLVD-PHRVAEQEEPPFSQAILDAPIPPRFKAPVMNSYDGSGDLISYVEVFEGKMDFLAASDPMKCRAFQIALEGSAR

Query:  LWYRQLKPRSIDSYQQLRRLFINQFSARQLLKLPPSHLGTVKQRDRESLTEYIARFMDEHVKVVSCTDDIVMMYFTTGLNDKNLTIEFESRPPASLNEML
        LWYR+L   SI +Y QLRR F+  FS+R   K   +HL T++Q++ E+L EY+ RF +E +KV  C+DD  M YF TGL D+ LT++     PA+  E+L
Subjt:  LWYRQLKPRSIDSYQQLRRLFINQFSARQLLKLPPSHLGTVKQRDRESLTEYIARFMDEHVKVVSCTDDIVMMYFTTGLNDKNLTIEFESRPPASLNEML

Query:  VRARQYIDDLELWKANGARWSDR------GKDRNQKSPPPKKQRSHGWSSSRRADDSKSRGHWDEKAPSDHRGPKFDKFTPLNASVAEIYAAAEDTDLEA
         +A++ ID  EL +    R   +      GKD     P   K +  G  SS RA+  ++     E  P+  R   +++FTP    ++EI    E++ +E 
Subjt:  VRARQYIDDLELWKANGARWSDR------GKDRNQKSPPPKKQRSHGWSSSRRADDSKSRGHWDEKAPSDHRGPKFDKFTPLNASVAEIYAAAEDTDLEA

Query:  LFAAPEKLRRPPGKRDKRLYKR
        L   PEKLR  P +R K  Y R
Subjt:  LFAAPEKLRRPPGKRDKRLYKR

A0A6J1D5T3 uncharacterized protein LOC1110175482.0e-10181.25Show/hide
Query:  MDFLAASDPMKCRAFQIALEGSARLWYRQLKPRSIDSYQQLRRLFINQFSARQLLKLPPSHLGTVKQRDRESLTEYIARFMDEHVKVVSCTDDIVMMYFT
        MDFLAASD +KCRAFQIALEGS RLWY+QLKPRSIDSYQQLRRLFINQFSARQLLKLPPSHL TVKQRD ESLTEYIAR MDEHVKVVSCTDDI MMYFT
Subjt:  MDFLAASDPMKCRAFQIALEGSARLWYRQLKPRSIDSYQQLRRLFINQFSARQLLKLPPSHLGTVKQRDRESLTEYIARFMDEHVKVVSCTDDIVMMYFT

Query:  TGLNDKNLTIEFESRPPASLNEMLVRARQYIDDLELWKANGARWSDRGKDRNQKSPPPKKQRSHGWSSSRRADDSKSRGHWDEKAPSDHRGPKFDKFTPL
        TGLND+NLTIEF SRPPASLN+ML RARQYID LELWKA GAR S RGKDR+Q+S PPKK+ S   SSSR+A D +SRG  DE+  SD  GPKFDKFTPL
Subjt:  TGLNDKNLTIEFESRPPASLNEMLVRARQYIDDLELWKANGARWSDRGKDRNQKSPPPKKQRSHGWSSSRRADDSKSRGHWDEKAPSDHRGPKFDKFTPL

Query:  NASVAEIYAAAEDTDLEALFAAPEKLRRPPGKRDKRLYKR
        NASVAEIYA  E+TD++ALF AP+KL RP GKRDKRLY R
Subjt:  NASVAEIYAAAEDTDLEALFAAPEKLRRPPGKRDKRLYKR

A0A6J1DIZ8 uncharacterized protein LOC1110204752.7e-6959.26Show/hide
Query:  MNSYDGSGDLISYVEVFEGKMDFLAASDPMKCRAFQIALEGSARLWYRQLKPRSIDSYQQLRRLFINQFSARQLLKLPPSHLGTVKQRDRESLTEYIARF
        M+SYDGSGD ISYVEVFEGKMDFLA SD MKC AFQI LEGS RLWYRQLK RSIDSYQQLRRLFINQFS RQ LKLP SHLGTVKQRD ES T YIARF
Subjt:  MNSYDGSGDLISYVEVFEGKMDFLAASDPMKCRAFQIALEGSARLWYRQLKPRSIDSYQQLRRLFINQFSARQLLKLPPSHLGTVKQRDRESLTEYIARF

Query:  MDEHVKVVSCTDDIVMMYFTTGLNDKNLTIEFESRPPASLNEMLVRARQYIDDLELWKANGARWSDRGKDRNQKSPPPKKQRSHGWSSSRRADDSKSRGH
        MDEHVKVVSCTDDI MMYFTTGLND+NLTIEF S  PA LNEM  RARQYID LELW A+GA      +       PP+                     
Subjt:  MDEHVKVVSCTDDIVMMYFTTGLNDKNLTIEFESRPPASLNEMLVRARQYIDDLELWKANGARWSDRGKDRNQKSPPPKKQRSHGWSSSRRADDSKSRGH

Query:  WDEKAPSDHRGPKFDKFTPLNASVAEIYAAAEDTDLEALFAAPEKLRRPPGKRDKRLYKRIMATTLHAVS
                H G        +          A  T         EKLRRP GKRDKRLY R      H  S
Subjt:  WDEKAPSDHRGPKFDKFTPLNASVAEIYAAAEDTDLEALFAAPEKLRRPPGKRDKRLYKRIMATTLHAVS

A0A6J1DS95 uncharacterized protein LOC1110234216.2e-5843.17Show/hide
Query:  RSEVDLLRDQFQREIEDLKRQCRLVDPH-RVAEQEEPPFSQAILDAPIPPRFKAPVMNSYDGSGDLISYVEVFEGKMDFLAASDPMKCRAFQIALEGSAR
        R E D LR +   ++E LK +C   D      +  E PF+  +L+APIPP+FKAP +  YDG+ D   YVEVFEG MDF AASD +KCRAFQIAL GSAR
Subjt:  RSEVDLLRDQFQREIEDLKRQCRLVDPH-RVAEQEEPPFSQAILDAPIPPRFKAPVMNSYDGSGDLISYVEVFEGKMDFLAASDPMKCRAFQIALEGSAR

Query:  LWYRQLKPRSIDSYQQLRRLFINQFSARQLLKLPPSHLGTVKQRDRESLTEYIARFMDEHVKVVSCTDDIVMMYFTTGLNDKNLTIEFESRPPASLNEML
        LWYR+L  RSI +Y QLRR F+ QFS+R   K   +HL T++Q++ E+L EY+ RF +E +KV  C+DD  M YF TGL D+ LT++     PA+  E+L
Subjt:  LWYRQLKPRSIDSYQQLRRLFINQFSARQLLKLPPSHLGTVKQRDRESLTEYIARFMDEHVKVVSCTDDIVMMYFTTGLNDKNLTIEFESRPPASLNEML

Query:  VRARQYIDDLELWKANGARWSDR------GKDRNQKSPPPKKQRSHGWSSSRRADDSKSRGHWDEKAPSDHRGPKFDKFTPLNASVAEIYAAAEDTDLEA
         +A++ ID  EL +    R   +      GKD  +  P   K +  G  SS RA+  ++     E  P+  R   +++FTP    + EI    E++ +E 
Subjt:  VRARQYIDDLELWKANGARWSDR------GKDRNQKSPPPKKQRSHGWSSSRRADDSKSRGHWDEKAPSDHRGPKFDKFTPLNASVAEIYAAAEDTDLEA

Query:  LFAAPEKLRRPPGKRDKRLYKR
        L   PEKLR  P +R K  Y R
Subjt:  LFAAPEKLRRPPGKRDKRLYKR

A0A6J1E0L8 uncharacterized protein LOC1110253102.3e-6077.5Show/hide
Query:  MDEHVKVVSCTDDIVMMYFTTGLNDKNLTIEFESRPPASLNEMLVRARQYIDDLELWKANGARWSDRGKDRNQKSPPPKKQRSHGWSSSRRADDSKSRGH
        MDEHVKVVSCTDDI MMYFTTGLND+NLTIEF SRPPASLNEM  RARQYID LELWKANGAR S RG+DR+ KSPP KK+     SSSRRADD KSR  
Subjt:  MDEHVKVVSCTDDIVMMYFTTGLNDKNLTIEFESRPPASLNEMLVRARQYIDDLELWKANGARWSDRGKDRNQKSPPPKKQRSHGWSSSRRADDSKSRGH

Query:  WDEKAPSDHRGPKFDKFTPLNASVAEIYAAAEDTDLEALFAAPEKLRRPPGKRDKRLYKR
         DE+  S+ RGPKFDKFTPLNAS+AEIYA  EDTD+E LFA+PEKLRRP GKR+KRLY R
Subjt:  WDEKAPSDHRGPKFDKFTPLNASVAEIYAAAEDTDLEALFAAPEKLRRPPGKRDKRLYKR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGGACGCGGTCACTTCCCGAGCCCATCCACGACTCACGTATTCTCAGGTGGCAGGAACTCCCGTCGTCAAACAACGACCCCACGCGGGGGCGGTCAAGGAGAATGG
AGGTCGGCCTGCCACATCTGATCCAGTAGCAGTCCGGGATTTCCACCTCGCCTCAGATCAATTCCCGCCACTGCAACCTCAGAGAAACGGGTTGCCACCCCGCGCACCTC
GCCTCCGCGGTTGGGGAAACACGGGCGCACGTTCCGGGGCAAGTGCTGATGCGGGCGTAGACCCCGTCATGGTAGCCGACGTGATCGTCGAGCTCAAAGAAGTCAAAGCA
AGGCTTGAGGCGGTCGAGAGAGGCAGCGAATTGTCCGGCTCTTTCGTCTCCAGGGACCCCATTCGAGGAAAGGGACCGATGCATCCTACCCAAAGAACGGAGTATCATTT
CCGACCCTGCAGGGAGGCCCGAGCTGGAGCACCCTCGCGAAAGCCACAACGAGTGACGACAGGCGGTGCTCTGGAAGCACAGATTCACGACCACCCTCGGCAGGACGATC
GGGTCGAGGGCCGGCGCCCGAGGATCCGACCAATTCGGACCCCCTTTGCATCTTTCGATAATTCCAATGCTCATCAGGGCCGAGGTGCCGAGACGCCAAGACGACGAGTA
GTGGCTCCCGAAGATCGGGAGTATTTGGTTGACGATGAGGAGGAAAGCCCAGTGGTCGACGTTCAAGAAAGGTCCTCCCACGCTGACCATTCGTTCCGGTCTGAGGTGGA
CCTCCTCCGGGATCAGTTTCAGAGGGAGATAGAAGATCTCAAGCGACAGTGCAGGCTTGTGGATCCGCATCGCGTGGCCGAGCAAGAGGAACCGCCTTTCTCCCAAGCGA
TCCTGGATGCACCTATCCCACCGAGGTTCAAGGCTCCGGTCATGAATTCTTACGACGGATCTGGAGATCTGATCTCCTATGTAGAGGTGTTCGAGGGGAAGATGGATTTC
CTGGCCGCAAGCGACCCTATGAAGTGCCGAGCATTTCAAATAGCCTTGGAAGGATCGGCAAGATTGTGGTACCGACAGTTGAAGCCCCGGTCCATCGATAGTTACCAACA
GCTAAGGAGGTTGTTCATCAACCAGTTCTCGGCTCGGCAGTTGTTGAAATTGCCACCCTCTCACCTCGGAACAGTAAAGCAACGGGACAGAGAGTCCCTGACAGAGTACA
TCGCTCGATTTATGGACGAGCATGTCAAAGTGGTAAGTTGCACCGATGACATCGTCATGATGTACTTCACCACGGGCTTGAACGACAAGAACCTAACGATAGAGTTCGAA
AGCCGACCACCGGCCTCCCTGAACGAGATGCTCGTTAGAGCTCGCCAGTACATTGACGACTTGGAGTTGTGGAAAGCCAATGGAGCACGGTGGAGCGACCGTGGTAAAGA
TCGGAACCAAAAGTCCCCTCCTCCCAAGAAGCAGCGCAGCCACGGCTGGAGCTCGTCTCGACGGGCCGACGACAGTAAGAGTAGAGGCCATTGGGATGAGAAAGCCCCTT
CAGACCATCGGGGGCCAAAATTCGACAAGTTCACTCCGTTGAATGCCTCAGTCGCGGAGATCTACGCGGCAGCCGAAGACACCGACCTGGAGGCACTCTTCGCGGCCCCA
GAAAAGCTTCGTCGACCTCCAGGGAAACGAGACAAGCGGCTCTACAAGCGGATCATGGCCACGACACTTCACGCTGTTTCCACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGGGACGCGGTCACTTCCCGAGCCCATCCACGACTCACGTATTCTCAGGTGGCAGGAACTCCCGTCGTCAAACAACGACCCCACGCGGGGGCGGTCAAGGAGAATGG
AGGTCGGCCTGCCACATCTGATCCAGTAGCAGTCCGGGATTTCCACCTCGCCTCAGATCAATTCCCGCCACTGCAACCTCAGAGAAACGGGTTGCCACCCCGCGCACCTC
GCCTCCGCGGTTGGGGAAACACGGGCGCACGTTCCGGGGCAAGTGCTGATGCGGGCGTAGACCCCGTCATGGTAGCCGACGTGATCGTCGAGCTCAAAGAAGTCAAAGCA
AGGCTTGAGGCGGTCGAGAGAGGCAGCGAATTGTCCGGCTCTTTCGTCTCCAGGGACCCCATTCGAGGAAAGGGACCGATGCATCCTACCCAAAGAACGGAGTATCATTT
CCGACCCTGCAGGGAGGCCCGAGCTGGAGCACCCTCGCGAAAGCCACAACGAGTGACGACAGGCGGTGCTCTGGAAGCACAGATTCACGACCACCCTCGGCAGGACGATC
GGGTCGAGGGCCGGCGCCCGAGGATCCGACCAATTCGGACCCCCTTTGCATCTTTCGATAATTCCAATGCTCATCAGGGCCGAGGTGCCGAGACGCCAAGACGACGAGTA
GTGGCTCCCGAAGATCGGGAGTATTTGGTTGACGATGAGGAGGAAAGCCCAGTGGTCGACGTTCAAGAAAGGTCCTCCCACGCTGACCATTCGTTCCGGTCTGAGGTGGA
CCTCCTCCGGGATCAGTTTCAGAGGGAGATAGAAGATCTCAAGCGACAGTGCAGGCTTGTGGATCCGCATCGCGTGGCCGAGCAAGAGGAACCGCCTTTCTCCCAAGCGA
TCCTGGATGCACCTATCCCACCGAGGTTCAAGGCTCCGGTCATGAATTCTTACGACGGATCTGGAGATCTGATCTCCTATGTAGAGGTGTTCGAGGGGAAGATGGATTTC
CTGGCCGCAAGCGACCCTATGAAGTGCCGAGCATTTCAAATAGCCTTGGAAGGATCGGCAAGATTGTGGTACCGACAGTTGAAGCCCCGGTCCATCGATAGTTACCAACA
GCTAAGGAGGTTGTTCATCAACCAGTTCTCGGCTCGGCAGTTGTTGAAATTGCCACCCTCTCACCTCGGAACAGTAAAGCAACGGGACAGAGAGTCCCTGACAGAGTACA
TCGCTCGATTTATGGACGAGCATGTCAAAGTGGTAAGTTGCACCGATGACATCGTCATGATGTACTTCACCACGGGCTTGAACGACAAGAACCTAACGATAGAGTTCGAA
AGCCGACCACCGGCCTCCCTGAACGAGATGCTCGTTAGAGCTCGCCAGTACATTGACGACTTGGAGTTGTGGAAAGCCAATGGAGCACGGTGGAGCGACCGTGGTAAAGA
TCGGAACCAAAAGTCCCCTCCTCCCAAGAAGCAGCGCAGCCACGGCTGGAGCTCGTCTCGACGGGCCGACGACAGTAAGAGTAGAGGCCATTGGGATGAGAAAGCCCCTT
CAGACCATCGGGGGCCAAAATTCGACAAGTTCACTCCGTTGAATGCCTCAGTCGCGGAGATCTACGCGGCAGCCGAAGACACCGACCTGGAGGCACTCTTCGCGGCCCCA
GAAAAGCTTCGTCGACCTCCAGGGAAACGAGACAAGCGGCTCTACAAGCGGATCATGGCCACGACACTTCACGCTGTTTCCACTTGA
Protein sequenceShow/hide protein sequence
MRDAVTSRAHPRLTYSQVAGTPVVKQRPHAGAVKENGGRPATSDPVAVRDFHLASDQFPPLQPQRNGLPPRAPRLRGWGNTGARSGASADAGVDPVMVADVIVELKEVKA
RLEAVERGSELSGSFVSRDPIRGKGPMHPTQRTEYHFRPCREARAGAPSRKPQRVTTGGALEAQIHDHPRQDDRVEGRRPRIRPIRTPFASFDNSNAHQGRGAETPRRRV
VAPEDREYLVDDEEESPVVDVQERSSHADHSFRSEVDLLRDQFQREIEDLKRQCRLVDPHRVAEQEEPPFSQAILDAPIPPRFKAPVMNSYDGSGDLISYVEVFEGKMDF
LAASDPMKCRAFQIALEGSARLWYRQLKPRSIDSYQQLRRLFINQFSARQLLKLPPSHLGTVKQRDRESLTEYIARFMDEHVKVVSCTDDIVMMYFTTGLNDKNLTIEFE
SRPPASLNEMLVRARQYIDDLELWKANGARWSDRGKDRNQKSPPPKKQRSHGWSSSRRADDSKSRGHWDEKAPSDHRGPKFDKFTPLNASVAEIYAAAEDTDLEALFAAP
EKLRRPPGKRDKRLYKRIMATTLHAVST