; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g05960 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g05960
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr7:4952865..4960074
RNA-Seq ExpressionMoc07g05960
SyntenyMoc07g05960
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022141796.1 uncharacterized protein LOC111012081 [Momordica charantia]2.6e-5841.16Show/hide
Query:  RSEVDLLRDQFQKEIEDLKWQCRPVD-PHQPVGQEEPPFSQAILDAPILPRFKAPTMGSYDGSGDLVSYAEVFEGKMDFLAASNAMKCRAFQIALKGLAR
        R E D LR Q   + E LK +C   + P       E PF+  +L+API P+FKAPT+  YDGS D   Y EVFEG MDF A S+A+KCRAFQIAL G AR
Subjt:  RSEVDLLRDQFQKEIEDLKWQCRPVD-PHQPVGQEEPPFSQAILDAPILPRFKAPTMGSYDGSGDLVSYAEVFEGKMDFLAASNAMKCRAFQIALKGLAR

Query:  LWYRQLRPRSIDSYQQLRRLFINQFSAQQLLKLPPLHLGTMRQQDNESLTEYIARFMEENVKVVSCTNDIAMMYFTTGLDDRNFTLWKANGARRS-----
        LWYR+L  RSI +Y QLRR F+ QFS++   K    HL T+RQ++ E+L EY+ RF EE +KV  C++D AM YF TGL D   T+     A  +     
Subjt:  LWYRQLRPRSIDSYQQLRRLFINQFSAQQLLKLPPLHLGTMRQQDNESLTEYIARFMEENVKVVSCTNDIAMMYFTTGLDDRNFTLWKANGARRS-----

Query:  --------SHGKDRDQKSPPPKK----------QRGDDRSSSGWAYDNKNRGRRDERVSSD--HRGPKFDRFTPLNASITDIYAVAEDTDLEELFTAPEK
                 H   R +   P +K          ++ D +S    ++ +   GR + R + +   R   ++RFTP    I +I    E++ +E+L   PEK
Subjt:  --------SHGKDRDQKSPPPKK----------QRGDDRSSSGWAYDNKNRGRRDERVSSD--HRGPKFDRFTPLNASITDIYAVAEDTDLEELFTAPEK

Query:  LCRRSGKRDKCLYCRFHKDHGHDTSRCFHLKEQVEDLIRRGYC--------TSATEKKLSQK
        L     +R K  YCRFH++HGH+TS C+ LK Q+EDLI+ GY         TS+ EKK  +K
Subjt:  LCRRSGKRDKCLYCRFHKDHGHDTSRCFHLKEQVEDLIRRGYC--------TSATEKKLSQK

XP_022149029.1 uncharacterized protein LOC111017548 [Momordica charantia]3.1e-9664.65Show/hide
Query:  MDFLAASNAMKCRAFQIALKGLARLWYRQLRPRSIDSYQQLRRLFINQFSAQQLLKLPPLHLGTMRQQDNESLTEYIARFMEENVKVVSCTNDIAMMYFT
        MDFLAAS+A+KCRAFQIAL+G  RLWY+QL+PRSIDSYQQLRRLFINQFSA+QLLKLPP HL T++Q+DNESLTEYIAR M+E+VKVVSCT+DIAMMYFT
Subjt:  MDFLAASNAMKCRAFQIALKGLARLWYRQLRPRSIDSYQQLRRLFINQFSAQQLLKLPPLHLGTMRQQDNESLTEYIARFMEENVKVVSCTNDIAMMYFT

Query:  TGLDDRNFT--------------------------LWKANGARRSSHGKDRDQKSPPPKKQRGDDRSSSGWAYDNKNRGRRDERVSSDHRGPKFDRFTPL
        TGL+DRN T                          LWKA GARRSS GKDRDQ+S PPKK+  DD+SSS  A D+++RG+ DER SSD  GPKFD+FTPL
Subjt:  TGLDDRNFT--------------------------LWKANGARRSSHGKDRDQKSPPPKKQRGDDRSSSGWAYDNKNRGRRDERVSSDHRGPKFDRFTPL

Query:  NASITDIYAVAEDTDLEELFTAPEKLCRRSGKRDKCLYCRFHKDHGHDTSRCFHLKEQVEDLIRRGYCTSATEKKLSQKDQLGRRREKDHSCLGEGR
        NAS+ +IYA  E+TD++ LFTAP+KL R SGKRDK LYCRFHKDHGH++SRCFHLKEQV+DLIRRGY       +   K +   R EK       GR
Subjt:  NASITDIYAVAEDTDLEELFTAPEKLCRRSGKRDKCLYCRFHKDHGHDTSRCFHLKEQVEDLIRRGYCTSATEKKLSQKDQLGRRREKDHSCLGEGR

XP_022152851.1 uncharacterized protein LOC111020475 [Momordica charantia]5.4e-6452.48Show/hide
Query:  MGSYDGSGDLVSYAEVFEGKMDFLAASNAMKCRAFQIALKGLARLWYRQLRPRSIDSYQQLRRLFINQFSAQQLLKLPPLHLGTMRQQDNESLTEYIARF
        M SYDGSGD +SY EVFEGKMDFLA S+AMKC AFQI L+G  RLWYRQL+ RSIDSYQQLRRLFINQFS +Q LKLP  HLGT++Q+DNES T YIARF
Subjt:  MGSYDGSGDLVSYAEVFEGKMDFLAASNAMKCRAFQIALKGLARLWYRQLRPRSIDSYQQLRRLFINQFSAQQLLKLPPLHLGTMRQQDNESLTEYIARF

Query:  MEENVKVVSCTNDIAMMYFTTGLDDRNFTLWKANGARRSSHGKDRDQKSPPPKKQRGDDRSSSGWAYDNKNRGRRDERVSSDHRGPKFDRF---TPLNAS
        M+E+VKVVSCT+DIAMMYFTTGL+DRN T+    G+ + ++  +   ++    +Q  D      W  D  +     E  ++    P+         L   
Subjt:  MEENVKVVSCTNDIAMMYFTTGLDDRNFTLWKANGARRSSHGKDRDQKSPPPKKQRGDDRSSSGWAYDNKNRGRRDERVSSDHRGPKFDRF---TPLNAS

Query:  ITDIYAVAEDTDLEELFTAP--EKLCRRSGKRDKCLYCRFHKDHGHDTSRCFHLKEQVEDLIRRGYC-------------TSATEKKLSQKDQLGRRREK
        +T    V       ++      EKL R SGKRDK LYCRFHKD GHDTSRCFHLKEQVEDLIRRGY               SA E+K  +     RR ++
Subjt:  ITDIYAVAEDTDLEELFTAP--EKLCRRSGKRDKCLYCRFHKDHGHDTSRCFHLKEQVEDLIRRGYC-------------TSATEKKLSQKDQLGRRREK

Query:  DHS
         HS
Subjt:  DHS

XP_022156542.1 uncharacterized protein LOC111023421 [Momordica charantia]9.9e-5841.16Show/hide
Query:  RSEVDLLRDQFQKEIEDLKWQCRPVDPHQPVGQ-EEPPFSQAILDAPILPRFKAPTMGSYDGSGDLVSYAEVFEGKMDFLAASNAMKCRAFQIALKGLAR
        R E D LR +   ++E LK +C   D     G   E PF+  +L+API P+FKAPT+  YDG+ D   Y EVFEG MDF AAS+A+KCRAFQIAL G AR
Subjt:  RSEVDLLRDQFQKEIEDLKWQCRPVDPHQPVGQ-EEPPFSQAILDAPILPRFKAPTMGSYDGSGDLVSYAEVFEGKMDFLAASNAMKCRAFQIALKGLAR

Query:  LWYRQLRPRSIDSYQQLRRLFINQFSAQQLLKLPPLHLGTMRQQDNESLTEYIARFMEENVKVVSCTNDIAMMYFTTGLDDRNFTLWKA-----------
        LWYR+L  RSI +Y QLRR F+ QFS++   K    HL T+RQ++ E+L EY+ RF EE +KV  C++D AM YF TGL D   T+              
Subjt:  LWYRQLRPRSIDSYQQLRRLFINQFSAQQLLKLPPLHLGTMRQQDNESLTEYIARFMEENVKVVSCTNDIAMMYFTTGLDDRNFTLWKA-----------

Query:  NGARRSSHGKD--RDQKSPPPKK----------QRGDDRSSSGWAYDNKNRGRRDERVSSD--HRGPKFDRFTPLNASITDIYAVAEDTDLEELFTAPEK
          A++   G++  R +   P +K          +R D +S    ++ +   GR + R + +   R   ++RFTP    I +I    E++ +E+L   PEK
Subjt:  NGARRSSHGKD--RDQKSPPPKK----------QRGDDRSSSGWAYDNKNRGRRDERVSSD--HRGPKFDRFTPLNASITDIYAVAEDTDLEELFTAPEK

Query:  LCRRSGKRDKCLYCRFHKDHGHDTSRCFHLKEQVEDLIRRGYC--------TSATEKKLSQK
        L     +R K  YCRFH++HGH+TS  + LK Q+EDLI+ GY         TS+ EKK  +K
Subjt:  LCRRSGKRDKCLYCRFHKDHGHDTSRCFHLKEQVEDLIRRGYC--------TSATEKKLSQK

XP_022158844.1 uncharacterized protein LOC111025310 [Momordica charantia]5.3e-5962.5Show/hide
Query:  MEENVKVVSCTNDIAMMYFTTGLDDRNFT--------------------------LWKANGARRSSHGKDRDQKSPPPKKQRGDDRSSSGWAYDNKNRGR
        M+E+VKVVSCT+DIAMMYFTTGL+DRN T                          LWKANGARRSS G+DRD KSPP KK+  DDRSSS  A D+K+R R
Subjt:  MEENVKVVSCTNDIAMMYFTTGLDDRNFT--------------------------LWKANGARRSSHGKDRDQKSPPPKKQRGDDRSSSGWAYDNKNRGR

Query:  RDERVSSDHRGPKFDRFTPLNASITDIYAVAEDTDLEELFTAPEKLCRRSGKRDKCLYCRFHKDHGHDTSRCFHLKEQVEDLIRRGYCTSATEKKLSQKD
        RDERV+S+ RGPKFD+FTPLNASI +IYAV EDTD+E LF +PEKL R SGKR+K LYCRFHKDHGHDTSRCFHLKEQVEDLIR GY       +   + 
Subjt:  RDERVSSDHRGPKFDRFTPLNASITDIYAVAEDTDLEELFTAPEKLCRRSGKRDKCLYCRFHKDHGHDTSRCFHLKEQVEDLIRRGYCTSATEKKLSQKD

Query:  QLGRRREK
        +   R EK
Subjt:  QLGRRREK

TrEMBL top hitse value%identityAlignment
A0A6J1CKB3 uncharacterized protein LOC1110120811.3e-5841.16Show/hide
Query:  RSEVDLLRDQFQKEIEDLKWQCRPVD-PHQPVGQEEPPFSQAILDAPILPRFKAPTMGSYDGSGDLVSYAEVFEGKMDFLAASNAMKCRAFQIALKGLAR
        R E D LR Q   + E LK +C   + P       E PF+  +L+API P+FKAPT+  YDGS D   Y EVFEG MDF A S+A+KCRAFQIAL G AR
Subjt:  RSEVDLLRDQFQKEIEDLKWQCRPVD-PHQPVGQEEPPFSQAILDAPILPRFKAPTMGSYDGSGDLVSYAEVFEGKMDFLAASNAMKCRAFQIALKGLAR

Query:  LWYRQLRPRSIDSYQQLRRLFINQFSAQQLLKLPPLHLGTMRQQDNESLTEYIARFMEENVKVVSCTNDIAMMYFTTGLDDRNFTLWKANGARRS-----
        LWYR+L  RSI +Y QLRR F+ QFS++   K    HL T+RQ++ E+L EY+ RF EE +KV  C++D AM YF TGL D   T+     A  +     
Subjt:  LWYRQLRPRSIDSYQQLRRLFINQFSAQQLLKLPPLHLGTMRQQDNESLTEYIARFMEENVKVVSCTNDIAMMYFTTGLDDRNFTLWKANGARRS-----

Query:  --------SHGKDRDQKSPPPKK----------QRGDDRSSSGWAYDNKNRGRRDERVSSD--HRGPKFDRFTPLNASITDIYAVAEDTDLEELFTAPEK
                 H   R +   P +K          ++ D +S    ++ +   GR + R + +   R   ++RFTP    I +I    E++ +E+L   PEK
Subjt:  --------SHGKDRDQKSPPPKK----------QRGDDRSSSGWAYDNKNRGRRDERVSSD--HRGPKFDRFTPLNASITDIYAVAEDTDLEELFTAPEK

Query:  LCRRSGKRDKCLYCRFHKDHGHDTSRCFHLKEQVEDLIRRGYC--------TSATEKKLSQK
        L     +R K  YCRFH++HGH+TS C+ LK Q+EDLI+ GY         TS+ EKK  +K
Subjt:  LCRRSGKRDKCLYCRFHKDHGHDTSRCFHLKEQVEDLIRRGYC--------TSATEKKLSQK

A0A6J1D5T3 uncharacterized protein LOC1110175481.5e-9664.65Show/hide
Query:  MDFLAASNAMKCRAFQIALKGLARLWYRQLRPRSIDSYQQLRRLFINQFSAQQLLKLPPLHLGTMRQQDNESLTEYIARFMEENVKVVSCTNDIAMMYFT
        MDFLAAS+A+KCRAFQIAL+G  RLWY+QL+PRSIDSYQQLRRLFINQFSA+QLLKLPP HL T++Q+DNESLTEYIAR M+E+VKVVSCT+DIAMMYFT
Subjt:  MDFLAASNAMKCRAFQIALKGLARLWYRQLRPRSIDSYQQLRRLFINQFSAQQLLKLPPLHLGTMRQQDNESLTEYIARFMEENVKVVSCTNDIAMMYFT

Query:  TGLDDRNFT--------------------------LWKANGARRSSHGKDRDQKSPPPKKQRGDDRSSSGWAYDNKNRGRRDERVSSDHRGPKFDRFTPL
        TGL+DRN T                          LWKA GARRSS GKDRDQ+S PPKK+  DD+SSS  A D+++RG+ DER SSD  GPKFD+FTPL
Subjt:  TGLDDRNFT--------------------------LWKANGARRSSHGKDRDQKSPPPKKQRGDDRSSSGWAYDNKNRGRRDERVSSDHRGPKFDRFTPL

Query:  NASITDIYAVAEDTDLEELFTAPEKLCRRSGKRDKCLYCRFHKDHGHDTSRCFHLKEQVEDLIRRGYCTSATEKKLSQKDQLGRRREKDHSCLGEGR
        NAS+ +IYA  E+TD++ LFTAP+KL R SGKRDK LYCRFHKDHGH++SRCFHLKEQV+DLIRRGY       +   K +   R EK       GR
Subjt:  NASITDIYAVAEDTDLEELFTAPEKLCRRSGKRDKCLYCRFHKDHGHDTSRCFHLKEQVEDLIRRGYCTSATEKKLSQKDQLGRRREKDHSCLGEGR

A0A6J1DIZ8 uncharacterized protein LOC1110204752.6e-6452.48Show/hide
Query:  MGSYDGSGDLVSYAEVFEGKMDFLAASNAMKCRAFQIALKGLARLWYRQLRPRSIDSYQQLRRLFINQFSAQQLLKLPPLHLGTMRQQDNESLTEYIARF
        M SYDGSGD +SY EVFEGKMDFLA S+AMKC AFQI L+G  RLWYRQL+ RSIDSYQQLRRLFINQFS +Q LKLP  HLGT++Q+DNES T YIARF
Subjt:  MGSYDGSGDLVSYAEVFEGKMDFLAASNAMKCRAFQIALKGLARLWYRQLRPRSIDSYQQLRRLFINQFSAQQLLKLPPLHLGTMRQQDNESLTEYIARF

Query:  MEENVKVVSCTNDIAMMYFTTGLDDRNFTLWKANGARRSSHGKDRDQKSPPPKKQRGDDRSSSGWAYDNKNRGRRDERVSSDHRGPKFDRF---TPLNAS
        M+E+VKVVSCT+DIAMMYFTTGL+DRN T+    G+ + ++  +   ++    +Q  D      W  D  +     E  ++    P+         L   
Subjt:  MEENVKVVSCTNDIAMMYFTTGLDDRNFTLWKANGARRSSHGKDRDQKSPPPKKQRGDDRSSSGWAYDNKNRGRRDERVSSDHRGPKFDRF---TPLNAS

Query:  ITDIYAVAEDTDLEELFTAP--EKLCRRSGKRDKCLYCRFHKDHGHDTSRCFHLKEQVEDLIRRGYC-------------TSATEKKLSQKDQLGRRREK
        +T    V       ++      EKL R SGKRDK LYCRFHKD GHDTSRCFHLKEQVEDLIRRGY               SA E+K  +     RR ++
Subjt:  ITDIYAVAEDTDLEELFTAP--EKLCRRSGKRDKCLYCRFHKDHGHDTSRCFHLKEQVEDLIRRGYC-------------TSATEKKLSQKDQLGRRREK

Query:  DHS
         HS
Subjt:  DHS

A0A6J1DS95 uncharacterized protein LOC1110234214.8e-5841.16Show/hide
Query:  RSEVDLLRDQFQKEIEDLKWQCRPVDPHQPVGQ-EEPPFSQAILDAPILPRFKAPTMGSYDGSGDLVSYAEVFEGKMDFLAASNAMKCRAFQIALKGLAR
        R E D LR +   ++E LK +C   D     G   E PF+  +L+API P+FKAPT+  YDG+ D   Y EVFEG MDF AAS+A+KCRAFQIAL G AR
Subjt:  RSEVDLLRDQFQKEIEDLKWQCRPVDPHQPVGQ-EEPPFSQAILDAPILPRFKAPTMGSYDGSGDLVSYAEVFEGKMDFLAASNAMKCRAFQIALKGLAR

Query:  LWYRQLRPRSIDSYQQLRRLFINQFSAQQLLKLPPLHLGTMRQQDNESLTEYIARFMEENVKVVSCTNDIAMMYFTTGLDDRNFTLWKA-----------
        LWYR+L  RSI +Y QLRR F+ QFS++   K    HL T+RQ++ E+L EY+ RF EE +KV  C++D AM YF TGL D   T+              
Subjt:  LWYRQLRPRSIDSYQQLRRLFINQFSAQQLLKLPPLHLGTMRQQDNESLTEYIARFMEENVKVVSCTNDIAMMYFTTGLDDRNFTLWKA-----------

Query:  NGARRSSHGKD--RDQKSPPPKK----------QRGDDRSSSGWAYDNKNRGRRDERVSSD--HRGPKFDRFTPLNASITDIYAVAEDTDLEELFTAPEK
          A++   G++  R +   P +K          +R D +S    ++ +   GR + R + +   R   ++RFTP    I +I    E++ +E+L   PEK
Subjt:  NGARRSSHGKD--RDQKSPPPKK----------QRGDDRSSSGWAYDNKNRGRRDERVSSD--HRGPKFDRFTPLNASITDIYAVAEDTDLEELFTAPEK

Query:  LCRRSGKRDKCLYCRFHKDHGHDTSRCFHLKEQVEDLIRRGYC--------TSATEKKLSQK
        L     +R K  YCRFH++HGH+TS  + LK Q+EDLI+ GY         TS+ EKK  +K
Subjt:  LCRRSGKRDKCLYCRFHKDHGHDTSRCFHLKEQVEDLIRRGYC--------TSATEKKLSQK

A0A6J1E0L8 uncharacterized protein LOC1110253102.5e-5962.5Show/hide
Query:  MEENVKVVSCTNDIAMMYFTTGLDDRNFT--------------------------LWKANGARRSSHGKDRDQKSPPPKKQRGDDRSSSGWAYDNKNRGR
        M+E+VKVVSCT+DIAMMYFTTGL+DRN T                          LWKANGARRSS G+DRD KSPP KK+  DDRSSS  A D+K+R R
Subjt:  MEENVKVVSCTNDIAMMYFTTGLDDRNFT--------------------------LWKANGARRSSHGKDRDQKSPPPKKQRGDDRSSSGWAYDNKNRGR

Query:  RDERVSSDHRGPKFDRFTPLNASITDIYAVAEDTDLEELFTAPEKLCRRSGKRDKCLYCRFHKDHGHDTSRCFHLKEQVEDLIRRGYCTSATEKKLSQKD
        RDERV+S+ RGPKFD+FTPLNASI +IYAV EDTD+E LF +PEKL R SGKR+K LYCRFHKDHGHDTSRCFHLKEQVEDLIR GY       +   + 
Subjt:  RDERVSSDHRGPKFDRFTPLNASITDIYAVAEDTDLEELFTAPEKLCRRSGKRDKCLYCRFHKDHGHDTSRCFHLKEQVEDLIRRGYCTSATEKKLSQKD

Query:  QLGRRREK
        +   R EK
Subjt:  QLGRRREK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCCCGACAAGGCCTCAACGGCTCTTTAAGGTATCCGTGAAATGTGGCATGATGTCGTGGAATATGTCGGAGGATGAGCCGAGTCGAGATAGGAGGAATCGATTTAT
CCAAAGTAGGGGAACAAGATTCCAAGTAGCTTGGTCGGCTCGGGGCCGAGGTGAGGGGCCATCAAGACGACGACTGGTGGCACCCGAGGATCGGGAGTACCGGATCGACG
ACCAGGAGGGAAGCCCGGAGGTCAACAATCGAGAGAGGTCCTCCCAGGGTGATCACTCGTTTCGGTCCGAGGTGGACCTCCTCCGGGACCAATTTCAGAAGGAGATAGAA
GATCTCAAGTGGCAGTGCAGACCTGTAGACCCACATCAGCCGGTCGGGCAGGAAGAGCCGCCTTTCTCCCAAGCTATCCTGGACGCGCCCATCCTGCCAAGGTTCAAAGC
TCCGACCATGGGTTCCTATGACGGGTCTGGAGATCTAGTCTCTTACGCGGAGGTGTTCGAGGGAAAGATGGACTTCTTGGCCGCAAGTAATGCTATGAAGTGCCGAGCAT
TTCAAATAGCCTTGAAAGGCTTGGCCAGGTTATGGTACCGACAGTTGAGGCCCCGATCCATAGACAGTTATCAACAGCTGAGAAGGTTGTTCATCAACCAGTTCTCAGCT
CAGCAGTTGTTGAAGTTGCCGCCCTTGCACCTCGGAACAATGAGGCAACAGGATAATGAGTCCCTGACGGAGTACATCGCTCGATTCATGGAAGAGAATGTCAAGGTGGT
GAGTTGTACCAACGACATCGCCATGATGTACTTTACCACGGGTTTAGACGACAGGAATTTTACGCTATGGAAGGCCAATGGAGCCAGGCGGAGCAGCCACGGTAAAGATC
GGGACCAGAAGTCCCCTCCTCCCAAGAAGCAACGTGGTGATGATCGGAGCTCGTCTGGGTGGGCCTACGATAACAAGAATAGAGGTCGTCGCGACGAGAGAGTCTCTTCA
GACCATCGAGGGCCGAAGTTCGACAGGTTCACTCCGCTGAACGCCTCAATTACAGATATCTACGCGGTAGCTGAAGATACCGACTTGGAAGAGCTGTTCACAGCCCCAGA
AAAGCTCTGCCGACGTTCTGGGAAGCGAGACAAGTGCCTCTACTGCCGATTCCATAAAGATCACGGCCACGACACTTCTCGTTGCTTCCACTTAAAGGAGCAAGTCGAGG
ATTTGATCCGGAGAGGTTATTGTACGTCGGCAACAGAGAAAAAGTTGAGCCAGAAGGATCAGCTCGGGAGGAGAAGAGAGAAAGATCATAGCTGCCTAGGCGAAGGGAGG
AACGTCCTTGAAGGGATGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTCCCGACAAGGCCTCAACGGCTCTTTAAGGTATCCGTGAAATGTGGCATGATGTCGTGGAATATGTCGGAGGATGAGCCGAGTCGAGATAGGAGGAATCGATTTAT
CCAAAGTAGGGGAACAAGATTCCAAGTAGCTTGGTCGGCTCGGGGCCGAGGTGAGGGGCCATCAAGACGACGACTGGTGGCACCCGAGGATCGGGAGTACCGGATCGACG
ACCAGGAGGGAAGCCCGGAGGTCAACAATCGAGAGAGGTCCTCCCAGGGTGATCACTCGTTTCGGTCCGAGGTGGACCTCCTCCGGGACCAATTTCAGAAGGAGATAGAA
GATCTCAAGTGGCAGTGCAGACCTGTAGACCCACATCAGCCGGTCGGGCAGGAAGAGCCGCCTTTCTCCCAAGCTATCCTGGACGCGCCCATCCTGCCAAGGTTCAAAGC
TCCGACCATGGGTTCCTATGACGGGTCTGGAGATCTAGTCTCTTACGCGGAGGTGTTCGAGGGAAAGATGGACTTCTTGGCCGCAAGTAATGCTATGAAGTGCCGAGCAT
TTCAAATAGCCTTGAAAGGCTTGGCCAGGTTATGGTACCGACAGTTGAGGCCCCGATCCATAGACAGTTATCAACAGCTGAGAAGGTTGTTCATCAACCAGTTCTCAGCT
CAGCAGTTGTTGAAGTTGCCGCCCTTGCACCTCGGAACAATGAGGCAACAGGATAATGAGTCCCTGACGGAGTACATCGCTCGATTCATGGAAGAGAATGTCAAGGTGGT
GAGTTGTACCAACGACATCGCCATGATGTACTTTACCACGGGTTTAGACGACAGGAATTTTACGCTATGGAAGGCCAATGGAGCCAGGCGGAGCAGCCACGGTAAAGATC
GGGACCAGAAGTCCCCTCCTCCCAAGAAGCAACGTGGTGATGATCGGAGCTCGTCTGGGTGGGCCTACGATAACAAGAATAGAGGTCGTCGCGACGAGAGAGTCTCTTCA
GACCATCGAGGGCCGAAGTTCGACAGGTTCACTCCGCTGAACGCCTCAATTACAGATATCTACGCGGTAGCTGAAGATACCGACTTGGAAGAGCTGTTCACAGCCCCAGA
AAAGCTCTGCCGACGTTCTGGGAAGCGAGACAAGTGCCTCTACTGCCGATTCCATAAAGATCACGGCCACGACACTTCTCGTTGCTTCCACTTAAAGGAGCAAGTCGAGG
ATTTGATCCGGAGAGGTTATTGTACGTCGGCAACAGAGAAAAAGTTGAGCCAGAAGGATCAGCTCGGGAGGAGAAGAGAGAAAGATCATAGCTGCCTAGGCGAAGGGAGG
AACGTCCTTGAAGGGATGTGA
Protein sequenceShow/hide protein sequence
MFPTRPQRLFKVSVKCGMMSWNMSEDEPSRDRRNRFIQSRGTRFQVAWSARGRGEGPSRRRLVAPEDREYRIDDQEGSPEVNNRERSSQGDHSFRSEVDLLRDQFQKEIE
DLKWQCRPVDPHQPVGQEEPPFSQAILDAPILPRFKAPTMGSYDGSGDLVSYAEVFEGKMDFLAASNAMKCRAFQIALKGLARLWYRQLRPRSIDSYQQLRRLFINQFSA
QQLLKLPPLHLGTMRQQDNESLTEYIARFMEENVKVVSCTNDIAMMYFTTGLDDRNFTLWKANGARRSSHGKDRDQKSPPPKKQRGDDRSSSGWAYDNKNRGRRDERVSS
DHRGPKFDRFTPLNASITDIYAVAEDTDLEELFTAPEKLCRRSGKRDKCLYCRFHKDHGHDTSRCFHLKEQVEDLIRRGYCTSATEKKLSQKDQLGRRREKDHSCLGEGR
NVLEGM