; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g18540 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g18540
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr6:14466121..14468594
RNA-Seq ExpressionMoc06g18540
SyntenyMoc06g18540
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]9.5e-6048.94Show/hide
Query:  KIEQSSAGVRERAMKISGFCFDRCWRRASKFVSAPGSAIQRLLDYTTEAHAIACQTALMVKAELEGRDLLTVKEREAFSASLE-AAAALEGELKEARAEA
        +IE SS+GVR++  +IS    DRC RRASKFVSAPGS +QR +DY  EA   + Q+AL VKAEL+GR++L  +E+E FSA+LE A++ ++ EL +A +E 
Subjt:  KIEQSSAGVRERAMKISGFCFDRCWRRASKFVSAPGSAIQRLLDYTTEAHAIACQTALMVKAELEGRDLLTVKEREAFSASLE-AAAALEGELKEARAEA

Query:  QAWKSTSDADKAELKSAQAEAALHLENLRGMHVVANCLEKEKFALMKQNDDLERLRDDLEGKLKARDAEMAELRAKLELSESKLSNRVLLEEAFRQHPDF
        +  K+  ++    LK    E       LR  H +   LE+EKF L+K+ DD+ +        L+A+D E+    A+LE ++ +LSN VLLEEAFRQHPDF
Subjt:  QAWKSTSDADKAELKSAQAEAALHLENLRGMHVVANCLEKEKFALMKQNDDLERLRDDLEGKLKARDAEMAELRAKLELSESKLSNRVLLEEAFRQHPDF

Query:  DWFAKDFSDAGFKFLMKGVQEVAPEL--DLTPIKVRYVEKWASGPNGTPGPQYCINQCLKELDSDV-ELDEDPSSQNAIGAIPS
        D FAKDFSDAGFKFLMKG+    P+L  DL+ +K RY EKWASGP GTPGPQ  ++Q +++LDSD  + +ED       GA P+
Subjt:  DWFAKDFSDAGFKFLMKGVQEVAPEL--DLTPIKVRYVEKWASGPNGTPGPQYCINQCLKELDSDV-ELDEDPSSQNAIGAIPS

XP_022150867.1 uncharacterized protein LOC111018913 [Momordica charantia]5.2e-6647.94Show/hide
Query:  KILEVSPLREIKKKASPKKSEKKRRKTHHSRDEVREMGASRRVSPFEDLVDDPKARMGGTSDLEIRFKIEQSSAGVRERAMKISGFCFDRCWRRASKFVS
        ++ +VSPL+E+++K+   KS+  +RKT  S D V E+    RV     L +DPKAR+G T D+ +RFKIE SSAG++E+  K S  CFDR  ++ASKFV 
Subjt:  KILEVSPLREIKKKASPKKSEKKRRKTHHSRDEVREMGASRRVSPFEDLVDDPKARMGGTSDLEIRFKIEQSSAGVRERAMKISGFCFDRCWRRASKFVS

Query:  APGSAIQRLLDYTTEAHAIACQTALMVKAELEGRDLLTVKEREAFSASLEAAAALEGELKEARAEAQAWKSTSDADKAELKSAQAEAALHLENLRGMHVV
         P S I++++DYT + HA++C  A+++K++L+ RDL+ V EREAFS +LE A  LE ELKEAR E +  KS  +   A+ KS + E     E  +  +V+
Subjt:  APGSAIQRLLDYTTEAHAIACQTALMVKAELEGRDLLTVKEREAFSASLEAAAALEGELKEARAEAQAWKSTSDADKAELKSAQAEAALHLENLRGMHVV

Query:  ANCLEKEKFALMKQNDDLERLRDDLEGKLKARDAEMAELRAKLELSESKLSNRVLLEEAFRQHPDFDWFAKDFSDAGFKFLMKGVQEVAPELDLTPIKVR
           LE EKF LM++ND L R         K   +E+ EL+ ++EL ++KLSN VLLEEAF+ H DFD F  DFSD  FKFLMKG+ EVA +LDL P+K  
Subjt:  ANCLEKEKFALMKQNDDLERLRDDLEGKLKARDAEMAELRAKLELSESKLSNRVLLEEAFRQHPDFDWFAKDFSDAGFKFLMKGVQEVAPELDLTPIKVR

Query:  YVEKWASGPNGTPGP
        Y +KWASGP  T GP
Subjt:  YVEKWASGPNGTPGP

XP_022152119.1 uncharacterized protein LOC111019909 [Momordica charantia]6.6e-6949.39Show/hide
Query:  MGGTSDLEIRFKIEQSSAGVRERAMKISGFCFDRCWRRASKFVSAPGSAIQRLLDYTTEAHAIACQTALMVKAELEGRDLLTVKEREAFSASLEAAAALE
        MGGT D+  RF++E SS+GV+++  +IS  C DRC +RASKFVS PGS +QR +D   EA   +  +A+MVKAEL+GR+ L  KERE  SA+LEAA  L+
Subjt:  MGGTSDLEIRFKIEQSSAGVRERAMKISGFCFDRCWRRASKFVSAPGSAIQRLLDYTTEAHAIACQTALMVKAELEGRDLLTVKEREAFSASLEAAAALE

Query:  GELKEARAEAQAWKSTSDADKAELKSAQAEAALHLENLRGMHVVANCLEKEKFALMKQNDDLERLRDDLEGKLKARDAEMAELRAKLELSESKLSNRVLL
        GEL +A+ E    ++  DA KAEL   + E   H  +LR  H +   LEKEKF L+K+ DDL ++   LEGK    D  +  L A+L+  + +L+N  LL
Subjt:  GELKEARAEAQAWKSTSDADKAELKSAQAEAALHLENLRGMHVVANCLEKEKFALMKQNDDLERLRDDLEGKLKARDAEMAELRAKLELSESKLSNRVLL

Query:  EEAFRQHPDFDWFAKDFSDAGFKFLMKGVQEVAP--ELDLTPIKVRYVEKWASGPNGTPGPQYCINQCLKELDSDVELDEDPSSQNAIGAIPSTVSVNSF
        EE+FRQH DFD FAKDFSDAGFKFLMKG+    P  ++DL+ +K +Y EKWASGPNGTPGPQ  + + ++ELDSD     D   ++A    P+ +     
Subjt:  EEAFRQHPDFDWFAKDFSDAGFKFLMKGVQEVAP--ELDLTPIKVRYVEKWASGPNGTPGPQYCINQCLKELDSDVELDEDPSSQNAIGAIPSTVSVNSF

Query:  QEPGGTD-SQEVDILGSQGELRSHLESS
        + P   D SQEV++LGS+GEL SHL SS
Subjt:  QEPGGTD-SQEVDILGSQGELRSHLESS

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]1.5e-6044.65Show/hide
Query:  SSPKKSDSVEEFSSRLDSELEEEEIENFRFSDDDGDDSDTSTSGQGLEYPFQIPENYLGPLRRRYNIPDDITLRLPKGRERADNPQDG------------
        SS   S+   + + RL+S+L  EEIEN R S DDG+DSD STSGQGLEYP +IPE+YLG LRR + IP++I LRLP+  ERADNP +G            
Subjt:  SSPKKSDSVEEFSSRLDSELEEEEIENFRFSDDDGDDSDTSTSGQGLEYPFQIPENYLGPLRRRYNIPDDITLRLPKGRERADNPQDG------------

Query:  -----------------------------------------CREVDDLDLLGVDQLLAFFEVKRISRKLGTYYLCARKGAEGVLKGPTSIKKWVGKWFFS
                                                  R+ ++ +L  VDQLLA FE KRI++K G +Y+CARKGA G++KGPTSIK WV KWF++
Subjt:  -----------------------------------------CREVDDLDLLGVDQLLAFFEVKRISRKLGTYYLCARKGAEGVLKGPTSIKKWVGKWFFS

Query:  SGSWLAKNEFDLPFHSVPCRFRNLVAIRPIPQLSELIFNALKFFKDKFKSGKQISTLITDKLLLASRLLDYNPR-------------AMVCGFSQSVKRK
        SG WLAK+E    F  VP RF NLV+IRP+P+L++  F+ LK++K++F  G+++ TL+TD+LLL S LLDYNP              AMVCGF+  VKRK
Subjt:  SGSWLAKNEFDLPFHSVPCRFRNLVAIRPIPQLSELIFNALKFFKDKFKSGKQISTLITDKLLLASRLLDYNPR-------------AMVCGFSQSVKRK

Query:  ---RLSAKTAAKSIEAPSPMVADLPAE
           R  A  AA+S +  +P V    +E
Subjt:  ---RLSAKTAAKSIEAPSPMVADLPAE

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]2.3e-10645.2Show/hide
Query:  LCARKGAEGVLKGPTSIKKWVGKWFFSSGSWLAKNEFDLPFHSVPCRFRNLVAIRPIPQLSELIFNALKFFKDKFKSGKQISTLITDKLLLASRLLDYNP
        +CARKG  G++KGPTSIK WVGKWFF+SG WLAK+E    F  VP RF NLV+I+ IP+L++  F+ LK +KD F   ++I TL+TDKLLL S LLDYNP
Subjt:  LCARKGAEGVLKGPTSIKKWVGKWFFSSGSWLAKNEFDLPFHSVPCRFRNLVAIRPIPQLSELIFNALKFFKDKFKSGKQISTLITDKLLLASRLLDYNP

Query:  -------------RAMVCGFSQSVKRKRLSAKTAAKSIEAPSPMVADLPAEVEVVGADLAAASSQGVVPIGTS----------QDQKILEVSPLREIKKK
                      AMVCGF+ SVKRK      A K++    P+   +P       +  ++A    V+ +  S          ++ + L+VSPL E++ +
Subjt:  -------------RAMVCGFSQSVKRKRLSAKTAAKSIEAPSPMVADLPAEVEVVGADLAAASSQGVVPIGTS----------QDQKILEVSPLREIKKK

Query:  ASPKKSEKKRRKTHHSRDEVREMGASRRV-SPFEDLVDDPKARMGGTSDLEIRFKIEQSSAGVRERAMKISGFCFDRCWRRASKFVSAPGSAIQRLLDYT
         SP +  +K++KT  S     E GA   + +   DLVDDP+ARM GTS++ +RF +E SS+GV+++  +IS  C DR  RRASKFVS PGS +QR +D  
Subjt:  ASPKKSEKKRRKTHHSRDEVREMGASRRV-SPFEDLVDDPKARMGGTSDLEIRFKIEQSSAGVRERAMKISGFCFDRCWRRASKFVSAPGSAIQRLLDYT

Query:  TEAHAIACQTALMVKAELEGRDLLTVKEREAFSASLEAAAALEGELKEARAEAQAWKSTSDADKAELKSAQAEAALHLENLRGMHVVANCLEKEKFALMK
         EA   +   A+MVKAEL+GR+ L  KERE   A+LEAA  L+GEL +A+ E    ++  DA    LK    E   H  +LR  H +   LEKEKF L+K
Subjt:  TEAHAIACQTALMVKAELEGRDLLTVKEREAFSASLEAAAALEGELKEARAEAQAWKSTSDADKAELKSAQAEAALHLENLRGMHVVANCLEKEKFALMK

Query:  QNDDLERLRDDLEGKLKARDAEMAELRAKLELSESKLSNRVLLEEAFRQHPDFDWFAKDFSDAGFKFLMKGVQEVAP--ELDLTPIKVRYVEKWASGPNG
        + DDL ++       L+ +DA +  L  +L+  + +L+N  LLEE+FRQHPDFD FAKDFSDAGFKFLMKG+    P  ++DL  +K +Y EKWASGPNG
Subjt:  QNDDLERLRDDLEGKLKARDAEMAELRAKLELSESKLSNRVLLEEAFRQHPDFDWFAKDFSDAGFKFLMKGVQEVAP--ELDLTPIKVRYVEKWASGPNG

Query:  TPGPQYCINQCLKELD---SDVELDEDPSSQ
        TP PQ  +++ ++ELD   SD+E ++ PS +
Subjt:  TPGPQYCINQCLKELD---SDVELDEDPSSQ

TrEMBL top hitse value%identityAlignment
A0A6J1D971 uncharacterized protein LOC1110185384.6e-6048.94Show/hide
Query:  KIEQSSAGVRERAMKISGFCFDRCWRRASKFVSAPGSAIQRLLDYTTEAHAIACQTALMVKAELEGRDLLTVKEREAFSASLE-AAAALEGELKEARAEA
        +IE SS+GVR++  +IS    DRC RRASKFVSAPGS +QR +DY  EA   + Q+AL VKAEL+GR++L  +E+E FSA+LE A++ ++ EL +A +E 
Subjt:  KIEQSSAGVRERAMKISGFCFDRCWRRASKFVSAPGSAIQRLLDYTTEAHAIACQTALMVKAELEGRDLLTVKEREAFSASLE-AAAALEGELKEARAEA

Query:  QAWKSTSDADKAELKSAQAEAALHLENLRGMHVVANCLEKEKFALMKQNDDLERLRDDLEGKLKARDAEMAELRAKLELSESKLSNRVLLEEAFRQHPDF
        +  K+  ++    LK    E       LR  H +   LE+EKF L+K+ DD+ +        L+A+D E+    A+LE ++ +LSN VLLEEAFRQHPDF
Subjt:  QAWKSTSDADKAELKSAQAEAALHLENLRGMHVVANCLEKEKFALMKQNDDLERLRDDLEGKLKARDAEMAELRAKLELSESKLSNRVLLEEAFRQHPDF

Query:  DWFAKDFSDAGFKFLMKGVQEVAPEL--DLTPIKVRYVEKWASGPNGTPGPQYCINQCLKELDSDV-ELDEDPSSQNAIGAIPS
        D FAKDFSDAGFKFLMKG+    P+L  DL+ +K RY EKWASGP GTPGPQ  ++Q +++LDSD  + +ED       GA P+
Subjt:  DWFAKDFSDAGFKFLMKGVQEVAPEL--DLTPIKVRYVEKWASGPNGTPGPQYCINQCLKELDSDV-ELDEDPSSQNAIGAIPS

A0A6J1DBX9 uncharacterized protein LOC1110189132.5e-6647.94Show/hide
Query:  KILEVSPLREIKKKASPKKSEKKRRKTHHSRDEVREMGASRRVSPFEDLVDDPKARMGGTSDLEIRFKIEQSSAGVRERAMKISGFCFDRCWRRASKFVS
        ++ +VSPL+E+++K+   KS+  +RKT  S D V E+    RV     L +DPKAR+G T D+ +RFKIE SSAG++E+  K S  CFDR  ++ASKFV 
Subjt:  KILEVSPLREIKKKASPKKSEKKRRKTHHSRDEVREMGASRRVSPFEDLVDDPKARMGGTSDLEIRFKIEQSSAGVRERAMKISGFCFDRCWRRASKFVS

Query:  APGSAIQRLLDYTTEAHAIACQTALMVKAELEGRDLLTVKEREAFSASLEAAAALEGELKEARAEAQAWKSTSDADKAELKSAQAEAALHLENLRGMHVV
         P S I++++DYT + HA++C  A+++K++L+ RDL+ V EREAFS +LE A  LE ELKEAR E +  KS  +   A+ KS + E     E  +  +V+
Subjt:  APGSAIQRLLDYTTEAHAIACQTALMVKAELEGRDLLTVKEREAFSASLEAAAALEGELKEARAEAQAWKSTSDADKAELKSAQAEAALHLENLRGMHVV

Query:  ANCLEKEKFALMKQNDDLERLRDDLEGKLKARDAEMAELRAKLELSESKLSNRVLLEEAFRQHPDFDWFAKDFSDAGFKFLMKGVQEVAPELDLTPIKVR
           LE EKF LM++ND L R         K   +E+ EL+ ++EL ++KLSN VLLEEAF+ H DFD F  DFSD  FKFLMKG+ EVA +LDL P+K  
Subjt:  ANCLEKEKFALMKQNDDLERLRDDLEGKLKARDAEMAELRAKLELSESKLSNRVLLEEAFRQHPDFDWFAKDFSDAGFKFLMKGVQEVAPELDLTPIKVR

Query:  YVEKWASGPNGTPGP
        Y +KWASGP  T GP
Subjt:  YVEKWASGPNGTPGP

A0A6J1DF31 uncharacterized protein LOC1110199093.2e-6949.39Show/hide
Query:  MGGTSDLEIRFKIEQSSAGVRERAMKISGFCFDRCWRRASKFVSAPGSAIQRLLDYTTEAHAIACQTALMVKAELEGRDLLTVKEREAFSASLEAAAALE
        MGGT D+  RF++E SS+GV+++  +IS  C DRC +RASKFVS PGS +QR +D   EA   +  +A+MVKAEL+GR+ L  KERE  SA+LEAA  L+
Subjt:  MGGTSDLEIRFKIEQSSAGVRERAMKISGFCFDRCWRRASKFVSAPGSAIQRLLDYTTEAHAIACQTALMVKAELEGRDLLTVKEREAFSASLEAAAALE

Query:  GELKEARAEAQAWKSTSDADKAELKSAQAEAALHLENLRGMHVVANCLEKEKFALMKQNDDLERLRDDLEGKLKARDAEMAELRAKLELSESKLSNRVLL
        GEL +A+ E    ++  DA KAEL   + E   H  +LR  H +   LEKEKF L+K+ DDL ++   LEGK    D  +  L A+L+  + +L+N  LL
Subjt:  GELKEARAEAQAWKSTSDADKAELKSAQAEAALHLENLRGMHVVANCLEKEKFALMKQNDDLERLRDDLEGKLKARDAEMAELRAKLELSESKLSNRVLL

Query:  EEAFRQHPDFDWFAKDFSDAGFKFLMKGVQEVAP--ELDLTPIKVRYVEKWASGPNGTPGPQYCINQCLKELDSDVELDEDPSSQNAIGAIPSTVSVNSF
        EE+FRQH DFD FAKDFSDAGFKFLMKG+    P  ++DL+ +K +Y EKWASGPNGTPGPQ  + + ++ELDSD     D   ++A    P+ +     
Subjt:  EEAFRQHPDFDWFAKDFSDAGFKFLMKGVQEVAP--ELDLTPIKVRYVEKWASGPNGTPGPQYCINQCLKELDSDVELDEDPSSQNAIGAIPSTVSVNSF

Query:  QEPGGTD-SQEVDILGSQGELRSHLESS
        + P   D SQEV++LGS+GEL SHL SS
Subjt:  QEPGGTD-SQEVDILGSQGELRSHLESS

A0A6J1DXS5 uncharacterized protein LOC1110255027.1e-6144.65Show/hide
Query:  SSPKKSDSVEEFSSRLDSELEEEEIENFRFSDDDGDDSDTSTSGQGLEYPFQIPENYLGPLRRRYNIPDDITLRLPKGRERADNPQDG------------
        SS   S+   + + RL+S+L  EEIEN R S DDG+DSD STSGQGLEYP +IPE+YLG LRR + IP++I LRLP+  ERADNP +G            
Subjt:  SSPKKSDSVEEFSSRLDSELEEEEIENFRFSDDDGDDSDTSTSGQGLEYPFQIPENYLGPLRRRYNIPDDITLRLPKGRERADNPQDG------------

Query:  -----------------------------------------CREVDDLDLLGVDQLLAFFEVKRISRKLGTYYLCARKGAEGVLKGPTSIKKWVGKWFFS
                                                  R+ ++ +L  VDQLLA FE KRI++K G +Y+CARKGA G++KGPTSIK WV KWF++
Subjt:  -----------------------------------------CREVDDLDLLGVDQLLAFFEVKRISRKLGTYYLCARKGAEGVLKGPTSIKKWVGKWFFS

Query:  SGSWLAKNEFDLPFHSVPCRFRNLVAIRPIPQLSELIFNALKFFKDKFKSGKQISTLITDKLLLASRLLDYNPR-------------AMVCGFSQSVKRK
        SG WLAK+E    F  VP RF NLV+IRP+P+L++  F+ LK++K++F  G+++ TL+TD+LLL S LLDYNP              AMVCGF+  VKRK
Subjt:  SGSWLAKNEFDLPFHSVPCRFRNLVAIRPIPQLSELIFNALKFFKDKFKSGKQISTLITDKLLLASRLLDYNPR-------------AMVCGFSQSVKRK

Query:  ---RLSAKTAAKSIEAPSPMVADLPAE
           R  A  AA+S +  +P V    +E
Subjt:  ---RLSAKTAAKSIEAPSPMVADLPAE

A0A6J1DZB3 uncharacterized protein LOC1110256651.1e-10645.2Show/hide
Query:  LCARKGAEGVLKGPTSIKKWVGKWFFSSGSWLAKNEFDLPFHSVPCRFRNLVAIRPIPQLSELIFNALKFFKDKFKSGKQISTLITDKLLLASRLLDYNP
        +CARKG  G++KGPTSIK WVGKWFF+SG WLAK+E    F  VP RF NLV+I+ IP+L++  F+ LK +KD F   ++I TL+TDKLLL S LLDYNP
Subjt:  LCARKGAEGVLKGPTSIKKWVGKWFFSSGSWLAKNEFDLPFHSVPCRFRNLVAIRPIPQLSELIFNALKFFKDKFKSGKQISTLITDKLLLASRLLDYNP

Query:  -------------RAMVCGFSQSVKRKRLSAKTAAKSIEAPSPMVADLPAEVEVVGADLAAASSQGVVPIGTS----------QDQKILEVSPLREIKKK
                      AMVCGF+ SVKRK      A K++    P+   +P       +  ++A    V+ +  S          ++ + L+VSPL E++ +
Subjt:  -------------RAMVCGFSQSVKRKRLSAKTAAKSIEAPSPMVADLPAEVEVVGADLAAASSQGVVPIGTS----------QDQKILEVSPLREIKKK

Query:  ASPKKSEKKRRKTHHSRDEVREMGASRRV-SPFEDLVDDPKARMGGTSDLEIRFKIEQSSAGVRERAMKISGFCFDRCWRRASKFVSAPGSAIQRLLDYT
         SP +  +K++KT  S     E GA   + +   DLVDDP+ARM GTS++ +RF +E SS+GV+++  +IS  C DR  RRASKFVS PGS +QR +D  
Subjt:  ASPKKSEKKRRKTHHSRDEVREMGASRRV-SPFEDLVDDPKARMGGTSDLEIRFKIEQSSAGVRERAMKISGFCFDRCWRRASKFVSAPGSAIQRLLDYT

Query:  TEAHAIACQTALMVKAELEGRDLLTVKEREAFSASLEAAAALEGELKEARAEAQAWKSTSDADKAELKSAQAEAALHLENLRGMHVVANCLEKEKFALMK
         EA   +   A+MVKAEL+GR+ L  KERE   A+LEAA  L+GEL +A+ E    ++  DA    LK    E   H  +LR  H +   LEKEKF L+K
Subjt:  TEAHAIACQTALMVKAELEGRDLLTVKEREAFSASLEAAAALEGELKEARAEAQAWKSTSDADKAELKSAQAEAALHLENLRGMHVVANCLEKEKFALMK

Query:  QNDDLERLRDDLEGKLKARDAEMAELRAKLELSESKLSNRVLLEEAFRQHPDFDWFAKDFSDAGFKFLMKGVQEVAP--ELDLTPIKVRYVEKWASGPNG
        + DDL ++       L+ +DA +  L  +L+  + +L+N  LLEE+FRQHPDFD FAKDFSDAGFKFLMKG+    P  ++DL  +K +Y EKWASGPNG
Subjt:  QNDDLERLRDDLEGKLKARDAEMAELRAKLELSESKLSNRVLLEEAFRQHPDFDWFAKDFSDAGFKFLMKGVQEVAP--ELDLTPIKVRYVEKWASGPNG

Query:  TPGPQYCINQCLKELD---SDVELDEDPSSQ
        TP PQ  +++ ++ELD   SD+E ++ PS +
Subjt:  TPGPQYCINQCLKELD---SDVELDEDPSSQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGGAGTTCTCTAGTTCATCCTCCGTTAGTAGCTCATCTAGCTACATAGTTCGGAATCGGGACTCTTCGCCTAAGAAATCTGATTCCGTAGAGGAGTTCTCCAGTAG
GTTAGATTCCGAACTAGAGGAGGAAGAGATAGAAAACTTTAGGTTTTCTGATGATGACGGGGATGATAGTGACACGTCCACTTCGGGTCAAGGTTTAGAATATCCTTTCC
AGATCCCTGAGAACTACCTCGGCCCTCTTCGTAGGAGATATAACATACCTGATGACATAACCCTTAGGCTTCCCAAGGGACGAGAAAGAGCCGATAATCCTCAAGATGGG
TGTAGAGAAGTGGACGACCTGGACCTCCTCGGAGTTGACCAACTTTTGGCTTTCTTCGAAGTCAAGCGAATTTCTAGAAAGCTAGGGACGTACTATCTGTGTGCTAGGAA
GGGCGCAGAAGGCGTTTTGAAAGGACCGACCTCCATAAAGAAGTGGGTTGGGAAGTGGTTCTTCTCCTCCGGATCGTGGCTGGCTAAGAACGAGTTCGACTTGCCCTTCC
ACAGCGTCCCTTGTAGGTTTAGGAACTTAGTTGCTATTCGGCCGATTCCTCAACTCTCTGAGCTGATCTTCAACGCCTTGAAATTTTTCAAAGATAAGTTCAAGAGTGGA
AAGCAGATCAGTACCCTTATAACAGATAAACTTCTCCTCGCTTCGAGACTCCTCGACTACAACCCTCGCGCGATGGTTTGTGGCTTTTCTCAAAGCGTGAAGCGCAAACG
CCTGAGCGCTAAAACAGCTGCCAAAAGCATTGAGGCGCCCAGCCCCATGGTAGCCGACCTTCCTGCCGAGGTCGAGGTGGTCGGGGCTGACCTTGCTGCTGCTTCATCTC
AAGGAGTAGTTCCGATTGGGACCTCGCAGGATCAGAAGATCCTCGAGGTCTCTCCCCTTAGGGAGATTAAAAAGAAGGCTTCTCCCAAGAAGTCCGAAAAGAAAAGGAGA
AAGACCCACCACTCTAGGGACGAAGTGAGGGAGATGGGTGCTAGTCGGCGGGTCAGTCCCTTCGAGGACCTGGTGGACGATCCTAAGGCCAGGATGGGTGGCACCTCTGA
CCTCGAGATTAGATTCAAGATCGAGCAATCAAGTGCTGGGGTGAGGGAGAGAGCCATGAAGATCTCCGGGTTTTGTTTTGACCGCTGCTGGAGGAGAGCTTCTAAGTTTG
TTAGCGCTCCGGGATCGGCCATCCAACGATTGTTGGATTACACTACCGAGGCTCACGCTATTGCTTGCCAGACGGCCCTCATGGTGAAGGCCGAACTAGAAGGGCGTGAC
TTGCTCACTGTGAAGGAGCGAGAGGCCTTTTCTGCTTCTTTGGAGGCTGCTGCTGCTCTAGAGGGGGAGCTCAAAGAGGCTCGCGCTGAGGCTCAGGCGTGGAAATCCAC
TTCTGATGCCGATAAGGCTGAGCTCAAAAGTGCACAAGCAGAGGCTGCCCTACACCTGGAGAACTTGCGAGGCATGCACGTTGTGGCCAACTGCCTGGAGAAGGAGAAGT
TCGCGCTGATGAAGCAGAACGACGACCTCGAACGTCTTCGAGATGACCTTGAGGGCAAACTAAAGGCCCGAGATGCCGAGATGGCAGAGCTGAGGGCCAAGCTTGAGCTA
TCTGAGTCCAAGCTCAGCAACAGAGTTCTGCTGGAGGAAGCTTTCCGCCAACATCCTGATTTTGATTGGTTCGCCAAAGATTTCAGCGATGCTGGCTTCAAGTTCCTGAT
GAAGGGAGTCCAGGAAGTGGCCCCCGAGCTCGACCTTACACCCATCAAAGTGCGATATGTAGAGAAGTGGGCTTCGGGTCCCAATGGGACCCCTGGCCCCCAGTACTGCA
TCAATCAATGCCTGAAGGAGCTTGACTCCGATGTGGAGCTCGACGAGGACCCTTCTTCCCAAAACGCTATCGGGGCTATTCCTTCTACAGTCAGTGTAAACTCCTTCCAA
GAACCTGGGGGGACAGACTCCCAGGAGGTTGACATCCTTGGGTCCCAGGGAGAGCTCAGATCGCATCTCGAAAGCAGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCGGAGTTCTCTAGTTCATCCTCCGTTAGTAGCTCATCTAGCTACATAGTTCGGAATCGGGACTCTTCGCCTAAGAAATCTGATTCCGTAGAGGAGTTCTCCAGTAG
GTTAGATTCCGAACTAGAGGAGGAAGAGATAGAAAACTTTAGGTTTTCTGATGATGACGGGGATGATAGTGACACGTCCACTTCGGGTCAAGGTTTAGAATATCCTTTCC
AGATCCCTGAGAACTACCTCGGCCCTCTTCGTAGGAGATATAACATACCTGATGACATAACCCTTAGGCTTCCCAAGGGACGAGAAAGAGCCGATAATCCTCAAGATGGG
TGTAGAGAAGTGGACGACCTGGACCTCCTCGGAGTTGACCAACTTTTGGCTTTCTTCGAAGTCAAGCGAATTTCTAGAAAGCTAGGGACGTACTATCTGTGTGCTAGGAA
GGGCGCAGAAGGCGTTTTGAAAGGACCGACCTCCATAAAGAAGTGGGTTGGGAAGTGGTTCTTCTCCTCCGGATCGTGGCTGGCTAAGAACGAGTTCGACTTGCCCTTCC
ACAGCGTCCCTTGTAGGTTTAGGAACTTAGTTGCTATTCGGCCGATTCCTCAACTCTCTGAGCTGATCTTCAACGCCTTGAAATTTTTCAAAGATAAGTTCAAGAGTGGA
AAGCAGATCAGTACCCTTATAACAGATAAACTTCTCCTCGCTTCGAGACTCCTCGACTACAACCCTCGCGCGATGGTTTGTGGCTTTTCTCAAAGCGTGAAGCGCAAACG
CCTGAGCGCTAAAACAGCTGCCAAAAGCATTGAGGCGCCCAGCCCCATGGTAGCCGACCTTCCTGCCGAGGTCGAGGTGGTCGGGGCTGACCTTGCTGCTGCTTCATCTC
AAGGAGTAGTTCCGATTGGGACCTCGCAGGATCAGAAGATCCTCGAGGTCTCTCCCCTTAGGGAGATTAAAAAGAAGGCTTCTCCCAAGAAGTCCGAAAAGAAAAGGAGA
AAGACCCACCACTCTAGGGACGAAGTGAGGGAGATGGGTGCTAGTCGGCGGGTCAGTCCCTTCGAGGACCTGGTGGACGATCCTAAGGCCAGGATGGGTGGCACCTCTGA
CCTCGAGATTAGATTCAAGATCGAGCAATCAAGTGCTGGGGTGAGGGAGAGAGCCATGAAGATCTCCGGGTTTTGTTTTGACCGCTGCTGGAGGAGAGCTTCTAAGTTTG
TTAGCGCTCCGGGATCGGCCATCCAACGATTGTTGGATTACACTACCGAGGCTCACGCTATTGCTTGCCAGACGGCCCTCATGGTGAAGGCCGAACTAGAAGGGCGTGAC
TTGCTCACTGTGAAGGAGCGAGAGGCCTTTTCTGCTTCTTTGGAGGCTGCTGCTGCTCTAGAGGGGGAGCTCAAAGAGGCTCGCGCTGAGGCTCAGGCGTGGAAATCCAC
TTCTGATGCCGATAAGGCTGAGCTCAAAAGTGCACAAGCAGAGGCTGCCCTACACCTGGAGAACTTGCGAGGCATGCACGTTGTGGCCAACTGCCTGGAGAAGGAGAAGT
TCGCGCTGATGAAGCAGAACGACGACCTCGAACGTCTTCGAGATGACCTTGAGGGCAAACTAAAGGCCCGAGATGCCGAGATGGCAGAGCTGAGGGCCAAGCTTGAGCTA
TCTGAGTCCAAGCTCAGCAACAGAGTTCTGCTGGAGGAAGCTTTCCGCCAACATCCTGATTTTGATTGGTTCGCCAAAGATTTCAGCGATGCTGGCTTCAAGTTCCTGAT
GAAGGGAGTCCAGGAAGTGGCCCCCGAGCTCGACCTTACACCCATCAAAGTGCGATATGTAGAGAAGTGGGCTTCGGGTCCCAATGGGACCCCTGGCCCCCAGTACTGCA
TCAATCAATGCCTGAAGGAGCTTGACTCCGATGTGGAGCTCGACGAGGACCCTTCTTCCCAAAACGCTATCGGGGCTATTCCTTCTACAGTCAGTGTAAACTCCTTCCAA
GAACCTGGGGGGACAGACTCCCAGGAGGTTGACATCCTTGGGTCCCAGGGAGAGCTCAGATCGCATCTCGAAAGCAGCTAA
Protein sequenceShow/hide protein sequence
MSEFSSSSSVSSSSSYIVRNRDSSPKKSDSVEEFSSRLDSELEEEEIENFRFSDDDGDDSDTSTSGQGLEYPFQIPENYLGPLRRRYNIPDDITLRLPKGRERADNPQDG
CREVDDLDLLGVDQLLAFFEVKRISRKLGTYYLCARKGAEGVLKGPTSIKKWVGKWFFSSGSWLAKNEFDLPFHSVPCRFRNLVAIRPIPQLSELIFNALKFFKDKFKSG
KQISTLITDKLLLASRLLDYNPRAMVCGFSQSVKRKRLSAKTAAKSIEAPSPMVADLPAEVEVVGADLAAASSQGVVPIGTSQDQKILEVSPLREIKKKASPKKSEKKRR
KTHHSRDEVREMGASRRVSPFEDLVDDPKARMGGTSDLEIRFKIEQSSAGVRERAMKISGFCFDRCWRRASKFVSAPGSAIQRLLDYTTEAHAIACQTALMVKAELEGRD
LLTVKEREAFSASLEAAAALEGELKEARAEAQAWKSTSDADKAELKSAQAEAALHLENLRGMHVVANCLEKEKFALMKQNDDLERLRDDLEGKLKARDAEMAELRAKLEL
SESKLSNRVLLEEAFRQHPDFDWFAKDFSDAGFKFLMKGVQEVAPELDLTPIKVRYVEKWASGPNGTPGPQYCINQCLKELDSDVELDEDPSSQNAIGAIPSTVSVNSFQ
EPGGTDSQEVDILGSQGELRSHLESS