; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS004256 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS004256
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionproline-, glutamic acid- and leucine-rich protein 1-like isoform X1
Genome locationscaffold92:1258919..1259820
RNA-Seq ExpressionMS004256
SyntenyMS004256
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606253.1 hypothetical protein SDJN03_03570, partial [Cucurbita argyrosperma subsp. sororia]4.6e-5755.6Show/hide
Query:  MSNLVQESCEAQKGEDEVLDPRFSMLCLNSSGGGGQHH-----CASCGFPQPRSAASATPMKRRSPTPFQDPNSTSKRLFLDP-QHNHSQDLPHAFSRID
        MSNL+QES E Q  E +  D RFS LCLN    GG HH     C+SCG   PR A  AT  KRRSPT  QDP +T+K+  LDP QHN +     +FS+ID
Subjt:  MSNLVQESCEAQKGEDEVLDPRFSMLCLNSSGGGGQHH-----CASCGFPQPRSAASATPMKRRSPTPFQDPNSTSKRLFLDP-QHNHSQDLPHAFSRID

Query:  LPIPFGP-SAQIQVHSPLRRSLSDPDPAQHLNGPT----LPP----PPLPLRRTVSDPNPSPGPDNISGSPNKVAPT-----RDSPDSKRLTGIKERLRE
        LPIPFGP SA     SPL RS+SDP  A++ + P+    L P    PPLPLRRTVSDP PS   D  S SP  +         DSPDSKRL  IK+RL+E
Subjt:  LPIPFGP-SAQIQVHSPLRRSLSDPDPAQHLNGPT----LPP----PPLPLRRTVSDPNPSPGPDNISGSPNKVAPT-----RDSPDSKRLTGIKERLRE

Query:  MNELWNELM--KEHEE------RDTKKSGGC---KDEGEESVGVERVGDSLEIHLKCPCGKGFEILLNGTSCFYKLL
        MNE WNE+M  +EHEE       +TKK   C   +++ EE+VGVERVGDSLE+ LKCPCGKGFEILL+GTSCFYKLL
Subjt:  MNELWNELM--KEHEE------RDTKKSGGC---KDEGEESVGVERVGDSLEIHLKCPCGKGFEILLNGTSCFYKLL

XP_022139101.1 uncharacterized protein LOC111010096 [Momordica charantia]2.4e-138100Show/hide
Query:  MSNLVQESCEAQKGEDEVLDPRFSMLCLNSSGGGGQHHCASCGFPQPRSAASATPMKRRSPTPFQDPNSTSKRLFLDPQHNHSQDLPHAFSRIDLPIPFG
        MSNLVQESCEAQKGEDEVLDPRFSMLCLNSSGGGGQHHCASCGFPQPRSAASATPMKRRSPTPFQDPNSTSKRLFLDPQHNHSQDLPHAFSRIDLPIPFG
Subjt:  MSNLVQESCEAQKGEDEVLDPRFSMLCLNSSGGGGQHHCASCGFPQPRSAASATPMKRRSPTPFQDPNSTSKRLFLDPQHNHSQDLPHAFSRIDLPIPFG

Query:  PSAQIQVHSPLRRSLSDPDPAQHLNGPTLPPPPLPLRRTVSDPNPSPGPDNISGSPNKVAPTRDSPDSKRLTGIKERLREMNELWNELMKEHEERDTKKS
        PSAQIQVHSPLRRSLSDPDPAQHLNGPTLPPPPLPLRRTVSDPNPSPGPDNISGSPNKVAPTRDSPDSKRLTGIKERLREMNELWNELMKEHEERDTKKS
Subjt:  PSAQIQVHSPLRRSLSDPDPAQHLNGPTLPPPPLPLRRTVSDPNPSPGPDNISGSPNKVAPTRDSPDSKRLTGIKERLREMNELWNELMKEHEERDTKKS

Query:  GGCKDEGEESVGVERVGDSLEIHLKCPCGKGFEILLNGTSCFYKLL
        GGCKDEGEESVGVERVGDSLEIHLKCPCGKGFEILLNGTSCFYKLL
Subjt:  GGCKDEGEESVGVERVGDSLEIHLKCPCGKGFEILLNGTSCFYKLL

XP_022930994.1 proline-, glutamic acid- and leucine-rich protein 1-like isoform X1 [Cucurbita moschata]6.0e-5755.43Show/hide
Query:  MSNLVQESCEAQKGEDEVLDPRFSMLCLNSSGGGGQHH-----CASCGFPQPRSAASATPMKRRSPTPFQDPNSTSKRLFLDP-QHNHSQDLPHAFSRID
        MSNL+QES E Q  E +  D RFS LCLN    GG HH     C+SCG   PR A  AT  KRRSPT  QDP +T+K+  LDP QHN +     +FS+ID
Subjt:  MSNLVQESCEAQKGEDEVLDPRFSMLCLNSSGGGGQHH-----CASCGFPQPRSAASATPMKRRSPTPFQDPNSTSKRLFLDP-QHNHSQDLPHAFSRID

Query:  LPIPFGP-SAQIQVHSPLRRSLSDPDPAQHLNGPT----LPP----PPLPLRRTVSDPNPSPGPDNISGSPNKVAPT-----RDSPDSKRLTGIKERLRE
        LPIPFGP SA     SPL RS+SDP  A++ + P+    L P    PPLPLRRTVSDP PS   D  S SP  +         DSPDSKRL  IK+RL+E
Subjt:  LPIPFGP-SAQIQVHSPLRRSLSDPDPAQHLNGPT----LPP----PPLPLRRTVSDPNPSPGPDNISGSPNKVAPT-----RDSPDSKRLTGIKERLRE

Query:  MNELWNELMKEHE-------ERDTKKSGGC---KDEGEESVGVERVGDSLEIHLKCPCGKGFEILLNGTSCFYKLL
        MNE WNE+M E E       E +TKK   C    ++ EE+VGVERVGDSLE+ LKCPCGKGFEILL+GTSCFYKLL
Subjt:  MNELWNELMKEHE-------ERDTKKSGGC---KDEGEESVGVERVGDSLEIHLKCPCGKGFEILLNGTSCFYKLL

XP_022930995.1 uncharacterized protein LOC111437321 isoform X2 [Cucurbita moschata]3.5e-5755.68Show/hide
Query:  MSNLVQESCEAQKGEDEVLDPRFSMLCLNSSGGGGQHH-----CASCGFPQPRSAASATPMKRRSPTPFQDPNSTSKRLFLDP-QHNHSQDLPHAFSRID
        MSNL+QES E Q  E +  D RFS LCLN    GG HH     C+SCG   PR A  AT  KRRSPT  QDP +T+K+  LDP QHN +     +FS+ID
Subjt:  MSNLVQESCEAQKGEDEVLDPRFSMLCLNSSGGGGQHH-----CASCGFPQPRSAASATPMKRRSPTPFQDPNSTSKRLFLDP-QHNHSQDLPHAFSRID

Query:  LPIPFGP-SAQIQVHSPLRRSLSDPDPAQHLNGPT----LPP----PPLPLRRTVSDPNPSPGPDNISGSPNKVAPT-----RDSPDSKRLTGIKERLRE
        LPIPFGP SA     SPL RS+SDP  A++ + P+    L P    PPLPLRRTVSDP PS   D  S SP  +         DSPDSKRL  IK+RL+E
Subjt:  LPIPFGP-SAQIQVHSPLRRSLSDPDPAQHLNGPT----LPP----PPLPLRRTVSDPNPSPGPDNISGSPNKVAPT-----RDSPDSKRLTGIKERLRE

Query:  MNELWNELMKEHE-------ERDTKKSGGCKDEGEESVGVERVGDSLEIHLKCPCGKGFEILLNGTSCFYKLL
        MNE WNE+M E E       E +TKK     ++ EE+VGVERVGDSLE+ LKCPCGKGFEILL+GTSCFYKLL
Subjt:  MNELWNELMKEHE-------ERDTKKSGGCKDEGEESVGVERVGDSLEIHLKCPCGKGFEILLNGTSCFYKLL

XP_022995233.1 proline-, glutamic acid- and leucine-rich protein 1-like isoform X2 [Cucurbita maxima]4.6e-5756.36Show/hide
Query:  MSNLVQESCEAQKGEDEVLDPRFSMLCLNSSGGGGQHH-----CASCGFPQPRSAASATPMKRRSPTPFQDPNSTSKRLFLDP-QHNHSQDLPHAFSRID
        MSNL+QES E Q  E +  D RFS LCLN    GG HH     C+SCG   PR A  AT  KRRSPT  QDP +T+K+  LDP QHN +     +FS+ID
Subjt:  MSNLVQESCEAQKGEDEVLDPRFSMLCLNSSGGGGQHH-----CASCGFPQPRSAASATPMKRRSPTPFQDPNSTSKRLFLDP-QHNHSQDLPHAFSRID

Query:  LPIPFGP-SAQIQVHSPLRRSLSDPDPAQHLNGPT----LPP----PPLPLRRTVSDPNPSPGPDNISGSPNKVAPT-----RDSPDSKRLTGIKERLRE
        LPIPFGP SA     SPL RS+SDP  A++ + P+    L P    PPLPLRRTVSDP PS   +  S SP  +         DSPDSKRL  IK RL+E
Subjt:  LPIPFGP-SAQIQVHSPLRRSLSDPDPAQHLNGPT----LPP----PPLPLRRTVSDPNPSPGPDNISGSPNKVAPT-----RDSPDSKRLTGIKERLRE

Query:  MNELWNELMKEHE-------ERDTKKSGGCKDE--GEESVGVERVGDSLEIHLKCPCGKGFEILLNGTSCFYKLL
        MNE WNE+M E E       E +TKK   CKDE   EE+VGVERVGDSLE+ LKCPCGKGFEILL+GTSCFYKLL
Subjt:  MNELWNELMKEHE-------ERDTKKSGGCKDE--GEESVGVERVGDSLEIHLKCPCGKGFEILLNGTSCFYKLL

TrEMBL top hitse value%identityAlignment
A0A6J1CBN2 uncharacterized protein LOC1110100961.2e-138100Show/hide
Query:  MSNLVQESCEAQKGEDEVLDPRFSMLCLNSSGGGGQHHCASCGFPQPRSAASATPMKRRSPTPFQDPNSTSKRLFLDPQHNHSQDLPHAFSRIDLPIPFG
        MSNLVQESCEAQKGEDEVLDPRFSMLCLNSSGGGGQHHCASCGFPQPRSAASATPMKRRSPTPFQDPNSTSKRLFLDPQHNHSQDLPHAFSRIDLPIPFG
Subjt:  MSNLVQESCEAQKGEDEVLDPRFSMLCLNSSGGGGQHHCASCGFPQPRSAASATPMKRRSPTPFQDPNSTSKRLFLDPQHNHSQDLPHAFSRIDLPIPFG

Query:  PSAQIQVHSPLRRSLSDPDPAQHLNGPTLPPPPLPLRRTVSDPNPSPGPDNISGSPNKVAPTRDSPDSKRLTGIKERLREMNELWNELMKEHEERDTKKS
        PSAQIQVHSPLRRSLSDPDPAQHLNGPTLPPPPLPLRRTVSDPNPSPGPDNISGSPNKVAPTRDSPDSKRLTGIKERLREMNELWNELMKEHEERDTKKS
Subjt:  PSAQIQVHSPLRRSLSDPDPAQHLNGPTLPPPPLPLRRTVSDPNPSPGPDNISGSPNKVAPTRDSPDSKRLTGIKERLREMNELWNELMKEHEERDTKKS

Query:  GGCKDEGEESVGVERVGDSLEIHLKCPCGKGFEILLNGTSCFYKLL
        GGCKDEGEESVGVERVGDSLEIHLKCPCGKGFEILLNGTSCFYKLL
Subjt:  GGCKDEGEESVGVERVGDSLEIHLKCPCGKGFEILLNGTSCFYKLL

A0A6J1ET23 proline-, glutamic acid- and leucine-rich protein 1-like isoform X12.9e-5755.43Show/hide
Query:  MSNLVQESCEAQKGEDEVLDPRFSMLCLNSSGGGGQHH-----CASCGFPQPRSAASATPMKRRSPTPFQDPNSTSKRLFLDP-QHNHSQDLPHAFSRID
        MSNL+QES E Q  E +  D RFS LCLN    GG HH     C+SCG   PR A  AT  KRRSPT  QDP +T+K+  LDP QHN +     +FS+ID
Subjt:  MSNLVQESCEAQKGEDEVLDPRFSMLCLNSSGGGGQHH-----CASCGFPQPRSAASATPMKRRSPTPFQDPNSTSKRLFLDP-QHNHSQDLPHAFSRID

Query:  LPIPFGP-SAQIQVHSPLRRSLSDPDPAQHLNGPT----LPP----PPLPLRRTVSDPNPSPGPDNISGSPNKVAPT-----RDSPDSKRLTGIKERLRE
        LPIPFGP SA     SPL RS+SDP  A++ + P+    L P    PPLPLRRTVSDP PS   D  S SP  +         DSPDSKRL  IK+RL+E
Subjt:  LPIPFGP-SAQIQVHSPLRRSLSDPDPAQHLNGPT----LPP----PPLPLRRTVSDPNPSPGPDNISGSPNKVAPT-----RDSPDSKRLTGIKERLRE

Query:  MNELWNELMKEHE-------ERDTKKSGGC---KDEGEESVGVERVGDSLEIHLKCPCGKGFEILLNGTSCFYKLL
        MNE WNE+M E E       E +TKK   C    ++ EE+VGVERVGDSLE+ LKCPCGKGFEILL+GTSCFYKLL
Subjt:  MNELWNELMKEHE-------ERDTKKSGGC---KDEGEESVGVERVGDSLEIHLKCPCGKGFEILLNGTSCFYKLL

A0A6J1EYB4 uncharacterized protein LOC111437321 isoform X21.7e-5755.68Show/hide
Query:  MSNLVQESCEAQKGEDEVLDPRFSMLCLNSSGGGGQHH-----CASCGFPQPRSAASATPMKRRSPTPFQDPNSTSKRLFLDP-QHNHSQDLPHAFSRID
        MSNL+QES E Q  E +  D RFS LCLN    GG HH     C+SCG   PR A  AT  KRRSPT  QDP +T+K+  LDP QHN +     +FS+ID
Subjt:  MSNLVQESCEAQKGEDEVLDPRFSMLCLNSSGGGGQHH-----CASCGFPQPRSAASATPMKRRSPTPFQDPNSTSKRLFLDP-QHNHSQDLPHAFSRID

Query:  LPIPFGP-SAQIQVHSPLRRSLSDPDPAQHLNGPT----LPP----PPLPLRRTVSDPNPSPGPDNISGSPNKVAPT-----RDSPDSKRLTGIKERLRE
        LPIPFGP SA     SPL RS+SDP  A++ + P+    L P    PPLPLRRTVSDP PS   D  S SP  +         DSPDSKRL  IK+RL+E
Subjt:  LPIPFGP-SAQIQVHSPLRRSLSDPDPAQHLNGPT----LPP----PPLPLRRTVSDPNPSPGPDNISGSPNKVAPT-----RDSPDSKRLTGIKERLRE

Query:  MNELWNELMKEHE-------ERDTKKSGGCKDEGEESVGVERVGDSLEIHLKCPCGKGFEILLNGTSCFYKLL
        MNE WNE+M E E       E +TKK     ++ EE+VGVERVGDSLE+ LKCPCGKGFEILL+GTSCFYKLL
Subjt:  MNELWNELMKEHE-------ERDTKKSGGCKDEGEESVGVERVGDSLEIHLKCPCGKGFEILLNGTSCFYKLL

A0A6J1JY87 proline-, glutamic acid- and leucine-rich protein 1-like isoform X16.5e-5755.07Show/hide
Query:  MSNLVQESCEAQKGEDEVLDPRFSMLCLNSSGGGGQHH-----CASCGFPQPRSAASATPMKRRSPTPFQDPNSTSKRLFLDP-QHNHSQDLPHAFSRID
        MSNL+QES E Q  E +  D RFS LCLN    GG HH     C+SCG   PR A  AT  KRRSPT  QDP +T+K+  LDP QHN +     +FS+ID
Subjt:  MSNLVQESCEAQKGEDEVLDPRFSMLCLNSSGGGGQHH-----CASCGFPQPRSAASATPMKRRSPTPFQDPNSTSKRLFLDP-QHNHSQDLPHAFSRID

Query:  LPIPFGP-SAQIQVHSPLRRSLSDPDPAQHLNGPT----LPP----PPLPLRRTVSDPNPSPGPDNISGSPNKVAPT-----RDSPDSKRLTGIKERLRE
        LPIPFGP SA     SPL RS+SDP  A++ + P+    L P    PPLPLRRTVSDP PS   +  S SP  +         DSPDSKRL  IK RL+E
Subjt:  LPIPFGP-SAQIQVHSPLRRSLSDPDPAQHLNGPT----LPP----PPLPLRRTVSDPNPSPGPDNISGSPNKVAPT-----RDSPDSKRLTGIKERLRE

Query:  MNELWNELMKEHE-------ERDTKKSGGC---KDEGEESVGVERVGDSLEIHLKCPCGKGFEILLNGTSCFYKLL
        MNE WNE+M E E       E +TKK   C   +++ EE+VGVERVGDSLE+ LKCPCGKGFEILL+GTSCFYKLL
Subjt:  MNELWNELMKEHE-------ERDTKKSGGC---KDEGEESVGVERVGDSLEIHLKCPCGKGFEILLNGTSCFYKLL

A0A6J1K7B1 proline-, glutamic acid- and leucine-rich protein 1-like isoform X22.2e-5756.36Show/hide
Query:  MSNLVQESCEAQKGEDEVLDPRFSMLCLNSSGGGGQHH-----CASCGFPQPRSAASATPMKRRSPTPFQDPNSTSKRLFLDP-QHNHSQDLPHAFSRID
        MSNL+QES E Q  E +  D RFS LCLN    GG HH     C+SCG   PR A  AT  KRRSPT  QDP +T+K+  LDP QHN +     +FS+ID
Subjt:  MSNLVQESCEAQKGEDEVLDPRFSMLCLNSSGGGGQHH-----CASCGFPQPRSAASATPMKRRSPTPFQDPNSTSKRLFLDP-QHNHSQDLPHAFSRID

Query:  LPIPFGP-SAQIQVHSPLRRSLSDPDPAQHLNGPT----LPP----PPLPLRRTVSDPNPSPGPDNISGSPNKVAPT-----RDSPDSKRLTGIKERLRE
        LPIPFGP SA     SPL RS+SDP  A++ + P+    L P    PPLPLRRTVSDP PS   +  S SP  +         DSPDSKRL  IK RL+E
Subjt:  LPIPFGP-SAQIQVHSPLRRSLSDPDPAQHLNGPT----LPP----PPLPLRRTVSDPNPSPGPDNISGSPNKVAPT-----RDSPDSKRLTGIKERLRE

Query:  MNELWNELMKEHE-------ERDTKKSGGCKDE--GEESVGVERVGDSLEIHLKCPCGKGFEILLNGTSCFYKLL
        MNE WNE+M E E       E +TKK   CKDE   EE+VGVERVGDSLE+ LKCPCGKGFEILL+GTSCFYKLL
Subjt:  MNELWNELMKEHE-------ERDTKKSGGCKDE--GEESVGVERVGDSLEIHLKCPCGKGFEILLNGTSCFYKLL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G32235.1 unknown protein1.5e-1629.32Show/hide
Query:  MSNLVQESCEAQKGED-EVLDPRFSMLCLNSSGGGGQHHCASCGFPQPRS---------AASATPMKRRSPTPFQDPNSTSKRLFLDPQHNHSQDLPHAF
        +  L+  S E     D E  D   S+L LNS G       A+   PQ +S          A+ +P+KR SP   Q      K+LF+             +
Subjt:  MSNLVQESCEAQKGED-EVLDPRFSMLCLNSSGGGGQHHCASCGFPQPRS---------AASATPMKRRSPTPFQDPNSTSKRLFLDPQHNHSQDLPHAF

Query:  SRIDLP-IPFGPSAQIQVHSPL-RRSLSDP-------------------DPAQHL-----NGPTLPPPPLPLRRTVSDPNPSPGPDNISGSPNK------
        S+I LP + F P+   Q+ SPL +RSLSD                      AQ       N P+LPP P   RR+VSD +P+P   ++ GS         
Subjt:  SRIDLP-IPFGPSAQIQVHSPL-RRSLSDP-------------------DPAQHL-----NGPTLPPPPLPLRRTVSDPNPSPGPDNISGSPNK------

Query:  -VAPTRDSPDSKRLTGIKERLREMNELWNELMKEHEERDTKKSGGCKD------------------EGEESVGVERVGDSLEIHLKCPCGKGFEILLNGT
         +A    S  +K L  IK+ +RE+++  N+L+K  E      SG  K                   E +E V V R+G++  + + CPCG+ ++ L +G 
Subjt:  -VAPTRDSPDSKRLTGIKERLREMNELWNELMKEHEERDTKKSGGCKD------------------EGEESVGVERVGDSLEIHLKCPCGKGFEILLNGT

Query:  SCFYKLL
         C+YKLL
Subjt:  SCFYKLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCAATCTGGTTCAAGAATCGTGTGAAGCCCAAAAGGGCGAAGACGAGGTGTTGGATCCCCGTTTCTCGATGCTGTGCCTCAACTCCTCCGGCGGCGGAGGCCAACA
TCACTGTGCTTCCTGCGGCTTCCCTCAACCTCGCTCCGCCGCCAGTGCCACACCCATGAAACGCCGCTCTCCCACGCCGTTTCAAGACCCCAACTCCACCTCCAAGAGGC
TCTTTCTTGATCCACAACACAATCACAGTCAAGATCTCCCCCACGCTTTCTCCAGGATCGATCTCCCCATCCCTTTTGGGCCTTCAGCCCAAATCCAGGTCCACTCCCCC
CTCCGCCGCTCCCTTTCCGACCCTGACCCGGCCCAACACCTTAACGGACCCACTCTGCCTCCTCCGCCGCTGCCTCTCCGGCGTACTGTTTCTGACCCGAATCCATCTCC
CGGCCCCGACAACATTTCCGGATCCCCAAATAAGGTTGCCCCCACTAGAGACAGCCCTGATTCCAAGAGGCTGACAGGGATCAAGGAGCGATTGAGGGAGATGAATGAGT
TGTGGAATGAACTCATGAAAGAACACGAAGAACGGGACACAAAAAAGAGTGGTGGTTGTAAAGATGAAGGGGAAGAAAGTGTGGGAGTGGAGAGGGTGGGAGATTCATTG
GAGATTCATTTGAAGTGCCCATGTGGGAAAGGCTTCGAGATCCTTCTCAATGGAACTAGCTGTTTCTATAAGCTCCTC
mRNA sequenceShow/hide mRNA sequence
ATGAGCAATCTGGTTCAAGAATCGTGTGAAGCCCAAAAGGGCGAAGACGAGGTGTTGGATCCCCGTTTCTCGATGCTGTGCCTCAACTCCTCCGGCGGCGGAGGCCAACA
TCACTGTGCTTCCTGCGGCTTCCCTCAACCTCGCTCCGCCGCCAGTGCCACACCCATGAAACGCCGCTCTCCCACGCCGTTTCAAGACCCCAACTCCACCTCCAAGAGGC
TCTTTCTTGATCCACAACACAATCACAGTCAAGATCTCCCCCACGCTTTCTCCAGGATCGATCTCCCCATCCCTTTTGGGCCTTCAGCCCAAATCCAGGTCCACTCCCCC
CTCCGCCGCTCCCTTTCCGACCCTGACCCGGCCCAACACCTTAACGGACCCACTCTGCCTCCTCCGCCGCTGCCTCTCCGGCGTACTGTTTCTGACCCGAATCCATCTCC
CGGCCCCGACAACATTTCCGGATCCCCAAATAAGGTTGCCCCCACTAGAGACAGCCCTGATTCCAAGAGGCTGACAGGGATCAAGGAGCGATTGAGGGAGATGAATGAGT
TGTGGAATGAACTCATGAAAGAACACGAAGAACGGGACACAAAAAAGAGTGGTGGTTGTAAAGATGAAGGGGAAGAAAGTGTGGGAGTGGAGAGGGTGGGAGATTCATTG
GAGATTCATTTGAAGTGCCCATGTGGGAAAGGCTTCGAGATCCTTCTCAATGGAACTAGCTGTTTCTATAAGCTCCTC
Protein sequenceShow/hide protein sequence
MSNLVQESCEAQKGEDEVLDPRFSMLCLNSSGGGGQHHCASCGFPQPRSAASATPMKRRSPTPFQDPNSTSKRLFLDPQHNHSQDLPHAFSRIDLPIPFGPSAQIQVHSP
LRRSLSDPDPAQHLNGPTLPPPPLPLRRTVSDPNPSPGPDNISGSPNKVAPTRDSPDSKRLTGIKERLREMNELWNELMKEHEERDTKKSGGCKDEGEESVGVERVGDSL
EIHLKCPCGKGFEILLNGTSCFYKLL