; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS020427 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS020427
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationscaffold211:629075..629891
RNA-Seq ExpressionMS020427
SyntenyMS020427
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG55646.1 hypothetical protein EZV62_020902 [Acer yangbiense]6.8e-2736.9Show/hide
Query:  MKSHADNLALASILVSVQDLVSQVLTSLDEEYNPIVVVFQGKVNPSWSETHVELLTYDKRLEYQN--SLKSGVLINQTQTLSVNYVDG-------HRSNT
        MK+ AD+LA+A        L + +L  LD EY PIVV+ + + + +W E +  LL+YD +LE+ N  S K  +L + +  L+ N  +             
Subjt:  MKSHADNLALASILVSVQDLVSQVLTSLDEEYNPIVVVFQGKVNPSWSETHVELLTYDKRLEYQN--SLKSGVLINQTQTLSVNYVDG-------HRSNT

Query:  YRGGGYQWGILGQWNHGIGRWGGNNNNNGSSGNKPICQVCNRIGHLA-VCHYRFDQN---TRPQSTQHKNFTPSNSGPNVFVAHHASAMVTTPKTVICPS
           GG +    G +  G GR+ G    N +S  +P CQVC + GH A VC++R+D N   + P +  + N       P+VFVA        TP+TV   +
Subjt:  YRGGGYQWGILGQWNHGIGRWGGNNNNNGSSGNKPICQVCNRIGHLA-VCHYRFDQN---TRPQSTQHKNFTPSNSGPNVFVAHHASAMVTTPKTVICPS

Query:  WYADSGATSHLTTNPNN-HSKEDYSGNVSVIVANGSNLSISHIGSPNLYS-SGGSLKLKDVLCVPDIGKNL
        WYADSGAT+H+T +  N   K +Y G+ S++V NG  L ISH+G  +L S +  S+ LK VL VP+I KNL
Subjt:  WYADSGATSHLTTNPNN-HSKEDYSGNVSVIVANGSNLSISHIGSPNLYS-SGGSLKLKDVLCVPDIGKNL

TYK05754.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.2e-3139.47Show/hide
Query:  MKSHADNLALASILVSVQDLVSQVLTSLDEEYNPIVVVFQGKVNPSWSETHVELLTYDKRLEYQNSLKSGVLINQTQTLSVNYVDGHRSNTYRGGGYQWG
        MK+++D L  A   V  +  +SQ L  LDE YNP++ V QGK   SW +   ELLT++KRLE+Q++ K+   I Q     VN      S+ +R       
Subjt:  MKSHADNLALASILVSVQDLVSQVLTSLDEEYNPIVVVFQGKVNPSWSETHVELLTYDKRLEYQNSLKSGVLINQTQTLSVNYVDGHRSNTYRGGGYQWG

Query:  ILGQWNHGIGRWGGNNNNNG---SSGNKPICQVCNRIGHLA-VCHYRFDQN-----TRPQSTQHKNFTPSNSGPNVFVAHHASAMVTTPKTVICPSWYAD
             N+  G+ GG N   G     GNKP CQVC + GH A VC+ RF++       + +  Q  NF+  +S   V V   +     T  TVI  +WY D
Subjt:  ILGQWNHGIGRWGGNNNNNG---SSGNKPICQVCNRIGHLA-VCHYRFDQN-----TRPQSTQHKNFTPSNSGPNVFVAHHASAMVTTPKTVICPSWYAD

Query:  SGATSHLTTNPNNHSK-EDYSGNVSVIVANGSNLSISHIGSPNLYSSGGSLKLKDVLCVPDIGKNL
        SGAT+HLT   +N S   +YSG   ++V NG +L IS+IG+  L      L LK+VLCVPDI KNL
Subjt:  SGATSHLTTNPNNHSK-EDYSGNVSVIVANGSNLSISHIGSPNLYSSGGSLKLKDVLCVPDIGKNL

XP_016902197.1 PREDICTED: uncharacterized protein LOC107991581 isoform X1 [Cucumis melo]4.7e-2836.61Show/hide
Query:  QDLVSQVLTS----LDEEYNPIVVVFQGKVNPSWSETHVELLTYDKRLEYQNSLKSGVLINQTQTLSVNYVDGHRSNTYRGGGYQWGILGQWNHGIGRWG
        +D + Q+L +    LDE YN ++VV QGK + SW +   +LL ++KRL++QN+ K     N TQ+ ++N               ++ + GQ N    ++ 
Subjt:  QDLVSQVLTS----LDEEYNPIVVVFQGKVNPSWSETHVELLTYDKRLEYQNSLKSGVLINQTQTLSVNYVDGHRSNTYRGGGYQWGILGQWNHGIGRWG

Query:  GNNNNN-----GSSGNKPICQVCNRIGHLA-VCHYRFDQNTRPQSTQHKNFTPSNS----GPNVFVAHHASAMVTTPKTVICPSWYADSGATSHLTTNPN
        G N  +     G+  N P CQ+C + GH A VC+ RF++       Q++N   SN      P VFV+   +    TP TV+ P+WY DSGAT+H+T   +
Subjt:  GNNNNN-----GSSGNKPICQVCNRIGHLA-VCHYRFDQNTRPQSTQHKNFTPSNS----GPNVFVAHHASAMVTTPKTVICPSWYADSGATSHLTTNPN

Query:  NHSK-EDYSGNVSVIVANGSNLSISHIGSPNLYSSGGSLKLKDVLCVPDIGKNL
        N +   +YSG   V V NG+ L+IS++G+  L     SL LK++LCVPDI KNL
Subjt:  NHSK-EDYSGNVSVIVANGSNLSISHIGSPNLYSSGGSLKLKDVLCVPDIGKNL

XP_022151683.1 uncharacterized protein LOC111019598 [Momordica charantia]3.2e-7763.24Show/hide
Query:  MKSHADNLALASILVSVQDLVSQVLTSLDEEYNPIVVVFQGKVNPSWSETHVELLTYDKRLEYQNSLKSGVLINQTQTLSVNYVDG--------------
        MKSHADNLALA   VSV+DLVSQVLT LDEEYNPIVV  QGKVN SWSE H ELLTY+KRLEYQNSLKSG+ INQTQT SVNYVDG              
Subjt:  MKSHADNLALASILVSVQDLVSQVLTSLDEEYNPIVVVFQGKVNPSWSETHVELLTYDKRLEYQNSLKSGVLINQTQTLSVNYVDG--------------

Query:  -HRSNTYRGGGYQWGILGQWNHGIGRWGGNNNNNGSSGNKPICQVCNRIGHLAVCHYRFDQNTRPQSTQHKNFTPSNSGPNVFVAHHASAMVTTPKTVIC
         H SNT+RGGGYQ G  GQ N G G                                       PQ TQHKNFTPSNSGPNVF AHH S  VTTP+TVI 
Subjt:  -HRSNTYRGGGYQWGILGQWNHGIGRWGGNNNNNGSSGNKPICQVCNRIGHLAVCHYRFDQNTRPQSTQHKNFTPSNSGPNVFVAHHASAMVTTPKTVIC

Query:  PSWYADSGATSHLTTNPNN-HSKEDYSGNVSVIVANGSNLSISHIGSPNLYSSGGSLKLKDVLCVPDIGKNL
        PSWYADSGATSH+T NPNN   K DYSG  +VIVANG+ LSISHIGS N+++SGGSLKLKDVL VPDI KNL
Subjt:  PSWYADSGATSHLTTNPNN-HSKEDYSGNVSVIVANGSNLSISHIGSPNLYSSGGSLKLKDVLCVPDIGKNL

XP_038905161.1 uncharacterized protein LOC120091275 isoform X1 [Benincasa hispida]6.1e-2837.45Show/hide
Query:  MKSHADNLALASILVSVQDLVSQVLTSLDEEYNPIVVVFQGKVNPSWSETHVELLTYDKRLEYQNSLKSGVLINQTQTLSVNYVD---------GHRSNT
        MK + DNL  A   +  + LVSQVL  LDEEYN IV + QG+V+ SW +   ELL Y++RLE+Q++ K+ V  NQ    SVN  +          + SN 
Subjt:  MKSHADNLALASILVSVQDLVSQVLTSLDEEYNPIVVVFQGKVNPSWSETHVELLTYDKRLEYQNSLKSGVLINQTQTLSVNYVD---------GHRSNT

Query:  YRGGGYQWGILGQWNHGIGRWGGNNNNNGSSGNKPICQVCNRIGHLA-VCHYRFDQNTRPQSTQHK-----NFTPSNSGPN---VFVAHHASAMVTTPKT
          GGG +    G   HG GR  G NN       KP+CQVC ++GH+A  C  R+ ++  P S Q+K     N    N+ P+   + +A+ ++  +T  + 
Subjt:  YRGGGYQWGILGQWNHGIGRWGGNNNNNGSSGNKPICQVCNRIGHLA-VCHYRFDQNTRPQSTQHK-----NFTPSNSGPN---VFVAHHASAMVTTPKT

Query:  VICPSWYADSGATSHLTTNPNNHSKE-DYSGNVSVIVANGSNLSISHIGSPNLYSSGGSLKLKDVLC
        +   +WY DSGA++H+T++ NN     +YSG        G+ L ISH+G+  L S   +LKL D+LC
Subjt:  VICPSWYADSGATSHLTTNPNNHSKE-DYSGNVSVIVANGSNLSISHIGSPNLYSSGGSLKLKDVLC

TrEMBL top hitse value%identityAlignment
A0A1S4E1U6 uncharacterized protein LOC107991581 isoform X12.3e-2836.61Show/hide
Query:  QDLVSQVLTS----LDEEYNPIVVVFQGKVNPSWSETHVELLTYDKRLEYQNSLKSGVLINQTQTLSVNYVDGHRSNTYRGGGYQWGILGQWNHGIGRWG
        +D + Q+L +    LDE YN ++VV QGK + SW +   +LL ++KRL++QN+ K     N TQ+ ++N               ++ + GQ N    ++ 
Subjt:  QDLVSQVLTS----LDEEYNPIVVVFQGKVNPSWSETHVELLTYDKRLEYQNSLKSGVLINQTQTLSVNYVDGHRSNTYRGGGYQWGILGQWNHGIGRWG

Query:  GNNNNN-----GSSGNKPICQVCNRIGHLA-VCHYRFDQNTRPQSTQHKNFTPSNS----GPNVFVAHHASAMVTTPKTVICPSWYADSGATSHLTTNPN
        G N  +     G+  N P CQ+C + GH A VC+ RF++       Q++N   SN      P VFV+   +    TP TV+ P+WY DSGAT+H+T   +
Subjt:  GNNNNN-----GSSGNKPICQVCNRIGHLA-VCHYRFDQNTRPQSTQHKNFTPSNS----GPNVFVAHHASAMVTTPKTVICPSWYADSGATSHLTTNPN

Query:  NHSK-EDYSGNVSVIVANGSNLSISHIGSPNLYSSGGSLKLKDVLCVPDIGKNL
        N +   +YSG   V V NG+ L+IS++G+  L     SL LK++LCVPDI KNL
Subjt:  NHSK-EDYSGNVSVIVANGSNLSISHIGSPNLYSSGGSLKLKDVLCVPDIGKNL

A0A5C7HHE9 Uncharacterized protein3.3e-2736.9Show/hide
Query:  MKSHADNLALASILVSVQDLVSQVLTSLDEEYNPIVVVFQGKVNPSWSETHVELLTYDKRLEYQN--SLKSGVLINQTQTLSVNYVDG-------HRSNT
        MK+ AD+LA+A        L + +L  LD EY PIVV+ + + + +W E +  LL+YD +LE+ N  S K  +L + +  L+ N  +             
Subjt:  MKSHADNLALASILVSVQDLVSQVLTSLDEEYNPIVVVFQGKVNPSWSETHVELLTYDKRLEYQN--SLKSGVLINQTQTLSVNYVDG-------HRSNT

Query:  YRGGGYQWGILGQWNHGIGRWGGNNNNNGSSGNKPICQVCNRIGHLA-VCHYRFDQN---TRPQSTQHKNFTPSNSGPNVFVAHHASAMVTTPKTVICPS
           GG +    G +  G GR+ G    N +S  +P CQVC + GH A VC++R+D N   + P +  + N       P+VFVA        TP+TV   +
Subjt:  YRGGGYQWGILGQWNHGIGRWGGNNNNNGSSGNKPICQVCNRIGHLA-VCHYRFDQN---TRPQSTQHKNFTPSNSGPNVFVAHHASAMVTTPKTVICPS

Query:  WYADSGATSHLTTNPNN-HSKEDYSGNVSVIVANGSNLSISHIGSPNLYS-SGGSLKLKDVLCVPDIGKNL
        WYADSGAT+H+T +  N   K +Y G+ S++V NG  L ISH+G  +L S +  S+ LK VL VP+I KNL
Subjt:  WYADSGATSHLTTNPNN-HSKEDYSGNVSVIVANGSNLSISHIGSPNLYS-SGGSLKLKDVLCVPDIGKNL

A0A5C7IJ06 Uncharacterized protein3.3e-2737.27Show/hide
Query:  MKSHADNLALASILVSVQDLVSQVLTSLDEEYNPIVVVFQGKVNPSWSETHVELLTYDKRLEYQN--SLKSGVLINQTQTLSVNYVDG-------HRSNT
        MK+ AD+LA+A        L +  L  LD EY PIVV+ + + + +W E +  LL+YD +LE+ N  S K  +L + +  L+ N  +             
Subjt:  MKSHADNLALASILVSVQDLVSQVLTSLDEEYNPIVVVFQGKVNPSWSETHVELLTYDKRLEYQN--SLKSGVLINQTQTLSVNYVDG-------HRSNT

Query:  YRGGGYQWGILGQWNHGIGRWGGNNNNNGSSGNKPICQVCNRIGHLA-VCHYRFDQN---TRPQSTQHKNFTPSNSGPNVFVAHHASAMVTTPKTVICPS
           GG +    G +  G GR+ G    N +S  +P CQVC + GH A VC++R+D N   + P +  + N       P+VFVA        TP+TV   +
Subjt:  YRGGGYQWGILGQWNHGIGRWGGNNNNNGSSGNKPICQVCNRIGHLA-VCHYRFDQN---TRPQSTQHKNFTPSNSGPNVFVAHHASAMVTTPKTVICPS

Query:  WYADSGATSHLTTNPNN-HSKEDYSGNVSVIVANGSNLSISHIGSPNLYS-SGGSLKLKDVLCVPDIGKNL
        WYADSGAT+H+T +  N   K DY G+ S++V NG  L ISH+G  +L S +  S+ LK VL VP+I KNL
Subjt:  WYADSGATSHLTTNPNN-HSKEDYSGNVSVIVANGSNLSISHIGSPNLYS-SGGSLKLKDVLCVPDIGKNL

A0A5D3C373 Retrovirus-related Pol polyprotein from transposon TNT 1-945.8e-3239.47Show/hide
Query:  MKSHADNLALASILVSVQDLVSQVLTSLDEEYNPIVVVFQGKVNPSWSETHVELLTYDKRLEYQNSLKSGVLINQTQTLSVNYVDGHRSNTYRGGGYQWG
        MK+++D L  A   V  +  +SQ L  LDE YNP++ V QGK   SW +   ELLT++KRLE+Q++ K+   I Q     VN      S+ +R       
Subjt:  MKSHADNLALASILVSVQDLVSQVLTSLDEEYNPIVVVFQGKVNPSWSETHVELLTYDKRLEYQNSLKSGVLINQTQTLSVNYVDGHRSNTYRGGGYQWG

Query:  ILGQWNHGIGRWGGNNNNNG---SSGNKPICQVCNRIGHLA-VCHYRFDQN-----TRPQSTQHKNFTPSNSGPNVFVAHHASAMVTTPKTVICPSWYAD
             N+  G+ GG N   G     GNKP CQVC + GH A VC+ RF++       + +  Q  NF+  +S   V V   +     T  TVI  +WY D
Subjt:  ILGQWNHGIGRWGGNNNNNG---SSGNKPICQVCNRIGHLA-VCHYRFDQN-----TRPQSTQHKNFTPSNSGPNVFVAHHASAMVTTPKTVICPSWYAD

Query:  SGATSHLTTNPNNHSK-EDYSGNVSVIVANGSNLSISHIGSPNLYSSGGSLKLKDVLCVPDIGKNL
        SGAT+HLT   +N S   +YSG   ++V NG +L IS+IG+  L      L LK+VLCVPDI KNL
Subjt:  SGATSHLTTNPNNHSK-EDYSGNVSVIVANGSNLSISHIGSPNLYSSGGSLKLKDVLCVPDIGKNL

A0A6J1DCW4 uncharacterized protein LOC1110195981.6e-7763.24Show/hide
Query:  MKSHADNLALASILVSVQDLVSQVLTSLDEEYNPIVVVFQGKVNPSWSETHVELLTYDKRLEYQNSLKSGVLINQTQTLSVNYVDG--------------
        MKSHADNLALA   VSV+DLVSQVLT LDEEYNPIVV  QGKVN SWSE H ELLTY+KRLEYQNSLKSG+ INQTQT SVNYVDG              
Subjt:  MKSHADNLALASILVSVQDLVSQVLTSLDEEYNPIVVVFQGKVNPSWSETHVELLTYDKRLEYQNSLKSGVLINQTQTLSVNYVDG--------------

Query:  -HRSNTYRGGGYQWGILGQWNHGIGRWGGNNNNNGSSGNKPICQVCNRIGHLAVCHYRFDQNTRPQSTQHKNFTPSNSGPNVFVAHHASAMVTTPKTVIC
         H SNT+RGGGYQ G  GQ N G G                                       PQ TQHKNFTPSNSGPNVF AHH S  VTTP+TVI 
Subjt:  -HRSNTYRGGGYQWGILGQWNHGIGRWGGNNNNNGSSGNKPICQVCNRIGHLAVCHYRFDQNTRPQSTQHKNFTPSNSGPNVFVAHHASAMVTTPKTVIC

Query:  PSWYADSGATSHLTTNPNN-HSKEDYSGNVSVIVANGSNLSISHIGSPNLYSSGGSLKLKDVLCVPDIGKNL
        PSWYADSGATSH+T NPNN   K DYSG  +VIVANG+ LSISHIGS N+++SGGSLKLKDVL VPDI KNL
Subjt:  PSWYADSGATSHLTTNPNN-HSKEDYSGNVSVIVANGSNLSISHIGSPNLYSSGGSLKLKDVLCVPDIGKNL

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.8e-1429.34Show/hide
Query:  DNLALASILVSVQDLVSQVLTSLDEEYNPIVVVFQGK-VNPSWSETHVELLTYDKRLEYQNS-----LKSGVLINQTQTLSVNYVDGHRSNTYRGGGYQW
        D LAL    +   + V +VL +L EEY P++     K   P+ +E H  LL ++ ++   +S     + +  + ++  T + N  +G+R+N Y       
Subjt:  DNLALASILVSVQDLVSQVLTSLDEEYNPIVVVFQGK-VNPSWSETHVELLTYDKRLEYQNS-----LKSGVLINQTQTLSVNYVDGHRSNTYRGGGYQW

Query:  GILGQWNHGIGRWGGNNNNNGSSGNKPICQVCNRIGHLAVCHYRFDQNTRPQSTQH-KNFTPSNSGPNVFVAHHASAMVTTPKTVICPSWYADSGATSHL
             W      +  NNN +     K  CQ+C   GH A          R    QH  +   S   P+ F      A +         +W  DSGAT H+
Subjt:  GILGQWNHGIGRWGGNNNNNGSSGNKPICQVCNRIGHLAVCHYRFDQNTRPQSTQH-KNFTPSNSGPNVFVAHHASAMVTTPKTVICPSWYADSGATSHL

Query:  TTNPNNHS-KEDYSGNVSVIVANGSNLSISHIGSPNLYSSGGSLKLKDVLCVPDIGKNL
        T++ NN S  + Y+G   V+VA+GS + ISH GS +L +    L L ++L VP+I KNL
Subjt:  TTNPNNHS-KEDYSGNVSVIVANGSNLSISHIGSPNLYSSGGSLKLKDVLCVPDIGKNL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.6e-1530.86Show/hide
Query:  DNLALASILVSVQDLVSQVLTSLDEEYNPIVVVFQGK-VNPSWSETHVELLTYDKRLEYQNSLKSGVLINQTQTLSVNYVDGHRSNTYRGGGYQWGILGQ
        D LAL    +   + V +VL +L ++Y P++     K   PS +E H  L+  + +L   NS        +   ++ N V    +NT R          Q
Subjt:  DNLALASILVSVQDLVSQVLTSLDEEYNPIVVVFQGK-VNPSWSETHVELLTYDKRLEYQNSLKSGVLINQTQTLSVNYVDGHRSNTYRGGGYQWGILGQ

Query:  WNHGIGRWGGNNNNN------GSSGN-------KPI---CQVCNRIGHLAVCHYRFDQNTRPQSTQHKNFTPSNSGPNVFVAHHASAMVTTPKTVICPSW
         N G  R   NNNN        SSG+       KP    CQ+C+  GH A           PQ  Q ++ T      + F      A +         +W
Subjt:  WNHGIGRWGGNNNNN------GSSGN-------KPI---CQVCNRIGHLAVCHYRFDQNTRPQSTQHKNFTPSNSGPNVFVAHHASAMVTTPKTVICPSW

Query:  YADSGATSHLTTNPNNHS-KEDYSGNVSVIVANGSNLSISHIGSPNLYSSGGSLKLKDVLCVPDIGKNL
          DSGAT H+T++ NN S  + Y+G   V++A+GS + I+H GS +L +S  SL L  VL VP+I KNL
Subjt:  YADSGATSHLTTNPNNHS-KEDYSGNVSVIVANGSNLSISHIGSPNLYSSGGSLKLKDVLCVPDIGKNL

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAAGTCATGCTGATAATTTGGCCCTAGCAAGTATTCTGGTCTCTGTTCAAGACTTGGTTTCACAAGTATTGACAAGCTTAGATGAAGAGTACAATCCCATTGTAGT
CGTCTTTCAAGGCAAAGTAAATCCGTCATGGTCAGAAACGCACGTGGAGCTCTTAACATATGATAAGCGGTTGGAATACCAAAACTCCCTCAAAAGTGGCGTCCTGATCA
ACCAGACTCAAACACTCTCAGTGAACTATGTTGATGGCCATAGATCAAATACTTATCGTGGTGGTGGTTATCAATGGGGAATTTTGGGTCAATGGAATCATGGTATAGGT
CGATGGGGAGGTAATAACAACAACAATGGGAGTAGTGGAAATAAGCCCATCTGTCAGGTATGTAATCGCATAGGACATCTTGCTGTTTGTCACTATCGTTTTGATCAAAA
CACAAGACCCCAATCCACCCAGCACAAAAATTTCACCCCCTCAAATTCTGGACCAAATGTGTTTGTTGCCCATCACGCCTCTGCCATGGTCACTACCCCTAAGACTGTCA
TTTGTCCTAGTTGGTATGCCGACAGTGGAGCTACAAGTCATCTGACGACCAACCCGAACAATCATAGCAAAGAGGATTACTCAGGTAATGTAAGTGTAATTGTCGCAAAC
GGCAGTAATTTATCTATCTCTCACATTGGTAGCCCTAATCTCTATTCCTCAGGCGGTTCTTTAAAATTGAAAGATGTTCTCTGCGTTCCTGATATAGGTAAAAACCTT
mRNA sequenceShow/hide mRNA sequence
ATGAAAAGTCATGCTGATAATTTGGCCCTAGCAAGTATTCTGGTCTCTGTTCAAGACTTGGTTTCACAAGTATTGACAAGCTTAGATGAAGAGTACAATCCCATTGTAGT
CGTCTTTCAAGGCAAAGTAAATCCGTCATGGTCAGAAACGCACGTGGAGCTCTTAACATATGATAAGCGGTTGGAATACCAAAACTCCCTCAAAAGTGGCGTCCTGATCA
ACCAGACTCAAACACTCTCAGTGAACTATGTTGATGGCCATAGATCAAATACTTATCGTGGTGGTGGTTATCAATGGGGAATTTTGGGTCAATGGAATCATGGTATAGGT
CGATGGGGAGGTAATAACAACAACAATGGGAGTAGTGGAAATAAGCCCATCTGTCAGGTATGTAATCGCATAGGACATCTTGCTGTTTGTCACTATCGTTTTGATCAAAA
CACAAGACCCCAATCCACCCAGCACAAAAATTTCACCCCCTCAAATTCTGGACCAAATGTGTTTGTTGCCCATCACGCCTCTGCCATGGTCACTACCCCTAAGACTGTCA
TTTGTCCTAGTTGGTATGCCGACAGTGGAGCTACAAGTCATCTGACGACCAACCCGAACAATCATAGCAAAGAGGATTACTCAGGTAATGTAAGTGTAATTGTCGCAAAC
GGCAGTAATTTATCTATCTCTCACATTGGTAGCCCTAATCTCTATTCCTCAGGCGGTTCTTTAAAATTGAAAGATGTTCTCTGCGTTCCTGATATAGGTAAAAACCTT
Protein sequenceShow/hide protein sequence
MKSHADNLALASILVSVQDLVSQVLTSLDEEYNPIVVVFQGKVNPSWSETHVELLTYDKRLEYQNSLKSGVLINQTQTLSVNYVDGHRSNTYRGGGYQWGILGQWNHGIG
RWGGNNNNNGSSGNKPICQVCNRIGHLAVCHYRFDQNTRPQSTQHKNFTPSNSGPNVFVAHHASAMVTTPKTVICPSWYADSGATSHLTTNPNNHSKEDYSGNVSVIVAN
GSNLSISHIGSPNLYSSGGSLKLKDVLCVPDIGKNL