; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g09000 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g09000
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr7:6985342..6986354
RNA-Seq ExpressionMoc07g09000
SyntenyMoc07g09000
Gene Ontology termsGO:0006310 - DNA recombination (biological process)
GO:0015074 - DNA integration (biological process)
GO:0071897 - DNA biosynthetic process (biological process)
GO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0005634 - nucleus (cellular component)
GO:0030430 - host cell cytoplasm (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0003887 - DNA-directed DNA polymerase activity (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143495.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111013372 [Momordica charantia]4.9e-11281.18Show/hide
Query:  LRPVAKGSSANPNQHHFLRARANDVTPSVLN------LGGLPAKTDPVGQNAPSNEKFEVLEERLRAVEGTYVFGNIDASQLCLVSGLVIPSKFKVPKFV
        L  V    S N N +    +R  D+ P +LN       GGLPAKTDPVGQNAPSNEKFEVL+ERLRA+EGT VFGNIDASQLCLVS LVIP KFKVP+F 
Subjt:  LRPVAKGSSANPNQHHFLRARANDVTPSVLN------LGGLPAKTDPVGQNAPSNEKFEVLEERLRAVEGTYVFGNIDASQLCLVSGLVIPSKFKVPKFV

Query:  KYDGSSCPKNHLRMYCRKMAAYVQNDKLLIHCFQDSLCGPASRWYMQLDSSHVDSWKNLADSFLKKYKHNIDMAPDRLDLQRMDKKSTKSFKEYAQRWRD
        KYDGSSCPKNHL MYCRKMAAYVQNDKLLIHCFQDSL  PASRWYMQLDSSHV SWKNLADSFLK+YKHNIDMAPDRLDLQRM+KKST+SFKEYAQRWRD
Subjt:  KYDGSSCPKNHLRMYCRKMAAYVQNDKLLIHCFQDSLCGPASRWYMQLDSSHVDSWKNLADSFLKKYKHNIDMAPDRLDLQRMDKKSTKSFKEYAQRWRD

Query:  TAAQVQPPLTNKELSAMFINTLKHPFYDRMIGSASTNFSDIMTIGEMIEYGVKHG
        TAAQVQPPLT+KELS MFINTLKHPFYDRM+GSASTNFSDIM IGE IEYGV+HG
Subjt:  TAAQVQPPLTNKELSAMFINTLKHPFYDRMIGSASTNFSDIMTIGEMIEYGVKHG

XP_022147189.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111016200 [Momordica charantia]8.9e-11480.84Show/hide
Query:  YPPWYGPP----LRPVAKGSSANPNQHHFLRARANDVTPSVLNLGGLPAKTDPVGQNAPSNEKFEVLEERLRAVEGTYVFGNIDASQLCLVSGLVIPSKF
        Y P Y  P    L P  KG+   P    F         P+VLNLG L AKTDPVGQNAPSNEKFEVL+ERLRA+E T VFGNIDASQLC VSGLVIP K 
Subjt:  YPPWYGPP----LRPVAKGSSANPNQHHFLRARANDVTPSVLNLGGLPAKTDPVGQNAPSNEKFEVLEERLRAVEGTYVFGNIDASQLCLVSGLVIPSKF

Query:  KVPKFVKYDGSSCPKNHLRMYCRKMAAYVQNDKLLIHCFQDSLCGPASRWYMQLDSSHVDSWKNLADSFLKKYKHNIDMAPDRLDLQRMDKKSTKSFKEY
        KVP+F KY+GSSCPKNHL MYCRKMAAYVQNDKLLIHCFQDSL GPASRWYMQLDSSHV SWKNLADSFLK+YKHNIDMAPDRLDLQRM+KKSTKSFKEY
Subjt:  KVPKFVKYDGSSCPKNHLRMYCRKMAAYVQNDKLLIHCFQDSLCGPASRWYMQLDSSHVDSWKNLADSFLKKYKHNIDMAPDRLDLQRMDKKSTKSFKEY

Query:  AQRWRDTAAQVQPPLTNKELSAMFINTLKHPFYDRMIGSASTNFSDIMTIGEMIEYGVKHG
        AQRWRDTAAQVQPPL +KELSAMFINTLKHPFYDRMIGSASTNFSDIMTIGE IEYGV+HG
Subjt:  AQRWRDTAAQVQPPLTNKELSAMFINTLKHPFYDRMIGSASTNFSDIMTIGEMIEYGVKHG

XP_022155098.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111022231, partial [Momordica charantia]1.4e-11465.12Show/hide
Query:  MMEDQKAKQEKTRKDIDELQEKLDVILLALEKGKAAVDPATPSNPIHEPQKIPPYPPWY-------GPPLRPVAKGSSANPNQHH---------------
        M +D+K++QEKTRKDI+EL+EKLD ILLALEKGK     A  SNPIHEP   P +PP +       GP  RP+ +G  +    ++               
Subjt:  MMEDQKAKQEKTRKDIDELQEKLDVILLALEKGKAAVDPATPSNPIHEPQKIPPYPPWY-------GPPLRPVAKGSSANPNQHH---------------

Query:  -----------FLRARANDVTPSVLNLGGLPAKTDPVGQNAPSNEKFEVLEERLRAVEGTYVFGNIDASQLCLVSGLVIPSKFKVPKFVKYDGSSCPKNH
                   F         P+VLNLGG  A  DP  Q APS+EK EVLEERLRAVEGT VFGNIDASQLCL SGLVIP KFK+P+F KY+GSSCPKNH
Subjt:  -----------FLRARANDVTPSVLNLGGLPAKTDPVGQNAPSNEKFEVLEERLRAVEGTYVFGNIDASQLCLVSGLVIPSKFKVPKFVKYDGSSCPKNH

Query:  LRMYCRKMAAYVQNDKLLIHCFQDSLCGPASRWYMQLDSSHVDSWKNLADSFLKKYKHNIDMAPDRLDLQRMDKKSTKSFKEYAQRWRDTAAQVQPPLTN
        L MYCRKMAAY+QNDKLLIHCFQDSL GP S WYM LDS HV SWKNLADSFLK+YKHNIDM  DRLDLQ M+KK+ +SFKEY QRWRDTAAQ QPP T+
Subjt:  LRMYCRKMAAYVQNDKLLIHCFQDSLCGPASRWYMQLDSSHVDSWKNLADSFLKKYKHNIDMAPDRLDLQRMDKKSTKSFKEYAQRWRDTAAQVQPPLTN

Query:  KELSAMFINTLKHPFYDRMIGSASTNFSDIMTIGEMIEYGVKHG
        KELS+MFINTLKHPFYDRMIGSAST+FSDI+TIGE IEYGV HG
Subjt:  KELSAMFINTLKHPFYDRMIGSASTNFSDIMTIGEMIEYGVKHG

XP_022157796.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111024415 [Momordica charantia]5.7e-13776.72Show/hide
Query:  MEDQKAKQEKTRKDIDELQEKLDVILLALEKGKAAVDPATPSNPIHEPQKIPPYPPWYGPPLRPVAKG-----SSANP--------NQHHFLRARANDVT
        MEDQKA+QEKTRKDI+EL+EKLD I L LEKGKA  DPAT SNPIHEPQ+ PPYPP YGPPLRPVA+G     ++ NP          H F++      T
Subjt:  MEDQKAKQEKTRKDIDELQEKLDVILLALEKGKAAVDPATPSNPIHEPQKIPPYPPWYGPPLRPVAKG-----SSANP--------NQHHFLRARANDVT

Query:  -------------PSVLNLGGLPAKTDPVGQNAPSNEKFEVLEERLRAVEGTYVFGNIDASQLCLVSGLVIPSKFKVPKFVKYDGSSCPKNHLRMYCRKM
                     P+V NLGG PAKTDPV QNA S EK EVLEERLRA+EGT VFGNIDASQLCLVSGLVIP KFKVP+F KYDGSSCPKNHL MYCRKM
Subjt:  -------------PSVLNLGGLPAKTDPVGQNAPSNEKFEVLEERLRAVEGTYVFGNIDASQLCLVSGLVIPSKFKVPKFVKYDGSSCPKNHLRMYCRKM

Query:  AAYVQNDKLLIHCFQDSLCGPASRWYMQLDSSHVDSWKNLADSFLKKYKHNIDMAPDRLDLQRMDKKSTKSFKEYAQRWRDTAAQVQPPLTNKELSAMFI
         AYVQN KLLIHCFQDSL G ASRWYMQLDSSHV SWKNLADSFLK+YKHNIDMAPDRLDLQRM+K ST+SFKEYAQRWRDTAAQVQPPLT+KELSAMFI
Subjt:  AAYVQNDKLLIHCFQDSLCGPASRWYMQLDSSHVDSWKNLADSFLKKYKHNIDMAPDRLDLQRMDKKSTKSFKEYAQRWRDTAAQVQPPLTNKELSAMFI

Query:  NTLKHPFYDRMIGSASTNFSDIMTIGEMIEYGVKH
        NTLKHPFYDRMIGSASTNFSDIMTIGE IEYGV+H
Subjt:  NTLKHPFYDRMIGSASTNFSDIMTIGEMIEYGVKH

XP_022158986.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111025431 [Momordica charantia]3.3e-11692.38Show/hide
Query:  PSVLNLGGLPAKTDPVGQNAPSNEKFEVLEERLRAVEGTYVFGNIDASQLCLVSGLVIPSKFKVPKFVKYDGSSCPKNHLRMYCRKMAAYVQNDKLLIHC
        P+VLNLGGLPAKTD VGQNAPSNEKFEVLEERLRA+EGTYVFGNIDASQLCLVSGLVIP KFKVP+F KYDGSSCPKNHL MYCRKMAAYVQNDKLLIHC
Subjt:  PSVLNLGGLPAKTDPVGQNAPSNEKFEVLEERLRAVEGTYVFGNIDASQLCLVSGLVIPSKFKVPKFVKYDGSSCPKNHLRMYCRKMAAYVQNDKLLIHC

Query:  FQDSLCGPASRWYMQLDSSHVDSWKNLADSFLKKYKHNIDMAPDRLDLQRMDKKSTKSFKEYAQRWRDTAAQVQPPLTNKELSAMFINTLKHPFYDRMIG
        FQDSL GPASRWYMQLDSS+V SWKNLADSFLK+YKHNIDMAPDRLDLQRM+KKST+SFKEYAQRWRDTAAQVQPPLT+KELSAMFINTLKHPFYDRMIG
Subjt:  FQDSLCGPASRWYMQLDSSHVDSWKNLADSFLKKYKHNIDMAPDRLDLQRMDKKSTKSFKEYAQRWRDTAAQVQPPLTNKELSAMFINTLKHPFYDRMIG

Query:  SASTNFSDIMTIGEMIEYGVKHG
        +ASTNFSDIMTIGE IEYGV+HG
Subjt:  SASTNFSDIMTIGEMIEYGVKHG

TrEMBL top hitse value%identityAlignment
A0A6J1CNY7 Ribonuclease H2.4e-11281.18Show/hide
Query:  LRPVAKGSSANPNQHHFLRARANDVTPSVLN------LGGLPAKTDPVGQNAPSNEKFEVLEERLRAVEGTYVFGNIDASQLCLVSGLVIPSKFKVPKFV
        L  V    S N N +    +R  D+ P +LN       GGLPAKTDPVGQNAPSNEKFEVL+ERLRA+EGT VFGNIDASQLCLVS LVIP KFKVP+F 
Subjt:  LRPVAKGSSANPNQHHFLRARANDVTPSVLN------LGGLPAKTDPVGQNAPSNEKFEVLEERLRAVEGTYVFGNIDASQLCLVSGLVIPSKFKVPKFV

Query:  KYDGSSCPKNHLRMYCRKMAAYVQNDKLLIHCFQDSLCGPASRWYMQLDSSHVDSWKNLADSFLKKYKHNIDMAPDRLDLQRMDKKSTKSFKEYAQRWRD
        KYDGSSCPKNHL MYCRKMAAYVQNDKLLIHCFQDSL  PASRWYMQLDSSHV SWKNLADSFLK+YKHNIDMAPDRLDLQRM+KKST+SFKEYAQRWRD
Subjt:  KYDGSSCPKNHLRMYCRKMAAYVQNDKLLIHCFQDSLCGPASRWYMQLDSSHVDSWKNLADSFLKKYKHNIDMAPDRLDLQRMDKKSTKSFKEYAQRWRD

Query:  TAAQVQPPLTNKELSAMFINTLKHPFYDRMIGSASTNFSDIMTIGEMIEYGVKHG
        TAAQVQPPLT+KELS MFINTLKHPFYDRM+GSASTNFSDIM IGE IEYGV+HG
Subjt:  TAAQVQPPLTNKELSAMFINTLKHPFYDRMIGSASTNFSDIMTIGEMIEYGVKHG

A0A6J1D099 Ribonuclease H4.3e-11480.84Show/hide
Query:  YPPWYGPP----LRPVAKGSSANPNQHHFLRARANDVTPSVLNLGGLPAKTDPVGQNAPSNEKFEVLEERLRAVEGTYVFGNIDASQLCLVSGLVIPSKF
        Y P Y  P    L P  KG+   P    F         P+VLNLG L AKTDPVGQNAPSNEKFEVL+ERLRA+E T VFGNIDASQLC VSGLVIP K 
Subjt:  YPPWYGPP----LRPVAKGSSANPNQHHFLRARANDVTPSVLNLGGLPAKTDPVGQNAPSNEKFEVLEERLRAVEGTYVFGNIDASQLCLVSGLVIPSKF

Query:  KVPKFVKYDGSSCPKNHLRMYCRKMAAYVQNDKLLIHCFQDSLCGPASRWYMQLDSSHVDSWKNLADSFLKKYKHNIDMAPDRLDLQRMDKKSTKSFKEY
        KVP+F KY+GSSCPKNHL MYCRKMAAYVQNDKLLIHCFQDSL GPASRWYMQLDSSHV SWKNLADSFLK+YKHNIDMAPDRLDLQRM+KKSTKSFKEY
Subjt:  KVPKFVKYDGSSCPKNHLRMYCRKMAAYVQNDKLLIHCFQDSLCGPASRWYMQLDSSHVDSWKNLADSFLKKYKHNIDMAPDRLDLQRMDKKSTKSFKEY

Query:  AQRWRDTAAQVQPPLTNKELSAMFINTLKHPFYDRMIGSASTNFSDIMTIGEMIEYGVKHG
        AQRWRDTAAQVQPPL +KELSAMFINTLKHPFYDRMIGSASTNFSDIMTIGE IEYGV+HG
Subjt:  AQRWRDTAAQVQPPLTNKELSAMFINTLKHPFYDRMIGSASTNFSDIMTIGEMIEYGVKHG

A0A6J1DM29 LOW QUALITY PROTEIN: uncharacterized protein LOC1110222313.9e-11565.41Show/hide
Query:  MMEDQKAKQEKTRKDIDELQEKLDVILLALEKGKAAVDPATPSNPIHEPQKIPPYPPWY-------GPPLRPVAKGSSANPNQHH---------------
        M +D+K++QEKTRKDI+EL+EKLD ILLALEKGK     A  SNPIHEP   P +PP +       GP  RP+ +G  +    ++               
Subjt:  MMEDQKAKQEKTRKDIDELQEKLDVILLALEKGKAAVDPATPSNPIHEPQKIPPYPPWY-------GPPLRPVAKGSSANPNQHH---------------

Query:  -----------FLRARANDVTPSVLNLGGLPAKTDPVGQNAPSNEKFEVLEERLRAVEGTYVFGNIDASQLCLVSGLVIPSKFKVPKFVKYDGSSCPKNH
                   F         P+VLNLGG  A  DP  Q APS+EK EVLEERLRAVEGT VFGNIDASQLCL SGLVIP KFK+P+F KYDGSSCPKNH
Subjt:  -----------FLRARANDVTPSVLNLGGLPAKTDPVGQNAPSNEKFEVLEERLRAVEGTYVFGNIDASQLCLVSGLVIPSKFKVPKFVKYDGSSCPKNH

Query:  LRMYCRKMAAYVQNDKLLIHCFQDSLCGPASRWYMQLDSSHVDSWKNLADSFLKKYKHNIDMAPDRLDLQRMDKKSTKSFKEYAQRWRDTAAQVQPPLTN
        L MYCRKMAAY+QNDKLLIHCFQDSL GP S WYM LDS HV SWKNLADSFLK+YKHNIDM  DRLDLQ M+KK+ +SFKEY QRWRDTAAQ QPP T+
Subjt:  LRMYCRKMAAYVQNDKLLIHCFQDSLCGPASRWYMQLDSSHVDSWKNLADSFLKKYKHNIDMAPDRLDLQRMDKKSTKSFKEYAQRWRDTAAQVQPPLTN

Query:  KELSAMFINTLKHPFYDRMIGSASTNFSDIMTIGEMIEYGVKHG
        KELS+MFINTLKHPFYDRMIGSAST+FSDI+TIGE IEYGV HG
Subjt:  KELSAMFINTLKHPFYDRMIGSASTNFSDIMTIGEMIEYGVKHG

A0A6J1DZ90 Ribonuclease H2.8e-13776.72Show/hide
Query:  MEDQKAKQEKTRKDIDELQEKLDVILLALEKGKAAVDPATPSNPIHEPQKIPPYPPWYGPPLRPVAKG-----SSANP--------NQHHFLRARANDVT
        MEDQKA+QEKTRKDI+EL+EKLD I L LEKGKA  DPAT SNPIHEPQ+ PPYPP YGPPLRPVA+G     ++ NP          H F++      T
Subjt:  MEDQKAKQEKTRKDIDELQEKLDVILLALEKGKAAVDPATPSNPIHEPQKIPPYPPWYGPPLRPVAKG-----SSANP--------NQHHFLRARANDVT

Query:  -------------PSVLNLGGLPAKTDPVGQNAPSNEKFEVLEERLRAVEGTYVFGNIDASQLCLVSGLVIPSKFKVPKFVKYDGSSCPKNHLRMYCRKM
                     P+V NLGG PAKTDPV QNA S EK EVLEERLRA+EGT VFGNIDASQLCLVSGLVIP KFKVP+F KYDGSSCPKNHL MYCRKM
Subjt:  -------------PSVLNLGGLPAKTDPVGQNAPSNEKFEVLEERLRAVEGTYVFGNIDASQLCLVSGLVIPSKFKVPKFVKYDGSSCPKNHLRMYCRKM

Query:  AAYVQNDKLLIHCFQDSLCGPASRWYMQLDSSHVDSWKNLADSFLKKYKHNIDMAPDRLDLQRMDKKSTKSFKEYAQRWRDTAAQVQPPLTNKELSAMFI
         AYVQN KLLIHCFQDSL G ASRWYMQLDSSHV SWKNLADSFLK+YKHNIDMAPDRLDLQRM+K ST+SFKEYAQRWRDTAAQVQPPLT+KELSAMFI
Subjt:  AAYVQNDKLLIHCFQDSLCGPASRWYMQLDSSHVDSWKNLADSFLKKYKHNIDMAPDRLDLQRMDKKSTKSFKEYAQRWRDTAAQVQPPLTNKELSAMFI

Query:  NTLKHPFYDRMIGSASTNFSDIMTIGEMIEYGVKH
        NTLKHPFYDRMIGSASTNFSDIMTIGE IEYGV+H
Subjt:  NTLKHPFYDRMIGSASTNFSDIMTIGEMIEYGVKH

A0A6J1E2J7 Ribonuclease H1.6e-11692.38Show/hide
Query:  PSVLNLGGLPAKTDPVGQNAPSNEKFEVLEERLRAVEGTYVFGNIDASQLCLVSGLVIPSKFKVPKFVKYDGSSCPKNHLRMYCRKMAAYVQNDKLLIHC
        P+VLNLGGLPAKTD VGQNAPSNEKFEVLEERLRA+EGTYVFGNIDASQLCLVSGLVIP KFKVP+F KYDGSSCPKNHL MYCRKMAAYVQNDKLLIHC
Subjt:  PSVLNLGGLPAKTDPVGQNAPSNEKFEVLEERLRAVEGTYVFGNIDASQLCLVSGLVIPSKFKVPKFVKYDGSSCPKNHLRMYCRKMAAYVQNDKLLIHC

Query:  FQDSLCGPASRWYMQLDSSHVDSWKNLADSFLKKYKHNIDMAPDRLDLQRMDKKSTKSFKEYAQRWRDTAAQVQPPLTNKELSAMFINTLKHPFYDRMIG
        FQDSL GPASRWYMQLDSS+V SWKNLADSFLK+YKHNIDMAPDRLDLQRM+KKST+SFKEYAQRWRDTAAQVQPPLT+KELSAMFINTLKHPFYDRMIG
Subjt:  FQDSLCGPASRWYMQLDSSHVDSWKNLADSFLKKYKHNIDMAPDRLDLQRMDKKSTKSFKEYAQRWRDTAAQVQPPLTNKELSAMFINTLKHPFYDRMIG

Query:  SASTNFSDIMTIGEMIEYGVKHG
        +ASTNFSDIMTIGE IEYGV+HG
Subjt:  SASTNFSDIMTIGEMIEYGVKHG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGGAAGATCAAAAAGCCAAGCAGGAGAAAACAAGGAAAGATATTGATGAGTTGCAAGAAAAGTTAGATGTCATCCTCCTCGCATTGGAAAAAGGCAAAGCG
GCTGTGGATCCTGCTACACCTAGCAACCCTATCCATGAGCCGCAAAAAATCCCGCCTTATCCACCTTGGTATGGTCCACCTCTTAGGCCAGTGGCAAAGGGGAGC
TCAGCAAATCCCAACCAACATCATTTTTTGAGAGCCCGAGCAAATGATGTCACCCCCTCAGTCCTTAACCTAGGTGGTCTCCCAGCCAAGACAGACCCAGTCGGA
CAAAATGCCCCCAGTAATGAGAAGTTCGAAGTTCTGGAGGAAAGATTAAGAGCAGTAGAGGGGACATATGTTTTTGGCAACATCGATGCCTCACAATTGTGCTTG
GTGTCTGGATTAGTCATACCTTCAAAATTCAAGGTGCCAAAGTTTGTGAAGTACGATGGTTCTTCTTGTCCTAAGAATCATCTTAGAATGTACTGCAGGAAGATG
GCAGCGTACGTCCAAAATGACAAATTGTTGATACACTGCTTTCAAGATAGTTTATGTGGTCCAGCCTCTCGTTGGTACATGCAGTTAGATAGCTCTCATGTTGAC
TCATGGAAGAATCTGGCCGACTCCTTCCTAAAAAAATATAAGCATAACATAGACATGGCTCCAGATCGCTTAGATTTACAGAGGATGGATAAGAAGAGTACAAAG
AGCTTTAAAGAGTATGCTCAAAGGTGGAGGGACACGGCAGCTCAAGTCCAACCTCCTTTAACGAATAAGGAGCTATCTGCCATGTTCATCAATACCCTAAAACAT
CCTTTCTATGATCGGATGATAGGAAGCGCTTCCACAAATTTCTCTGATATTATGACCATCGGAGAAATGATCGAATATGGGGTTAAACATGGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGATGGAAGATCAAAAAGCCAAGCAGGAGAAAACAAGGAAAGATATTGATGAGTTGCAAGAAAAGTTAGATGTCATCCTCCTCGCATTGGAAAAAGGCAAAGCG
GCTGTGGATCCTGCTACACCTAGCAACCCTATCCATGAGCCGCAAAAAATCCCGCCTTATCCACCTTGGTATGGTCCACCTCTTAGGCCAGTGGCAAAGGGGAGC
TCAGCAAATCCCAACCAACATCATTTTTTGAGAGCCCGAGCAAATGATGTCACCCCCTCAGTCCTTAACCTAGGTGGTCTCCCAGCCAAGACAGACCCAGTCGGA
CAAAATGCCCCCAGTAATGAGAAGTTCGAAGTTCTGGAGGAAAGATTAAGAGCAGTAGAGGGGACATATGTTTTTGGCAACATCGATGCCTCACAATTGTGCTTG
GTGTCTGGATTAGTCATACCTTCAAAATTCAAGGTGCCAAAGTTTGTGAAGTACGATGGTTCTTCTTGTCCTAAGAATCATCTTAGAATGTACTGCAGGAAGATG
GCAGCGTACGTCCAAAATGACAAATTGTTGATACACTGCTTTCAAGATAGTTTATGTGGTCCAGCCTCTCGTTGGTACATGCAGTTAGATAGCTCTCATGTTGAC
TCATGGAAGAATCTGGCCGACTCCTTCCTAAAAAAATATAAGCATAACATAGACATGGCTCCAGATCGCTTAGATTTACAGAGGATGGATAAGAAGAGTACAAAG
AGCTTTAAAGAGTATGCTCAAAGGTGGAGGGACACGGCAGCTCAAGTCCAACCTCCTTTAACGAATAAGGAGCTATCTGCCATGTTCATCAATACCCTAAAACAT
CCTTTCTATGATCGGATGATAGGAAGCGCTTCCACAAATTTCTCTGATATTATGACCATCGGAGAAATGATCGAATATGGGGTTAAACATGGGTGA
Protein sequenceShow/hide protein sequence
MMEDQKAKQEKTRKDIDELQEKLDVILLALEKGKAAVDPATPSNPIHEPQKIPPYPPWYGPPLRPVAKGSSANPNQHHFLRARANDVTPSVLNLGGLPAKTDPVG
QNAPSNEKFEVLEERLRAVEGTYVFGNIDASQLCLVSGLVIPSKFKVPKFVKYDGSSCPKNHLRMYCRKMAAYVQNDKLLIHCFQDSLCGPASRWYMQLDSSHVD
SWKNLADSFLKKYKHNIDMAPDRLDLQRMDKKSTKSFKEYAQRWRDTAAQVQPPLTNKELSAMFINTLKHPFYDRMIGSASTNFSDIMTIGEMIEYGVKHG