; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g12700 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g12700
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr6:9826246..9831141
RNA-Seq ExpressionMoc06g12700
SyntenyMoc06g12700
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR005135 - Endonuclease/exonuclease/phosphatase
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6429825.1 hypothetical protein SASPL_107879 [Salvia splendens]3.7e-4337.41Show/hide
Query:  QGAGSPRFKYVLRDMIQAYNPDVLVILETKVSGIVADRVCGSFLRFSYT-RIEASRFKGGIWVLWKPNRVSLVESNRHNQVFHFRITKGS--FSGVFTTI
        QG G PRF  V+R+  + Y PD++ +LET+V GI ADR+    L F Y+ R+E + + GGIW+ WK + +S+     H Q  H R+ +G+   S   T +
Subjt:  QGAGSPRFKYVLRDMIQAYNPDVLVILETKVSGIVADRVCGSFLRFSYT-RIEASRFKGGIWVLWKPNRVSLVESNRHNQVFHFRITKGS--FSGVFTTI

Query:  YGSPQRATRRELWIFLESIASEIAEPWLLLGDFNEILVSKEKLGVASFNPSLATLFINMMNNCQLLDLGSTGPKFTW-RGPLLQGCSRI-----FEKLDR
        YGSPQ A RR LW +L  IA  ++EPW+L GDFN +L S EKLG +S   S+  LF N M +  L D+G  GP+FTW RG LL+   R+     F+ +  
Subjt:  YGSPQRATRRELWIFLESIASEIAEPWLLLGDFNEILVSKEKLGVASFNPSLATLFINMMNNCQLLDLGSTGPKFTW-RGPLLQGCSRI-----FEKLDR

Query:  DTAGFCAS------------------SHSQKPFRFMAAWLTHKMFPSFLEEIWPKSEPLTTGLATAQGKFSHWNKEIFQSISKRKK
        D   F  S                    + +PFRF   WL H  F   + + W     +T  +         WN+  F SI +RKK
Subjt:  DTAGFCAS------------------SHSQKPFRFMAAWLTHKMFPSFLEEIWPKSEPLTTGLATAQGKFSHWNKEIFQSISKRKK

PKI35403.1 hypothetical protein CRG98_044205, partial [Punica granatum]3.1e-5041.46Show/hide
Query:  LRDMIQAYNPDVLVILETKVSGIVADRVCGSFLRFSYTRIEASRFKGGIWVLWKPNRVSLVESNRHNQVFHFRITKGSFSGVFTTIYGSPQRATRRELWI
        +++MI+ + P+++VI+E ++SG  AD VC  F  +S  R+EA  F GGIWV W+PN V +   +RH Q  H RI++   +  FT +Y SP    RR+LW 
Subjt:  LRDMIQAYNPDVLVILETKVSGIVADRVCGSFLRFSYTRIEASRFKGGIWVLWKPNRVSLVESNRHNQVFHFRITKGSFSGVFTTIYGSPQRATRRELWI

Query:  FLESIASEIAEPWLLLGDFNEILVSKEKLGVASFNPSLATLFINMMNNCQLLDLGSTGPKFTWRGPLLQGCSRIFEKLDR--------------------
         L +I++ I  PW++LGDFN IL + EK G A FNP  A  F   ++NC L+DL S+GP+FTW GP+  G  R+FE+LDR                    
Subjt:  FLESIASEIAEPWLLLGDFNEILVSKEKLGVASFNPSLATLFINMMNNCQLLDLGSTGPKFTWRGPLLQGCSRIFEKLDR--------------------

Query:  -----------DTAGFCASSHSQKPFRFMAAWLTHKMFPSFLEEIW
                   DT GF  S   ++PFR+MAAW  HK FP F+ E W
Subjt:  -----------DTAGFCASSHSQKPFRFMAAWLTHKMFPSFLEEIW

XP_022137804.1 uncharacterized protein LOC111009151 [Momordica charantia]1.5e-5744.78Show/hide
Query:  LVILETKVSGIVADRVCGSFLRFSYTRIEASRFKGGIWVLWKPNRVSLVESNRHNQVFHFRITKGSFSGVFTTIYGSPQRATRRELWIFLESIASEIAEP
        +VI+E K+SG +AD VC SF  FS+ R+EA   KGGIWV WK +RVSL+E   + Q  HFR  + + SG FTT+YGSPQR+++RELW FL+S+      P
Subjt:  LVILETKVSGIVADRVCGSFLRFSYTRIEASRFKGGIWVLWKPNRVSLVESNRHNQVFHFRITKGSFSGVFTTIYGSPQRATRRELWIFLESIASEIAEP

Query:  WLLLGDFNEILVSKEKLGVASFNPSLATLFINMMNNCQLLDLGSTGPKFTWRGPLLQGCSRIFEKLDR------------DT------------------
        WLL+GDFN I  + EK G A  +P  AT F+  +++CQL+DLGS+GPKFTW+GPL+ G  R+FE+LDR            DT                  
Subjt:  WLLLGDFNEILVSKEKLGVASFNPSLATLFINMMNNCQLLDLGSTGPKFTWRGPLLQGCSRIFEKLDR------------DT------------------

Query:  -AGFCASSHSQKPFRFMAAWLTHKMFPSFLEEIWPKSEPLTTGLATAQGKFSHWNKEIFQSISKRKKV
         A  C S      F F+AAW  H+ F SFL + W  S+P  + LA  +GK   W+K++F S+ K K +
Subjt:  -AGFCASSHSQKPFRFMAAWLTHKMFPSFLEEIWPKSEPLTTGLATAQGKFSHWNKEIFQSISKRKKV

XP_022153253.1 uncharacterized protein LOC111020790 [Momordica charantia]4.1e-119100Show/hide
Query:  RCNTETKVNDNSSGILDSVGADSRVPKESNIPEFPQKPFIPVTVSQLNPSPGHGPWMLVDHSKRNGGGKPPRTRSGRGFISSKNLNTKWNLNSDPNEEPM
        RCNTETKVNDNSSGILDSVGADSRVPKESNIPEFPQKPFIPVTVSQLNPSPGHGPWMLVDHSKRNGGGKPPRTRSGRGFISSKNLNTKWNLNSDPNEEPM
Subjt:  RCNTETKVNDNSSGILDSVGADSRVPKESNIPEFPQKPFIPVTVSQLNPSPGHGPWMLVDHSKRNGGGKPPRTRSGRGFISSKNLNTKWNLNSDPNEEPM

Query:  NVLDTIVDPTMKRDPIKSFQIKSVERVRLARDQDSWRFKNKIPFAGPGANFSSLPFDYQGTLVIQDTDGRRKPRWMDQDYDPGDMDSEQEDDTDPKSPIP
        NVLDTIVDPTMKRDPIKSFQIKSVERVRLARDQDSWRFKNKIPFAGPGANFSSLPFDYQGTLVIQDTDGRRKPRWMDQDYDPGDMDSEQEDDTDPKSPIP
Subjt:  NVLDTIVDPTMKRDPIKSFQIKSVERVRLARDQDSWRFKNKIPFAGPGANFSSLPFDYQGTLVIQDTDGRRKPRWMDQDYDPGDMDSEQEDDTDPKSPIP

Query:  RNGYFDSAGPQQ
        RNGYFDSAGPQQ
Subjt:  RNGYFDSAGPQQ

XP_031402735.1 uncharacterized protein LOC116212324 [Punica granatum]2.1e-5937.01Show/hide
Query:  QGAGSPRFKYVLRDMIQAYNPDVLVILETKVSGIVADRVCGSFLRFSYTRIEASRFKGGIWVLWKPNRVSLVESNRHNQVFHFRITKGSFSGVFTTIYGS
        +GAGS RF  V+++MI+ + P+++VI+E ++SG  AD VC  F  +S  R+EA  F GGIWV W+PN V +   +RH Q  H RI++   +  FT +Y S
Subjt:  QGAGSPRFKYVLRDMIQAYNPDVLVILETKVSGIVADRVCGSFLRFSYTRIEASRFKGGIWVLWKPNRVSLVESNRHNQVFHFRITKGSFSGVFTTIYGS

Query:  PQRATRRELWIFLESIASEIAEPWLLLGDFNEILVSKEKLGVASFNPSLATLFINMMNNCQLLDLGSTGPKFTWRGPLLQGCSRIFEKLDR---------
        P    RR+LW  L +I++ I  PW++LGDFN IL + EK G A FNP  A  F   ++NC L+DL S+GP+FTW GP+  G  R+FE+LDR         
Subjt:  PQRATRRELWIFLESIASEIAEPWLLLGDFNEILVSKEKLGVASFNPSLATLFINMMNNCQLLDLGSTGPKFTWRGPLLQGCSRIFEKLDR---------

Query:  ----------------------DTAGFCASSHSQKPFRFMAAWLTHKMFPSFLEEIWPKSEPLTTGLATAQGKFSHWNKEIFQSISKRKKVSFMGSSGKS
                              DT GF  S   ++PFR+MAAW  HK FP F+ E W     L  GL   + +   WN+ +F SI++RK+       G  
Subjt:  ----------------------DTAGFCASSHSQKPFRFMAAWLTHKMFPSFLEEIWPKSEPLTTGLATAQGKFSHWNKEIFQSISKRKKVSFMGSSGKS

Query:  YYYFKGVLRFDRENFRFWKMQVTDLLTYKKIQKTLKERSAIGMIDKEWVEMDEQ
            +G   F          ++ ++L+ ++I    K RS       EWV++ E+
Subjt:  YYYFKGVLRFDRENFRFWKMQVTDLLTYKKIQKTLKERSAIGMIDKEWVEMDEQ

TrEMBL top hitse value%identityAlignment
A0A2I0HUV7 Reverse transcriptase domain-containing protein (Fragment)1.5e-5041.46Show/hide
Query:  LRDMIQAYNPDVLVILETKVSGIVADRVCGSFLRFSYTRIEASRFKGGIWVLWKPNRVSLVESNRHNQVFHFRITKGSFSGVFTTIYGSPQRATRRELWI
        +++MI+ + P+++VI+E ++SG  AD VC  F  +S  R+EA  F GGIWV W+PN V +   +RH Q  H RI++   +  FT +Y SP    RR+LW 
Subjt:  LRDMIQAYNPDVLVILETKVSGIVADRVCGSFLRFSYTRIEASRFKGGIWVLWKPNRVSLVESNRHNQVFHFRITKGSFSGVFTTIYGSPQRATRRELWI

Query:  FLESIASEIAEPWLLLGDFNEILVSKEKLGVASFNPSLATLFINMMNNCQLLDLGSTGPKFTWRGPLLQGCSRIFEKLDR--------------------
         L +I++ I  PW++LGDFN IL + EK G A FNP  A  F   ++NC L+DL S+GP+FTW GP+  G  R+FE+LDR                    
Subjt:  FLESIASEIAEPWLLLGDFNEILVSKEKLGVASFNPSLATLFINMMNNCQLLDLGSTGPKFTWRGPLLQGCSRIFEKLDR--------------------

Query:  -----------DTAGFCASSHSQKPFRFMAAWLTHKMFPSFLEEIW
                   DT GF  S   ++PFR+MAAW  HK FP F+ E W
Subjt:  -----------DTAGFCASSHSQKPFRFMAAWLTHKMFPSFLEEIW

A0A4D9BNB2 Uncharacterized protein1.8e-4337.41Show/hide
Query:  QGAGSPRFKYVLRDMIQAYNPDVLVILETKVSGIVADRVCGSFLRFSYT-RIEASRFKGGIWVLWKPNRVSLVESNRHNQVFHFRITKGS--FSGVFTTI
        QG G PRF  V+R+  + Y PD++ +LET+V GI ADR+    L F Y+ R+E + + GGIW+ WK + +S+     H Q  H R+ +G+   S   T +
Subjt:  QGAGSPRFKYVLRDMIQAYNPDVLVILETKVSGIVADRVCGSFLRFSYT-RIEASRFKGGIWVLWKPNRVSLVESNRHNQVFHFRITKGS--FSGVFTTI

Query:  YGSPQRATRRELWIFLESIASEIAEPWLLLGDFNEILVSKEKLGVASFNPSLATLFINMMNNCQLLDLGSTGPKFTW-RGPLLQGCSRI-----FEKLDR
        YGSPQ A RR LW +L  IA  ++EPW+L GDFN +L S EKLG +S   S+  LF N M +  L D+G  GP+FTW RG LL+   R+     F+ +  
Subjt:  YGSPQRATRRELWIFLESIASEIAEPWLLLGDFNEILVSKEKLGVASFNPSLATLFINMMNNCQLLDLGSTGPKFTW-RGPLLQGCSRI-----FEKLDR

Query:  DTAGFCAS------------------SHSQKPFRFMAAWLTHKMFPSFLEEIWPKSEPLTTGLATAQGKFSHWNKEIFQSISKRKK
        D   F  S                    + +PFRF   WL H  F   + + W     +T  +         WN+  F SI +RKK
Subjt:  DTAGFCAS------------------SHSQKPFRFMAAWLTHKMFPSFLEEIWPKSEPLTTGLATAQGKFSHWNKEIFQSISKRKK

A0A6J1C8B2 uncharacterized protein LOC1110091517.4e-5844.78Show/hide
Query:  LVILETKVSGIVADRVCGSFLRFSYTRIEASRFKGGIWVLWKPNRVSLVESNRHNQVFHFRITKGSFSGVFTTIYGSPQRATRRELWIFLESIASEIAEP
        +VI+E K+SG +AD VC SF  FS+ R+EA   KGGIWV WK +RVSL+E   + Q  HFR  + + SG FTT+YGSPQR+++RELW FL+S+      P
Subjt:  LVILETKVSGIVADRVCGSFLRFSYTRIEASRFKGGIWVLWKPNRVSLVESNRHNQVFHFRITKGSFSGVFTTIYGSPQRATRRELWIFLESIASEIAEP

Query:  WLLLGDFNEILVSKEKLGVASFNPSLATLFINMMNNCQLLDLGSTGPKFTWRGPLLQGCSRIFEKLDR------------DT------------------
        WLL+GDFN I  + EK G A  +P  AT F+  +++CQL+DLGS+GPKFTW+GPL+ G  R+FE+LDR            DT                  
Subjt:  WLLLGDFNEILVSKEKLGVASFNPSLATLFINMMNNCQLLDLGSTGPKFTWRGPLLQGCSRIFEKLDR------------DT------------------

Query:  -AGFCASSHSQKPFRFMAAWLTHKMFPSFLEEIWPKSEPLTTGLATAQGKFSHWNKEIFQSISKRKKV
         A  C S      F F+AAW  H+ F SFL + W  S+P  + LA  +GK   W+K++F S+ K K +
Subjt:  -AGFCASSHSQKPFRFMAAWLTHKMFPSFLEEIWPKSEPLTTGLATAQGKFSHWNKEIFQSISKRKKV

A0A6J1DGZ9 uncharacterized protein LOC1110207902.0e-119100Show/hide
Query:  RCNTETKVNDNSSGILDSVGADSRVPKESNIPEFPQKPFIPVTVSQLNPSPGHGPWMLVDHSKRNGGGKPPRTRSGRGFISSKNLNTKWNLNSDPNEEPM
        RCNTETKVNDNSSGILDSVGADSRVPKESNIPEFPQKPFIPVTVSQLNPSPGHGPWMLVDHSKRNGGGKPPRTRSGRGFISSKNLNTKWNLNSDPNEEPM
Subjt:  RCNTETKVNDNSSGILDSVGADSRVPKESNIPEFPQKPFIPVTVSQLNPSPGHGPWMLVDHSKRNGGGKPPRTRSGRGFISSKNLNTKWNLNSDPNEEPM

Query:  NVLDTIVDPTMKRDPIKSFQIKSVERVRLARDQDSWRFKNKIPFAGPGANFSSLPFDYQGTLVIQDTDGRRKPRWMDQDYDPGDMDSEQEDDTDPKSPIP
        NVLDTIVDPTMKRDPIKSFQIKSVERVRLARDQDSWRFKNKIPFAGPGANFSSLPFDYQGTLVIQDTDGRRKPRWMDQDYDPGDMDSEQEDDTDPKSPIP
Subjt:  NVLDTIVDPTMKRDPIKSFQIKSVERVRLARDQDSWRFKNKIPFAGPGANFSSLPFDYQGTLVIQDTDGRRKPRWMDQDYDPGDMDSEQEDDTDPKSPIP

Query:  RNGYFDSAGPQQ
        RNGYFDSAGPQQ
Subjt:  RNGYFDSAGPQQ

A0A6P8E5K3 uncharacterized protein LOC1162123241.0e-5937.01Show/hide
Query:  QGAGSPRFKYVLRDMIQAYNPDVLVILETKVSGIVADRVCGSFLRFSYTRIEASRFKGGIWVLWKPNRVSLVESNRHNQVFHFRITKGSFSGVFTTIYGS
        +GAGS RF  V+++MI+ + P+++VI+E ++SG  AD VC  F  +S  R+EA  F GGIWV W+PN V +   +RH Q  H RI++   +  FT +Y S
Subjt:  QGAGSPRFKYVLRDMIQAYNPDVLVILETKVSGIVADRVCGSFLRFSYTRIEASRFKGGIWVLWKPNRVSLVESNRHNQVFHFRITKGSFSGVFTTIYGS

Query:  PQRATRRELWIFLESIASEIAEPWLLLGDFNEILVSKEKLGVASFNPSLATLFINMMNNCQLLDLGSTGPKFTWRGPLLQGCSRIFEKLDR---------
        P    RR+LW  L +I++ I  PW++LGDFN IL + EK G A FNP  A  F   ++NC L+DL S+GP+FTW GP+  G  R+FE+LDR         
Subjt:  PQRATRRELWIFLESIASEIAEPWLLLGDFNEILVSKEKLGVASFNPSLATLFINMMNNCQLLDLGSTGPKFTWRGPLLQGCSRIFEKLDR---------

Query:  ----------------------DTAGFCASSHSQKPFRFMAAWLTHKMFPSFLEEIWPKSEPLTTGLATAQGKFSHWNKEIFQSISKRKKVSFMGSSGKS
                              DT GF  S   ++PFR+MAAW  HK FP F+ E W     L  GL   + +   WN+ +F SI++RK+       G  
Subjt:  ----------------------DTAGFCASSHSQKPFRFMAAWLTHKMFPSFLEEIWPKSEPLTTGLATAQGKFSHWNKEIFQSISKRKKVSFMGSSGKS

Query:  YYYFKGVLRFDRENFRFWKMQVTDLLTYKKIQKTLKERSAIGMIDKEWVEMDEQ
            +G   F          ++ ++L+ ++I    K RS       EWV++ E+
Subjt:  YYYFKGVLRFDRENFRFWKMQVTDLLTYKKIQKTLKERSAIGMIDKEWVEMDEQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G40390.1 DNAse I-like superfamily protein2.7e-0432.32Show/hide
Query:  ATRRELWIFLESIASE---IAEPWLLLGDFNEILVSKEKLGVASFNPSLATL--FINMMNNCQLLDLGSTGPKFTWRGPLLQGCSRIFEKLDRDTAGFC
        A RR LW  +  +++       PWL++GDFN+I    E   +   N SL  L      M +  L+DL   G  +TW     Q  + I  KLDR     C
Subjt:  ATRRELWIFLESIASE---IAEPWLLLGDFNEILVSKEKLGVASFNPSLATL--FINMMNNCQLLDLGSTGPKFTWRGPLLQGCSRIFEKLDRDTAGFC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTTGGGAGATACGGATGAAGACTTTGAGATGGGGGCTGAAACTCCCCTGGAAGATGCTTCCAGTAATGAAGAGGTAGATTCGTATCCCTCGGATGATGTGAAATT
TTCGAAAGTACAATGGAAGAATTTTAGGGCACCGTGGAGGTCTGCCTTAATTATAAAAGTTCTAGGCAAGTCTTTTGGGTATCAATTTCTTCTCCGAAGGTTAACCGCAA
TCTGGAAACCGAAAGGGGATTTCAAGTTGATCAACCTTAGAATGAAGGGAATCAGGCATTTCATAGGCAAGACTCTAAAAATTGATTTCAAGACCCAATCCGGGAAGATG
GGCCACTTTGCGAGAATCTACGTTGAGGTGGACCTTACGAAGAAGCTTCGACCTGACTTTACAATTTTACGAGAAAGATGCAACACTGAAACGAAGGTTAACGATAACTC
ATCTGGAATTCTTGATTCCGTTGGTGCAGATTCGAGGGTTCCAAAGGAAAGCAATATCCCTGAGTTTCCTCAAAAGCCTTTCATACCAGTGACAGTTTCTCAGCTAAATC
CTTCACCTGGTCATGGGCCTTGGATGTTGGTAGATCATTCCAAGAGAAATGGGGGAGGTAAACCACCTCGTACAAGGTCAGGAAGAGGTTTCATATCATCTAAAAACTTG
AACACTAAGTGGAATTTAAATTCGGACCCAAATGAAGAGCCCATGAATGTTCTTGACACAATTGTTGATCCAACTATGAAGCGTGATCCTATAAAGTCTTTCCAAATTAA
GTCAGTTGAGAGGGTGAGGTTAGCAAGAGATCAAGACTCTTGGAGGTTTAAGAACAAGATTCCTTTTGCTGGACCTGGGGCAAATTTTTCAAGTCTTCCTTTCGATTATC
AGGGGACCCTTGTAATTCAAGATACGGATGGAAGACGAAAACCTCGTTGGATGGATCAAGATTATGACCCTGGGGACATGGACTCAGAACAGGAAGATGATACTGATCCA
AAATCACCAATTCCTCGCAATGGATATTTTGATTCGGCTGGGCCTCAACAAGGGGCTGGCAGCCCCCGTTTTAAATATGTTCTTAGGGATATGATCCAAGCTTATAATCC
GGATGTGCTAGTAATCTTAGAAACCAAAGTTAGTGGTATTGTGGCCGATAGAGTTTGTGGTAGTTTTTTAAGATTTTCTTATACTCGCATTGAAGCTTCGAGATTTAAGG
GTGGCATTTGGGTGCTGTGGAAACCTAATCGTGTTTCACTTGTTGAGTCAAATAGACACAATCAAGTCTTTCACTTTAGGATCACTAAAGGTAGTTTCTCGGGTGTGTTT
ACTACTATATATGGGAGCCCGCAGAGGGCCACAAGAAGGGAATTATGGATTTTTCTCGAGTCAATCGCGTCTGAAATTGCTGAGCCTTGGCTACTATTAGGTGACTTTAA
TGAGATTTTGGTGAGCAAGGAAAAGTTGGGTGTTGCCTCTTTTAATCCGAGTTTAGCGACCCTGTTTATCAATATGATGAATAATTGCCAGCTATTGGACCTTGGCAGTA
CAGGACCTAAGTTTACTTGGAGAGGGCCGCTGTTACAAGGGTGCTCTAGAATTTTTGAGAAGTTGGATAGAGACACGGCTGGATTTTGTGCCAGTAGTCATTCTCAAAAA
CCTTTTCGGTTTATGGCAGCATGGCTAACCCATAAGATGTTTCCCTCATTCCTCGAAGAGATTTGGCCCAAATCTGAACCTCTAACAACAGGGTTAGCAACTGCCCAAGG
GAAATTTTCGCATTGGAATAAGGAAATATTCCAAAGCATTTCTAAAAGGAAAAAGGTTAGCTTCATGGGATCCAGTGGAAAGTCATATTACTATTTCAAGGGAGTTTTGA
GGTTTGATAGGGAGAATTTCAGGTTTTGGAAGATGCAAGTGACAGATCTTCTTACATACAAGAAGATACAGAAAACCCTGAAAGAACGATCGGCTATTGGGATGATAGAT
AAGGAATGGGTGGAGATGGATGAACAGATCGTAGCGATCATCAGGTTGTGCTTGTCGATGAATGTGACAAGTCTTGTAGCAAGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTTTTGGGAGATACGGATGAAGACTTTGAGATGGGGGCTGAAACTCCCCTGGAAGATGCTTCCAGTAATGAAGAGGTAGATTCGTATCCCTCGGATGATGTGAAATT
TTCGAAAGTACAATGGAAGAATTTTAGGGCACCGTGGAGGTCTGCCTTAATTATAAAAGTTCTAGGCAAGTCTTTTGGGTATCAATTTCTTCTCCGAAGGTTAACCGCAA
TCTGGAAACCGAAAGGGGATTTCAAGTTGATCAACCTTAGAATGAAGGGAATCAGGCATTTCATAGGCAAGACTCTAAAAATTGATTTCAAGACCCAATCCGGGAAGATG
GGCCACTTTGCGAGAATCTACGTTGAGGTGGACCTTACGAAGAAGCTTCGACCTGACTTTACAATTTTACGAGAAAGATGCAACACTGAAACGAAGGTTAACGATAACTC
ATCTGGAATTCTTGATTCCGTTGGTGCAGATTCGAGGGTTCCAAAGGAAAGCAATATCCCTGAGTTTCCTCAAAAGCCTTTCATACCAGTGACAGTTTCTCAGCTAAATC
CTTCACCTGGTCATGGGCCTTGGATGTTGGTAGATCATTCCAAGAGAAATGGGGGAGGTAAACCACCTCGTACAAGGTCAGGAAGAGGTTTCATATCATCTAAAAACTTG
AACACTAAGTGGAATTTAAATTCGGACCCAAATGAAGAGCCCATGAATGTTCTTGACACAATTGTTGATCCAACTATGAAGCGTGATCCTATAAAGTCTTTCCAAATTAA
GTCAGTTGAGAGGGTGAGGTTAGCAAGAGATCAAGACTCTTGGAGGTTTAAGAACAAGATTCCTTTTGCTGGACCTGGGGCAAATTTTTCAAGTCTTCCTTTCGATTATC
AGGGGACCCTTGTAATTCAAGATACGGATGGAAGACGAAAACCTCGTTGGATGGATCAAGATTATGACCCTGGGGACATGGACTCAGAACAGGAAGATGATACTGATCCA
AAATCACCAATTCCTCGCAATGGATATTTTGATTCGGCTGGGCCTCAACAAGGGGCTGGCAGCCCCCGTTTTAAATATGTTCTTAGGGATATGATCCAAGCTTATAATCC
GGATGTGCTAGTAATCTTAGAAACCAAAGTTAGTGGTATTGTGGCCGATAGAGTTTGTGGTAGTTTTTTAAGATTTTCTTATACTCGCATTGAAGCTTCGAGATTTAAGG
GTGGCATTTGGGTGCTGTGGAAACCTAATCGTGTTTCACTTGTTGAGTCAAATAGACACAATCAAGTCTTTCACTTTAGGATCACTAAAGGTAGTTTCTCGGGTGTGTTT
ACTACTATATATGGGAGCCCGCAGAGGGCCACAAGAAGGGAATTATGGATTTTTCTCGAGTCAATCGCGTCTGAAATTGCTGAGCCTTGGCTACTATTAGGTGACTTTAA
TGAGATTTTGGTGAGCAAGGAAAAGTTGGGTGTTGCCTCTTTTAATCCGAGTTTAGCGACCCTGTTTATCAATATGATGAATAATTGCCAGCTATTGGACCTTGGCAGTA
CAGGACCTAAGTTTACTTGGAGAGGGCCGCTGTTACAAGGGTGCTCTAGAATTTTTGAGAAGTTGGATAGAGACACGGCTGGATTTTGTGCCAGTAGTCATTCTCAAAAA
CCTTTTCGGTTTATGGCAGCATGGCTAACCCATAAGATGTTTCCCTCATTCCTCGAAGAGATTTGGCCCAAATCTGAACCTCTAACAACAGGGTTAGCAACTGCCCAAGG
GAAATTTTCGCATTGGAATAAGGAAATATTCCAAAGCATTTCTAAAAGGAAAAAGGTTAGCTTCATGGGATCCAGTGGAAAGTCATATTACTATTTCAAGGGAGTTTTGA
GGTTTGATAGGGAGAATTTCAGGTTTTGGAAGATGCAAGTGACAGATCTTCTTACATACAAGAAGATACAGAAAACCCTGAAAGAACGATCGGCTATTGGGATGATAGAT
AAGGAATGGGTGGAGATGGATGAACAGATCGTAGCGATCATCAGGTTGTGCTTGTCGATGAATGTGACAAGTCTTGTAGCAAGTTAG
Protein sequenceShow/hide protein sequence
MVLGDTDEDFEMGAETPLEDASSNEEVDSYPSDDVKFSKVQWKNFRAPWRSALIIKVLGKSFGYQFLLRRLTAIWKPKGDFKLINLRMKGIRHFIGKTLKIDFKTQSGKM
GHFARIYVEVDLTKKLRPDFTILRERCNTETKVNDNSSGILDSVGADSRVPKESNIPEFPQKPFIPVTVSQLNPSPGHGPWMLVDHSKRNGGGKPPRTRSGRGFISSKNL
NTKWNLNSDPNEEPMNVLDTIVDPTMKRDPIKSFQIKSVERVRLARDQDSWRFKNKIPFAGPGANFSSLPFDYQGTLVIQDTDGRRKPRWMDQDYDPGDMDSEQEDDTDP
KSPIPRNGYFDSAGPQQGAGSPRFKYVLRDMIQAYNPDVLVILETKVSGIVADRVCGSFLRFSYTRIEASRFKGGIWVLWKPNRVSLVESNRHNQVFHFRITKGSFSGVF
TTIYGSPQRATRRELWIFLESIASEIAEPWLLLGDFNEILVSKEKLGVASFNPSLATLFINMMNNCQLLDLGSTGPKFTWRGPLLQGCSRIFEKLDRDTAGFCASSHSQK
PFRFMAAWLTHKMFPSFLEEIWPKSEPLTTGLATAQGKFSHWNKEIFQSISKRKKVSFMGSSGKSYYYFKGVLRFDRENFRFWKMQVTDLLTYKKIQKTLKERSAIGMID
KEWVEMDEQIVAIIRLCLSMNVTSLVAS