; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc11g29360 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc11g29360
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr11:21405538..21414404
RNA-Seq ExpressionMoc11g29360
SyntenyMoc11g29360
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF4349753.1 hypothetical protein G4B88_029501, partial [Cannabis sativa]3.9e-2331.14Show/hide
Query:  LCLLWSSEVEVLVGLGGGDMNEILWEEEKEGGCSRRPSLILAFRNVVDDCRLMEMEVDDVPFTWNNRRQKEEQIRDRLDRFLCYEAFSSLFHVIQVHNLD
        LC L   ++  L  + GGD NEIL   EK+GG  R  S +  F+  +D C L++M  D   FTW N+RQ    +++RLDRF C + +  LF    V   D
Subjt:  LCLLWSSEVEVLVGLGGGDMNEILWEEEKEGGCSRRPSLILAFRNVVDDCRLMEMEVDDVPFTWNNRRQKEEQIRDRLDRFLCYEAFSSLFHVIQVHNLD

Query:  WAHSNHRATTVEVEVSTARNRRKGGPKPFRSIQYHLSESA---KALRLWGVRRPKEDVIEKVIEVINPKVTPEMNSMLMAPYDKKEVEVELGLRPSYFWK
        + HS+HR     +E +  R ++K   + FR   + L +        + W  +    +  E +I++                         LG      W 
Subjt:  WAHSNHRATTVEVEVSTARNRRKGGPKPFRSIQYHLSESA---KALRLWGVRRPKEDVIEKVIEVINPKVTPEMNSMLMAPYDKKEVEVELGLRPSYFWK

Query:  SFIWG---RELLVKGLRRRVGDGKSIRAFRDGWLPRLSTFKPLVVPPLSQDLTVADFIRSDRSWDNNMLQNRF
           +G   + LL  G+R+ VGDG+S+ AFRD W+PR  +F+P + P  ++ L V+DFI  D +WD   L   F
Subjt:  SFIWG---RELLVKGLRRRVGDGKSIRAFRDGWLPRLSTFKPLVVPPLSQDLTVADFIRSDRSWDNNMLQNRF

KAF4363712.1 hypothetical protein G4B88_030211, partial [Cannabis sativa]1.4e-2029.71Show/hide
Query:  LCLLWSSEVEVLVGLGGGDMNEILWEEEKEGGCSRRPSLILAFRNVVDDCRLMEMEVDDVPFTWNNRRQKEEQIRDRLDRFLCYEAFSSLFHVIQVHNLD
        LC L   ++  L  + GGD NEIL   EK+GG  R  S +  F+  +D C L++M  D   FTW N+RQ    +++RLDRF C + +  LF    V   D
Subjt:  LCLLWSSEVEVLVGLGGGDMNEILWEEEKEGGCSRRPSLILAFRNVVDDCRLMEMEVDDVPFTWNNRRQKEEQIRDRLDRFLCYEAFSSLFHVIQVHNLD

Query:  WAHSNHRATTVEVEVSTARNRRKGGPKPFRSIQYHLSESA---KALRLWGVRRPKEDVIEKVIEVINPKVTPEMNSMLMAPYDKKEVEVELGLRPSYFWK
        + HS+HR     +E +  R ++K   + FR   + L +        + W  +    +  + +I++          +  +  ++K     + G  P    +
Subjt:  WAHSNHRATTVEVEVSTARNRRKGGPKPFRSIQYHLSESA---KALRLWGVRRPKEDVIEKVIEVINPKVTPEMNSMLMAPYDKKEVEVELGLRPSYFWK

Query:  SFIWGRELLVKGLRRRVGDGKSIRAFRDGWLPRLSTFKPLVVPPLSQDLTVADFIRSDRSWDNNMLQNRFCREDVD
        +         K L   + DG+S+ AFRD W+PR  +F+P + P  +  + V+DFI  D +WD   L   F R DVD
Subjt:  SFIWGRELLVKGLRRRVGDGKSIRAFRDGWLPRLSTFKPLVVPPLSQDLTVADFIRSDRSWDNNMLQNRFCREDVD

KAF4386840.1 hypothetical protein F8388_006795 [Cannabis sativa]1.5e-2230.86Show/hide
Query:  GDMNEILWEEEKEGGCSRRPSLILAFRNVVDDCRLMEMEVDDVPFTWNNRRQKEEQIRDRLDRFLCYEAFSSLFHVIQVHNLDWAHSNHRATTVEVEVST
        GD NEI+   EK GG  R P ++  FR V+DDCRL++        TW N      +I +RLDR LC E + + F    +  LDW  S+HRA  V++ V  
Subjt:  GDMNEILWEEEKEGGCSRRPSLILAFRNVVDDCRLMEMEVDDVPFTWNNRRQKEEQIRDRLDRFLCYEAFSSLFHVIQVHNLDWAHSNHRATTVEVEVST

Query:  ARNRRKGGPKPFRSIQYHLSESAKALRLWGVRRPKEDVIEKVIEVINPKVTP-----EMNSMLMA--PYDKKEVEVELGLRPSYFWKSFIWGRELLVKGL
          +  K G K  R  ++H  E+      W       +++++V +  + +  P     ++N    A   ++KK+   + GL            ++++  G 
Subjt:  ARNRRKGGPKPFRSIQYHLSESAKALRLWGVRRPKEDVIEKVIEVINPKVTP-----EMNSMLMA--PYDKKEVEVELGLRPSYFWKSFIWGRELLVKGL

Query:  RRRVGDGKSIRAFRDGWLPRLSTFKPLVVPPLSQDLTVADFIRSDRSWDNNMLQNRFCREDVDIIRRLP
        R R+G+G S+R   D WLPR  TFK    PPL + L V D  + + +WD   ++  F   D D+I ++P
Subjt:  RRRVGDGKSIRAFRDGWLPRLSTFKPLVVPPLSQDLTVADFIRSDRSWDNNMLQNRFCREDVDIIRRLP

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]2.4e-2052.08Show/hide
Query:  RPSYFWKSFIWGRELLVKGLRRRVGDGKSIRAFRDGWLPRLSTFKPLVVPPLSQDLTVADFIRSDRSWDNNMLQNRFCREDVDIIRRLPLCNLILK
        + SYFWK F+WGR+LLVKGLR RVG+G +I+AF D WLPR +TFKPL     + D TVA FI +D +WD   + + FC ED D+I  +P+ +  L+
Subjt:  RPSYFWKSFIWGRELLVKGLRRRVGDGKSIRAFRDGWLPRLSTFKPLVVPPLSQDLTVADFIRSDRSWDNNMLQNRFCREDVDIIRRLPLCNLILK

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]1.0e-0740Show/hide
Query:  SEVEVLVGLGGGDMNEILWEEEKEGGCSRRPSLILAFRNVVDDCRLMEMEVDDVPFTWNNRRQKEEQIRDRLDRFLCYEAFSSLFHVIQVHNLDWAHSNH
        S ++    L GGDMN ILW  E     S   S I AFRN++D C L +M      FTW N R   +Q+  RLDRFLC + F+ +F     H   W+++ H
Subjt:  SEVEVLVGLGGGDMNEILWEEEKEGGCSRRPSLILAFRNVVDDCRLMEMEVDDVPFTWNNRRQKEEQIRDRLDRFLCYEAFSSLFHVIQVHNLDWAHSNH

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]3.1e-2024.81Show/hide
Query:  EGCFSVDSSGASGGLCLLWSSEVEVLVG---------------------------------------------------LGGGDMNEILWEEEKEGGCSR
        E  +SVD  G SGGL L+W   +++ V                                                    L  GD NEI+   EK GG  R
Subjt:  EGCFSVDSSGASGGLCLLWSSEVEVLVG---------------------------------------------------LGGGDMNEILWEEEKEGGCSR

Query:  RPSLILAFRNVVDDCRLMEMEVDDVPFTWNNRRQKEEQIRDRLDRFLCYEAFSSLFHVIQVHNLDWAHSNHRATTVEVEVSTARNRRKGGPKPFRSIQYH
            +  FR V+DDCRL+    +    TW N  +    + +RLDR LC E +   F    +  LDW  S+H+   V++ +    N+     K  R   +H
Subjt:  RPSLILAFRNVVDDCRLMEMEVDDVPFTWNNRRQKEEQIRDRLDRFLCYEAFSSLFHVIQVHNLDWAHSNHRATTVEVEVSTARNRRKGGPKPFRSIQYH

Query:  LSESAKALRLWGVRRPKEDVIEKVIEVINPKVTPEMNSMLM-APYDKKEVEVELGLRPSYFWKSFIWGRELLVKGLRRRVGDGKSIRAFRDGWLPRLSTF
          E+      W     +ED  ++++E    K   E + ++    Y +K  EV   L          W R+             KS+R   D WLPR  TF
Subjt:  LSESAKALRLWGVRRPKEDVIEKVIEVINPKVTPEMNSMLM-APYDKKEVEVELGLRPSYFWKSFIWGRELLVKGLRRRVGDGKSIRAFRDGWLPRLSTF

Query:  KPLVVPPLSQDLTVADFIRSDRSWDNNMLQNRFCREDVDIIRRLPLCNLILKGLAI------EDVCPMCGNGGETIEHTLLTCKRAREFWEVCSPKVRQR
        K    P L ++L V D  ++D  WD + +++ F   D ++I  +P  +   +   +       +     G+  E+I H L  CK ++ +W V       +
Subjt:  KPLVVPPLSQDLTVADFIRSDRSWDNNMLQNRFCREDVDIIRRLPLCNLILKGLAI------EDVCPMCGNGGETIEHTLLTCKRAREFWEVCSPKVRQR

Query:  KVL
        K+L
Subjt:  KVL

TrEMBL top hitse value%identityAlignment
A0A6J1DX30 uncharacterized protein LOC1110248741.2e-2052.08Show/hide
Query:  RPSYFWKSFIWGRELLVKGLRRRVGDGKSIRAFRDGWLPRLSTFKPLVVPPLSQDLTVADFIRSDRSWDNNMLQNRFCREDVDIIRRLPLCNLILK
        + SYFWK F+WGR+LLVKGLR RVG+G +I+AF D WLPR +TFKPL     + D TVA FI +D +WD   + + FC ED D+I  +P+ +  L+
Subjt:  RPSYFWKSFIWGRELLVKGLRRRVGDGKSIRAFRDGWLPRLSTFKPLVVPPLSQDLTVADFIRSDRSWDNNMLQNRFCREDVDIIRRLPLCNLILK

A0A6J1DX30 uncharacterized protein LOC1110248745.0e-0840Show/hide
Query:  SEVEVLVGLGGGDMNEILWEEEKEGGCSRRPSLILAFRNVVDDCRLMEMEVDDVPFTWNNRRQKEEQIRDRLDRFLCYEAFSSLFHVIQVHNLDWAHSNH
        S ++    L GGDMN ILW  E     S   S I AFRN++D C L +M      FTW N R   +Q+  RLDRFLC + F+ +F     H   W+++ H
Subjt:  SEVEVLVGLGGGDMNEILWEEEKEGGCSRRPSLILAFRNVVDDCRLMEMEVDDVPFTWNNRRQKEEQIRDRLDRFLCYEAFSSLFHVIQVHNLDWAHSNH

A0A6J1DX30 uncharacterized protein LOC1110248747.0e-1848.86Show/hide
Query:  SYFWKSFIWGRELLVKGLRRRVGDGKSIRAFRDGWLPRLSTFKPLVVPPLSQDLTVADFIRSDRSWDNNMLQNRFCREDVDIIRRLPL
        SYFWK FIWGR+LL+KGLR RVG+G +I  F D W+PR  +F+P+  P    D+ VAD I  +  WD +++   FC ED D+I  +P+
Subjt:  SYFWKSFIWGRELLVKGLRRRVGDGKSIRAFRDGWLPRLSTFKPLVVPPLSQDLTVADFIRSDRSWDNNMLQNRFCREDVDIIRRLPL

A0A7J6DUI7 Uncharacterized protein (Fragment)1.9e-2331.14Show/hide
Query:  LCLLWSSEVEVLVGLGGGDMNEILWEEEKEGGCSRRPSLILAFRNVVDDCRLMEMEVDDVPFTWNNRRQKEEQIRDRLDRFLCYEAFSSLFHVIQVHNLD
        LC L   ++  L  + GGD NEIL   EK+GG  R  S +  F+  +D C L++M  D   FTW N+RQ    +++RLDRF C + +  LF    V   D
Subjt:  LCLLWSSEVEVLVGLGGGDMNEILWEEEKEGGCSRRPSLILAFRNVVDDCRLMEMEVDDVPFTWNNRRQKEEQIRDRLDRFLCYEAFSSLFHVIQVHNLD

Query:  WAHSNHRATTVEVEVSTARNRRKGGPKPFRSIQYHLSESA---KALRLWGVRRPKEDVIEKVIEVINPKVTPEMNSMLMAPYDKKEVEVELGLRPSYFWK
        + HS+HR     +E +  R ++K   + FR   + L +        + W  +    +  E +I++                         LG      W 
Subjt:  WAHSNHRATTVEVEVSTARNRRKGGPKPFRSIQYHLSESA---KALRLWGVRRPKEDVIEKVIEVINPKVTPEMNSMLMAPYDKKEVEVELGLRPSYFWK

Query:  SFIWG---RELLVKGLRRRVGDGKSIRAFRDGWLPRLSTFKPLVVPPLSQDLTVADFIRSDRSWDNNMLQNRF
           +G   + LL  G+R+ VGDG+S+ AFRD W+PR  +F+P + P  ++ L V+DFI  D +WD   L   F
Subjt:  SFIWG---RELLVKGLRRRVGDGKSIRAFRDGWLPRLSTFKPLVVPPLSQDLTVADFIRSDRSWDNNMLQNRF

A0A7J6EZ57 Uncharacterized protein6.8e-2129.71Show/hide
Query:  LCLLWSSEVEVLVGLGGGDMNEILWEEEKEGGCSRRPSLILAFRNVVDDCRLMEMEVDDVPFTWNNRRQKEEQIRDRLDRFLCYEAFSSLFHVIQVHNLD
        LC L   ++  L  + GGD NEIL   EK+GG  R  S +  F+  +D C L++M  D   FTW N+RQ    +++RLDRF C + +  LF    V   D
Subjt:  LCLLWSSEVEVLVGLGGGDMNEILWEEEKEGGCSRRPSLILAFRNVVDDCRLMEMEVDDVPFTWNNRRQKEEQIRDRLDRFLCYEAFSSLFHVIQVHNLD

Query:  WAHSNHRATTVEVEVSTARNRRKGGPKPFRSIQYHLSESA---KALRLWGVRRPKEDVIEKVIEVINPKVTPEMNSMLMAPYDKKEVEVELGLRPSYFWK
        + HS+HR     +E +  R ++K   + FR   + L +        + W  +    +  + +I++          +  +  ++K     + G  P    +
Subjt:  WAHSNHRATTVEVEVSTARNRRKGGPKPFRSIQYHLSESA---KALRLWGVRRPKEDVIEKVIEVINPKVTPEMNSMLMAPYDKKEVEVELGLRPSYFWK

Query:  SFIWGRELLVKGLRRRVGDGKSIRAFRDGWLPRLSTFKPLVVPPLSQDLTVADFIRSDRSWDNNMLQNRFCREDVD
        +         K L   + DG+S+ AFRD W+PR  +F+P + P  +  + V+DFI  D +WD   L   F R DVD
Subjt:  SFIWGRELLVKGLRRRVGDGKSIRAFRDGWLPRLSTFKPLVVPPLSQDLTVADFIRSDRSWDNNMLQNRFCREDVD

A0A7J6GWY1 Uncharacterized protein7.2e-2330.86Show/hide
Query:  GDMNEILWEEEKEGGCSRRPSLILAFRNVVDDCRLMEMEVDDVPFTWNNRRQKEEQIRDRLDRFLCYEAFSSLFHVIQVHNLDWAHSNHRATTVEVEVST
        GD NEI+   EK GG  R P ++  FR V+DDCRL++        TW N      +I +RLDR LC E + + F    +  LDW  S+HRA  V++ V  
Subjt:  GDMNEILWEEEKEGGCSRRPSLILAFRNVVDDCRLMEMEVDDVPFTWNNRRQKEEQIRDRLDRFLCYEAFSSLFHVIQVHNLDWAHSNHRATTVEVEVST

Query:  ARNRRKGGPKPFRSIQYHLSESAKALRLWGVRRPKEDVIEKVIEVINPKVTP-----EMNSMLMA--PYDKKEVEVELGLRPSYFWKSFIWGRELLVKGL
          +  K G K  R  ++H  E+      W       +++++V +  + +  P     ++N    A   ++KK+   + GL            ++++  G 
Subjt:  ARNRRKGGPKPFRSIQYHLSESAKALRLWGVRRPKEDVIEKVIEVINPKVTP-----EMNSMLMA--PYDKKEVEVELGLRPSYFWKSFIWGRELLVKGL

Query:  RRRVGDGKSIRAFRDGWLPRLSTFKPLVVPPLSQDLTVADFIRSDRSWDNNMLQNRFCREDVDIIRRLP
        R R+G+G S+R   D WLPR  TFK    PPL + L V D  + + +WD   ++  F   D D+I ++P
Subjt:  RRRVGDGKSIRAFRDGWLPRLSTFKPLVVPPLSQDLTVADFIRSDRSWDNNMLQNRFCREDVDIIRRLP

SwissProt top hitse value%identityAlignment
P93295 Uncharacterized mitochondrial protein AtMg003104.5e-0644.44Show/hide
Query:  VEVELGLRPSYFWKSFIWGRELLVKGLRRRVGDGKSIRAFRDGWLPRLSTFKPL
        +E  +G RPSY W+S I GRELL +GL R +GDG   + + D W+   +   PL
Subjt:  VEVELGLRPSYFWKSFIWGRELLVKGLRRRVGDGKSIRAFRDGWLPRLSTFKPL

Arabidopsis top hitse value%identityAlignment
AT4G29090.1 Ribonuclease H-like superfamily protein2.1e-0634Show/hide
Query:  LGLRPSYFWKSFIWGRELLVKGLRRRVGDGKSIRAFRDGWL---PRLSTFKPLVVPP-----LSQDLTVADFI-RSDRSWDNNMLQNRFCREDVDIIRRL
        LG RPS+ WKS    +E+L +G R  VG+G+ I  +R  WL   P  +  +   VPP     +S  L V+D I  S R W  ++++  F   +  +I  L
Subjt:  LGLRPSYFWKSFIWGRELLVKGLRRRVGDGKSIRAFRDGWL---PRLSTFKPLVVPP-----LSQDLTVADFI-RSDRSWDNNMLQNRFCREDVDIIRRL

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.2e-0744.44Show/hide
Query:  VEVELGLRPSYFWKSFIWGRELLVKGLRRRVGDGKSIRAFRDGWLPRLSTFKPL
        +E  +G RPSY W+S I GRELL +GL R +GDG   + + D W+   +   PL
Subjt:  VEVELGLRPSYFWKSFIWGRELLVKGLRRRVGDGKSIRAFRDGWLPRLSTFKPL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCGCCGGTCGTCGTCGCTCCCTCTCCTATTGCCGTCGGCCTTTGCAGCTGCAGCCCGCCGTTGTCGTCGAAGCAGCTTCAATTCCGGAGGAACAAGACAAT
TCGATTGGCTTCACTCCTTCTAGTAATAGGACAAGTCCACTTCACAAGTCCATTGGGCTGCCAAAGGCTAGCACAATAAATTTTTTGCCTGATAGTTATAATGAT
GTTAAGTCTGCAATGAAGTATGGTAAATATGATCTAACGTCTGCTATTGTGATTAATGTCATTAAAATGAAAGAGTTAGAATTGAAGGATAGTAAAAATAATAAT
AGCAAGTCTTTGTTTGTTAGTGAGAGATCAAATAATAAGAAATCCCATAATAGGGGAAGAAGTAGAAACAGATCGAAGTCTAGGGAGCCTAGTCATACGAGGTGT
TATAATTGTAATAAAGAAGGTCATATTAGGAGATTTTGTTCCAACCTTAGGAAGAATGGGGGTGGTAACCATAACCAAAACCAAAAGGGTAAAGGGAAAGAGGAT
AAGGATGTGGTCAACCTCGGTGAGGAGTATGAGGAGTATGAGGAGTATGACCATGTCCTTATAGTGAGTGAGCCTAAAACAAAGTGTGACAAGTATGTATCAAAT
AATTTAAAGATAAAGCTTCAGTTTGAGGGTTGCTTCTCTGTTGATAGTAGTGGTGCAAGTGGCGGGCTGTGTTTGCTGTGGAGCTCTGAAGTTGAGGTCTTGGTT
GGTTTGGGGGGAGGGGATATGAATGAGATTTTGTGGGAGGAAGAGAAAGAAGGTGGCTGTTCTCGTCGACCGTCTTTGATTTTGGCATTTCGAAATGTTGTGGAT
GATTGTAGGCTTATGGAGATGGAGGTGGACGATGTTCCATTTACATGGAATAATAGACGTCAAAAGGAGGAGCAAATCAGGGATCGGTTAGACAGATTCTTATGT
TACGAAGCTTTTTCTTCCTTGTTTCATGTGATACAAGTCCACAATCTAGATTGGGCGCATTCGAATCATAGAGCCACAACAGTGGAGGTCGAGGTTTCTACGGCT
AGGAATAGGCGAAAGGGAGGGCCAAAACCTTTTCGGTCAATCCAGTACCATCTGTCAGAGTCGGCCAAGGCGCTTCGGTTGTGGGGGGTAAGGAGGCCGAAGGAA
GATGTTATTGAGAAGGTGATTGAGGTGATCAACCCAAAGGTGACTCCAGAGATGAACTCAATGTTGATGGCTCCTTATGATAAGAAAGAGGTGGAGGTTGAGTTG
GGGTTAAGACCTTCATACTTTTGGAAAAGCTTTATTTGGGGAAGAGAGTTGTTGGTTAAAGGCCTCCGTAGGAGAGTGGGAGATGGAAAGTCTATCAGAGCGTTT
AGGGATGGTTGGTTGCCTAGGCTGTCTACGTTTAAACCTTTGGTGGTGCCTCCTTTATCTCAGGACCTTACAGTTGCGGACTTTATTCGGTCGGATAGGAGTTGG
GATAATAATATGCTTCAGAATCGGTTTTGTCGAGAAGATGTGGATATAATTAGAAGATTACCGCTATGCAACCTAATCTTGAAGGGGTTAGCTATTGAGGATGTT
TGCCCTATGTGTGGTAATGGTGGAGAGACGATTGAGCATACCTTGTTGACATGCAAGCGGGCACGCGAATTCTGGGAAGTGTGTTCTCCTAAGGTGAGGCAAAGA
AAAGTGTTGCATGGTTCTTTGATGTCATTTTGGGAGCAGTTGGGCGACGTTATGTCGTTAGGTCAACGGGAGGTGGTGACAGTGGCTTTAAATCCAATGGAAGTT
AGGTTGTCCGATTTGCCCAAAGAGTGGTTCCGGGTGATGGTGGATGCGGCGTGTGCGACTTCAGTGCCGGTGTTAGGGTGGGTGTTGCTGTGTAGAATGGAGAAA
ATGCCCTTCAGAGGCAATTGCGCTGATCCATGGAGAGGGGCAGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGTTCGCCGGTCGTCGTCGCTCCCTCTCCTATTGCCGTCGGCCTTTGCAGCTGCAGCCCGCCGTTGTCGTCGAAGCAGCTTCAATTCCGGAGGAACAAGACAAT
TCGATTGGCTTCACTCCTTCTAGTAATAGGACAAGTCCACTTCACAAGTCCATTGGGCTGCCAAAGGCTAGCACAATAAATTTTTTGCCTGATAGTTATAATGAT
GTTAAGTCTGCAATGAAGTATGGTAAATATGATCTAACGTCTGCTATTGTGATTAATGTCATTAAAATGAAAGAGTTAGAATTGAAGGATAGTAAAAATAATAAT
AGCAAGTCTTTGTTTGTTAGTGAGAGATCAAATAATAAGAAATCCCATAATAGGGGAAGAAGTAGAAACAGATCGAAGTCTAGGGAGCCTAGTCATACGAGGTGT
TATAATTGTAATAAAGAAGGTCATATTAGGAGATTTTGTTCCAACCTTAGGAAGAATGGGGGTGGTAACCATAACCAAAACCAAAAGGGTAAAGGGAAAGAGGAT
AAGGATGTGGTCAACCTCGGTGAGGAGTATGAGGAGTATGAGGAGTATGACCATGTCCTTATAGTGAGTGAGCCTAAAACAAAGTGTGACAAGTATGTATCAAAT
AATTTAAAGATAAAGCTTCAGTTTGAGGGTTGCTTCTCTGTTGATAGTAGTGGTGCAAGTGGCGGGCTGTGTTTGCTGTGGAGCTCTGAAGTTGAGGTCTTGGTT
GGTTTGGGGGGAGGGGATATGAATGAGATTTTGTGGGAGGAAGAGAAAGAAGGTGGCTGTTCTCGTCGACCGTCTTTGATTTTGGCATTTCGAAATGTTGTGGAT
GATTGTAGGCTTATGGAGATGGAGGTGGACGATGTTCCATTTACATGGAATAATAGACGTCAAAAGGAGGAGCAAATCAGGGATCGGTTAGACAGATTCTTATGT
TACGAAGCTTTTTCTTCCTTGTTTCATGTGATACAAGTCCACAATCTAGATTGGGCGCATTCGAATCATAGAGCCACAACAGTGGAGGTCGAGGTTTCTACGGCT
AGGAATAGGCGAAAGGGAGGGCCAAAACCTTTTCGGTCAATCCAGTACCATCTGTCAGAGTCGGCCAAGGCGCTTCGGTTGTGGGGGGTAAGGAGGCCGAAGGAA
GATGTTATTGAGAAGGTGATTGAGGTGATCAACCCAAAGGTGACTCCAGAGATGAACTCAATGTTGATGGCTCCTTATGATAAGAAAGAGGTGGAGGTTGAGTTG
GGGTTAAGACCTTCATACTTTTGGAAAAGCTTTATTTGGGGAAGAGAGTTGTTGGTTAAAGGCCTCCGTAGGAGAGTGGGAGATGGAAAGTCTATCAGAGCGTTT
AGGGATGGTTGGTTGCCTAGGCTGTCTACGTTTAAACCTTTGGTGGTGCCTCCTTTATCTCAGGACCTTACAGTTGCGGACTTTATTCGGTCGGATAGGAGTTGG
GATAATAATATGCTTCAGAATCGGTTTTGTCGAGAAGATGTGGATATAATTAGAAGATTACCGCTATGCAACCTAATCTTGAAGGGGTTAGCTATTGAGGATGTT
TGCCCTATGTGTGGTAATGGTGGAGAGACGATTGAGCATACCTTGTTGACATGCAAGCGGGCACGCGAATTCTGGGAAGTGTGTTCTCCTAAGGTGAGGCAAAGA
AAAGTGTTGCATGGTTCTTTGATGTCATTTTGGGAGCAGTTGGGCGACGTTATGTCGTTAGGTCAACGGGAGGTGGTGACAGTGGCTTTAAATCCAATGGAAGTT
AGGTTGTCCGATTTGCCCAAAGAGTGGTTCCGGGTGATGGTGGATGCGGCGTGTGCGACTTCAGTGCCGGTGTTAGGGTGGGTGTTGCTGTGTAGAATGGAGAAA
ATGCCCTTCAGAGGCAATTGCGCTGATCCATGGAGAGGGGCAGAATAG
Protein sequenceShow/hide protein sequence
MFAGRRRSLSYCRRPLQLQPAVVVEAASIPEEQDNSIGFTPSSNRTSPLHKSIGLPKASTINFLPDSYNDVKSAMKYGKYDLTSAIVINVIKMKELELKDSKNNN
SKSLFVSERSNNKKSHNRGRSRNRSKSREPSHTRCYNCNKEGHIRRFCSNLRKNGGGNHNQNQKGKGKEDKDVVNLGEEYEEYEEYDHVLIVSEPKTKCDKYVSN
NLKIKLQFEGCFSVDSSGASGGLCLLWSSEVEVLVGLGGGDMNEILWEEEKEGGCSRRPSLILAFRNVVDDCRLMEMEVDDVPFTWNNRRQKEEQIRDRLDRFLC
YEAFSSLFHVIQVHNLDWAHSNHRATTVEVEVSTARNRRKGGPKPFRSIQYHLSESAKALRLWGVRRPKEDVIEKVIEVINPKVTPEMNSMLMAPYDKKEVEVEL
GLRPSYFWKSFIWGRELLVKGLRRRVGDGKSIRAFRDGWLPRLSTFKPLVVPPLSQDLTVADFIRSDRSWDNNMLQNRFCREDVDIIRRLPLCNLILKGLAIEDV
CPMCGNGGETIEHTLLTCKRAREFWEVCSPKVRQRKVLHGSLMSFWEQLGDVMSLGQREVVTVALNPMEVRLSDLPKEWFRVMVDAACATSVPVLGWVLLCRMEK
MPFRGNCADPWRGAE