; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g26340 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g26340
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr6:19878736..19892357
RNA-Seq ExpressionMoc06g26340
SyntenyMoc06g26340
Gene Ontology termsGO:0016020 - membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF5758504.1 putative RNA-directed DNA polymerase [Helianthus annuus]5.6e-5851.62Show/hide
Query:  SPVKIDVEKFDGRINFGLWQVQVKDVLIQSGLHKALKGRPSKGSSKKLSNDGGAMESSGGSSRGSKKSSMSDEDWEEMDLRTASAIRTGLAKNILANVHG
        SP++  VEK+DGRINFGLWQVQVKDVLIQSGLHKAL+G+P+  SSK         +SSG        S   DE+WE++DLR ASAIR  LAKN+LANVHG
Subjt:  SPVKIDVEKFDGRINFGLWQVQVKDVLIQSGLHKALKGRPSKGSSKKLSNDGGAMESSGGSSRGSKKSSMSDEDWEEMDLRTASAIRTGLAKNILANVHG

Query:  ISTAKERSEKLEALYQAK--------------------------------------AIEVKIDDEDKALRFILSLPPFYEHMKPILMYGKDNLNFVDATS
        ISTAK+  EKLE LYQ K                                      AI VK++DEDKALR ILSL   YEHMKPILMYGK+ L + D T 
Subjt:  ISTAKERSEKLEALYQAK--------------------------------------AIEVKIDDEDKALRFILSLPPFYEHMKPILMYGKDNLNFVDATS

Query:  KLLLEERRLKSEGRTSHEDSTLVTSNWKNKKDSAQKKTCCWGCGQSGHMKKNCPNRAGSSKGSRQDADSVSLIRGDD
        KLL EE+RL S G TS E + L+  N   KK   QK   CW CGQSGH+K+NCP  A S+  S+  A++V+++ GDD
Subjt:  KLLLEERRLKSEGRTSHEDSTLVTSNWKNKKDSAQKKTCCWGCGQSGHMKKNCPNRAGSSKGSRQDADSVSLIRGDD

KAF5765959.1 putative RNA-directed DNA polymerase [Helianthus annuus]5.1e-5951.99Show/hide
Query:  SPVKIDVEKFDGRINFGLWQVQVKDVLIQSGLHKALKGRPSKGSSKKLSNDGGAMESSGGSSRGSKKSSMSDEDWEEMDLRTASAIRTGLAKNILANVHG
        SP++ DVEK+DGRINFGLWQVQVKDVLIQSGLHKAL+G+P+  SSK         +SSG        S   DE+WE++DLR ASAIR  LAKN+LANVHG
Subjt:  SPVKIDVEKFDGRINFGLWQVQVKDVLIQSGLHKALKGRPSKGSSKKLSNDGGAMESSGGSSRGSKKSSMSDEDWEEMDLRTASAIRTGLAKNILANVHG

Query:  ISTAKERSEKLEALYQAK--------------------------------------AIEVKIDDEDKALRFILSLPPFYEHMKPILMYGKDNLNFVDATS
        ISTAK+  EKLE LYQ K                                      AI VK++DEDKALR ILSL   YEHMKPILMYGK+ L + D T 
Subjt:  ISTAKERSEKLEALYQAK--------------------------------------AIEVKIDDEDKALRFILSLPPFYEHMKPILMYGKDNLNFVDATS

Query:  KLLLEERRLKSEGRTSHEDSTLVTSNWKNKKDSAQKKTCCWGCGQSGHMKKNCPNRAGSSKGSRQDADSVSLIRGDD
        KLL EE+RL S G TS E + L+  N   KK   QK   CW CGQSGH+K+NCP  A S+  S+  A++V+++ GDD
Subjt:  KLLLEERRLKSEGRTSHEDSTLVTSNWKNKKDSAQKKTCCWGCGQSGHMKKNCPNRAGSSKGSRQDADSVSLIRGDD

QHN81458.1 Retrovirus-related Pol polyprotein [Arachis hypogaea]2.6e-5547.89Show/hide
Query:  MSIFMSPVKIDVEKFDGRINFGLWQVQVKDVLIQSGLHKALKGRPSKGSSKKLSNDGGAMESSGGSSRGSKKSSMSDEDWEEMDLRTASAIRTGLAKNIL
        MS + S VK+++EKFDGRINFGLWQ+QVKDVLIQSGLHKALK R                            S M DE+WEE+DLR ASAIR  LAKN+L
Subjt:  MSIFMSPVKIDVEKFDGRINFGLWQVQVKDVLIQSGLHKALKGRPSKGSSKKLSNDGGAMESSGGSSRGSKKSSMSDEDWEEMDLRTASAIRTGLAKNIL

Query:  ANVHGISTAKERSEKLEALYQAK--------------------------------------AIEVKIDDEDKALRFILSLPPFYEHMKPILMYGKDNLNF
        ANV GI TAKE  +KLE LYQ+K                                      AI VKIDDEDKALR ILSLP  YE++KP+LMYGK+ LNF
Subjt:  ANVHGISTAKERSEKLEALYQAK--------------------------------------AIEVKIDDEDKALRFILSLPPFYEHMKPILMYGKDNLNF

Query:  VDATSKLLLEERRLKSEGRTSHEDSTLVTSNWKNKKDSAQKKTCCWGCGQSGHMKKNCPNRAGSSKGSRQDADSVSLIRGDDDL
         +  SKL+ EERR+K+EG TS  D  LV  +    K +  +   CW CG+SGH+K+NCP  A S K S+ D  +++L   +DD+
Subjt:  VDATSKLLLEERRLKSEGRTSHEDSTLVTSNWKNKKDSAQKKTCCWGCGQSGHMKKNCPNRAGSSKGSRQDADSVSLIRGDDDL

XP_022139673.1 uncharacterized protein LOC111010521 [Momordica charantia]4.0e-9673.88Show/hide
Query:  TKECKVGMSIFMSPVKIDVEKFDGRINFGLWQVQVKDVLIQSGLHKALKGRPSKGSSKKLSNDGGAMESSGGSSRGSKKSSMSDEDWEEMDLRTASAIRT
        T EC+  MS FMSPVKIDVEKFDG INFGLWQVQVKDVLIQS LHKALKGRPS+G+S+KLS+DGG MESSGGSSRGSKKSSMS EDWEEMDLR ASAIRT
Subjt:  TKECKVGMSIFMSPVKIDVEKFDGRINFGLWQVQVKDVLIQSGLHKALKGRPSKGSSKKLSNDGGAMESSGGSSRGSKKSSMSDEDWEEMDLRTASAIRT

Query:  GLAKNILANVHGISTAKERSEKLEALYQAK--------------------------------------AIEVKIDDEDKALRFILSLPPFYEHMKPILMY
         LAKNILANVH ISTAKE  EKLEALYQAK                                      AIEVKIDDEDKALR ILSLP  YEHMKPILMY
Subjt:  GLAKNILANVHGISTAKERSEKLEALYQAK--------------------------------------AIEVKIDDEDKALRFILSLPPFYEHMKPILMY

Query:  GKDNLNFVDATSKLLLEERRLKSEGRTSHEDSTLVTSNWKNKKDSAQKKTCCWGCGQSGHMKKNCPNR
        GKD LNF + TSKLL EERRLKSEGRTSHEDS LV SNWK KKDS QKK CCWGCGQSGHMKK+CPNR
Subjt:  GKDNLNFVDATSKLLLEERRLKSEGRTSHEDSTLVTSNWKNKKDSAQKKTCCWGCGQSGHMKKNCPNR

XP_022152845.1 uncharacterized protein LOC111020469 [Momordica charantia]2.2e-6249.69Show/hide
Query:  IVDYLHSKELELPLEKKPDDMEEAKWKKLDRKVLGTY-----------------------------------------------------VAAHINEFDM
        ++DYLHSKELE PLE KPDDM E +WKKLDRKVLGT                                                      + AH+NEFD+
Subjt:  IVDYLHSKELELPLEKKPDDMEEAKWKKLDRKVLGTY-----------------------------------------------------VAAHINEFDM

Query:  LINKLVAVDLTFTDELNAILLLRSLPNSWEPMKATISNSCGKEKLKLAYVRDTALGEEIRRKNSSIASTS------------------------------
        LINKLVAVDL F+ E+ AILLLRSLP+SWEPM+A ISNSC KEKLK   VRD AL EEIRRK+S IA TS                              
Subjt:  LINKLVAVDLTFTDELNAILLLRSLPNSWEPMKATISNSCGKEKLKLAYVRDTALGEEIRRKNSSIASTS------------------------------

Query:  ------------GHLRRNCKAPKKTEGKEAGANVVAEEIHDALVLAVE----------GNHGNVYLADGEPLDIIGIGDVNLKMANSSVWKIRKVRHVQN
                    GHL+ NCKAPKK EG EA AN VAE+IHDALV+AVE          GNHG VYLADGEPLDIIGIG+VNLKMAN SVWKIRK      
Subjt:  ------------GHLRRNCKAPKKTEGKEAGANVVAEEIHDALVLAVE----------GNHGNVYLADGEPLDIIGIGDVNLKMANSSVWKIRKVRHVQN

Query:  MMKNLISVGQLDNEGCEISF
                  LDNEGCEISF
Subjt:  MMKNLISVGQLDNEGCEISF

TrEMBL top hitse value%identityAlignment
A0A6A2YS90 Transcription initiation factor IIA subunit 22.6e-5347.31Show/hide
Query:  KIDVEKFDGRINFGLWQVQVKDVLIQSGLHKALKGRPSKGSSKKLSNDGGAMESSGGSSRGSKKSSMSDEDWEEMDLRTASAIRTGLAKNILANVHGIST
        + D+EKFDGRINFGLWQVQVKD+LIQSGL+KALKG+P+  S      +G   +    SS    KS MS+E+WEE+D+R AS IR  LAKN+LANV   S+
Subjt:  KIDVEKFDGRINFGLWQVQVKDVLIQSGLHKALKGRPSKGSSKKLSNDGGAMESSGGSSRGSKKSSMSDEDWEEMDLRTASAIRTGLAKNILANVHGIST

Query:  AKERSEKLEALYQAK--------------------------------------AIEVKIDDEDKALRFILSLPPFYEHMKPILMYGKDNLNFVDATSKLL
         KE  EKLE +YQAK                                      +I V+IDDEDKALR I SL   YEHM+ +LMYGK+N+NF + TSKL+
Subjt:  AKERSEKLEALYQAK--------------------------------------AIEVKIDDEDKALRFILSLPPFYEHMKPILMYGKDNLNFVDATSKLL

Query:  LEERRLKSEGRTSHEDSTLVTSNWKNKKDSAQKKTCCWGCGQSGHMKKNCPN-RAGSSKGSRQDADSVSLIRGDDDLFL
         EERRLK+    S E   L     + K   ++KK  CWGCGQ GH+KK+C N  A S+ GS+ DA +V +   +DD F+
Subjt:  LEERRLKSEGRTSHEDSTLVTSNWKNKKDSAQKKTCCWGCGQSGHMKKNCPN-RAGSSKGSRQDADSVSLIRGDDDLFL

A0A6A3BK59 CCHC-type domain-containing protein3.1e-5447.67Show/hide
Query:  KIDVEKFDGRINFGLWQVQVKDVLIQSGLHKALKGRPSKGSSKKLSNDGGAMESSGGSSRGSKKSSMSDEDWEEMDLRTASAIRTGLAKNILANVHGIST
        + D+EKFDGRINFGLWQVQVKD+LIQSGL+KALKG+P+  S      +G   +    SS    KS MS+E+WEE+D+R AS IR  LAKN+LANV   S+
Subjt:  KIDVEKFDGRINFGLWQVQVKDVLIQSGLHKALKGRPSKGSSKKLSNDGGAMESSGGSSRGSKKSSMSDEDWEEMDLRTASAIRTGLAKNILANVHGIST

Query:  AKERSEKLEALYQAK--------------------------------------AIEVKIDDEDKALRFILSLPPFYEHMKPILMYGKDNLNFVDATSKLL
         KE  EKLE +YQAK                                      +I V+IDDEDKALR I SLP  YEHM+ +LMYGK+N+NF + TSKL+
Subjt:  AKERSEKLEALYQAK--------------------------------------AIEVKIDDEDKALRFILSLPPFYEHMKPILMYGKDNLNFVDATSKLL

Query:  LEERRLKSEGRTSHEDSTLVTSNWKNKKDSAQKKTCCWGCGQSGHMKKNCPN-RAGSSKGSRQDADSVSLIRGDDDLFL
         EERRLK+    S E   L     + K   ++KK  CWGCGQ GH+KK+C N  A S+ GS+ DA +V +   +DD F+
Subjt:  LEERRLKSEGRTSHEDSTLVTSNWKNKKDSAQKKTCCWGCGQSGHMKKNCPN-RAGSSKGSRQDADSVSLIRGDDDLFL

A0A6A3CWI3 CCHC-type domain-containing protein2.6e-5347.31Show/hide
Query:  KIDVEKFDGRINFGLWQVQVKDVLIQSGLHKALKGRPSKGSSKKLSNDGGAMESSGGSSRGSKKSSMSDEDWEEMDLRTASAIRTGLAKNILANVHGIST
        + D+EKFDGRINFGLWQVQVKD+LIQSGL+KALKG+P+  S      +G   +    SS    KS MS+E+WEE+D+R AS IR  LAKN+LANV   S+
Subjt:  KIDVEKFDGRINFGLWQVQVKDVLIQSGLHKALKGRPSKGSSKKLSNDGGAMESSGGSSRGSKKSSMSDEDWEEMDLRTASAIRTGLAKNILANVHGIST

Query:  AKERSEKLEALYQAK--------------------------------------AIEVKIDDEDKALRFILSLPPFYEHMKPILMYGKDNLNFVDATSKLL
         KE  EKLE +YQAK                                      +I V IDDEDKALR I SLP  YEHM+ +LMYGK+N+NF + TSKL+
Subjt:  AKERSEKLEALYQAK--------------------------------------AIEVKIDDEDKALRFILSLPPFYEHMKPILMYGKDNLNFVDATSKLL

Query:  LEERRLKSEGRTSHEDSTLVTSNWKNKKDSAQKKTCCWGCGQSGHMKKNCPN-RAGSSKGSRQDADSVSLIRGDDDLFL
         EERRLK+    S E   L     + K   ++KK  CWGCGQ GH+KK+C N  A  + GS+ DA +V +   +DD F+
Subjt:  LEERRLKSEGRTSHEDSTLVTSNWKNKKDSAQKKTCCWGCGQSGHMKKNCPN-RAGSSKGSRQDADSVSLIRGDDDLFL

A0A6J1CG82 uncharacterized protein LOC1110105211.9e-9673.88Show/hide
Query:  TKECKVGMSIFMSPVKIDVEKFDGRINFGLWQVQVKDVLIQSGLHKALKGRPSKGSSKKLSNDGGAMESSGGSSRGSKKSSMSDEDWEEMDLRTASAIRT
        T EC+  MS FMSPVKIDVEKFDG INFGLWQVQVKDVLIQS LHKALKGRPS+G+S+KLS+DGG MESSGGSSRGSKKSSMS EDWEEMDLR ASAIRT
Subjt:  TKECKVGMSIFMSPVKIDVEKFDGRINFGLWQVQVKDVLIQSGLHKALKGRPSKGSSKKLSNDGGAMESSGGSSRGSKKSSMSDEDWEEMDLRTASAIRT

Query:  GLAKNILANVHGISTAKERSEKLEALYQAK--------------------------------------AIEVKIDDEDKALRFILSLPPFYEHMKPILMY
         LAKNILANVH ISTAKE  EKLEALYQAK                                      AIEVKIDDEDKALR ILSLP  YEHMKPILMY
Subjt:  GLAKNILANVHGISTAKERSEKLEALYQAK--------------------------------------AIEVKIDDEDKALRFILSLPPFYEHMKPILMY

Query:  GKDNLNFVDATSKLLLEERRLKSEGRTSHEDSTLVTSNWKNKKDSAQKKTCCWGCGQSGHMKKNCPNR
        GKD LNF + TSKLL EERRLKSEGRTSHEDS LV SNWK KKDS QKK CCWGCGQSGHMKK+CPNR
Subjt:  GKDNLNFVDATSKLLLEERRLKSEGRTSHEDSTLVTSNWKNKKDSAQKKTCCWGCGQSGHMKKNCPNR

A0A6J1DF43 uncharacterized protein LOC1110204691.1e-6249.69Show/hide
Query:  IVDYLHSKELELPLEKKPDDMEEAKWKKLDRKVLGTY-----------------------------------------------------VAAHINEFDM
        ++DYLHSKELE PLE KPDDM E +WKKLDRKVLGT                                                      + AH+NEFD+
Subjt:  IVDYLHSKELELPLEKKPDDMEEAKWKKLDRKVLGTY-----------------------------------------------------VAAHINEFDM

Query:  LINKLVAVDLTFTDELNAILLLRSLPNSWEPMKATISNSCGKEKLKLAYVRDTALGEEIRRKNSSIASTS------------------------------
        LINKLVAVDL F+ E+ AILLLRSLP+SWEPM+A ISNSC KEKLK   VRD AL EEIRRK+S IA TS                              
Subjt:  LINKLVAVDLTFTDELNAILLLRSLPNSWEPMKATISNSCGKEKLKLAYVRDTALGEEIRRKNSSIASTS------------------------------

Query:  ------------GHLRRNCKAPKKTEGKEAGANVVAEEIHDALVLAVE----------GNHGNVYLADGEPLDIIGIGDVNLKMANSSVWKIRKVRHVQN
                    GHL+ NCKAPKK EG EA AN VAE+IHDALV+AVE          GNHG VYLADGEPLDIIGIG+VNLKMAN SVWKIRK      
Subjt:  ------------GHLRRNCKAPKKTEGKEAGANVVAEEIHDALVLAVE----------GNHGNVYLADGEPLDIIGIGDVNLKMANSSVWKIRKVRHVQN

Query:  MMKNLISVGQLDNEGCEISF
                  LDNEGCEISF
Subjt:  MMKNLISVGQLDNEGCEISF

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.2e-2227.67Show/hide
Query:  MSPVKIDVEKFDGRINFGLWQVQVKDVLIQSGLHKALKGRPSKGSSKKLSNDGGAMESSGGSSRGSKKSSMSDEDWEEMDLRTASAIRTGLAKNILANVH
        MS VK +V KF+G   F  WQ +++D+LIQ GLHK L                             K  +M  EDW ++D R ASAIR  L+ +++ N+ 
Subjt:  MSPVKIDVEKFDGRINFGLWQVQVKDVLIQSGLHKALKGRPSKGSSKKLSNDGGAMESSGGSSRGSKKSSMSDEDWEEMDLRTASAIRTGLAKNILANVH

Query:  GISTAKERSEKLEALYQAKAIE--------------------------------------VKIDDEDKALRFILSLPPFYEHMKPILMYGKDNLNFVDAT
           TA+    +LE+LY +K +                                       VKI++EDKA+  + SLP  Y+++   +++GK  +   D T
Subjt:  GISTAKERSEKLEALYQAKAIE--------------------------------------VKIDDEDKALRFILSLPPFYEHMKPILMYGKDNLNFVDAT

Query:  SKLLLEERRLK----------SEGR-----TSHEDSTLVTSNWKNKKDSAQKKTCCWGCGQSGHMKKNCPN---RAGSSKGSRQDADSVSLIRGDDDLFL
        S LLL E+  K          +EGR      S  +     +  K+K  S  +   C+ C Q GH K++CPN     G + G + D ++ ++++ +D++ L
Subjt:  SKLLLEERRLK----------SEGR-----TSHEDSTLVTSNWKNKKDSAQKKTCCWGCGQSGHMKKNCPN---RAGSSKGSRQDADSVSLIRGDDDLFL

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTAGAGAAGATAAATTGGTTATTTTTTATGGAACTGATTTTTCGTACTGGAAGGATCAGATAGTAGATTATCTACATTCCAAAGAATTGGAATTGCCATTAGAGAA
GAAGCCGGATGACATGGAAGAAGCCAAATGGAAAAAGTTGGACAGGAAGGTGTTGGGTACATATGTGGCTGCCCATATAAATGAATTTGATATGTTGATTAACAAACTGG
TTGCTGTGGATTTAACATTTACGGATGAATTAAATGCTATCTTGTTGTTGAGATCCTTACCTAACAGTTGGGAGCCTATGAAGGCAACTATTTCAAATTCTTGTGGAAAA
GAGAAATTGAAATTAGCATATGTCAGAGATACAGCTCTTGGAGAGGAGATTCGCAGAAAGAATTCTAGTATTGCGTCTACTTCTGGACATCTGAGGAGGAACTGCAAAGC
CCCAAAGAAAACTGAGGGGAAAGAAGCTGGTGCAAATGTTGTTGCTGAAGAAATACATGATGCTCTAGTTCTTGCAGTTGAGGGAAATCATGGAAATGTGTATCTTGCTG
ATGGAGAGCCTTTGGACATCATTGGGATTGGTGACGTTAATTTAAAAATGGCGAACAGTTCAGTCTGGAAGATTCGCAAGGTACGTCACGTTCAGAATATGATGAAAAAC
CTGATTTCCGTGGGGCAGCTTGATAATGAAGGATGTGAAATATCCTTCGAGATTGAAGATCATAATACAATTACTCTTGAAGAAACAGCTGTGGGATCTGATGAACAAGT
TGAGGAATCTGATGCACCAGTTGTGGAAACTGATCAGGTTACCCTAGCCTCTACAGCAACCAAAAGCGATAGTGAAAGAGTAGAATTCAAATCCCAAGAAAAGTCAGGAA
TTGCGCCTGGTACATTTTCCCAACATAGTGTGTTTTCCATGTTTTGCATCAAAACTAAGGAGTGTAAAGTAGGTATGTCAATCTTTATGAGTCCAGTGAAGATTGACGTG
GAGAAATTTGACGGAAGGATCAACTTCGGTTTGTGGCAAGTGCAAGTCAAGGATGTGCTTATACAATCTGGGTTACACAAGGCGTTGAAGGGAAGACCGAGTAAAGGTTC
TTCTAAAAAGCTAAGCAATGATGGTGGTGCAATGGAGTCCAGTGGTGGTTCCAGCAGAGGTTCTAAAAAGTCCAGCATGAGTGATGAAGATTGGGAGGAAATGGATTTGA
GGACTGCAAGTGCAATACGAACAGGTTTGGCTAAGAATATTCTTGCGAATGTGCATGGAATTTCGACTGCCAAAGAACGTTCGGAGAAGCTCGAAGCGTTGTATCAGGCA
AAGGCGATCGAAGTGAAGATAGATGACGAAGATAAAGCACTTAGGTTCATCTTATCACTTCCACCTTTTTATGAACACATGAAGCCGATCTTGATGTATGGGAAGGATAA
TTTGAATTTTGTCGATGCTACTAGTAAACTCTTGTTAGAGGAAAGAAGGTTGAAGAGTGAAGGACGTACTTCACATGAAGATTCGACACTGGTAACTAGCAATTGGAAGA
ATAAGAAAGACTCCGCACAAAAGAAAACTTGTTGCTGGGGATGCGGACAGTCTGGGCACATGAAGAAAAATTGCCCCAATAGAGCCGGTTCGTCAAAAGGCTCTAGGCAG
GATGCTGACAGTGTTTCTCTCATCAGGGGAGATGATGATCTCTTCCTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGACTAGAGAAGATAAATTGGTTATTTTTTATGGAACTGATTTTTCGTACTGGAAGGATCAGATAGTAGATTATCTACATTCCAAAGAATTGGAATTGCCATTAGAGAA
GAAGCCGGATGACATGGAAGAAGCCAAATGGAAAAAGTTGGACAGGAAGGTGTTGGGTACATATGTGGCTGCCCATATAAATGAATTTGATATGTTGATTAACAAACTGG
TTGCTGTGGATTTAACATTTACGGATGAATTAAATGCTATCTTGTTGTTGAGATCCTTACCTAACAGTTGGGAGCCTATGAAGGCAACTATTTCAAATTCTTGTGGAAAA
GAGAAATTGAAATTAGCATATGTCAGAGATACAGCTCTTGGAGAGGAGATTCGCAGAAAGAATTCTAGTATTGCGTCTACTTCTGGACATCTGAGGAGGAACTGCAAAGC
CCCAAAGAAAACTGAGGGGAAAGAAGCTGGTGCAAATGTTGTTGCTGAAGAAATACATGATGCTCTAGTTCTTGCAGTTGAGGGAAATCATGGAAATGTGTATCTTGCTG
ATGGAGAGCCTTTGGACATCATTGGGATTGGTGACGTTAATTTAAAAATGGCGAACAGTTCAGTCTGGAAGATTCGCAAGGTACGTCACGTTCAGAATATGATGAAAAAC
CTGATTTCCGTGGGGCAGCTTGATAATGAAGGATGTGAAATATCCTTCGAGATTGAAGATCATAATACAATTACTCTTGAAGAAACAGCTGTGGGATCTGATGAACAAGT
TGAGGAATCTGATGCACCAGTTGTGGAAACTGATCAGGTTACCCTAGCCTCTACAGCAACCAAAAGCGATAGTGAAAGAGTAGAATTCAAATCCCAAGAAAAGTCAGGAA
TTGCGCCTGGTACATTTTCCCAACATAGTGTGTTTTCCATGTTTTGCATCAAAACTAAGGAGTGTAAAGTAGGTATGTCAATCTTTATGAGTCCAGTGAAGATTGACGTG
GAGAAATTTGACGGAAGGATCAACTTCGGTTTGTGGCAAGTGCAAGTCAAGGATGTGCTTATACAATCTGGGTTACACAAGGCGTTGAAGGGAAGACCGAGTAAAGGTTC
TTCTAAAAAGCTAAGCAATGATGGTGGTGCAATGGAGTCCAGTGGTGGTTCCAGCAGAGGTTCTAAAAAGTCCAGCATGAGTGATGAAGATTGGGAGGAAATGGATTTGA
GGACTGCAAGTGCAATACGAACAGGTTTGGCTAAGAATATTCTTGCGAATGTGCATGGAATTTCGACTGCCAAAGAACGTTCGGAGAAGCTCGAAGCGTTGTATCAGGCA
AAGGCGATCGAAGTGAAGATAGATGACGAAGATAAAGCACTTAGGTTCATCTTATCACTTCCACCTTTTTATGAACACATGAAGCCGATCTTGATGTATGGGAAGGATAA
TTTGAATTTTGTCGATGCTACTAGTAAACTCTTGTTAGAGGAAAGAAGGTTGAAGAGTGAAGGACGTACTTCACATGAAGATTCGACACTGGTAACTAGCAATTGGAAGA
ATAAGAAAGACTCCGCACAAAAGAAAACTTGTTGCTGGGGATGCGGACAGTCTGGGCACATGAAGAAAAATTGCCCCAATAGAGCCGGTTCGTCAAAAGGCTCTAGGCAG
GATGCTGACAGTGTTTCTCTCATCAGGGGAGATGATGATCTCTTCCTTTGA
Protein sequenceShow/hide protein sequence
MTREDKLVIFYGTDFSYWKDQIVDYLHSKELELPLEKKPDDMEEAKWKKLDRKVLGTYVAAHINEFDMLINKLVAVDLTFTDELNAILLLRSLPNSWEPMKATISNSCGK
EKLKLAYVRDTALGEEIRRKNSSIASTSGHLRRNCKAPKKTEGKEAGANVVAEEIHDALVLAVEGNHGNVYLADGEPLDIIGIGDVNLKMANSSVWKIRKVRHVQNMMKN
LISVGQLDNEGCEISFEIEDHNTITLEETAVGSDEQVEESDAPVVETDQVTLASTATKSDSERVEFKSQEKSGIAPGTFSQHSVFSMFCIKTKECKVGMSIFMSPVKIDV
EKFDGRINFGLWQVQVKDVLIQSGLHKALKGRPSKGSSKKLSNDGGAMESSGGSSRGSKKSSMSDEDWEEMDLRTASAIRTGLAKNILANVHGISTAKERSEKLEALYQA
KAIEVKIDDEDKALRFILSLPPFYEHMKPILMYGKDNLNFVDATSKLLLEERRLKSEGRTSHEDSTLVTSNWKNKKDSAQKKTCCWGCGQSGHMKKNCPNRAGSSKGSRQ
DADSVSLIRGDDDLFL