; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh14G012290 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh14G012290
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCmo_Chr14:10439008..10440722
RNA-Seq ExpressionCmoCh14G012290
SyntenyCmoCh14G012290
Gene Ontology termsGO:0006397 - mRNA processing (biological process)
GO:0044260 - cellular macromolecule metabolic process (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0016020 - membrane (cellular component)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
GO:0043167 - ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF3636042.1 hypothetical protein FXO37_25670 [Capsicum annuum]9.6e-4850.38Show/hide
Query:  MTTEQWKLKDRQALGLIRLTLSRNMTFNIIKEKTTSDLLKALSNMYEKPSAMNKVYLMRTLYNLQMSEGGFVAGHINEFNMIVSQLSSVEINFEDKIKAL
        M  E+WKLKDRQALG IRLTLSRN+ FNI KEKTTSDLLKALSNMYEK  AMNKVYLM  L+NLQ+S+ G VA HINEFN+IVSQL SV+INFED+IK L
Subjt:  MTTEQWKLKDRQALGLIRLTLSRNMTFNIIKEKTTSDLLKALSNMYEKPSAMNKVYLMRTLYNLQMSEGGFVAGHINEFNMIVSQLSSVEINFEDKIKAL

Query:  ILMSSLPESWDTVVAATGSS--RVSNKLKFDEIQDVVLSESIRKREIGDPSGNALSVDRRARS------AYFRSSPNKELFRNFKSGIFDK---------
        ILM SLPESWDT+VAA  SS  +   K K  +  D  +S +    +IGD     LS++    S      A F SSP+KE+F+NFK   F K         
Subjt:  ILMSSLPESWDTVVAATGSS--RVSNKLKFDEIQDVVLSESIRKREIGDPSGNALSVDRRARS------AYFRSSPNKELFRNFKSGIFDK---------

Query:  --------------------------------------LDSTGYATKFGKSSWKIVKSAMCL
                                              LDSTGY  +FGK  WK+VK AM +
Subjt:  --------------------------------------LDSTGYATKFGKSSWKIVKSAMCL

KAF3643966.1 Pleiotropic drug resistance protein 1 [Capsicum annuum]7.3e-5643.87Show/hide
Query:  MTTEQWKLKDRQALGLIRLTLSRNMTFNIIKEKTTSDLLKALSNMYEKPSAMNKVYLMRTLYNLQMSEGGFVAGHINEFNMIVSQLSSVEINFEDKIKAL
        M  + WKLKDRQ LGLIRLTLSRN+ FNI+KEKTTSDLLKALSNMYEKPSA NKVYLMR L+NLQM E G VA HINEFNMIVSQL SV+INFED+IKAL
Subjt:  MTTEQWKLKDRQALGLIRLTLSRNMTFNIIKEKTTSDLLKALSNMYEKPSAMNKVYLMRTLYNLQMSEGGFVAGHINEFNMIVSQLSSVEINFEDKIKAL

Query:  ILMSSLPESWDTVVAATGSSRVSNKLKFDEIQDVVLSESIRKREIGDPSGNALSVDRRAR----------------------------------------
        ILMSSLPE   T+V A  SS  S KLKFD+I+DVV S+SIRKREIG+ SG+ALSVDRR R                                        
Subjt:  ILMSSLPESWDTVVAATGSSRVSNKLKFDEIQDVVLSESIRKREIGDPSGNALSVDRRAR----------------------------------------

Query:  --------------------------------------------SAYFRSSPNKELFRNFKSGIFDK---------------------------------
                                                     A F   P+KELF+NFK G F K                                 
Subjt:  --------------------------------------------SAYFRSSPNKELFRNFKSGIFDK---------------------------------

Query:  --------------LDSTGYATKFGKSSWKIVKSAMCLTAKELKKGSGTTKQVEVEVELLKDSTSDV
                      LDSTGYAT+FGK SWKI+K AM + A+  K G+  T    + + ++ +  SD+
Subjt:  --------------LDSTGYATKFGKSSWKIVKSAMCLTAKELKKGSGTTKQVEVEVELLKDSTSDV

KAG7011443.1 hypothetical protein SDJN02_26349, partial [Cucurbita argyrosperma subsp. argyrosperma]5.6e-6461.85Show/hide
Query:  MTTEQWKLKDRQALGLIRLTLSRNMTFNIIKEKTTSDLLKALSNMYEKPSAMNKVYLMRTLYNLQMSEGGFVAGHINEFNMIVSQLSSVEINFEDKIKAL
        MTTEQWKLKDRQAL LIRLTLSRN  FNIIKEKTTSDLLKALSNMYEK SAMNKVYLMR L+NLQMSEGG +A +INEFNMIVS+LS VEINF+D+IKAL
Subjt:  MTTEQWKLKDRQALGLIRLTLSRNMTFNIIKEKTTSDLLKALSNMYEKPSAMNKVYLMRTLYNLQMSEGGFVAGHINEFNMIVSQLSSVEINFEDKIKAL

Query:  ILMSSLPESWDTVVAATGSSRVSNKLKFDEIQDVVLSESIRKREIGDPSGNALSVD--RRARSAYFRSSPNKELF---RNFKSGIFDKLDS---------
        ILMSSLPESWDTVVAA  SSR S+KLKFDEI+D+VL ESIR R+ GD SG ALS D  +  +    +S  + +      + +  +   +DS         
Subjt:  ILMSSLPESWDTVVAATGSSRVSNKLKFDEIQDVVLSESIRKREIGDPSGNALSVD--RRARSAYFRSSPNKELF---RNFKSGIFDKLDS---------

Query:  TGYATKFGKSSWKIVKSAMCLTAKELKKGSGTTKQVEVEVELLKDSTSD
         GYA +FGKSSWKIVK AM + A+  K G+  T    + +     STS+
Subjt:  TGYATKFGKSSWKIVKSAMCLTAKELKKGSGTTKQVEVEVELLKDSTSD

VFQ59121.1 unnamed protein product [Cuscuta campestris]2.1e-5874.85Show/hide
Query:  MTTEQWKLKDRQALGLIRLTLSRNMTFNIIKEKTTSDLLKALSNMYEKPSAMNKVYLMRTLYNLQMSEGGFVAGHINEFNMIVSQLSSVEINFEDKIKAL
        MT EQWK+KDRQALG+IRLTL++N+ FNI+KE TT+ L+KALSN+YEKPSAMNKVYLMR L+NLQM E G VA HIN+FNMIVSQL  VEINFED+IK L
Subjt:  MTTEQWKLKDRQALGLIRLTLSRNMTFNIIKEKTTSDLLKALSNMYEKPSAMNKVYLMRTLYNLQMSEGGFVAGHINEFNMIVSQLSSVEINFEDKIKAL

Query:  ILMSSLPESWDTVVAATGSSRVSNKLKFDEIQDVVLSESIRKREIGDPSGNALSVDRRARSAY
        IL+SS+PESWD VVAA  SSR S KL+FDEI+DVVLSESIRKRE+ D SG+ALSVDRR R  +
Subjt:  ILMSSLPESWDTVVAATGSSRVSNKLKFDEIQDVVLSESIRKREIGDPSGNALSVDRRARSAY

VFR00719.1 unnamed protein product [Cuscuta campestris]1.8e-5438Show/hide
Query:  MTTEQWKLKDRQALGLIRLTLSRNMTFNIIKEKTTSDLLKALSNMYEKPSAMNKVYLMRTLYNLQMSEGGFVAGHINEFNMIVSQLSSVEINFEDKIKAL
        MT EQWKLKDRQALG+IRLTL++N+ FNI+KE TT+ L+KALSNMYEKPSAMNKVYLMR L+NLQM E G VA HIN+FNMIVSQL SVEINFED+IKAL
Subjt:  MTTEQWKLKDRQALGLIRLTLSRNMTFNIIKEKTTSDLLKALSNMYEKPSAMNKVYLMRTLYNLQMSEGGFVAGHINEFNMIVSQLSSVEINFEDKIKAL

Query:  ILMSSLP------------------------------------------------------------------------------------------ESW
        IL+SS+P                                                                                          ESW
Subjt:  ILMSSLP------------------------------------------------------------------------------------------ESW

Query:  DTVVAATGSSRVSNKLKFDEIQDVVLSESIRKREIGDPSGNALSVDRRARSAY-------------FRSSPNKELF---------------------RNF
        DTVVAA  SSR S KL+FDEI+DVVLSESIRKRE+GD SG+ALSVD++ RS +                SPN+                        +N 
Subjt:  DTVVAATGSSRVSNKLKFDEIQDVVLSESIRKREIGDPSGNALSVDRRARSAY-------------FRSSPNKELF---------------------RNF

Query:  KSGIFDKLDSTGYATKFGKS-----------SWKIVKSAMCLTAKELKKGSG--------------------------------------TTKQVEVEVE
        KSG  D  DS   A   G +            W +V  ++   A+   K S                                       +TKQV VEVE
Subjt:  KSGIFDKLDSTGYATKFGKS-----------SWKIVKSAMCLTAKELKKGSG--------------------------------------TTKQVEVEVE

Query:  LLKDSTSDV-ADTQETLETVVEELEILKKTPETIAEELEVEQVTLEKVLKRLSRTIRVPDRYVPSL-------HYQLVTAEGERKPLDEALQLEDTTKWE
        L K +  +V A+TQ               TP+TI EE EVEQVT E+VL+R SR  RVPDR V  L       HY+   A      L  +  +   +KWE
Subjt:  LLKDSTSDV-ADTQETLETVVEELEILKKTPETIAEELEVEQVTLEKVLKRLSRTIRVPDRYVPSL-------HYQLVTAEGERKPLDEALQLEDTTKWE

TrEMBL top hitse value%identityAlignment
A0A438IVL1 Retrovirus-related Pol polyprotein from transposon TNT 1-943.0e-4734.81Show/hide
Query:  MTTEQWKLKDRQALGLIRLTLSRNMTFNIIKEKTTSDLLKALSNMYEKPSAMNKVYLMRTLYNLQMSEGGFVAGHINEFNMIVSQLSSVEINFEDKIKAL
        M  E+W L DRQ LG+IRLTLSR++  N++KEKTT+DL+KALS MYEK  A NKV+LM+ L+NL+M E   V  H+NEFN I +QLSSVEI+F+D I+AL
Subjt:  MTTEQWKLKDRQALGLIRLTLSRNMTFNIIKEKTTSDLLKALSNMYEKPSAMNKVYLMRTLYNLQMSEGGFVAGHINEFNMIVSQLSSVEINFEDKIKAL

Query:  ILMSSLPESWDTVVAATGSSRVSNKLKFDEIQDVVLSESIRKREIGDPS--GNALSVDRRARSAYFRSSPNKELFRNFKSGIFDK---------------
        I+++S P SW+ +  A  +S    KLK+++IQD++L E I +R+  + S  G+AL+++ R + A F ++P++E+ +N+ +  F K               
Subjt:  ILMSSLPESWDTVVAATGSSRVSNKLKFDEIQDVVLSESIRKREIGDPS--GNALSVDRRARSAYFRSSPNKELFRNFKSGIFDK---------------

Query:  --------------------------------LDSTGYATKFGKSSWKIVKSAMCLTAKELKKGSGTTKQVEVEVELLKDSTSDVADTQETLETVVEELE
                                        LD  G+A      +WK+ K A  L      K +GT        + +  + +D  +  ++L+ V     
Subjt:  --------------------------------LDSTGYATKFGKSSWKIVKSAMCLTAKELKKGSGTTKQVEVEVELLKDSTSDVADTQETLETVVEELE

Query:  ILK-------KTPETIAEELEVEQVTLEKVLKRLSRTIRVPDRYVPSLHYQLVTAEGERKPLDEALQLEDTTKWEQAMDDGMSRL
        +++       +  E +  ++++     E  ++R S+ IR P  Y   L+Y L+T  GE +  DEALQ E+++KWE AM D M+ L
Subjt:  ILK-------KTPETIAEELEVEQVTLEKVLKRLSRTIRVPDRYVPSLHYQLVTAEGERKPLDEALQLEDTTKWEQAMDDGMSRL

A0A484K039 Uncharacterized protein1.0e-5874.85Show/hide
Query:  MTTEQWKLKDRQALGLIRLTLSRNMTFNIIKEKTTSDLLKALSNMYEKPSAMNKVYLMRTLYNLQMSEGGFVAGHINEFNMIVSQLSSVEINFEDKIKAL
        MT EQWK+KDRQALG+IRLTL++N+ FNI+KE TT+ L+KALSN+YEKPSAMNKVYLMR L+NLQM E G VA HIN+FNMIVSQL  VEINFED+IK L
Subjt:  MTTEQWKLKDRQALGLIRLTLSRNMTFNIIKEKTTSDLLKALSNMYEKPSAMNKVYLMRTLYNLQMSEGGFVAGHINEFNMIVSQLSSVEINFEDKIKAL

Query:  ILMSSLPESWDTVVAATGSSRVSNKLKFDEIQDVVLSESIRKREIGDPSGNALSVDRRARSAY
        IL+SS+PESWD VVAA  SSR S KL+FDEI+DVVLSESIRKRE+ D SG+ALSVDRR R  +
Subjt:  ILMSSLPESWDTVVAATGSSRVSNKLKFDEIQDVVLSESIRKREIGDPSGNALSVDRRARSAY

A0A484NK44 CCHC-type domain-containing protein8.7e-5538Show/hide
Query:  MTTEQWKLKDRQALGLIRLTLSRNMTFNIIKEKTTSDLLKALSNMYEKPSAMNKVYLMRTLYNLQMSEGGFVAGHINEFNMIVSQLSSVEINFEDKIKAL
        MT EQWKLKDRQALG+IRLTL++N+ FNI+KE TT+ L+KALSNMYEKPSAMNKVYLMR L+NLQM E G VA HIN+FNMIVSQL SVEINFED+IKAL
Subjt:  MTTEQWKLKDRQALGLIRLTLSRNMTFNIIKEKTTSDLLKALSNMYEKPSAMNKVYLMRTLYNLQMSEGGFVAGHINEFNMIVSQLSSVEINFEDKIKAL

Query:  ILMSSLP------------------------------------------------------------------------------------------ESW
        IL+SS+P                                                                                          ESW
Subjt:  ILMSSLP------------------------------------------------------------------------------------------ESW

Query:  DTVVAATGSSRVSNKLKFDEIQDVVLSESIRKREIGDPSGNALSVDRRARSAY-------------FRSSPNKELF---------------------RNF
        DTVVAA  SSR S KL+FDEI+DVVLSESIRKRE+GD SG+ALSVD++ RS +                SPN+                        +N 
Subjt:  DTVVAATGSSRVSNKLKFDEIQDVVLSESIRKREIGDPSGNALSVDRRARSAY-------------FRSSPNKELF---------------------RNF

Query:  KSGIFDKLDSTGYATKFGKS-----------SWKIVKSAMCLTAKELKKGSG--------------------------------------TTKQVEVEVE
        KSG  D  DS   A   G +            W +V  ++   A+   K S                                       +TKQV VEVE
Subjt:  KSGIFDKLDSTGYATKFGKS-----------SWKIVKSAMCLTAKELKKGSG--------------------------------------TTKQVEVEVE

Query:  LLKDSTSDV-ADTQETLETVVEELEILKKTPETIAEELEVEQVTLEKVLKRLSRTIRVPDRYVPSL-------HYQLVTAEGERKPLDEALQLEDTTKWE
        L K +  +V A+TQ               TP+TI EE EVEQVT E+VL+R SR  RVPDR V  L       HY+   A      L  +  +   +KWE
Subjt:  LLKDSTSDV-ADTQETLETVVEELEILKKTPETIAEELEVEQVTLEKVLKRLSRTIRVPDRYVPSL-------HYQLVTAEGERKPLDEALQLEDTTKWE

A0A5B7BAK4 Uncharacterized protein2.5e-4958.99Show/hide
Query:  MTTEQWKLKDRQALGLIRLTLSRNMTFNIIKEKTTSDLLKALSNMYEKPSAMNKVYLMRTLYNLQMSEGGFVAGHINEFNMIVSQLSSVEINFEDKIKAL
        M  E W L DRQALG++RLTL+RN+ FNI KEKTT+ L+ ALSNMYEKPSA NKVYLMR L+NL+MSEG  VA H+NEFN++ +QLSSVEI F+D+I+AL
Subjt:  MTTEQWKLKDRQALGLIRLTLSRNMTFNIIKEKTTSDLLKALSNMYEKPSAMNKVYLMRTLYNLQMSEGGFVAGHINEFNMIVSQLSSVEINFEDKIKAL

Query:  ILMSSLPESWDTVVAATGSSRVSNKLKFDEIQDVVLSESIRKREIGDPSGNALSVDRRARSAYFRSSPNKELFRNFKS
        IL+SSLPESW+  V A  SS  + KLK+D+++D++LSE IR+RE G+ SG+AL+V+ R R+    S  ++   R  +S
Subjt:  ILMSSLPESWDTVVAATGSSRVSNKLKFDEIQDVVLSESIRKREIGDPSGNALSVDRRARSAYFRSSPNKELFRNFKS

A0A5B7BAK4 Uncharacterized protein1.0e-0232.89Show/hide
Query:  GYAT-KFGKSSW-----KIVKSA-MCLTAKELKKGSGTTKQVEVEVELLKDSTSDVADTQETLETVVEELEILKKTPETIAEELEVEQVTLEKVLKRLSR
        GY T +FG   W     KI +S  +    K L K     +    +    K    ++ +  E+     E    ++  PE I    +VE VT    L+R SR
Subjt:  GYAT-KFGKSSW-----KIVKSA-MCLTAKELKKGSGTTKQVEVEVELLKDSTSDVADTQETLETVVEELEILKKTPETIAEELEVEQVTLEKVLKRLSR

Query:  TIRVPDRYVPSLHYQLVTAEGERKPLDEALQLEDTTKWEQAMDDGMSRL
          + P  Y PSL+Y L++  GE +  DEALQ+ D+ KWE AM D M  L
Subjt:  TIRVPDRYVPSLHYQLVTAEGERKPLDEALQLEDTTKWEQAMDDGMSRL

A0A5B7BAK4 Uncharacterized protein6.1e-4878.17Show/hide
Query:  MTTEQWKLKDRQALGLIRLTLSRNMTFNIIKEKTTSDLLKALSNMYEKPSAMNKVYLMRTLYNLQMSEGGFVAGHINEFNMIVSQLSSVEINFEDKIKAL
        M  + WKLKDRQALGLIRLTLSRN+ FNI+KEKTTSDLLKALSNMYEKPSA NKVYLMR L+NLQM E G VA HINEFN+IVSQL SV+INFED+IKAL
Subjt:  MTTEQWKLKDRQALGLIRLTLSRNMTFNIIKEKTTSDLLKALSNMYEKPSAMNKVYLMRTLYNLQMSEGGFVAGHINEFNMIVSQLSSVEINFEDKIKAL

Query:  ILMSSLPESWDTVVAATGSSRVSNKLKFDEIQDVVLSESIRK
        ILMSSLPE   T+VAA  S   S KLKFD+I+DVV SESIRK
Subjt:  ILMSSLPESWDTVVAATGSSRVSNKLKFDEIQDVVLSESIRK

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.2e-1836.09Show/hide
Query:  MTTEQWKLKDRQALGLIRLTLSRNMTFNIIKEKTTSDLLKALSNMYEKPSAMNKVYLMRTLYNLQMSEGGFVAGHINEFNMIVSQLSSVEINFEDKIKAL
        M  E W   D +A   IRL LS ++  NII E T   +   L ++Y   +  NK+YL + LY L MSEG     H+N FN +++QL+++ +  E++ KA+
Subjt:  MTTEQWKLKDRQALGLIRLTLSRNMTFNIIKEKTTSDLLKALSNMYEKPSAMNKVYLMRTLYNLQMSEGGFVAGHINEFNMIVSQLSSVEINFEDKIKAL

Query:  ILMSSLPESWDTVVAATGSSRVSNKLKFDEIQDVVLSESIRKREIGDPSGNALSVDRRARSAYFRSSPN
        +L++SLP S+D +       + + +LK D    ++L+E +RK+   +  G AL  + R RS Y RSS N
Subjt:  ILMSSLPESWDTVVAATGSSRVSNKLKFDEIQDVVLSESIRKREIGDPSGNALSVDRRARSAYFRSSPN

Arabidopsis top hitse value%identityAlignment
AT3G29785.1 unknown protein1.8e-0745.45Show/hide
Query:  MTTEQWKLKDRQALGLIRLTLSRNMTFNIIKEKTTSDLLKALSNMYEKPSAMNKV
        M+ + W +  RQ L +IRLT+S+N+  N+ KEK+   L+K LS++Y+KPS  N V
Subjt:  MTTEQWKLKDRQALGLIRLTLSRNMTFNIIKEKTTSDLLKALSNMYEKPSAMNKV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCACGGAGCAGTGGAAGCTCAAGGATCGTCAGGCTTTAGGGTTGATCCGGTTGACGCTGTCTAGAAACATGACGTTCAATATTATCAAGGAGAAGACAACGTCAGA
TTTATTGAAGGCGTTGTCGAATATGTACGAGAAACCGTCGGCTATGAACAAGGTGTATTTGATGCGGACACTGTACAATCTACAAATGTCTGAAGGTGGATTTGTTGCTG
GTCATATAAACGAATTCAATATGATTGTAAGTCAACTGAGTTCGGTGGAAATTAATTTCGAGGATAAAATTAAAGCATTGATTTTGATGTCATCTTTACCCGAGTCGTGG
GATACGGTTGTTGCCGCAACCGGCAGTTCACGAGTATCTAATAAACTGAAGTTCGATGAAATTCAAGATGTAGTTCTCAGCGAAAGTATTCGCAAACGAGAAATCGGAGA
TCCATCAGGCAATGCTCTCAGTGTTGATCGAAGGGCAAGAAGTGCATATTTTCGTTCGTCTCCAAATAAAGAGTTGTTTAGAAATTTCAAGTCTGGAATTTTTGATAAGT
TGGATAGCACAGGTTATGCAACAAAGTTTGGGAAGAGTTCGTGGAAGATTGTGAAGAGTGCTATGTGCTTGACTGCCAAAGAATTGAAGAAAGGTTCCGGGACAACGAAG
CAAGTGGAAGTTGAGGTTGAGTTGCTGAAAGATTCAACTAGTGATGTAGCAGATACTCAAGAAACTCTTGAGACTGTTGTTGAGGAACTAGAGATACTCAAAAAAACTCC
TGAGACTATTGCTGAGGAACTAGAAGTGGAGCAAGTGACACTTGAGAAGGTGTTGAAAAGATTATCCAGAACTATCAGAGTACCAGATAGGTATGTACCTTCTTTACACT
ATCAGTTGGTGACTGCTGAAGGGGAACGAAAGCCCCTTGATGAGGCCCTACAGTTGGAGGATACAACCAAGTGGGAGCAAGCCATGGATGATGGGATGTCTAGGCTTCAG
AAATGCGTTGCTCTTTCACCTGGTGAGGCTAAGTACATGGCAGTAGCTGAAGCTGGAAAGGAGATGATATGA
mRNA sequenceShow/hide mRNA sequence
ATGACCACGGAGCAGTGGAAGCTCAAGGATCGTCAGGCTTTAGGGTTGATCCGGTTGACGCTGTCTAGAAACATGACGTTCAATATTATCAAGGAGAAGACAACGTCAGA
TTTATTGAAGGCGTTGTCGAATATGTACGAGAAACCGTCGGCTATGAACAAGGTGTATTTGATGCGGACACTGTACAATCTACAAATGTCTGAAGGTGGATTTGTTGCTG
GTCATATAAACGAATTCAATATGATTGTAAGTCAACTGAGTTCGGTGGAAATTAATTTCGAGGATAAAATTAAAGCATTGATTTTGATGTCATCTTTACCCGAGTCGTGG
GATACGGTTGTTGCCGCAACCGGCAGTTCACGAGTATCTAATAAACTGAAGTTCGATGAAATTCAAGATGTAGTTCTCAGCGAAAGTATTCGCAAACGAGAAATCGGAGA
TCCATCAGGCAATGCTCTCAGTGTTGATCGAAGGGCAAGAAGTGCATATTTTCGTTCGTCTCCAAATAAAGAGTTGTTTAGAAATTTCAAGTCTGGAATTTTTGATAAGT
TGGATAGCACAGGTTATGCAACAAAGTTTGGGAAGAGTTCGTGGAAGATTGTGAAGAGTGCTATGTGCTTGACTGCCAAAGAATTGAAGAAAGGTTCCGGGACAACGAAG
CAAGTGGAAGTTGAGGTTGAGTTGCTGAAAGATTCAACTAGTGATGTAGCAGATACTCAAGAAACTCTTGAGACTGTTGTTGAGGAACTAGAGATACTCAAAAAAACTCC
TGAGACTATTGCTGAGGAACTAGAAGTGGAGCAAGTGACACTTGAGAAGGTGTTGAAAAGATTATCCAGAACTATCAGAGTACCAGATAGGTATGTACCTTCTTTACACT
ATCAGTTGGTGACTGCTGAAGGGGAACGAAAGCCCCTTGATGAGGCCCTACAGTTGGAGGATACAACCAAGTGGGAGCAAGCCATGGATGATGGGATGTCTAGGCTTCAG
AAATGCGTTGCTCTTTCACCTGGTGAGGCTAAGTACATGGCAGTAGCTGAAGCTGGAAAGGAGATGATATGA
Protein sequenceShow/hide protein sequence
MTTEQWKLKDRQALGLIRLTLSRNMTFNIIKEKTTSDLLKALSNMYEKPSAMNKVYLMRTLYNLQMSEGGFVAGHINEFNMIVSQLSSVEINFEDKIKALILMSSLPESW
DTVVAATGSSRVSNKLKFDEIQDVVLSESIRKREIGDPSGNALSVDRRARSAYFRSSPNKELFRNFKSGIFDKLDSTGYATKFGKSSWKIVKSAMCLTAKELKKGSGTTK
QVEVEVELLKDSTSDVADTQETLETVVEELEILKKTPETIAEELEVEQVTLEKVLKRLSRTIRVPDRYVPSLHYQLVTAEGERKPLDEALQLEDTTKWEQAMDDGMSRLQ
KCVALSPGEAKYMAVAEAGKEMI