; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g27800 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g27800
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr9:20817281..20819895
RNA-Seq ExpressionMoc09g27800
SyntenyMoc09g27800
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150863.1 uncharacterized protein LOC111018910 [Momordica charantia]7.6e-8458.08Show/hide
Query:  MNPPNSHPRQPIPPNVRIEEIVDRVPVA-GPEVAVPLLNVVLLADNIDREIRVEKMLQTVDQFHRHPTEDPHSHLKFFMGVCNSFKDEGCSKEV------
        MNPPN +  QPIPPNVRIEEIVD VPVA   EV VP LNVVLLA  IDREIR      T   F+   TE   +  KF +       DEG +KEV      
Subjt:  MNPPNSHPRQPIPPNVRIEEIVDRVPVA-GPEVAVPLLNVVLLADNIDREIRVEKMLQTVDQFHRHPTEDPHSHLKFFMGVCNSFKDEGCSKEV------

Query:  --------------------------------------KNAKYRSEINNFQQFAWESVSESWEHFKQLLQKCPHHGIPRCIQIKIYYNDLDDATRLVIDA
                                              KNAKYRS+INNFQQF  ESV+ESWEHFK+L+QKC HH IPRCI I++YYN LDDATRLV   
Subjt:  --------------------------------------KNAKYRSEINNFQQFAWESVSESWEHFKQLLQKCPHHGIPRCIQIKIYYNDLDDATRLVIDA

Query:  STNGALLAKSYNEAFNILERISSNNHSWSDPRAIQGRGSKGLNKSKSFSTLNSKIKNLIDLIMRSMTQHSTMRASTGKTNVSHIQGISYSFCEGEHHCKN
        S N ALLAK Y EAFNILERISSN HS SD RAIQGRG+K LN+SKS+ST NSKI+N+IDL+ RSMTQ ST+ A TGK N SH QG SYSF  G HH  N
Subjt:  STNGALLAKSYNEAFNILERISSNNHSWSDPRAIQGRGSKGLNKSKSFSTLNSKIKNLIDLIMRSMTQHSTMRASTGKTNVSHIQGISYSFCEGEHHCKN

Query:  CPSNPESVYYLGNPQNNGNNPYSHTYNPDWRNHP
        CP NPESVY LGN  N+ NN YS+TYNP  RNHP
Subjt:  CPSNPESVYYLGNPQNNGNNPYSHTYNPDWRNHP

XP_022156835.1 uncharacterized protein LOC111023669 [Momordica charantia]8.9e-10977.57Show/hide
Query:  KNAKYRSEINNFQQFAWESVSESWEHFKQLLQKCPHHGIPRCIQIKIYYNDLDDATRLVIDASTNGALLAKSYNEAFNILERISSNNHSWSDPRAIQGRG
        KNAKYRSEI NFQQ   ESV+ESWE FKQLLQKCPHHGIPRCIQI+ YY  LDDATRLVIDASTNGALL K Y EAFNILERISSNNHSWSDPRAIQGRG
Subjt:  KNAKYRSEINNFQQFAWESVSESWEHFKQLLQKCPHHGIPRCIQIKIYYNDLDDATRLVIDASTNGALLAKSYNEAFNILERISSNNHSWSDPRAIQGRG

Query:  SKGLNKSKSFSTLNSKIKNLIDLIMRSMTQHSTMRASTGKTNVSHIQGISYSFCEGEHHCKNCPSNPESVYYLGNPQNNGNNPYSHTYNPDWRNHPKFSW
         KGLN+S+S+  LNSK++NL +L+MRSMTQ +T+ AS GK NVSHIQGIS SFCEGEHH  N P NPESVYYLGN QNNG N YS+TYNP WRNHP FSW
Subjt:  SKGLNKSKSFSTLNSKIKNLIDLIMRSMTQHSTMRASTGKTNVSHIQGISYSFCEGEHHCKNCPSNPESVYYLGNPQNNGNNPYSHTYNPDWRNHPKFSW

Query:  SENQGGNSAGMSNAPAYQQKGNYSLGFSNQGQVAVQRQFEGSFASLENLIKHYMKNNDTTVQS
        S NQGGN+AG SNAPAYQQK +Y   FSNQGQV VQ   EGSFASLENL+K  M+ ND TVQS
Subjt:  SENQGGNSAGMSNAPAYQQKGNYSLGFSNQGQVAVQRQFEGSFASLENLIKHYMKNNDTTVQS

XP_022158598.1 uncharacterized protein LOC111025053 [Momordica charantia]9.6e-6347.5Show/hide
Query:  KMLQTVDQFHRHPTEDPHSHLKFFMGVCNSFKDEGCSKEVKNAKYRSEINNFQQFAW-ESVSE----SWEHF------------KQLLQKCPHHGIPRCI
        +M+  V QFH H TE PH HLKFFMGV NSFKDEG SK V   K  S     +   W ES+S     SW+              K+L Q+CP+HGIP  I
Subjt:  KMLQTVDQFHRHPTEDPHSHLKFFMGVCNSFKDEGCSKEVKNAKYRSEINNFQQFAW-ESVSE----SWEHF------------KQLLQKCPHHGIPRCI

Query:  QIKIYYNDLDDATRLVIDASTNGALLAKSYNEAFNILERISSNNHSWSDPRAIQGRGSKGLNKSKSFSTLNSKIKNLIDLIMRSMTQHSTMRASTGKTNV
        QI+ YY  LD+ATRLVIDAS NGALL K Y +A NILERISS+NHSWSD RAI+G+ SK L +S+S++TLNSKI+ L DL                    
Subjt:  QIKIYYNDLDDATRLVIDASTNGALLAKSYNEAFNILERISSNNHSWSDPRAIQGRGSKGLNKSKSFSTLNSKIKNLIDLIMRSMTQHSTMRASTGKTNV

Query:  SHIQGISYSFCEGEHHCKNCPSNPESVYYLGNPQNNGNNPYSHTYNPDWRNHPKFSWSENQGGNSAGMSNAPAYQQKGNYSLGFSNQGQVAVQRQFEGSF
                                          NN N  YS+TYNP  RNHP F WS NQGG++ G+SNAP +QQK +Y  GF+ QGQ+    Q +GS 
Subjt:  SHIQGISYSFCEGEHHCKNCPSNPESVYYLGNPQNNGNNPYSHTYNPDWRNHPKFSWSENQGGNSAGMSNAPAYQQKGNYSLGFSNQGQVAVQRQFEGSF

Query:  ASLENLIKHYMKNNDTTVQS
         SLEN++K YM NND TVQS
Subjt:  ASLENLIKHYMKNNDTTVQS

XP_022158611.1 uncharacterized protein LOC111025065 [Momordica charantia]2.1e-9757.06Show/hide
Query:  KMLQTVDQFHRHPTEDPHSHLKFFMGVCNSFKDEGCSKEV--------------------------------------------KNAKYRSEINNFQQFA
        +MLQTVDQFH H TEDPH HLKFFMGVCNSFK+EG S EV                                            KNAKYRSEINNFQQFA
Subjt:  KMLQTVDQFHRHPTEDPHSHLKFFMGVCNSFKDEGCSKEV--------------------------------------------KNAKYRSEINNFQQFA

Query:  WESVSESWEHFKQLLQKCPHHGIPRCIQIKIYYNDLDDATRLVIDASTNGALLAKSYNEAFNILERISSNNHSWSDPRAIQGRGSKGLNKSKSFSTLNSK
         ESVSESWE FK+LLQ CPHHGIPRCIQI+ YY DL+DATRL                                 DPRA+QG+ SKGL +S+S++TLNS 
Subjt:  WESVSESWEHFKQLLQKCPHHGIPRCIQIKIYYNDLDDATRLVIDASTNGALLAKSYNEAFNILERISSNNHSWSDPRAIQGRGSKGLNKSKSFSTLNSK

Query:  IKNLIDLIMRSMTQHSTMRASTGKTNVSHIQGISYSFCEGEHHCKNCPSNPESVYYLGNPQNNGNNPYSHTYNPDWRNHPKFSWSENQGGNSAGMSNAPA
        I+NL  L+MRSM Q S++ A TG  NV+ IQGIS SFCEG+HH  NCP NPESVYYLGNPQNN NN YS+TYNP WRNHP FSWS +QGG++AG S+APA
Subjt:  IKNLIDLIMRSMTQHSTMRASTGKTNVSHIQGISYSFCEGEHHCKNCPSNPESVYYLGNPQNNGNNPYSHTYNPDWRNHPKFSWSENQGGNSAGMSNAPA

Query:  YQQKGNYSLGFSNQGQVAVQRQFEGSFASLENLIKHYMKNNDTTVQS
        +Q K +Y  GF NQGQ+  +RQ EGS ASLE L+K YM NND TVQS
Subjt:  YQQKGNYSLGFSNQGQVAVQRQFEGSFASLENLIKHYMKNNDTTVQS

XP_022159060.1 uncharacterized protein LOC111025500 [Momordica charantia]3.0e-7277.3Show/hide
Query:  KNAKYRSEINNFQQFAWESVSESWEHFKQLLQKCPHHGIPRCIQIKIYYNDLDDATRLVIDASTNGALLAKSYNEAFNILERISSNNHSWSDPRAIQGRG
        KNAKYRSEINNFQQF  ESVSESWE FK+L+QK  + GIPRCIQIK YYN LDDATRLVIDAS NGALLAK Y EAFNILERISSNN SWSDPRAI G+G
Subjt:  KNAKYRSEINNFQQFAWESVSESWEHFKQLLQKCPHHGIPRCIQIKIYYNDLDDATRLVIDASTNGALLAKSYNEAFNILERISSNNHSWSDPRAIQGRG

Query:  SKGLNKSKSFSTLNSKIKNLIDLIMRSMTQHSTMRASTGKTNVSHIQGISYSFCEGEHHCKNCPSNPESVYYLGNPQNNGNNPYS
        SKG N+S+SF+ LN KI+NL DL+MRSMT  ST+ AS GK NVSHIQGIS SFC GE+   NCP NPESV+YLGN QNN NNPYS
Subjt:  SKGLNKSKSFSTLNSKIKNLIDLIMRSMTQHSTMRASTGKTNVSHIQGISYSFCEGEHHCKNCPSNPESVYYLGNPQNNGNNPYS

TrEMBL top hitse value%identityAlignment
A0A6J1DAK9 uncharacterized protein LOC1110189103.7e-8458.08Show/hide
Query:  MNPPNSHPRQPIPPNVRIEEIVDRVPVA-GPEVAVPLLNVVLLADNIDREIRVEKMLQTVDQFHRHPTEDPHSHLKFFMGVCNSFKDEGCSKEV------
        MNPPN +  QPIPPNVRIEEIVD VPVA   EV VP LNVVLLA  IDREIR      T   F+   TE   +  KF +       DEG +KEV      
Subjt:  MNPPNSHPRQPIPPNVRIEEIVDRVPVA-GPEVAVPLLNVVLLADNIDREIRVEKMLQTVDQFHRHPTEDPHSHLKFFMGVCNSFKDEGCSKEV------

Query:  --------------------------------------KNAKYRSEINNFQQFAWESVSESWEHFKQLLQKCPHHGIPRCIQIKIYYNDLDDATRLVIDA
                                              KNAKYRS+INNFQQF  ESV+ESWEHFK+L+QKC HH IPRCI I++YYN LDDATRLV   
Subjt:  --------------------------------------KNAKYRSEINNFQQFAWESVSESWEHFKQLLQKCPHHGIPRCIQIKIYYNDLDDATRLVIDA

Query:  STNGALLAKSYNEAFNILERISSNNHSWSDPRAIQGRGSKGLNKSKSFSTLNSKIKNLIDLIMRSMTQHSTMRASTGKTNVSHIQGISYSFCEGEHHCKN
        S N ALLAK Y EAFNILERISSN HS SD RAIQGRG+K LN+SKS+ST NSKI+N+IDL+ RSMTQ ST+ A TGK N SH QG SYSF  G HH  N
Subjt:  STNGALLAKSYNEAFNILERISSNNHSWSDPRAIQGRGSKGLNKSKSFSTLNSKIKNLIDLIMRSMTQHSTMRASTGKTNVSHIQGISYSFCEGEHHCKN

Query:  CPSNPESVYYLGNPQNNGNNPYSHTYNPDWRNHP
        CP NPESVY LGN  N+ NN YS+TYNP  RNHP
Subjt:  CPSNPESVYYLGNPQNNGNNPYSHTYNPDWRNHP

A0A6J1DRG1 uncharacterized protein LOC1110236694.3e-10977.57Show/hide
Query:  KNAKYRSEINNFQQFAWESVSESWEHFKQLLQKCPHHGIPRCIQIKIYYNDLDDATRLVIDASTNGALLAKSYNEAFNILERISSNNHSWSDPRAIQGRG
        KNAKYRSEI NFQQ   ESV+ESWE FKQLLQKCPHHGIPRCIQI+ YY  LDDATRLVIDASTNGALL K Y EAFNILERISSNNHSWSDPRAIQGRG
Subjt:  KNAKYRSEINNFQQFAWESVSESWEHFKQLLQKCPHHGIPRCIQIKIYYNDLDDATRLVIDASTNGALLAKSYNEAFNILERISSNNHSWSDPRAIQGRG

Query:  SKGLNKSKSFSTLNSKIKNLIDLIMRSMTQHSTMRASTGKTNVSHIQGISYSFCEGEHHCKNCPSNPESVYYLGNPQNNGNNPYSHTYNPDWRNHPKFSW
         KGLN+S+S+  LNSK++NL +L+MRSMTQ +T+ AS GK NVSHIQGIS SFCEGEHH  N P NPESVYYLGN QNNG N YS+TYNP WRNHP FSW
Subjt:  SKGLNKSKSFSTLNSKIKNLIDLIMRSMTQHSTMRASTGKTNVSHIQGISYSFCEGEHHCKNCPSNPESVYYLGNPQNNGNNPYSHTYNPDWRNHPKFSW

Query:  SENQGGNSAGMSNAPAYQQKGNYSLGFSNQGQVAVQRQFEGSFASLENLIKHYMKNNDTTVQS
        S NQGGN+AG SNAPAYQQK +Y   FSNQGQV VQ   EGSFASLENL+K  M+ ND TVQS
Subjt:  SENQGGNSAGMSNAPAYQQKGNYSLGFSNQGQVAVQRQFEGSFASLENLIKHYMKNNDTTVQS

A0A6J1DWK1 uncharacterized protein LOC1110250534.7e-6347.5Show/hide
Query:  KMLQTVDQFHRHPTEDPHSHLKFFMGVCNSFKDEGCSKEVKNAKYRSEINNFQQFAW-ESVSE----SWEHF------------KQLLQKCPHHGIPRCI
        +M+  V QFH H TE PH HLKFFMGV NSFKDEG SK V   K  S     +   W ES+S     SW+              K+L Q+CP+HGIP  I
Subjt:  KMLQTVDQFHRHPTEDPHSHLKFFMGVCNSFKDEGCSKEVKNAKYRSEINNFQQFAW-ESVSE----SWEHF------------KQLLQKCPHHGIPRCI

Query:  QIKIYYNDLDDATRLVIDASTNGALLAKSYNEAFNILERISSNNHSWSDPRAIQGRGSKGLNKSKSFSTLNSKIKNLIDLIMRSMTQHSTMRASTGKTNV
        QI+ YY  LD+ATRLVIDAS NGALL K Y +A NILERISS+NHSWSD RAI+G+ SK L +S+S++TLNSKI+ L DL                    
Subjt:  QIKIYYNDLDDATRLVIDASTNGALLAKSYNEAFNILERISSNNHSWSDPRAIQGRGSKGLNKSKSFSTLNSKIKNLIDLIMRSMTQHSTMRASTGKTNV

Query:  SHIQGISYSFCEGEHHCKNCPSNPESVYYLGNPQNNGNNPYSHTYNPDWRNHPKFSWSENQGGNSAGMSNAPAYQQKGNYSLGFSNQGQVAVQRQFEGSF
                                          NN N  YS+TYNP  RNHP F WS NQGG++ G+SNAP +QQK +Y  GF+ QGQ+    Q +GS 
Subjt:  SHIQGISYSFCEGEHHCKNCPSNPESVYYLGNPQNNGNNPYSHTYNPDWRNHPKFSWSENQGGNSAGMSNAPAYQQKGNYSLGFSNQGQVAVQRQFEGSF

Query:  ASLENLIKHYMKNNDTTVQS
         SLEN++K YM NND TVQS
Subjt:  ASLENLIKHYMKNNDTTVQS

A0A6J1DXK5 uncharacterized protein LOC1110255001.4e-7277.3Show/hide
Query:  KNAKYRSEINNFQQFAWESVSESWEHFKQLLQKCPHHGIPRCIQIKIYYNDLDDATRLVIDASTNGALLAKSYNEAFNILERISSNNHSWSDPRAIQGRG
        KNAKYRSEINNFQQF  ESVSESWE FK+L+QK  + GIPRCIQIK YYN LDDATRLVIDAS NGALLAK Y EAFNILERISSNN SWSDPRAI G+G
Subjt:  KNAKYRSEINNFQQFAWESVSESWEHFKQLLQKCPHHGIPRCIQIKIYYNDLDDATRLVIDASTNGALLAKSYNEAFNILERISSNNHSWSDPRAIQGRG

Query:  SKGLNKSKSFSTLNSKIKNLIDLIMRSMTQHSTMRASTGKTNVSHIQGISYSFCEGEHHCKNCPSNPESVYYLGNPQNNGNNPYS
        SKG N+S+SF+ LN KI+NL DL+MRSMT  ST+ AS GK NVSHIQGIS SFC GE+   NCP NPESV+YLGN QNN NNPYS
Subjt:  SKGLNKSKSFSTLNSKIKNLIDLIMRSMTQHSTMRASTGKTNVSHIQGISYSFCEGEHHCKNCPSNPESVYYLGNPQNNGNNPYS

A0A6J1E1F3 uncharacterized protein LOC1110250659.9e-9857.06Show/hide
Query:  KMLQTVDQFHRHPTEDPHSHLKFFMGVCNSFKDEGCSKEV--------------------------------------------KNAKYRSEINNFQQFA
        +MLQTVDQFH H TEDPH HLKFFMGVCNSFK+EG S EV                                            KNAKYRSEINNFQQFA
Subjt:  KMLQTVDQFHRHPTEDPHSHLKFFMGVCNSFKDEGCSKEV--------------------------------------------KNAKYRSEINNFQQFA

Query:  WESVSESWEHFKQLLQKCPHHGIPRCIQIKIYYNDLDDATRLVIDASTNGALLAKSYNEAFNILERISSNNHSWSDPRAIQGRGSKGLNKSKSFSTLNSK
         ESVSESWE FK+LLQ CPHHGIPRCIQI+ YY DL+DATRL                                 DPRA+QG+ SKGL +S+S++TLNS 
Subjt:  WESVSESWEHFKQLLQKCPHHGIPRCIQIKIYYNDLDDATRLVIDASTNGALLAKSYNEAFNILERISSNNHSWSDPRAIQGRGSKGLNKSKSFSTLNSK

Query:  IKNLIDLIMRSMTQHSTMRASTGKTNVSHIQGISYSFCEGEHHCKNCPSNPESVYYLGNPQNNGNNPYSHTYNPDWRNHPKFSWSENQGGNSAGMSNAPA
        I+NL  L+MRSM Q S++ A TG  NV+ IQGIS SFCEG+HH  NCP NPESVYYLGNPQNN NN YS+TYNP WRNHP FSWS +QGG++AG S+APA
Subjt:  IKNLIDLIMRSMTQHSTMRASTGKTNVSHIQGISYSFCEGEHHCKNCPSNPESVYYLGNPQNNGNNPYSHTYNPDWRNHPKFSWSENQGGNSAGMSNAPA

Query:  YQQKGNYSLGFSNQGQVAVQRQFEGSFASLENLIKHYMKNNDTTVQS
        +Q K +Y  GF NQGQ+  +RQ EGS ASLE L+K YM NND TVQS
Subjt:  YQQKGNYSLGFSNQGQVAVQRQFEGSFASLENLIKHYMKNNDTTVQS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCCACCTAACTCACATCCGCGCCAGCCTATTCCACCGAATGTGAGGATTGAGGAAATAGTAGATAGGGTTCCTGTTGCTGGCCCTGAGGTAGCAGTGCCCCTTCT
CAATGTTGTATTACTAGCAGATAACATCGACAGAGAAATCAGGGTAGAGAAGATGCTCCAGACAGTGGACCAATTTCACAGACATCCTACGGAGGACCCGCATTCACATC
TGAAATTTTTCATGGGAGTTTGCAATTCGTTTAAGGATGAAGGATGCAGCAAAGAAGTCAAAAATGCTAAGTACAGAAGCGAAATCAACAATTTTCAGCAATTTGCTTGG
GAGTCGGTTAGTGAGTCTTGGGAGCATTTTAAACAGCTTTTGCAAAAATGTCCTCACCATGGGATCCCAAGATGCATCCAGATCAAAATATACTATAATGATCTGGATGA
TGCCACACGTCTGGTCATTGATGCCTCAACAAATGGCGCATTGCTAGCAAAGTCTTATAATGAAGCTTTCAACATTCTGGAGAGGATATCATCGAACAATCATTCATGGT
CTGACCCTAGAGCCATCCAAGGGAGAGGAAGCAAGGGACTGAACAAATCAAAGTCATTTTCTACATTGAATTCAAAGATTAAGAATCTGATAGACTTGATTATGAGGAGC
ATGACACAACATAGTACGATGAGAGCATCTACTGGTAAGACAAATGTTAGCCACATCCAAGGAATTTCTTACTCTTTTTGTGAAGGAGAGCATCATTGCAAGAATTGCCC
TAGCAATCCAGAGTCGGTGTACTATTTGGGAAATCCTCAGAATAATGGAAATAACCCGTATTCACATACATATAACCCGGACTGGAGGAACCATCCTAAATTTAGTTGGA
GCGAAAATCAGGGAGGAAATAGTGCTGGAATGTCCAATGCTCCGGCATACCAGCAAAAAGGAAACTATTCCCTAGGTTTTTCTAACCAAGGTCAGGTAGCAGTGCAAAGG
CAATTTGAAGGATCATTCGCATCTTTGGAGAATCTGATCAAACACTATATGAAAAATAATGATACCACGGTGCAAAGTCTGAAGGGATGTGAATCCCGGAGCGGAAGCAG
ATTTTCGTTGCTGATTTTCGGGATACAAAATCTAGTTTTAAAGAAGAAAAATACAGGGGAAACCATTACCTTTGAAGAACTTTTCTTCTACGGTTTCCCACGAACACCAC
CACTACGTCTTCCTCGCTATCCTCTTGGGTCTCGGGATCGTGCTTGTGGTAGCACCTTGATCGAAGAGAGGGAGAGATTGAAAGATTACATTTTACTCCTTAAACACCAC
TGCTCCTCTAATGAACAACCTGATTATGGTCCAACCAATAACCGGAAATCCCTCTCGGCCAATGAGAGGGTGGGGCCCCTTGTTCAAGTTCCGGAGTCAGCATTTAAGGG
AACACTCATCTACTCCCTAAAGTCGGGGAGGAGTGAATTCCATCTTGTGAAGTTATGTTCCCAGCTCCCCACTCGGTCTCGTCCCCAAAATGGTAGGAATATTGAGTCGG
CGACTCAGGCCACTCTCACCCATACAGATCAAAGAAGATTCCATAACTCACTCAGGATTGAGAATGAGTTGCCTGGTCATCCTAAGAAATGGCAATCTATTAGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATCCACCTAACTCACATCCGCGCCAGCCTATTCCACCGAATGTGAGGATTGAGGAAATAGTAGATAGGGTTCCTGTTGCTGGCCCTGAGGTAGCAGTGCCCCTTCT
CAATGTTGTATTACTAGCAGATAACATCGACAGAGAAATCAGGGTAGAGAAGATGCTCCAGACAGTGGACCAATTTCACAGACATCCTACGGAGGACCCGCATTCACATC
TGAAATTTTTCATGGGAGTTTGCAATTCGTTTAAGGATGAAGGATGCAGCAAAGAAGTCAAAAATGCTAAGTACAGAAGCGAAATCAACAATTTTCAGCAATTTGCTTGG
GAGTCGGTTAGTGAGTCTTGGGAGCATTTTAAACAGCTTTTGCAAAAATGTCCTCACCATGGGATCCCAAGATGCATCCAGATCAAAATATACTATAATGATCTGGATGA
TGCCACACGTCTGGTCATTGATGCCTCAACAAATGGCGCATTGCTAGCAAAGTCTTATAATGAAGCTTTCAACATTCTGGAGAGGATATCATCGAACAATCATTCATGGT
CTGACCCTAGAGCCATCCAAGGGAGAGGAAGCAAGGGACTGAACAAATCAAAGTCATTTTCTACATTGAATTCAAAGATTAAGAATCTGATAGACTTGATTATGAGGAGC
ATGACACAACATAGTACGATGAGAGCATCTACTGGTAAGACAAATGTTAGCCACATCCAAGGAATTTCTTACTCTTTTTGTGAAGGAGAGCATCATTGCAAGAATTGCCC
TAGCAATCCAGAGTCGGTGTACTATTTGGGAAATCCTCAGAATAATGGAAATAACCCGTATTCACATACATATAACCCGGACTGGAGGAACCATCCTAAATTTAGTTGGA
GCGAAAATCAGGGAGGAAATAGTGCTGGAATGTCCAATGCTCCGGCATACCAGCAAAAAGGAAACTATTCCCTAGGTTTTTCTAACCAAGGTCAGGTAGCAGTGCAAAGG
CAATTTGAAGGATCATTCGCATCTTTGGAGAATCTGATCAAACACTATATGAAAAATAATGATACCACGGTGCAAAGTCTGAAGGGATGTGAATCCCGGAGCGGAAGCAG
ATTTTCGTTGCTGATTTTCGGGATACAAAATCTAGTTTTAAAGAAGAAAAATACAGGGGAAACCATTACCTTTGAAGAACTTTTCTTCTACGGTTTCCCACGAACACCAC
CACTACGTCTTCCTCGCTATCCTCTTGGGTCTCGGGATCGTGCTTGTGGTAGCACCTTGATCGAAGAGAGGGAGAGATTGAAAGATTACATTTTACTCCTTAAACACCAC
TGCTCCTCTAATGAACAACCTGATTATGGTCCAACCAATAACCGGAAATCCCTCTCGGCCAATGAGAGGGTGGGGCCCCTTGTTCAAGTTCCGGAGTCAGCATTTAAGGG
AACACTCATCTACTCCCTAAAGTCGGGGAGGAGTGAATTCCATCTTGTGAAGTTATGTTCCCAGCTCCCCACTCGGTCTCGTCCCCAAAATGGTAGGAATATTGAGTCGG
CGACTCAGGCCACTCTCACCCATACAGATCAAAGAAGATTCCATAACTCACTCAGGATTGAGAATGAGTTGCCTGGTCATCCTAAGAAATGGCAATCTATTAGTTAA
Protein sequenceShow/hide protein sequence
MNPPNSHPRQPIPPNVRIEEIVDRVPVAGPEVAVPLLNVVLLADNIDREIRVEKMLQTVDQFHRHPTEDPHSHLKFFMGVCNSFKDEGCSKEVKNAKYRSEINNFQQFAW
ESVSESWEHFKQLLQKCPHHGIPRCIQIKIYYNDLDDATRLVIDASTNGALLAKSYNEAFNILERISSNNHSWSDPRAIQGRGSKGLNKSKSFSTLNSKIKNLIDLIMRS
MTQHSTMRASTGKTNVSHIQGISYSFCEGEHHCKNCPSNPESVYYLGNPQNNGNNPYSHTYNPDWRNHPKFSWSENQGGNSAGMSNAPAYQQKGNYSLGFSNQGQVAVQR
QFEGSFASLENLIKHYMKNNDTTVQSLKGCESRSGSRFSLLIFGIQNLVLKKKNTGETITFEELFFYGFPRTPPLRLPRYPLGSRDRACGSTLIEERERLKDYILLLKHH
CSSNEQPDYGPTNNRKSLSANERVGPLVQVPESAFKGTLIYSLKSGRSEFHLVKLCSQLPTRSRPQNGRNIESATQATLTHTDQRRFHNSLRIENELPGHPKKWQSIS