; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g17740 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g17740
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr3:11771627..11776461
RNA-Seq ExpressionMoc03g17740
SyntenyMoc03g17740
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ERM93404.1 hypothetical protein AMTR_s04947p00003620 [Amborella trichopoda]5.7e-8353.4Show/hide
Query:  DDIGREIRAYAAPAFYNFNPVITEPKIEASKFEVKPVMFQMFQTVGQFHGHAMEDLHQHLKFFMGVCNSFKDERLSKEVLRLKLFPYSLRDEDRTWLESL
        DD  R IR YAAP F   NP I  P+I+A +FE+KPVMFQM QTVGQF G   ED H HL+ F+ V +SFK + +S+EVLRLKLFP+SLRD  R+WL +L
Subjt:  DDIGREIRAYAAPAFYNFNPVITEPKIEASKFEVKPVMFQMFQTVGQFHGHAMEDLHQHLKFFMGVCNSFKDERLSKEVLRLKLFPYSLRDEDRTWLESL

Query:  PSKSIKSWDDLAEKFLKKYFPPSKNAKYRSEINNFQQFVGESVSESWERFKQLLQKCPHHGIPRCIQIETYYKGLDDATRLVIDASANGALLAKPYAEAF
        P  S+ +W+DLAEKFL+KYFPP++NAK+RSEI +FQQ   ES S++WERFK+LL+KCPHHGIP CIQ+ET+Y GL+ A+R+V+DASANGA+L+K Y EAF
Subjt:  PSKSIKSWDDLAEKFLKKYFPPSKNAKYRSEINNFQQFVGESVSESWERFKQLLQKCPHHGIPRCIQIETYYKGLDDATRLVIDASANGALLAKPYAEAF

Query:  NILERISSNNHSWSDPRAVQGKSSKGLAESESYSALNSKIENLTDLVMRSMTQQSLAGA-SVSTANVNQIQGISCSFCEGNHHYNNCPGNLESL
         ILE I+SNN+ WS+ RA   +   G+ E ++ +AL +++ ++T++    +   S+  A ++  A   Q   +SC FC   H +  CP N ES+
Subjt:  NILERISSNNHSWSDPRAVQGKSSKGLAESESYSALNSKIENLTDLVMRSMTQQSLAGA-SVSTANVNQIQGISCSFCEGNHHYNNCPGNLESL

XP_022150863.1 uncharacterized protein LOC111018910 [Momordica charantia]6.1e-9364.95Show/hide
Query:  IGREIRAYAAPAFYNFNPVITEPKIEASKFEVKPVMFQMFQTVGQFHGHAMEDLHQHLKFFMGVCNSFKDERLSKEVLRLKLFPYSLRDEDRTWLESLPS
        I REIRAYAAP FYNFNPVITE +I A KFE+K                                    DE  +KEVLRLKLF +SLRDE RTWL SLPS
Subjt:  IGREIRAYAAPAFYNFNPVITEPKIEASKFEVKPVMFQMFQTVGQFHGHAMEDLHQHLKFFMGVCNSFKDERLSKEVLRLKLFPYSLRDEDRTWLESLPS

Query:  KSIKSWDDLAEKFLKKYFPPSKNAKYRSEINNFQQFVGESVSESWERFKQLLQKCPHHGIPRCIQIETYYKGLDDATRLVIDASANGALLAKPYAEAFNI
        +SI SWDDLAE FL KYFPPSKNAKYRS+INNFQQF GESV+ESWE FK+L+QKC HH IPRCI IE YY GLDDATRLV   S N ALLAKPYAEAFNI
Subjt:  KSIKSWDDLAEKFLKKYFPPSKNAKYRSEINNFQQFVGESVSESWERFKQLLQKCPHHGIPRCIQIETYYKGLDDATRLVIDASANGALLAKPYAEAFNI

Query:  LERISSNNHSWSDPRAVQGKSSKGLAESESYSALNSKIENLTDLVMRSMTQQSLAGASVSTANVNQIQGISCSFCEGNHHYNNCPGNLESL
        LERISSN HS SD RA+QG+ +K L ES+SYS  NSKIEN+ DLV RSMTQQS  GA    AN +  QG S SF  G HHYNNCPGN ES+
Subjt:  LERISSNNHSWSDPRAVQGKSSKGLAESESYSALNSKIENLTDLVMRSMTQQSLAGASVSTANVNQIQGISCSFCEGNHHYNNCPGNLESL

XP_022157438.1 uncharacterized protein LOC111024136 [Momordica charantia]7.5e-8367.06Show/hide
Query:  MFQMFQTVGQFHGHAMEDLHQHLKFFMGVCNSFKDERLSKEVLRLKLFPYSLRDEDRTWLESLPSKSIKSWDDLAEKFLKKYFPPSKNAKYRSEINNFQQ
        MFQM QTVG+FHGHA ED HQHLKF MGVCNSFKDE LSK+V+RLKLFP+SLRDE RTWLESLPS+SI SWDDLAEKFL KYFPP+KNAKYR+EINNFQQ
Subjt:  MFQMFQTVGQFHGHAMEDLHQHLKFFMGVCNSFKDERLSKEVLRLKLFPYSLRDEDRTWLESLPSKSIKSWDDLAEKFLKKYFPPSKNAKYRSEINNFQQ

Query:  FVGESVSESWERFKQLLQKCPHHGIPRCIQIETYYKGLDDATRLVIDASANGALLAKPYAEAFNILERISSNNHSWSDPRAVQGKSSKGLAESESYSALN
        F GES +                                                    AEAFNILERISSNNHSW DP+AVQGKSSK L ESESY+ LN
Subjt:  FVGESVSESWERFKQLLQKCPHHGIPRCIQIETYYKGLDDATRLVIDASANGALLAKPYAEAFNILERISSNNHSWSDPRAVQGKSSKGLAESESYSALN

Query:  SKIENLTDLVMRSMTQQSLAGASVSTANVNQIQGISCSFCEGNHHYNNCPGNLES
        SKIENLTDLVMRS+TQQS AGASV   NVNQIQGISCSF EG+HHYNNCPGN ES
Subjt:  SKIENLTDLVMRSMTQQSLAGASVSTANVNQIQGISCSFCEGNHHYNNCPGNLES

XP_022158611.1 uncharacterized protein LOC111025065 [Momordica charantia]9.7e-9974.61Show/hide
Query:  MFQMFQTVGQFHGHAMEDLHQHLKFFMGVCNSFKDERLSKEVLRLKLFPYSLRDEDRTWLESLPSKSIKSWDDLAEKFLKKYFPPSKNAKYRSEINNFQQ
        MFQM QTV QFHGHA ED HQHLKFFMGVCNSFK+E LS EVLRLKLFPYSLRDE RTWLESLP +SI SWDDLAEKFL KYFPPSKNAKYRSEINNFQQ
Subjt:  MFQMFQTVGQFHGHAMEDLHQHLKFFMGVCNSFKDERLSKEVLRLKLFPYSLRDEDRTWLESLPSKSIKSWDDLAEKFLKKYFPPSKNAKYRSEINNFQQ

Query:  FVGESVSESWERFKQLLQKCPHHGIPRCIQIETYYKGLDDATRLVIDASANGALLAKPYAEAFNILERISSNNHSWSDPRAVQGKSSKGLAESESYSALN
        F GESVSESWE FK+LLQ CPHHGIPRCIQIETYYK L+DATRL                                 DPRAVQGKSSKGL ESESY+ LN
Subjt:  FVGESVSESWERFKQLLQKCPHHGIPRCIQIETYYKGLDDATRLVIDASANGALLAKPYAEAFNILERISSNNHSWSDPRAVQGKSSKGLAESESYSALN

Query:  SKIENLTDLVMRSMTQQSLAGASVSTANVNQIQGISCSFCEGNHHYNNCPGNLESL
        S IENLT LVMRSM QQS  GA   TANVNQIQGISCSFCEG+HHYNNCPGN ES+
Subjt:  SKIENLTDLVMRSMTQQSLAGASVSTANVNQIQGISCSFCEGNHHYNNCPGNLESL

XP_030483210.1 uncharacterized protein LOC115699807 [Cannabis sativa]3.0e-8453.58Show/hide
Query:  DDIGREIRAYAAPAFYNFNPVITEPKIEASKFEVKPVMFQMFQTVGQFHGHAMEDLHQHLKFFMGVCNSFKDERLSKEVLRLKLFPYSLRDEDRTWLESL
        DD  + IR YAAP F   NP I  P+I+A +FE+KPVMFQM QTVGQF G   ED H HL+ FM V +SFK   ++++ LRLKLFPYSLRD+ R WL SL
Subjt:  DDIGREIRAYAAPAFYNFNPVITEPKIEASKFEVKPVMFQMFQTVGQFHGHAMEDLHQHLKFFMGVCNSFKDERLSKEVLRLKLFPYSLRDEDRTWLESL

Query:  PSKSIKSWDDLAEKFLKKYFPPSKNAKYRSEINNFQQFVGESVSESWERFKQLLQKCPHHGIPRCIQIETYYKGLDDATRLVIDASANGALLAKPYAEAF
        PS S+ +W +LAE+FL KYFPP+KNAK R EI +FQQF  ES+ E+WERFK+LL+KCPHHGIP CIQ+ET+Y GL+  TR+V+DASANGALLAK Y EA+
Subjt:  PSKSIKSWDDLAEKFLKKYFPPSKNAKYRSEINNFQQFVGESVSESWERFKQLLQKCPHHGIPRCIQIETYYKGLDDATRLVIDASANGALLAKPYAEAF

Query:  NILERISSNNHSWSDPRAVQGKSSKGLAESESYSALNSKIENLTDLVMRSMTQQSLAGASVSTANVNQIQGISCSFCEGNHHYNNCPGNLESL
        +I+ERIS+NN+ W   R   GK   G+ E ++ +AL++++ ++++++      Q +   +VS+  V Q++ +SC FC   H ++NCP N  S+
Subjt:  NILERISSNNHSWSDPRAVQGKSSKGLAESESYSALNSKIENLTDLVMRSMTQQSLAGASVSTANVNQIQGISCSFCEGNHHYNNCPGNLESL

TrEMBL top hitse value%identityAlignment
A0A6J1DAK9 uncharacterized protein LOC1110189103.0e-9364.95Show/hide
Query:  IGREIRAYAAPAFYNFNPVITEPKIEASKFEVKPVMFQMFQTVGQFHGHAMEDLHQHLKFFMGVCNSFKDERLSKEVLRLKLFPYSLRDEDRTWLESLPS
        I REIRAYAAP FYNFNPVITE +I A KFE+K                                    DE  +KEVLRLKLF +SLRDE RTWL SLPS
Subjt:  IGREIRAYAAPAFYNFNPVITEPKIEASKFEVKPVMFQMFQTVGQFHGHAMEDLHQHLKFFMGVCNSFKDERLSKEVLRLKLFPYSLRDEDRTWLESLPS

Query:  KSIKSWDDLAEKFLKKYFPPSKNAKYRSEINNFQQFVGESVSESWERFKQLLQKCPHHGIPRCIQIETYYKGLDDATRLVIDASANGALLAKPYAEAFNI
        +SI SWDDLAE FL KYFPPSKNAKYRS+INNFQQF GESV+ESWE FK+L+QKC HH IPRCI IE YY GLDDATRLV   S N ALLAKPYAEAFNI
Subjt:  KSIKSWDDLAEKFLKKYFPPSKNAKYRSEINNFQQFVGESVSESWERFKQLLQKCPHHGIPRCIQIETYYKGLDDATRLVIDASANGALLAKPYAEAFNI

Query:  LERISSNNHSWSDPRAVQGKSSKGLAESESYSALNSKIENLTDLVMRSMTQQSLAGASVSTANVNQIQGISCSFCEGNHHYNNCPGNLESL
        LERISSN HS SD RA+QG+ +K L ES+SYS  NSKIEN+ DLV RSMTQQS  GA    AN +  QG S SF  G HHYNNCPGN ES+
Subjt:  LERISSNNHSWSDPRAVQGKSSKGLAESESYSALNSKIENLTDLVMRSMTQQSLAGASVSTANVNQIQGISCSFCEGNHHYNNCPGNLESL

A0A6J1DTD1 uncharacterized protein LOC1110241363.6e-8367.06Show/hide
Query:  MFQMFQTVGQFHGHAMEDLHQHLKFFMGVCNSFKDERLSKEVLRLKLFPYSLRDEDRTWLESLPSKSIKSWDDLAEKFLKKYFPPSKNAKYRSEINNFQQ
        MFQM QTVG+FHGHA ED HQHLKF MGVCNSFKDE LSK+V+RLKLFP+SLRDE RTWLESLPS+SI SWDDLAEKFL KYFPP+KNAKYR+EINNFQQ
Subjt:  MFQMFQTVGQFHGHAMEDLHQHLKFFMGVCNSFKDERLSKEVLRLKLFPYSLRDEDRTWLESLPSKSIKSWDDLAEKFLKKYFPPSKNAKYRSEINNFQQ

Query:  FVGESVSESWERFKQLLQKCPHHGIPRCIQIETYYKGLDDATRLVIDASANGALLAKPYAEAFNILERISSNNHSWSDPRAVQGKSSKGLAESESYSALN
        F GES +                                                    AEAFNILERISSNNHSW DP+AVQGKSSK L ESESY+ LN
Subjt:  FVGESVSESWERFKQLLQKCPHHGIPRCIQIETYYKGLDDATRLVIDASANGALLAKPYAEAFNILERISSNNHSWSDPRAVQGKSSKGLAESESYSALN

Query:  SKIENLTDLVMRSMTQQSLAGASVSTANVNQIQGISCSFCEGNHHYNNCPGNLES
        SKIENLTDLVMRS+TQQS AGASV   NVNQIQGISCSF EG+HHYNNCPGN ES
Subjt:  SKIENLTDLVMRSMTQQSLAGASVSTANVNQIQGISCSFCEGNHHYNNCPGNLES

A0A6J1E1F3 uncharacterized protein LOC1110250654.7e-9974.61Show/hide
Query:  MFQMFQTVGQFHGHAMEDLHQHLKFFMGVCNSFKDERLSKEVLRLKLFPYSLRDEDRTWLESLPSKSIKSWDDLAEKFLKKYFPPSKNAKYRSEINNFQQ
        MFQM QTV QFHGHA ED HQHLKFFMGVCNSFK+E LS EVLRLKLFPYSLRDE RTWLESLP +SI SWDDLAEKFL KYFPPSKNAKYRSEINNFQQ
Subjt:  MFQMFQTVGQFHGHAMEDLHQHLKFFMGVCNSFKDERLSKEVLRLKLFPYSLRDEDRTWLESLPSKSIKSWDDLAEKFLKKYFPPSKNAKYRSEINNFQQ

Query:  FVGESVSESWERFKQLLQKCPHHGIPRCIQIETYYKGLDDATRLVIDASANGALLAKPYAEAFNILERISSNNHSWSDPRAVQGKSSKGLAESESYSALN
        F GESVSESWE FK+LLQ CPHHGIPRCIQIETYYK L+DATRL                                 DPRAVQGKSSKGL ESESY+ LN
Subjt:  FVGESVSESWERFKQLLQKCPHHGIPRCIQIETYYKGLDDATRLVIDASANGALLAKPYAEAFNILERISSNNHSWSDPRAVQGKSSKGLAESESYSALN

Query:  SKIENLTDLVMRSMTQQSLAGASVSTANVNQIQGISCSFCEGNHHYNNCPGNLESL
        S IENLT LVMRSM QQS  GA   TANVNQIQGISCSFCEG+HHYNNCPGN ES+
Subjt:  SKIENLTDLVMRSMTQQSLAGASVSTANVNQIQGISCSFCEGNHHYNNCPGNLESL

A0A6J1EEI2 uncharacterized protein LOC1114333944.0e-8247.99Show/hide
Query:  EIERTFHRNIREQRRQQAAANMDRVNQPDDIGREIRAYAAPAFYNFNPVITEPKIEASKFEVKPVMFQMFQTVGQFHGHAMEDLHQHLKFFMGVCNSFKD
        ++ R F        +++  AN   ++  DD  R IRAYA PA    NP I  P+++A+ FE+KPVMFQM QT+GQFHG   ED H HLK F+GV +SF+ 
Subjt:  EIERTFHRNIREQRRQQAAANMDRVNQPDDIGREIRAYAAPAFYNFNPVITEPKIEASKFEVKPVMFQMFQTVGQFHGHAMEDLHQHLKFFMGVCNSFKD

Query:  ERLSKEVLRLKLFPYSLRDEDRTWLESLPSKSIKSWDDLAEKFLKKYFPPSKNAKYRSEINNFQQFVGESVSESWERFKQLLQKCPHHGIPRCIQIETYY
        +R+ K+V+RL LFPYSLRD  ++WL +L   +I SW+ L EKFL KYFPP++NA++R+EI  FQQF  +++SE+WERFK++L+KCPHHG+P CIQ+ET+Y
Subjt:  ERLSKEVLRLKLFPYSLRDEDRTWLESLPSKSIKSWDDLAEKFLKKYFPPSKNAKYRSEINNFQQFVGESVSESWERFKQLLQKCPHHGIPRCIQIETYY

Query:  KGLDDATRLVIDASANGALLAKPYAEAFNILERISSNNHSWSDPRAVQGKSSKGLAESESYSALNSKIENLTDLVMR-SMTQQSLAGASVST-ANVNQIQ
         GL+ AT+ V+DASANGA+L+K Y EA+ ILERI+SNN  W+D R+  G+ ++G+ E ++ S++N+++ ++T+++   ++ Q S+  A V T A +NQ  
Subjt:  KGLDDATRLVIDASANGALLAKPYAEAFNILERISSNNHSWSDPRAVQGKSSKGLAESESYSALNSKIENLTDLVMR-SMTQQSLAGASVST-ANVNQIQ

Query:  GISCSFCEGNHHYNNCPGNLESL
          SC +C   H ++ CP N  S+
Subjt:  GISCSFCEGNHHYNNCPGNLESL

U5CUI2 Retrotrans_gag domain-containing protein2.8e-8353.4Show/hide
Query:  DDIGREIRAYAAPAFYNFNPVITEPKIEASKFEVKPVMFQMFQTVGQFHGHAMEDLHQHLKFFMGVCNSFKDERLSKEVLRLKLFPYSLRDEDRTWLESL
        DD  R IR YAAP F   NP I  P+I+A +FE+KPVMFQM QTVGQF G   ED H HL+ F+ V +SFK + +S+EVLRLKLFP+SLRD  R+WL +L
Subjt:  DDIGREIRAYAAPAFYNFNPVITEPKIEASKFEVKPVMFQMFQTVGQFHGHAMEDLHQHLKFFMGVCNSFKDERLSKEVLRLKLFPYSLRDEDRTWLESL

Query:  PSKSIKSWDDLAEKFLKKYFPPSKNAKYRSEINNFQQFVGESVSESWERFKQLLQKCPHHGIPRCIQIETYYKGLDDATRLVIDASANGALLAKPYAEAF
        P  S+ +W+DLAEKFL+KYFPP++NAK+RSEI +FQQ   ES S++WERFK+LL+KCPHHGIP CIQ+ET+Y GL+ A+R+V+DASANGA+L+K Y EAF
Subjt:  PSKSIKSWDDLAEKFLKKYFPPSKNAKYRSEINNFQQFVGESVSESWERFKQLLQKCPHHGIPRCIQIETYYKGLDDATRLVIDASANGALLAKPYAEAF

Query:  NILERISSNNHSWSDPRAVQGKSSKGLAESESYSALNSKIENLTDLVMRSMTQQSLAGA-SVSTANVNQIQGISCSFCEGNHHYNNCPGNLESL
         ILE I+SNN+ WS+ RA   +   G+ E ++ +AL +++ ++T++    +   S+  A ++  A   Q   +SC FC   H +  CP N ES+
Subjt:  NILERISSNNHSWSDPRAVQGKSSKGLAESESYSALNSKIENLTDLVMRSMTQQSLAGA-SVSTANVNQIQGISCSFCEGNHHYNNCPGNLESL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGGTCATGTTAAGGCAACAGAAAGTTGACTACGTGCTAATGGATCCTAAGAAAATTCCAGCAAGTGTCAGTGAGAACGAAGATCATCTTAAAGAAAAGGAT
AGGGTGGCGCATTGCTTGATCATCCTCCACTTGGCAAACAGTGTCATACGCCAAATGCGTAGTAAGGACACTACAAGTAAGCTTTGGAAAAGTTATAGTGATGTA
AAGTCTACAATGAAGTATGGTAGAGATGATCTGACGTCTGCTATTGTGATTAATGCCATTAAAATGAAAGAGAAGAATGGGGGTGGTAACCGTAACCAAAACCAA
AAGGGTAAGGCGAAAGAGGATACGGATGGGGTCAACCTTAGTGAGGAGTATGAGGAGTATGACCATGTCCTTATGGGATGCTCAGGAATGGGGTTCCCTAATCAT
GCCTATAGTTTTGATTTATTGCTTGGGTGCATGAACGATTATGAAGGATTAGAATATATTATTAATCACGAAATAGAGCGCACCTTCCACAGAAATATACGAGAG
CAGAGGCGACAACAAGCGGCAGCCAACATGGACAGAGTAAATCAGCCTGACGACATAGGCAGGGAGATAAGAGCATATGCAGCTCCGGCATTTTACAACTTCAAC
CCAGTTATCACAGAGCCAAAAATTGAAGCTTCCAAATTTGAGGTGAAACCAGTGATGTTTCAGATGTTTCAGACAGTGGGCCAATTTCATGGACATGCTATGGAG
GACCTGCATCAGCATCTGAAATTCTTTATGGGAGTTTGCAACTCATTCAAGGACGAGAGATTGAGCAAGGAAGTGCTACGACTTAAACTATTTCCGTACTCACTT
AGAGATGAAGACAGAACGTGGTTAGAGTCCCTTCCTTCAAAATCTATTAAAAGTTGGGATGACTTGGCTGAGAAGTTTCTGAAGAAATACTTCCCACCCAGCAAA
AATGCTAAGTATCGCAGTGAGATCAACAATTTTCAACAATTTGTTGGGGAATCAGTCAGTGAATCCTGGGAAAGGTTTAAGCAATTATTGCAGAAGTGCCCCCAC
CATGGAATCCCGAGATGTATACAGATCGAGACATATTACAAAGGTCTGGATGATGCCACACGCCTAGTGATTGATGCGTCAGCAAATGGGGCTTTGCTAGCAAAA
CCCTATGCTGAAGCATTTAATATTTTGGAAAGAATATCATCTAACAATCACTCATGGTCTGATCCTAGAGCTGTGCAAGGAAAATCGAGTAAGGGGTTGGCTGAA
TCCGAATCATATTCTGCATTGAATTCAAAGATTGAGAATCTGACGGATTTGGTAATGAGGAGTATGACGCAGCAGAGTCTAGCTGGAGCATCAGTCAGTACAGCT
AATGTTAATCAAATTCAAGGGATTTCCTGCTCATTCTGCGAAGGAAACCACCATTACAATAACTGCCCTGGAAATCTAGAGAGTTTATTATCTGGGGAACCCGTA
GAATAA
mRNA sequenceShow/hide mRNA sequence
ATGATGGTCATGTTAAGGCAACAGAAAGTTGACTACGTGCTAATGGATCCTAAGAAAATTCCAGCAAGTGTCAGTGAGAACGAAGATCATCTTAAAGAAAAGGAT
AGGGTGGCGCATTGCTTGATCATCCTCCACTTGGCAAACAGTGTCATACGCCAAATGCGTAGTAAGGACACTACAAGTAAGCTTTGGAAAAGTTATAGTGATGTA
AAGTCTACAATGAAGTATGGTAGAGATGATCTGACGTCTGCTATTGTGATTAATGCCATTAAAATGAAAGAGAAGAATGGGGGTGGTAACCGTAACCAAAACCAA
AAGGGTAAGGCGAAAGAGGATACGGATGGGGTCAACCTTAGTGAGGAGTATGAGGAGTATGACCATGTCCTTATGGGATGCTCAGGAATGGGGTTCCCTAATCAT
GCCTATAGTTTTGATTTATTGCTTGGGTGCATGAACGATTATGAAGGATTAGAATATATTATTAATCACGAAATAGAGCGCACCTTCCACAGAAATATACGAGAG
CAGAGGCGACAACAAGCGGCAGCCAACATGGACAGAGTAAATCAGCCTGACGACATAGGCAGGGAGATAAGAGCATATGCAGCTCCGGCATTTTACAACTTCAAC
CCAGTTATCACAGAGCCAAAAATTGAAGCTTCCAAATTTGAGGTGAAACCAGTGATGTTTCAGATGTTTCAGACAGTGGGCCAATTTCATGGACATGCTATGGAG
GACCTGCATCAGCATCTGAAATTCTTTATGGGAGTTTGCAACTCATTCAAGGACGAGAGATTGAGCAAGGAAGTGCTACGACTTAAACTATTTCCGTACTCACTT
AGAGATGAAGACAGAACGTGGTTAGAGTCCCTTCCTTCAAAATCTATTAAAAGTTGGGATGACTTGGCTGAGAAGTTTCTGAAGAAATACTTCCCACCCAGCAAA
AATGCTAAGTATCGCAGTGAGATCAACAATTTTCAACAATTTGTTGGGGAATCAGTCAGTGAATCCTGGGAAAGGTTTAAGCAATTATTGCAGAAGTGCCCCCAC
CATGGAATCCCGAGATGTATACAGATCGAGACATATTACAAAGGTCTGGATGATGCCACACGCCTAGTGATTGATGCGTCAGCAAATGGGGCTTTGCTAGCAAAA
CCCTATGCTGAAGCATTTAATATTTTGGAAAGAATATCATCTAACAATCACTCATGGTCTGATCCTAGAGCTGTGCAAGGAAAATCGAGTAAGGGGTTGGCTGAA
TCCGAATCATATTCTGCATTGAATTCAAAGATTGAGAATCTGACGGATTTGGTAATGAGGAGTATGACGCAGCAGAGTCTAGCTGGAGCATCAGTCAGTACAGCT
AATGTTAATCAAATTCAAGGGATTTCCTGCTCATTCTGCGAAGGAAACCACCATTACAATAACTGCCCTGGAAATCTAGAGAGTTTATTATCTGGGGAACCCGTA
GAATAA
Protein sequenceShow/hide protein sequence
MMVMLRQQKVDYVLMDPKKIPASVSENEDHLKEKDRVAHCLIILHLANSVIRQMRSKDTTSKLWKSYSDVKSTMKYGRDDLTSAIVINAIKMKEKNGGGNRNQNQ
KGKAKEDTDGVNLSEEYEEYDHVLMGCSGMGFPNHAYSFDLLLGCMNDYEGLEYIINHEIERTFHRNIREQRRQQAAANMDRVNQPDDIGREIRAYAAPAFYNFN
PVITEPKIEASKFEVKPVMFQMFQTVGQFHGHAMEDLHQHLKFFMGVCNSFKDERLSKEVLRLKLFPYSLRDEDRTWLESLPSKSIKSWDDLAEKFLKKYFPPSK
NAKYRSEINNFQQFVGESVSESWERFKQLLQKCPHHGIPRCIQIETYYKGLDDATRLVIDASANGALLAKPYAEAFNILERISSNNHSWSDPRAVQGKSSKGLAE
SESYSALNSKIENLTDLVMRSMTQQSLAGASVSTANVNQIQGISCSFCEGNHHYNNCPGNLESLLSGEPVE