; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g38430 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g38430
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRNase H domain-containing protein
Genome locationchr9:29576468..29579326
RNA-Seq ExpressionMoc09g38430
SyntenyMoc09g38430
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022149029.1 uncharacterized protein LOC111017548 [Momordica charantia]9.7e-7757.42Show/hide
Query:  FLAASDAMKYRAFQIALEGSTRLWYRQLKPRSIDSYQQMRRLFINQFSARQLLKLPPSHLGTVKQQDNKSLTEYIARFIDEDVKVVSCTNDIVMTELA--
        FLAASDA+K RAFQIALEGS RLWY+QLKPRSIDSYQQ+RRLFINQFSARQLLKLPPSHL TVKQ+DN+SLTEYIAR +DE VKVVSCT+DI M      
Subjt:  FLAASDAMKYRAFQIALEGSTRLWYRQLKPRSIDSYQQMRRLFINQFSARQLLKLPPSHLGTVKQQDNKSLTEYIARFIDEDVKVVSCTNDIVMTELA--

Query:  ----SILMAWSCRRPTE--------------------PGEAAAIKFKTR----SLPVPR--SDVVMIEVCLDELTTTRIE---------------VCVNA
            ++ + +  R P                       G   + + K R    S P  R   D       +D+ +  + +                 +NA
Subjt:  ----SILMAWSCRRPTE--------------------PGEAAAIKFKTR----SLPVPR--SDVVMIEVCLDELTTTRIE---------------VCVNA

Query:  SIVETYTAVEDTDMEALFTASEKLKLHQPLGKRDKQLYYRFHKDHDHDTSRCFHLKEQVEDLIRRGYLKKYFGSREQAEPEGSAREEKRERSQPPRRKGD
        S+ E Y  VE+TDM+ALFTA +  KLH+P GKRDK+LY RFHKDH H++SRCFHLKEQV+DLIRRGYLKKY GSRE+A+PEGS REEKRERSQPP RK D
Subjt:  SIVETYTAVEDTDMEALFTASEKLKLHQPLGKRDKQLYYRFHKDHDHDTSRCFHLKEQVEDLIRRGYLKKYFGSREQAEPEGSAREEKRERSQPPRRKGD

Query:  RPAVINTIHG
        RPAVINTIHG
Subjt:  RPAVINTIHG

XP_022150613.1 uncharacterized protein LOC111018708, partial [Momordica charantia]2.3e-3336.74Show/hide
Query:  FLAASDAMKYRAFQIALEGSTRLWYRQLKPRSIDSYQQMRRLFINQFSARQLLKLPPSHLGTVKQQDNKSLTEYIARFIDEDVKVVSCTNDIVM------
        F AASDA+K RAFQIAL GS RLWYR+L  RSI +Y Q+RR F+ QFS+RQ  K   +HL T++Q++  +L EY+ RF +E +KV  C++D  M      
Subjt:  FLAASDAMKYRAFQIALEGSTRLWYRQLKPRSIDSYQQMRRLFINQFSARQLLKLPPSHLGTVKQQDNKSLTEYIARFIDEDVKVVSCTNDIVM------

Query:  ---------------TELASI------------LMAWSCRRPT---------EPGEAAAIKFKTR-SLPVPRSDVVMIEVCLDELTTTRIEVCVNASIVE
                       T  A +            L+     RP          +  E A  K K + S    R++    E    +             I E
Subjt:  ---------------TELASI------------LMAWSCRRPT---------EPGEAAAIKFKTR-SLPVPRSDVVMIEVCLDELTTTRIEVCVNASIVE

Query:  TYTAVEDTDMEALFTASEKLKLHQPLGKRDKQLYYRFHKDHDHDTSRCFHLKEQVEDLIRRGYLKKYFGSREQAEPEGSAREEKRERSQPPRRKGDRPAV
          T +E++ ME L    EKL+      +R K  Y RFH++H H+TS C+ LK Q+EDLI+ GY KK+ G    +  E   ++E+R+RS+ P R+ DRPAV
Subjt:  TYTAVEDTDMEALFTASEKLKLHQPLGKRDKQLYYRFHKDHDHDTSRCFHLKEQVEDLIRRGYLKKYFGSREQAEPEGSAREEKRERSQPPRRKGDRPAV

Query:  INTIHGVHGMGRS
        INTI G    G+S
Subjt:  INTIHGVHGMGRS

XP_022152851.1 uncharacterized protein LOC111020475 [Momordica charantia]1.3e-6556.74Show/hide
Query:  FLAASDAMKYRAFQIALEGSTRLWYRQLKPRSIDSYQQMRRLFINQFSARQLLKLPPSHLGTVKQQDNKSLTEYIARFIDEDVKVVSCTNDIVMTELA--
        FLA SDAMK  AFQI LEGSTRLWYRQLK RSIDSYQQ+RRLFINQFS RQ LKLP SHLGTVKQ+DN+S T YIARF+DE VKVVSCT+DI M      
Subjt:  FLAASDAMKYRAFQIALEGSTRLWYRQLKPRSIDSYQQMRRLFINQFSARQLLKLPPSHLGTVKQQDNKSLTEYIARFIDEDVKVVSCTNDIVMTELA--

Query:  ----SILMAWSCRRPT----------------------EPGEAAAIKFKTRSLPVPRSDVVMIEVCLDELTTTRIEVCVNASIVETYTAVEDTDMEALFT
            ++ + +   +P                          E AA++  T SL  PRS  VMIE+ L EL TTRI+V +      ++  V    +E    
Subjt:  ----SILMAWSCRRPT----------------------EPGEAAAIKFKTRSLPVPRSDVVMIEVCLDELTTTRIEVCVNASIVETYTAVEDTDMEALFT

Query:  ASEKLKLHQPLGKRDKQLYYRFHKDHDHDTSRCFHLKEQVEDLIRRGYLKKYFGSREQAEPEGSAREEKRERSQPPRRKGDR
             KL +P GKRDK+LY RFHKD  HDTSRCFHLKEQVEDLIRRGYLKKY G+ E+A PE SA EEKRERSQ P+R+ DR
Subjt:  ASEKLKLHQPLGKRDKQLYYRFHKDHDHDTSRCFHLKEQVEDLIRRGYLKKYFGSREQAEPEGSAREEKRERSQPPRRKGDR

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]7.1e-3534.18Show/hide
Query:  FLAASDAMKYRAFQIALEGSTRLWYRQLKPRSIDSYQQMRRLFINQFSARQLLKLPPSHLGTVKQQDNKSLTEYIARFIDEDVKVVSCTNDIVM----TE
        F AA+DA+K RAFQIAL GS RLWYR+L  RSI +Y Q+R+ FI+QFS+    +   +HL T++Q++ ++L EY+ RF +E +KV  C++D  M    T 
Subjt:  FLAASDAMKYRAFQIALEGSTRLWYRQLKPRSIDSYQQMRRLFINQFSARQLLKLPPSHLGTVKQQDNKSLTEYIARFIDEDVKVVSCTNDIVM----TE

Query:  LA-----------------------------SILMAWSCRRPTEPGEAAAIKFKTR-----------SLPVPRSDVVMIEVCLDELTTTRIEVCVNASIV
        LA                               L+     RP +  +   +  + R           S    R++   +E                  I 
Subjt:  LA-----------------------------SILMAWSCRRPTEPGEAAAIKFKTR-----------SLPVPRSDVVMIEVCLDELTTTRIEVCVNASIV

Query:  ETYTAVEDTDMEALFTASEKLKLHQPLGKRDKQLYYRFHKDHDHDTSRCFHLKEQVEDLIRRGYLKKYFGSREQAEPEGSAREEKRERSQPPRRKGDRPA
        E  T +E++ ME L    EKL+    L KR+K+ Y RFH+DH H+T+ C+ LK Q+EDLI+ GY KK+ G       E   ++E+R+RS+ P R+ DRPA
Subjt:  ETYTAVEDTDMEALFTASEKLKLHQPLGKRDKQLYYRFHKDHDHDTSRCFHLKEQVEDLIRRGYLKKYFGSREQAEPEGSAREEKRERSQPPRRKGDRPA

Query:  VINTIHGVHGMGRSPS--SSLIRDLRLIMPIV--HIARSSSFVSSLQFNGRHLP
        VINTI G    G+S +    L R+ R  + I+  H    S         G HLP
Subjt:  VINTIHGVHGMGRSPS--SSLIRDLRLIMPIV--HIARSSSFVSSLQFNGRHLP

XP_022158844.1 uncharacterized protein LOC111025310 [Momordica charantia]1.8e-3879.65Show/hide
Query:  VNASIVETYTAVEDTDMEALFTASEKLKLHQPLGKRDKQLYYRFHKDHDHDTSRCFHLKEQVEDLIRRGYLKKYFGSREQAEPEGSAREEKRERSQPPRR
        +NASI E Y  VEDTDME LF + EKL+  +P GKR+K+LY RFHKDH HDTSRCFHLKEQVEDLIR GYLKKY GSREQAE EGSAREEKRERSQPPR 
Subjt:  VNASIVETYTAVEDTDMEALFTASEKLKLHQPLGKRDKQLYYRFHKDHDHDTSRCFHLKEQVEDLIRRGYLKKYFGSREQAEPEGSAREEKRERSQPPRR

Query:  KGDRPAVINTIHG
        K DRPAVINTIHG
Subjt:  KGDRPAVINTIHG

TrEMBL top hitse value%identityAlignment
A0A6J1D5T3 uncharacterized protein LOC1110175484.7e-7757.42Show/hide
Query:  FLAASDAMKYRAFQIALEGSTRLWYRQLKPRSIDSYQQMRRLFINQFSARQLLKLPPSHLGTVKQQDNKSLTEYIARFIDEDVKVVSCTNDIVMTELA--
        FLAASDA+K RAFQIALEGS RLWY+QLKPRSIDSYQQ+RRLFINQFSARQLLKLPPSHL TVKQ+DN+SLTEYIAR +DE VKVVSCT+DI M      
Subjt:  FLAASDAMKYRAFQIALEGSTRLWYRQLKPRSIDSYQQMRRLFINQFSARQLLKLPPSHLGTVKQQDNKSLTEYIARFIDEDVKVVSCTNDIVMTELA--

Query:  ----SILMAWSCRRPTE--------------------PGEAAAIKFKTR----SLPVPR--SDVVMIEVCLDELTTTRIE---------------VCVNA
            ++ + +  R P                       G   + + K R    S P  R   D       +D+ +  + +                 +NA
Subjt:  ----SILMAWSCRRPTE--------------------PGEAAAIKFKTR----SLPVPR--SDVVMIEVCLDELTTTRIE---------------VCVNA

Query:  SIVETYTAVEDTDMEALFTASEKLKLHQPLGKRDKQLYYRFHKDHDHDTSRCFHLKEQVEDLIRRGYLKKYFGSREQAEPEGSAREEKRERSQPPRRKGD
        S+ E Y  VE+TDM+ALFTA +  KLH+P GKRDK+LY RFHKDH H++SRCFHLKEQV+DLIRRGYLKKY GSRE+A+PEGS REEKRERSQPP RK D
Subjt:  SIVETYTAVEDTDMEALFTASEKLKLHQPLGKRDKQLYYRFHKDHDHDTSRCFHLKEQVEDLIRRGYLKKYFGSREQAEPEGSAREEKRERSQPPRRKGD

Query:  RPAVINTIHG
        RPAVINTIHG
Subjt:  RPAVINTIHG

A0A6J1D7S8 uncharacterized protein LOC1110178071.4e-3333.52Show/hide
Query:  FLAASDAMKYRAFQIALEGSTRLWYRQLKPRSIDSYQQMRRLFINQFSARQLLKLPPSHLGTVKQQDNKSLTEYIARFIDEDVKVVSCTNDIVM----TE
        F AA+DA+K RAFQIAL G  RLWYR+L  RSI +Y Q+R+ FI+QF +R   +   +HL T++Q++ ++L EY+ RF +E +KVV C++D  M    T 
Subjt:  FLAASDAMKYRAFQIALEGSTRLWYRQLKPRSIDSYQQMRRLFINQFSARQLLKLPPSHLGTVKQQDNKSLTEYIARFIDEDVKVVSCTNDIVM----TE

Query:  LASILMAWSCRRPTEPGEAAAIKFKTRSL-----------PVPRSDVVMIEVCLDELTT------------------TRIEVCVNAS------------I
        LA   +         P   A +  K + +             P   +   ++  ++  T                   R+E   + S            I
Subjt:  LASILMAWSCRRPTEPGEAAAIKFKTRSL-----------PVPRSDVVMIEVCLDELTT------------------TRIEVCVNAS------------I

Query:  VETYTAVEDTDMEALFTASEKLKLHQPLGKRDKQLYYRFHKDHDHDTSRCFHLKEQVEDLIRRGYLKKYFGSREQAEPEGSAREEKRERSQPPRRKGDRP
         E  T +E++ ME L  + EKL+      KR K    RFH+DHDH+T+ C+ LK Q+EDLI+ GY KK+ G       E   ++E+R+RS+ P R+ DRP
Subjt:  VETYTAVEDTDMEALFTASEKLKLHQPLGKRDKQLYYRFHKDHDHDTSRCFHLKEQVEDLIRRGYLKKYFGSREQAEPEGSAREEKRERSQPPRRKGDRP

Query:  AVINTIHGVHGMGRSPS--SSLIRDLRLIMPIVHIARSSSFVS--SLQFNGRHLP
        AVINTI G    G+S +    L R+ R  + I+   + +  ++       G HLP
Subjt:  AVINTIHGVHGMGRSPS--SSLIRDLRLIMPIVHIARSSSFVS--SLQFNGRHLP

A0A6J1D9W7 uncharacterized protein LOC1110187081.1e-3336.74Show/hide
Query:  FLAASDAMKYRAFQIALEGSTRLWYRQLKPRSIDSYQQMRRLFINQFSARQLLKLPPSHLGTVKQQDNKSLTEYIARFIDEDVKVVSCTNDIVM------
        F AASDA+K RAFQIAL GS RLWYR+L  RSI +Y Q+RR F+ QFS+RQ  K   +HL T++Q++  +L EY+ RF +E +KV  C++D  M      
Subjt:  FLAASDAMKYRAFQIALEGSTRLWYRQLKPRSIDSYQQMRRLFINQFSARQLLKLPPSHLGTVKQQDNKSLTEYIARFIDEDVKVVSCTNDIVM------

Query:  ---------------TELASI------------LMAWSCRRPT---------EPGEAAAIKFKTR-SLPVPRSDVVMIEVCLDELTTTRIEVCVNASIVE
                       T  A +            L+     RP          +  E A  K K + S    R++    E    +             I E
Subjt:  ---------------TELASI------------LMAWSCRRPT---------EPGEAAAIKFKTR-SLPVPRSDVVMIEVCLDELTTTRIEVCVNASIVE

Query:  TYTAVEDTDMEALFTASEKLKLHQPLGKRDKQLYYRFHKDHDHDTSRCFHLKEQVEDLIRRGYLKKYFGSREQAEPEGSAREEKRERSQPPRRKGDRPAV
          T +E++ ME L    EKL+      +R K  Y RFH++H H+TS C+ LK Q+EDLI+ GY KK+ G    +  E   ++E+R+RS+ P R+ DRPAV
Subjt:  TYTAVEDTDMEALFTASEKLKLHQPLGKRDKQLYYRFHKDHDHDTSRCFHLKEQVEDLIRRGYLKKYFGSREQAEPEGSAREEKRERSQPPRRKGDRPAV

Query:  INTIHGVHGMGRS
        INTI G    G+S
Subjt:  INTIHGVHGMGRS

A0A6J1DIZ8 uncharacterized protein LOC1110204756.4e-6656.74Show/hide
Query:  FLAASDAMKYRAFQIALEGSTRLWYRQLKPRSIDSYQQMRRLFINQFSARQLLKLPPSHLGTVKQQDNKSLTEYIARFIDEDVKVVSCTNDIVMTELA--
        FLA SDAMK  AFQI LEGSTRLWYRQLK RSIDSYQQ+RRLFINQFS RQ LKLP SHLGTVKQ+DN+S T YIARF+DE VKVVSCT+DI M      
Subjt:  FLAASDAMKYRAFQIALEGSTRLWYRQLKPRSIDSYQQMRRLFINQFSARQLLKLPPSHLGTVKQQDNKSLTEYIARFIDEDVKVVSCTNDIVMTELA--

Query:  ----SILMAWSCRRPT----------------------EPGEAAAIKFKTRSLPVPRSDVVMIEVCLDELTTTRIEVCVNASIVETYTAVEDTDMEALFT
            ++ + +   +P                          E AA++  T SL  PRS  VMIE+ L EL TTRI+V +      ++  V    +E    
Subjt:  ----SILMAWSCRRPT----------------------EPGEAAAIKFKTRSLPVPRSDVVMIEVCLDELTTTRIEVCVNASIVETYTAVEDTDMEALFT

Query:  ASEKLKLHQPLGKRDKQLYYRFHKDHDHDTSRCFHLKEQVEDLIRRGYLKKYFGSREQAEPEGSAREEKRERSQPPRRKGDR
             KL +P GKRDK+LY RFHKD  HDTSRCFHLKEQVEDLIRRGYLKKY G+ E+A PE SA EEKRERSQ P+R+ DR
Subjt:  ASEKLKLHQPLGKRDKQLYYRFHKDHDHDTSRCFHLKEQVEDLIRRGYLKKYFGSREQAEPEGSAREEKRERSQPPRRKGDR

A0A6J1E0L8 uncharacterized protein LOC1110253108.7e-3979.65Show/hide
Query:  VNASIVETYTAVEDTDMEALFTASEKLKLHQPLGKRDKQLYYRFHKDHDHDTSRCFHLKEQVEDLIRRGYLKKYFGSREQAEPEGSAREEKRERSQPPRR
        +NASI E Y  VEDTDME LF + EKL+  +P GKR+K+LY RFHKDH HDTSRCFHLKEQVEDLIR GYLKKY GSREQAE EGSAREEKRERSQPPR 
Subjt:  VNASIVETYTAVEDTDMEALFTASEKLKLHQPLGKRDKQLYYRFHKDHDHDTSRCFHLKEQVEDLIRRGYLKKYFGSREQAEPEGSAREEKRERSQPPRR

Query:  KGDRPAVINTIHG
        K DRPAVINTIHG
Subjt:  KGDRPAVINTIHG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGCGCCCATCCCGACGGGTTCGGGGATCCAGTCTCGTACGTTGAGATGTTTGAGGGAAATATGGATTTTTTTGGCCGCAAGCGACGCTATGAAATACCGAGCATT
CCAAATAGCTCTGGAAGGCTCGACGAGATTATGGTACCGACAGCTGAAGCCCAGGTCAATCGACAGTTATCAACAGATGAGAAGGTTGTTCATCAATCAGTTCTCAGCTC
GGCAGTTATTGAAGTTGCCGCCTTCCCACCTCGGAACAGTGAAGCAACAGGACAACAAGTCCCTTACGGAGTACATCGCTCGGTTCATAGACGAGGATGTCAAGGTGGTG
AGTTGTACCAACGACATTGTCATGACCGAGCTCGCCAGTATATTGATGGCCTGGAGTTGTAGAAGGCCAACGGAGCCAGGCGAAGCAGCCGCGATAAAGTTCAAGACCAG
AAGTCTCCCCGTTCCAAGAAGCGACGTAGTGATGATCGAAGTTTGTCTCGACGAACTGACGACGACAAGAATAGAGGTCTGCGTGAATGCCTCAATCGTGGAGACCTACA
CAGCAGTCGAAGACACCGACATGGAGGCGCTGTTCACAGCCTCAGAAAAGCTCAAGCTCCACCAACCCTTGGGAAAGCGGGACAAACAACTCTACTACCGATTCCATAAG
GATCACGACCACGATACCTCCCGTTGCTTTCATCTGAAGGAGCAGGTCGAGGATTTGATCCGGAGAGGTTATTTGAAGAAGTATTTCGGCAGTAGAGAACAAGCTGAGCC
AGAGGGATCAGCTCGGGAGGAGAAGCGAGAGAGATCACAGCCGCCCAGACGAAAGGGAGATCGCCCTGCCGTTATAAACACCATTCACGGGGTGCATGGTATGGGCCGAT
CACCATCTTCCTCGCTTATTAGAGACCTGCGACTCATCATGCCAATAGTCCACATCGCCAGGTCATCTTCATTTGTTTCTTCTCTTCAATTCAACGGTCGACATCTCCCA
GATTTTCCGTTGAATTTCACAGGCAAGCGGGAGAGACAAGATTGTTGGAACCATTCCGATCAGCGGTTACGATTGTTTGGCCCCCAGAACCCGCATCAGTTGAAGAGCGA
GAATGAGAAGGCCGCCATTGACAGAGCAAACACCGGCCAACTTATCCAGCAAAACCTCGCCGGAAGGAAGGCGGTCGCCATTAGGAAGAAGATAATGCAGGTTAAAACCG
AACCAAATTTCTCGAGTTTGTTTCCGAATCTTGGAAAGGAATTTCGAAGAGTAACGCCACAAAACGAAACTCCAAAGCTTATCACCACCATTACCTCTGCAATCTGTAGG
GACAGAGAAGGTGGAATCTGTGAAGAAACGACCAATCCAATGGCGGCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACGCGCCCATCCCGACGGGTTCGGGGATCCAGTCTCGTACGTTGAGATGTTTGAGGGAAATATGGATTTTTTTGGCCGCAAGCGACGCTATGAAATACCGAGCATT
CCAAATAGCTCTGGAAGGCTCGACGAGATTATGGTACCGACAGCTGAAGCCCAGGTCAATCGACAGTTATCAACAGATGAGAAGGTTGTTCATCAATCAGTTCTCAGCTC
GGCAGTTATTGAAGTTGCCGCCTTCCCACCTCGGAACAGTGAAGCAACAGGACAACAAGTCCCTTACGGAGTACATCGCTCGGTTCATAGACGAGGATGTCAAGGTGGTG
AGTTGTACCAACGACATTGTCATGACCGAGCTCGCCAGTATATTGATGGCCTGGAGTTGTAGAAGGCCAACGGAGCCAGGCGAAGCAGCCGCGATAAAGTTCAAGACCAG
AAGTCTCCCCGTTCCAAGAAGCGACGTAGTGATGATCGAAGTTTGTCTCGACGAACTGACGACGACAAGAATAGAGGTCTGCGTGAATGCCTCAATCGTGGAGACCTACA
CAGCAGTCGAAGACACCGACATGGAGGCGCTGTTCACAGCCTCAGAAAAGCTCAAGCTCCACCAACCCTTGGGAAAGCGGGACAAACAACTCTACTACCGATTCCATAAG
GATCACGACCACGATACCTCCCGTTGCTTTCATCTGAAGGAGCAGGTCGAGGATTTGATCCGGAGAGGTTATTTGAAGAAGTATTTCGGCAGTAGAGAACAAGCTGAGCC
AGAGGGATCAGCTCGGGAGGAGAAGCGAGAGAGATCACAGCCGCCCAGACGAAAGGGAGATCGCCCTGCCGTTATAAACACCATTCACGGGGTGCATGGTATGGGCCGAT
CACCATCTTCCTCGCTTATTAGAGACCTGCGACTCATCATGCCAATAGTCCACATCGCCAGGTCATCTTCATTTGTTTCTTCTCTTCAATTCAACGGTCGACATCTCCCA
GATTTTCCGTTGAATTTCACAGGCAAGCGGGAGAGACAAGATTGTTGGAACCATTCCGATCAGCGGTTACGATTGTTTGGCCCCCAGAACCCGCATCAGTTGAAGAGCGA
GAATGAGAAGGCCGCCATTGACAGAGCAAACACCGGCCAACTTATCCAGCAAAACCTCGCCGGAAGGAAGGCGGTCGCCATTAGGAAGAAGATAATGCAGGTTAAAACCG
AACCAAATTTCTCGAGTTTGTTTCCGAATCTTGGAAAGGAATTTCGAAGAGTAACGCCACAAAACGAAACTCCAAAGCTTATCACCACCATTACCTCTGCAATCTGTAGG
GACAGAGAAGGTGGAATCTGTGAAGAAACGACCAATCCAATGGCGGCCTGA
Protein sequenceShow/hide protein sequence
MDAPIPTGSGIQSRTLRCLREIWIFLAASDAMKYRAFQIALEGSTRLWYRQLKPRSIDSYQQMRRLFINQFSARQLLKLPPSHLGTVKQQDNKSLTEYIARFIDEDVKVV
SCTNDIVMTELASILMAWSCRRPTEPGEAAAIKFKTRSLPVPRSDVVMIEVCLDELTTTRIEVCVNASIVETYTAVEDTDMEALFTASEKLKLHQPLGKRDKQLYYRFHK
DHDHDTSRCFHLKEQVEDLIRRGYLKKYFGSREQAEPEGSAREEKRERSQPPRRKGDRPAVINTIHGVHGMGRSPSSSLIRDLRLIMPIVHIARSSSFVSSLQFNGRHLP
DFPLNFTGKRERQDCWNHSDQRLRLFGPQNPHQLKSENEKAAIDRANTGQLIQQNLAGRKAVAIRKKIMQVKTEPNFSSLFPNLGKEFRRVTPQNETPKLITTITSAICR
DREGGICEETTNPMAA