; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g23460 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g23460
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRibonuclease H
Genome locationchr4:16949582..16954849
RNA-Seq ExpressionMoc04g23460
SyntenyMoc04g23460
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0030430 - host cell cytoplasm (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022149417.1 protein NYNRIN-like [Momordica charantia]2.1e-6871.34Show/hide
Query:  RRCDRCRRFATAIHHPLQLLTPISAPWPFAQWGIDLIWPLPTGRGQTKFAVVALDYFTKWAEAEPLATITEAKITGFVWTNLVYRFGIPHAIITENGRQF
        + CD C+RFA  IH PL+LLTPISAPWPFAQWG+D+I P P G+GQTKFAVVA+DYFTKWAEAE L+ ITE+++T F+W N+V RFGIP+AI+T+NG+QF
Subjt:  RRCDRCRRFATAIHHPLQLLTPISAPWPFAQWGIDLIWPLPTGRGQTKFAVVALDYFTKWAEAEPLATITEAKITGFVWTNLVYRFGIPHAIITENGRQF

Query:  DNPKFNKFCEQLGIKHFSPSPAHPQANGQVEAINKIIKRGLKLRLEDRKGRWVEELPDVLWSYR
        DN KF  FC  LGI+H S SPAHP+ANGQVEA+NKIIKR LKLRL+ R GRW EELP+VLWSY+
Subjt:  DNPKFNKFCEQLGIKHFSPSPAHPQANGQVEAINKIIKRGLKLRLEDRKGRWVEELPDVLWSYR

XP_022153142.1 uncharacterized protein LOC111020710 [Momordica charantia]5.8e-6351.75Show/hide
Query:  VQVNPGRNFRSSLCSGARSNEYRHRDADLDDPPKSLPLWMRATRTSQPLQDAAQSHGVPTPRVARRCDRCRRFATAIHHPLQLLTPISAPWPFAQWGIDL
        ++ NP ++ +       R+  +  ++  L     SLPL ++     + L    + H     +  + CD C+RFA  IH P +L+TPISAPWPFAQWG+D+
Subjt:  VQVNPGRNFRSSLCSGARSNEYRHRDADLDDPPKSLPLWMRATRTSQPLQDAAQSHGVPTPRVARRCDRCRRFATAIHHPLQLLTPISAPWPFAQWGIDL

Query:  IWPLPTGRGQTKFAVVALDYFTKWAEAEPLATITEAKITGFVWTNLVYRFGIPHAIITENGRQFDNPKFNKFCEQLGIKHFSPSPAHPQANGQVEAINKI
          P P G+GQ KFAV A+DYFTKWAEA+ L+ ITE+++T F+ TN+V RF IP+AI+ +NG+QFDN K   FC +LGI H S SP HP+ANGQVEA+NKI
Subjt:  IWPLPTGRGQTKFAVVALDYFTKWAEAEPLATITEAKITGFVWTNLVYRFGIPHAIITENGRQFDNPKFNKFCEQLGIKHFSPSPAHPQANGQVEAINKI

Query:  IKRGLKLRLEDRKGRWVEELPDVLWSYR
        IKRGLKLRL+ RKGRW  ELP+VLW YR
Subjt:  IKRGLKLRLEDRKGRWVEELPDVLWSYR

XP_022156575.1 uncharacterized protein LOC111023451 [Momordica charantia]1.7e-6771.17Show/hide
Query:  RRCDRCRRFATAIHHPLQLLTPISAPWPFAQWGIDLIWPLPTGRGQTKFAVVALDYFTKWAEAEPLATITEAKITGFVWTNLVYRFGIPHAIITENGRQF
        R CD C+R+ T I  P +LLTPISAPWPFAQWG+D+I   P G+GQTKFAVVA+DYFTKW EAE L+ ITE+++T FVWTN++ RFGIP AI+T+NG+QF
Subjt:  RRCDRCRRFATAIHHPLQLLTPISAPWPFAQWGIDLIWPLPTGRGQTKFAVVALDYFTKWAEAEPLATITEAKITGFVWTNLVYRFGIPHAIITENGRQF

Query:  DNPKFNKFCEQLGIKHFSPSPAHPQANGQVEAINKIIKRGLKLRLEDRKGRWVEELPDVLWSY
        DN KF  FC +LGI H S SPAHPQANGQVEA+NKIIKRG+KLRL+ +KGRWVEELP+VLWSY
Subjt:  DNPKFNKFCEQLGIKHFSPSPAHPQANGQVEAINKIIKRGLKLRLEDRKGRWVEELPDVLWSY

XP_022157799.1 uncharacterized protein LOC111024419 [Momordica charantia]8.4e-6268.29Show/hide
Query:  RRCDRCRRFATAIHHPLQLLTPISAPWPFAQWGIDLIWPLPTGRGQTKFAVVALDYFTKWAEAEPLATITEAKITGFVWTNLVYRFGIPHAIITENGRQF
        R  D C+RF   IH P +LLTPISA WPF QWGID+I P   G+G TKFAVVA+DYFTKWAEA  L+ ITE+++T FVW N+V RFGIPHAI+T+N +QF
Subjt:  RRCDRCRRFATAIHHPLQLLTPISAPWPFAQWGIDLIWPLPTGRGQTKFAVVALDYFTKWAEAEPLATITEAKITGFVWTNLVYRFGIPHAIITENGRQF

Query:  DNPKFNKFCEQLGIKHFSPSPAHPQANGQVEAINKIIKRGLKLRLEDRKGRWVEELPDVLWSYR
        DN  F  FC +LGI H S SPAH QANGQVEA+NKIIKRG+KLRL+ RKGRW  ELP+VLWSYR
Subjt:  DNPKFNKFCEQLGIKHFSPSPAHPQANGQVEAINKIIKRGLKLRLEDRKGRWVEELPDVLWSYR

XP_022158759.1 uncharacterized protein LOC111025225 [Momordica charantia]1.4e-7798.67Show/hide
Query:  MRRDAEKHNRLKFCRFYKDHGHDTSDCYELKRQIEGLIQKGYFKKQVGQAHSRGRKKAGSSKEGRAKRERTRSPLKRTDRPTVINMIFGGPSGGQLGRKH
        MRRDAEKHNRLKFCRFYKDHGHDTSDCYELKRQIEGLIQKGYFKKQVGQAHSRGRKKAGSSKEGRAKRERTRSPLKRTDRPTVINMIFGGPSGGQLGRKH
Subjt:  MRRDAEKHNRLKFCRFYKDHGHDTSDCYELKRQIEGLIQKGYFKKQVGQAHSRGRKKAGSSKEGRAKRERTRSPLKRTDRPTVINMIFGGPSGGQLGRKH

Query:  KALVREAHHEICASYIQLTPYQISLSIEDMNGLYSPHNNALVIEAKIDHI
        KALVREAHHEICASYIQLTPYQISLSIEDMN LYSPHNNALVIEAKIDH+
Subjt:  KALVREAHHEICASYIQLTPYQISLSIEDMNGLYSPHNNALVIEAKIDHI

TrEMBL top hitse value%identityAlignment
A0A6J1D7W6 Ribonuclease H9.9e-6971.34Show/hide
Query:  RRCDRCRRFATAIHHPLQLLTPISAPWPFAQWGIDLIWPLPTGRGQTKFAVVALDYFTKWAEAEPLATITEAKITGFVWTNLVYRFGIPHAIITENGRQF
        + CD C+RFA  IH PL+LLTPISAPWPFAQWG+D+I P P G+GQTKFAVVA+DYFTKWAEAE L+ ITE+++T F+W N+V RFGIP+AI+T+NG+QF
Subjt:  RRCDRCRRFATAIHHPLQLLTPISAPWPFAQWGIDLIWPLPTGRGQTKFAVVALDYFTKWAEAEPLATITEAKITGFVWTNLVYRFGIPHAIITENGRQF

Query:  DNPKFNKFCEQLGIKHFSPSPAHPQANGQVEAINKIIKRGLKLRLEDRKGRWVEELPDVLWSYR
        DN KF  FC  LGI+H S SPAHP+ANGQVEA+NKIIKR LKLRL+ R GRW EELP+VLWSY+
Subjt:  DNPKFNKFCEQLGIKHFSPSPAHPQANGQVEAINKIIKRGLKLRLEDRKGRWVEELPDVLWSYR

A0A6J1DI47 Ribonuclease H2.8e-6351.75Show/hide
Query:  VQVNPGRNFRSSLCSGARSNEYRHRDADLDDPPKSLPLWMRATRTSQPLQDAAQSHGVPTPRVARRCDRCRRFATAIHHPLQLLTPISAPWPFAQWGIDL
        ++ NP ++ +       R+  +  ++  L     SLPL ++     + L    + H     +  + CD C+RFA  IH P +L+TPISAPWPFAQWG+D+
Subjt:  VQVNPGRNFRSSLCSGARSNEYRHRDADLDDPPKSLPLWMRATRTSQPLQDAAQSHGVPTPRVARRCDRCRRFATAIHHPLQLLTPISAPWPFAQWGIDL

Query:  IWPLPTGRGQTKFAVVALDYFTKWAEAEPLATITEAKITGFVWTNLVYRFGIPHAIITENGRQFDNPKFNKFCEQLGIKHFSPSPAHPQANGQVEAINKI
          P P G+GQ KFAV A+DYFTKWAEA+ L+ ITE+++T F+ TN+V RF IP+AI+ +NG+QFDN K   FC +LGI H S SP HP+ANGQVEA+NKI
Subjt:  IWPLPTGRGQTKFAVVALDYFTKWAEAEPLATITEAKITGFVWTNLVYRFGIPHAIITENGRQFDNPKFNKFCEQLGIKHFSPSPAHPQANGQVEAINKI

Query:  IKRGLKLRLEDRKGRWVEELPDVLWSYR
        IKRGLKLRL+ RKGRW  ELP+VLW YR
Subjt:  IKRGLKLRLEDRKGRWVEELPDVLWSYR

A0A6J1DQP4 Ribonuclease H8.4e-6871.17Show/hide
Query:  RRCDRCRRFATAIHHPLQLLTPISAPWPFAQWGIDLIWPLPTGRGQTKFAVVALDYFTKWAEAEPLATITEAKITGFVWTNLVYRFGIPHAIITENGRQF
        R CD C+R+ T I  P +LLTPISAPWPFAQWG+D+I   P G+GQTKFAVVA+DYFTKW EAE L+ ITE+++T FVWTN++ RFGIP AI+T+NG+QF
Subjt:  RRCDRCRRFATAIHHPLQLLTPISAPWPFAQWGIDLIWPLPTGRGQTKFAVVALDYFTKWAEAEPLATITEAKITGFVWTNLVYRFGIPHAIITENGRQF

Query:  DNPKFNKFCEQLGIKHFSPSPAHPQANGQVEAINKIIKRGLKLRLEDRKGRWVEELPDVLWSY
        DN KF  FC +LGI H S SPAHPQANGQVEA+NKIIKRG+KLRL+ +KGRWVEELP+VLWSY
Subjt:  DNPKFNKFCEQLGIKHFSPSPAHPQANGQVEAINKIIKRGLKLRLEDRKGRWVEELPDVLWSY

A0A6J1DZU0 Ribonuclease H4.0e-6266.07Show/hide
Query:  TPRVARRCDRCRRFATAIHHPLQLLTPISAPWPFAQWGIDLIWPLPTGRGQTKFAVVALDYFTKWAEAEPLATITEAKITGFVWTNLVYRFGIPHAIITE
        T    ++CD+C+RFAT    P + LT I +PWPFAQWGIDLI PLP G+GQTKFAVVA+DYFTKWAEA+ LATITE K+T F+W N++ RFGIP+AII++
Subjt:  TPRVARRCDRCRRFATAIHHPLQLLTPISAPWPFAQWGIDLIWPLPTGRGQTKFAVVALDYFTKWAEAEPLATITEAKITGFVWTNLVYRFGIPHAIITE

Query:  NGRQFDNPKFNKFCEQLGIKHFSPSPAHPQANGQVEAINKIIKRGLKLRLEDRKGRWVEELPDVLWSY
        NG+QFDN  F +F  +LGIKH   SPAHPQANGQVEA+NK+IKR LK RLE  KG W EELP+ LW+Y
Subjt:  NGRQFDNPKFNKFCEQLGIKHFSPSPAHPQANGQVEAINKIIKRGLKLRLEDRKGRWVEELPDVLWSY

A0A6J1E0C5 uncharacterized protein LOC1110252256.9e-7898.67Show/hide
Query:  MRRDAEKHNRLKFCRFYKDHGHDTSDCYELKRQIEGLIQKGYFKKQVGQAHSRGRKKAGSSKEGRAKRERTRSPLKRTDRPTVINMIFGGPSGGQLGRKH
        MRRDAEKHNRLKFCRFYKDHGHDTSDCYELKRQIEGLIQKGYFKKQVGQAHSRGRKKAGSSKEGRAKRERTRSPLKRTDRPTVINMIFGGPSGGQLGRKH
Subjt:  MRRDAEKHNRLKFCRFYKDHGHDTSDCYELKRQIEGLIQKGYFKKQVGQAHSRGRKKAGSSKEGRAKRERTRSPLKRTDRPTVINMIFGGPSGGQLGRKH

Query:  KALVREAHHEICASYIQLTPYQISLSIEDMNGLYSPHNNALVIEAKIDHI
        KALVREAHHEICASYIQLTPYQISLSIEDMN LYSPHNNALVIEAKIDH+
Subjt:  KALVREAHHEICASYIQLTPYQISLSIEDMNGLYSPHNNALVIEAKIDHI

SwissProt top hitse value%identityAlignment
P03359 Gag-Pol polyprotein7.0e-1134.35Show/hide
Query:  PFAQWGIDLIWPLPTGRGQTKFAVVALDYFTKWAEAEPLATITEAKITGFVWTNLVYRFGIPHAIITENGRQFDNPKFNKFCEQLGIKHFSPSPAHPQAN
        P   W +D     P GR   ++ +V +D F+ W EA P  T T   +   +   ++ RFGIP  + ++NG  F          QLGI         PQ++
Subjt:  PFAQWGIDLIWPLPTGRGQTKFAVVALDYFTKWAEAEPLATITEAKITGFVWTNLVYRFGIPHAIITENGRQFDNPKFNKFCEQLGIKHFSPSPAHPQAN

Query:  GQVEAINKIIKRGL-KLRLEDRKGRWVEELP
        GQVE +N+ IK  L KL LE     WV  LP
Subjt:  GQVEAINKIIKRGL-KLRLEDRKGRWVEELP

P03360 Gag-Pol polyprotein (Fragment)1.8e-1132.09Show/hide
Query:  PFAQWGIDLIWPLPTGRGQTKFAVVALDYFTKWAEAEPLATITEAKITGFVWTNLVYRFGIPHAIITENGRQFDNPKFNKFCEQLGIKHFSPSPAHPQAN
        P   W +D    + T +G  K+ +V +D F+ W EA P    T   +   +  +++ RFG+P  I ++NG  F      + CE L +         PQ++
Subjt:  PFAQWGIDLIWPLPTGRGQTKFAVVALDYFTKWAEAEPLATITEAKITGFVWTNLVYRFGIPHAIITENGRQFDNPKFNKFCEQLGIKHFSPSPAHPQAN

Query:  GQVEAINKIIKRGL-KLRLEDRKGRWVEELPDVL
        GQVE +N+ +K  + KLR+E   G WV  LP  L
Subjt:  GQVEAINKIIKRGL-KLRLEDRKGRWVEELPDVL

P10273 Gag-Pol polyprotein4.5e-1032.61Show/hide
Query:  PFAQWGIDLIWPLPTGRGQTKFAVVALDYFTKWAEAEPLATITEAKITGFVWTNLVYRFGIPHAIITENGRQFDNPKFNKFCEQLGIKHFSPSPAHPQAN
        P   W +D    +  G    K+ +V +D F+ WAEA P    T   +   +   +  R+GIP  + ++NG  F +         LGI         PQ++
Subjt:  PFAQWGIDLIWPLPTGRGQTKFAVVALDYFTKWAEAEPLATITEAKITGFVWTNLVYRFGIPHAIITENGRQFDNPKFNKFCEQLGIKHFSPSPAHPQAN

Query:  GQVEAINKIIKRGL-KLRLEDRKGRWVEELPDVLWSYR
        GQVE +N+ IK  L KL LE     WV  LP VL+  R
Subjt:  GQVEAINKIIKRGL-KLRLEDRKGRWVEELPDVLWSYR

P21414 Gag-Pol polyprotein7.0e-1134.35Show/hide
Query:  PFAQWGIDLIWPLPTGRGQTKFAVVALDYFTKWAEAEPLATITEAKITGFVWTNLVYRFGIPHAIITENGRQFDNPKFNKFCEQLGIKHFSPSPAHPQAN
        P   W +D    +  GR   K+ +V +D F+ W EA P  T T   +   +   ++ RFGIP  + ++NG  F          QLGI         PQ++
Subjt:  PFAQWGIDLIWPLPTGRGQTKFAVVALDYFTKWAEAEPLATITEAKITGFVWTNLVYRFGIPHAIITENGRQFDNPKFNKFCEQLGIKHFSPSPAHPQAN

Query:  GQVEAINKIIKRGL-KLRLEDRKGRWVEELP
        GQVE +N+ IK  L KL LE     WV  LP
Subjt:  GQVEAINKIIKRGL-KLRLEDRKGRWVEELP

Q9TTC1 Gag-Pol polyprotein1.2e-1034.35Show/hide
Query:  PFAQWGIDLIWPLPTGRGQTKFAVVALDYFTKWAEAEPLATITEAKITGFVWTNLVYRFGIPHAIITENGRQFDNPKFNKFCEQLGIKHFSPSPAHPQAN
        P   W +D     P GR   ++ +V +D F+ W EA P  T T   +   +   ++ RFGIP  + ++NG  F          QLGI         PQ++
Subjt:  PFAQWGIDLIWPLPTGRGQTKFAVVALDYFTKWAEAEPLATITEAKITGFVWTNLVYRFGIPHAIITENGRQFDNPKFNKFCEQLGIKHFSPSPAHPQAN

Query:  GQVEAINKIIKRGL-KLRLEDRKGRWVEELP
        GQVE +N+ IK  L KL LE     WV  LP
Subjt:  GQVEAINKIIKRGL-KLRLEDRKGRWVEELP

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGAGTGTCAGGGCCTCGGGTATAAATGGTCGAGGGCTGATACGTCACTAATTGGGTATCGAGGCCTCGGGTATAAATGGTCGGGGGTCGATACGCCAATA
TTGGATAAAGATGAGCGTCGAGGCCTCGGGTATAAATGGTCGGGGGTCAATGTGAGGAGTCTTTCGAAAGGAGAACTATTGGGGCCTTGGGCTAGAGTAGCTGTG
ATAAGCGTAGAGGTACTTGAGATGTTAGGGAGACTAGAGGTTGGTACTGATAAGGGCTGCTTACTGAGTACTGTGGTTGTACTCATCCCTCTTTTTCCCCTCCAG
TTCGCAGGTGTACTGCTCCCTGAGATCTCTGAGCTCGAGGTTTTGCATGACCCCCAAAGGGGAATAACTAAGGCTATCAGGACCGACCAGGAAAGGTCGAAACTC
GGGCGCAACCTGTCGAAACTCGTCCTACGAGGAAACAATCGACCTCGGGGACCGGAGGATATGCGGACCCGCCTCTCCTCGCTGAACGAGCGGGACCTTCGCAAC
AAGCTGGACCATCAACGCTCCAAGAGAGACACCCCTTCAACGAGGTCGGAGTCAAGCGTTCTCCGTAACTCTGCAAGGGGCTGCGAGGAAATGGTATTGGCAGCT
CGCACCCCACTCTATTTCCTCTTAGAAACAGCTCCACAAGAACTTTGTGGCCCAGTTTGCAGCCCAGCAAGAAACTCAACACCCGGCTCAGTTCTTGCTCACGAT
AAGGCAGAAGGAGTTCGGCATGAACAATTGAAATGGTCTCTTGCCAAGAAGCTCGAGCTCACTATAAAGGCCAGTATAGAACGGGCCAAGAAGTTCATGCAGGCG
AGCGAACTCATCCAGTCAAGGGAGGATCCTTCCAAAGTCGACCCCGCGAAGGAAAAAAGAGCTCGGGACAGCTCTAAAAGGTCACCGTGTCGCCACCAACGATAT
GATCACGAGCCTCGACATTCCAAGCTCGACCTACTCACCAAGTCTAGTCCCATGCGCCGAGACGCTGAGAAGCACAATAGGTTGAAGTTCTGCCGCTTCTACAAG
GATCACGGTCACGATACCTCCGACTGCTACGAGCTGAAGAGGCAGATCGAGGGCCTCATCCAGAAAGGGTACTTCAAGAAGCAAGTGGGACAAGCTCACAGCAGA
GGAAGAAAAAAGGCAGGTAGTTCAAAAGAAGGAAGGGCAAAGAGGGAGAGGACACGATCTCCTCTTAAACGCACAGATCGACCTACCGTGATCAACATGATCTTT
GGAGGTCCGAGCGGGGGGCAGTTAGGAAGGAAGCACAAGGCACTGGTGCGAGAAGCCCACCACGAGATTTGTGCAAGCTATATACAGCTCACTCCGTACCAGATT
TCGTTATCAATCGAGGACATGAACGGCCTGTACTCCCCTCACAACAACGCCTTGGTTATCGAAGCAAAGATTGACCACATAATGGGTACCACGCACGGCGAACAC
CAATGCGAACGCCTTCGCAAAACTGGCGTCCTCTTATCCGACCGAGCTGTCCAAGTCAATCCCGGTCGAAATTTTAGAAGCTCCCTCTGTTCAGGGGCCCGAAGC
AATGAATATCGACACCGCGACGCCGACCTGGATGATCCACCTAAAAGCCTTCCTCTGTGGATGAGAGCTACCAGAACAAGTCAGCCTCTGCAAGATGCGGCGCAA
AGCCATGGGGTACCTACTCCGAGAGTCGCGCGACGATGTGATCGGTGCCGGCGATTTGCTACGGCAATCCATCACCCGCTCCAGCTCCTCACACCGATCTCAGCC
CCTTGGCCATTCGCCCAGTGGGGAATCGATCTTATTTGGCCCCTTCCCACGGGGAGAGGCCAAACGAAATTCGCAGTTGTGGCCTTAGATTACTTCACCAAGTGG
GCGGAGGCTGAGCCGTTGGCCACCATAACCGAGGCTAAAATCACGGGCTTCGTGTGGACCAATCTCGTCTATAGGTTTGGCATACCCCACGCCATCATAACCGAA
AATGGAAGGCAGTTCGACAACCCCAAATTCAACAAGTTTTGTGAACAGCTTGGGATAAAGCATTTCAGCCCCTCGCCAGCTCACCCGCAAGCCAATGGCCAGGTG
GAGGCTATTAACAAGATTATAAAGCGCGGCCTAAAATTAAGACTAGAGGATCGCAAGGGCCGGTGGGTAGAGGAGCTACCCGATGTATTGTGGTCATACAGGATG
TAA
mRNA sequenceShow/hide mRNA sequence
ATGGGTGAGTGTCAGGGCCTCGGGTATAAATGGTCGAGGGCTGATACGTCACTAATTGGGTATCGAGGCCTCGGGTATAAATGGTCGGGGGTCGATACGCCAATA
TTGGATAAAGATGAGCGTCGAGGCCTCGGGTATAAATGGTCGGGGGTCAATGTGAGGAGTCTTTCGAAAGGAGAACTATTGGGGCCTTGGGCTAGAGTAGCTGTG
ATAAGCGTAGAGGTACTTGAGATGTTAGGGAGACTAGAGGTTGGTACTGATAAGGGCTGCTTACTGAGTACTGTGGTTGTACTCATCCCTCTTTTTCCCCTCCAG
TTCGCAGGTGTACTGCTCCCTGAGATCTCTGAGCTCGAGGTTTTGCATGACCCCCAAAGGGGAATAACTAAGGCTATCAGGACCGACCAGGAAAGGTCGAAACTC
GGGCGCAACCTGTCGAAACTCGTCCTACGAGGAAACAATCGACCTCGGGGACCGGAGGATATGCGGACCCGCCTCTCCTCGCTGAACGAGCGGGACCTTCGCAAC
AAGCTGGACCATCAACGCTCCAAGAGAGACACCCCTTCAACGAGGTCGGAGTCAAGCGTTCTCCGTAACTCTGCAAGGGGCTGCGAGGAAATGGTATTGGCAGCT
CGCACCCCACTCTATTTCCTCTTAGAAACAGCTCCACAAGAACTTTGTGGCCCAGTTTGCAGCCCAGCAAGAAACTCAACACCCGGCTCAGTTCTTGCTCACGAT
AAGGCAGAAGGAGTTCGGCATGAACAATTGAAATGGTCTCTTGCCAAGAAGCTCGAGCTCACTATAAAGGCCAGTATAGAACGGGCCAAGAAGTTCATGCAGGCG
AGCGAACTCATCCAGTCAAGGGAGGATCCTTCCAAAGTCGACCCCGCGAAGGAAAAAAGAGCTCGGGACAGCTCTAAAAGGTCACCGTGTCGCCACCAACGATAT
GATCACGAGCCTCGACATTCCAAGCTCGACCTACTCACCAAGTCTAGTCCCATGCGCCGAGACGCTGAGAAGCACAATAGGTTGAAGTTCTGCCGCTTCTACAAG
GATCACGGTCACGATACCTCCGACTGCTACGAGCTGAAGAGGCAGATCGAGGGCCTCATCCAGAAAGGGTACTTCAAGAAGCAAGTGGGACAAGCTCACAGCAGA
GGAAGAAAAAAGGCAGGTAGTTCAAAAGAAGGAAGGGCAAAGAGGGAGAGGACACGATCTCCTCTTAAACGCACAGATCGACCTACCGTGATCAACATGATCTTT
GGAGGTCCGAGCGGGGGGCAGTTAGGAAGGAAGCACAAGGCACTGGTGCGAGAAGCCCACCACGAGATTTGTGCAAGCTATATACAGCTCACTCCGTACCAGATT
TCGTTATCAATCGAGGACATGAACGGCCTGTACTCCCCTCACAACAACGCCTTGGTTATCGAAGCAAAGATTGACCACATAATGGGTACCACGCACGGCGAACAC
CAATGCGAACGCCTTCGCAAAACTGGCGTCCTCTTATCCGACCGAGCTGTCCAAGTCAATCCCGGTCGAAATTTTAGAAGCTCCCTCTGTTCAGGGGCCCGAAGC
AATGAATATCGACACCGCGACGCCGACCTGGATGATCCACCTAAAAGCCTTCCTCTGTGGATGAGAGCTACCAGAACAAGTCAGCCTCTGCAAGATGCGGCGCAA
AGCCATGGGGTACCTACTCCGAGAGTCGCGCGACGATGTGATCGGTGCCGGCGATTTGCTACGGCAATCCATCACCCGCTCCAGCTCCTCACACCGATCTCAGCC
CCTTGGCCATTCGCCCAGTGGGGAATCGATCTTATTTGGCCCCTTCCCACGGGGAGAGGCCAAACGAAATTCGCAGTTGTGGCCTTAGATTACTTCACCAAGTGG
GCGGAGGCTGAGCCGTTGGCCACCATAACCGAGGCTAAAATCACGGGCTTCGTGTGGACCAATCTCGTCTATAGGTTTGGCATACCCCACGCCATCATAACCGAA
AATGGAAGGCAGTTCGACAACCCCAAATTCAACAAGTTTTGTGAACAGCTTGGGATAAAGCATTTCAGCCCCTCGCCAGCTCACCCGCAAGCCAATGGCCAGGTG
GAGGCTATTAACAAGATTATAAAGCGCGGCCTAAAATTAAGACTAGAGGATCGCAAGGGCCGGTGGGTAGAGGAGCTACCCGATGTATTGTGGTCATACAGGATG
TAA
Protein sequenceShow/hide protein sequence
MGECQGLGYKWSRADTSLIGYRGLGYKWSGVDTPILDKDERRGLGYKWSGVNVRSLSKGELLGPWARVAVISVEVLEMLGRLEVGTDKGCLLSTVVVLIPLFPLQ
FAGVLLPEISELEVLHDPQRGITKAIRTDQERSKLGRNLSKLVLRGNNRPRGPEDMRTRLSSLNERDLRNKLDHQRSKRDTPSTRSESSVLRNSARGCEEMVLAA
RTPLYFLLETAPQELCGPVCSPARNSTPGSVLAHDKAEGVRHEQLKWSLAKKLELTIKASIERAKKFMQASELIQSREDPSKVDPAKEKRARDSSKRSPCRHQRY
DHEPRHSKLDLLTKSSPMRRDAEKHNRLKFCRFYKDHGHDTSDCYELKRQIEGLIQKGYFKKQVGQAHSRGRKKAGSSKEGRAKRERTRSPLKRTDRPTVINMIF
GGPSGGQLGRKHKALVREAHHEICASYIQLTPYQISLSIEDMNGLYSPHNNALVIEAKIDHIMGTTHGEHQCERLRKTGVLLSDRAVQVNPGRNFRSSLCSGARS
NEYRHRDADLDDPPKSLPLWMRATRTSQPLQDAAQSHGVPTPRVARRCDRCRRFATAIHHPLQLLTPISAPWPFAQWGIDLIWPLPTGRGQTKFAVVALDYFTKW
AEAEPLATITEAKITGFVWTNLVYRFGIPHAIITENGRQFDNPKFNKFCEQLGIKHFSPSPAHPQANGQVEAINKIIKRGLKLRLEDRKGRWVEELPDVLWSYRM