; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g27660 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g27660
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr6:20824130..20830048
RNA-Seq ExpressionMoc06g27660
SyntenyMoc06g27660
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022154299.1 uncharacterized protein LOC111021593 [Momordica charantia]2.2e-6862.2Show/hide
Query:  MSASNTQRLGQKAPSTVSTQGGNQKARVFTLTRKEAANVEAVVIGISFLLWVLGYCMWGYIFILVLYFRYGLSYNVSVYVLFDSGSSHTFISTAFVRQAN
        M+ SN QRLGQKAPS V TQ GNQKARVF LTR+E  N EAVV G                 +LV         N   YVLFDSGSS TFISTAFVRQ N
Subjt:  MSASNTQRLGQKAPSTVSTQGGNQKARVFTLTRKEAANVEAVVIGISFLLWVLGYCMWGYIFILVLYFRYGLSYNVSVYVLFDSGSSHTFISTAFVRQAN

Query:  LELEPLRFLLSISTSPGSVMIASQIVKAGELSFGNQTLGASLIQLDMRDFDVILGMDWLATNQANSNCTKMEVFFQLPSGQGFTFKGVTGGVPRTVSVLK
        LEL PL FLL +ST  GSVMI+SQ+VK G LSF  Q LGA LIQLD+RDFDVILGMDWLATNQA+ NC+K EV FQLP G  F FKGVTGGVPR VS L+
Subjt:  LELEPLRFLLSISTSPGSVMIASQIVKAGELSFGNQTLGASLIQLDMRDFDVILGMDWLATNQANSNCTKMEVFFQLPSGQGFTFKGVTGGVPRTVSVLK

Query:  AIRLLRCGAWGYLASVVNTSKTTPSIDSVHVVKERATSVAPPTVHHRPVLDEFD
        A  LL+ GAWG+LASVV+T   TPSIDSVHVV E    V P  +   P + E D
Subjt:  AIRLLRCGAWGYLASVVNTSKTTPSIDSVHVVKERATSVAPPTVHHRPVLDEFD

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]6.7e-7353.29Show/hide
Query:  PVCPSYQRRHAVQCWMRIKVCFRCEREGRFARECPMSASNTQRLGQKAPSTVSTQGGNQKARVFTLTRKEAANVEAVVIGISFLLWVLGYCMWGYIFILV
        PVCPS ++ HA  CW+  K+CF+C++EG F REC M+ SNTQ L QK P+  +TQGG Q ARVF LTR +  + EAVV G   +L               
Subjt:  PVCPSYQRRHAVQCWMRIKVCFRCEREGRFARECPMSASNTQRLGQKAPSTVSTQGGNQKARVFTLTRKEAANVEAVVIGISFLLWVLGYCMWGYIFILV

Query:  LYFRYGLSYNVSVYVLFDSGSSHTFISTAFVRQANLELEPLRFLLSISTSPGSVMIASQIVKAGELSFGNQTLGASLIQLDMRDFDVILGMDWLATNQAN
                 ++  Y LFDSGSSH+FI++ FVR A+LELE   F LS+ST  GSV++ SQ+VK G+LSFG QTL  +LIQL+M+DFDVILGMDWLA N+AN
Subjt:  LYFRYGLSYNVSVYVLFDSGSSHTFISTAFVRQANLELEPLRFLLSISTSPGSVMIASQIVKAGELSFGNQTLGASLIQLDMRDFDVILGMDWLATNQAN

Query:  SNCTKMEVFFQLPSGQGFTFKGVTGGVPRTVSVLKAIRLLRCGAWGYLASVVNTSKTTPSIDSVHVVKERATSVAPPTVHHRPVLDEFD
         NC+K EV F L SGQ FTFKGV  GVPR VS LKA  LL+ G W YLASVV+  K  PSI+ V VV E  T V P  +   P   E D
Subjt:  SNCTKMEVFFQLPSGQGFTFKGVTGGVPRTVSVLKAIRLLRCGAWGYLASVVNTSKTTPSIDSVHVVKERATSVAPPTVHHRPVLDEFD

XP_022156992.1 uncharacterized protein LOC111023821 [Momordica charantia]6.3e-7153.29Show/hide
Query:  PVCPSYQRRHAVQCWMRIKVCFRCEREGRFARECPMSASNTQRLGQKAPSTVSTQGGNQKARVFTLTRKEAANVEAVVIGISFLLWVLGYCMWGYIFILV
        PVCPS ++ H   CW+   +C+RC++EG FARECPM+  NTQ LGQ+ P T + QGG  +ARVF LTR + A+ EAVV+G                 +LV
Subjt:  PVCPSYQRRHAVQCWMRIKVCFRCEREGRFARECPMSASNTQRLGQKAPSTVSTQGGNQKARVFTLTRKEAANVEAVVIGISFLLWVLGYCMWGYIFILV

Query:  LYFRYGLSYNVSVYVLFDSGSSHTFISTAFVRQANLELEPLRFLLSISTSPGSVMIASQIVKAGELSFGNQTLGASLIQLDMRDFDVILGMDWLATNQAN
        L        ++  Y LFDS SSH+FI++ FVR A+LELE L FLLS+ST  GSV++ SQ+VK G+LSF  QTL   LIQLDM+DFDVILGMDWLA NQAN
Subjt:  LYFRYGLSYNVSVYVLFDSGSSHTFISTAFVRQANLELEPLRFLLSISTSPGSVMIASQIVKAGELSFGNQTLGASLIQLDMRDFDVILGMDWLATNQAN

Query:  SNCTKMEVFFQLPSGQGFTFKGVTGGVPRTVSVLKAIRLLRCGAWGYLASVVNTSKTTPSIDSVHVVKERATSVAPPTVHHRPVLDEFD
         +C+K E  F+LPS Q FTFKGV   VPR VS LKA   L+ GAW YLASVV+  K  PSI++V VV E  T V P  +   P   E D
Subjt:  SNCTKMEVFFQLPSGQGFTFKGVTGGVPRTVSVLKAIRLLRCGAWGYLASVVNTSKTTPSIDSVHVVKERATSVAPPTVHHRPVLDEFD

XP_022158750.1 uncharacterized protein LOC111025215 [Momordica charantia]6.3e-7155.51Show/hide
Query:  PVCPSYQRRHAVQCWMRIKVCFRCEREGRFARECPMSASNTQRLGQKAPSTVSTQGGNQKARVFTLTRKEAANVEAVVIGISFLLWVLGYCMWGYIFILV
        PVCPS ++ HA  CW+  ++C+RC++EG FARECPM+ SNTQ LGQ+ P+T + QGG  +ARVF LTR +    EAVV       W           +LV
Subjt:  PVCPSYQRRHAVQCWMRIKVCFRCEREGRFARECPMSASNTQRLGQKAPSTVSTQGGNQKARVFTLTRKEAANVEAVVIGISFLLWVLGYCMWGYIFILV

Query:  LYFRYGLSYNVSVYVLFDSGSSHTFISTAFVRQANLELEPLRFLLSISTSPGSVMIASQIVKAGELSFGNQTLGASLIQLDMRDFDVILGMDWLATNQAN
        L        ++  Y LFDSGSSH+FI++ FV  A+LELE L FLLS+ST  GSV++ SQ+VK G+LSF  QTL   LIQLDM+DFDVILGMDWLA N+AN
Subjt:  LYFRYGLSYNVSVYVLFDSGSSHTFISTAFVRQANLELEPLRFLLSISTSPGSVMIASQIVKAGELSFGNQTLGASLIQLDMRDFDVILGMDWLATNQAN

Query:  SNCTKMEVFFQLPSGQGFTFKGVTGGVPRTVSVLKAIRLLRCGAWGYLASVVNTSKTTPSIDS
         +C+K +V F+LPSGQ FTFKGV  GVPR V  LKA  LL+ GAW YLASVV+  K  PSI++
Subjt:  SNCTKMEVFFQLPSGQGFTFKGVTGGVPRTVSVLKAIRLLRCGAWGYLASVVNTSKTTPSIDS

XP_022159077.1 uncharacterized protein LOC111025517 [Momordica charantia]3.1e-7859.86Show/hide
Query:  PVCPSYQRRHAVQCWMRIKVCFRCEREGRFARECPMSASNTQRLGQKAPSTVSTQGGNQKARVFTLTRKEAANVEAVVIGISFLLWVLGYCMWGYIFILV
        PVCPS Q+R A QCW   + CFRC REG FAREC M+A+NTQRLGQ+A  TVSTQGG                                           
Subjt:  PVCPSYQRRHAVQCWMRIKVCFRCEREGRFARECPMSASNTQRLGQKAPSTVSTQGGNQKARVFTLTRKEAANVEAVVIGISFLLWVLGYCMWGYIFILV

Query:  LYFRYGLSYNVSVYVLFDSGSSHTFISTAFVRQANLELEPLRFLLSISTSPGSVMIASQIVKAGELSFGNQTLGASLIQLDMRDFDVILGMDWLATNQAN
              L +NV  YVLFD GSSHTFISTAFVRQA LELEPL FLLS+ST  GSV+IASQ+V+AGELSF NQTL A LIQLDMRDFDVILGMDWLATNQAN
Subjt:  LYFRYGLSYNVSVYVLFDSGSSHTFISTAFVRQANLELEPLRFLLSISTSPGSVMIASQIVKAGELSFGNQTLGASLIQLDMRDFDVILGMDWLATNQAN

Query:  SNCTKMEVFFQLPSGQGFTFKGVTGGVPRTVSVLKAIRLLRCGAWGYLASVVNTSKTTPSIDSVHVVKERATSVAPPTVHHRPVLDEFD
         NC+K EV FQLPSG+ FTFKGV+GGVPR VS LKA RLL  GAW YLASVV+ S T PSIDS HVVK   + V P  +   P + E D
Subjt:  SNCTKMEVFFQLPSGQGFTFKGVTGGVPRTVSVLKAIRLLRCGAWGYLASVVNTSKTTPSIDSVHVVKERATSVAPPTVHHRPVLDEFD

TrEMBL top hitse value%identityAlignment
A0A6J1DLN2 uncharacterized protein LOC1110215931.1e-6862.2Show/hide
Query:  MSASNTQRLGQKAPSTVSTQGGNQKARVFTLTRKEAANVEAVVIGISFLLWVLGYCMWGYIFILVLYFRYGLSYNVSVYVLFDSGSSHTFISTAFVRQAN
        M+ SN QRLGQKAPS V TQ GNQKARVF LTR+E  N EAVV G                 +LV         N   YVLFDSGSS TFISTAFVRQ N
Subjt:  MSASNTQRLGQKAPSTVSTQGGNQKARVFTLTRKEAANVEAVVIGISFLLWVLGYCMWGYIFILVLYFRYGLSYNVSVYVLFDSGSSHTFISTAFVRQAN

Query:  LELEPLRFLLSISTSPGSVMIASQIVKAGELSFGNQTLGASLIQLDMRDFDVILGMDWLATNQANSNCTKMEVFFQLPSGQGFTFKGVTGGVPRTVSVLK
        LEL PL FLL +ST  GSVMI+SQ+VK G LSF  Q LGA LIQLD+RDFDVILGMDWLATNQA+ NC+K EV FQLP G  F FKGVTGGVPR VS L+
Subjt:  LELEPLRFLLSISTSPGSVMIASQIVKAGELSFGNQTLGASLIQLDMRDFDVILGMDWLATNQANSNCTKMEVFFQLPSGQGFTFKGVTGGVPRTVSVLK

Query:  AIRLLRCGAWGYLASVVNTSKTTPSIDSVHVVKERATSVAPPTVHHRPVLDEFD
        A  LL+ GAWG+LASVV+T   TPSIDSVHVV E    V P  +   P + E D
Subjt:  AIRLLRCGAWGYLASVVNTSKTTPSIDSVHVVKERATSVAPPTVHHRPVLDEFD

A0A6J1DQB9 Reverse transcriptase3.2e-7353.29Show/hide
Query:  PVCPSYQRRHAVQCWMRIKVCFRCEREGRFARECPMSASNTQRLGQKAPSTVSTQGGNQKARVFTLTRKEAANVEAVVIGISFLLWVLGYCMWGYIFILV
        PVCPS ++ HA  CW+  K+CF+C++EG F REC M+ SNTQ L QK P+  +TQGG Q ARVF LTR +  + EAVV G   +L               
Subjt:  PVCPSYQRRHAVQCWMRIKVCFRCEREGRFARECPMSASNTQRLGQKAPSTVSTQGGNQKARVFTLTRKEAANVEAVVIGISFLLWVLGYCMWGYIFILV

Query:  LYFRYGLSYNVSVYVLFDSGSSHTFISTAFVRQANLELEPLRFLLSISTSPGSVMIASQIVKAGELSFGNQTLGASLIQLDMRDFDVILGMDWLATNQAN
                 ++  Y LFDSGSSH+FI++ FVR A+LELE   F LS+ST  GSV++ SQ+VK G+LSFG QTL  +LIQL+M+DFDVILGMDWLA N+AN
Subjt:  LYFRYGLSYNVSVYVLFDSGSSHTFISTAFVRQANLELEPLRFLLSISTSPGSVMIASQIVKAGELSFGNQTLGASLIQLDMRDFDVILGMDWLATNQAN

Query:  SNCTKMEVFFQLPSGQGFTFKGVTGGVPRTVSVLKAIRLLRCGAWGYLASVVNTSKTTPSIDSVHVVKERATSVAPPTVHHRPVLDEFD
         NC+K EV F L SGQ FTFKGV  GVPR VS LKA  LL+ G W YLASVV+  K  PSI+ V VV E  T V P  +   P   E D
Subjt:  SNCTKMEVFFQLPSGQGFTFKGVTGGVPRTVSVLKAIRLLRCGAWGYLASVVNTSKTTPSIDSVHVVKERATSVAPPTVHHRPVLDEFD

A0A6J1DTE5 uncharacterized protein LOC1110238213.0e-7153.29Show/hide
Query:  PVCPSYQRRHAVQCWMRIKVCFRCEREGRFARECPMSASNTQRLGQKAPSTVSTQGGNQKARVFTLTRKEAANVEAVVIGISFLLWVLGYCMWGYIFILV
        PVCPS ++ H   CW+   +C+RC++EG FARECPM+  NTQ LGQ+ P T + QGG  +ARVF LTR + A+ EAVV+G                 +LV
Subjt:  PVCPSYQRRHAVQCWMRIKVCFRCEREGRFARECPMSASNTQRLGQKAPSTVSTQGGNQKARVFTLTRKEAANVEAVVIGISFLLWVLGYCMWGYIFILV

Query:  LYFRYGLSYNVSVYVLFDSGSSHTFISTAFVRQANLELEPLRFLLSISTSPGSVMIASQIVKAGELSFGNQTLGASLIQLDMRDFDVILGMDWLATNQAN
        L        ++  Y LFDS SSH+FI++ FVR A+LELE L FLLS+ST  GSV++ SQ+VK G+LSF  QTL   LIQLDM+DFDVILGMDWLA NQAN
Subjt:  LYFRYGLSYNVSVYVLFDSGSSHTFISTAFVRQANLELEPLRFLLSISTSPGSVMIASQIVKAGELSFGNQTLGASLIQLDMRDFDVILGMDWLATNQAN

Query:  SNCTKMEVFFQLPSGQGFTFKGVTGGVPRTVSVLKAIRLLRCGAWGYLASVVNTSKTTPSIDSVHVVKERATSVAPPTVHHRPVLDEFD
         +C+K E  F+LPS Q FTFKGV   VPR VS LKA   L+ GAW YLASVV+  K  PSI++V VV E  T V P  +   P   E D
Subjt:  SNCTKMEVFFQLPSGQGFTFKGVTGGVPRTVSVLKAIRLLRCGAWGYLASVVNTSKTTPSIDSVHVVKERATSVAPPTVHHRPVLDEFD

A0A6J1DWP4 uncharacterized protein LOC1110252153.0e-7155.51Show/hide
Query:  PVCPSYQRRHAVQCWMRIKVCFRCEREGRFARECPMSASNTQRLGQKAPSTVSTQGGNQKARVFTLTRKEAANVEAVVIGISFLLWVLGYCMWGYIFILV
        PVCPS ++ HA  CW+  ++C+RC++EG FARECPM+ SNTQ LGQ+ P+T + QGG  +ARVF LTR +    EAVV       W           +LV
Subjt:  PVCPSYQRRHAVQCWMRIKVCFRCEREGRFARECPMSASNTQRLGQKAPSTVSTQGGNQKARVFTLTRKEAANVEAVVIGISFLLWVLGYCMWGYIFILV

Query:  LYFRYGLSYNVSVYVLFDSGSSHTFISTAFVRQANLELEPLRFLLSISTSPGSVMIASQIVKAGELSFGNQTLGASLIQLDMRDFDVILGMDWLATNQAN
        L        ++  Y LFDSGSSH+FI++ FV  A+LELE L FLLS+ST  GSV++ SQ+VK G+LSF  QTL   LIQLDM+DFDVILGMDWLA N+AN
Subjt:  LYFRYGLSYNVSVYVLFDSGSSHTFISTAFVRQANLELEPLRFLLSISTSPGSVMIASQIVKAGELSFGNQTLGASLIQLDMRDFDVILGMDWLATNQAN

Query:  SNCTKMEVFFQLPSGQGFTFKGVTGGVPRTVSVLKAIRLLRCGAWGYLASVVNTSKTTPSIDS
         +C+K +V F+LPSGQ FTFKGV  GVPR V  LKA  LL+ GAW YLASVV+  K  PSI++
Subjt:  SNCTKMEVFFQLPSGQGFTFKGVTGGVPRTVSVLKAIRLLRCGAWGYLASVVNTSKTTPSIDS

A0A6J1DYU5 uncharacterized protein LOC1110255171.5e-7859.86Show/hide
Query:  PVCPSYQRRHAVQCWMRIKVCFRCEREGRFARECPMSASNTQRLGQKAPSTVSTQGGNQKARVFTLTRKEAANVEAVVIGISFLLWVLGYCMWGYIFILV
        PVCPS Q+R A QCW   + CFRC REG FAREC M+A+NTQRLGQ+A  TVSTQGG                                           
Subjt:  PVCPSYQRRHAVQCWMRIKVCFRCEREGRFARECPMSASNTQRLGQKAPSTVSTQGGNQKARVFTLTRKEAANVEAVVIGISFLLWVLGYCMWGYIFILV

Query:  LYFRYGLSYNVSVYVLFDSGSSHTFISTAFVRQANLELEPLRFLLSISTSPGSVMIASQIVKAGELSFGNQTLGASLIQLDMRDFDVILGMDWLATNQAN
              L +NV  YVLFD GSSHTFISTAFVRQA LELEPL FLLS+ST  GSV+IASQ+V+AGELSF NQTL A LIQLDMRDFDVILGMDWLATNQAN
Subjt:  LYFRYGLSYNVSVYVLFDSGSSHTFISTAFVRQANLELEPLRFLLSISTSPGSVMIASQIVKAGELSFGNQTLGASLIQLDMRDFDVILGMDWLATNQAN

Query:  SNCTKMEVFFQLPSGQGFTFKGVTGGVPRTVSVLKAIRLLRCGAWGYLASVVNTSKTTPSIDSVHVVKERATSVAPPTVHHRPVLDEFD
         NC+K EV FQLPSG+ FTFKGV+GGVPR VS LKA RLL  GAW YLASVV+ S T PSIDS HVVK   + V P  +   P + E D
Subjt:  SNCTKMEVFFQLPSGQGFTFKGVTGGVPRTVSVLKAIRLLRCGAWGYLASVVNTSKTTPSIDSVHVVKERATSVAPPTVHHRPVLDEFD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTACCAGTGTGCCCTTCCTATCAGAGAAGACATGCGGTGCAATGTTGGATGAGAATTAAGGTCTGTTTCAGGTGTGAGAGAGAAGGGCGTTTTGCGAGGGAGTGTCC
CATGTCGGCCTCGAATACACAGAGGCTAGGTCAGAAGGCCCCCTCAACAGTCTCTACGCAGGGAGGTAACCAGAAGGCTCGTGTTTTCACACTTACCCGTAAGGAAGCGG
CGAATGTCGAAGCCGTTGTCATAGGTATTTCTTTCTTACTATGGGTTCTTGGGTATTGCATGTGGGGTTATATCTTTATCCTTGTATTATATTTCAGGTACGGTCTTAGC
TATAATGTGTCTGTTTACGTATTGTTTGATTCGGGGTCAAGTCACACTTTTATTTCCACCGCATTTGTTCGTCAAGCAAACCTCGAACTAGAGCCGTTACGTTTTTTGTT
GTCGATATCTACGTCACCGGGGTCAGTGATGATTGCTAGTCAAATAGTGAAAGCAGGCGAGTTATCCTTCGGCAATCAGACCTTGGGGGCAAGTTTGATCCAACTGGACA
TGCGGGATTTTGACGTTATTTTGGGCATGGATTGGCTAGCTACCAACCAAGCCAACAGTAATTGCACGAAAATGGAAGTCTTCTTCCAACTACCTTCTGGTCAGGGCTTC
ACGTTTAAAGGAGTTACAGGTGGAGTTCCAAGGACAGTCTCGGTGCTAAAGGCAATACGCCTTTTACGGTGTGGTGCTTGGGGTTATTTAGCAAGTGTCGTCAACACTAG
TAAGACTACACCCAGTATCGACTCCGTTCACGTGGTCAAGGAGAGAGCGACGAGCGTAGCACCCCCGACAGTGCATCATAGACCGGTATTAGATGAGTTTGACCGTTCTG
AGGTAGAGTTAGCGGTGGAAGATGTGTCAGTACTGTTAGCTCGACTCTCAGTTGAACCCACCTCAAGACAGCGGATCATCGCTGCACAAAAGGGGGATCCCAACTTAATC
AGAGTTTTCGATGAAACCTTGTGCTATAGAGAGGTACCCATTGAGATCCTAGCAAAAGAGACCAAGGTGCTGAGGAACCAGGCAATTGACTTGGTGAAGGTCCGGTGGAG
GAATCACCAAGTGGAGGAAGCTACCTGGGAAAGAGAGGACGAGATCAGAGCCCGCTACCCTGTATTGTTCAATCAACGAACTTTCGAGGACGAAAGTTTTTTAAGAGGGG
AAGTCTGTAACGCCCCACATTACTCAGTAACCCTAGCCTCTCCCCCACTTTCACGTGTTGCCCCTCTCCCCCCGAAGGAAAACAAAAATTACCTCACCTCTTTGTGCTGC
CTCACGACCACCGCCAGTGGCCTCTCCAGCGAACCCAGCCACCGACAACCTCACCCAGCGACGGCACGGCAACCCCAACCCACGACAGCAGGTGCGACAGTTCTGTTCGT
GACAACAGCATGCAGTAGGGCGACGGTTCAGTTCGTGACAGCAGCGTACGACGGCTTCCGTTCGGGACAACATCTTGTGCGACGGCTCTCGTTCATGACAGCAGCGTGCA
GCAGGTGCGGCGGCTCCGTTTGTGACAGCGGCGTGCAGCAGAGCTTTGTTAGTTGTGGTTGGCTAGAATGTTGTGAAGCATCTGTTGGATCTTATGTTTTTGAGTATGAC
TGTTGTGTCGATTTTCTTGGCAAAGCTAATGGTCCTTCCGATCAAACCTCCTCCTCGCCATCACCATCCCAAGAACTAGCTCGATACCTGCAAACTGGAGGGGAAAAGAG
GGATGAGTACAACCACAGTACTCACTACTCTAGCCTTCCACAGTTAACTCCTTCAAGCAGAGTCGGCCCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTACCAGTGTGCCCTTCCTATCAGAGAAGACATGCGGTGCAATGTTGGATGAGAATTAAGGTCTGTTTCAGGTGTGAGAGAGAAGGGCGTTTTGCGAGGGAGTGTCC
CATGTCGGCCTCGAATACACAGAGGCTAGGTCAGAAGGCCCCCTCAACAGTCTCTACGCAGGGAGGTAACCAGAAGGCTCGTGTTTTCACACTTACCCGTAAGGAAGCGG
CGAATGTCGAAGCCGTTGTCATAGGTATTTCTTTCTTACTATGGGTTCTTGGGTATTGCATGTGGGGTTATATCTTTATCCTTGTATTATATTTCAGGTACGGTCTTAGC
TATAATGTGTCTGTTTACGTATTGTTTGATTCGGGGTCAAGTCACACTTTTATTTCCACCGCATTTGTTCGTCAAGCAAACCTCGAACTAGAGCCGTTACGTTTTTTGTT
GTCGATATCTACGTCACCGGGGTCAGTGATGATTGCTAGTCAAATAGTGAAAGCAGGCGAGTTATCCTTCGGCAATCAGACCTTGGGGGCAAGTTTGATCCAACTGGACA
TGCGGGATTTTGACGTTATTTTGGGCATGGATTGGCTAGCTACCAACCAAGCCAACAGTAATTGCACGAAAATGGAAGTCTTCTTCCAACTACCTTCTGGTCAGGGCTTC
ACGTTTAAAGGAGTTACAGGTGGAGTTCCAAGGACAGTCTCGGTGCTAAAGGCAATACGCCTTTTACGGTGTGGTGCTTGGGGTTATTTAGCAAGTGTCGTCAACACTAG
TAAGACTACACCCAGTATCGACTCCGTTCACGTGGTCAAGGAGAGAGCGACGAGCGTAGCACCCCCGACAGTGCATCATAGACCGGTATTAGATGAGTTTGACCGTTCTG
AGGTAGAGTTAGCGGTGGAAGATGTGTCAGTACTGTTAGCTCGACTCTCAGTTGAACCCACCTCAAGACAGCGGATCATCGCTGCACAAAAGGGGGATCCCAACTTAATC
AGAGTTTTCGATGAAACCTTGTGCTATAGAGAGGTACCCATTGAGATCCTAGCAAAAGAGACCAAGGTGCTGAGGAACCAGGCAATTGACTTGGTGAAGGTCCGGTGGAG
GAATCACCAAGTGGAGGAAGCTACCTGGGAAAGAGAGGACGAGATCAGAGCCCGCTACCCTGTATTGTTCAATCAACGAACTTTCGAGGACGAAAGTTTTTTAAGAGGGG
AAGTCTGTAACGCCCCACATTACTCAGTAACCCTAGCCTCTCCCCCACTTTCACGTGTTGCCCCTCTCCCCCCGAAGGAAAACAAAAATTACCTCACCTCTTTGTGCTGC
CTCACGACCACCGCCAGTGGCCTCTCCAGCGAACCCAGCCACCGACAACCTCACCCAGCGACGGCACGGCAACCCCAACCCACGACAGCAGGTGCGACAGTTCTGTTCGT
GACAACAGCATGCAGTAGGGCGACGGTTCAGTTCGTGACAGCAGCGTACGACGGCTTCCGTTCGGGACAACATCTTGTGCGACGGCTCTCGTTCATGACAGCAGCGTGCA
GCAGGTGCGGCGGCTCCGTTTGTGACAGCGGCGTGCAGCAGAGCTTTGTTAGTTGTGGTTGGCTAGAATGTTGTGAAGCATCTGTTGGATCTTATGTTTTTGAGTATGAC
TGTTGTGTCGATTTTCTTGGCAAAGCTAATGGTCCTTCCGATCAAACCTCCTCCTCGCCATCACCATCCCAAGAACTAGCTCGATACCTGCAAACTGGAGGGGAAAAGAG
GGATGAGTACAACCACAGTACTCACTACTCTAGCCTTCCACAGTTAACTCCTTCAAGCAGAGTCGGCCCCTGA
Protein sequenceShow/hide protein sequence
MLPVCPSYQRRHAVQCWMRIKVCFRCEREGRFARECPMSASNTQRLGQKAPSTVSTQGGNQKARVFTLTRKEAANVEAVVIGISFLLWVLGYCMWGYIFILVLYFRYGLS
YNVSVYVLFDSGSSHTFISTAFVRQANLELEPLRFLLSISTSPGSVMIASQIVKAGELSFGNQTLGASLIQLDMRDFDVILGMDWLATNQANSNCTKMEVFFQLPSGQGF
TFKGVTGGVPRTVSVLKAIRLLRCGAWGYLASVVNTSKTTPSIDSVHVVKERATSVAPPTVHHRPVLDEFDRSEVELAVEDVSVLLARLSVEPTSRQRIIAAQKGDPNLI
RVFDETLCYREVPIEILAKETKVLRNQAIDLVKVRWRNHQVEEATWEREDEIRARYPVLFNQRTFEDESFLRGEVCNAPHYSVTLASPPLSRVAPLPPKENKNYLTSLCC
LTTTASGLSSEPSHRQPHPATARQPQPTTAGATVLFVTTACSRATVQFVTAAYDGFRSGQHLVRRLSFMTAACSRCGGSVCDSGVQQSFVSCGWLECCEASVGSYVFEYD
CCVDFLGKANGPSDQTSSSPSPSQELARYLQTGGEKRDEYNHSTHYSSLPQLTPSSRVGP