; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035661 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035661
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr3:26538702..26543713
RNA-Seq ExpressionLag0035661
SyntenyLag0035661
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151688.1 uncharacterized protein LOC111019603 [Momordica charantia]1.7e-4649.02Show/hide
Query:  SQFIRDFKQYGPPSFDGRSDNPLAVKRWIDNLEAMFDYMNCDDCLKARGVVFMLRGEARNWWKSVAVVEDHANEPVSWERFKDLLYDYYFLETVRDDKET
        +QFIRDF++YGPP+F+G S+    V+ WI  LEA++ Y+ C D LK +G VFMLRGEA NWW  VA VEDH NEP++W   KDLLYDYYF +T++D+KE 
Subjt:  SQFIRDFKQYGPPSFDGRSDNPLAVKRWIDNLEAMFDYMNCDDCLKARGVVFMLRGEARNWWKSVAVVEDHANEPVSWERFKDLLYDYYFLETVRDDKET

Query:  EFLHLTQGTMSVHQ----------------------------RLREEICGTIFLNAPTTFAAALRGVLVMDKNLAKKVQSRWEVGSSSGVKRKPPP-TQA
        EFLHLTQ T+ V Q                             L + I G I L  PTT+A A++G LVMDK++ +K Q + +VG SSGVKRK PP + +
Subjt:  EFLHLTQGTMSVHQ----------------------------RLREEICGTIFLNAPTTFAAALRGVLVMDKNLAKKVQSRWEVGSSSGVKRKPPP-TQA

Query:  RPSQ
        +PS+
Subjt:  RPSQ

XP_022155000.1 uncharacterized protein LOC111022144 [Momordica charantia]8.9e-4850.5Show/hide
Query:  SQFIRDFKQYGPPSFDGRSDNPLAVKRWIDNLEAMFDYMNCDDCLKARGVVFMLRGEARNWWKSVAVVEDHANEPVSWERFKDLLYDYYFLETVRDDKET
        + FI+DFK+YGPP+FDG S+   A + WI  LEA + Y+ C+D  K +G VFMLRGEA NWW S+A  EDHAN  + W RFKDLLYDYY+LETV+D KE 
Subjt:  SQFIRDFKQYGPPSFDGRSDNPLAVKRWIDNLEAMFDYMNCDDCLKARGVVFMLRGEARNWWKSVAVVEDHANEPVSWERFKDLLYDYYFLETVRDDKET

Query:  EFLHLTQGTMSVHQ----------------------------RLREEICGTIFLNAPTTFAAALRGVLVMDKNLAKKVQSRWEVGSSSGVKRKPPPTQAR
        EFLHL QGT+SV Q                             L + I G + L  P ++A A+RG L+MDK+++ K  S  EVGSSSGVKRK  PT A 
Subjt:  EFLHLTQGTMSVHQ----------------------------RLREEICGTIFLNAPTTFAAALRGVLVMDKNLAKKVQSRWEVGSSSGVKRKPPPTQAR

Query:  PS
        PS
Subjt:  PS

XP_022155925.1 uncharacterized protein LOC111022925 [Momordica charantia]1.3e-4350Show/hide
Query:  PSRQQKTVEDPIVSQFIRDFKQYGPPSFDGRSDNPLAVKRWIDNLEAMFDYMNCDDCLKARGVVFMLRGEARNWWKSVAVVEDHANEPVSWERFKDLLYD
        PSR Q + E+    QFIRDFK++GPP F+G S+ P A + W+  LEA++ Y+ C D  K RG VFML+GEA NWW+SVA  EDHAN PV+W RFKDLLY+
Subjt:  PSRQQKTVEDPIVSQFIRDFKQYGPPSFDGRSDNPLAVKRWIDNLEAMFDYMNCDDCLKARGVVFMLRGEARNWWKSVAVVEDHANEPVSWERFKDLLYD

Query:  YYFLETVRDDKETEFLHLTQGTMSVHQ----------------------------RLREEICGTIFLNAPTTFAAALRGVLVMDKNLAKKVQSRWEVGSS
        YYF  TVR++K  EFL LTQ ++ V Q                             LR EI G + L  PTT+AAA+R  LVMDK L ++ QS+  +GSS
Subjt:  YYFLETVRDDKETEFLHLTQGTMSVHQ----------------------------RLREEICGTIFLNAPTTFAAALRGVLVMDKNLAKKVQSRWEVGSS

Query:  SGVKRK
        SGVKRK
Subjt:  SGVKRK

XP_022156326.1 uncharacterized protein LOC111023247 [Momordica charantia]8.9e-4848.44Show/hide
Query:  PFPPDQHEVDPPPPSRQQKTVEDPIVSQFIRDFKQYGPPSFDGRSDNPLAVKRWIDNLEAMFDYMNCDDCLKARGVVFMLRGEARNWWKSVAVVEDHANE
        P P     V  PPP        +   ++FI+DFK+YGPP+FDG S+   AV+ WI  LEA++ Y+ C+D  K +G VFMLRGEA NWW SVA  ED+AN 
Subjt:  PFPPDQHEVDPPPPSRQQKTVEDPIVSQFIRDFKQYGPPSFDGRSDNPLAVKRWIDNLEAMFDYMNCDDCLKARGVVFMLRGEARNWWKSVAVVEDHANE

Query:  PVSWERFKDLLYDYYFLETVRDDKETEFLHLTQGTMSVHQ----------------------------RLREEICGTIFLNAPTTFAAALRGVLVMDKNL
        P+ W RFK+LLYDYY+ ETV+D KE EFLHL QGT+SV Q                             LR+ I G + L  PTT+A A+RG LVMDK++
Subjt:  PVSWERFKDLLYDYYFLETVRDDKETEFLHLTQGTMSVHQ----------------------------RLREEICGTIFLNAPTTFAAALRGVLVMDKNL

Query:  AKKVQSRWEVGSSSGVKRKPPPTQA
        + K     EVGSSSGVKRK P T A
Subjt:  AKKVQSRWEVGSSSGVKRKPPPTQA

XP_022156546.1 uncharacterized protein LOC111023424 [Momordica charantia]2.0e-4443.6Show/hide
Query:  NPPFPP--DQHEVDPP-PPSRQQKTV--------------------EDPIVSQFIRDFKQYGPPSFDGRSDNPLAVKRWIDNLEAMFDYMNCDDCLKARG
        +PP PP  DQ  V PP PP+  Q+                           +QFI+DFK+YGPP+F G S+     + W+  LEA++ Y+ C+D  K +G
Subjt:  NPPFPP--DQHEVDPP-PPSRQQKTV--------------------EDPIVSQFIRDFKQYGPPSFDGRSDNPLAVKRWIDNLEAMFDYMNCDDCLKARG

Query:  VVFMLRGEARNWWKSVAVVEDHANEPVSWERFKDLLYDYYFLETVRDDKETEFLHLTQGTMSVHQ----------------------------RLREEIC
         VFMLR EA NWW SVA  EDHAN PV W RFK+LLYD+Y+ ETV D KE EFLHL QGT++V Q                             L + I 
Subjt:  VVFMLRGEARNWWKSVAVVEDHANEPVSWERFKDLLYDYYFLETVRDDKETEFLHLTQGTMSVHQ----------------------------RLREEIC

Query:  GTIFLNAPTTFAAALRGVLVMDKNLAKKVQSRWEVGSSSGVKRKPPPTQA
        G++ L  P T+A A+RG L+MDK+++ +VQ   EVGSS GVKRK PPT A
Subjt:  GTIFLNAPTTFAAALRGVLVMDKNLAKKVQSRWEVGSSSGVKRKPPPTQA

TrEMBL top hitse value%identityAlignment
A0A6J1DCW8 uncharacterized protein LOC1110196038.1e-4749.02Show/hide
Query:  SQFIRDFKQYGPPSFDGRSDNPLAVKRWIDNLEAMFDYMNCDDCLKARGVVFMLRGEARNWWKSVAVVEDHANEPVSWERFKDLLYDYYFLETVRDDKET
        +QFIRDF++YGPP+F+G S+    V+ WI  LEA++ Y+ C D LK +G VFMLRGEA NWW  VA VEDH NEP++W   KDLLYDYYF +T++D+KE 
Subjt:  SQFIRDFKQYGPPSFDGRSDNPLAVKRWIDNLEAMFDYMNCDDCLKARGVVFMLRGEARNWWKSVAVVEDHANEPVSWERFKDLLYDYYFLETVRDDKET

Query:  EFLHLTQGTMSVHQ----------------------------RLREEICGTIFLNAPTTFAAALRGVLVMDKNLAKKVQSRWEVGSSSGVKRKPPP-TQA
        EFLHLTQ T+ V Q                             L + I G I L  PTT+A A++G LVMDK++ +K Q + +VG SSGVKRK PP + +
Subjt:  EFLHLTQGTMSVHQ----------------------------RLREEICGTIFLNAPTTFAAALRGVLVMDKNLAKKVQSRWEVGSSSGVKRKPPP-TQA

Query:  RPSQ
        +PS+
Subjt:  RPSQ

A0A6J1DL73 uncharacterized protein LOC1110221444.3e-4850.5Show/hide
Query:  SQFIRDFKQYGPPSFDGRSDNPLAVKRWIDNLEAMFDYMNCDDCLKARGVVFMLRGEARNWWKSVAVVEDHANEPVSWERFKDLLYDYYFLETVRDDKET
        + FI+DFK+YGPP+FDG S+   A + WI  LEA + Y+ C+D  K +G VFMLRGEA NWW S+A  EDHAN  + W RFKDLLYDYY+LETV+D KE 
Subjt:  SQFIRDFKQYGPPSFDGRSDNPLAVKRWIDNLEAMFDYMNCDDCLKARGVVFMLRGEARNWWKSVAVVEDHANEPVSWERFKDLLYDYYFLETVRDDKET

Query:  EFLHLTQGTMSVHQ----------------------------RLREEICGTIFLNAPTTFAAALRGVLVMDKNLAKKVQSRWEVGSSSGVKRKPPPTQAR
        EFLHL QGT+SV Q                             L + I G + L  P ++A A+RG L+MDK+++ K  S  EVGSSSGVKRK  PT A 
Subjt:  EFLHLTQGTMSVHQ----------------------------RLREEICGTIFLNAPTTFAAALRGVLVMDKNLAKKVQSRWEVGSSSGVKRKPPPTQAR

Query:  PS
        PS
Subjt:  PS

A0A6J1DNV8 uncharacterized protein LOC1110229256.4e-4450Show/hide
Query:  PSRQQKTVEDPIVSQFIRDFKQYGPPSFDGRSDNPLAVKRWIDNLEAMFDYMNCDDCLKARGVVFMLRGEARNWWKSVAVVEDHANEPVSWERFKDLLYD
        PSR Q + E+    QFIRDFK++GPP F+G S+ P A + W+  LEA++ Y+ C D  K RG VFML+GEA NWW+SVA  EDHAN PV+W RFKDLLY+
Subjt:  PSRQQKTVEDPIVSQFIRDFKQYGPPSFDGRSDNPLAVKRWIDNLEAMFDYMNCDDCLKARGVVFMLRGEARNWWKSVAVVEDHANEPVSWERFKDLLYD

Query:  YYFLETVRDDKETEFLHLTQGTMSVHQ----------------------------RLREEICGTIFLNAPTTFAAALRGVLVMDKNLAKKVQSRWEVGSS
        YYF  TVR++K  EFL LTQ ++ V Q                             LR EI G + L  PTT+AAA+R  LVMDK L ++ QS+  +GSS
Subjt:  YYFLETVRDDKETEFLHLTQGTMSVHQ----------------------------RLREEICGTIFLNAPTTFAAALRGVLVMDKNLAKKVQSRWEVGSS

Query:  SGVKRK
        SGVKRK
Subjt:  SGVKRK

A0A6J1DUM2 uncharacterized protein LOC1110232474.3e-4848.44Show/hide
Query:  PFPPDQHEVDPPPPSRQQKTVEDPIVSQFIRDFKQYGPPSFDGRSDNPLAVKRWIDNLEAMFDYMNCDDCLKARGVVFMLRGEARNWWKSVAVVEDHANE
        P P     V  PPP        +   ++FI+DFK+YGPP+FDG S+   AV+ WI  LEA++ Y+ C+D  K +G VFMLRGEA NWW SVA  ED+AN 
Subjt:  PFPPDQHEVDPPPPSRQQKTVEDPIVSQFIRDFKQYGPPSFDGRSDNPLAVKRWIDNLEAMFDYMNCDDCLKARGVVFMLRGEARNWWKSVAVVEDHANE

Query:  PVSWERFKDLLYDYYFLETVRDDKETEFLHLTQGTMSVHQ----------------------------RLREEICGTIFLNAPTTFAAALRGVLVMDKNL
        P+ W RFK+LLYDYY+ ETV+D KE EFLHL QGT+SV Q                             LR+ I G + L  PTT+A A+RG LVMDK++
Subjt:  PVSWERFKDLLYDYYFLETVRDDKETEFLHLTQGTMSVHQ----------------------------RLREEICGTIFLNAPTTFAAALRGVLVMDKNL

Query:  AKKVQSRWEVGSSSGVKRKPPPTQA
        + K     EVGSSSGVKRK P T A
Subjt:  AKKVQSRWEVGSSSGVKRKPPPTQA

A0A6J1DVA0 uncharacterized protein LOC1110234249.9e-4543.6Show/hide
Query:  NPPFPP--DQHEVDPP-PPSRQQKTV--------------------EDPIVSQFIRDFKQYGPPSFDGRSDNPLAVKRWIDNLEAMFDYMNCDDCLKARG
        +PP PP  DQ  V PP PP+  Q+                           +QFI+DFK+YGPP+F G S+     + W+  LEA++ Y+ C+D  K +G
Subjt:  NPPFPP--DQHEVDPP-PPSRQQKTV--------------------EDPIVSQFIRDFKQYGPPSFDGRSDNPLAVKRWIDNLEAMFDYMNCDDCLKARG

Query:  VVFMLRGEARNWWKSVAVVEDHANEPVSWERFKDLLYDYYFLETVRDDKETEFLHLTQGTMSVHQ----------------------------RLREEIC
         VFMLR EA NWW SVA  EDHAN PV W RFK+LLYD+Y+ ETV D KE EFLHL QGT++V Q                             L + I 
Subjt:  VVFMLRGEARNWWKSVAVVEDHANEPVSWERFKDLLYDYYFLETVRDDKETEFLHLTQGTMSVHQ----------------------------RLREEIC

Query:  GTIFLNAPTTFAAALRGVLVMDKNLAKKVQSRWEVGSSSGVKRKPPPTQA
        G++ L  P T+A A+RG L+MDK+++ +VQ   EVGSS GVKRK PPT A
Subjt:  GTIFLNAPTTFAAALRGVLVMDKNLAKKVQSRWEVGSSSGVKRKPPPTQA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGGAAGTAAGTTGTATTCTTGGAAAGCCTTTTCCTAGGGTTTTTCTATTAAGAAAAAGAACGAAAATAAAGAAAAGAAAAGAATTAAGGAAAAGGAAAAAGGCCAA
AACCTCTATCTTCCCTTCATTCTTTCATGGAAGCCACCAACCCCCCTCTCGATTTCATCTTGCCGAAAGCCCACCAGCCACTGTCGCTGCCCACCAGCTGTCGTCTATAG
GAGCTTCCGTTGCCGTCGTGTTGTCGTTCCCACCAGTTGTTCGTCACAACTTGAGCACGCTGCAACGCATCATGCGATTTCCCTCGCCGCCTGCTCGAGCCACTGCCGTT
GCACGCTGCTCTCTCTCCCGCTCCGTCCGTGCATTTTTCTCCCTCTCAGTGGGTCTGTCCGTGGAGATTTTTCTCTCCCCTTGTGTTCGCGACGCCACCATTTCTCTCCT
CCATCCCGGTGTCGCAATCTCCTTGAGGAATCGGCGTAAGTTTCCCTTCTTTGATTTCGCTTGGTTAAGATTCAAGTATTTGGGTCTAAGTTCTGATCTGCAGTACCGTC
TAGGAAGCATTTGGCCTCGAATTTGCAGCATTCCTTCGGTGTCTTTTGGAAACTCTATTTTAGTTTCGAGTCAATTTGGATCTCGAACAACGTTAATTAAGGATTTTGGT
GCTGGTCGACAAATTCCAACAAGCAGGTTGGCTCGTTTATGCAATTCGGAGGATCAGGTCGACGATGTCCATGCTCAGCAGGAGATTAACCCGCCGATTCCTCCTGATCA
GCACGAAGTTGACCCTCCGCCCCCTTCTAGGCAGCAGAAGTCCTGCCAGCCAAATTGGACAGTCGAGGACCCGATAGTGTCCCAATTCATTCTCGACTTCAAGCAGTATG
GTCCTCCCTCTTTCGACGGGCGTTCAGATAACCCGTTGGCAGTCAAGTGTTGGATTGACAACCTCGAAGCTATGTTTGACTACATGAACTGTGATGACTGCCTGAAAGCT
CGAGGCGTAGTTTTCATGTTGAGGGGCGAAGCTCGTAATTGGTGGAAGTCAGTAGCAGTCGTCGTGGATCATGCTAATGAACCAGTTTCGTGGGAGAGGTTCAAAGATCT
TCTTTACGATTATTACTTCCTCGAGACTGTCAGAGATGACAAAGAGACAGAGTTCTTGCATTTAACTCAGGGAACCATGTCCGTTCATCAAGGCCTCCGGGAAGAAATTT
GTGGTACAATTTTCCTGAATGCACCTACGACCTTTGCTGCAGCCCTCCGTGGAGTGTTGGTCATGGATAAAAATTTGGCCAAGAAGGTGCAATCTCGTTGGGAGGTCGGT
TCGTCTTCTGGGGTTAAAAGAAAGCCCCCACCAACTCAAGCGAGACCACCACAGAAGGGCTTGGGCTTATCTAGCGAGTGTGGTAGATACCAGCAAGGAAGCCTGATGCG
CAATTTGCTTACCGCAAGTGAGGATCAGGTCGACGATGTCCATGCTCAGCAGGAGATTAACCCGCCGTTTCCTCCTGATCAGCACGAAGTTGACCCTCCGCCCCCTTCTA
GGCAGCAGAAGACAGTCGAGGACCCGATAGTGTCCCAATTCATTCGCGACTTCAAGCAGTATGGTCCTCCCTCTTTCGACGGGCGTTCAGATAACCCGTTGGCAGTCAAG
CGTTGGATTGACAACCTCGAAGCTATGTTTGACTACATGAACTGTGATGACTGCCTGAAAGCTCGAGGCGTAGTTTTCATGTTGAGGGGCGAAGCTCGTAATTGGTGGAA
GTCAGTAGCAGTCGTCGAGGATCATGCTAATGAACCAGTTTCGTGGGAGAGGTTCAAAGATCTTCTTTACGATTATTACTTCCTCGAGACTGTCAGAGATGACAAAGAGA
CAGAGTTCTTGCATTTAACTCAGGGAACCATGTCCGTTCATCAACGCCTCCGAGAAGAAATTTGTGGTACAATTTTCCTGAATGCACCTACGACCTTTGCTGCAGCCCTC
CGTGGAGTGTTGGTCATGGATAAAAATTTGGCCAAGAAGGTGCAATCTCGTTGGGAGGTCGGTTCGTCTTCTGGGGTTAAAAGAAAGCCCCCACCAACTCAAGCGAGACC
ATCACAGAAGGGCTTGGGCTTATCTAGCGAGTGTGGTAGATACCAGCAAGGAAGCCTGATGCGCAATTTGCTTACCGCAAGTGTACGGGTGAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGGGAAGTAAGTTGTATTCTTGGAAAGCCTTTTCCTAGGGTTTTTCTATTAAGAAAAAGAACGAAAATAAAGAAAAGAAAAGAATTAAGGAAAAGGAAAAAGGCCAA
AACCTCTATCTTCCCTTCATTCTTTCATGGAAGCCACCAACCCCCCTCTCGATTTCATCTTGCCGAAAGCCCACCAGCCACTGTCGCTGCCCACCAGCTGTCGTCTATAG
GAGCTTCCGTTGCCGTCGTGTTGTCGTTCCCACCAGTTGTTCGTCACAACTTGAGCACGCTGCAACGCATCATGCGATTTCCCTCGCCGCCTGCTCGAGCCACTGCCGTT
GCACGCTGCTCTCTCTCCCGCTCCGTCCGTGCATTTTTCTCCCTCTCAGTGGGTCTGTCCGTGGAGATTTTTCTCTCCCCTTGTGTTCGCGACGCCACCATTTCTCTCCT
CCATCCCGGTGTCGCAATCTCCTTGAGGAATCGGCGTAAGTTTCCCTTCTTTGATTTCGCTTGGTTAAGATTCAAGTATTTGGGTCTAAGTTCTGATCTGCAGTACCGTC
TAGGAAGCATTTGGCCTCGAATTTGCAGCATTCCTTCGGTGTCTTTTGGAAACTCTATTTTAGTTTCGAGTCAATTTGGATCTCGAACAACGTTAATTAAGGATTTTGGT
GCTGGTCGACAAATTCCAACAAGCAGGTTGGCTCGTTTATGCAATTCGGAGGATCAGGTCGACGATGTCCATGCTCAGCAGGAGATTAACCCGCCGATTCCTCCTGATCA
GCACGAAGTTGACCCTCCGCCCCCTTCTAGGCAGCAGAAGTCCTGCCAGCCAAATTGGACAGTCGAGGACCCGATAGTGTCCCAATTCATTCTCGACTTCAAGCAGTATG
GTCCTCCCTCTTTCGACGGGCGTTCAGATAACCCGTTGGCAGTCAAGTGTTGGATTGACAACCTCGAAGCTATGTTTGACTACATGAACTGTGATGACTGCCTGAAAGCT
CGAGGCGTAGTTTTCATGTTGAGGGGCGAAGCTCGTAATTGGTGGAAGTCAGTAGCAGTCGTCGTGGATCATGCTAATGAACCAGTTTCGTGGGAGAGGTTCAAAGATCT
TCTTTACGATTATTACTTCCTCGAGACTGTCAGAGATGACAAAGAGACAGAGTTCTTGCATTTAACTCAGGGAACCATGTCCGTTCATCAAGGCCTCCGGGAAGAAATTT
GTGGTACAATTTTCCTGAATGCACCTACGACCTTTGCTGCAGCCCTCCGTGGAGTGTTGGTCATGGATAAAAATTTGGCCAAGAAGGTGCAATCTCGTTGGGAGGTCGGT
TCGTCTTCTGGGGTTAAAAGAAAGCCCCCACCAACTCAAGCGAGACCACCACAGAAGGGCTTGGGCTTATCTAGCGAGTGTGGTAGATACCAGCAAGGAAGCCTGATGCG
CAATTTGCTTACCGCAAGTGAGGATCAGGTCGACGATGTCCATGCTCAGCAGGAGATTAACCCGCCGTTTCCTCCTGATCAGCACGAAGTTGACCCTCCGCCCCCTTCTA
GGCAGCAGAAGACAGTCGAGGACCCGATAGTGTCCCAATTCATTCGCGACTTCAAGCAGTATGGTCCTCCCTCTTTCGACGGGCGTTCAGATAACCCGTTGGCAGTCAAG
CGTTGGATTGACAACCTCGAAGCTATGTTTGACTACATGAACTGTGATGACTGCCTGAAAGCTCGAGGCGTAGTTTTCATGTTGAGGGGCGAAGCTCGTAATTGGTGGAA
GTCAGTAGCAGTCGTCGAGGATCATGCTAATGAACCAGTTTCGTGGGAGAGGTTCAAAGATCTTCTTTACGATTATTACTTCCTCGAGACTGTCAGAGATGACAAAGAGA
CAGAGTTCTTGCATTTAACTCAGGGAACCATGTCCGTTCATCAACGCCTCCGAGAAGAAATTTGTGGTACAATTTTCCTGAATGCACCTACGACCTTTGCTGCAGCCCTC
CGTGGAGTGTTGGTCATGGATAAAAATTTGGCCAAGAAGGTGCAATCTCGTTGGGAGGTCGGTTCGTCTTCTGGGGTTAAAAGAAAGCCCCCACCAACTCAAGCGAGACC
ATCACAGAAGGGCTTGGGCTTATCTAGCGAGTGTGGTAGATACCAGCAAGGAAGCCTGATGCGCAATTTGCTTACCGCAAGTGTACGGGTGAAGTAA
Protein sequenceShow/hide protein sequence
MGEVSCILGKPFPRVFLLRKRTKIKKRKELRKRKKAKTSIFPSFFHGSHQPPSRFHLAESPPATVAAHQLSSIGASVAVVLSFPPVVRHNLSTLQRIMRFPSPPARATAV
ARCSLSRSVRAFFSLSVGLSVEIFLSPCVRDATISLLHPGVAISLRNRRKFPFFDFAWLRFKYLGLSSDLQYRLGSIWPRICSIPSVSFGNSILVSSQFGSRTTLIKDFG
AGRQIPTSRLARLCNSEDQVDDVHAQQEINPPIPPDQHEVDPPPPSRQQKSCQPNWTVEDPIVSQFILDFKQYGPPSFDGRSDNPLAVKCWIDNLEAMFDYMNCDDCLKA
RGVVFMLRGEARNWWKSVAVVVDHANEPVSWERFKDLLYDYYFLETVRDDKETEFLHLTQGTMSVHQGLREEICGTIFLNAPTTFAAALRGVLVMDKNLAKKVQSRWEVG
SSSGVKRKPPPTQARPPQKGLGLSSECGRYQQGSLMRNLLTASEDQVDDVHAQQEINPPFPPDQHEVDPPPPSRQQKTVEDPIVSQFIRDFKQYGPPSFDGRSDNPLAVK
RWIDNLEAMFDYMNCDDCLKARGVVFMLRGEARNWWKSVAVVEDHANEPVSWERFKDLLYDYYFLETVRDDKETEFLHLTQGTMSVHQRLREEICGTIFLNAPTTFAAAL
RGVLVMDKNLAKKVQSRWEVGSSSGVKRKPPPTQARPSQKGLGLSSECGRYQQGSLMRNLLTASVRVK